From patchwork Fri Feb 2 14:08:44 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Roman Pen X-Patchwork-Id: 10196795 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id F41F46037D for ; Fri, 2 Feb 2018 14:10:54 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id E26B228E78 for ; Fri, 2 Feb 2018 14:10:54 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id D717028E7A; Fri, 2 Feb 2018 14:10:54 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.9 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_HI autolearn=ham version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 076FF28E72 for ; Fri, 2 Feb 2018 14:10:54 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752019AbeBBOKx (ORCPT ); Fri, 2 Feb 2018 09:10:53 -0500 Received: from mail-wm0-f67.google.com ([74.125.82.67]:53697 "EHLO mail-wm0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751960AbeBBOKs (ORCPT ); Fri, 2 Feb 2018 09:10:48 -0500 Received: by mail-wm0-f67.google.com with SMTP id t74so12934334wme.3 for ; Fri, 02 Feb 2018 06:10:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=profitbricks-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=PnW5RpVw+b/Lj7FhkFgikmcleJTF9li4rmCUuCQDXU0=; b=lic1wpo45zMKwHkjy8dl+8u5wUgglkgGAfAJmy+dmZBaELkSigpt0bP1xd9ZhOM60o LmYefZcfNU3b79DDvU6Nv4fgJbOIt0n7szJbuUbm8LY2TneUBEAHDrQngD+7TUc3Tuih JqhblCFqmDm6juTCPbh4Kc/5qtbXpthIsUzWWK/c+QruCd1cI3wTGE9GWMkebjw3p7dZ HoKiOXmQhpbKanii+Gpnz5277qiTlKoJc67eU4Y+zuVXcFwQaTsq1yhrOk1DzCIEgiOP f09hHL3UAz1lo8DYKVp/ZPQzJsjqUnu8VF0tImhVOjrfUbrm5W0glcbA4E/O8W3V9Ksf HrBg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=PnW5RpVw+b/Lj7FhkFgikmcleJTF9li4rmCUuCQDXU0=; b=dEMdfDtPNkRYPxXJldSXoMLU1CI0so9BAUK7EuIXRIWSKdN2R7PRoiFPOJbbDsvKM+ lWW3aUjawh50ifhfUIJ1xH9KPXWIvVoyxAWpUPk9o0fcTproykx07wcbCR4IJDEkQST5 xv2Ba2u2xwzSLP71YLZ8Uv5xfiGs4opF60n6IYJ30DYTT3x5na8cI6yTl3M86Ob9ttux 7ofPDSmMYmpz9In4NCOc3+E+Jqy3ARDm2hepEmy3SXIiEuoH8zr23oO60KCES75JZDkn V+EvkYnFuHlr+Tz5nfBzOUQtybXqZuXQrw/aQ7g0GwKnaey1wPQhnRzwMQqtsR7S8wTC PmHg== X-Gm-Message-State: AKwxytcCu0wbGaj1YSFe0y/acNAhNPhROr5Jj8sBIdZQVobCcFJcVZIZ KrFV575dKfrNqkxzxaNIumIUyNxy X-Google-Smtp-Source: AH8x226NHOD5dyeFgOTUC+d8baLhr41b5lY5CKpQiPYuAlwqm+D76hfXsBuaC9a2/1PBlkAtgM1ESQ== X-Received: by 10.28.191.148 with SMTP id o20mr30226081wmi.63.1517580646580; Fri, 02 Feb 2018 06:10:46 -0800 (PST) Received: from pb.pb.local ([62.217.45.26]) by smtp.gmail.com with ESMTPSA id v186sm798819wmf.17.2018.02.02.06.10.45 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 02 Feb 2018 06:10:45 -0800 (PST) From: Roman Pen To: linux-block@vger.kernel.org, linux-rdma@vger.kernel.org Cc: Jens Axboe , Christoph Hellwig , Sagi Grimberg , Bart Van Assche , Or Gerlitz , Roman Pen , Danil Kipnis , Jack Wang Subject: [PATCH 04/24] ibtrs: client: private header with client structs and functions Date: Fri, 2 Feb 2018 15:08:44 +0100 Message-Id: <20180202140904.2017-5-roman.penyaev@profitbricks.com> X-Mailer: git-send-email 2.13.1 In-Reply-To: <20180202140904.2017-1-roman.penyaev@profitbricks.com> References: <20180202140904.2017-1-roman.penyaev@profitbricks.com> Sender: linux-block-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP This header describes main structs and functions used by ibtrs-client module, mainly for managing IBTRS sessions, creating/destroying sysfs entries, accounting statistics on client side. Signed-off-by: Roman Pen Signed-off-by: Danil Kipnis Cc: Jack Wang --- drivers/infiniband/ulp/ibtrs/ibtrs-clt.h | 338 +++++++++++++++++++++++++++++++ 1 file changed, 338 insertions(+) diff --git a/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h b/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h new file mode 100644 index 000000000000..b57af19ac833 --- /dev/null +++ b/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h @@ -0,0 +1,338 @@ +/* + * InfiniBand Transport Layer + * + * Copyright (c) 2014 - 2017 ProfitBricks GmbH. All rights reserved. + * Authors: Fabian Holler + * Jack Wang + * Kleber Souza + * Danil Kipnis + * Roman Penyaev + * Milind Dumbare + * + * Copyright (c) 2017 - 2018 ProfitBricks GmbH. All rights reserved. + * Authors: Danil Kipnis + * Roman Penyaev + * + * This program is free software; you can redistribute it and/or + * modify it under the terms of the GNU General Public License + * as published by the Free Software Foundation; either version 2 + * of the License, or (at your option) any later version. + * + * This program is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the + * GNU General Public License for more details. + * + * You should have received a copy of the GNU General Public License + * along with this program; if not, see . + */ + +#ifndef IBTRS_CLT_H +#define IBTRS_CLT_H + +#include "ibtrs-pri.h" + +/** + * enum ibtrs_clt_state - Client states. + */ +enum ibtrs_clt_state { + IBTRS_CLT_CONNECTING, + IBTRS_CLT_CONNECTING_ERR, + IBTRS_CLT_RECONNECTING, + IBTRS_CLT_CONNECTED, + IBTRS_CLT_CLOSING, + IBTRS_CLT_CLOSED, + IBTRS_CLT_DEAD, +}; + +static inline const char *ibtrs_clt_state_str(enum ibtrs_clt_state state) +{ + switch (state) { + case IBTRS_CLT_CONNECTING: + return "IBTRS_CLT_CONNECTING"; + case IBTRS_CLT_CONNECTING_ERR: + return "IBTRS_CLT_CONNECTING_ERR"; + case IBTRS_CLT_RECONNECTING: + return "IBTRS_CLT_RECONNECTING"; + case IBTRS_CLT_CONNECTED: + return "IBTRS_CLT_CONNECTED"; + case IBTRS_CLT_CLOSING: + return "IBTRS_CLT_CLOSING"; + case IBTRS_CLT_CLOSED: + return "IBTRS_CLT_CLOSED"; + case IBTRS_CLT_DEAD: + return "IBTRS_CLT_DEAD"; + default: + return "UNKNOWN"; + } +} + +enum ibtrs_fast_reg { + IBTRS_FAST_MEM_NONE, + IBTRS_FAST_MEM_FR, + IBTRS_FAST_MEM_FMR +}; + +enum ibtrs_mp_policy { + MP_POLICY_RR, + MP_POLICY_MIN_INFLIGHT, +}; + +struct ibtrs_clt_stats_reconnects { + int successful_cnt; + int fail_cnt; +}; + +struct ibtrs_clt_stats_wc_comp { + u32 cnt; + u64 total_cnt; +}; + +struct ibtrs_clt_stats_cpu_migr { + atomic_t from; + int to; +}; + +struct ibtrs_clt_stats_rdma { + struct { + u64 cnt; + u64 size_total; + } dir[2]; + + u64 failover_cnt; +}; + +struct ibtrs_clt_stats_rdma_lat { + u64 read; + u64 write; +}; + +#define MIN_LOG_SG 2 +#define MAX_LOG_SG 5 +#define MAX_LIN_SG BIT(MIN_LOG_SG) +#define SG_DISTR_SZ (MAX_LOG_SG - MIN_LOG_SG + MAX_LIN_SG + 2) + +#define MAX_LOG_LAT 16 +#define MIN_LOG_LAT 0 +#define LOG_LAT_SZ (MAX_LOG_LAT - MIN_LOG_LAT + 2) + +struct ibtrs_clt_stats_pcpu { + struct ibtrs_clt_stats_cpu_migr cpu_migr; + struct ibtrs_clt_stats_rdma rdma; + u64 sg_list_total; + u64 sg_list_distr[SG_DISTR_SZ]; + struct ibtrs_clt_stats_rdma_lat rdma_lat_distr[LOG_LAT_SZ]; + struct ibtrs_clt_stats_rdma_lat rdma_lat_max; + struct ibtrs_clt_stats_wc_comp wc_comp; +}; + +struct ibtrs_clt_stats { + bool enable_rdma_lat; + struct ibtrs_clt_stats_pcpu __percpu *pcpu_stats; + struct ibtrs_clt_stats_reconnects reconnects; + atomic_t inflight; +}; + +struct ibtrs_clt_con { + struct ibtrs_con c; + unsigned cpu; + atomic_t io_cnt; + struct ibtrs_fr_pool *fr_pool; + int cm_err; +}; + +struct ibtrs_clt_io_req { + struct list_head list; + struct ibtrs_iu *iu; + struct scatterlist *sglist; /* list holding user data */ + unsigned int sg_cnt; + unsigned int sg_size; + unsigned int data_len; + unsigned int usr_len; + void *priv; + bool in_use; + struct ibtrs_clt_con *con; + union { + struct ib_pool_fmr **fmr_list; + struct ibtrs_fr_desc **fr_list; + }; + void *map_page; + struct ibtrs_tag *tag; + u16 nmdesc; + enum dma_data_direction dir; + ibtrs_conf_fn *conf; + unsigned long start_time; +}; + +struct ibtrs_clt_sess { + struct ibtrs_sess s; + struct ibtrs_clt *clt; + wait_queue_head_t state_wq; + enum ibtrs_clt_state state; + atomic_t connected_cnt; + struct mutex init_mutex; + struct ibtrs_clt_io_req *reqs; + struct ib_fmr_pool *fmr_pool; + struct delayed_work reconnect_dwork; + struct work_struct close_work; + unsigned reconnect_attempts; + bool established; + u64 *srv_rdma_addr; + u32 srv_rdma_buf_rkey; + u32 max_io_size; + u32 max_req_size; + u32 chunk_size; + u32 max_desc; + size_t queue_depth; + enum ibtrs_fast_reg fast_reg_mode; + u64 mr_page_mask; + u32 mr_page_size; + u32 mr_max_size; + u32 max_pages_per_mr; + int max_sge; + struct kobject kobj; + struct kobject kobj_stats; + struct ibtrs_clt_stats stats; + struct list_head __percpu + *mp_skip_entry; +}; + +struct ibtrs_clt { + struct list_head /* __rcu */ paths_list; + size_t paths_num; + struct ibtrs_clt_sess + __percpu * __rcu *pcpu_path; + + bool opened; + uuid_t paths_uuid; + int paths_up; + struct mutex paths_mutex; + struct mutex paths_ev_mutex; + char sessname[NAME_MAX]; + short port; + unsigned max_reconnect_attempts; + unsigned reconnect_delay_sec; + unsigned max_segments; + void *tags; + unsigned long *tags_map; + size_t queue_depth; + size_t max_io_size; + wait_queue_head_t tags_wait; + size_t pdu_sz; + void *priv; + link_clt_ev_fn *link_ev; + struct kobject kobj; + struct kobject kobj_paths; + enum ibtrs_mp_policy mp_policy; +}; + +static inline struct ibtrs_clt_con *to_clt_con(struct ibtrs_con *c) +{ + if (unlikely(!c)) + return NULL; + + return container_of(c, struct ibtrs_clt_con, c); +} + +static inline struct ibtrs_clt_sess *to_clt_sess(struct ibtrs_sess *s) +{ + if (unlikely(!s)) + return NULL; + + return container_of(s, struct ibtrs_clt_sess, s); +} + +/** + * list_next_or_null_rr - get next list element in round-robin fashion. + * @pos: entry, starting cursor. + * @head: head of the list to examine. This list must have at least one + * element, namely @pos. + * @member: name of the list_head structure within typeof(*pos). + * + * Important to understand that @pos is a list entry, which can be already + * removed using list_del_rcu(), so if @head has become empty NULL will be + * returned. Otherwise next element is returned in round-robin fashion. + */ +#define list_next_or_null_rcu_rr(pos, head, member) ({ \ + typeof(pos) ________next = NULL; \ + \ + if (!list_empty(head)) \ + ________next = (pos)->member.next != (head) ? \ + list_entry_rcu((pos)->member.next, \ + typeof(*pos), member) : \ + list_entry_rcu((pos)->member.next->next, \ + typeof(*pos), member); \ + ________next; \ +}) + +/* See ibtrs-log.h */ +#define TYPES_TO_SESSNAME(obj) \ + LIST(CASE(obj, struct ibtrs_clt_sess *, s.sessname), \ + CASE(obj, struct ibtrs_clt *, sessname)) + +#define TAG_SIZE(clt) (sizeof(struct ibtrs_tag) + (clt)->pdu_sz) +#define GET_TAG(clt, idx) ((clt)->tags + TAG_SIZE(clt) * idx) + +int ibtrs_clt_reconnect_from_sysfs(struct ibtrs_clt_sess *sess); +int ibtrs_clt_disconnect_from_sysfs(struct ibtrs_clt_sess *sess); +int ibtrs_clt_create_path_from_sysfs(struct ibtrs_clt *clt, + struct ibtrs_addr *addr); +int ibtrs_clt_remove_path_from_sysfs(struct ibtrs_clt_sess *sess, + const struct attribute *sysfs_self); + +void ibtrs_clt_set_max_reconnect_attempts(struct ibtrs_clt *clt, int value); +int ibtrs_clt_get_max_reconnect_attempts(const struct ibtrs_clt *clt); + +/* ibtrs-clt-stats.c */ + +int ibtrs_clt_init_stats(struct ibtrs_clt_stats *stats); +void ibtrs_clt_free_stats(struct ibtrs_clt_stats *stats); + +void ibtrs_clt_decrease_inflight(struct ibtrs_clt_stats *s); +void ibtrs_clt_inc_failover_cnt(struct ibtrs_clt_stats *s); + +void ibtrs_clt_update_rdma_lat(struct ibtrs_clt_stats *s, bool read, + unsigned long ms); +void ibtrs_clt_update_wc_stats(struct ibtrs_clt_con *con); +void ibtrs_clt_update_all_stats(struct ibtrs_clt_io_req *req, int dir); + +int ibtrs_clt_reset_sg_list_distr_stats(struct ibtrs_clt_stats *stats, + bool enable); +int ibtrs_clt_stats_sg_list_distr_to_str(struct ibtrs_clt_stats *stats, + char *buf, size_t len); +int ibtrs_clt_reset_rdma_lat_distr_stats(struct ibtrs_clt_stats *stats, + bool enable); +ssize_t ibtrs_clt_stats_rdma_lat_distr_to_str(struct ibtrs_clt_stats *stats, + char *page, size_t len); +int ibtrs_clt_reset_cpu_migr_stats(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_migration_cnt_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_reconnects_stat(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_reconnects_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_wc_comp_stats(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_wc_completion_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_rdma_stats(struct ibtrs_clt_stats *stats, bool enable); +ssize_t ibtrs_clt_stats_rdma_to_str(struct ibtrs_clt_stats *stats, + char *page, size_t len); +bool ibtrs_clt_sess_is_connected(const struct ibtrs_clt_sess *sess); +int ibtrs_clt_reset_all_stats(struct ibtrs_clt_stats *stats, bool enable); +ssize_t ibtrs_clt_reset_all_help(struct ibtrs_clt_stats *stats, + char *page, size_t len); + +/* ibtrs-clt-sysfs.c */ + +int ibtrs_clt_create_sysfs_module_files(void); +void ibtrs_clt_destroy_sysfs_module_files(void); + +int ibtrs_clt_create_sysfs_root_folders(struct ibtrs_clt *clt); +int ibtrs_clt_create_sysfs_root_files(struct ibtrs_clt *clt); +void ibtrs_clt_destroy_sysfs_root_folders(struct ibtrs_clt *clt); +void ibtrs_clt_destroy_sysfs_root_files(struct ibtrs_clt *clt); + +int ibtrs_clt_create_sess_files(struct ibtrs_clt_sess *sess); +void ibtrs_clt_destroy_sess_files(struct ibtrs_clt_sess *sess, + const struct attribute *sysfs_self); + +#endif /* IBTRS_CLT_H */