From patchwork Wed Jun 6 15:24:55 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Roman Pen X-Patchwork-Id: 10450451 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id CBBE960146 for ; Wed, 6 Jun 2018 15:26:00 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id BD276297CC for ; Wed, 6 Jun 2018 15:26:00 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id BBD4E297D0; Wed, 6 Jun 2018 15:26:00 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.9 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID, MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI autolearn=unavailable version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id E8EDC297B4 for ; Wed, 6 Jun 2018 15:25:56 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932295AbeFFPZp (ORCPT ); Wed, 6 Jun 2018 11:25:45 -0400 Received: from mail-wr0-f194.google.com ([209.85.128.194]:37344 "EHLO mail-wr0-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932257AbeFFPZf (ORCPT ); Wed, 6 Jun 2018 11:25:35 -0400 Received: by mail-wr0-f194.google.com with SMTP id d8-v6so6760199wro.4 for ; Wed, 06 Jun 2018 08:25:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=profitbricks-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=J4wbDYtIWWRmVFiTiva0C8BX8lFeQSRFtuFdxYiZcWQ=; b=fhdZycXFz3wqrS0MzKg3U4W1uV/+SvA1G0PN8bZRiazkIPOmGKCwpNEIb0y+6CRZjb y3foT1n1R+qvsIkYdO8FjJRpnw/MA2bBYcOKnxRVQOYJkEul4OgYnzjhC3xs5zWVRHTa 7lJA4+5w03+O6Il8z8OhwjuJ3EPIjZJo98Y3pPTIepT/VF0taDmlFBY2oqmdqg9+8Mdl To87kOsYsXeKYfU9CuPNxPMQj+HvZJQKCBrTFqETDRP1Mjf7j2nvA+h6poVL3NzqTdku uNha47Tkcp2Y+knzneLkxwYqeDWgBg42PTxB/a7wj6hbajOVyWCn4jtNE0xX9IcIPGUU 7zig== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=J4wbDYtIWWRmVFiTiva0C8BX8lFeQSRFtuFdxYiZcWQ=; b=s5KxCnpFcDtlSWtOy6NVAQWKkxbh2M3kZewI9X2pZlvfl2GG02Bp0v+0DwtqD/NOCZ FHJasmErP/AuUCD2pWj8Uusc9gnlsa/MfXBHYvIkuAzLp7nLix3ThQ3khTDIKT8/9l+t 7784prdYUhI2Ui65z9S0Tc8ZJZxVHWQoRihsG44LyD2M0TV5GDUHae2LVftEcwtf1I5J gKf7qLW4Jd8ieCF2rCIljqhRn9CrjhCab/dtdYmvoZvzDq0NqUVQXYOvb7HjsMIIyHY/ 14uTPYtjZjPhJ1sL9EJCsH2rJotdzSG7L6bhqFQgTwX6eKkVpwFJGao1EBUlUBFi6rZx Porg== X-Gm-Message-State: APt69E2xPX43zsKSfnf2hUhf8ZEP+SQ3ydA5cjNVkgeFXdhkz79f6LCr sDJLYeSFicV266GSwDwYp5q4yFA0KBQ= X-Google-Smtp-Source: ADUXVKJpWsGJ3V+30riLdLuIEe+4sqIEGBaK1ItlXOEg3EOHSjphx+AgOhPimgjgzn5nKBw8QxlxIQ== X-Received: by 2002:adf:9441:: with SMTP id 59-v6mr2580197wrq.274.1528298733792; Wed, 06 Jun 2018 08:25:33 -0700 (PDT) Received: from pb.pb.local ([62.217.45.26]) by smtp.gmail.com with ESMTPSA id n11-v6sm18645834wro.13.2018.06.06.08.25.32 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 06 Jun 2018 08:25:33 -0700 (PDT) From: Roman Pen To: linux-block@vger.kernel.org, linux-rdma@vger.kernel.org Cc: Jens Axboe , Christoph Hellwig , Sagi Grimberg , Bart Van Assche , Or Gerlitz , Doug Ledford , Danil Kipnis , Jack Wang , Roman Pen Subject: [PATCH v3 05/25] ibtrs: client: private header with client structs and functions Date: Wed, 6 Jun 2018 17:24:55 +0200 Message-Id: <20180606152515.25807-6-roman.penyaev@profitbricks.com> X-Mailer: git-send-email 2.13.1 In-Reply-To: <20180606152515.25807-1-roman.penyaev@profitbricks.com> References: <20180606152515.25807-1-roman.penyaev@profitbricks.com> Sender: linux-block-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-block@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP This header describes main structs and functions used by ibtrs-client module, mainly for managing IBTRS sessions, creating/destroying sysfs entries, accounting statistics on client side. Signed-off-by: Roman Pen Signed-off-by: Danil Kipnis Cc: Jack Wang --- drivers/infiniband/ulp/ibtrs/ibtrs-clt.h | 315 +++++++++++++++++++++++++++++++ 1 file changed, 315 insertions(+) create mode 100644 drivers/infiniband/ulp/ibtrs/ibtrs-clt.h diff --git a/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h b/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h new file mode 100644 index 000000000000..3212a33a0bf5 --- /dev/null +++ b/drivers/infiniband/ulp/ibtrs/ibtrs-clt.h @@ -0,0 +1,315 @@ +/* + * InfiniBand Transport Layer + * + * Copyright (c) 2014 - 2017 ProfitBricks GmbH. All rights reserved. + * Authors: Fabian Holler + * Jack Wang + * Kleber Souza + * Danil Kipnis + * Roman Penyaev + * Milind Dumbare + * + * Copyright (c) 2017 - 2018 ProfitBricks GmbH. All rights reserved. + * Authors: Danil Kipnis + * Roman Penyaev + * Swapnil Ingle + * + * This program is free software; you can redistribute it and/or + * modify it under the terms of the GNU General Public License + * as published by the Free Software Foundation; either version 2 + * of the License, or (at your option) any later version. + * + * This program is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the + * GNU General Public License for more details. + * + * You should have received a copy of the GNU General Public License + * along with this program; if not, see . + */ + +#ifndef IBTRS_CLT_H +#define IBTRS_CLT_H + +#include +#include "ibtrs-pri.h" + +/** + * enum ibtrs_clt_state - Client states. + */ +enum ibtrs_clt_state { + IBTRS_CLT_CONNECTING, + IBTRS_CLT_CONNECTING_ERR, + IBTRS_CLT_RECONNECTING, + IBTRS_CLT_CONNECTED, + IBTRS_CLT_CLOSING, + IBTRS_CLT_CLOSED, + IBTRS_CLT_DEAD, +}; + +static inline const char *ibtrs_clt_state_str(enum ibtrs_clt_state state) +{ + switch (state) { + case IBTRS_CLT_CONNECTING: + return "IBTRS_CLT_CONNECTING"; + case IBTRS_CLT_CONNECTING_ERR: + return "IBTRS_CLT_CONNECTING_ERR"; + case IBTRS_CLT_RECONNECTING: + return "IBTRS_CLT_RECONNECTING"; + case IBTRS_CLT_CONNECTED: + return "IBTRS_CLT_CONNECTED"; + case IBTRS_CLT_CLOSING: + return "IBTRS_CLT_CLOSING"; + case IBTRS_CLT_CLOSED: + return "IBTRS_CLT_CLOSED"; + case IBTRS_CLT_DEAD: + return "IBTRS_CLT_DEAD"; + default: + return "UNKNOWN"; + } +} + +enum ibtrs_mp_policy { + MP_POLICY_RR, + MP_POLICY_MIN_INFLIGHT, +}; + +struct ibtrs_clt_stats_reconnects { + int successful_cnt; + int fail_cnt; +}; + +struct ibtrs_clt_stats_wc_comp { + u32 cnt; + u64 total_cnt; +}; + +struct ibtrs_clt_stats_cpu_migr { + atomic_t from; + int to; +}; + +struct ibtrs_clt_stats_rdma { + struct { + u64 cnt; + u64 size_total; + } dir[2]; + + u64 failover_cnt; +}; + +struct ibtrs_clt_stats_rdma_lat { + u64 read; + u64 write; +}; + +#define MIN_LOG_SG 2 +#define MAX_LOG_SG 5 +#define MAX_LIN_SG BIT(MIN_LOG_SG) +#define SG_DISTR_SZ (MAX_LOG_SG - MIN_LOG_SG + MAX_LIN_SG + 2) + +#define MAX_LOG_LAT 16 +#define MIN_LOG_LAT 0 +#define LOG_LAT_SZ (MAX_LOG_LAT - MIN_LOG_LAT + 2) + +struct ibtrs_clt_stats_pcpu { + struct ibtrs_clt_stats_cpu_migr cpu_migr; + struct ibtrs_clt_stats_rdma rdma; + u64 sg_list_total; + u64 sg_list_distr[SG_DISTR_SZ]; + struct ibtrs_clt_stats_rdma_lat rdma_lat_distr[LOG_LAT_SZ]; + struct ibtrs_clt_stats_rdma_lat rdma_lat_max; + struct ibtrs_clt_stats_wc_comp wc_comp; +}; + +struct ibtrs_clt_stats { + bool enable_rdma_lat; + struct ibtrs_clt_stats_pcpu __percpu *pcpu_stats; + struct ibtrs_clt_stats_reconnects reconnects; + atomic_t inflight; +}; + +struct ibtrs_clt_con { + struct ibtrs_con c; + unsigned cpu; + atomic_t io_cnt; + int cm_err; +}; + +/** + * ibtrs_tag - tags the memory allocation for future RDMA operation + */ +struct ibtrs_tag { + enum ibtrs_clt_con_type con_type; + unsigned int cpu_id; + unsigned int mem_id; + unsigned int mem_off; +}; + +struct ibtrs_clt_io_req { + struct list_head list; + struct ibtrs_iu *iu; + struct scatterlist *sglist; /* list holding user data */ + unsigned int sg_cnt; + unsigned int sg_size; + unsigned int data_len; + unsigned int usr_len; + void *priv; + bool in_use; + struct ibtrs_clt_con *con; + struct ibtrs_sg_desc *desc; + struct ib_sge *sge; + struct ibtrs_tag *tag; + enum dma_data_direction dir; + ibtrs_conf_fn *conf; + unsigned long start_jiffies; + + struct ib_mr *mr; + struct ib_cqe inv_cqe; + struct completion inv_comp; + int inv_errno; + bool need_inv_comp; + bool need_inv; +}; + +struct ibtrs_rbuf { + u64 addr; + u32 rkey; +}; + +struct ibtrs_clt_sess { + struct ibtrs_sess s; + struct ibtrs_clt *clt; + wait_queue_head_t state_wq; + enum ibtrs_clt_state state; + atomic_t connected_cnt; + struct mutex init_mutex; + struct ibtrs_clt_io_req *reqs; + struct delayed_work reconnect_dwork; + struct work_struct close_work; + unsigned reconnect_attempts; + bool established; + struct ibtrs_rbuf *rbufs; + size_t max_io_size; + u32 max_hdr_size; + u32 chunk_size; + size_t queue_depth; + u32 max_pages_per_mr; + int max_sge; + struct kobject kobj; + struct kobject kobj_stats; + struct ibtrs_clt_stats stats; + /* cache hca_port and hca_name to display in sysfs */ + u8 hca_port; + char hca_name[IB_DEVICE_NAME_MAX]; + struct list_head __percpu + *mp_skip_entry; +}; + +struct ibtrs_clt { + struct list_head /* __rcu */ paths_list; + size_t paths_num; + struct ibtrs_clt_sess + __rcu * __percpu *pcpu_path; + + bool opened; + uuid_t paths_uuid; + int paths_up; + struct mutex paths_mutex; + struct mutex paths_ev_mutex; + char sessname[NAME_MAX]; + short port; + unsigned max_reconnect_attempts; + unsigned reconnect_delay_sec; + unsigned max_segments; + void *tags; + unsigned long *tags_map; + size_t queue_depth; + size_t max_io_size; + wait_queue_head_t tags_wait; + size_t pdu_sz; + void *priv; + link_clt_ev_fn *link_ev; + struct device dev; + struct kobject kobj_paths; + enum ibtrs_mp_policy mp_policy; +}; + +static inline struct ibtrs_clt_con *to_clt_con(struct ibtrs_con *c) +{ + return container_of(c, struct ibtrs_clt_con, c); +} + +static inline struct ibtrs_clt_sess *to_clt_sess(struct ibtrs_sess *s) +{ + return container_of(s, struct ibtrs_clt_sess, s); +} + +/* See ibtrs-log.h */ +#define TYPES_TO_SESSNAME(obj) \ + LIST(CASE(obj, struct ibtrs_clt_sess *, s.sessname), \ + CASE(obj, struct ibtrs_clt *, sessname)) + +#define TAG_SIZE(clt) (sizeof(struct ibtrs_tag) + (clt)->pdu_sz) +#define GET_TAG(clt, idx) ((clt)->tags + TAG_SIZE(clt) * idx) + +int ibtrs_clt_reconnect_from_sysfs(struct ibtrs_clt_sess *sess); +int ibtrs_clt_disconnect_from_sysfs(struct ibtrs_clt_sess *sess); +int ibtrs_clt_create_path_from_sysfs(struct ibtrs_clt *clt, + struct ibtrs_addr *addr); +int ibtrs_clt_remove_path_from_sysfs(struct ibtrs_clt_sess *sess, + const struct attribute *sysfs_self); + +void ibtrs_clt_set_max_reconnect_attempts(struct ibtrs_clt *clt, int value); +int ibtrs_clt_get_max_reconnect_attempts(const struct ibtrs_clt *clt); + +/* ibtrs-clt-stats.c */ + +int ibtrs_clt_init_stats(struct ibtrs_clt_stats *stats); +void ibtrs_clt_free_stats(struct ibtrs_clt_stats *stats); + +void ibtrs_clt_decrease_inflight(struct ibtrs_clt_stats *s); +void ibtrs_clt_inc_failover_cnt(struct ibtrs_clt_stats *s); + +void ibtrs_clt_update_rdma_lat(struct ibtrs_clt_stats *s, bool read, + unsigned long ms); +void ibtrs_clt_update_wc_stats(struct ibtrs_clt_con *con); +void ibtrs_clt_update_all_stats(struct ibtrs_clt_io_req *req, int dir); + +int ibtrs_clt_reset_sg_list_distr_stats(struct ibtrs_clt_stats *stats, + bool enable); +int ibtrs_clt_stats_sg_list_distr_to_str(struct ibtrs_clt_stats *stats, + char *buf, size_t len); +int ibtrs_clt_reset_rdma_lat_distr_stats(struct ibtrs_clt_stats *stats, + bool enable); +ssize_t ibtrs_clt_stats_rdma_lat_distr_to_str(struct ibtrs_clt_stats *stats, + char *page, size_t len); +int ibtrs_clt_reset_cpu_migr_stats(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_migration_cnt_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_reconnects_stat(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_reconnects_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_wc_comp_stats(struct ibtrs_clt_stats *stats, bool enable); +int ibtrs_clt_stats_wc_completion_to_str(struct ibtrs_clt_stats *stats, char *buf, + size_t len); +int ibtrs_clt_reset_rdma_stats(struct ibtrs_clt_stats *stats, bool enable); +ssize_t ibtrs_clt_stats_rdma_to_str(struct ibtrs_clt_stats *stats, + char *page, size_t len); +bool ibtrs_clt_sess_is_connected(const struct ibtrs_clt_sess *sess); +int ibtrs_clt_reset_all_stats(struct ibtrs_clt_stats *stats, bool enable); +ssize_t ibtrs_clt_reset_all_help(struct ibtrs_clt_stats *stats, + char *page, size_t len); + +/* ibtrs-clt-sysfs.c */ + +int ibtrs_clt_create_sysfs_root_folders(struct ibtrs_clt *clt); +int ibtrs_clt_create_sysfs_root_files(struct ibtrs_clt *clt); +void ibtrs_clt_destroy_sysfs_root_folders(struct ibtrs_clt *clt); +void ibtrs_clt_destroy_sysfs_root_files(struct ibtrs_clt *clt); + +int ibtrs_clt_create_sess_files(struct ibtrs_clt_sess *sess); +void ibtrs_clt_destroy_sess_files(struct ibtrs_clt_sess *sess, + const struct attribute *sysfs_self); + +#endif /* IBTRS_CLT_H */