Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01426 |
Symbol | rhsDE |
ID | 8116142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1481789 |
End bp | 1485439 |
Gene Length | 3651 bp |
Protein Length | 1216 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644847669 |
Product | hypothetical protein |
Protein accession | YP_002999242 |
Protein GI | 251784938 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTACCGGGTG CTGAGGACGT ACTGCCAGCG CCGCTGCCGC CGTACCGGGT GCTTACCGGG CTGGCAGACC GTTTCGGGCG GACGCTGACG TACCGGCGTG AGGCCGCAGG CGACCTGGCC GGGGAAATCA CCGGCGTGAC GGACGGTGCC GGGCGGGAGT TCCGTCTGGT GCTGACCACG CAGGCGCAGC GTGCGGAAGA GGCCCGCACC TCTTCGCTAT CTTCTTCTGA CAGTTCCCGC CCTCTCTCAG CCTCAGCGTT CCCCGACACA CTGCCCGGTA CCGAATACGG CCCCGACAGG GGTATCCGCC TTTCGGCGGT GTGGCTGATG CACGACCCGG CATACCCGGA GAGCCTGCCC GCTGCGCCAC TGGTGCGGTA CACGTATACG GAAGCCGGTG AACTGCTGGC GGTATATGAC CGCAGCAATA CGCAGGTGCG CGCTTTCACG TATGACGCGC AGCACCCGGG CCGGATGGTG GCGCACCGTT ACGCGGGAAG GCCGGAGATG CGCTACCGCT ACGACGATAC GGGGCGGGTG GTGGAGCAAC TGAACCCGGC AGGGTTAAGC TACCGCTATC TTTATGAGCA GGACCGCATC ACCGTCACCG ACAGCCTGAA CCGGCGTGAG GTGCTGCATA CAGAAGGCGG GGCCGGGCTG AAACGGGTGG TGAAAAAAGA ACTGGCGGAC GGCAGCGTCA CGCGCAGCGG GTATGACGCG GCAGGAAGGC TCACGGCGCA GACGGACGCG GCGGGACGGA GGACAGAGTA CGGTCTGAAT GTGGTGTCCG GCGATATCAC GGACATCACC ACACCGGACG GGCGGGAGAC GAAATTTTAC TATAACGACG GGAACCAGCT GACGGCGGTG GTGTACCCGG ACGGGCTGGA GAGCCGCCGG GAATATGATG AACCGGGCAG GCTGGTATCG GAGACATCGC GCAGCGGGGA GACAGTACGC TACCGCTACG ATGACGCGCA CAGTGAGTTA CCGGCGACGA CAACGGATGC GACGGGCAGC ACCCGGCAGA TGACGTGGAG CCGCTACGGG CAGTTGCTGG CGTTCACTGA CTGCTCGGGC TACCAGACCC GCTATGAATA CGACCGCTTC GGCCAGATGA CGGCGGTTCA CCGCGAGGAA GGTATCAGCC TTTACCGCCA CTATGACAAC CGTGGCCGGT TAACCTCGGT GAAAGACGCA CAGGGCCGTG AAACGCGGTA TGAATACAAC GCCGCAGGCG ACCTGACTGC CGTCATTACC CCGGACGGCA ACCGGAGCGA GACACAGTAC GATGCGTGGG GAAAGGCGGT CAGCACCACG CAGGGCGGGC TGACGCGCAG TATGGAGTAC GATGCTGCCG GACGTGTCAT CAGCCTGACC AACGAGAACG GCAGCCACAG CGTCTTCAGT TACGATGCGC TGGACCGGCT GGTACAGCAG GGCGGCTTTG ACGGGCGGAC GCAACGTTAT CATTATGACC TGACCGGAAA ACTCACGCAG AGTGAGGATG AGGGACTTGT CATCCTCTGG TACTACGATG CGTCGGACCG CATCACGCAC CGCACGGTGA ACGGCGAACC GGCAGAGCAG TGGCAGTATG ATGGCCACGG CTGGCTGACA GACATCAGCC ACCTGAGCGA AGGCCACCGT GTTGCCGTCC ACTATGGCTA TGACGATAAA GGCCGCCTGA CCGGCGAATG CCAGACGGTG GAGAACCCGG AGACGGGGGA ACTGCTGTGG CAGCATGAGA CGAAACACGC ATACAACGAG CAGGGGCTGG CAAACCGCGT CACGCCGGAC AGCCTGCCGC CGGTGGAGTG GCTGACGTAT GGCAGCGGTT ACCTGGCGGG CATGAAGCTG GGCGGGACGC CGCTGGTCGA GTATACGCGG GACAGGCTGC ACCGTGAGAC GGTGCGCAGC TTCGGCAGCA TGGCAGGCAG TAATGCCGCA TACGAACTGA CCAGCACATA CACCCCCGCA GGCCAGTTAC AGAGCCAGCA CCTGAACAGC CTGGTATATG ACCGTGACTA CGGGTGGAGT GACAACGGCG ACCTGGTGCG CATCAGCGGC CCGCGACAGA CGCGGGAATA CGGCTACAGC GCCACGGGCA GGCTGGAGAG TGTGCGCACC CTCGCACCAG ACCTGGACAT CCGCATCCCG TATGCCACGG ACCCGGCGGG CAACCGGCTG CCGGACCCGG AGCTGCACCC GGACAGTACA CTCACAGTGT GGCCGGATAA CCGCATCGCG GAGGATGCGC ACTATGTCTA CCGCCACGAT GAATACGGCA GGCTGACGGA GAAGACGGAC CGCATCCCGG CGGGTGTTAT CCGGACGGAC GACGAGCGGA CCCACCACTA CCACTACGAC AGCCAGCACC GCCTGGTGTT CTACACGCGG ATACAGCATG GCGAGCCACT GGTCGAGAGC CGCTACCTCT ACGACCCGCT GGGACGGCGA ATGGCAAAAC GGGTCTGGCG GCGGGAGCGT GACCTGACGG GGTGGATGTC GCTGTCGCGT AAACCGGAGG TGACGTGGTA TGGCTGGGAC GGAGACAGGC TGACGACGGT GCAGACTGAC ACCACACGTA TCCAGACGGT ATACGAGCCG GGAAGCTTCA CGCCGCTCAT CCGGGTCGAG ACAGAGAACG GCGAGCGGGA AAAAGCGCAG CGGCGCAGCC TGGCAGAGAC GCTCCAGCAG GAAGGGAGTG AGAACGGCCA CGGCGTGGTG TTCCCGGCTG AACTGGTGCG GCTGCTGGAC AGGCTGGAGG AAGAAATCCG GGCAGACCGC GTGAGCAGTG AAAGCCGGGC GTGGCTTGCG CAGTGCGGGC TGACGGTGGA GCAACTGGCC AGACAGGTGG AGCCGGAATA CACACCGGCG CGAAAAGTTC ATTTTTACCA CTGCGACCAC CGGGGCCTGC CGCTGGCGCT CATCAGCGAA GACGGCAATA CGGCGTGGCG CGGGGAGTAT GATGAATGGG GCAACCAGCT TAATGAGGAG AACCCGCATC ACCTGCACCA GCCGTACCGT CTGCCAGGGC AGCAGCATGA TGAGGAGTCG GGGCTGTACT ATAACCGTCA CCGGCACTAC GATCCGTTGC AGGGGCGGTA TATCACCCCG GACCCGATTG GGTTGAGAGG TGGATGGAAT ATGTATCAGT ATCCGTTGAA TCCCATACAA GTGATAGACC CAATGGGGTT AGATGCGATT GAGAATATGA CATCAGGTGG ACTAATTTAT GCCGTATCTG GTGTACCTGG ATTGATTGCT GCAAACAGCA TTACTAACAG TGCTTACCAG TTCGGTTATG ATATGGATGC TATTGTTGGC GGAGCTCATA ATGGGGCCGC CGATGCAATG AGACATTGTT ACTTGATGTG TCGAATGACT AAGACATTTG GATCAACAAT AGCTGACGTG ATAGGTAAAA ATCATGAGGC GGCAGGGGAT AGACAAGGTC AGCCAGCTAA AAAAAGAATC ATGGATCTTA AAAATAACAC TGTCGGTATT GCTTGTGGCG ATTTTTCTGC CAAATGTAGC GATGCATGTA TTGAAAAATA TAACACTGGG CAACTCTTCG GGTTAGATGG TATAAAAGCA GATAATCCAA TAAAAGCAAA GCAAGGGAGT TCAGATGCTT CAAATTATTA G
|
Protein sequence | VPGAEDVLPA PLPPYRVLTG LADRFGRTLT YRREAAGDLA GEITGVTDGA GREFRLVLTT QAQRAEEART SSLSSSDSSR PLSASAFPDT LPGTEYGPDR GIRLSAVWLM HDPAYPESLP AAPLVRYTYT EAGELLAVYD RSNTQVRAFT YDAQHPGRMV AHRYAGRPEM RYRYDDTGRV VEQLNPAGLS YRYLYEQDRI TVTDSLNRRE VLHTEGGAGL KRVVKKELAD GSVTRSGYDA AGRLTAQTDA AGRRTEYGLN VVSGDITDIT TPDGRETKFY YNDGNQLTAV VYPDGLESRR EYDEPGRLVS ETSRSGETVR YRYDDAHSEL PATTTDATGS TRQMTWSRYG QLLAFTDCSG YQTRYEYDRF GQMTAVHREE GISLYRHYDN RGRLTSVKDA QGRETRYEYN AAGDLTAVIT PDGNRSETQY DAWGKAVSTT QGGLTRSMEY DAAGRVISLT NENGSHSVFS YDALDRLVQQ GGFDGRTQRY HYDLTGKLTQ SEDEGLVILW YYDASDRITH RTVNGEPAEQ WQYDGHGWLT DISHLSEGHR VAVHYGYDDK GRLTGECQTV ENPETGELLW QHETKHAYNE QGLANRVTPD SLPPVEWLTY GSGYLAGMKL GGTPLVEYTR DRLHRETVRS FGSMAGSNAA YELTSTYTPA GQLQSQHLNS LVYDRDYGWS DNGDLVRISG PRQTREYGYS ATGRLESVRT LAPDLDIRIP YATDPAGNRL PDPELHPDST LTVWPDNRIA EDAHYVYRHD EYGRLTEKTD RIPAGVIRTD DERTHHYHYD SQHRLVFYTR IQHGEPLVES RYLYDPLGRR MAKRVWRRER DLTGWMSLSR KPEVTWYGWD GDRLTTVQTD TTRIQTVYEP GSFTPLIRVE TENGEREKAQ RRSLAETLQQ EGSENGHGVV FPAELVRLLD RLEEEIRADR VSSESRAWLA QCGLTVEQLA RQVEPEYTPA RKVHFYHCDH RGLPLALISE DGNTAWRGEY DEWGNQLNEE NPHHLHQPYR LPGQQHDEES GLYYNRHRHY DPLQGRYITP DPIGLRGGWN MYQYPLNPIQ VIDPMGLDAI ENMTSGGLIY AVSGVPGLIA ANSITNSAYQ FGYDMDAIVG GAHNGAADAM RHCYLMCRMT KTFGSTIADV IGKNHEAAGD RQGQPAKKRI MDLKNNTVGI ACGDFSAKCS DACIEKYNTG QLFGLDGIKA DNPIKAKQGS SDASNY
|
| |