Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1767 |
Symbol | |
ID | 6143039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1776915 |
End bp | 1782869 |
Gene Length | 5955 bp |
Protein Length | 1984 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616643 |
Product | putative autotransporter |
Protein accession | YP_001743821 |
Protein GI | 170680870 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.209421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGA AAACGTTATT ATCCGCCTGT ATCGCGCTGG CTTTAAGCGG ATCGGGCTGG GCTGCTGACA TTGATGATAC AGATAGTGCA ACTCGTCAGC GTAAGGAGAC AAGGATACCC TGTCCGACCG CTCACTCCTC TGAAAAACTA AGTCCGCAAC AACTAAAATC GCTCCCGTCT GAATGTTCTA CAACCAATGA CAACAACCTC TATTCCTTGA TTGCCGTTGG CGCTACTTCA CTAATCACCA CTCTTGCAGT CCTTGAACTA AACCACGATG ACGGTAACCA CGCTCATTCC TCTGACAATC CTCCAGTACC GCCTGATGAC GATAATGGCG GCAACACACC GGACGATGGC GGCAATACCC CTGACGATGG TGGGAATACT CCTGACGATG GCGGCAACAC ACCGGATGAT GGTGGCAACA CACCGGATGA TGGTGGCAAC ACACCGGATG ATGGTGGCAA CACACCGGAC GATGGCGGCA ATACCCCTGA CGATGGTGGG AATACTCCTG ACGATGGCGG CAATACTCCT GACGATGGCG GCAACACACC GGACGATGGC GGCAATACCC CTGACGATGG CGGCAATACC CCTGACGATG GTGGGAATAC TCCTGACGAT GGCGGCAACA CACCGGATGA TGGTGGCAAT ACCCCAGACG ATGGCGGCAA CGTCACCCCG CCCAAAGAGC CTAAAATCTT CAATAACAAT GTCACGTTCG ACGAAGATAA AGGCACGCTG AAAATTCGTA ACGCCACCTT TACCTACAGC AAAAACACCG ATGGAACTTA TACCCTGACG GCTGGAGATG GCCGGACCAC TGTTGTACAA GGCTGGGATG TCGACACGGC TGCCAATACT GTAGAAATTA CTGGCGTGAA TACCCAGGGC GGTATGACGT GGCGTTACGG TAAAGACGGC ATTATCTATA TCACTAAAAC CGTCGGCGCA ACGGTGGATG ACCCCGCGAG CAGCAACGTA TTTAACCTCA GCGATGCGGT GCTCACTGAC CAGGGCGGTA ATGCCGCGCT GAATGGCGCA ACGGTGATTG AAATTAACGG CAGCAGGATC GTCCTTAATA ACGACGGTGA CATTTCTGCT ACGGGAAAAG ATTCTGTGGT TGTGGCGATG ACCGGAAACG ACATTACCGT TAACAATAAC GGCCATATGG TTGTCGATGG CGGCACCGCG GGCGTGGTTA ACGGCGATCG TGCGATCCTG AATAACCGTG GTGATGCGGT GATCACTAAC GGGGGCGCGG GTGTCATTGT GACGGGCGAC AACGCGGTCA TCAACAACAC GGGGCAAAGC GATATTGATG GCGACAACTC GGTATCGGTA AAAGTCGCAG GAAACGCAAC CAGAATCAAA ATGGAAGGCG GGCTCAACGT CAGCGGTGGC GCGCATGGCA TTGATGCCAC CGGTGACAAT AACGAAGTCA GCAACAAGGG CAACATTTCG GTGGTGGATG CGCATTCCAC GGGCGTGCTG CTGAACGGCG ATCGCGCGTC ATTCGTTAAT ATGGGTGATA TTAACGTTAG CGGAGGTGCA GCGGACGATC ACGCTATTGG TGTGCAGATT AACGGCGATA ACAGTACCTT TATCAACGTT GGAGATTTAA ATGCCGACGA CAGGGCGACC GGGGTGAAAA TTACCGGTGA CGCCAGCGAC ATTGCCCTGG CCGGGGCGAT GCATGTCGGG GATTTTGCCT CCGGGCTGGA GGTGACAGGT AAGAACAATG ACCTGTCGTT GTCCACCAAT ATGATGGATG TCACCGGGAA ACAGTCCACC GGGGTGACCA TTACAGGTGA TGAAAATACC ATTGATATCA CCGGTGATAT GGTCGTTGAT CAAAACTCTG TCGGGACGAA GATTGTTGGT GATCGCGTTT CGTTGCAACA GAAAGGTGAT ATTACCGTTA ACGGTGCGGG GCACGGCGTT GAAGTCAGCG GTAGTAAGGC GACAATCAAT AATCAGGGCA AACTGACCGT AAAAGATCAA GACTCAATCG GTATCGCTAT CACCGGGGAT GACGCGCAAT TTACCACCGT GGGTGAGATT GATGTCTCGC TGAATGGTAC GGGTGTTGCG ATCAGCGGCG ATCGTGAACA AGTGAATTTG AGCGGTGATA TAAATATCAT TCAGGAGCGT GACGACAGCG GTACTTTCCA GGGCGGGACG GGCATCAGCA TAATGAGCAA CGACAGCAGC ATGCTGTTGG CGGGAAACAT TAATGTGACG TCCAGCATAG GGGACCAACC GTCCACCTCA CACCTGTCGT TAACCGGCGT TACCATCGGG GGGGAAAACA ATACTGTCGA TCTGCAAGGC GACGTCAATA TTACTGTCGA TTCGGGTTAC CCGGGGGATG AAAACAGTCT TTACGGCGTG GTGGTTTCGG GGTCAAATAA CATCATCAGC CTCGATGGTG GTATCAATAT TTCCGGCGAT AGTGGCGGAC ATGTTGTCAA AGGCGTGCAG GTAACCGGGA ATAATACCGT CAATATTAGT GGTCATTCCG TAATGAATAC CCGACAGGTA TTGGGAGCCT TCTCTTTGAT TTCGGTGGCT GATGGCGGGA ATGTCGTATT TGATGAAAAT GCGATAACCG AGATCCAAAG TTCAGTGAGA AATACTACTG CCCCTCTATT TGTTGATTCA TCGTTAATTG TTGCTAAGGG AAATCAGTCT GTTATTCAGA ACAAAGGCAT TATTAACACA ACCGATGCCC AGGGATTAAT GATGGCGAAT TCGGGGGCGA AGGTTGTCAA TGCCGGGGTG ATTAATATCA GGCCAGATGC TGAATCACAC AGTTTCTTTG CTGGAATGCT GGCAAGAGGC GATGACTCGC AGGCAAAAAA TGTCTCCGGA GGGACAATCA ATCTGACATC CATTACCCAG CCTTCTTACG GCGGAGGATT TAGTGAATAC CCGGTGAAAT GGTATTTCAA CACGGGCTAT GCATTGCTGG CCAGTAATTA CGGTTCGGTA ATTAATGAAG CGGGAGCGAC CATTAATTTG CATGGGGCAG GAACCTATGG TGTTTCCGCG TCAAAAGGAA CGGCGACCAA TGCGGGCGAG ATAAATGTCG ACGGCTTTGT ACCAACAGTA GATGAAAACG GTAATATTAG TGATGAAACT TACTGGCAGA CCAATTCCGT ATATCTGATG GGCGGCGGGA TGTTGGCCGG TTCAACGGAT ACCGGCAACG GTGATGCAAA AGCGGTGAAT ACCGGTACCA TCAACGTTAA GAATGAAGGC TTCGGCATGC TGGCAATGAG CGGCGGGACC GTTGTAAACC AGGGGACAAT TAATCTCACT GCCGATGAGG GCGTGACGAA ACAGCAAGAT AACCAACTGT TCGCGCTGGG TGCAATTCAA CGTGGTCTGG CGATTAATGA CCAGAATGGG GTTATTAATA TTAACACTGA TATTGGCCAG GCGTTTTATA AAGATAGCAC CGGCACCATT CTCAACTACG GCAAAATTAA TCTTTTCGGT AATCCAATGG ATGAAAGTGA TTCCCATATG GGCGTTGCGC CGGATGACAA AGATGTTCTC AGTGAGCTGT CTGGTAGCGG CGAGAGCATC AGCAAAACCA CAACTGCCGA CGGTTTTATC GCCGTAAATG ACCTGGCAAA CTATGGCGAC GAAACGCTTA ATGGCGACGT TACAGCAAAC GGCTGGATAT TCAACCAACC TGATGCCAGC CTGACAATCA ACGGCGAACT GAGCGTAAAC CAGGGGTTGG AAAACAGTGG TCACCTGACA ACCGATCTGC TGACGTTGAA TGGCGCGGTT TCCTTCTTTA ATGAAGGCGA ATTTAGTGGT TCAATTACCG GAAACAGCTA TCAACAGAAT GTGGTGAATA CCGGTGAAAT GACGGTGACC GAAGATGGCC ACTCACTTGT CAACGGAAGT TTCCTGTTCT TCAATGAAGC TGGCGCGACC CTGACAAATA GCGGTAATGC GGTAACGGGC GGGGAAAACG CGATTATTCA TGTAACCAGA ACCGGTGATT CTGTTTCGCA GGTTAACCGT GGCACCATCA CGGCCATAAA TGGCTATAGC GCAATCAAAA CTGAAAATAC TGGCTCGTAC AGTAACGGGA AATGGATTTG GAATACGGAA ACGGGAGTGA TAAATGGTAT TAATCCTGCG GCTCCGTTGG TTGATTTAGG ACGAGGTTAT AATTTTTCTA ACGCTGGCGT TATTAATGTT CAGGGCGATA ATGCGGTGGG CATCAGTGGC GGCATAACCA GCTATACTGT CAAGTTGGTG AACAGTGGCA CCATTAATGT GGGTACTGAA CAGGGGCAAC TGGATGGAAC CAACGGCGAA GGCTTAATTG GTATTAAGGG CAATGGTAAG GACACGACAA TTAATAACAC GCAAACGGGT GTGATTAATG TCTATGCCGA TAACTCCTGG GCGTTTGGTG GGCAAACGAA AGCCATCATT AATAACGGTG AAATTAATTT GCTGTGTGAC ACCGGATGCG ATATTTATGC TCCTGGAACG ACGGGTACGA AAAACGATCA CAACGGTACT GCGGATATCA CCGTACCAGA GGCATCAACC ACGCCGTCAC AAGGTAATGT TCCAACGCCG CCTGCCGATT CAAACGCACC GCAACTGCTG AGCAACTATA CCATCGGTAC CAACAGCGAC GGGAGTTCGG GTACGCTCAG CGCGAATAAT CTGGTTATTG GCGACAACGT GAGTGTTAAC GCCGGGTTTA GTGCCGGAAC AGCGGACACC ACCGTTGTCG TTAACGATGT ATTCAAAGGC GAAAACATCA GCGGTGTAGA TAACATTGTT TCCTCTACGG TCGTCTGGAC TGCCAAAGGC AGTACCGACG CCAGCGGCAA CGTTGACGTG ACCATGAGCA AAAACGCCTA CACCGATGTA GCGACCGATG GCTCAGTGAG TGACGTGGCG AAGGCACTTG ATGCGGGTTA TACCAACAAC GAGCTGTACA CCAGTCTGAA CGTGGGCACC ACCGCCGAAC TGAACAGCGC GCTGAAACAA ATCAGTGGTA GCCAGGCGAC CACCGTATTC CGTGAGGCGC GCGTGTTAAG CAACCGCTTC AGCATGCTGG CGGATGCAGC GCCGAAGATG GGCAACGGTC TGGCGTTCAA CGTGGTGGCG AAAGGTGATC CGCGTGCCGA ACTGGGTAAC AATACCGAGT ACGACATGCT GGCGCTGCGT AAAACAGTTG ACCTGAGCGA AAGCCAGACC ATGAGCCTGG AATACGGTAT CGCGCGTCTG GATGGTGACG GTGCGCAAAA AGCAGGCGAC AACGGAGTAA CCGGCGGCTA CAGCCAGTTC TTTGGACTGA AACATCAGAT GTCCTTCGAC AATGGCATGA ACTGGAATAA CGCGCTGCGT TACGATATTC ATCAACTGGA CAGCAGCCGC TCGGTGGCTT ACGGCGACGT GAGCAAAACG GCGGATACCA ACGTGAAACA GCAGTACCTG GAGTTCCGTA GCGAAGGGGC GAAAACCTTT GAACCGCGCG AAGGGCTGAA AATCACCCCG TATGCCGGTG TGAAACTGCG TCACACGCTG GAGGGCGGCT ATCAGGAACG CAATGCCGGA GACTTCAACC TGAGCATGAA CAGTGGCAGC GAAACGGCGG TGGACAGCAT CGTCGGGCTG AAACTGGACT ACGCAGGCAA AGACGGCTGG AGTGCGAACG CCACTCTGGA AGGTGGGCCG AACCTGAGCT ACGCGAAGAG CCAGCGCACG GCAAGCCTGG CAGGCGCAGG CAGCCAGCAC TTTAATGTCG ATGATGGACA GAAGGGCGGC AGTATCAACA GCCTGGCAAG CGTCGGCGTG AAGTACAGCA GCAAAGAGAG CTCGCTGAAT CTGGATGCCT ATCACTGGAA AGAAGACGGC GTCAGCGATA AAGGCGTGAT GCTCAATTTC AAGAAAACGT TCTAA
|
Protein sequence | MQKKTLLSAC IALALSGSGW AADIDDTDSA TRQRKETRIP CPTAHSSEKL SPQQLKSLPS ECSTTNDNNL YSLIAVGATS LITTLAVLEL NHDDGNHAHS SDNPPVPPDD DNGGNTPDDG GNTPDDGGNT PDDGGNTPDD GGNTPDDGGN TPDDGGNTPD DGGNTPDDGG NTPDDGGNTP DDGGNTPDDG GNTPDDGGNT PDDGGNTPDD GGNTPDDGGN TPDDGGNVTP PKEPKIFNNN VTFDEDKGTL KIRNATFTYS KNTDGTYTLT AGDGRTTVVQ GWDVDTAANT VEITGVNTQG GMTWRYGKDG IIYITKTVGA TVDDPASSNV FNLSDAVLTD QGGNAALNGA TVIEINGSRI VLNNDGDISA TGKDSVVVAM TGNDITVNNN GHMVVDGGTA GVVNGDRAIL NNRGDAVITN GGAGVIVTGD NAVINNTGQS DIDGDNSVSV KVAGNATRIK MEGGLNVSGG AHGIDATGDN NEVSNKGNIS VVDAHSTGVL LNGDRASFVN MGDINVSGGA ADDHAIGVQI NGDNSTFINV GDLNADDRAT GVKITGDASD IALAGAMHVG DFASGLEVTG KNNDLSLSTN MMDVTGKQST GVTITGDENT IDITGDMVVD QNSVGTKIVG DRVSLQQKGD ITVNGAGHGV EVSGSKATIN NQGKLTVKDQ DSIGIAITGD DAQFTTVGEI DVSLNGTGVA ISGDREQVNL SGDINIIQER DDSGTFQGGT GISIMSNDSS MLLAGNINVT SSIGDQPSTS HLSLTGVTIG GENNTVDLQG DVNITVDSGY PGDENSLYGV VVSGSNNIIS LDGGINISGD SGGHVVKGVQ VTGNNTVNIS GHSVMNTRQV LGAFSLISVA DGGNVVFDEN AITEIQSSVR NTTAPLFVDS SLIVAKGNQS VIQNKGIINT TDAQGLMMAN SGAKVVNAGV INIRPDAESH SFFAGMLARG DDSQAKNVSG GTINLTSITQ PSYGGGFSEY PVKWYFNTGY ALLASNYGSV INEAGATINL HGAGTYGVSA SKGTATNAGE INVDGFVPTV DENGNISDET YWQTNSVYLM GGGMLAGSTD TGNGDAKAVN TGTINVKNEG FGMLAMSGGT VVNQGTINLT ADEGVTKQQD NQLFALGAIQ RGLAINDQNG VININTDIGQ AFYKDSTGTI LNYGKINLFG NPMDESDSHM GVAPDDKDVL SELSGSGESI SKTTTADGFI AVNDLANYGD ETLNGDVTAN GWIFNQPDAS LTINGELSVN QGLENSGHLT TDLLTLNGAV SFFNEGEFSG SITGNSYQQN VVNTGEMTVT EDGHSLVNGS FLFFNEAGAT LTNSGNAVTG GENAIIHVTR TGDSVSQVNR GTITAINGYS AIKTENTGSY SNGKWIWNTE TGVINGINPA APLVDLGRGY NFSNAGVINV QGDNAVGISG GITSYTVKLV NSGTINVGTE QGQLDGTNGE GLIGIKGNGK DTTINNTQTG VINVYADNSW AFGGQTKAII NNGEINLLCD TGCDIYAPGT TGTKNDHNGT ADITVPEAST TPSQGNVPTP PADSNAPQLL SNYTIGTNSD GSSGTLSANN LVIGDNVSVN AGFSAGTADT TVVVNDVFKG ENISGVDNIV SSTVVWTAKG STDASGNVDV TMSKNAYTDV ATDGSVSDVA KALDAGYTNN ELYTSLNVGT TAELNSALKQ ISGSQATTVF REARVLSNRF SMLADAAPKM GNGLAFNVVA KGDPRAELGN NTEYDMLALR KTVDLSESQT MSLEYGIARL DGDGAQKAGD NGVTGGYSQF FGLKHQMSFD NGMNWNNALR YDIHQLDSSR SVAYGDVSKT ADTNVKQQYL EFRSEGAKTF EPREGLKITP YAGVKLRHTL EGGYQERNAG DFNLSMNSGS ETAVDSIVGL KLDYAGKDGW SANATLEGGP NLSYAKSQRT ASLAGAGSQH FNVDDGQKGG SINSLASVGV KYSSKESSLN LDAYHWKEDG VSDKGVMLNF KKTF
|
| |