Gene EcSMS35_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1767 
Symbol 
ID6143039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1776915 
End bp1782869 
Gene Length5955 bp 
Protein Length1984 aa 
Translation table11 
GC content52% 
IMG OID641616643 
Productputative autotransporter 
Protein accessionYP_001743821 
Protein GI170680870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.209421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA AAACGTTATT ATCCGCCTGT ATCGCGCTGG CTTTAAGCGG ATCGGGCTGG 
GCTGCTGACA TTGATGATAC AGATAGTGCA ACTCGTCAGC GTAAGGAGAC AAGGATACCC
TGTCCGACCG CTCACTCCTC TGAAAAACTA AGTCCGCAAC AACTAAAATC GCTCCCGTCT
GAATGTTCTA CAACCAATGA CAACAACCTC TATTCCTTGA TTGCCGTTGG CGCTACTTCA
CTAATCACCA CTCTTGCAGT CCTTGAACTA AACCACGATG ACGGTAACCA CGCTCATTCC
TCTGACAATC CTCCAGTACC GCCTGATGAC GATAATGGCG GCAACACACC GGACGATGGC
GGCAATACCC CTGACGATGG TGGGAATACT CCTGACGATG GCGGCAACAC ACCGGATGAT
GGTGGCAACA CACCGGATGA TGGTGGCAAC ACACCGGATG ATGGTGGCAA CACACCGGAC
GATGGCGGCA ATACCCCTGA CGATGGTGGG AATACTCCTG ACGATGGCGG CAATACTCCT
GACGATGGCG GCAACACACC GGACGATGGC GGCAATACCC CTGACGATGG CGGCAATACC
CCTGACGATG GTGGGAATAC TCCTGACGAT GGCGGCAACA CACCGGATGA TGGTGGCAAT
ACCCCAGACG ATGGCGGCAA CGTCACCCCG CCCAAAGAGC CTAAAATCTT CAATAACAAT
GTCACGTTCG ACGAAGATAA AGGCACGCTG AAAATTCGTA ACGCCACCTT TACCTACAGC
AAAAACACCG ATGGAACTTA TACCCTGACG GCTGGAGATG GCCGGACCAC TGTTGTACAA
GGCTGGGATG TCGACACGGC TGCCAATACT GTAGAAATTA CTGGCGTGAA TACCCAGGGC
GGTATGACGT GGCGTTACGG TAAAGACGGC ATTATCTATA TCACTAAAAC CGTCGGCGCA
ACGGTGGATG ACCCCGCGAG CAGCAACGTA TTTAACCTCA GCGATGCGGT GCTCACTGAC
CAGGGCGGTA ATGCCGCGCT GAATGGCGCA ACGGTGATTG AAATTAACGG CAGCAGGATC
GTCCTTAATA ACGACGGTGA CATTTCTGCT ACGGGAAAAG ATTCTGTGGT TGTGGCGATG
ACCGGAAACG ACATTACCGT TAACAATAAC GGCCATATGG TTGTCGATGG CGGCACCGCG
GGCGTGGTTA ACGGCGATCG TGCGATCCTG AATAACCGTG GTGATGCGGT GATCACTAAC
GGGGGCGCGG GTGTCATTGT GACGGGCGAC AACGCGGTCA TCAACAACAC GGGGCAAAGC
GATATTGATG GCGACAACTC GGTATCGGTA AAAGTCGCAG GAAACGCAAC CAGAATCAAA
ATGGAAGGCG GGCTCAACGT CAGCGGTGGC GCGCATGGCA TTGATGCCAC CGGTGACAAT
AACGAAGTCA GCAACAAGGG CAACATTTCG GTGGTGGATG CGCATTCCAC GGGCGTGCTG
CTGAACGGCG ATCGCGCGTC ATTCGTTAAT ATGGGTGATA TTAACGTTAG CGGAGGTGCA
GCGGACGATC ACGCTATTGG TGTGCAGATT AACGGCGATA ACAGTACCTT TATCAACGTT
GGAGATTTAA ATGCCGACGA CAGGGCGACC GGGGTGAAAA TTACCGGTGA CGCCAGCGAC
ATTGCCCTGG CCGGGGCGAT GCATGTCGGG GATTTTGCCT CCGGGCTGGA GGTGACAGGT
AAGAACAATG ACCTGTCGTT GTCCACCAAT ATGATGGATG TCACCGGGAA ACAGTCCACC
GGGGTGACCA TTACAGGTGA TGAAAATACC ATTGATATCA CCGGTGATAT GGTCGTTGAT
CAAAACTCTG TCGGGACGAA GATTGTTGGT GATCGCGTTT CGTTGCAACA GAAAGGTGAT
ATTACCGTTA ACGGTGCGGG GCACGGCGTT GAAGTCAGCG GTAGTAAGGC GACAATCAAT
AATCAGGGCA AACTGACCGT AAAAGATCAA GACTCAATCG GTATCGCTAT CACCGGGGAT
GACGCGCAAT TTACCACCGT GGGTGAGATT GATGTCTCGC TGAATGGTAC GGGTGTTGCG
ATCAGCGGCG ATCGTGAACA AGTGAATTTG AGCGGTGATA TAAATATCAT TCAGGAGCGT
GACGACAGCG GTACTTTCCA GGGCGGGACG GGCATCAGCA TAATGAGCAA CGACAGCAGC
ATGCTGTTGG CGGGAAACAT TAATGTGACG TCCAGCATAG GGGACCAACC GTCCACCTCA
CACCTGTCGT TAACCGGCGT TACCATCGGG GGGGAAAACA ATACTGTCGA TCTGCAAGGC
GACGTCAATA TTACTGTCGA TTCGGGTTAC CCGGGGGATG AAAACAGTCT TTACGGCGTG
GTGGTTTCGG GGTCAAATAA CATCATCAGC CTCGATGGTG GTATCAATAT TTCCGGCGAT
AGTGGCGGAC ATGTTGTCAA AGGCGTGCAG GTAACCGGGA ATAATACCGT CAATATTAGT
GGTCATTCCG TAATGAATAC CCGACAGGTA TTGGGAGCCT TCTCTTTGAT TTCGGTGGCT
GATGGCGGGA ATGTCGTATT TGATGAAAAT GCGATAACCG AGATCCAAAG TTCAGTGAGA
AATACTACTG CCCCTCTATT TGTTGATTCA TCGTTAATTG TTGCTAAGGG AAATCAGTCT
GTTATTCAGA ACAAAGGCAT TATTAACACA ACCGATGCCC AGGGATTAAT GATGGCGAAT
TCGGGGGCGA AGGTTGTCAA TGCCGGGGTG ATTAATATCA GGCCAGATGC TGAATCACAC
AGTTTCTTTG CTGGAATGCT GGCAAGAGGC GATGACTCGC AGGCAAAAAA TGTCTCCGGA
GGGACAATCA ATCTGACATC CATTACCCAG CCTTCTTACG GCGGAGGATT TAGTGAATAC
CCGGTGAAAT GGTATTTCAA CACGGGCTAT GCATTGCTGG CCAGTAATTA CGGTTCGGTA
ATTAATGAAG CGGGAGCGAC CATTAATTTG CATGGGGCAG GAACCTATGG TGTTTCCGCG
TCAAAAGGAA CGGCGACCAA TGCGGGCGAG ATAAATGTCG ACGGCTTTGT ACCAACAGTA
GATGAAAACG GTAATATTAG TGATGAAACT TACTGGCAGA CCAATTCCGT ATATCTGATG
GGCGGCGGGA TGTTGGCCGG TTCAACGGAT ACCGGCAACG GTGATGCAAA AGCGGTGAAT
ACCGGTACCA TCAACGTTAA GAATGAAGGC TTCGGCATGC TGGCAATGAG CGGCGGGACC
GTTGTAAACC AGGGGACAAT TAATCTCACT GCCGATGAGG GCGTGACGAA ACAGCAAGAT
AACCAACTGT TCGCGCTGGG TGCAATTCAA CGTGGTCTGG CGATTAATGA CCAGAATGGG
GTTATTAATA TTAACACTGA TATTGGCCAG GCGTTTTATA AAGATAGCAC CGGCACCATT
CTCAACTACG GCAAAATTAA TCTTTTCGGT AATCCAATGG ATGAAAGTGA TTCCCATATG
GGCGTTGCGC CGGATGACAA AGATGTTCTC AGTGAGCTGT CTGGTAGCGG CGAGAGCATC
AGCAAAACCA CAACTGCCGA CGGTTTTATC GCCGTAAATG ACCTGGCAAA CTATGGCGAC
GAAACGCTTA ATGGCGACGT TACAGCAAAC GGCTGGATAT TCAACCAACC TGATGCCAGC
CTGACAATCA ACGGCGAACT GAGCGTAAAC CAGGGGTTGG AAAACAGTGG TCACCTGACA
ACCGATCTGC TGACGTTGAA TGGCGCGGTT TCCTTCTTTA ATGAAGGCGA ATTTAGTGGT
TCAATTACCG GAAACAGCTA TCAACAGAAT GTGGTGAATA CCGGTGAAAT GACGGTGACC
GAAGATGGCC ACTCACTTGT CAACGGAAGT TTCCTGTTCT TCAATGAAGC TGGCGCGACC
CTGACAAATA GCGGTAATGC GGTAACGGGC GGGGAAAACG CGATTATTCA TGTAACCAGA
ACCGGTGATT CTGTTTCGCA GGTTAACCGT GGCACCATCA CGGCCATAAA TGGCTATAGC
GCAATCAAAA CTGAAAATAC TGGCTCGTAC AGTAACGGGA AATGGATTTG GAATACGGAA
ACGGGAGTGA TAAATGGTAT TAATCCTGCG GCTCCGTTGG TTGATTTAGG ACGAGGTTAT
AATTTTTCTA ACGCTGGCGT TATTAATGTT CAGGGCGATA ATGCGGTGGG CATCAGTGGC
GGCATAACCA GCTATACTGT CAAGTTGGTG AACAGTGGCA CCATTAATGT GGGTACTGAA
CAGGGGCAAC TGGATGGAAC CAACGGCGAA GGCTTAATTG GTATTAAGGG CAATGGTAAG
GACACGACAA TTAATAACAC GCAAACGGGT GTGATTAATG TCTATGCCGA TAACTCCTGG
GCGTTTGGTG GGCAAACGAA AGCCATCATT AATAACGGTG AAATTAATTT GCTGTGTGAC
ACCGGATGCG ATATTTATGC TCCTGGAACG ACGGGTACGA AAAACGATCA CAACGGTACT
GCGGATATCA CCGTACCAGA GGCATCAACC ACGCCGTCAC AAGGTAATGT TCCAACGCCG
CCTGCCGATT CAAACGCACC GCAACTGCTG AGCAACTATA CCATCGGTAC CAACAGCGAC
GGGAGTTCGG GTACGCTCAG CGCGAATAAT CTGGTTATTG GCGACAACGT GAGTGTTAAC
GCCGGGTTTA GTGCCGGAAC AGCGGACACC ACCGTTGTCG TTAACGATGT ATTCAAAGGC
GAAAACATCA GCGGTGTAGA TAACATTGTT TCCTCTACGG TCGTCTGGAC TGCCAAAGGC
AGTACCGACG CCAGCGGCAA CGTTGACGTG ACCATGAGCA AAAACGCCTA CACCGATGTA
GCGACCGATG GCTCAGTGAG TGACGTGGCG AAGGCACTTG ATGCGGGTTA TACCAACAAC
GAGCTGTACA CCAGTCTGAA CGTGGGCACC ACCGCCGAAC TGAACAGCGC GCTGAAACAA
ATCAGTGGTA GCCAGGCGAC CACCGTATTC CGTGAGGCGC GCGTGTTAAG CAACCGCTTC
AGCATGCTGG CGGATGCAGC GCCGAAGATG GGCAACGGTC TGGCGTTCAA CGTGGTGGCG
AAAGGTGATC CGCGTGCCGA ACTGGGTAAC AATACCGAGT ACGACATGCT GGCGCTGCGT
AAAACAGTTG ACCTGAGCGA AAGCCAGACC ATGAGCCTGG AATACGGTAT CGCGCGTCTG
GATGGTGACG GTGCGCAAAA AGCAGGCGAC AACGGAGTAA CCGGCGGCTA CAGCCAGTTC
TTTGGACTGA AACATCAGAT GTCCTTCGAC AATGGCATGA ACTGGAATAA CGCGCTGCGT
TACGATATTC ATCAACTGGA CAGCAGCCGC TCGGTGGCTT ACGGCGACGT GAGCAAAACG
GCGGATACCA ACGTGAAACA GCAGTACCTG GAGTTCCGTA GCGAAGGGGC GAAAACCTTT
GAACCGCGCG AAGGGCTGAA AATCACCCCG TATGCCGGTG TGAAACTGCG TCACACGCTG
GAGGGCGGCT ATCAGGAACG CAATGCCGGA GACTTCAACC TGAGCATGAA CAGTGGCAGC
GAAACGGCGG TGGACAGCAT CGTCGGGCTG AAACTGGACT ACGCAGGCAA AGACGGCTGG
AGTGCGAACG CCACTCTGGA AGGTGGGCCG AACCTGAGCT ACGCGAAGAG CCAGCGCACG
GCAAGCCTGG CAGGCGCAGG CAGCCAGCAC TTTAATGTCG ATGATGGACA GAAGGGCGGC
AGTATCAACA GCCTGGCAAG CGTCGGCGTG AAGTACAGCA GCAAAGAGAG CTCGCTGAAT
CTGGATGCCT ATCACTGGAA AGAAGACGGC GTCAGCGATA AAGGCGTGAT GCTCAATTTC
AAGAAAACGT TCTAA
 
Protein sequence
MQKKTLLSAC IALALSGSGW AADIDDTDSA TRQRKETRIP CPTAHSSEKL SPQQLKSLPS 
ECSTTNDNNL YSLIAVGATS LITTLAVLEL NHDDGNHAHS SDNPPVPPDD DNGGNTPDDG
GNTPDDGGNT PDDGGNTPDD GGNTPDDGGN TPDDGGNTPD DGGNTPDDGG NTPDDGGNTP
DDGGNTPDDG GNTPDDGGNT PDDGGNTPDD GGNTPDDGGN TPDDGGNVTP PKEPKIFNNN
VTFDEDKGTL KIRNATFTYS KNTDGTYTLT AGDGRTTVVQ GWDVDTAANT VEITGVNTQG
GMTWRYGKDG IIYITKTVGA TVDDPASSNV FNLSDAVLTD QGGNAALNGA TVIEINGSRI
VLNNDGDISA TGKDSVVVAM TGNDITVNNN GHMVVDGGTA GVVNGDRAIL NNRGDAVITN
GGAGVIVTGD NAVINNTGQS DIDGDNSVSV KVAGNATRIK MEGGLNVSGG AHGIDATGDN
NEVSNKGNIS VVDAHSTGVL LNGDRASFVN MGDINVSGGA ADDHAIGVQI NGDNSTFINV
GDLNADDRAT GVKITGDASD IALAGAMHVG DFASGLEVTG KNNDLSLSTN MMDVTGKQST
GVTITGDENT IDITGDMVVD QNSVGTKIVG DRVSLQQKGD ITVNGAGHGV EVSGSKATIN
NQGKLTVKDQ DSIGIAITGD DAQFTTVGEI DVSLNGTGVA ISGDREQVNL SGDINIIQER
DDSGTFQGGT GISIMSNDSS MLLAGNINVT SSIGDQPSTS HLSLTGVTIG GENNTVDLQG
DVNITVDSGY PGDENSLYGV VVSGSNNIIS LDGGINISGD SGGHVVKGVQ VTGNNTVNIS
GHSVMNTRQV LGAFSLISVA DGGNVVFDEN AITEIQSSVR NTTAPLFVDS SLIVAKGNQS
VIQNKGIINT TDAQGLMMAN SGAKVVNAGV INIRPDAESH SFFAGMLARG DDSQAKNVSG
GTINLTSITQ PSYGGGFSEY PVKWYFNTGY ALLASNYGSV INEAGATINL HGAGTYGVSA
SKGTATNAGE INVDGFVPTV DENGNISDET YWQTNSVYLM GGGMLAGSTD TGNGDAKAVN
TGTINVKNEG FGMLAMSGGT VVNQGTINLT ADEGVTKQQD NQLFALGAIQ RGLAINDQNG
VININTDIGQ AFYKDSTGTI LNYGKINLFG NPMDESDSHM GVAPDDKDVL SELSGSGESI
SKTTTADGFI AVNDLANYGD ETLNGDVTAN GWIFNQPDAS LTINGELSVN QGLENSGHLT
TDLLTLNGAV SFFNEGEFSG SITGNSYQQN VVNTGEMTVT EDGHSLVNGS FLFFNEAGAT
LTNSGNAVTG GENAIIHVTR TGDSVSQVNR GTITAINGYS AIKTENTGSY SNGKWIWNTE
TGVINGINPA APLVDLGRGY NFSNAGVINV QGDNAVGISG GITSYTVKLV NSGTINVGTE
QGQLDGTNGE GLIGIKGNGK DTTINNTQTG VINVYADNSW AFGGQTKAII NNGEINLLCD
TGCDIYAPGT TGTKNDHNGT ADITVPEAST TPSQGNVPTP PADSNAPQLL SNYTIGTNSD
GSSGTLSANN LVIGDNVSVN AGFSAGTADT TVVVNDVFKG ENISGVDNIV SSTVVWTAKG
STDASGNVDV TMSKNAYTDV ATDGSVSDVA KALDAGYTNN ELYTSLNVGT TAELNSALKQ
ISGSQATTVF REARVLSNRF SMLADAAPKM GNGLAFNVVA KGDPRAELGN NTEYDMLALR
KTVDLSESQT MSLEYGIARL DGDGAQKAGD NGVTGGYSQF FGLKHQMSFD NGMNWNNALR
YDIHQLDSSR SVAYGDVSKT ADTNVKQQYL EFRSEGAKTF EPREGLKITP YAGVKLRHTL
EGGYQERNAG DFNLSMNSGS ETAVDSIVGL KLDYAGKDGW SANATLEGGP NLSYAKSQRT
ASLAGAGSQH FNVDDGQKGG SINSLASVGV KYSSKESSLN LDAYHWKEDG VSDKGVMLNF
KKTF