Gene EcSMS35_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0402 
Symbol 
ID6146526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp414664 
End bp416808 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content47% 
IMG OID641615298 
Productouter membrane autotransporter 
Protein accessionYP_001742505 
Protein GI170683055 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGAAA GTTCTGGTTA TGGTCATTTC GGTAACTCTG GTGAGCCGAG TGATTATGCT 
GGTCCGGGTG ACGTTGCATT GTCTTTCACT GACAGTACTT CAGACTATGC AATGAAGAAC
AATGTATATT TCAGCAATTC TACTCTGGTG GGTGATGTTG CGTTTACCAG CACCTGGAAT
GCTAATTTTG ATCCAAGTGG TCATGATTCT AACGGTGACG GCGTAAAAGA TACCAATGCT
GGTTGGGTTG ATGATAGCCT GAACGTTGAT GAGCTGAATA TTACTCTTGA TAATGGAAGC
AAGTGGGTTG GACAGGCTAC ATTCAATGCT GAAACCATTT CTCCAGACAC AATGTATGAT
GTTGCAACTA ACAGTTTGAC CCCTGGTGGA ACAGCTGAAG CTAATGGTTG GAATCGCATC
ATCGATAATA AAGTATTCCA GAGTGGTGTA TTTAACGTAG CGTTGAACAA CGGTTCTGAA
TGGGATACTA CAGGTCGTTC CGTCGTTGAT ACCTTGACAG TTAATAATGC TTCTCAGGTT
AATGTTTCGG AATCTAAATT AACTTCAGAT ACTATCGATT TAACTAACGG TTCTTCGCTG
AACATTGGTG AAGATGGTTA CGTTGATACC GATCATCTGA CTATTAACTC CTACAGTACT
GTTGCGTTGA CCGAATCTAC AGGTTGGGGT GCTGATTACA ACCTGTACGC CAATACCATC
ACCGTAACTA ACGGTGGTGT ATTGGATGTG AACGTTGATC AGTTCGATAC TGAAGCTTTC
CGTACTGACA AACTGGAACT GACCAGCGGC AACATTGCTG ACAATAATGG TAATGTAGTA
TCCGGTGTAT TCGATATCCA CAGCAGCGAT TATGTTCTGA ACGCTGACTT GGTGAACGAC
CGTACCTGGG ATACTTCCAA GTCTAACTAC GGCTACGGTA TTGTTGCTAT GAACTCTGAT
GGTCATCTGA CTATCAATGG TAACGGCGAC ATGAACAACG GCGACGAACT GGATAACAGC
TCTGTAGACA ACGTTGTTGC TGCAACCGGT AACTATAAAG TTCGTATCGA CAACGCAACT
GGCGCTGGCG CAATCGCTGA TTACAAAGAT AAAGAAATTA TCTACGTAAA CGACGTCAAC
AGCAACGCGA CCTTCTCTGC TGCTAACAAA GCTGACCTGG GTGCATACAC CTATCAGGCT
GAACAGCGTG GTAACACCGT TGTTCTGCAA CAGATGGAGC TGACCGACTA CGCTAACATG
GCGCTGAGCA TCCCGTCTGC GAACACCAAT ATCTGGAACC TGGAACAAGA CACCGTTGGT
ACTCGTCTGA CCAACTCTCG TCATGGCCTG GCTGATAACG GCGGCGCATG GGTAAGCTAC
TTCGGTGGTA ACTTCAACGG CGACAACGGC ACCATCAACT ATGATCAGGA TGTTAACGGC
ATCATGGTCG GTGTTGATAC CAAAATTGAC GGTAACAACG CTAAGTGGAT CGTCGGTGCG
GCTGCAGGCT TCGCTAAAGG TGACATGAAT GACCGTTCTG GTCAGGTGGA TCAAGACAGC
CAGACTGCCT ACATCTACTC TTCTGCTCAC TTCGCGAACA ACGTCTTTGT TGATGGTAGC
TTGAGCTACT CTCACTTCAA CAACGACCTG TCTGCAACCA TGAGCAACGG TACTTATGTT
GATGGTAGCA CCAACTCCGA CGCTTGGGGC TTCGGTTTGA AAGCCGGTTA CGACTTCAAA
CTGGGTGATG CTGGTTACGT GACTCCTTAC GGCAGCGTTT CTGGTCTGTT CCAGTCTGGT
GATGACTACC AGCTGAGCAA CAACATGAAA GTTGACGGTC AGTCTTACGA CAGCATGCGT
TATGAACTGG GTGTAGATGC AGGTTATACC TTCACCTACA GCGAAGACCA GGCTCTGACT
CCGTACTTCA AACTGGCTTA CGTTTACGAC GACTCTAACA ACCATAACGA TGTGAACGGT
GATTCCATCG ATAACGGTAC TGAAGGGTCT GCGGTACGTG TTGGTCTGGG TACTCAGTTC
AGCTTCACCA AGAACTTCAG CGCCTATACC GATGCTAACT ACCTCGGTGG TGGTGACGTA
GATCAAGACT GGTCCGCGAA CGTGGGTGTT AAATATACCT GGTAA
 
Protein sequence
MLESSGYGHF GNSGEPSDYA GPGDVALSFT DSTSDYAMKN NVYFSNSTLV GDVAFTSTWN 
ANFDPSGHDS NGDGVKDTNA GWVDDSLNVD ELNITLDNGS KWVGQATFNA ETISPDTMYD
VATNSLTPGG TAEANGWNRI IDNKVFQSGV FNVALNNGSE WDTTGRSVVD TLTVNNASQV
NVSESKLTSD TIDLTNGSSL NIGEDGYVDT DHLTINSYST VALTESTGWG ADYNLYANTI
TVTNGGVLDV NVDQFDTEAF RTDKLELTSG NIADNNGNVV SGVFDIHSSD YVLNADLVND
RTWDTSKSNY GYGIVAMNSD GHLTINGNGD MNNGDELDNS SVDNVVAATG NYKVRIDNAT
GAGAIADYKD KEIIYVNDVN SNATFSAANK ADLGAYTYQA EQRGNTVVLQ QMELTDYANM
ALSIPSANTN IWNLEQDTVG TRLTNSRHGL ADNGGAWVSY FGGNFNGDNG TINYDQDVNG
IMVGVDTKID GNNAKWIVGA AAGFAKGDMN DRSGQVDQDS QTAYIYSSAH FANNVFVDGS
LSYSHFNNDL SATMSNGTYV DGSTNSDAWG FGLKAGYDFK LGDAGYVTPY GSVSGLFQSG
DDYQLSNNMK VDGQSYDSMR YELGVDAGYT FTYSEDQALT PYFKLAYVYD DSNNHNDVNG
DSIDNGTEGS AVRVGLGTQF SFTKNFSAYT DANYLGGGDV DQDWSANVGV KYTW