Gene EcolC_3258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3258 
Symbol 
ID6066834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3565225 
End bp3568227 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content47% 
IMG OID641602673 
Productouter membrane autotransporter 
Protein accessionYP_001726207 
Protein GI170021253 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.372144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCCT GGAAAAAGAA ACTTGTAGTA TCACAATTAG CATTGGCTTG CACTCTGGCT 
ATCACCTCTC AGGCTAATGC AGCGACCAAC GATATTTCTG GTCAAACTTA CAATACTTTC
CATCACTACA ACGACGCCAC CTATGCTGAT GACGTTTACT ATGATGGTTA TGTAGGCTGG
AACAACTATG CCGCTGATAG CTATTACAAC GGCGATATCT ACCCGGTCAT TAATAACGCT
ACCGTTAACG GCGTGATTTC TACCTACTAT CTGGACGACG GTATTTCTAC CAATACCAAC
GCCAATAGTC TGACAATCAA AAACAGCACT ATTCACGGTA TGATTTATTC CGAGTGCATG
ACAACTGATT GTGCTGATCG TGCGGATGAT TACTACCATG ACCGTCTGGC GCTGACTGTT
GATAATTCAA CGATCGACGA CAACTACGAA CACTACACAT ATAACGGCAC TTACAATAAT
GCCGCTGATA CTCATGTTGT AGACGTTTAC AACATTGGTA CGGCTATTAC TCTGGATCAG
GAAGTTGATC TGTCCATCAC TAATAACTCT CACGTTGCGG GTATTACGTT GACTCAGGGT
TATGAGTGGG AAGATATTGA CGACAACACT GTCAGCACTG GCGTAAACAG CAGCGAAGTG
TTTAATAACA CCATCACTGT TAAAGATTCT ACCGTAACCT CTGGTTCATG GACTGATGAA
GGTACTACTG GTTGGTTCGG TAATACAGGT AATGCCAGCG ATTATAGTGG TAAATCTAAC
TTTGTCACCG TAGATACTGA CGGAGATGGT GTTGCTGATA GCACTATTGC AAGCTGGGAT
GATGTTGCAT TAGCTGTAGT TGCACATCCG AATGCTGATA ATGCTATGCA GACCACCGCT
GACTTTAGTA ACTCTACTTT GATGGGCGAT GTAATCTTCT CTAGCAACTT TGATGAAAAC
TTCTTCCCAC GTGGTGCAGA TAGTTATCGC GATGCTGATG GTGAAGTAGA CACCAACGGT
TGGGATGGAA CAGATCGCCT GGATCTGACA TTAAATAACG GTAGTAAATG GGTAGGGGCA
GCACAATCTG TTCATCAAAC TGGTTCTATT GATGTTGATG GAGATGGTAA AGGTGACATC
GCTACATACG GTGTTGGCAC CGAGGCAACT GCAACTCTGA TTGATATTGA GGATAATAGC
CTGTGGCCGT TATCAACCGT TGGTGTTGAA AACGATGATA CAAGTTACAG TGAATTCGAT
CATATTACTG GTAATCAGGT TTACCAGAGC GGTCTGTTCA ATGTGACCCT GAATACCGGT
TCACAGTGGG ATACCACCAA AACTTCTCTG ATTGATACTC TGAGCATCAA CAGTGGTTCA
ACTGTTAATG TTGCTGATTC AACGCTGATT TCTGACTCTA TCTCTCTGAC AGGTCTTTCT
GCGCTGAACA TCAACGAAGA TGGTCATGTT GCAACTGATT CACTGACTGT CGACAACAGT
ACCGTAACCA TTTCTGATGA AGTTTCTGCT GGTTGGGCTG TAGGTGATGC TGCTCTGTAT
GCCAACAACA TCAAAGTGAC TAACGACGGT ATTCTGGATG TAGGTAACAC TGCGGCGAAT
GCTCTGCAGG TTGATACTCT GAACCTGACC AGCACTACTG ATACCAGTGG TAACATTCAC
GCTGGTGTGT TCAACATCGA AAGCAACCGC TTCGTACTTG ATGCAGACCT GACCAACGAC
CGTACCAACG ATACTACCAA GTCAAACTAC GGTTATGGCT TAATCGCAAT GAACTCTGAT
GGTCACCTGA CCATTAACGG TAACGGCGAT AACGACAACA CTGCTTCTAT CGAAGCTGGT
CAGAACGAAG TTGATAACAA CGGTGACCAT GTTGCAGCTG CAACCGGTAA CTACAAAGTT
CGTATCGACA ACGCTACTGG TGCTGGTTCT ATCGCTGACT ACAACGGCAA CGAGCTGATC
TACGTCAACG ACAAAAATAG CAACGCGACC TTCTCTGCTG CTAACAAAGC TGACCTGGGT
GCATACACCT ATCAGGCTGA ACAGCGCGGT AACACCGTTG TTCTGCAACA GATGGAGCTG
ACCGACTACG CTAACATGGC GCTGAGCATC CCTTCTGCGA ACACCAATAT CTGGAACCTG
GAACAAGACA CCGTTGGTAC TCGTCTGACC AACTCTCGTC ATGGCCTGGC TGATAACGGC
GGCGCATGGG TAAGCTACTT CGGTGGTAAC TTCAACGGCG ACAACGGCAC CATCAACTAT
GATCAGGATG TTAACGGCAT CATGGTCGGT GTTGATACCA AAATTGACGG TAACAACGCT
AAGTGGATCG TCGGTGCGGC TGCAGGCTTC GCTAAAGGTG ACATGAATGA CCGTTCTGGT
CAGGTAGATC AAGACAGCCA GACTGCCTAC ATCTACTCTT CTGCTCACTT CGCGAACAAC
GTCTTTGTTG ATGGTAGCTT GAGCTACTCT CACTTCAACA ACGACCTGTC TGCAACCATG
AGCAACGGTA CTTACGTTGA CGGTAGCACC AACTCCGACG CTTGGGGCTT CGGCTTGAAA
GCCGGTTACG ACTTCAAACT GGGTGATGCT GGTTATGTGA CTCCTTACGG CAGCATTTCT
GGTCTGTTCC AGTCTGGTGA TGACTACCAG CTGAGCAACG ACATGAAAGT TGACGGTCAG
TCTTACGACA GCATGCGTTA TGAACTGGGT GTAGATGCAG GTTATACCTT CACCTACAGC
GAAGACCAGG CTCTGACTCC GTACTTCAAA CTGGCTTACG TCTACGACGA CTCTAACAAC
GATAACGATG TGAACGGTGA TTCCATCGAT AACGGTACTG AAGGGTCTGC GGTACGTGTT
GGTCTGGGTA CTCAGTTCAG CTTCACCAAG AACTTCAGCG CCTATACCGA TGCTAACTAC
CTCGGTGGTG GTGACGTAGA TCAAGATTGG TCCGCGAACG TGGGTGTTAA ATATACCTGG
TAA
 
Protein sequence
MHSWKKKLVV SQLALACTLA ITSQANAATN DISGQTYNTF HHYNDATYAD DVYYDGYVGW 
NNYAADSYYN GDIYPVINNA TVNGVISTYY LDDGISTNTN ANSLTIKNST IHGMIYSECM
TTDCADRADD YYHDRLALTV DNSTIDDNYE HYTYNGTYNN AADTHVVDVY NIGTAITLDQ
EVDLSITNNS HVAGITLTQG YEWEDIDDNT VSTGVNSSEV FNNTITVKDS TVTSGSWTDE
GTTGWFGNTG NASDYSGKSN FVTVDTDGDG VADSTIASWD DVALAVVAHP NADNAMQTTA
DFSNSTLMGD VIFSSNFDEN FFPRGADSYR DADGEVDTNG WDGTDRLDLT LNNGSKWVGA
AQSVHQTGSI DVDGDGKGDI ATYGVGTEAT ATLIDIEDNS LWPLSTVGVE NDDTSYSEFD
HITGNQVYQS GLFNVTLNTG SQWDTTKTSL IDTLSINSGS TVNVADSTLI SDSISLTGLS
ALNINEDGHV ATDSLTVDNS TVTISDEVSA GWAVGDAALY ANNIKVTNDG ILDVGNTAAN
ALQVDTLNLT STTDTSGNIH AGVFNIESNR FVLDADLTND RTNDTTKSNY GYGLIAMNSD
GHLTINGNGD NDNTASIEAG QNEVDNNGDH VAAATGNYKV RIDNATGAGS IADYNGNELI
YVNDKNSNAT FSAANKADLG AYTYQAEQRG NTVVLQQMEL TDYANMALSI PSANTNIWNL
EQDTVGTRLT NSRHGLADNG GAWVSYFGGN FNGDNGTINY DQDVNGIMVG VDTKIDGNNA
KWIVGAAAGF AKGDMNDRSG QVDQDSQTAY IYSSAHFANN VFVDGSLSYS HFNNDLSATM
SNGTYVDGST NSDAWGFGLK AGYDFKLGDA GYVTPYGSIS GLFQSGDDYQ LSNDMKVDGQ
SYDSMRYELG VDAGYTFTYS EDQALTPYFK LAYVYDDSNN DNDVNGDSID NGTEGSAVRV
GLGTQFSFTK NFSAYTDANY LGGGDVDQDW SANVGVKYTW