Gene EcolC_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1052 
Symbol 
ID6066296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1141442 
End bp1146022 
Gene Length4581 bp 
Protein Length1526 aa 
Translation table11 
GC content49% 
IMG OID641600464 
Productouter membrane autotransporter 
Protein accessionYP_001724046 
Protein GI170019092 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGA CCAGTCCCTA TTATTGTCGC CGCTCAGTAC TTTCCTTATT GATATCTGCC 
TTGATATATG CCCCGCCCGG GATGGCTGCC TTCACTACTA ATGTTATTGG TGTGGTAAAC
GATGAGACTG TAGATGGCAA CCAAAAAGTG GATGAACGAG GTACAACAAA TAACACTCAT
ATTATCAACC ATGGCCAGCA GAATGTTCAT GGCGGGGTAT CTAATGGAAG TCTTATTGAA
TCTGGTGGAT ATCAAGATAT AGGAAGTCAT AACAATTTTG TGGGGCAGGC TAATAATACA
ACCATTAACG GTGGCAGACA GTCAATTCAT GACGGGGGTA TTTCCACAGG TACGACAATC
GAGAGTGGCA ATCAGGACGT TTATAAAGGG GGTATCAGCA ATGGAACGAC AATTAAGGGC
GGTGCTTCAC GCGTAGAGGG AGGGAGTGCG AATGGAATAC TCATTGATGG TGGTAGCCAG
ATAGTAAAAG TTCAAGGGCA TGCTGATGGT ACAACGATAA ATAAGTCTGG CTCTCAGGAC
GTAGTACAAG GAAGTCTGGC AACGAACACA ACCATAAATG GTGGTCGACA GTATGTTGAA
CAGAGCACAG TAGAAACAAC AACCATTAAA AATGGCGGTG AGCAAAGAGT ATATGAGAGC
CGTGCGCTGG ACACGACGAT TGAAGGCGGA ACTCAGTCTC TGAATAGTAA GTCAACGGCA
AAAAATACGC ATATCTATTC TGGTGGCACG CAAATTGTTG ATAACACCAG CACCTCGGAT
GTTATTGAAG TTTATTCTGG TGGCGTGCTT GATGTTAGTG GTGGTACGGC AACAAATGTT
ACCCAGCACG ATGGTGCAAT TTTAAAAACT AACACTAACG GTACGACGGT GAGCGGTACG
AATAGTGAAG GTGCATTCTC CATCCACAAT CACGTGGCAG ACAATGTGTT GCTGGAAAAC
GGTGGTCATT TAGACATAAA CGCATATGGT TCGGCAAACA AGACGATTAT TAAAGATAAA
GGAACAATGT CAGTTTTAAC CAATGCTAAA GCTGATGCGA CCCGAATAGA TAATGGCGGG
GTTATGGATG TTGCAGGAAA CGCGACAAAT ACCATAATTA ATGGTGGCAC ACAGAATATT
AATAATTATG GCATAGCCAC AGGCACCAAT ATCAACAGCG GAACGCAAAA TATCAAAAGC
GGCGGGAAAG CTGACACAAC AATTATATCC TCCGGGAGCC GGCAGGTTGT TGAGAAAGAT
GGTACGGCAA TTGGCAGCAA TATTAGCGCC GGAGGCTCGC TGATTGTCTA TACCGGCGGT
ATTGCACATG GGGTTAACCA GGAGACGGGC AGTGCTTTAG TTGCCAACAC GGGTGCAGGG
ACTGATATCG AAGGATACAA CAAGCTCTCT CACTTCACTA TTACCGGAGG GGAGGCTAAT
TATGTTGTGC TGGAAAATAC CGGCGAACTG ACGGTAGTGG CTAAAACCTC GGCGAAAAAT
ACTACCATTG ATACTGGCGG TAAGCTGATT GTCCAGAAGG AGGCTAAAAC AGATAGCACC
AGACTTAATA ATGGCGGCGT TCTGGAGGTT CAGGACGGTG GTGAGGCTAA GCATGTTGAG
CAACAATCCG GCGGCGCATT AATTGCTTCC ACGACCTCCG GAACACTTAT CGAAGGAACC
AACAGTTATG GTGATGCTTT CTACATCAGG AATTCAGAAG CTAAAAATGT AGTGCTGGAA
AACGCTGGCT CATTAACAGT CGTCACTGGT TCCCGGGCAG TTGACACGAT TATTAATGCC
AACGGCAAAA TGGATGTTTA TGGAAAAGAT GTTGGCACTG TACTCAATAG TGCTGGCACC
CAAACAATAT ATGCCAGTGC CACTTCTGAT AAAGCAAATA TCAAAGGTGG CAAGCAAACG
GTATATGGTT TAGCCACTGA AGCAAATATC GAAAGTGGTG AACAAATTGT TGATGGTGGG
TCAACAGAGA AAACACACAT CAATGGTGGC ACGCAAACCG TTCAGAATTA TGGTAAGGCA
ATCAATACCG ATATCGTCTC TGGCCTACAA CAAATTATGG CAAACGGGAC AGCGGAAGGT
TCCATTATTA ATGGGGGTTC ACAGGTAGTT AATGAGGGCG GTCTGGCTGA AAACTCGGTG
CTTAATGACG GCGGCACACT CGATGTGCGG GAGAAAGGCA GCGCAACGGG GATACAGCAG
AGTAGCCAGG GCGCTTTGGT TGCAACCACC AGGGCGACGC GGGTCACAGG AACACGCGCG
GATGGCGTCG CGTTCAGCAT CGAGCAGGGT GCGGCGAACA ATATCCTGCT GGCAAATGGC
GGCGTGTTAA CCGTGGAGTC AGACACCTCT TCTGACAAAA CACAGGTCAA TATGGGCGGA
CGGGAGATCG TCAAAACAAA AGCCACTGCG ACAGGCACGA CGCTCACCGG CGGTGAACAA
ATTGTCGAGG GTGTGGCGAA TGAGACAACA ATTAACGACG GCGGAATACA AACAGTTTCA
GCTAACGGAG AGGCAATAAA AACAAAGATC AATGAAGGCG GTACGCTGAC AGTCAACGAT
AATGGCAAAG CGACAGATAT CGTCCAGAAC AGCGGTGCCG CTCTCCAGAC GAGCACGGCT
AACGGTATTG AAATCAGCGG TACTCACCAG TACGGTACTT TTTCCATTTC CGGCAATTTA
GCGACCAATA TGTTGCTGGA AAATGGCGGT AATTTATTGG TATTAGCAGG TACCGAAGCT
CGCGACTCCA CGGTTGGCAA GGGTGGGGCA ATGCAAAACC TGGGTCAGGA CTCCGCCACA
AAGGTTAACT CTGGCGGGCA ATATACCCTT GGGCGGTCAA AAGATGAGTT TCAGGCTCTG
GCCCGGGCAG AAGATCTCCA GGTCGCTGGC GGTACGGCAA TCGTCTACGC AGGTACGCTG
GCGGATGCAT CGGTCAGTGG CGCGACAGGA AGCCTGTCGT TAATGACGCC ACGGGATAAT
GTTACGCCAG TTAAACTCGA AGGGGCGGTC CGGATTACCG ATAGCGCGAC ATTGACTCTG
GGAAATGGCG TCGATACCAC GCTTGCCGAC CTGACGGCTG CCAGCCGGGG CAGTGTCTGG
CTTAACAGCA ATAATTCCTG TGCAGGTACC AGCAACTGCG AATATAGAGT AAACAGTTTG
CTACTCAACG ACGGTGATGT TTATTTGTCA GCACAAACAG CAGCGCCTGC CACAACTAAC
GGTATCTACA ATACGCTGAC AACCAATGAA CTTTCCGGTA GCGGTAATTT CTACCTGCAT
ACCAACGTTG CAGGCTCCCG GGGCGATCAA CTGGTCGTCA ACAACAACGC CACTGGTAAT
TTTAAAATCT TTGTTCAGGA TACCGGCGTC AGCCCACAGT CTGACGACGC GATGACGCTG
GTGAAAACAG GGGGAGGGGA TGCTTCGTTT ACGCTGGGCA ATACCGGCGG TTTCGTTGAT
CTTGGGACCT ATGAGTATGT CCTGAAAAGT GACGGCAACA GCAACTGGAA CCTGACCAAT
GATGTCAAAC CCAACCCGGC CCCCATCCCA AATCCAAAGC CAGACCCAAA ACCCGATCCA
AAGCCAGACC CAAATCCAAA ACCAGACCCT ACTCCCGATC CAACGCCGAC ACCCGTTCCG
GAGAAACGCA TTACGCCTTC TACGGCAGCC GTACTCAATA TGGCAGCAAC ATTACCGTTG
GTATTTGATG CTGAGCTAAA CAGTATTCGC GAGCGGTTGA ACATAATGAA AGCGAGTCCA
CACAACAATA ATGTCTGGGG GGCGACGTAT AACACCCGTA ATAATGTCAC CACCGATGCG
GGTGCCGGGT TTGAGCAGAC GCTGACCGGA ATGACAGTGG GGATCGACAG CCGTAATGAT
ATTCCTGAAG GAATTACCAC GCTAGGCGCT TTTATGGGCT ATTCCCATTC ACATATCGGT
TTTGATCGCG GAGGACATGG CAGTGTGGGC AGTTATTCTC TGGGCGGCTA TGCCAGTTGG
GAACATGAAA GTGGTTTCTA TCTGGACGGT GTCGTGAAGC TGAACCGTTT TAAAAGTAAC
GTAGCAGGTA AAATGAGCAG CGGTGGAGCC GCCAATGGCA GTTACCACAG CAACGGGCTG
GGCGGTCACA TTGAAACCGG GATGCGATTT ACCGATGGTA ACTGGAACCT GACGCCGTAT
GCATCGTTAA CGGGGTTCAC CGCTGATAAC CCCGAATATC ATTTATCCAA TGGCATGAAA
TCGAAATCAG TCGATACCCG CAGTATATAT CGTGAACTGG GCGCAACGCT GAGTTACAAC
ATGCGTCTGG GGAACGGTAT GGAAGTTGAG CCGTGGCTGA AGGCGGCTGT GCGCAAAGAA
TTTGTCGATG ATAACCGGGT GAAAGTGAAT AGTGACGGTA ATTTCGTCAA TTATTTGTCG
GGCAGACGTG GAATATACCA GGCAGGTATT AAAGCCTCAT TCAGCAGTAC GTTAAGCGGG
CATCTTGGGG TGGGGTATAG CCATAGTGCC GGTGTGGAAT CCCCGTGGAA CGCGGTAGCT
GGTGTGAACT GGTCGTTCTG A
 
Protein sequence
MNRTSPYYCR RSVLSLLISA LIYAPPGMAA FTTNVIGVVN DETVDGNQKV DERGTTNNTH 
IINHGQQNVH GGVSNGSLIE SGGYQDIGSH NNFVGQANNT TINGGRQSIH DGGISTGTTI
ESGNQDVYKG GISNGTTIKG GASRVEGGSA NGILIDGGSQ IVKVQGHADG TTINKSGSQD
VVQGSLATNT TINGGRQYVE QSTVETTTIK NGGEQRVYES RALDTTIEGG TQSLNSKSTA
KNTHIYSGGT QIVDNTSTSD VIEVYSGGVL DVSGGTATNV TQHDGAILKT NTNGTTVSGT
NSEGAFSIHN HVADNVLLEN GGHLDINAYG SANKTIIKDK GTMSVLTNAK ADATRIDNGG
VMDVAGNATN TIINGGTQNI NNYGIATGTN INSGTQNIKS GGKADTTIIS SGSRQVVEKD
GTAIGSNISA GGSLIVYTGG IAHGVNQETG SALVANTGAG TDIEGYNKLS HFTITGGEAN
YVVLENTGEL TVVAKTSAKN TTIDTGGKLI VQKEAKTDST RLNNGGVLEV QDGGEAKHVE
QQSGGALIAS TTSGTLIEGT NSYGDAFYIR NSEAKNVVLE NAGSLTVVTG SRAVDTIINA
NGKMDVYGKD VGTVLNSAGT QTIYASATSD KANIKGGKQT VYGLATEANI ESGEQIVDGG
STEKTHINGG TQTVQNYGKA INTDIVSGLQ QIMANGTAEG SIINGGSQVV NEGGLAENSV
LNDGGTLDVR EKGSATGIQQ SSQGALVATT RATRVTGTRA DGVAFSIEQG AANNILLANG
GVLTVESDTS SDKTQVNMGG REIVKTKATA TGTTLTGGEQ IVEGVANETT INDGGIQTVS
ANGEAIKTKI NEGGTLTVND NGKATDIVQN SGAALQTSTA NGIEISGTHQ YGTFSISGNL
ATNMLLENGG NLLVLAGTEA RDSTVGKGGA MQNLGQDSAT KVNSGGQYTL GRSKDEFQAL
ARAEDLQVAG GTAIVYAGTL ADASVSGATG SLSLMTPRDN VTPVKLEGAV RITDSATLTL
GNGVDTTLAD LTAASRGSVW LNSNNSCAGT SNCEYRVNSL LLNDGDVYLS AQTAAPATTN
GIYNTLTTNE LSGSGNFYLH TNVAGSRGDQ LVVNNNATGN FKIFVQDTGV SPQSDDAMTL
VKTGGGDASF TLGNTGGFVD LGTYEYVLKS DGNSNWNLTN DVKPNPAPIP NPKPDPKPDP
KPDPNPKPDP TPDPTPTPVP EKRITPSTAA VLNMAATLPL VFDAELNSIR ERLNIMKASP
HNNNVWGATY NTRNNVTTDA GAGFEQTLTG MTVGIDSRND IPEGITTLGA FMGYSHSHIG
FDRGGHGSVG SYSLGGYASW EHESGFYLDG VVKLNRFKSN VAGKMSSGGA ANGSYHSNGL
GGHIETGMRF TDGNWNLTPY ASLTGFTADN PEYHLSNGMK SKSVDTRSIY RELGATLSYN
MRLGNGMEVE PWLKAAVRKE FVDDNRVKVN SDGNFVNYLS GRRGIYQAGI KASFSSTLSG
HLGVGYSHSA GVESPWNAVA GVNWSF