Gene EcolC_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1418 
Symbol 
ID6067759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1552399 
End bp1556103 
Gene Length3705 bp 
Protein Length1234 aa 
Translation table11 
GC content52% 
IMG OID641600837 
Productadhesin 
Protein accessionYP_001724408 
Protein GI170019454 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat
[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000765921 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGCAT CTCTTTTCTC TGCTAACGGT GTCGCGGCGG TCACTGATTC ATGCCAGGGA 
TATGATGTCA AAGCGAGTTG TCAGGCCAGC AGGCAAAGCC TTTCAGGCAT TACGCAGGAC
TGGAGTATCG CTGATGGGCA ATGGCTGGTT TTTTCGGATA TGACCAATAA CGCCAGCGGT
GGGGCCGTAT TTTTGCAACA AGGAGCGGAA TTTTCACTAT TACCAGAAAA TGAAACTGGA
ATGACTCTGT TTGCCAATAA CACCGTTACA GGAGAATATA ATAACGGCGG GGCCATATTT
GCTAAAGAAA ACTCAACGCT GAATCTTACT GATGTTATTT TTTCCGGTAA CGTCGCAGGC
GGCTATGGTG GCGCAATCTA TTCTTCTGGT ACTAACGATA CTGGTGCCGT CGATTTACGT
GTCACTAACG CCATGTTTCG CAATAACATC GCTAATGATG GCAAAGGTGG CGCAATTTAT
ACCATTAATA ATGACGTTTA TTTAAGTGAT GTTATTTTTG ATAACAACCA GGCATATACA
TCAACAAGTT ACAGTGATGG CGATGGCGGG GCAATCGATG TTACCGATAA TAATAGCGAC
AGCAAGCATC CTTCAGGTTA TACGATAGTA AATAACACTG CCTTTACAAA TAACACTGCC
GAAGGTTATG GCGGGGCGAT ATATACCAAT AGCGTGACGG CTCCCTATCT TATTGATATT
TCTGTTGATG ACAGCTACAG CCAGAACGGA GGCGTGTTAG TCGATGAGAA CAATAGCGCA
GCAGGCTATG GAGATGGTCC TTCCTCTGCG GCGGGTGGCT TTATGTATCT CGGCTTAAGT
GAAGTTACCT TTGATATTGC CGACGGAAAA ACGCTGGTTA TTGGCAATAC AGAGAATGAC
GGAGCTGTTG ACTCTATTGC TGGTACCGGG TTAATCACCA AAACAGGTTC CGGCGATCTG
GTACTTTATG CAGATAACAA TGACTTTACT GGTGAGATGC AGATTGAAAA CGGTGAAGTT
ACCCTGGGCC GCAGCAACTC CCTGATGAAT GTCGGCGATA CGCATTGCCA GGACGATCCG
CAAGACTGCT ACGGTCTGAC GATAGGGAGT ATTGATCAGT ATCAGAATCA GGCTGAGCTA
AACGTTGGCT CGACCCAACA AACTTTTGTG CACGCATTGA CGGGCTTTCA GAATGGCACT
TTAAATATCG ATGCTGGTGG CAACGTTACT GTTAATCAGG GCAGTTTTGC TGGCATCATC
GAAGGTGCTG GTCAGCTCAC CATTGCGCAA AACGGCAGCT ACGTGCTGGC AGGGGCGCAG
TCGATGGCGC TAACCGGCGA TATAGTCGTT GATGATGGTG CGGTGCTTTC GCTGGAAGGC
GACGCGGCAG ATCTTACCGC TCTCCAGGAC GATCCGCAGT CGATCGTGTT AAACGGCGGT
GTGCTCGATC TCTCTGATTT CTCCACCTGG CAGAGCGGCA CATCATACAA CGATGGCCTT
GAAGTCAGTG GCAGCAGCGG AACGGTTATC GGCAGTCAGG ATGTGGTAGA TCTTGCAGGT
GGCGACAATT TGCATATCGG CGGCGACGGG AAAGATGGCG TCTACGTGGT GGTCGATGCG
AGCGACGGGC AGGTAAGTCT GGCAAACAAT AATAGTTATT TGGGCACAAC ACAAATCGCC
TCCGGTACGC TGATGGTGAG CGACAACTCG CAGCTTGGAG ATACCCACTA TAACCGCCAG
GTTATCTTTA CCGATAAGCA ACAAGAAAGC GTGATGGAGA TTACCTCCGA CGTTGACACG
CGTTCAGATG CGGCAGGCCA CGGACGTGAT ATTGAAATGC GCGCCGACGG TGAAGTGGCA
GTTGATGCGG GGGTAGACAC GCAGTGGGGC GCACTGATGG CTGACAGCAG CGGGCAGCAT
CAGGATGAGG GTAGCACATT GACTAAAACG GGGGCTGGTA CACTGGAGCT GACCGCCAGC
GGTACAACGC AGTCGGCGGT ACGTGTCGAA GAAGGCACCC TGAAAGGTGA TGTTGCTGAT
ATCCTTCCTT ATGCTTCGTC ACTGTGGGTT GGTGATGGGG CAACGTTCGT TACTGGCGCG
GATCAGGATA TTCAGTCAAT TGATGCTATT TCCAGCGGCA CTATCGACAT CAGCGATGGT
ACGGTTTTGC GCCTGACCGG GCAGGATACT TCCGTCGCCC TTAATGCCTC ACTATTTAAC
GGCGATGGGA CGCTGGTGAA TGCCACCGAT GGTGTGACGT TGACAGGTGA GCTTAATACC
AACCTTGAAA CTGACAGCCT GACTTATCTT TCCAACGTGA CGGTTAATGG CAATCTGACC
AATACGTCCG GTGCGGTTAG CCTGCAAAAT GGCGTCGCTG GCGATACGCT GACGGTAAAC
GGTGATTATA CCGGCGGCGG TACGCTACTG CTCGATAGCG AATTAAACGG CGATGACTCG
GTAAGCGATC AATTGGTGAT GAACGGTAAT ACTGCTGGCA ACACAACTGT GGTGGTTAAC
TCCATTACAG GGATTGGTGA GCCGACATCG ACAGGCATTA AAGTGGTTGA TTTCGCAGCT
GATCCCACGC AGTTTCAAAA CAATGCGCAG TTCAGTCTGG CAGGCAGCGG CTACGTCAAT
ATGGGAGCGT ATGACTACAC GCTGGTGGAA GATAACAACG ACTGGTATCT GCGATCGCAA
GAAGTAACGC CGCCATCGCC ACCTGAGCCA GACCCGACTC CCGATCCTGA TCCCACGCCG
GATCCTGACC CAACCCCCGA CCCGGCCCCT GTGCCTGTTT ACCAGCCGGT GCTGAATGCC
AAAGTTGGCG GTTATCTCAA TAACCTGCGG GCGGCAAATC AGGCGTTTAT GATGGAGCGA
CGCGATCACG CAGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT
CATTACACAG CAGCGGGGCA ACTGGCTCAA CATGAAGACA CTTCTACGGT GCAACTTAGC
GGCGATCTGT TTAGCGGGCG CTGGGGCACG GATGGCGAGT GGATGCTTGG GATTGTTGGT
GGCTACAGCG ATAACCAGGG CGACAGCCGC TCGAATATGA CCGGAACTCG CGCCGATAAC
CAGAACCACG GTTATGCCGT TGGGCTGACA TCAAGCTGGT TTCAGCACGG TAATCAGAAG
CAAGGGGCCT GGCTGGATAG CTGGCTGCAA TACGCGTGGT TTAGCAATGA TGTTTCCGAA
CAAGAAGATG GCACAGATCA TTACCACTCG TCGGGGATTA TCGCCTCGCT GGAGGCGGGG
TATCAGTGGT TACCGGGGCG TGGTGTGGTG ATTGAACCGC AGGCGCAGGT GATTTATCAG
GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCGTGCGC GCGTGTCACA ATCGCAGGGT
GATGATATTC AGACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCTGT TCATGTCATA
CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATTCGA CGGAAATTGA AGAGGATGGC
AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AGTCAGGGGC
AATATCAGTC AGCGAGTGTC CCTGCGCGGC AGCGTGGCGT GGCAGAAAGG GAGTGATGAT
TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACGGTGAAAT GGTAA
 
Protein sequence
MIASLFSANG VAAVTDSCQG YDVKASCQAS RQSLSGITQD WSIADGQWLV FSDMTNNASG 
GAVFLQQGAE FSLLPENETG MTLFANNTVT GEYNNGGAIF AKENSTLNLT DVIFSGNVAG
GYGGAIYSSG TNDTGAVDLR VTNAMFRNNI ANDGKGGAIY TINNDVYLSD VIFDNNQAYT
STSYSDGDGG AIDVTDNNSD SKHPSGYTIV NNTAFTNNTA EGYGGAIYTN SVTAPYLIDI
SVDDSYSQNG GVLVDENNSA AGYGDGPSSA AGGFMYLGLS EVTFDIADGK TLVIGNTEND
GAVDSIAGTG LITKTGSGDL VLYADNNDFT GEMQIENGEV TLGRSNSLMN VGDTHCQDDP
QDCYGLTIGS IDQYQNQAEL NVGSTQQTFV HALTGFQNGT LNIDAGGNVT VNQGSFAGII
EGAGQLTIAQ NGSYVLAGAQ SMALTGDIVV DDGAVLSLEG DAADLTALQD DPQSIVLNGG
VLDLSDFSTW QSGTSYNDGL EVSGSSGTVI GSQDVVDLAG GDNLHIGGDG KDGVYVVVDA
SDGQVSLANN NSYLGTTQIA SGTLMVSDNS QLGDTHYNRQ VIFTDKQQES VMEITSDVDT
RSDAAGHGRD IEMRADGEVA VDAGVDTQWG ALMADSSGQH QDEGSTLTKT GAGTLELTAS
GTTQSAVRVE EGTLKGDVAD ILPYASSLWV GDGATFVTGA DQDIQSIDAI SSGTIDISDG
TVLRLTGQDT SVALNASLFN GDGTLVNATD GVTLTGELNT NLETDSLTYL SNVTVNGNLT
NTSGAVSLQN GVAGDTLTVN GDYTGGGTLL LDSELNGDDS VSDQLVMNGN TAGNTTVVVN
SITGIGEPTS TGIKVVDFAA DPTQFQNNAQ FSLAGSGYVN MGAYDYTLVE DNNDWYLRSQ
EVTPPSPPEP DPTPDPDPTP DPDPTPDPAP VPVYQPVLNA KVGGYLNNLR AANQAFMMER
RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS GDLFSGRWGT DGEWMLGIVG
GYSDNQGDSR SNMTGTRADN QNHGYAVGLT SSWFQHGNQK QGAWLDSWLQ YAWFSNDVSE
QEDGTDHYHS SGIIASLEAG YQWLPGRGVV IEPQAQVIYQ GVQQDDFTAA NRARVSQSQG
DDIQTRLGLH SEWRTAVHVI PTLDLNYYHD PHSTEIEEDG STISDDAVKQ RGEIKVGVRG
NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW