Gene EcHS_A2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2373 
Symbol 
ID5592447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2381399 
End bp2385151 
Gene Length3753 bp 
Protein Length1250 aa 
Translation table11 
GC content51% 
IMG OID640921500 
Productadhesin 
Protein accessionYP_001459034 
Protein GI157161716 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat
[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTA TCTTTCTACG CAAGGAGTAT TTATCTTTAC TCCCGTCAAT GATTGCATCT 
CTTTTCTCTG CTAACGGTGT CGCGGCGGTC ACTGATTCAT GCCAGGGATA TGATGTCAAA
GCGAGTTGTC AGGCCAGCAG GCAAAGCCTT TCAGGCATTA CGCAGGACTG GAGTATCGCT
GATGGGCAAT GGCTGGTTTT TTCGGATATG ACCAATAACG CCAGCGGTGG GGCCGTATTT
TTGCAACAAG GAGCGGAATT TTCACTATTA CCAGAAAATG AAACTGGAAT GACTCTGTTT
GCCAATAACA CCGTTACAGG AGAATATAAT AACGGCGGGG CCATATTTGC TAAAGAAAAC
TCAACGCTGA ATCTTACTGA TGTTATTTTT TCCGGTAACG TCGCAGGCGG CTATGGTGGC
GCAATCTATT CTTCTGGTAC TAACGATACT GGTGCCGTCG ATTTACGTGT CACTAACGCC
ATGTTTCGCA ATAACATCGC TAATGATGGC AAAGGTGGCG CAATTTATAC CATTAATAAT
GACGTTTATT TAAGTGATGT TATTTTTGAT AACAACCAGG CATATACATC AACAAGTTAC
AGTGATGGCG ATGGCGGGGC AATCGATGTT ACCGATAATA ATAGCGACAG CAAGCATCCT
TCAGGTTATA CGATAGTAAA TAACACTGCC TTTACAAATA ACACTGCCGA AGGTTATGGC
GGGGCGATAT ATACCAATAG CGTGACGGCT CCCTATCTTA TTGATATTTC TGTTGATGAC
AGCTACAGCC AGAACGGAGG CGTGTTAGTC GATGAGAACA ATAGCGCAGC AGGCTATGGA
GATGGTCCTT CCTCTGCGGC GGGTGGCTTT ATGTATCTCG GCTTAAGTGA AGTTACCTTT
GATATTGCCG ACGGAAAAAC GCTGGTTATT GGCAATACAG AGAATGACGG AGCTGTTGAC
TCTATTGCTG GTACCGGGTT AATCACCAAA ACAGGTTCCG GCGATCTGGT ACTTAATGCA
GATAACAATG ACTTTACTGG TGAGATGCAG ATTGAAAACG GTGAAGTTAC CCTGGGCCGC
AGCAACTCCC TGATGAATGT CGGCGATACG CATTGCCAGG ACGATCCGCA AGACTGCTAC
GGTCTGACGA TAGGGAGTAT TGATCAGTAT CAGAATCAGG CTGAGCTAAA CGTTGGCTCG
ACCCAACAAA CTTTTGTGCA CGCATTGACG GGCTTTCAGA ATGGCACTTT AAATATCGAT
GCTGGTGGCA ACGTTACTGT TAATCAGGGC AGTTTTGCTG GCATCATCGA AGGTGCTGGT
CAGCTCACCA TTGCGCAAAA CGGCAGCTAC GTGCTGGCAG GGGCGCAGTC GATGGCGCTA
ACCGGCGATA TAGTCGTTGA TGATGGTGCG GTGCTTTCGC TGGAAGGCGA CGCGGCAGAT
CTTACCGCTC TCCAGGACGA TCCGCAGTCG ATCGTGTTAA ACGGCGGTGT GCTCGATCTC
TCTGATTTCT CCACCTGGCA GAGCGGCACA TCATACAACG ATGGCCTTGA AGTCAGTGGC
AGCAGCGGAA CGGTTATCGG CAGTCAGGAT GTGGTAGATC TTGCAGGTGG CGACAATTTG
CATATCGGCG GCGACGGGAA AGATGGCGTC TACGTGGTGG TCGATGCGAG CGACGGGCAG
GTAAGTCTGG CAAACAATAA TAGTTATTTG GGCACAACAC AAATCGCCTC CGGTACGCTG
ATGGTGAGCG ACAACTCGCA GCTTGGAGAT ACCCACTATA ACCGCCAGGT TATCTTTACC
GATAAGCAAC AAGAAAGCGT GATGGAGATT ACCTCCGACG TTGACACGCG TTCAGATGCG
GCAGGCCACG GACGTGATAT TGAAATGCGC GCCGACGGTG AAGTGGCAGT TGATGCGGGG
GTAGACACGC AGTGGGGCGC ACTGATGGCT GACAGCAGCG GGCAGCATCA GGATGAGGGT
AGCACATTGA CTAAAACGGG GGCTGGTACA CTGGAGCTGA CCGCCAGCGG TACAACGCAG
TCGGCGGTAC GTGTCGAAGA AGGCACCCTG AAAGGTGATG TTGCTGATAT CCTTCCTTAT
GCTTCGTCAC TGTGGGTTGG TGATGGGGCA ACGTTCGTTA CTGGCGCGGA TCAGGATATT
CAGTCAATTG ATGCTATTTC CAGCGGCACT ATCGACATCA GCGATGGTAC GGTTTTGCGC
CTGACCGGGC AGGATACTTC CGTCGCCCTT AATGCCTCAC TATTTAACGG CGATGGGACG
CTGGTGAATG CCACCGATGG TGTGATGTTG ACAGGTGAGC TTAATACCAA CCTTGAAACT
GACAGCCTGA CTTATCTTTC CAACGTGACG GTTAATGGCA ATCTGACCAA TACGTCCGGT
GCGGTTAGCC TGCAAAATGG CGTCGCTGGC GATACGCTGA CGGTAAACGG TGATTATACC
GGCGGCGGTA CGCTACTGCT CGATAGCGAA TTAAACGGCG ATGACTCGGT AAGCGATCAA
TTGGTGATGA ACGGTAATAC TGCTGGCAAC ACAACTGTGG TGGTTAACTC CATTACAGGG
ATTGGTGAGC CGACATCGAC AGGCATTAAA GTGGTTGATT TCGCAGCTGA TCCCACGCAG
TTTCAAAACA ATGCGCAGTT CAGTCTGGCA GGCAGCGGCT ACGTCAATAT GGGAGCGTAT
GACTACACGC TGGTGGAAGA TAACAACGAC TGGTATCTGC GATCGCAAGA AGTAACGCCG
CCATCGCCAC CTGATCCAGA CCCGACTCCC GATCCTGATC CCACGCCGGA TCCTGATCCA
ACACCCGACC CGGAACCTAC GCCTGCTTAC CAGCCGGTGT TGAATGCCAA AGTTGGCGGT
TATCTCAATA ACCTGCGGGC GGCAAATCAG GCGTTTATGA TGGAGCGACG CGATCACGCA
GGTGGCGATG GTCAGACGCT GAATTTACGT GTTATCGGCG GAGATTATCA TTACACAGCA
GCGGGGCAAC TGGCTCAACA TGAAGACACT TCTACGGTGC AACTTAGCGG CGATCTGTTT
AGCGGGCGCT GGGGCACGGA TGGCGAGTGG ATGCTTGGGA TTGTTGGTGG CTACAGCGAT
AACCAGGGCG ACAGCCGCTC GAATATGACC GGAACTCGCG CCGATAACCA GAACCACGGT
TATGCCGTTG GGCTGACATC AAGCTGGTTT CAGCACGGTA ATCAGAAGCA AGGGGCCTGG
CTGGATAGCT GGCTGCAATA CGCGTGGTTT AGCAATGATG TTTCCGAACA AGAAGATGGC
ACAGATCATT ACCACTCGTC GGGGATTATC GCCTCGCTGG AGGCGGGGTA TCAGTGGTTA
CCGGGGCGTG GTGTGGTGAT TGAACCGCAG GCGCAGGTGA TTTATCAGGG CGTGCAGCAG
GATGATTTTA CCGCCGCTAA CCGTGCGCGC GTGTCACAAT CGCAGGGTGA TGATATTCAG
ACGCGGCTGG GTTTACACAG CGAATGGCGT ACCGCTGTTC ATGTCATACC AACATTAGAT
CTGAATTATT ATCACGATCC CCATTCGACG GAAATTGAAG AGGATGGCAG CACTATCAGT
GACGATGCGG TGAAGCAACG GGGTGAAATA AAAGTGGGAG TCACGGGCAA TATCAGTCAG
CGAGTGTCCC TGCGCGGCAG CGTGGCGTGG CAGAAAGGGA GTGATGATTT TGCCCAGACG
GCAGGGTTTT TGTCGATGAC GGTGAAATGG TAA
 
Protein sequence
MRIIFLRKEY LSLLPSMIAS LFSANGVAAV TDSCQGYDVK ASCQASRQSL SGITQDWSIA 
DGQWLVFSDM TNNASGGAVF LQQGAEFSLL PENETGMTLF ANNTVTGEYN NGGAIFAKEN
STLNLTDVIF SGNVAGGYGG AIYSSGTNDT GAVDLRVTNA MFRNNIANDG KGGAIYTINN
DVYLSDVIFD NNQAYTSTSY SDGDGGAIDV TDNNSDSKHP SGYTIVNNTA FTNNTAEGYG
GAIYTNSVTA PYLIDISVDD SYSQNGGVLV DENNSAAGYG DGPSSAAGGF MYLGLSEVTF
DIADGKTLVI GNTENDGAVD SIAGTGLITK TGSGDLVLNA DNNDFTGEMQ IENGEVTLGR
SNSLMNVGDT HCQDDPQDCY GLTIGSIDQY QNQAELNVGS TQQTFVHALT GFQNGTLNID
AGGNVTVNQG SFAGIIEGAG QLTIAQNGSY VLAGAQSMAL TGDIVVDDGA VLSLEGDAAD
LTALQDDPQS IVLNGGVLDL SDFSTWQSGT SYNDGLEVSG SSGTVIGSQD VVDLAGGDNL
HIGGDGKDGV YVVVDASDGQ VSLANNNSYL GTTQIASGTL MVSDNSQLGD THYNRQVIFT
DKQQESVMEI TSDVDTRSDA AGHGRDIEMR ADGEVAVDAG VDTQWGALMA DSSGQHQDEG
STLTKTGAGT LELTASGTTQ SAVRVEEGTL KGDVADILPY ASSLWVGDGA TFVTGADQDI
QSIDAISSGT IDISDGTVLR LTGQDTSVAL NASLFNGDGT LVNATDGVML TGELNTNLET
DSLTYLSNVT VNGNLTNTSG AVSLQNGVAG DTLTVNGDYT GGGTLLLDSE LNGDDSVSDQ
LVMNGNTAGN TTVVVNSITG IGEPTSTGIK VVDFAADPTQ FQNNAQFSLA GSGYVNMGAY
DYTLVEDNND WYLRSQEVTP PSPPDPDPTP DPDPTPDPDP TPDPEPTPAY QPVLNAKVGG
YLNNLRAANQ AFMMERRDHA GGDGQTLNLR VIGGDYHYTA AGQLAQHEDT STVQLSGDLF
SGRWGTDGEW MLGIVGGYSD NQGDSRSNMT GTRADNQNHG YAVGLTSSWF QHGNQKQGAW
LDSWLQYAWF SNDVSEQEDG TDHYHSSGII ASLEAGYQWL PGRGVVIEPQ AQVIYQGVQQ
DDFTAANRAR VSQSQGDDIQ TRLGLHSEWR TAVHVIPTLD LNYYHDPHST EIEEDGSTIS
DDAVKQRGEI KVGVTGNISQ RVSLRGSVAW QKGSDDFAQT AGFLSMTVKW