Gene ECH74115_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2121 
Symbol 
ID6969395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2030001 
End bp2031056 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content48% 
IMG OID643386019 
Productprotein HipA 
Protein accessionYP_002270508 
Protein GI209400663 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.218209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAGAAA TAGGGCGAGA CAGCGTTGGT GCCGTGACGT TAATACCCGA AGACGAAACC 
GTAACGCGTC CGATAATGGC ATGGGAAAAG CTTACTGAAG TCAGACTTGA AGAAGTATTA
ACGGCTTATA AAGCAGATAT CCCGCTAGGC ATGATTAGAG AAGAAAATGA CTTTCGCATC
TCGGTTGCTG GCGCACAGGA GAAGACAGCA CTGCTCAGAA TAGGCAATGA CTGGTGCATT
CCGAAAGGAA TAACGCCGAC GACGCACATC ATTAAATTAC CGATTGGGGA AATCAGGCAG
CCCAATGCGA CGCTCGATCT CAGCCAAAGC GTTGATAATG AGTATTACTG TCTGCTGCTG
GCGAAAGAAC TTGGGTTGAA TGTCCCGGAC GCAGAAATCA TTAAAGCGGG AAATGTGCGC
GCGTTAGCGG TCGAACGTTT TGACAGGCGT TGGAATGCTG AGCGAACGGT TTTACTTCGC
TTGCCACAGG AGGATATGTG TCAGACATTC GGTTTACCTT CATCGGTGAA ATATGAATCA
GATGGAGGCC CAGGCATCGC GCGGATCATG GCTTTTTTGA TGGGGTTCAG CGAGGCGCTT
CGCGATCGTT ATGATTTTAT GAAATTCCAG GTCTTCCAGT GGTTGATTGG CGCAACGGAT
GGCCATGCAA AAAACTTCTC CGTATTTATT CAGGCTGGCG GCAGTTATCG ACTCACGCCA
TTTTACGACA TCATTTCAGC ATTTCCGGTC CTTGGCGGTA CGGGAATACA CATCAGCGAT
CTCAAACTGG CAATGGGGCT TAACGCATCC AAAGGCAAAA AAACGGCAAT CGATAAAATT
TATCCGCGAC ATTTTTTGGC GACAGCAAAG GTGCTGAGAT TCCCGGAAGT GCAGATGCAT
GAAATCCTGA GTGACTTTGC CAGAATGATT CCGGCAGCAC TGGATAACGT GAAGACTTCA
TTACCGACAG ATTTTCCAGA GAACGTGGTG ACGGCAGTTG AAACCAATGT GTTGAGGTTG
CACGGTCGGT TAAGCCGAGA ATACGGTATT AAGTAA
 
Protein sequence
MSEIGRDSVG AVTLIPEDET VTRPIMAWEK LTEVRLEEVL TAYKADIPLG MIREENDFRI 
SVAGAQEKTA LLRIGNDWCI PKGITPTTHI IKLPIGEIRQ PNATLDLSQS VDNEYYCLLL
AKELGLNVPD AEIIKAGNVR ALAVERFDRR WNAERTVLLR LPQEDMCQTF GLPSSVKYES
DGGPGIARIM AFLMGFSEAL RDRYDFMKFQ VFQWLIGATD GHAKNFSVFI QAGGSYRLTP
FYDIISAFPV LGGTGIHISD LKLAMGLNAS KGKKTAIDKI YPRHFLATAK VLRFPEVQMH
EILSDFARMI PAALDNVKTS LPTDFPENVV TAVETNVLRL HGRLSREYGI K