Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2121 |
Symbol | |
ID | 6969395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2030001 |
End bp | 2031056 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643386019 |
Product | protein HipA |
Protein accession | YP_002270508 |
Protein GI | 209400663 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.218209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAGAAA TAGGGCGAGA CAGCGTTGGT GCCGTGACGT TAATACCCGA AGACGAAACC GTAACGCGTC CGATAATGGC ATGGGAAAAG CTTACTGAAG TCAGACTTGA AGAAGTATTA ACGGCTTATA AAGCAGATAT CCCGCTAGGC ATGATTAGAG AAGAAAATGA CTTTCGCATC TCGGTTGCTG GCGCACAGGA GAAGACAGCA CTGCTCAGAA TAGGCAATGA CTGGTGCATT CCGAAAGGAA TAACGCCGAC GACGCACATC ATTAAATTAC CGATTGGGGA AATCAGGCAG CCCAATGCGA CGCTCGATCT CAGCCAAAGC GTTGATAATG AGTATTACTG TCTGCTGCTG GCGAAAGAAC TTGGGTTGAA TGTCCCGGAC GCAGAAATCA TTAAAGCGGG AAATGTGCGC GCGTTAGCGG TCGAACGTTT TGACAGGCGT TGGAATGCTG AGCGAACGGT TTTACTTCGC TTGCCACAGG AGGATATGTG TCAGACATTC GGTTTACCTT CATCGGTGAA ATATGAATCA GATGGAGGCC CAGGCATCGC GCGGATCATG GCTTTTTTGA TGGGGTTCAG CGAGGCGCTT CGCGATCGTT ATGATTTTAT GAAATTCCAG GTCTTCCAGT GGTTGATTGG CGCAACGGAT GGCCATGCAA AAAACTTCTC CGTATTTATT CAGGCTGGCG GCAGTTATCG ACTCACGCCA TTTTACGACA TCATTTCAGC ATTTCCGGTC CTTGGCGGTA CGGGAATACA CATCAGCGAT CTCAAACTGG CAATGGGGCT TAACGCATCC AAAGGCAAAA AAACGGCAAT CGATAAAATT TATCCGCGAC ATTTTTTGGC GACAGCAAAG GTGCTGAGAT TCCCGGAAGT GCAGATGCAT GAAATCCTGA GTGACTTTGC CAGAATGATT CCGGCAGCAC TGGATAACGT GAAGACTTCA TTACCGACAG ATTTTCCAGA GAACGTGGTG ACGGCAGTTG AAACCAATGT GTTGAGGTTG CACGGTCGGT TAAGCCGAGA ATACGGTATT AAGTAA
|
Protein sequence | MSEIGRDSVG AVTLIPEDET VTRPIMAWEK LTEVRLEEVL TAYKADIPLG MIREENDFRI SVAGAQEKTA LLRIGNDWCI PKGITPTTHI IKLPIGEIRQ PNATLDLSQS VDNEYYCLLL AKELGLNVPD AEIIKAGNVR ALAVERFDRR WNAERTVLLR LPQEDMCQTF GLPSSVKYES DGGPGIARIM AFLMGFSEAL RDRYDFMKFQ VFQWLIGATD GHAKNFSVFI QAGGSYRLTP FYDIISAFPV LGGTGIHISD LKLAMGLNAS KGKKTAIDKI YPRHFLATAK VLRFPEVQMH EILSDFARMI PAALDNVKTS LPTDFPENVV TAVETNVLRL HGRLSREYGI K
|
| |