Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3959 |
Symbol | gutQ |
ID | 6969541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3657977 |
End bp | 3658942 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643387727 |
Product | D-arabinose 5-phosphate isomerase |
Protein accession | YP_002272170 |
Protein GI | 209396393 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAG CACTACTGAA CGCGGGACGT CAGACGTTAA TGCTGGAATT GCAGGAAGCA AGCCGTTTAC CGGAACGTCT GGGCGATGAT TTTGTTCGCG CCGCCAATAT CATCCTGCAC TGCGAAGGCA AAGTGGTGGT TTCGGGAATT GGCAAATCGG GCCACATTGG TAAGAAAATC GCCGCAACGC TTGCCAGTAC CGGCACACCG GCTTTTTTTG TCCATCCGGC AGAAGCGCTG CATGGCGATC TGGGGATGAT CGAAAGCCGC GATGTGATGC TGTTTATCTC TTACTCCGGT GGGGCGAAGG AACTGGATCT GATTATTCCG CGTCTGGAAG ACAAATCTAT CGCGCTGCTG GCGATGACCG GCAAACCGAC GTCACCGCTG GGCCTGGCGG CGAAAGCGGT GCTGGATATC TCCGTAGAAC GCGAAGCCTG CCCGATGCAC CTTGCGCCGA CCTCCAGCAC CGTCAATACC CTGATGATGG GTGACGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGATT TAATGAAGAA GATTTTGCCC GCTCCCACCC TGCTGGAGCA CTGGGCGCTC GCTTGCTGAA TAAAGTGCAT CATCTGATGC GCCGTGACGA TGCCATCCCA CAGGTGGCGT TAACCGCCAG CGTGATGGAT GCTATGCTGG AACTCAGCCG CACCGGTCTG GGGCTGGTGG CGGTATGTGA CGCTCAACAA CAGGTACAAG GCGTCTTTAC CGACGGCGAT TTACGTCGCT GGCTGGTTGG CGGCGGCGCA CTCACAACGC CAGTCAATGA AGCGATGACG GTCGGCGGCA CCACGTTGCA ATCGCAAAGT CGCGCCATCG ACGCCAAAGA GATCCTGATG AAGCGCAAAA TCACTGCCGC ACCGGTGGTG GATGAAAACG GCAAACTCAC CGGCGCAATA AACCTGCAGG ATTTCTATCA GGCCGGGATT ATTTAA
|
Protein sequence | MSEALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIILH CEGKVVVSGI GKSGHIGKKI AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSIALL AMTGKPTSPL GLAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE DFARSHPAGA LGARLLNKVH HLMRRDDAIP QVALTASVMD AMLELSRTGL GLVAVCDAQQ QVQGVFTDGD LRRWLVGGGA LTTPVNEAMT VGGTTLQSQS RAIDAKEILM KRKITAAPVV DENGKLTGAI NLQDFYQAGI I
|
| |