Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2993 |
Symbol | gutQ |
ID | 5588744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2990482 |
End bp | 2991447 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640926640 |
Product | D-arabinose 5-phosphate isomerase |
Protein accession | YP_001464016 |
Protein GI | 157158310 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAG CACTACTGAA CGCGGGACGT CAGACGTTAA TGCTGGAGTT GCAGGAAGCA AGCCGTTTAC CGGAACGTCT GGGCGATGAT TTTGTTCGCG CCGCCAATAT CATCCTGCAC TGCGAAGGCA AAGTGGTGGT TTCGGGAATT GGCAAATCGG GCCACATTGG TAAGAAAATC GCCGCAACAC TTGCCAGTAC CGGCACACCG GCTTTTTTTG TCCATCCGGC AGAAGCACTG CACGGCGATC TGGGGATGAT TGAAAGCCGC GATGTGATGC TGTTTATCTC TTACTCCGGT GGCGCGAAGG AACTGGATCT GATTATTCCG CGTCTGGAAG ATAAATCTAT CGCGCTGCTG GCGATGACCG GCAAACCGAC GTCACCGCTG GGCCTGGCGG CAAAAGCGGT GCTGGATATC TCCGTAGAAC GCGAAGCCTG CCCGATGCAC CTTGCGCCGA CCTCCAGCAC CGTCAATACC CTGATGATGG GTGACGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGTTT TAATGAAGAA GATTTTGCCC GCTCCCACCC AGCTGGGGCA CTAGGCGCTC GCTTGCTGAA TAAAGTGCAT CATCTGATGC GCCGTGATGA CGCTATCCCG CAGGTAGCGT TAGCCGCCAG CGTGATGGAT GCGATGCTGG AGCTCAGCCG CACCGGTCTG GGGCTGGTGG CGGTATGTGA CGCTCAACAA CAGGTACAAG GCGTCTTTAC CGACGGCGAT TTACGTCGCT GGCTGGTTGG CGGCGGCGCA CTCACCACGC CAGTTAATGA AGCGATGACG ACAGGCGGCA CCACATTACA GGCGCAAAGC CGTGCCATCG ACGCCAAAGA GATCCTGATG AAGCGCAAAA TCACTGCCGC ACCGGTGGTA GATGAAAACG GCAAACTCAC CGGCGCAATT AACTTGCAGG ATTTCTATCA GGCCGGGATT ATTTAA
|
Protein sequence | MSEALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIILH CEGKVVVSGI GKSGHIGKKI AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSIALL AMTGKPTSPL GLAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE DFARSHPAGA LGARLLNKVH HLMRRDDAIP QVALAASVMD AMLELSRTGL GLVAVCDAQQ QVQGVFTDGD LRRWLVGGGA LTTPVNEAMT TGGTTLQAQS RAIDAKEILM KRKITAAPVV DENGKLTGAI NLQDFYQAGI I
|
| |