Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1004 |
Symbol | gutQ |
ID | 6067627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1093284 |
End bp | 1094249 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641600412 |
Product | D-arabinose 5-phosphate isomerase |
Protein accession | YP_001724000 |
Protein GI | 170019046 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000464147 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000496759 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGAAG CACTACTGAA CGCGGGACGT CAGACGTTAA TGCTGGAGTT GCAGGAAGCA AGCCGTTTAC CGGAACGTCT GGGCGATGAT TTTGTTCGCG CCGCCAATAT CATCCTGCAC TGCGAAGGCA AAGTGGTGGT TTCGGGAATT GGCAAATCGG GCCACATTGG TAAGAAAATC GCCGCAACAC TTGCCAGTAC CGGCACACCG GCTTTTTTTG TCCATCCGGC AGAAGCACTG CACGGCGATC TGGGGATGAT TGAAAGCCGC GATGTGATGC TGTTTATCTC TTACTCCGGT GGCGCGAAGG AACTGGATCT GATTATTCCG CGTCTGGAAG ATAAATCTAT CGCGCTGCTG GCGATGACCG GCAAACCGAC GTCACCGCTG GGCCTGGCGG CAAAAGCGGT GCTGGATATC TCCGTAGAAC GCGAAGCCTG CCCGATGCAC CTTGCGCCGA CCTCCAGCAC CGTCAATACC CTGATGATGG GTGACGCGCT GGCGATGGCG GTCATGCAGG CGCGCGGATT TAATGAAGAA GATTTTGCCC GCTCCCACCC AGCCGGGGCA CTGGGCGCTC GCTTGCTGAA TAAAGTGCAT CATCTGATGC GCCGTGACGA TGCCATCCCA CAGGTGGCGT TAACCGCCAG CGTGATGGAT GCGATGCTGG AACTCAGCCG CACCGGTCTG GGGCTGGTGG CGGTATGTGA CGCTCAACAA CAGGTACAAG GCGTCTTTAC CGACGGCGAT TTACGTCGCT GGCTGGTAGG CGGCGGGGCA CTCACCACGC CAGTTAATGA AGCGATGACG ACAGGCGGCA CCACGTTACA GGCGCAAAGT CGCGCCATCG ACGCCAAAGA GATCCTGATG AAGCGCAAAA TCACTGCCGC ACCGGTGGTG GATGAAAACG GCAAACTCAC CGGCGCAATT AACCTGCAGG ATTTCTATCA GGCCGGGATT ATTTAA
|
Protein sequence | MSEALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIILH CEGKVVVSGI GKSGHIGKKI AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSIALL AMTGKPTSPL GLAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE DFARSHPAGA LGARLLNKVH HLMRRDDAIP QVALTASVMD AMLELSRTGL GLVAVCDAQQ QVQGVFTDGD LRRWLVGGGA LTTPVNEAMT TGGTTLQAQS RAIDAKEILM KRKITAAPVV DENGKLTGAI NLQDFYQAGI I
|
| |