Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2832 |
Symbol | gutQ |
ID | 6144460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2905040 |
End bp | 2906005 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617700 |
Product | D-arabinose 5-phosphate isomerase |
Protein accession | YP_001744855 |
Protein GI | 170680279 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00103059 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGAAG CACTACTGAA CGCGGGACGT CAGACGTTAA TGCTGGAGTT ACAGGAAGCA AGCCGTTTAC CGGAACGTCT GGGCGATGAT TTTGTTCGCG CCGCCAATAT CATCCTGCAC TGCGAAGGCA AAGTGGTGGT TTCGGGAATT GGCAAATCGG GCCACATTGG TAAGAAAATC GCCGCAACGC TTGCCAGCAC CGGCACTCCT GCCTTTTTTG TCCATCCGGC AGAGGCATTG CACGGCGACC TGGGGATGAT CGAAAGCCGC GATGTGATGC TGTTTATCTC TTACTCCGGT GGCGCGAAGG AACTGGATCT GATTATTCCG CGTCTGGAAG ATAAATCTAT CGCGCTGCTG GCGATGACCG GCAAACCGAC GTCACCGCTG GGCCTGGCGG CGAAAGCGGT GCTGGATATC TCCGTAGAAC GCGAAGCCTG CCCGATGCAT CTTGCGCCGA CCTCCAGCAC CGTGAATACC CTGATGATGG GCGACGCGTT GGCGATGGCA GTCATGCAGG CGCGTGGTTT TAATGAAGAA GATTTTGCCC GCTCCCACCC AGCCGGGGCA CTGGGCGCTC GCTTGCTGAA TAAAGTGCAT CATCTGATGC GCCGTGATGA CGCTATTCCG CAGGTAGCGT TAACCGCCAG CGTGATGGAT GCGATGCTGG AACTCAGCCG CACCGGTCTG GGGCTGGTGG CGGTATGTGA CGCTCAGCAA CAGGTACAAG GCGTCTTTAC CGACGGTGAT TTACGTCGCT GGCTGGTTGG CGGCGGCGCA CTCACCACGC CAGTCAATGA AGCGATGACG GTCGGCGGCA CCACGTTGCA ATCGCAAAGC CGTGCCATCG ACGCCAAAGA AATCCTGATG AAACGCAAAA TCACCGCCGC ACCGGTAGTG GATGAAAACG GCAAACTCAC CGGCGCAATT AACCTGCAGG ATTTCTATCA GGCCGGGATT ATTTAA
|
Protein sequence | MSEALLNAGR QTLMLELQEA SRLPERLGDD FVRAANIILH CEGKVVVSGI GKSGHIGKKI AATLASTGTP AFFVHPAEAL HGDLGMIESR DVMLFISYSG GAKELDLIIP RLEDKSIALL AMTGKPTSPL GLAAKAVLDI SVEREACPMH LAPTSSTVNT LMMGDALAMA VMQARGFNEE DFARSHPAGA LGARLLNKVH HLMRRDDAIP QVALTASVMD AMLELSRTGL GLVAVCDAQQ QVQGVFTDGD LRRWLVGGGA LTTPVNEAMT VGGTTLQSQS RAIDAKEILM KRKITAAPVV DENGKLTGAI NLQDFYQAGI I
|
| |