Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3575 |
Symbol | |
ID | 5594631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3552932 |
End bp | 3554158 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922692 |
Product | putative mutase |
Protein accession | YP_001460173 |
Protein GI | 157162855 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1015] Phosphopentomutase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGAT TTGTGGTGTT AGTGATTGAT AGCTTTGGCG TAGGGGCAAT GAAAGATGTC ACGCTGGTGC GTCCGCAAGA TGCGGGAGCG AATACCTGTG GTCACATCCT GAGCCAGTTG CCGCATTTGC AGCTACCAAC GCTGGAGAAG CTGGGGCTAA TCAACGCATT GGGTTATGCG CCAGGCGATA TGCAGCCGTC AGATTCCGCA ACCTGGGGCG TGGCAGAGCT GCAACATGAA GGTGGCGATA CCTTTATGGG GCATCAGGAA ATTTTAGGCA CGCGCCCGTT ACCGCCGCTG CGGATGCCTT TTTGCGATGT GATTGACCGT GTTGAGCAGG CATTAGTATC CGCCGGTTGG CAGGTGGAGC GCCGTGGCGA TGAACTGCAA TTTCTGTGGG TCAATCAGGC GGTTGCGATT GGCGATAATC TCGAGGCGGA TTTAGGCCAG GTCTATAACA TTACCGCCAA TCTCTCTGTG ATCTCTTTTG ACGACGCAAT CAAAATTGGT CGTATCGTGC GTGAGCAGGT ACAGGTCGGT CGGGTCATTA CATTTGGTGG CCTGTTAACC GACAGTCAAC GCATTCTCGA TGCCGCAGAA AGCAAAGAAG GGCGCTTTAT TGGTATCAAT GCGCCGCGTT CTGGCGCTTA TGACAACGGT TTCCAGGTCG TGCATATGGG CTATGGCGTC GATGAAAAAG TGCAGGTGCC ACAAAAACTG TATGAAGCAG GCGTGCCAAC CGTGCTGGTG GGTAAGGTGG CAGATATCGT CAACAATCCT TATGGCGTGA GCTGGCAAAA TCTGGTGGAT AGCCAGCGGA TTATGGATAT CACCCTCAAC GAATTTAACA CCCATCCGAC GGCGTTTATT TGCACCAACA TTCAGGAAAC CGACCTCGCT GGTCATGCAG AAGACGTCGC ACGTTATGCC GAACGTTTGC AGGTCGTTGA CCGTAACCTT GCCCGGCTTG TTGAGGCGAT GCAGCCAGAT GATTGCCTGG TCGTGATGGC GGATCACGGC AACGATCCGA CCATTGGTCA CAGCCACCAT ACCCGCGAAG TGGTGCCAGT GCTGGTTTAT CAGCAAGGGA TGATCGCTAC GCAGCTCGGT GTGCGCACCA CGCTTTCTGA TGTGGGGGCT ACCGTGTGTG AATTTTTCCG CGCGCCACCG CCACAAAATG GTCGCTCTTT TCTTTCCTCC CTCCGGTTTG CAGGAGACAC CCTATGA
|
Protein sequence | MARFVVLVID SFGVGAMKDV TLVRPQDAGA NTCGHILSQL PHLQLPTLEK LGLINALGYA PGDMQPSDSA TWGVAELQHE GGDTFMGHQE ILGTRPLPPL RMPFCDVIDR VEQALVSAGW QVERRGDELQ FLWVNQAVAI GDNLEADLGQ VYNITANLSV ISFDDAIKIG RIVREQVQVG RVITFGGLLT DSQRILDAAE SKEGRFIGIN APRSGAYDNG FQVVHMGYGV DEKVQVPQKL YEAGVPTVLV GKVADIVNNP YGVSWQNLVD SQRIMDITLN EFNTHPTAFI CTNIQETDLA GHAEDVARYA ERLQVVDRNL ARLVEAMQPD DCLVVMADHG NDPTIGHSHH TREVVPVLVY QQGMIATQLG VRTTLSDVGA TVCEFFRAPP PQNGRSFLSS LRFAGDTL
|
| |