Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0847 |
Symbol | |
ID | 5833337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 923526 |
End bp | 924632 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641366629 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_001638323 |
Protein GI | 163850280 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.307113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.210188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCGC TCAACCCCGA TCCTGCCTTC TGGGCGGGCA AGCGCGTGCT GCTCACCGGG CATACCGGCT TCAAGGGCGC GTGGCTGAGC CTGTGGCTCG CCCGGCTCGG CGCCCGCGTC ACCGGCTTCG CCCTTCCCCC TGAGACGCGG CCGAACCTGT TCGAGGCGAT CGCATTCCCG TCCGAGGACT CGCGCATCGG CGACATCCGC GATTTGCCGG CGCTCGCCGC GGCCGTGGCG GCTGCCGAGC CGGAGATCGT GATCCACATG GCGGCGCAGG CCCTGGTGCG GCCCTCCTAT ACCGATCCGG TGGGGACGTT CGCGATCAAC ACCATGGGCA GTGTTCACCT GCTGGAAGCG GTGCGGTTGG CCCCGAGCGT GCGCGCCGTC GTCGTCGTGA CGAGCGACAA GGCCTACGAG AACCGCGAAT GGCCCTATGC CTATCGCGAG ACCGAGGCGA TGGGCGGGCG CGATCCCTAC AGCGCCTCGA AGGGCTGTGC CGAACTCGTA ACGAGTGCCT ATCGCGCCTC GTTCTTCGGC GCGGGCGGCC ATCCGGCCCG GATCGCCAGC GCGCGGGCCG GCAACGTCAT CGGCGGCGGC GATTGGTCCC TCGACCGGCT GATCCCCGAT ATCGTGCGCG CCTTCGAGGC CGGGGACTCG GTCGAGATCC GCGCGCCGCA CGCGATCCGC CCGTGGCAGC ACGTGCTGGA ACCGCTGGCC GGCTACCTCA GGCTCGCCGA ATGCCTCGCG GGCGCCGACG GCGCCGCCTT CGCGGAGGGC TGGAATCTCG GGCCGGCGGA CGAGGATTGC CGGCCGGTCT CGTACCTCGT GGAGCGGCTG GCGCAGGGCT GGGGCGGGGG AGCCGGCTGG CACCTCTCGC AGAAGACCCA TCCTCACGAG GCGACATATC TCAAGGTCGA TGCCTCCAAG GCCCGCGCCC GCCTCGGCTG GGACCGGCGG CTGACCCTCG ACACGGCGCT CGACTGGACC GCCGCGTGGT ATCGCGCGGC CGCTTCCGGT GCCGATCCCC GCGCTCTGGC CGAGGCTGAG ATCGCGCGCT ACGAGGCGCT GGGCCAGCCT GGAGCAAAAG CCGGAGTCCA AGCGTGA
|
Protein sequence | MAALNPDPAF WAGKRVLLTG HTGFKGAWLS LWLARLGARV TGFALPPETR PNLFEAIAFP SEDSRIGDIR DLPALAAAVA AAEPEIVIHM AAQALVRPSY TDPVGTFAIN TMGSVHLLEA VRLAPSVRAV VVVTSDKAYE NREWPYAYRE TEAMGGRDPY SASKGCAELV TSAYRASFFG AGGHPARIAS ARAGNVIGGG DWSLDRLIPD IVRAFEAGDS VEIRAPHAIR PWQHVLEPLA GYLRLAECLA GADGAAFAEG WNLGPADEDC RPVSYLVERL AQGWGGGAGW HLSQKTHPHE ATYLKVDASK ARARLGWDRR LTLDTALDWT AAWYRAAASG ADPRALAEAE IARYEALGQP GAKAGVQA
|
| |