Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3835 |
Symbol | |
ID | 7090763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4197935 |
End bp | 4199173 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643467120 |
Product | 1-deoxy-D-xylulose 5-phosphate reductoisomerase |
Protein accession | YP_002364079 |
Protein GI | 217979932 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0743] 1-deoxy-D-xylulose 5-phosphate reductoisomerase |
TIGRFAM ID | [TIGR00243] 1-deoxy-D-xylulose 5-phosphate reductoisomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0104927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TCGTTCAACT GCGGCCCTCC CGCTCCGGCG AGGCGAGTTC GCCGGACGCC GATTCGCGCC GCGTCGTGCT GCTCGGCGCG ACCGGCTCCA TCGGCCGCTC GACCGTCGAG ATCATCAACG GCGCCAACGG CGCCTTCAGC GTGGCCGCCG TCGCCGGCGG CAGCGACGCC AAGGCGCTCG CCGCCGTCGC CATCGAGCTT GGCGCGGAAT TCGCCGCCCT TGCGGATCCG TCCGGCTATG CGGATTTGAA AGCGGCGCTG TCTGGAACCG CGATCGAGGC GGCGGCCGGT CCCGAGGCGG TGATCGAAGC GGCGCTGCGG CCGGCCGACA TCGTTGTCGG CGCCATCGCC GGCGCCGCCG GCGTCGCGCC GACCTTCGCC GCGCTGGCCG CCGGGCGCAT CATCGCGCTC GCGAATAAGG AATGCCTCGT CTGCGCCGGG CCGGCCTTCA TGCGTCAGGC GGCCGCGTCG GGAACCCGGC TTTTGCCGGT CGACAGCGAG CACAACGCCA TTTTTCAGGC GATGGGCGAC GCCGAATTGT CGTCCGTCGA GATGATCACC CTGACGGCCT CCGGCGGACC ATTCCGAACC TGGAGCGTCG AGGCCATCGC CGCCGCGACG CCGGAGCAGG CGCTCGCCCA TCCAAACTGG TCGATGGGAC CGAAAGTCAC GATCGATTCG GCCGGCCTGA TGAACAAGGG GCTCGAAGTC ATCGAGGCGC ATTATCTGTT CGGGATCGAG ACGGCGCGGC TCGACGTGCT GGTTCACGCG CAATCGGTGG TGCACGGCCT CGTCGCCTTC TCAGACGGAT CGGTCTCGGC CGGACTCGCC GCGCCCGACA TGAAGGTGCC GATCGCCCAT TGCCTGTCGC ATCCGCGGCG TCTCGTTACC AAGGCGCGCC GGCTCGATCT CGCGGCGATC GGCCAGCTCA CCTTCGAGCG TCCGGATTTT AACCGCTTTC CGGCCCTGCG CGTCGCCCTC GACGCCTTGC GCGCCGGCCG CGGTCTTCCT ACTGTATTGA ACGCCGCCAA TGAGATTGCC GTGGAGGCGT TCCTGAAGCG GCGCATTTCG TTCCACGAGA TCGCCAAAAT CGTCGAGCAG GCTTGCGAGG CGGCGCTGTC CGATGGCGTC GCGCGCGAGC CGGAGACGAT CGACGAGGCG CTCGCCATTG ATTTCGCCGT GCGCGAACGC ACGCGCGCCC GTTTGCCGGG GACCGCGGCG GCGTCATAA
|
Protein sequence | MKKVVQLRPS RSGEASSPDA DSRRVVLLGA TGSIGRSTVE IINGANGAFS VAAVAGGSDA KALAAVAIEL GAEFAALADP SGYADLKAAL SGTAIEAAAG PEAVIEAALR PADIVVGAIA GAAGVAPTFA ALAAGRIIAL ANKECLVCAG PAFMRQAAAS GTRLLPVDSE HNAIFQAMGD AELSSVEMIT LTASGGPFRT WSVEAIAAAT PEQALAHPNW SMGPKVTIDS AGLMNKGLEV IEAHYLFGIE TARLDVLVHA QSVVHGLVAF SDGSVSAGLA APDMKVPIAH CLSHPRRLVT KARRLDLAAI GQLTFERPDF NRFPALRVAL DALRAGRGLP TVLNAANEIA VEAFLKRRIS FHEIAKIVEQ ACEAALSDGV AREPETIDEA LAIDFAVRER TRARLPGTAA AS
|
| |