Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3234 |
Symbol | |
ID | 7090649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3546858 |
End bp | 3548708 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643466542 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002363503 |
Protein GI | 217979356 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00391519 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTCCCT ATCGCTCTCG CACCACGACG CATGGACGCA ATATGGCCGG CGCGCGCGGA CTTTGGCGCG CCACCGGCAT GAAAGACGGC GATTTCGGCA AGCCGATCAT CGCCGTCGCC AATTCCTTCA CGCAATTCGT GCCGGGCCAC GTCCATCTGA AGGACCTTGG CCAGCTGGTC GCGCGCGAGA TCGAGGCGGC GGGCGGAGTC GCTAAGGAAT TCAACACCAT TGCGGTCGAC GACGGCATCG CCATGGGTCA TGACGGCATG CTCTATAGCC TGCCCTCGCG CGAGATCATC GCCGACTCGG TCGAATATAT GGTCAACGCC CATTGCGCCG ACGCGATCGT CTGCATCTCG AATTGTGACA AGATCACGCC CGGCATGCTG ATGGCCTCGC TCCGTCTGAA CATCCCGGTC GTGTTCGTTT CCGGCGGCCC GATGGAGGCC GGCAAAGTGT TGCTCGGCGG CAAGACGAAG GCGCTCGATC TCGTCGACGC CATGGTCGCC GCCGCCGACG ACAAAGTGTC TGAAGCCGAT GTCGCGGCGA TCGAGCGCTC GGCCTGCCCG ACCTGCGGCT CGTGCTCGGG CATGTTCACC GCCAATTCGA TGAATTGCCT CACCGAGGCG CTCGGTCTTG CGCTGCCGGG AAATGGCTCG ATGCTGGCGA CGCATGGCGA TCGCAAGCGC CTCTTCGTCG AGGCCGGTCA TCTCATCGTC GACCTTGCCA GGCGCTATTA CGAGCAGGAC GATTCGTCGG TGCTGCCGCG CTCGATCGCG AGCTTTGCCG CTTTCGAGAA TGCGATGACG CTCGACATCT CGATGGGCGG CTCGACCAAT ACCGTCCTGC ATCTTCTCGC CGCCGCGCAT GAAGGCGAGA TCGACTTTAC CATGGCCGAC ATCGACCGAC TGTCGCGGCG CGTTCCCGTC CTTTGCAAGG TCGCGCCCGC GGTCGCCGAC GTGCATGTCG AGGATGTGCA TCGCGCCGGC GGCGTCATGG CGATCCTCGG CGAACTCGAA CGCGCCGGGC TGATCCATGG CGATCTGCCT GTGGTGCACG CGCCGAGCCT TAAGGAGGCG CTGGAACGCT GGGATCTCCG GCGCACGTCC AGCGAATCCG TCACTGAATT TTTCCGCGCC GCGCCGGGCG GCGTGCCGAC CCAGGTCGCA TTCAGCCAGA ACGCGCGCTG GAAAGAGACC GATGTCGACC GCGCAGGCGG CGTCATCCGC GACGTCGAAC ACGCCTTTTC CAAGGATGGC GGCCTCGCCG TGCTCTATGG CAATCTTGCC GAGGATGGCG CAATCGTGAA GACGGCCGGC GTGGACGCGT CCATCCTCGT CTTTTCCGGC CCCGCGCGCG TGTTCGAGAG TCAGGACGCC GCCGTCGAGG CGATTCTCGC CAATCAGATC AAGCCGGGCG ACGTCCTGGT GATCCGTTAT GAAGGGCCGC GCGGCGGACC CGGCATGCAG GAAATGCTCT ATCCGACCAG CTATCTGAAA TCGAAAGGCC TTGGCAAAGC CTGCGCCCTG ATCACCGACG GGCGGTTTTC CGGCGGCACT TCAGGTCTCT CGATCGGCCA TGTGTCGCCG GAAGCGGCGG AAGGCGGTTT GATCGGCCTC GTCGAGGAGG GCGATTCGAT CCAGATCGAC ATCCCGAACC GGCGCCTGCA TCTCGACATT TCCGATGAGG CGCTCGCCCA TCGCCGCACC GCCATGGCGG AAAAGGGCAA GGGCGCATGG AAGCCGGCGC ATCGGACGCG AAAAGTCTCG ACCGCGCTCA GGGCCTACGC GGCGATGGCG ACCAGCGCCG CGCGGGGCGC CGTGCGCGAC GTCGATCAGC TGTTTCACTA A
|
Protein sequence | MPPYRSRTTT HGRNMAGARG LWRATGMKDG DFGKPIIAVA NSFTQFVPGH VHLKDLGQLV AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSREII ADSVEYMVNA HCADAIVCIS NCDKITPGML MASLRLNIPV VFVSGGPMEA GKVLLGGKTK ALDLVDAMVA AADDKVSEAD VAAIERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS MLATHGDRKR LFVEAGHLIV DLARRYYEQD DSSVLPRSIA SFAAFENAMT LDISMGGSTN TVLHLLAAAH EGEIDFTMAD IDRLSRRVPV LCKVAPAVAD VHVEDVHRAG GVMAILGELE RAGLIHGDLP VVHAPSLKEA LERWDLRRTS SESVTEFFRA APGGVPTQVA FSQNARWKET DVDRAGGVIR DVEHAFSKDG GLAVLYGNLA EDGAIVKTAG VDASILVFSG PARVFESQDA AVEAILANQI KPGDVLVIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL VEEGDSIQID IPNRRLHLDI SDEALAHRRT AMAEKGKGAW KPAHRTRKVS TALRAYAAMA TSAARGAVRD VDQLFH
|
| |