Gene Msil_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3234 
Symbol 
ID7090649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3546858 
End bp3548708 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content65% 
IMG OID643466542 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002363503 
Protein GI217979356 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00391519 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCCCT ATCGCTCTCG CACCACGACG CATGGACGCA ATATGGCCGG CGCGCGCGGA 
CTTTGGCGCG CCACCGGCAT GAAAGACGGC GATTTCGGCA AGCCGATCAT CGCCGTCGCC
AATTCCTTCA CGCAATTCGT GCCGGGCCAC GTCCATCTGA AGGACCTTGG CCAGCTGGTC
GCGCGCGAGA TCGAGGCGGC GGGCGGAGTC GCTAAGGAAT TCAACACCAT TGCGGTCGAC
GACGGCATCG CCATGGGTCA TGACGGCATG CTCTATAGCC TGCCCTCGCG CGAGATCATC
GCCGACTCGG TCGAATATAT GGTCAACGCC CATTGCGCCG ACGCGATCGT CTGCATCTCG
AATTGTGACA AGATCACGCC CGGCATGCTG ATGGCCTCGC TCCGTCTGAA CATCCCGGTC
GTGTTCGTTT CCGGCGGCCC GATGGAGGCC GGCAAAGTGT TGCTCGGCGG CAAGACGAAG
GCGCTCGATC TCGTCGACGC CATGGTCGCC GCCGCCGACG ACAAAGTGTC TGAAGCCGAT
GTCGCGGCGA TCGAGCGCTC GGCCTGCCCG ACCTGCGGCT CGTGCTCGGG CATGTTCACC
GCCAATTCGA TGAATTGCCT CACCGAGGCG CTCGGTCTTG CGCTGCCGGG AAATGGCTCG
ATGCTGGCGA CGCATGGCGA TCGCAAGCGC CTCTTCGTCG AGGCCGGTCA TCTCATCGTC
GACCTTGCCA GGCGCTATTA CGAGCAGGAC GATTCGTCGG TGCTGCCGCG CTCGATCGCG
AGCTTTGCCG CTTTCGAGAA TGCGATGACG CTCGACATCT CGATGGGCGG CTCGACCAAT
ACCGTCCTGC ATCTTCTCGC CGCCGCGCAT GAAGGCGAGA TCGACTTTAC CATGGCCGAC
ATCGACCGAC TGTCGCGGCG CGTTCCCGTC CTTTGCAAGG TCGCGCCCGC GGTCGCCGAC
GTGCATGTCG AGGATGTGCA TCGCGCCGGC GGCGTCATGG CGATCCTCGG CGAACTCGAA
CGCGCCGGGC TGATCCATGG CGATCTGCCT GTGGTGCACG CGCCGAGCCT TAAGGAGGCG
CTGGAACGCT GGGATCTCCG GCGCACGTCC AGCGAATCCG TCACTGAATT TTTCCGCGCC
GCGCCGGGCG GCGTGCCGAC CCAGGTCGCA TTCAGCCAGA ACGCGCGCTG GAAAGAGACC
GATGTCGACC GCGCAGGCGG CGTCATCCGC GACGTCGAAC ACGCCTTTTC CAAGGATGGC
GGCCTCGCCG TGCTCTATGG CAATCTTGCC GAGGATGGCG CAATCGTGAA GACGGCCGGC
GTGGACGCGT CCATCCTCGT CTTTTCCGGC CCCGCGCGCG TGTTCGAGAG TCAGGACGCC
GCCGTCGAGG CGATTCTCGC CAATCAGATC AAGCCGGGCG ACGTCCTGGT GATCCGTTAT
GAAGGGCCGC GCGGCGGACC CGGCATGCAG GAAATGCTCT ATCCGACCAG CTATCTGAAA
TCGAAAGGCC TTGGCAAAGC CTGCGCCCTG ATCACCGACG GGCGGTTTTC CGGCGGCACT
TCAGGTCTCT CGATCGGCCA TGTGTCGCCG GAAGCGGCGG AAGGCGGTTT GATCGGCCTC
GTCGAGGAGG GCGATTCGAT CCAGATCGAC ATCCCGAACC GGCGCCTGCA TCTCGACATT
TCCGATGAGG CGCTCGCCCA TCGCCGCACC GCCATGGCGG AAAAGGGCAA GGGCGCATGG
AAGCCGGCGC ATCGGACGCG AAAAGTCTCG ACCGCGCTCA GGGCCTACGC GGCGATGGCG
ACCAGCGCCG CGCGGGGCGC CGTGCGCGAC GTCGATCAGC TGTTTCACTA A
 
Protein sequence
MPPYRSRTTT HGRNMAGARG LWRATGMKDG DFGKPIIAVA NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSREII ADSVEYMVNA HCADAIVCIS
NCDKITPGML MASLRLNIPV VFVSGGPMEA GKVLLGGKTK ALDLVDAMVA AADDKVSEAD
VAAIERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS MLATHGDRKR LFVEAGHLIV
DLARRYYEQD DSSVLPRSIA SFAAFENAMT LDISMGGSTN TVLHLLAAAH EGEIDFTMAD
IDRLSRRVPV LCKVAPAVAD VHVEDVHRAG GVMAILGELE RAGLIHGDLP VVHAPSLKEA
LERWDLRRTS SESVTEFFRA APGGVPTQVA FSQNARWKET DVDRAGGVIR DVEHAFSKDG
GLAVLYGNLA EDGAIVKTAG VDASILVFSG PARVFESQDA AVEAILANQI KPGDVLVIRY
EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL
VEEGDSIQID IPNRRLHLDI SDEALAHRRT AMAEKGKGAW KPAHRTRKVS TALRAYAAMA
TSAARGAVRD VDQLFH