Gene M446_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5056 
Symbol 
ID6134784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5539236 
End bp5540969 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content71% 
IMG OID641645192 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001771817 
Protein GI170743162 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.644776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0033961 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCGC GTCACACCAT CGACAAGTCG AGGCTCCCCA GCCGCCACGT CACCGAGGGA 
CCGGCGCGGG CGCCGCACCG CTCCTACCTC TACGCGATGG GGCTCACGCG CGAGCAGATC
CACCAGCCTC TGGTCGGCGT CGCCTCGTGC TGGAACGAGG CCGCGCCCTG CAACATCGCC
CTGATGCGCC AAGCGCAGGC GGTGAAGAAG GGCGTGGCCG CCGCCGCCGG CACGCCGCGC
GAATTCTGCA CCATCACGGT GACGGACGGC ATCGCCATGG GCCACCGGGG CATGCGCGCC
TCCCTGCCCT CCCGCGAGGT CATCGCGGAT TCGGTCGAGC TGACCATGCG CGGCCACGCC
TACGACGCCC TCGTGGGCCT CGCCGGCTGC GACAAGTCCC TGCCCGGCAT GATGATGGCG
ATGGTGCGCC TCAACGTGCC GTCGATCTTC ATCTATGGCG GCTCGATCCT GCCCGGCACC
TTCCGGGGCC GGCCCGTCAC CGTCCAGGAC CTGTTCGAGG CGGTGGGCAA GCACTCGGTC
GGCGCGATGA GCGACGAGGA TCTCGACGAG CTGGAGCAGG TCGCCTGCCC CTCGGCCGGG
GCCTGCGGGG CGCAGTTCAC CGCCAACACC ATGGCGACCG TCTCCGAGGC GATCGGCCTC
GCGCTGCCCT ACTCGGCCGG CGCGCCGGCG CCCTACGAGA TCCGCGACAA GTTCTGCGCC
GCGGCCGGCG AGATGGTGAT GGACCTGCTC GCCAGGAACA TCCGCCCGCG CGACATCGTC
ACCCGCCGGG CGCTGGAGAA CGCCGCCACC GTGGTGGCGG CCTCCGGCGG CTCGACCAAC
GCGGCGCTGC ACCTGCCGGC GATCGCGCAC GAGGCGGGCA TCTCCTTCGA CCTGTTCGAC
GTCGCCGAGA TCTTCAAGCG CACCCCCTAC GTCGCGGACC TGAAGCCGGG CGGGCGCTAC
GTCGCCAAGG ACCTGTTCGA GGTCGGCGGC ATCCCGCTGC TGATGAAGAC CCTCCTCGAC
CACGGCTTCC TGCACGGCGA CTGCATGACC GTGACGGGCC GCACCATCGC CGAGAACCTC
GCCAAGGTCG CCTGGAACGA CCAGCAGGAC GTGGTGCGCC CGGCCAACAC CCCGATCACC
CCGACCGGCG GCGTGGTCGG CCTGAAGGGC AACCTCGCCC CCGAGGGCGC GATCGTGAAG
GTGGCCGGCA TGGCGCCGGA CCGCCAGGTC TTCGCCGGTC CCGCCAGGGT CTTCGACACC
GAGGAGGCCT GCTTCGAGGC GGTGCAGAAC CGCCAGTACA AGGAGGGCGA CGTTCTCGTC
ATCCGCTACG AGGGTCCGAA GGGCGGCCCC GGCATGCGCG AGATGCTGGC GACCACGGCC
GCCCTCTACG GCCAGGGCAT GGGCGACAAG GTCGCGCTCA TCACCGACGG TCGCTTCTCC
GGCGCGACCC GCGGCTTCTG CGTCGGCCAT GTCGGCCCGG AGGCGGCGGT GGGCGGCCCG
ATCGGGCTGC TCAAGGACGG CGACATCATC CGCCTCGACG CGATCCAGGG CACGCTCACG
GTCGACCTCT CGGACGAGGA ACTGGCCGAG CGGCGCAAGG CCTGGGCCCC GCGCGGCAAC
GAGGCGACCT CCGGCTATCT CTGGAAATAC GCGCAGACCG TCGGCCCGGC GGTGAACGGC
GCGGTGACGC ATCCGGGCGG TGCCCAGGAG ACGCTCGCCT ATGCGGATGT GTGA
 
Protein sequence
MDARHTIDKS RLPSRHVTEG PARAPHRSYL YAMGLTREQI HQPLVGVASC WNEAAPCNIA 
LMRQAQAVKK GVAAAAGTPR EFCTITVTDG IAMGHRGMRA SLPSREVIAD SVELTMRGHA
YDALVGLAGC DKSLPGMMMA MVRLNVPSIF IYGGSILPGT FRGRPVTVQD LFEAVGKHSV
GAMSDEDLDE LEQVACPSAG ACGAQFTANT MATVSEAIGL ALPYSAGAPA PYEIRDKFCA
AAGEMVMDLL ARNIRPRDIV TRRALENAAT VVAASGGSTN AALHLPAIAH EAGISFDLFD
VAEIFKRTPY VADLKPGGRY VAKDLFEVGG IPLLMKTLLD HGFLHGDCMT VTGRTIAENL
AKVAWNDQQD VVRPANTPIT PTGGVVGLKG NLAPEGAIVK VAGMAPDRQV FAGPARVFDT
EEACFEAVQN RQYKEGDVLV IRYEGPKGGP GMREMLATTA ALYGQGMGDK VALITDGRFS
GATRGFCVGH VGPEAAVGGP IGLLKDGDII RLDAIQGTLT VDLSDEELAE RRKAWAPRGN
EATSGYLWKY AQTVGPAVNG AVTHPGGAQE TLAYADV