Gene Mnod_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_0453 
Symbol 
ID7302615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp508211 
End bp509926 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content71% 
IMG OID643598232 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002495798 
Protein GI220920497 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.107529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG GGTTGCGCAA GGGGCTCACG AGCTACGGCG ATGCCGGTTT CTCGCTCTTC 
CTCCGCAAGG CCTTCATCAA GGCCATGGGC TATTCCGACG ACGCGCTGAA CCGGCCGATC
GTCGGCATCA CCAACACCTA TAGCGACTAC AATCCCTGCC ACGGCAATGG TCCGGCGCTG
GTCGAGGCGG CCAAGCGCGG CGTGATGCTG GCTGGCGCCA TGCCGATGGT GTTCCCGACC
ATCTCGATCC ACGAGAGCTT CGCCCATCCG ACCTCCATGT TCCTGCGCAA CCTGATGGCG
ATGGACACGG AGGAGATGAT CCGGGCCCAG CCGATGGACG CTGTCATCGT CATCGGCGGC
TGCGACAAGA CCCTGCCGGC CCAGATCATG GCCGCGGCGA GCGTCGACCT GCCGACCGTG
GTGATCCCGG TCGGGCCCAT GGTGGTGGGG CACCACAAGG GCGAAGTGCT CGGCGCCTGC
ACGGATTGCC GCCGCCTCTG GGCCGCCCAT CGGGCCGGCG CGATCGACGA GCAGGAGATC
GAGATGGTGA ACGGGCGCCT CGCGCCCTCC GTCGGCACCT GCATGGTGAT GGGCACGGCC
TCCACCATGG CCTGCCTCAC CGAGGCGATG GGTCTGTCGC TGCCGCTCTC GGCGACGATT
CCCGCCCCTC ATGCGGAGCG CGTGCGCCTC GCCGAGGCGA GCGGGCGGCG TGCGGCCGAG
ATGGCGGTCG CCGGGGGGCC GCGTCCGAGC GCCATCCTGA CGCCGGCCGC CTTCCGCAAC
GCGCAGACCG TGCTCCAGGC GATCGGCGGC TCGACGAACG GGCTCATCCA CCTCACGGCC
ATAGCCAGGC GCGTCCGCGC CGAGATCGAC CTCGACGCCT TCGACGCGAT CGGCCGCGCG
GTGCCGGTGC TGGTCGACCT CAAACCCTCG GGCGACCACT ACATGGAGCA TTTCCACCAT
GCGGGCGGGG TGCCCCGCCT GCTTGCGGAA CTCGGCGACC TCATCGACCT CGACGTGCCG
ACCGTGGCGG GAGAAAAACT GCGCGACGTC GTGGCGGCGG CCGAGATTGT GCCGGGCCAG
ACCGTCATCC GCAGCCCTGC CGATCCGATC AAGCCGACGG GCGGGCTCGC CGTGCTGCGC
GGCAACCTCG CGCCCCGGGG CGCGCTCATC AAGCACGCGG CGGCGAGCGA GCGCCTCCTC
CAGCACACGG GGCGGGCGCT GGTGTTCGAA TCGATCCCCG AGATGGCCGC GCGGATCGAC
GACCCGGATC TCGACGTCTC GCCCGACGAC GTGCTGGTGC TGTGCAATGC CGGTCCCAAG
GGGGCGCCCG GCATGCCGGA GGCGGGCTAC CTGCCGATCC CCAAGAAGCT CGCTCGCCAG
GGCGTGAAGG ACATGGTGCG CATCTCGGAT GCGCGGATGA GCGGCACGGC CTTCGGCACG
ATCGTGCTGC ACGTCACCCC GGAATCCGCG GTGGGCGGCC CGCTCGCCCT GGTGCGGACC
GGCGACAGGA TCCGCCTCGA CGTGGCCGGG CGGCGCATCG ACCTCCTGGT GGACGAGGCG
GAACTCGCCC GGCGCGCGGC GGCGCTGCCG GCGCCGCCGC GGCCCGCCTG GGCCGAGCGG
GGCTATGCCC GCCTGTTCCA CGACACGGTC ACGCAGGCGG ACGAGGGCTG CGACTTCGAT
TTCATGCGGC CGGGCGGGGC TGCTCAGGGA AACTGA
 
Protein sequence
MSKGLRKGLT SYGDAGFSLF LRKAFIKAMG YSDDALNRPI VGITNTYSDY NPCHGNGPAL 
VEAAKRGVML AGAMPMVFPT ISIHESFAHP TSMFLRNLMA MDTEEMIRAQ PMDAVIVIGG
CDKTLPAQIM AAASVDLPTV VIPVGPMVVG HHKGEVLGAC TDCRRLWAAH RAGAIDEQEI
EMVNGRLAPS VGTCMVMGTA STMACLTEAM GLSLPLSATI PAPHAERVRL AEASGRRAAE
MAVAGGPRPS AILTPAAFRN AQTVLQAIGG STNGLIHLTA IARRVRAEID LDAFDAIGRA
VPVLVDLKPS GDHYMEHFHH AGGVPRLLAE LGDLIDLDVP TVAGEKLRDV VAAAEIVPGQ
TVIRSPADPI KPTGGLAVLR GNLAPRGALI KHAAASERLL QHTGRALVFE SIPEMAARID
DPDLDVSPDD VLVLCNAGPK GAPGMPEAGY LPIPKKLARQ GVKDMVRISD ARMSGTAFGT
IVLHVTPESA VGGPLALVRT GDRIRLDVAG RRIDLLVDEA ELARRAAALP APPRPAWAER
GYARLFHDTV TQADEGCDFD FMRPGGAAQG N