Gene Mnod_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1036 
Symbol 
ID7302580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1107162 
End bp1108826 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content71% 
IMG OID643598785 
Producturocanate hydratase 
Protein accessionYP_002496347 
Protein GI220921046 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0275392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCC TCGACAATGC CCGCATCGTC CGCGCTCCCC GCGGCCCCGC CCTCACGGCC 
AAGAGCTGGC TCACCGAGGC GCCCCTGCGG ATGCTGATGA ACAACCTCGA TCCGGATGTC
GCGGAGCGGC CGGGCGACCT CGTCGTCTAT GGCGGCATCG GCCGGGCGGC GCGGGACTGG
GCCAGCTTCG ACCGGATCGT CGCCGCGCTC AAGGACCTCG ACGAGGACCA GACGCTCCTC
GTGCAGTCGG GTAAGCCGGT CGGAATCTTC CGCACCCATC CGGACGCGCC GCGGGTGCTG
ATCGCCAATT CCAACCTCGT GCCGCACTGG GCGACCTGGG CCCATTTCCA CGAGCTCGAT
CGCAAGGGCC TGATGATGTA CGGCCAGATG ACGGCCGGCT CCTGGATCTA CATCGGCAGC
CAGGGCATCG TGCAGGGCAC CTACGAGACC TTCGTGGAGA TGGGCCGCCA GCATTACGGC
GGCGACCTCG CGGGGCGCTG GATCCTGACC GCGGGCCTCG GGGGCATGGG CGGCGCGCAG
CCGCTCGCCG CCACCATGGC CGGGGCCTCC TGCCTCGCCG TCGAGTGCCG GGCATCGAGC
ATCGAGTTCC GCCTGCGCAC GGGCTATGTC GACGTCCAGG CCCGCGACCT CGACGAGGCG
CTCGCCCTGA TCGACGAATC CTGCCGGGCG CGGGTGCCCC GCTCCGTGGC GCTCATCGGC
AACGCCGCCG AGGTCTTCGC CGAGATCCAG CGCCGTGGCG TGCGGCCGGA TTGCGTCACC
GACCAGACCT CCGCGCACGA CCCCGTCAAT GGCTACCTGC CCCGGGGCTG GAGCATCGCC
GAGTGGGAGG CGCGGCGCGA GAGCGACCCG GACGGGGTCG CGGCGGCCGC CAAGCGCTCC
ATGGCCGAGC AGGTGCGGGT GATGCTGGCC TTCCACCGGG CCGGCGTGCC GACCGTCGAT
TACGGCAACA ACATCCGGCA GATGGCGCTG GAGGAAGGGG TGGCGGACGC CTTCGCCTTC
CCGGGCTTCG TGCCGGCCTA TATCCGCCCG CTCTTCTGCC GCGGCGTCGG GCCGTTCCGC
TGGTGCGCCC TCTCGGGCGA TCCGGAGGAC ATTTACCGCA CCGACGCCAA GGTGAAGCAG
CTTCTGCCCG ACAATGCCCA CCTGCACCGC TGGCTCGACA TGGCCCGGGA CAAGATCCGG
TTCCAGGGCC TGCCGGCGCG GATCTGCTGG GTGGGCCTGG GCGACCGCCA CCGGCTCGGC
CTTGCCTTCA ACGCCATGGT GCGCAGCGGG GAGCTCAAGG CGCCGATCGT GATCGGGCGC
GACCACCTCG ATTCCGGCTC CGTCGCCTCC CCCAACCGGG AGACGGAGGC GATGCGCGAC
GGCTCGGACG CGGTCTCGGA CTGGCCGCTT CTCAACGCCC TCCTCAACAC CGCCTCGGGC
GCCACCTGGG TGTCGCTCCA CCACGGCGGC GGGGTCGGGA TGGGCTTCTC GCAGCATGCC
GGCATGGTGA TCGTCTGCGA CGGCAGCGAG GCCGCGGACC GGCGCCTGGA GCGGGTGCTG
TGGAACGATC CGGCCACGGG CGTGATGCGC CACGCCGATG CCGGCTACCC GGAGGCGATC
GCCTGTGCGC GGGAACAGGG GTTGGTCCTG CCGAGCCTGG GCTAG
 
Protein sequence
MTRLDNARIV RAPRGPALTA KSWLTEAPLR MLMNNLDPDV AERPGDLVVY GGIGRAARDW 
ASFDRIVAAL KDLDEDQTLL VQSGKPVGIF RTHPDAPRVL IANSNLVPHW ATWAHFHELD
RKGLMMYGQM TAGSWIYIGS QGIVQGTYET FVEMGRQHYG GDLAGRWILT AGLGGMGGAQ
PLAATMAGAS CLAVECRASS IEFRLRTGYV DVQARDLDEA LALIDESCRA RVPRSVALIG
NAAEVFAEIQ RRGVRPDCVT DQTSAHDPVN GYLPRGWSIA EWEARRESDP DGVAAAAKRS
MAEQVRVMLA FHRAGVPTVD YGNNIRQMAL EEGVADAFAF PGFVPAYIRP LFCRGVGPFR
WCALSGDPED IYRTDAKVKQ LLPDNAHLHR WLDMARDKIR FQGLPARICW VGLGDRHRLG
LAFNAMVRSG ELKAPIVIGR DHLDSGSVAS PNRETEAMRD GSDAVSDWPL LNALLNTASG
ATWVSLHHGG GVGMGFSQHA GMVIVCDGSE AADRRLERVL WNDPATGVMR HADAGYPEAI
ACAREQGLVL PSLG