Gene Mnod_5839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5839 
Symbol 
ID7303256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5939462 
End bp5940469 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content75% 
IMG OID643603457 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_002500970 
Protein GI220925668 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACA GCCTCGCCCT GACGCAGGGC GACCCGGCCG GGATCGGCCT CGAGATCACG 
CTCAAGGCCT GGGCGGCCCG GGAGGCGGCA GGCCTCGCGC CCTTCTTCCT GATCGGCGAT
CCCGATCTCG TCGCGGAGCG CGCCGCGCGG CTCGGGCTCG CGGTGCGGAT AGCGCGCGTC
GACCCGGAGG GGGCCGCGGC GGCCTTTCCC GGTGCGCTTC CGGTCGTGCC CCTGCCGGAC
CGGGTGCGGG CCGAGCCCGG CCGTCCGGAT CCCGGAACCG CCGCCGCGAC GCTCGCCTCG
ATCGAGACGG CGGTGCGCTT CGTGCGCGAG GGCCGGGCCG CTTCCCTGGT CACGAATCCG
ATCGCCAAGC ACGTGCTGTA TGCGGCGGGC TTCCGCCATC CGGGCCACAC CGAATATCTG
GCGGCGCTCG CGGCCGGTCC CGACGGTGCG GTGCCGCGCC CGGTGATGCT GCTGTGGTCG
GAGCTGCTCG CCGTGGTGCC GCTGACCATT CACGTGCCGC TGCGGCGCGT CCCCGACCTT
CTGACCCCGG ATCTCGTCAT CGAGACCGCC CGCATCGTCG ACCGGGACCT GCGGGCGCGC
TTCGGCCGCC TGAGCCCGCG TCTCGTGCTC GCGGGCCTCA ACCCGCATGC GGGCGAGGAG
GGCAGCATCG GGACGGAGGA CCGCGACGTG CTGGCCCCCG CGGTGGCGCG GCTGCGGGAC
GAGGGGATCG ACATCCGCGG CCCGCTTCCT GCGGACACGC TCTTCCACGC ACGGGCGCGG
GCGGCCTACG ACGTGGCGCT CGCCCCGACC CACGACCAGG CGCTGATCCC GATCAAGACG
CTCGCCTTCG ACGAGGGCGT GAACGTCACG CTCGGCCTGC CCTTCCTGCG CACCTCGCCC
GACCACGGCA CCGCCTTCGA CATTGCCGGG CAGGGGATCG CCAAGCCCGA CAGCCTGATC
GCGGCGCTCC GGCTCGCCGG GCGGCTCACG GCCGCACGGC CCGCATGA
 
Protein sequence
MSHSLALTQG DPAGIGLEIT LKAWAAREAA GLAPFFLIGD PDLVAERAAR LGLAVRIARV 
DPEGAAAAFP GALPVVPLPD RVRAEPGRPD PGTAAATLAS IETAVRFVRE GRAASLVTNP
IAKHVLYAAG FRHPGHTEYL AALAAGPDGA VPRPVMLLWS ELLAVVPLTI HVPLRRVPDL
LTPDLVIETA RIVDRDLRAR FGRLSPRLVL AGLNPHAGEE GSIGTEDRDV LAPAVARLRD
EGIDIRGPLP ADTLFHARAR AAYDVALAPT HDQALIPIKT LAFDEGVNVT LGLPFLRTSP
DHGTAFDIAG QGIAKPDSLI AALRLAGRLT AARPA