Gene Mext_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1984 
Symbol 
ID5831372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2215730 
End bp2217025 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content71% 
IMG OID641367785 
Producthistidinol dehydrogenase 
Protein accessionYP_001639454 
Protein GI163851411 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC TCGACAGCCG CTCTCCCGAT TTCGCGGAGG CGTTCAAGCG CCTGCTCGGC 
CTCAAGCGCG AGATCTCCGA GGATGTGGAC GAGACCGTGC GCGGCATCAT CGCGGGGGTC
GTCTCCGGCG GCGACGCGGC GCTCGTCGAT TACACCCGCC AGTTCGACCG GCTCGGCCAG
GATTTTGCGC CGGCCTCCCT GCGCATCACC GCCGAAGAGG TGGAGGCGGC GGTGGACGCC
TGCCCGGCCG AGGCTCGCGC TGCCCTGGCT CTCGCGGCCG AGCGGATCGA GGCCTATCAC
CGCCGCCAGA TCCCCGAGGA TCACCTCTCC ACCGACGATC TTGGTGTCAC CGCCGGTTGG
CGCTGGACTG CGATCGAATC GGTCGGGCTC TACGTGCCCG GCGGCACCGC GAGCTATCCC
TCCTCGGTGC TGATGAACGC GGTGCCGGCG CGCGTCGCGG GCGTGCCGCG CATCGTCATG
GTGGTGCCGA CCCCCGAGGG CCAGCTCAAC CCGCTGGTGC TCGCCGCGGC CAAACTCTCC
GGCGTCACCG AGATCTACCG GGTCGGCGGA GCGCAGGCGG TGGCCGCACT CGCCTACGGC
ACGGAAACCA TCGCGCCGGT GGCCAAGATC GTCGGCCCCG GCAATGCCTG GGTCGCGGCG
GCCAAGCGCC GGGTGTTTGG GCAGGTTGGC ATCGACATGA TCGCCGGCCC CTCCGAAGTG
CTGATCCTGG CCGACCGCCA CGCCAACCCC GACTGGATCG CCGCCGACCT GCTGGCCCAG
GCCGAGCACG ACACCGCGGC GCAGGCCGTG CTCGTCACCG ATTCGGACGA GTTGGCCGAC
GCGACCGAGG CCGCGGTCGA GCGCGCGCTC GCAACCCTGA AGCGGGCCGA GATCGCCCGC
GCCAGTTGGC GCGATTACGG CGCGATCATC CGCGTCCGGG ACTTCGATGA GGCAGTGACG
CTCGTGGACG CCATCGCGCC CGAACATCTC GAGATCGAGA CCGAGGACGC CGACGCGTTG
TCGCTGAAAA TCCGGAACGC GGGGGCGATC TTCCTCGGCG CGCACACGCC CGAAGCCATC
GGCGATTATG TCGGCGGCCC GAACCACGTG CTGCCGACCG CCCGCTCGGC GCGGTTCTCC
TCGGGGCTCG GGGTGCTCGA CTTCATGAAG CGCACCTCGA TCCTGCGCTG CGATCCGGCC
GCCTTGCGGG CGCTCGGGCC TGCCGCGATC GCTCTCGGCG AGTCCGAGGG TCTCGACGGG
CATGCCCGCT CCGTGTCGAT CCGGCTCAAT CTCTAG
 
Protein sequence
MIRLDSRSPD FAEAFKRLLG LKREISEDVD ETVRGIIAGV VSGGDAALVD YTRQFDRLGQ 
DFAPASLRIT AEEVEAAVDA CPAEARAALA LAAERIEAYH RRQIPEDHLS TDDLGVTAGW
RWTAIESVGL YVPGGTASYP SSVLMNAVPA RVAGVPRIVM VVPTPEGQLN PLVLAAAKLS
GVTEIYRVGG AQAVAALAYG TETIAPVAKI VGPGNAWVAA AKRRVFGQVG IDMIAGPSEV
LILADRHANP DWIAADLLAQ AEHDTAAQAV LVTDSDELAD ATEAAVERAL ATLKRAEIAR
ASWRDYGAII RVRDFDEAVT LVDAIAPEHL EIETEDADAL SLKIRNAGAI FLGAHTPEAI
GDYVGGPNHV LPTARSARFS SGLGVLDFMK RTSILRCDPA ALRALGPAAI ALGESEGLDG
HARSVSIRLN L