Gene MCA1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1963 
SymbolhisD 
ID3102435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2115248 
End bp2116558 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content65% 
IMG OID637171118 
Producthistidinol dehydrogenase 
Protein accessionYP_114396 
Protein GI53803969 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.318488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG TAAAAATCAA ACGGCTTTAC ACCGGCGATG CGGACTTTGC ATCGCAACTG 
GACAGGCTGC TTGCCTGGAG CGAAAGCGAG GACACCGACA TCCACCAGCG CGTGACCGAG
ATCATCGGCT GCATCCGCCG CGATGGCGAT GCGGCCCTGG TGGAGCTCAC GGCCCGTTTC
GACCATTTCG TCGTGGATAC CGCTGCGGCG CTCGAGCTGC CGCGTGACGT GCTGGAAGCG
GCCTGGCAGG CGCTGCCCGC CGAACAAGCC AAAGCCCTGC GGGAAGCGGC GGAGCGCATC
CGGGCCTACG CCGAGCGGCA AAAGCTCGAT TCCTGGGACT ACCGTGAAGC CGACGGCACT
TTGCTGGGAC AGAAGATCAC GCCGCTCGAC CGGGTCGGCC TGTATGTACC CGGTGGCAAG
GCCGCATATC CTTCCTCGGT ACTGATGAAT GCGGTTCCTG CCAAGGTGGC GGGCGTGCCG
GAACTCATCA TGGCGGTGCC GGCTCCGCGG GGAGAGCTGA ACGCCCTGGT GCTGGCTGCC
GCCTATATTT CCGGAGTGGA CCGGGTTTTC CGCATCGGTG GCGCACAGGC CGTCGCCGCC
CTGGCTTATG GGACGGAAAC GGTGCCGCGG GTCGACAAGA TCGTCGGCCC CGGTAACATC
TATGTGGCGA CCGCCAAAAA GCTGGTGTTC GGCCAAGTCG GGATCGACAT GGTCGCAGGC
CCCTCGGAGA TCCTGGTGAT CTCGGACGGA CGGACCGACC CGGACTGGAT CGCCATGGAT
CTGTTTTCGC AAGCCGAGCA TGACGAGGAC GCCCAGGCGA TCCTGATCAG CCCGGATGCA
GCCCATCTGG AGGCGGTACA GGCAAGCATC GAGCGGCTGT TGCCCGGCAT GGAGCGCGCC
GAGGTCATCC GCACCTCGCT GGAGCGGCGC GGCGGCATGA TCCTGGTCGA TGATCTGGAG
CAGGCGGCGG CGGTCGCCAA TCGCATCGCG CCGGAACATC TGGAGCTTTC GGTGGAGAGC
CCGGAGGTCC TGGTGGAGTC GATCCGCAAT GCCGGGGCCA TCTTCATGGG GCGCTATACC
GCGGAAGCGC TCGGCGATTA CTGTGCCGGT CCCAACCACG TCCTGCCGAC TTCGGGCACG
GCGCGCTTCT CGTCGCCGCT GGGCGTCTAT GATTTCCAGA AGCGTTCCAG CCTGATCTAC
TGTTCGCCAG ACGGCGCAGA CCAACTGGGC CGTACCGCTT CGCTGCTGGC CTGGGGCGAA
GGGCTGGGGG CGCATGCCCG TTCGGCCGAA TATCGGATCA GGCACCATTA A
 
Protein sequence
MTEVKIKRLY TGDADFASQL DRLLAWSESE DTDIHQRVTE IIGCIRRDGD AALVELTARF 
DHFVVDTAAA LELPRDVLEA AWQALPAEQA KALREAAERI RAYAERQKLD SWDYREADGT
LLGQKITPLD RVGLYVPGGK AAYPSSVLMN AVPAKVAGVP ELIMAVPAPR GELNALVLAA
AYISGVDRVF RIGGAQAVAA LAYGTETVPR VDKIVGPGNI YVATAKKLVF GQVGIDMVAG
PSEILVISDG RTDPDWIAMD LFSQAEHDED AQAILISPDA AHLEAVQASI ERLLPGMERA
EVIRTSLERR GGMILVDDLE QAAAVANRIA PEHLELSVES PEVLVESIRN AGAIFMGRYT
AEALGDYCAG PNHVLPTSGT ARFSSPLGVY DFQKRSSLIY CSPDGADQLG RTASLLAWGE
GLGAHARSAE YRIRHH