Gene P9301_16791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_16791 
SymbolhisD 
ID4912697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1410728 
End bp1412014 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content33% 
IMG OID640161276 
Producthistidinol dehydrogenase 
Protein accessionYP_001091903 
Protein GI126697017 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.557567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAA TAAATAATAA AACAGAAGCT ATTGAAGAAT TGAAGAGGAT TTCTACCCGA 
ACTAATTCAG AAAATAATAA TAAGATAAAT AAAATTGTTG AAGAAATTCT TCAAGAGGTA
AAAATTTCTG GAGATAACGC AGTTGAAAAA TATACCAAAA AATTTGATGG TTTCGATCCT
AACCCTATGC AAGTGAATGC GGATCAAATA AAGAATGCAT GGGATGAAAT TGATAACAAT
TTGAAACGCT CACTTGAGGA GGCTCACAAA AGAATTCAAA AATTCCATGA AAAAGAAATT
CCTCAATCTT TCACGATAAA AGGTGAATAT GGTGATATTG TCCAAAGAAG ATGGAAACCA
GTTAAAAATG CAGGTATTTA TATTCCTGGA GGAAGAGCTG CTTATCCAAG TACTGTATTA
ATGAATGCAA TTCCTGCAAA AGTAGCAGGA GTAGCTGAAA TTATAATGGT ATCTCCTGGA
AATAAAGAAG GGGAAATAAA CAAAACTGTT TTAGCTGCAG CTCACTTATC AGGTATCAAT
AAAGTATTTA GAATTGGAGG AGCTCAAGCG ATTGGTGCAT TAGCATTTGG CACAAATCAA
ATCAACAAAG TTGATGTTAT ATCAGGTCCA GGGAATATAT ATGTTACAAC TGCAAAAAAA
CTAATTTATG GCTCAACAGG AATTGATTCA TTAGCTGGTC CAAGTGAAAT ATTAATCATT
GCAGATGAAA CAGCTCAAAG CACTCATATA GCATCTGATC TACTAGCGCA AGCAGAACAT
GATCCTTTGG CTTCATCAAT ACTACTAACT ACATCAAAGA ACCAGGCAAA AGAAGTTTTA
GAAGAACTTT ATAAAAAAAT TGAGGACCAT CCAAGAAAAG AAATTTGCAT GCAATCAATC
AACAATTGGG GTTCAATTGT GATTTGCGAG AATTCTGAAC AATGTGTTGA ACTAAGCAAT
AACTTTGCCC CTGAACACCT AGAAATTCTT ACTGTAGATC CAAAAAAAAT TCTTGCAGGT
ATAGAGAATG CAGGAGCAAT TTTTTTAGGA AAATGGACTC CAGAAGCTGT TGGAGATTAT
CTTGCTGGAC CAAATCATAC TTTACCCACA TCAGGTAATT CTAGATTTAG CGGTTCTTTA
GGGGTTGAAA CTTTTATGAA AAATTCTTCA ATAATAGAAT TTAATGAAGA AAGTTTAAAA
GTTAATAGCC TTGATATTAT TAATCTAGCT AAAAGTGAGG GCTTGCATAG TCACGCTAAC
TCAGTACAAA TAAGATTTGA AGATTAG
 
Protein sequence
MKIINNKTEA IEELKRISTR TNSENNNKIN KIVEEILQEV KISGDNAVEK YTKKFDGFDP 
NPMQVNADQI KNAWDEIDNN LKRSLEEAHK RIQKFHEKEI PQSFTIKGEY GDIVQRRWKP
VKNAGIYIPG GRAAYPSTVL MNAIPAKVAG VAEIIMVSPG NKEGEINKTV LAAAHLSGIN
KVFRIGGAQA IGALAFGTNQ INKVDVISGP GNIYVTTAKK LIYGSTGIDS LAGPSEILII
ADETAQSTHI ASDLLAQAEH DPLASSILLT TSKNQAKEVL EELYKKIEDH PRKEICMQSI
NNWGSIVICE NSEQCVELSN NFAPEHLEIL TVDPKKILAG IENAGAIFLG KWTPEAVGDY
LAGPNHTLPT SGNSRFSGSL GVETFMKNSS IIEFNEESLK VNSLDIINLA KSEGLHSHAN
SVQIRFED