Gene P9515_16681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_16681 
SymbolhisD 
ID4720471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1462212 
End bp1463498 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content35% 
IMG OID640081360 
Producthistidinol dehydrogenase 
Protein accessionYP_001011982 
Protein GI123966901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTA TAAATAATAA ACAAAAGGCT TTGGAAGAGT TGAAAAGAAT TTCTCAAAGA 
ACTACATCCG GCGACAATAG AAAAATAAAT TCATTAGTTG AAGAAATTCT TGAAGAGGTG
AAAATTAATG GAGATAAAGC TGTAGAAAAA TATACAAAAA AGTTTGATGG ATTCCATCCT
AAACCGATGC AAGTTAGTGC TATGGATTTA AAGTCTGCTT GGGATGAGAC AGACCATCAT
TTAAAAAAAT CATTAGAAAT TGCCTACCAA AGAATTCAAA AATTCCATGA AAAAGAAATA
CCTGAATCAT TCACTATAAA AGGAGAATAT GGTGACTCCG TTCAAAGAAA ATGGATGCCT
GTAAAAAATG CTGGCTTATA CATTCCGGGG GGACGAGCCG CATATCCAAG TACTGTTTTA
ATGAATGCGA TACCTGCAAA AGTTGCTGGA GTGAAAGAAA TTTCAATGGT ATCTCCCGGA
AATCAAGAAG GAAAAATAAA TAAAACTGTT TTAGCTGCTG CCTACTTATC AGGGATTGAT
AAAGTTTTTA GAATTGGAGG AGCACAGGCA ATTGGCGCCT TAGCCTTTGG CACAAAGCAA
ATAAATAGAG TTGATGTTAT TTCTGGACCA GGAAATATCT ATGTTACTGC TGCTAAAAAG
TTAATTTATG GATTGACTGG AATTGATTCT TTAGCAGGCC CAAGTGAAAT TTTAATCATC
GCGGATAGAA CAGCTAAAAG TTCTCAAATA GCATCAGATT TATTAGCCCA AGCAGAGCAT
GATCCATTAG CCTCCTCAAT ACTTTTGACT ACATCAAAGG GTCAAGCTCA AGAAGTCTTT
GATGAGGTCT TTAAGCAATT AGAGCATCAC CCCAGAAAAG AAATTTGTAT TCAATCAATT
GAAAATTGGG GATTAATTGC TATTTGCGAA AATTTAGAAT CATGCATTGA ACTCAGCAAT
GAGTTTGCCC CAGAACATCT AGAGATTATT ACTATTGATC CAAAAAAGAT ACTTAAAACT
ATCGAAAATG CAGGCGCGAT TTTTTTAGGA AATTGGACAC CAGAAGCTGT AGGTGATTAT
TTGGCTGGAC CAAACCATAC TTTACCCACA TGCGGTAATG CCAGGTTTAG TGGATCATTA
GGTGTTGAAA CTTTCATGAA GAATTCATCA ATAATTGAAT TTAATGAAAA AAGTTTAAAA
ATCAATAGCT CAGATATTAT AAACTTGGCA AACAGCGAAG GTCTCCATAG TCATGCGAAT
TCAGTAAAAA TTAGATTTGA GGATTAA
 
Protein sequence
MKIINNKQKA LEELKRISQR TTSGDNRKIN SLVEEILEEV KINGDKAVEK YTKKFDGFHP 
KPMQVSAMDL KSAWDETDHH LKKSLEIAYQ RIQKFHEKEI PESFTIKGEY GDSVQRKWMP
VKNAGLYIPG GRAAYPSTVL MNAIPAKVAG VKEISMVSPG NQEGKINKTV LAAAYLSGID
KVFRIGGAQA IGALAFGTKQ INRVDVISGP GNIYVTAAKK LIYGLTGIDS LAGPSEILII
ADRTAKSSQI ASDLLAQAEH DPLASSILLT TSKGQAQEVF DEVFKQLEHH PRKEICIQSI
ENWGLIAICE NLESCIELSN EFAPEHLEII TIDPKKILKT IENAGAIFLG NWTPEAVGDY
LAGPNHTLPT CGNARFSGSL GVETFMKNSS IIEFNEKSLK INSSDIINLA NSEGLHSHAN
SVKIRFED