Gene A9601_16921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16921 
SymbolhisD 
ID4718422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1434952 
End bp1436238 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content33% 
IMG OID640079418 
Producthistidinol dehydrogenase 
Protein accessionYP_001010082 
Protein GI123969224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCA TAAATAATAA AGAAGATGCT ATCCAAGAAT TAAAAAGGAT TTCCACTCGA 
ACTAACTCAG AAAACAACCA CAAAATAAAT GCAATAGTCG AAGAAATTCT TCAAGAAGTA
AAAAATTATG GAGATACAGC AGTAAAAAAA TATACCAAAA AATTTGATGG TTTCAACCCT
GAACCTATGC AAGTAAGTTC GAACCAAATA AAAAATGCAT GGAATGAAAT TGATAACAAT
TTGAAGCGCT CTCTTGAGGT AGCTCATAAA AGAATACAAA AATTCCATGA AAAAGAAATT
CCTAAATCTT TTACAATAAA AGGTGAATAT GGTGATACAG TCCAAAGAAG ATGGCGACCA
GTTAAAAATG CAGGTATTTA TATTCCTGGA GGCAGAGCTG CTTATCCAAG CACTGTATTA
ATGAATGCAA TACCTGCGAA AGTAGCAGGA GTTAAGGAAA CTATAATGGT ATCTCCTGGG
AATAAAGAAG GAGAAATAAA CAAAACTGTT TTAGCAGCAG CTCACTTATC AGGAATCAAT
AAAGTATTTA GAATTGGAGG AGCTCAAGCA ATTGGTGCTT TAGCCTTTGG CACAAATCAA
ATCAATAAAG TTGATGTTAT TTCAGGTCCA GGGAATATTT ATGTTACAAC TGCGAAAAAA
CTTATTTATG GTTCTACAGG GATTGATTCT TTAGCTGGTC CAAGTGAAAT ATTAATCATT
GCAGATGAAA CAGCTCAAAG CACTTATATA GCGTCTGATC TACTAGCCCA AGCAGAACAT
GATCCTTTAG CTTCATCAAT CCTTCTAACT ACATCAAAAA ATCAGGCACA AGAAGTTTTA
GAAGAACTTT ATAAAAAAAT AGATGATCAT CCAAGAAAAG AAATTTGCAT GCAATCAATT
AAAAATTGGG GGTTAATTGT GATTTGTGAG AATTATGAAT CATGTGTTGA ACTAAGTAAT
AACTTTGCCC CTGAACACCT AGAAATTCTT ACTATAAATT CAAAAAAAAT TCTTGCAGAT
ATAGATAATG CAGGAGCGAT ATTTTTGGGG AAATGGACAC CAGAAGCTGT TGGAGATTAT
CTTGCTGGAC CAAATCATAC TTTACCCACA TCAGGAAATT CTAGATTCAG CGGTTCTTTG
GGGGTTGAAA CTTTTATGAA AAATACTTCA ATAATAGAAT TTAATGAAGA AAGTTTAAAA
ATCAATAGCC CGGATATTAT AAATCTTGCT AATAGTGAGG GCTTACACAG CCACGCTAAC
TCAGTACAAA TAAGATTTGA AGATTAG
 
Protein sequence
MKIINNKEDA IQELKRISTR TNSENNHKIN AIVEEILQEV KNYGDTAVKK YTKKFDGFNP 
EPMQVSSNQI KNAWNEIDNN LKRSLEVAHK RIQKFHEKEI PKSFTIKGEY GDTVQRRWRP
VKNAGIYIPG GRAAYPSTVL MNAIPAKVAG VKETIMVSPG NKEGEINKTV LAAAHLSGIN
KVFRIGGAQA IGALAFGTNQ INKVDVISGP GNIYVTTAKK LIYGSTGIDS LAGPSEILII
ADETAQSTYI ASDLLAQAEH DPLASSILLT TSKNQAQEVL EELYKKIDDH PRKEICMQSI
KNWGLIVICE NYESCVELSN NFAPEHLEIL TINSKKILAD IDNAGAIFLG KWTPEAVGDY
LAGPNHTLPT SGNSRFSGSL GVETFMKNTS IIEFNEESLK INSPDIINLA NSEGLHSHAN
SVQIRFED