Gene P9211_16101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16101 
SymbolhisD 
ID5731277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1441389 
End bp1442660 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content40% 
IMG OID641285988 
Producthistidinol dehydrogenase 
Protein accessionYP_001551495 
Protein GI159904151 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCC TGGAAAGGAT ATCCAATAGA ACTTGCGGAG ACGGTCAAAA AAAGGCTGTT 
CTTACAGTAG AAAAAATCCT TGAACGAGTA AAAAAGGATG GGGATACAGC TCTGATTGAA
TACACTAAAA AATTTGATGG ATTTGACCCT GATCCCCTTG AGGTTCCTTT AGAAGCAATA
GAAAGAGCAT GGGAAGAAAC TCCAAAGCCA CTTCAAGATG CTCTAATTAC TGCAAAACAT
AGAATCCAAG ACTTCCACCA AAAGCAAATT CCAAAAAATA TTCTTTTTAA AGGGATTGAG
GGTGAGACAT TAGGTAGGCG ATGGCAACCT GTTCAAAAAG CAGGCATCTA TATTCCTGGA
GGAAGAGCTT CTTATCCCAG CACAGTTCTT ATGAACGCAA TTCCTGCATC TGTAGCGGGA
GTAAAAGAAA TTATTATGGT TTCTCCTGGA GGTTCTAATG GTCTTGTGAA TCAAACTGTA
CTCGCTGCGG CTTATATCAC AGGAATAAAA ACAGTTTTTA GGATAGGCGG GGCTCAAGCC
ATTGGAGCAA TGGCATATGG AACAAACACA ATTCCAAAAG TTAATGTAAT AAGTGGTCCC
GGCAATTTAT ATGTCACTTT GGCGAAGAAA TACGTTTATG GAGATGTTGG AATTGATGCT
CTTGCCGGTC CCAGTGAAGT ACTAATTATT GCCGATAATA GTGCCGATGT TCGTCATATT
GCTGCTGATT TACTTGCACA AGCAGAGCAT GATCCATTGG CAGCGACAAT CTTGCTAACA
ACTAACTCCG TTCTTGCTGA AAAGATTGAT GATGAAATTA TGGAGCAGTT AGAAGAGCAT
CCACGAAAAG AAATATGTCT TAAGGCACTT AAAGACTGGG GCCTAATTGT AATTTGCAAT
GACTTAGAAA CTTGTGCAAA GTTAACAGAT TACTTTGCAC CTGAGCATCT AGAATTATTA
CTAAAAATGC CATATCAGGT GGCGGATAAA ATCAATAATG CTGGTGCAAT TTTCATAGGC
GCTTGGAGCC CAGAAGCTAC TGGAGACTAT CTTGCTGGTC CAAACCACAC ATTACCAACA
TCAGGAACTG CAAGATTTAG TGGGGCTTTA GGGGTGGAGA CTTTCATGAA GAACACTTCA
ATTATTGAAT TTAATAAACA AGCTTTCGAT AAGAATAGTA AGGCAATTAT TGAGCTTGCG
AATAGCGAAG GACTTCATAG TCATGCGAAG TCAATAGAGA TTAGACTCTC TAAATCTTCT
GAAGAGATTT AA
 
Protein sequence
MTALERISNR TCGDGQKKAV LTVEKILERV KKDGDTALIE YTKKFDGFDP DPLEVPLEAI 
ERAWEETPKP LQDALITAKH RIQDFHQKQI PKNILFKGIE GETLGRRWQP VQKAGIYIPG
GRASYPSTVL MNAIPASVAG VKEIIMVSPG GSNGLVNQTV LAAAYITGIK TVFRIGGAQA
IGAMAYGTNT IPKVNVISGP GNLYVTLAKK YVYGDVGIDA LAGPSEVLII ADNSADVRHI
AADLLAQAEH DPLAATILLT TNSVLAEKID DEIMEQLEEH PRKEICLKAL KDWGLIVICN
DLETCAKLTD YFAPEHLELL LKMPYQVADK INNAGAIFIG AWSPEATGDY LAGPNHTLPT
SGTARFSGAL GVETFMKNTS IIEFNKQAFD KNSKAIIELA NSEGLHSHAK SIEIRLSKSS
EEI