Gene PCC8801_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0205 
SymbolhisD 
ID7103541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp199991 
End bp201295 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content47% 
IMG OID643473319 
Producthistidinol dehydrogenase 
Protein accessionYP_002370465 
Protein GI218245094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGAA TTATTAAACT TGCCCAACTT ACCCCCGCAG AACGCGCTAA ATTGCAACGA 
CGGGCAGAAC TTGATATCGA TCAGGTGTTA CCTATCGCCC AACAAGTGAT TACGGCGATC
GCCCAAAAGG GTGATGCAGG GGTAATTGAA TATGCGCGTA AATTTGATTA TCCTGGGGCA
ACGGCTACTA ATATTAAGGT AACAGAAGAA GAGTTTACTC AGGCAAGAGA ATTAGTCGAA
CCAGAGGTTA AACGCGCGGT TGAACAGGCG TTCCGCAACA TCAAAGCCGT TCACCAAGGG
CAAATGCCCC AACCCATCCA CCTAGCCGAA ATCGATAGCG GCATTTTCGC CGGAGAAAAG
ATCACTCCTA TCCCTAGGGT GGGGTTGTAT GTTCCTAGAG GGCGTGGGGC TTTTCCTTCG
ATTATGTTAA TGTTAACTAT CCCGGCCATG GTAGCCGGGG TAGAAAAAAT CGTCGTGTGT
ACGCCTCCTG ATAAAGAAGG GAGGGTTGAA CCCGTTTCTC TCTATGTGGC TGAAATGGCA
GGGGTTAAGG AAGTCTATAA ACTCGGTGGG GTTCAAGCTT TAGCGGCGAT CGCCCTCGGA
ACGGAAACTG TCCCTAAAGT AGACAAACTT ATTGGACCAT GTAGTGTCTA TGGGGCAGCC
GCTAAACGCC TTTTGATCGG TACGGTGGAT GTGGGACTCC CCGCCGGACC CAGTGAAGGC
ATTATCCTCG CCGATGAAAC GACTGATCCC CAGTTAGCAG CGTTGGATTT ATTGATTGAA
GCTGAACACG GGTCAGACTC AGCAGCGTTA TTAGTGACCC ACAGCGAAAC AGTTGCCCAA
AAAGCCAGTC AGTTTGCCTT AGAATACCTT GAAAAACTTC CCCAATGGCG CAAGCAATTT
TGTGAAGATG GATTGGCTGA CTATGGGGGC ATTATTTTAA CGTCTAGTTT ACAAGAATCT
ATTGATTTTG TTAATGACTA TGCCCCCGAA CATTTAGAAG TTTTAGTCGA AGATCCCCTC
AGTTTACTCG GAAAAATTAA CAATGCAGGG GAGATTTTAT TAGGAAAATA TACGCCATCT
TCGGCAGCTA CTTATGCGAT CGGGGTTAAT GCAGTATTAC CGACGGGAGG CTTTGCACGG
TCTTATTCGG CGGTATCAGT GTTTGATTTT CTCAAGCGAT CAACCGTCGC TTATTTAACG
TCTCAAGGCT TCGAGACTGT TAAACAAACA GCCAAAACTT TAGCCACTTA TGAAGAATTT
CCGGCTCATG GAATGGCCAT TACAGAACGA GATAAGTTAC TATAG
 
Protein sequence
MVRIIKLAQL TPAERAKLQR RAELDIDQVL PIAQQVITAI AQKGDAGVIE YARKFDYPGA 
TATNIKVTEE EFTQARELVE PEVKRAVEQA FRNIKAVHQG QMPQPIHLAE IDSGIFAGEK
ITPIPRVGLY VPRGRGAFPS IMLMLTIPAM VAGVEKIVVC TPPDKEGRVE PVSLYVAEMA
GVKEVYKLGG VQALAAIALG TETVPKVDKL IGPCSVYGAA AKRLLIGTVD VGLPAGPSEG
IILADETTDP QLAALDLLIE AEHGSDSAAL LVTHSETVAQ KASQFALEYL EKLPQWRKQF
CEDGLADYGG IILTSSLQES IDFVNDYAPE HLEVLVEDPL SLLGKINNAG EILLGKYTPS
SAATYAIGVN AVLPTGGFAR SYSAVSVFDF LKRSTVAYLT SQGFETVKQT AKTLATYEEF
PAHGMAITER DKLL