Gene Cpha266_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0801 
SymbolhisD 
ID4570220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp912494 
End bp913777 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content51% 
IMG OID639765395 
Producthistidinol dehydrogenase 
Protein accessionYP_911276 
Protein GI119356632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAAAAA TATTCTGTTT TGCCCAGGAT GGTGAGGCTC TGAAAAAACA GTTGAGCCGT 
AGCGTTTCTT TTGATTCCGG AGTTCAGGAT GTCGTCAATG ATATTCTTGA ACAGGTTCGC
ACCCGGGGTG ACGCAGCACT TCTTGAGTAT ACCGAGCGCT TTCAGGGAGC CCGGCTTAAT
GAAATGATGG TTTCCGGCGA AGAGATCAGA AGTGCTTATG AGCAGGTGGA TTCCGGATTT
CTTGAGGTGA TGCATGAGGC CTATCGCAAT ATTACCCGAT TTCACCAGCA CGAGGTGGAA
AACAGTTTTT TTTATGAAGG CGCGGGGGGC GTAATTCTCG GCCAGAGGGT TACTCCGATG
CAGCGGGCAC TCCTGTATGT TCCCGGAGGG ATGGCCTCTT ATCCTTCATC GCTCTTGATG
AATGCTGCGC CTGCAAAGGT TGCCGGCGTG AGGGATATTG TTGTTACCAC CCCTTGCGAT
CCGGATGGAC GTGTGAATCC GCATATTCTT GCGGCTGCCG CAGTTGCCGG AATTGACTCT
GTTTACAAAC TTGGAGGGGC TCAGGCAATC GCCGCGTTTG CCTATGGTAC GGAGAGCATT
CCGAAAGTGG ATATTATTAC CGGACCTGGT AACAAGTATG TTGCGCTTGC AAAAAAACAG
GTGTTTGGAC ATGTCTCCAT CGACAGCATT GCCGGTCCGT CAGAAGTTGT TGTGATTGCC
GATGAAACAG CAAACCCGGA CTTTATCGTT CTTGATATGT TTGCACAGGC AGAGCATGAT
GCGGATGCTT CGGCGGTGCT TATTACTCCA TCCGAATCGC TTGCAGCAGC CGTTCGCGAG
ACTGCCCGCC GGAGGATCGG CTCGATGCTG CGCAAGGATA TCATCGGTTC AGCTCTTGAA
AAAAACGGCG CTATCGTGAT CGTTCAATCA ATAGAGGAGG CCTGTTTGGT GTCGGATATG
ATTGCGCCGG AACATCTCGA ACTGCATGTC GACCGTCCAT GGGATCTGCT GCCGAGCATC
AATCATGCCG GGGCAGTGTT CATAGGGAGT TACTCCTGTG AAACGGTAGG GGATTATTTC
GCAGGCCCGA ACCACACGCT TCCGACCAAC GGGACGGCCC GATTTTTTTC TCCGCTTTCT
GTTCGTGATT TTGTCAAGCA CACCTCAATT ATTTCCTACT CAAAAAAACA GCTTCAGGAA
TGTGGTAAAA AAATTGCCGC ATTTGCCGAT TATGAGGGTC TTCAGGCTCA TGCTGAGGCT
GTCCGCGTAA GACTTGATTC GTGA
 
Protein sequence
MLKIFCFAQD GEALKKQLSR SVSFDSGVQD VVNDILEQVR TRGDAALLEY TERFQGARLN 
EMMVSGEEIR SAYEQVDSGF LEVMHEAYRN ITRFHQHEVE NSFFYEGAGG VILGQRVTPM
QRALLYVPGG MASYPSSLLM NAAPAKVAGV RDIVVTTPCD PDGRVNPHIL AAAAVAGIDS
VYKLGGAQAI AAFAYGTESI PKVDIITGPG NKYVALAKKQ VFGHVSIDSI AGPSEVVVIA
DETANPDFIV LDMFAQAEHD ADASAVLITP SESLAAAVRE TARRRIGSML RKDIIGSALE
KNGAIVIVQS IEEACLVSDM IAPEHLELHV DRPWDLLPSI NHAGAVFIGS YSCETVGDYF
AGPNHTLPTN GTARFFSPLS VRDFVKHTSI ISYSKKQLQE CGKKIAAFAD YEGLQAHAEA
VRVRLDS