Gene Cag_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1289 
SymbolhisD 
ID3747438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1754009 
End bp1755295 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637773827 
Producthistidinol dehydrogenase 
Protein accessionYP_379593 
Protein GI78189255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTACCAC TTTATCGTTT TCCGCAAGAG GCTTCGGCGT TACAAGAGCG TTTAGTGCGC 
CATGTTTCTT TTGATGAGGC TGCCCATAAG GCGGTTGATG AAATTCTTGC GAAGGTGCGC
CAACAGGGCG ACCGTGCGGT GCTTAATTAT ACTGAGCAGT TTCAAGGTGT GCGTTTAACT
TCCATGCAGG TTGATGAAGA GGCTATTGAG ATGGCGTATC GCCATGCTGA TCCTTCGCTG
ATTGCTACGT TGCATGAAGC CTATGCTAAT ATTGTGCGTT TTCATGAGCA TGAGGTGGAG
CGTAGCTTTT TTTATGAAGC TGAAGGTGGC GTGTTGCTTG GGCAAAGGGT TCGCCCTATG
GAACGTGCAA TGCTCTATGT TCCAGGCGGC AAAGCTGCAT ATCCCTCTTC GCTTTTGATG
AATGCTGCAC CTGCAAAAGT GGCTGGTGTT TGTGAGATTG CTGTAACCAC GCCATGCGAT
GCAACGGGGG TGGTTAATCC AACAATTTTA GCAGCAGCAA AGGTTGCGGG TATTTCTTCC
ATTTATAAGA TAGGAGGAGC GCAAGCTGTT GCGGCGTTTG CGTATGGCAC CGAGTCCATT
CCAAAGGTTG ATATTATCAC CGGTCCAGGT AACAAGTATG TAGCGCTTGC TAAAAAGCAG
GTTTTTGGTC ATGTTGCTAT TGATAGCATT GCTGGTCCAT CTGAAGTGGT AATTATTGCT
GACGAGTCGG CACATGCGGA GTTTGTGGCG TTAGATATGT TTGCGCAAGC TGAACACGAC
CCTGATGCTT CGGCGGTTTT AATTACTACC TCCGAAAGCT TTGCTCAAGC GGTGCAGCAA
GCGGTGGCTT CGCTGTTACC CACTATGCTT CGCCACGAAA CCATTGCAAG CTCTTTGTTG
CACAATGGCG CAATGGTGTT AGTGCCATCT TTGGACGATG CGTGCGCTGT TTCCGATATG
TTAGCGCCCG AACATCTTGA ATTGCATGTA GTGCAACCGT GGGATATTCT CCCTAAGCTC
AAACATGCAG GCGCAATTTT TATGGGGAGT TATTCATGCG AAACTATTGG CGACTACTTT
GCAGGACCTA ATCACACCTT GCCAACAAGC GGCACGGCGC GCTTTTTCTC GCCGCTGTCG
GTGCGCGATT TTGTAAAGCA TACCTCCATT ATTTCCTACT CGCCTGAGCA GTTGCGCTCT
AAAGGGGCAC AAATTGCCGC GTTTGCTGAT GCTGAAGGAT TGCAAGCTCA TGCTGAAGCG
GTTCGTGTGC GCTTAAAAAC CTTATAG
 
Protein sequence
MLPLYRFPQE ASALQERLVR HVSFDEAAHK AVDEILAKVR QQGDRAVLNY TEQFQGVRLT 
SMQVDEEAIE MAYRHADPSL IATLHEAYAN IVRFHEHEVE RSFFYEAEGG VLLGQRVRPM
ERAMLYVPGG KAAYPSSLLM NAAPAKVAGV CEIAVTTPCD ATGVVNPTIL AAAKVAGISS
IYKIGGAQAV AAFAYGTESI PKVDIITGPG NKYVALAKKQ VFGHVAIDSI AGPSEVVIIA
DESAHAEFVA LDMFAQAEHD PDASAVLITT SESFAQAVQQ AVASLLPTML RHETIASSLL
HNGAMVLVPS LDDACAVSDM LAPEHLELHV VQPWDILPKL KHAGAIFMGS YSCETIGDYF
AGPNHTLPTS GTARFFSPLS VRDFVKHTSI ISYSPEQLRS KGAQIAAFAD AEGLQAHAEA
VRVRLKTL