Gene Syncc9902_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0602 
SymbolhisD 
ID3742644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp607811 
End bp609151 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content59% 
IMG OID637770773 
Producthistidinol dehydrogenase 
Protein accessionYP_376614 
Protein GI78184179 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGGGAA TGAACCCAAA TCGTTCCTTG CCTGAGAAGA GTCCGGCGGG TTTTTCACTC 
CGCATCGTGC GAGATCCGGA GCAGGCCAAA GGAGAACTCC AGCGCCTCGT CCAACGGACG
GCTCATGCCC AACAACGAGA TGCACAGTCA CGTGTGGACA CGATTCTGTC CGAAGTCCGA
GCCCGGGGGG ATGCCGCAGT TGTCGAATTC ACAGAGCGCT TTGACGGCTT CCGTCCAAAT
CCGGTCGCTG TTCCCCAGGA GCAGCTCGAG CGTTCATGGC GCAACCTGCC GGCCAACCTT
CAGGACGCCC TAGAGCTCGC GCATCGGCGC ATCACCGACT TTCATCAACG TCAGCGGCCA
TCGGATATCG CCACCGAGGG GCCCTATGGA GAACGGCTTG GACGCCGCTG GAGGCCGGTG
GATCGAGCTG GTTTATACGT GCCAGGAGGT CGAGCCTCGT ATCCCAGCAC AGTCTTGATG
AATGCTGTAC CAGCGAAGGT TGCTGGGGTC AAAGAGGTTG TGATCTGTTC GCCGGCAGGT
CGCGACGGAA CGGTCAATCC GGTGGTTCTC GCGGCCTCGC ATCTCGCAGG GGTTCGAACC
GTCTTTCGCC TTGGTGGGGC CCAAGCGATC GCAGCTATGG CCTACGGAAC AAACAGCGTT
CCCAAGGTGG ACGTGATCAG CGGACCTGGA AATCTCTACG TCACCTTGGC CAAACAAGCT
GTCTATGGCC AAGTGGGCAT CGACTCCCTG GCGGGGCCTA GTGAAGTTTT AGTGATCGCT
GATCAAAGCG CCAAACCTGA TCAAGTTGCC GCAGACCTGT TAGCCCAAGC CGAGCATGAT
CCGCTCGCCG CTGCGGTTCT GATCACCACC AATCCAGCCT TGGCTGAGCA GATTCCCCAT
GAAATCGAGC AACAACTCGA GGGGCACCCG CGCCGCGAAA TCTGCGAGGC CTCAATCAGC
AACTGGGGGT TGGTGGTGGT CTGCGACGAC CTCGAAAGCT GCGCCGAGTT GAGCGACAGC
TTCGCTCCAG AGCATTTGGA ACTGTTGGTG GAACGTCCCC AGGCGTTGGC TGAACGGATC
CAACATGCCG GTGCCATTTT TCTGGGCCCT TGGTCACCGG AGGCGGTTGG TGATTATTTA
GCCGGTCCGA ATCACACCCT GCCCACCTGT GGCGCAGCTC GTTTTAGTGG AGCCCTAAGT
GTTGAAACAT TCATGCGTCA CACGTCTTTG ATCGGTTTTA ACCGCGCGGC CCTTGAAGCC
ACAGGATCAG CTGTTCAAGA ACTGGCGACG AGCGAAGGAC TCCATAGCCA TGCCGAATCC
GTAAGGCGTC GGCTCAACTA A
 
Protein sequence
MRGMNPNRSL PEKSPAGFSL RIVRDPEQAK GELQRLVQRT AHAQQRDAQS RVDTILSEVR 
ARGDAAVVEF TERFDGFRPN PVAVPQEQLE RSWRNLPANL QDALELAHRR ITDFHQRQRP
SDIATEGPYG ERLGRRWRPV DRAGLYVPGG RASYPSTVLM NAVPAKVAGV KEVVICSPAG
RDGTVNPVVL AASHLAGVRT VFRLGGAQAI AAMAYGTNSV PKVDVISGPG NLYVTLAKQA
VYGQVGIDSL AGPSEVLVIA DQSAKPDQVA ADLLAQAEHD PLAAAVLITT NPALAEQIPH
EIEQQLEGHP RREICEASIS NWGLVVVCDD LESCAELSDS FAPEHLELLV ERPQALAERI
QHAGAIFLGP WSPEAVGDYL AGPNHTLPTC GAARFSGALS VETFMRHTSL IGFNRAALEA
TGSAVQELAT SEGLHSHAES VRRRLN