Gene Sare_3422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3422 
SymbolhisD 
ID5704031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3948474 
End bp3949796 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content72% 
IMG OID641272849 
Producthistidinol dehydrogenase 
Protein accessionYP_001538215 
Protein GI159038962 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.025652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAACC GGATCGACCT TCGTGGCGGG GTTCGTGACC CGCGTCGCCT GCTGCCCCGT 
GCCCGGCTCG ATGTCTCCGC GGCCGTCGAG CGGATCCGTC CGCTCGTGGC GGAGGTGCGG
GAGCATGGCT ATCCGGCGAT CCGGGCGGCG AGCGAACGTT TCGACGGGGT GTCCCCGGCG
GTGCTGCGGG TGCCGGCCGA GATGGTCGCC GAGGCCGAGG GGACGCTCGA TCCGCAGGTC
CGTGCCGCGT TGGTGGAGTC GATCGACCGG GCCCGCCGGG TGCACGCCGC CCAGCGCCGA
AGCGACCACA CCACGCAGGT CGTGCCGGGC GGCACGGTCA CCGAGCGCTG GTTGCCGGTC
GACCGGGTCG GCCTCTACGT GCCCGGCGGT CTGGCGATGT ACCCGTCGAC GGTGGTGATG
AACGTGGTGC CCGCGCAGGA GGCCGGGGTG CGTTCGTTGG TCGTGGCCAG TCCACCGCAG
AAGGACAACG GTGGCTGGCC CGACCCGCGG GTGCTCGCCG CCTGTGCTCT GCTCGGCGTG
GATGAGGTGT ACGCCGTCGG CGGCGCGCAG GCGGTGGCGA TGCTGGCATA CGGCAGTTCG
GTTGACCCCG ATGGCGCCAC CCGCTGCGAT CCGGTCGACT TGATCACTGG CCCCGGCAAC
ATCTGGGTCA CCGCCGCCAA GCGGCTGCTG CGGGGTGTGG TGGGCATCGA CGCCGAGGCC
GGCCCCACCG AGATCGCGAT ACTGGCCGAC CACACCGCCG ATCCGGTGCA CGTGGCCGCT
GACCTGATCA GCCAGGCCGA GCACGACCCG CTCGCGGCGA GCGTGCTGGT CACGCCGTCG
ATGGAGCTGG CCGACGCGGT GGACCGGGAG CTGACCCGCC AGGTCGCGGC GGCCAAGCAC
ACCGAGCGGA TCGGCACGGC GCTCACCGGT GAGCAGAGCG GCATCGTGCT CGTTGATGAC
CTGGCGGCGG GGCTGCGGGT GGTTGACGCG TACGCGGCCG AGCATCTGGA GATTCAGACC
GAGAACGCCC GCGAGTGGGC GCTGCGGGTA CGCAACGCCG GGGCGATCTT CGTCGGTGCC
TGGTCGCCGG TGTCGCTTGG TGACTACTGC GCCGGCTCCA ACCATGTACT GCCCACCGGT
GGGTGCGCCC GGCACTCGTC GGGCCTGTCG GTGCAGTCCT TCCTGCGCGG TGTTCACCTG
GTGGAGTACA CGCGGGATGC TCTGCGGGAG GCGGCGCCGC ACGTGGTCGC CCTGGCGACG
GTGGAGGACC TGCCGGCGCA CGGCCAGGCG GTGTCCGTCC GGCTGCCGGG GGAGGCGTCG
TGA
 
Protein sequence
MLNRIDLRGG VRDPRRLLPR ARLDVSAAVE RIRPLVAEVR EHGYPAIRAA SERFDGVSPA 
VLRVPAEMVA EAEGTLDPQV RAALVESIDR ARRVHAAQRR SDHTTQVVPG GTVTERWLPV
DRVGLYVPGG LAMYPSTVVM NVVPAQEAGV RSLVVASPPQ KDNGGWPDPR VLAACALLGV
DEVYAVGGAQ AVAMLAYGSS VDPDGATRCD PVDLITGPGN IWVTAAKRLL RGVVGIDAEA
GPTEIAILAD HTADPVHVAA DLISQAEHDP LAASVLVTPS MELADAVDRE LTRQVAAAKH
TERIGTALTG EQSGIVLVDD LAAGLRVVDA YAAEHLEIQT ENAREWALRV RNAGAIFVGA
WSPVSLGDYC AGSNHVLPTG GCARHSSGLS VQSFLRGVHL VEYTRDALRE AAPHVVALAT
VEDLPAHGQA VSVRLPGEAS