Gene Saro_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2920 
SymbolhisD 
ID3917355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3137953 
End bp3139242 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content69% 
IMG OID640445698 
Producthistidinol dehydrogenase 
Protein accessionYP_498189 
Protein GI87200932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCGCC TCAGGTCCGG CGATCCCGAT TTCGCCCGCG CCTTCGCACG CCTCGTCAAG 
GAGCGTCGTG AAAGCGATGA CAACGTCGCG CGCGATGTCC AGACCATCGT CGAGGACGTC
CGCCTGCGCG GGGACGAGGC CCTTGCCGAA TATACTGCAA GGTTCGATGG CCACGCCCCC
GCCGACGATG ACTGGCGCAT CTCGCCCGAG GCGTGCCGCG AAGCCTTCGA CGCGCTCAAG
CCCGACCTTC GCGCCGCGCT CGAACTCGCG GCGGAACGCA TCCGCGAATA TCACAAGGCC
CAGCTTCCCG GGGACCGCGA CTACACCGAC GCCACCGGCA TGCGTCTCGG CGCGCGCTGG
CGGCCCGTCG ATGGCGCCGG TCTCTATGTG CCCGGAGGCC GCGCCGCCTA TCCCTCGTCG
CTGCTGATGA ACGCCATCCC CGCCAAGGTC GCGGGGGTCG AGCGTCTGGT CGTCGTCACC
CCCACGCCGA AGGGCGAGGC GAACCCCCTG GTCCTTGCCG CCGCGCACCT TGCCGGCGCG
GACGAAGTCT GGCGCGTCGG CGGGGCGCAG GCGGTTGCCG CGCTCGCCTA TGGCACGAAG
CGGATCAGGC CGGTCGACGT CATCACCGGT CCGGGCAACG CCTGGGTGGC CGAGGCCAAG
CGCCAGCTCT TCGGCGTCGT GGGCATCGAC ATGGTCGCCG GGCCTTCGGA AATCCTCGTC
ATCGCCGATG CGAAGAACGA TCCGCAATGG ATTGCCGCCG ACCTCCTCAG CCAGGCCGAG
CACGATCCCG TCGCGCAGTC GATCCTCATC ACGGACGACG CCGCATTCGC CGATCAGGTG
GCCGACATGG TCGGAGTCGA GATCGCCATG CTGCCCACGG CAAAGGTCGC CAAGGCAAGC
TGGGACGCGC ATGGCGTGAT CATCGTGGTG GACTCGCTCG ACGAAGCGCC CGCGCTCGCC
AATGCGCTCG CCGCCGAACA CGTCGAGATC GCCACCGACG ATCCCCAGGC CCTGTTCGAC
CGCATCCGCC ACGCCGGATC GGTCTTCCTC GGTCGCATGA CCCCCGAAGC CGTTGGCGAT
TACGTTGCGG GACCGAACCA CGTCCTGCCG ACCGGTCGCC GCGCGCGCTT TTCTTCGGGC
CTCTCCGTTC TGGATTTCAT GAAACGCACC AGCTTCATCG CGGCCACTGA CGCCGCGCTC
GCCGCCGTCG GCCCCGCAGC CGTGGCCCTT GCCGAGGCAG AAGGCCTTGC CGCCCACGCC
CGCTCCATCG AACTGAGGCT CGGCATATGA
 
Protein sequence
MLRLRSGDPD FARAFARLVK ERRESDDNVA RDVQTIVEDV RLRGDEALAE YTARFDGHAP 
ADDDWRISPE ACREAFDALK PDLRAALELA AERIREYHKA QLPGDRDYTD ATGMRLGARW
RPVDGAGLYV PGGRAAYPSS LLMNAIPAKV AGVERLVVVT PTPKGEANPL VLAAAHLAGA
DEVWRVGGAQ AVAALAYGTK RIRPVDVITG PGNAWVAEAK RQLFGVVGID MVAGPSEILV
IADAKNDPQW IAADLLSQAE HDPVAQSILI TDDAAFADQV ADMVGVEIAM LPTAKVAKAS
WDAHGVIIVV DSLDEAPALA NALAAEHVEI ATDDPQALFD RIRHAGSVFL GRMTPEAVGD
YVAGPNHVLP TGRRARFSSG LSVLDFMKRT SFIAATDAAL AAVGPAAVAL AEAEGLAAHA
RSIELRLGI