Gene Saro_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3423 
Symbol 
ID5077572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp23612 
End bp24943 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID640481147 
Producthistidinol dehydrogenase 
Protein accessionYP_001165809 
Protein GI146275649 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.542405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCT GGCTCAAGCG CGGCGCGGCT GCGGACGTGA AGGCTCAAAG CGACCGGAAG 
GTCCGCGACA TCGTGGAAGC GGCGCTGGCC GACATCGAGC AGCGGGGCGA CGATGCCGTG
CGCGAGATGA GCGTGAAGTT CGATGGCTGG GACCGCGACG ACTATCGCCT CTCCCAGGCA
GAGATCGATG CGGCGGTCGA CAGCCTGAGC CCGCAGGAAC GCAAGGACAT CGAATTCGCG
CAGGCGCAGG TGCGCAACTT CGCGCAGATC CAGCGCGAAA GCGTGAAGGA CGTCGAGGTG
GAAACCATGC CCGGCGTCGT CCTCGGCCAC CGGAACGTGC CGATCCAGGC GGCCGGCTGC
TATGTCCCGG GCGGCAAGTA TCCGCTGCTC GCCTCGGCGC ACATGTCGGT CATCACTGCC
AAGGTCGCGG GCGTGCCGCG CGTGATCACT TGCGCGCCGC CCTTCCAGGG CAAGCCCGCG
CGCGCCATCG TCGCCGCGCA GGCCATGGCC GGGGCCGACG CGATCTACGC GCTGGGCGGC
ATCCAGGCGA TCGGCGCGAT GGCCATCGGC ACGCAGAGCA TCGATCCGGT CGATATCCTT
GTCGGCCCGG GCAACGCATT CGTCGCAGAG GCCAAGCGCC AGCTTTTCGG CCGCGTCGGC
ATAGACCTCT TCGCCGGCCC CACCGAAACG CTCATCATTG CCGACGAGAT CGGCTGCGAT
CCCGAAATGG CCGCGACCGA CATCCTTGGC CAGGTAGAGC ACGGACCGGA CAGCCCCGGC
GTACTGCTGA CAAATTCGGA AAAGCTCGCC CGCGAAACGA TGGCCGAGAT AGAGCGCCTG
CTCCGGATCC TGCCCACCGC CGACCATGCA CGCAAGGCAT GGGAAACCTT CGGCGAGGTG
ATCGTGGCCG AAAGCTACGA GGAAATGGTC CGCATCGCCG ACGACATCGC CAGCGAGCAC
GTACAGGTGA TGACCGCCGA CCCCGATTAC TTCCTGAACA ACATGACCAA TTATGGCGCG
CTGTTCCTCG GGCCCCGAAC CAACGTTTCC TTCGGCGACA AGGTGATCGG CACAAACCAC
ACGCTGCCGA CGAAAAAGGC GGCACGCTAT ACCGGGGGTC TGTGGGTCGG CAAGTTCCTC
AAGACCTGCA CCTACCAGAA GGTCCTGACC GACGAGGCAT CGGCGCTGGT GGGCGAATAT
TGCAGCCGCC TCTGCGCACT GGAAGGCTTT GCGGGACACG GTGAACAGGC CAACCTGCGC
GTGCGCCGCT ATGGCGGGCG CAACGTCCCC TATGCCGGAC AGGCGGAGCC GAGCGAGCTG
ACGCGGGCAT GA
 
Protein sequence
MAIWLKRGAA ADVKAQSDRK VRDIVEAALA DIEQRGDDAV REMSVKFDGW DRDDYRLSQA 
EIDAAVDSLS PQERKDIEFA QAQVRNFAQI QRESVKDVEV ETMPGVVLGH RNVPIQAAGC
YVPGGKYPLL ASAHMSVITA KVAGVPRVIT CAPPFQGKPA RAIVAAQAMA GADAIYALGG
IQAIGAMAIG TQSIDPVDIL VGPGNAFVAE AKRQLFGRVG IDLFAGPTET LIIADEIGCD
PEMAATDILG QVEHGPDSPG VLLTNSEKLA RETMAEIERL LRILPTADHA RKAWETFGEV
IVAESYEEMV RIADDIASEH VQVMTADPDY FLNNMTNYGA LFLGPRTNVS FGDKVIGTNH
TLPTKKAARY TGGLWVGKFL KTCTYQKVLT DEASALVGEY CSRLCALEGF AGHGEQANLR
VRRYGGRNVP YAGQAEPSEL TRA