Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3423 |
Symbol | |
ID | 5077572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 23612 |
End bp | 24943 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481147 |
Product | histidinol dehydrogenase |
Protein accession | YP_001165809 |
Protein GI | 146275649 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.542405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCATCT GGCTCAAGCG CGGCGCGGCT GCGGACGTGA AGGCTCAAAG CGACCGGAAG GTCCGCGACA TCGTGGAAGC GGCGCTGGCC GACATCGAGC AGCGGGGCGA CGATGCCGTG CGCGAGATGA GCGTGAAGTT CGATGGCTGG GACCGCGACG ACTATCGCCT CTCCCAGGCA GAGATCGATG CGGCGGTCGA CAGCCTGAGC CCGCAGGAAC GCAAGGACAT CGAATTCGCG CAGGCGCAGG TGCGCAACTT CGCGCAGATC CAGCGCGAAA GCGTGAAGGA CGTCGAGGTG GAAACCATGC CCGGCGTCGT CCTCGGCCAC CGGAACGTGC CGATCCAGGC GGCCGGCTGC TATGTCCCGG GCGGCAAGTA TCCGCTGCTC GCCTCGGCGC ACATGTCGGT CATCACTGCC AAGGTCGCGG GCGTGCCGCG CGTGATCACT TGCGCGCCGC CCTTCCAGGG CAAGCCCGCG CGCGCCATCG TCGCCGCGCA GGCCATGGCC GGGGCCGACG CGATCTACGC GCTGGGCGGC ATCCAGGCGA TCGGCGCGAT GGCCATCGGC ACGCAGAGCA TCGATCCGGT CGATATCCTT GTCGGCCCGG GCAACGCATT CGTCGCAGAG GCCAAGCGCC AGCTTTTCGG CCGCGTCGGC ATAGACCTCT TCGCCGGCCC CACCGAAACG CTCATCATTG CCGACGAGAT CGGCTGCGAT CCCGAAATGG CCGCGACCGA CATCCTTGGC CAGGTAGAGC ACGGACCGGA CAGCCCCGGC GTACTGCTGA CAAATTCGGA AAAGCTCGCC CGCGAAACGA TGGCCGAGAT AGAGCGCCTG CTCCGGATCC TGCCCACCGC CGACCATGCA CGCAAGGCAT GGGAAACCTT CGGCGAGGTG ATCGTGGCCG AAAGCTACGA GGAAATGGTC CGCATCGCCG ACGACATCGC CAGCGAGCAC GTACAGGTGA TGACCGCCGA CCCCGATTAC TTCCTGAACA ACATGACCAA TTATGGCGCG CTGTTCCTCG GGCCCCGAAC CAACGTTTCC TTCGGCGACA AGGTGATCGG CACAAACCAC ACGCTGCCGA CGAAAAAGGC GGCACGCTAT ACCGGGGGTC TGTGGGTCGG CAAGTTCCTC AAGACCTGCA CCTACCAGAA GGTCCTGACC GACGAGGCAT CGGCGCTGGT GGGCGAATAT TGCAGCCGCC TCTGCGCACT GGAAGGCTTT GCGGGACACG GTGAACAGGC CAACCTGCGC GTGCGCCGCT ATGGCGGGCG CAACGTCCCC TATGCCGGAC AGGCGGAGCC GAGCGAGCTG ACGCGGGCAT GA
|
Protein sequence | MAIWLKRGAA ADVKAQSDRK VRDIVEAALA DIEQRGDDAV REMSVKFDGW DRDDYRLSQA EIDAAVDSLS PQERKDIEFA QAQVRNFAQI QRESVKDVEV ETMPGVVLGH RNVPIQAAGC YVPGGKYPLL ASAHMSVITA KVAGVPRVIT CAPPFQGKPA RAIVAAQAMA GADAIYALGG IQAIGAMAIG TQSIDPVDIL VGPGNAFVAE AKRQLFGRVG IDLFAGPTET LIIADEIGCD PEMAATDILG QVEHGPDSPG VLLTNSEKLA RETMAEIERL LRILPTADHA RKAWETFGEV IVAESYEEMV RIADDIASEH VQVMTADPDY FLNNMTNYGA LFLGPRTNVS FGDKVIGTNH TLPTKKAARY TGGLWVGKFL KTCTYQKVLT DEASALVGEY CSRLCALEGF AGHGEQANLR VRRYGGRNVP YAGQAEPSEL TRA
|
| |