Gene Ndas_3192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3192 
Symbol 
ID9247049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3816422 
End bp3817717 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content75% 
IMG OID 
Producthistidinol dehydrogenase 
Protein accessionYP_003681106 
Protein GI297562132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.142432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAGTC GAATCGACCT CCGAGGCACC CAAGGCGACC CGCGTCAGGC CCTTCCGCGC 
GCGGAACTGG ACGTGGCGGC CGCCGCCGAG CGCGTACGCC CCCTGTGCGA GGACGTGCGC
CATCGCGGTG TCGAGGCCCT GGTCGAACTC ACCGAGCGCT TCGACGGCGT CAGACTGACC
GACATCCGCG TGCCCAAGGA CGCGATCGAG GCCGCCCTGG ACGGCCTCGA CCCCGCCGTG
CGCGCCGCGC TGGAGGAGTC CATCCGCCGC GCCCGCGCGG TCCACCGCGA CCAGCGCCGC
ACCGACCACA CCACCCGCGT CGTGCCCGGG GGCACCGTCA CCGAGAGGTG GATCCCCGTC
GACCGCGTCG GCCTCTACGT GCCGGGCGGC CGGGCCGTCT ACCCCTCCAG CGTCGTCATG
AACGTCGTCC CCGCCCAGGA GGCGGGGGTG CGCTCCCTGG CGGTCACCTC GCCGCCCCAG
AGCGCCTTCG GCGGGCTGCC CCACCCGACC ATCCTCGCCG CCTGCGCCCT GCTCGGCGTC
GACGAGGTCT ACGCCGTCGG GGGCGCCCAG GCGATCGCCA TGTTCGCCTA CGGCGCGGGC
CCCTGCGAGC GCGCCGACAT GGTCACCGGC CCCGGCAACA TCTGGGTGGC CGCGGCCAAA
CGCCTGCTCA AGGGCGTCAT CGGCATCGAC GCCGAGGCCG GTCCCACCGA GATCGCGATC
CTCGCCGACG CCACCGCCAA CCCCGACTAC GTCGCCGCCG ACCTGATCAG CCAGGCCGAG
CACGACGTCG TCGCCGCCTC CGTCCTGGTC ACCCCGGACG AGGCGCTCGC CGAGGCGGTC
ACCGACCGCC TCGCCGCCCG CGTGGCCGCC ACCAAGCACG GCGACCGCGT CCGCGAGGCC
CTGTCCGGCC CGCAGTCCGG CATCGTCCTG GTCGACGACC TCGACCACGG CCTCGCCGTC
GTCAACGCCT ACGCCGCCGA GCACCTGGAG GTCATGACCG CCGACGCCGC CGCGTGTGCC
GCGCGCGTGC GCAACGCGGG CGCGATCTTC GTCGGCGACT TCTCGCCGGT CTCCCTGGGC
GACTACGCGG CCGGGTCCAA CCACGTGCTG CCCACCGGAG GCTGCGCCTG CCACACCGGC
GGCCTGAGCG TGCAGACCTT CCTACGCGGC GTGCACGTGG TCGAGTACGA CCGCGAGGCG
CTGACCGACG TCGCCCACCA CGTCATCGCC CTGGCCAACG CCGAGGACCT GCCCGCGCAC
GGCGAGGCCG TCGCCGCGCG CACGGACCCG GCCTGA
 
Protein sequence
MISRIDLRGT QGDPRQALPR AELDVAAAAE RVRPLCEDVR HRGVEALVEL TERFDGVRLT 
DIRVPKDAIE AALDGLDPAV RAALEESIRR ARAVHRDQRR TDHTTRVVPG GTVTERWIPV
DRVGLYVPGG RAVYPSSVVM NVVPAQEAGV RSLAVTSPPQ SAFGGLPHPT ILAACALLGV
DEVYAVGGAQ AIAMFAYGAG PCERADMVTG PGNIWVAAAK RLLKGVIGID AEAGPTEIAI
LADATANPDY VAADLISQAE HDVVAASVLV TPDEALAEAV TDRLAARVAA TKHGDRVREA
LSGPQSGIVL VDDLDHGLAV VNAYAAEHLE VMTADAAACA ARVRNAGAIF VGDFSPVSLG
DYAAGSNHVL PTGGCACHTG GLSVQTFLRG VHVVEYDREA LTDVAHHVIA LANAEDLPAH
GEAVAARTDP A