Gene Ndas_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2334 
Symbol 
ID9246184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2785086 
End bp2786456 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content76% 
IMG OID 
Producthomoserine dehydrogenase, NAD-binding protein 
Protein accessionYP_003680262 
Protein GI297561288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.670502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.115253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCA CCGACGCCAC CACGGCGCCG CCACCGCCCG TGCGCTACGC GCTCACCGGA 
GCCCGCGGCG GTTTCGCGCG CACCCTGCTC GCCCAGACCC CGCGGATGGA GCGGCTGCGG
CCCTCGGTCC TGTGCGACCT GGACACCGAG GGGACGATCG CGCTGTGCGC CGAGCTCGGC
TACCCGGCCG ACGCGCTGAC GGTGTGCTCC AGCGCGGACG AGGTGGCCGC GCTGGCCGAC
GGGCGGATCG CGGTGATCGC CGACGCCCGC CTCCTCGAAC ACGCCGACTA CGACGTGCTC
GTCGAGGCCA CCGGCAGCCC CGAGGCGGGC ACCCGGGCCG CGCGGACCGC CATCGCGGCC
GGACGCCACG TGGTCATGGT CTCCAAGGAG GTCGACTCCG TTTCCGGGAT CGCGCTGGCC
GACCTCGCCC GGGAACGGGG CGTGGTCTAC ACCCCCGGGA TCGGGGACCA GCCCGCCAAC
CTCATCGAGT GGTACGAGCG GACCCGCCTG CTGGGCCTGG ACGTCGTCGC GATCGGCAAG
TCGGGCGAGT ACGACCTGGT CTTCGACCCG GCCACGGGCC GCGTGCGCCA GCTCGACCAG
GAGATCGACG CCCCCGACCT GGCCGACCTG CTGACCCTGG GCGAGGACGT GCCCGCCACG
CTCGCCGCCC GCGCCCGCGC CGTGGCGGGC CTGTCGCGCT CGGCCGCCGC CGACTACTGC
GAGATGGGGG TCGTCGCCAA CCACACGGGC CTGGTGCCCG ACACCGAGGA GCTGCACTAC
CCCGTCGCGC GCGTCGCCGA ACTCGCCGAC GTCTACTCCC TCGCCGAGGA CGGGGGGATC
CTCTCCCGGT CCGGCGCCGT GGACGTCTTC AGCGTGCTCC GCCTGCCCGA GGAGGCCTCC
TTCGCCGGAG GGGTCTTCGC CGTCGTCCGC ACCGGCGACC CCGTCTCCTG GGCACTGCTC
GCGCAGAAGG GCCACGTGGT CTCGCGCTCG GGACGCTACG CCTGCCTCTA CCTGCCCTAC
CACCTCATGG GCGTGGAGAC CCCGCTGAGC CTGCTGGACG CCGTGGACCG CCGCCGCGCG
GTGACCCCCC GGCACCCGGC CCCGCACGCG GTCCTGGCCG GGCGCGCCCG GCGCGACCTC
CCCGCCGGGA CCAGGCTGGA CATGGGCGGA CACCACCACG ACGTCCAGGG GGTGGCCCCC
GTCCTGCTGG ACGCCGCCGA CGCCCCCGAC GACGTCGCAC CGCTCTACCT GGCGGCCCAC
GCCGCCCTGG GCCGCGACGT CGCGGCGGGC GCACTGGTGC GCCTGGACGA CCTGGCCGAC
GCCAGCGCCC CCCTCCTGGA CGCCTGGCGC CACGCCCGCA CGCACCTGTG A
 
Protein sequence
MQATDATTAP PPPVRYALTG ARGGFARTLL AQTPRMERLR PSVLCDLDTE GTIALCAELG 
YPADALTVCS SADEVAALAD GRIAVIADAR LLEHADYDVL VEATGSPEAG TRAARTAIAA
GRHVVMVSKE VDSVSGIALA DLARERGVVY TPGIGDQPAN LIEWYERTRL LGLDVVAIGK
SGEYDLVFDP ATGRVRQLDQ EIDAPDLADL LTLGEDVPAT LAARARAVAG LSRSAAADYC
EMGVVANHTG LVPDTEELHY PVARVAELAD VYSLAEDGGI LSRSGAVDVF SVLRLPEEAS
FAGGVFAVVR TGDPVSWALL AQKGHVVSRS GRYACLYLPY HLMGVETPLS LLDAVDRRRA
VTPRHPAPHA VLAGRARRDL PAGTRLDMGG HHHDVQGVAP VLLDAADAPD DVAPLYLAAH
AALGRDVAAG ALVRLDDLAD ASAPLLDAWR HARTHL