Gene Ndas_1337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1337 
Symbol 
ID9245187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1643732 
End bp1645192 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_003679275 
Protein GI297560301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCA ACGACTACAC GAACATGTTC TACGTGAACG GGCGCTGGAT CCGCTCGCAC 
GGTACGCGGC AGGTCGTGGT GACCAACCCG GCCACCGAGG AGAGCCTGGG CCGGGTGACT
CTGGGCGACG TCACCGACGT CGAGGCGGTG GTCGACGCCG CCCGCCGGGC CGCGCCCGGC
TGGGCCGGGA CGCCCGTGTC CGAGCGCGCG GCGCTGCTGC GCGCCGTCGC GGCCGAGCTG
GCCCTGCGCC AGGAGGAGAT CGCGCGGTTG GAGACGGCGG AGGTCGGTTC CCCGATCACC
CTGTCCCGCC GGGCGCACGC GCAGAGCCCG ATCCACCTCT TCGCCTCGGC CGCCGACCTG
GTCGAGCGGT CCGAGCCGGA CGAGACGATC CCCGGCGCGA CGGTGCTGCG CGAACCGTAC
GGTGTGGTCG GCGCGATCAC CCCGTGGAAC TACCCGCTGC ACCAGAGCGC GGCCAAGATA
GCCCCGGCGC TGGCGGCCGG GAACACCGTC GTGCACAAGC CCAGTGAGAC CACGCCGCTG
GGGGCCTACG CCCTGGCCGA GGCGATCGAA TCGGCGGGGC TGCCGCCCGG CGTGTTCAAC
ATGGTCATGG GTGACGGGGC GACCGTCGGA GCGCGCGTCG CCGGTCACCC CGACGTCGAC
CTCGTCTCGT TCACCGGTTC CACCCGGGCC GGGGTCCGGG TGGCGGCTGA GGCGGCGGCC
ACCGTCAAGA AGGTCTCCCT GGAACTGGGC GGCAAGAGCC CCGCCGTCAT CCTGCCGGGC
GCGCCGCTGC GCCCGGCCGT GCGGCGGGCA CTGCGCTCGG GTTTCCTCAA CTCCGGCCAG
ACGTGCATGG CCCTGACCCG GATCCTCGTC GACCGGGCGC GCCTGGCCGA GGCCGAGGAG
ATCGTCCGCG ACGCGGTCGC CGACTACGTC GTGGGCGACC CGACCGACCC GGACACCGAG
TACGGTCCGC TCGTCTCGAA GGCGCAGCGG GACCGCGTGC GCGACTACGT CCGCAGGGGG
CAGCGGGAGG GCCTGCGCCT GATCACCGGT GGTCCCGACC GCCCGGCCGC GTTGAGCCGC
GGCTACTACC TGCCGCTGAC GGTCTTCTCC GACGTCCCGC CCACCTCGGC GCTGGTGACC
GACGAGATCT TCGGGCCCGT GCTGGTGATC CAGGTCTACG ACTCGGTGGG CGAGGCCGTC
GATCTGGCCA ACCGCACGCC CTACGGCCTG TGCGCCGGGG TGTGGGGCGC CGACCGCGCC
GAGGCCGTCG AGGTGGCGGG GCGGTTGCAG GTCGGCCAGG TCTTCGTCAA CGGCGCCGGG
TTCAATCCGG ACGTCCCGTT CGGCGGCTTC AAGCGGTCGG GGATCGGCCG CGAGTACGGG
CGCTACGGGC TGGAGGAGTT CCAGCAGACC AAGGGGCTGG TGTTCGGCGC CGACGCTGTC
GGCTGTGGTG GATACCGCTG A
 
Protein sequence
MQTNDYTNMF YVNGRWIRSH GTRQVVVTNP ATEESLGRVT LGDVTDVEAV VDAARRAAPG 
WAGTPVSERA ALLRAVAAEL ALRQEEIARL ETAEVGSPIT LSRRAHAQSP IHLFASAADL
VERSEPDETI PGATVLREPY GVVGAITPWN YPLHQSAAKI APALAAGNTV VHKPSETTPL
GAYALAEAIE SAGLPPGVFN MVMGDGATVG ARVAGHPDVD LVSFTGSTRA GVRVAAEAAA
TVKKVSLELG GKSPAVILPG APLRPAVRRA LRSGFLNSGQ TCMALTRILV DRARLAEAEE
IVRDAVADYV VGDPTDPDTE YGPLVSKAQR DRVRDYVRRG QREGLRLITG GPDRPAALSR
GYYLPLTVFS DVPPTSALVT DEIFGPVLVI QVYDSVGEAV DLANRTPYGL CAGVWGADRA
EAVEVAGRLQ VGQVFVNGAG FNPDVPFGGF KRSGIGREYG RYGLEEFQQT KGLVFGADAV
GCGGYR