Gene Ndas_5405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5405 
Symbol 
ID9249308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp583217 
End bp584806 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003683290 
Protein GI297564317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.264091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG AGTCTGATGA GACGAGGCTG TTTGTGACTA ACGATCCGGG GCAAACCGGA 
CCCGTGAACG AACTCTACGT AACCCTTCCG TCCGTGCCGT ACCTGGCGGA ACCGGGGTCG
GCGACGGCCA CCAGCCTCTA CATCGACGGT CGGTGGCGGG CGGCCGGCAA CGGCCGGGTG
CGGGAGATCC TGAACCCCGC CGACGCCTCC GTCCTGACCA TCGTCAGCGA GGGCGGAAGG
GCCGACTCCG AGGAGGCCAT CGCCGCGGCC CGCCGCGCCT TCGACGGCGG CGAGTGGCCC
CGCACCCCCG CCGGGGAGCG CGGACGCGTC CTCGACCGCA TCGCCGACCT GCTCCAGCGC
GACCGCGAGG AGATCGCGGT CATGGAGTCC CTCGACACCG GCAAGACCAT CGAGGAGGGC
GGGATCGACG TCGACGACGT CACCGGCGTC TTCCGCTACT ACGCCGGTCT CGCCGACAAG
GACACGGGCC GCCTGGTCTC CGCGCCCGAG GGCGTGCACA GCAAGGTGGT CTACGAGCCC
GTCGGCGTCT GCGGCATGAT CACGCCCTGG AACTACCCTC TGCTCCAGCT CGCCTGGAAG
ATGGCCCCGG CCCTGGCCGC GGGCAACACC ATGGTGGTCA AGCCCAGTGA GATCACACCG
GTCACCACCG CCAAGCTGGT CGAGCTCACT ACCGAGGCGG GCGTCCCGGC GGGCGTGGTC
AACCTGGTCA CGGGCAGCGG CCCCGACGCG GGCGCCCCGC TGTCCGAGCA CCCCGACGTG
GACCTGATCT CCTTCACCGG CGGTCTGGCC ACCGGCAGGC GGATCATGGC GGCGGCCTCC
GAGACGGTCA AGAAGATCGC CCTGGAACTC GGCGGCAAGA ACCCCAACAT CATCTTCCCG
GACGTGGACC TGGACACCGC CGTGGACTAC GCGCTCAGCG CCGCGTTCTT CCACTCCGGG
CAGGTCTGCT CGGCCGGGGC GCGCCTCATC GTGCACAACG ACGTCCACGA CGCCTTCACC
ACCGAACTCG CCCGCCGCGC CGAGGCCATC CGCATCGGCC GAGGCCAGGA CGAGGGCGTG
CGCTGCGGCC CGCTGGTGTC GGCCGAGCAC CGCGCCAAGG TGGAGGCCGC GGTCGCTCGC
GGCGTCGAGG AGGGCGCCCG GATCATCGCC GGGGGCAGGC GGCCCGACGA CCCGGACCTC
GCGCAGGGCT ACTTCTACCG GCCCACCGTG TTCGTGGACT GCGACCGGGC CATGGACATC
GTCCAGACCG AGGTGTTCGG CCCGGTCGTG ACCGTGGAGC GGTTCGAGAC CGAGCAGCAG
GCCGTCGAGC TGGGCAACGA CACCGACTAC GGCCTCTCCG GCGGAGTGTG GACCGACGAC
ACCGCCCGCG GGGAGCGCGT CGCGGCGGCC CTGCGCCACG GCACCGTCTG GATCAACGAC
TACGGCCCCT ACTTCCCCGG CGCCGAGTGG GGCGGCTACG GCCGCAGCGG GATCGGCCGC
GAACTCGGAC TCGCGGGCCT GGACGAGTAC CGCGAGGCCA AGCACGTCTA CCGCAACCTG
TCCCCCGAAC CGCAGCGCTG GTTCGGCTGA
 
Protein sequence
MAEESDETRL FVTNDPGQTG PVNELYVTLP SVPYLAEPGS ATATSLYIDG RWRAAGNGRV 
REILNPADAS VLTIVSEGGR ADSEEAIAAA RRAFDGGEWP RTPAGERGRV LDRIADLLQR
DREEIAVMES LDTGKTIEEG GIDVDDVTGV FRYYAGLADK DTGRLVSAPE GVHSKVVYEP
VGVCGMITPW NYPLLQLAWK MAPALAAGNT MVVKPSEITP VTTAKLVELT TEAGVPAGVV
NLVTGSGPDA GAPLSEHPDV DLISFTGGLA TGRRIMAAAS ETVKKIALEL GGKNPNIIFP
DVDLDTAVDY ALSAAFFHSG QVCSAGARLI VHNDVHDAFT TELARRAEAI RIGRGQDEGV
RCGPLVSAEH RAKVEAAVAR GVEEGARIIA GGRRPDDPDL AQGYFYRPTV FVDCDRAMDI
VQTEVFGPVV TVERFETEQQ AVELGNDTDY GLSGGVWTDD TARGERVAAA LRHGTVWIND
YGPYFPGAEW GGYGRSGIGR ELGLAGLDEY REAKHVYRNL SPEPQRWFG