Gene Ndas_3628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3628 
Symbol 
ID9247497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4352166 
End bp4353590 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content75% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003681534 
Protein GI297562560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACGC GTCCGTTCTG GCTGGCCGGC ACCGCCGCGA CCGGCGACAC CGAGATCACC 
GTCGTCAACC CCTACACCGG CGAGGACGCG GGTACCGTCT CCGTGCCCAC CGCCGAGCAG
ATCGAGGAGG CGGTGGCCGC CGCGCACGCG GTCGCCGCCG AGGCCGCCGC CCTGCCCGCC
CACGTGCGGG CCGACGCCCT CGACCACGTC TCCCGGCGCA TCGGCGAGCG CGCCGAGGAG
ATCGCCCGCA CCATCACCGC CGAGAGCGGC AAGCCGATCA AGTGGGCCCG CGCCGAGGCC
GGACGCGCCG TGTCCGTGTT CCGCTGGGCG GCCGAGGAGG CCCGCCGCGA CAGCGGTGAG
CTCCAGCGCC TGGACACCGA CCCGGGCGGC ACCGGCCGCA TGGCCCTGGT GCGCCGCGTC
CCCAAGGGCC CGGTCCTGGG CATCTCGCCG TTCAACTTCC CCCTCAACCT CGTCGCCCAC
AAGGTCGCCC CGGCCATCGC CGTCGGCGCG CCGATCATCC TCAAGCCCGC CCCCGCCACC
CCGATGACCG CCCTGCTCCT GGGCGAGATC ATCGCCGAGA CGGACCTGCC CGGCGGCATG
GTGTCGGTGC TGCCGATGCC CAACGAGCTC GCCGCCCCCC TCATCACCGA CGAACGCCTG
CCCGTCATCT CCTTCACCGG CGGCCCCTTC GGCTGGCAGC TGCCCCGCCT GGCCCCGCAC
AAGCACGTCA CCCTGGAACT GGGCGGCAAC GCCGCCGCCG TGGTGCTCGC CGACGCCGAC
CTGGACTGGG CGGCCCAGCG GGTGGCCCTG TTCGGCAACA ACCAGGCCGG ACAGGTGTGC
ATCGGCGTGC AGCGCGTCAT CGTCGAGGAC GCCGTGTACG ACGAGTTCGT CCCACGCCTG
GTGGAGCGGG TGGAGGCCCT CGGCGTGGGC GACCCCGCCG ACCCGGCCAC CGACGTGGGC
CCGCTGGTGG ACGAGGCCGC GGCCGAGCGC GTGGCCTCCT GGATCGACGA GGCCGTCACC
GCCGGGGCCA AGCTCCTCAC CGGCGGCACG CGCGACGGCG TGACCGTGGC GCCGACCGTG
CTCGCCGAGG CCCCGGACGA CTCCCGGGTG GTGCGCCAGG AGGTCTTCGG CCCGGTGCTG
GTCCTCCAGC GCGCGGCCGA CACCGACGCC GCCTTCGCCG CGGTCAACGA CAGCGACTTC
GGGCTCCAGG CGGGCGTGTT CACCCGCGAC CTGCCCACCG CGTTCCGCGC CCACCGCGAG
CTGGAGGTCG GCGGCGTGGT CATCGGCGAC GTGCCCACGT TCCGGGCCGA CCAGATGCCC
TACGGCGGCG TCAAGGGGTC GGGTGTGGGC AAGGAGGGCG TGCGCGCCGC CATGACCGAC
CTGTCCTACG AGCGGGTCCT GGTTCTGACC GGAATCGACC TGTAG
 
Protein sequence
MSTRPFWLAG TAATGDTEIT VVNPYTGEDA GTVSVPTAEQ IEEAVAAAHA VAAEAAALPA 
HVRADALDHV SRRIGERAEE IARTITAESG KPIKWARAEA GRAVSVFRWA AEEARRDSGE
LQRLDTDPGG TGRMALVRRV PKGPVLGISP FNFPLNLVAH KVAPAIAVGA PIILKPAPAT
PMTALLLGEI IAETDLPGGM VSVLPMPNEL AAPLITDERL PVISFTGGPF GWQLPRLAPH
KHVTLELGGN AAAVVLADAD LDWAAQRVAL FGNNQAGQVC IGVQRVIVED AVYDEFVPRL
VERVEALGVG DPADPATDVG PLVDEAAAER VASWIDEAVT AGAKLLTGGT RDGVTVAPTV
LAEAPDDSRV VRQEVFGPVL VLQRAADTDA AFAAVNDSDF GLQAGVFTRD LPTAFRAHRE
LEVGGVVIGD VPTFRADQMP YGGVKGSGVG KEGVRAAMTD LSYERVLVLT GIDL