Gene Ndas_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3410 
Symbol 
ID9247277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4076539 
End bp4077960 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003681321 
Protein GI297562347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCC TCTACATCGA CGGCGCCTGG CGCGACTCCG CCTCCAGCGA GGCACTGGAC 
GTGGTCAACC CGGCGACCGA GCAGGTCATC GACACCGTTC CGGCCGGGGC CGCCGAGGAC
GTCGACGCCG CGGTCGCGGC GGCAGCGGCG GCCCTGCCCG CCTGGTCCGC CCTCACGCCC
GGGCAGCGCG TCACCCACCT GGCCAAGGCC CTGGAGCTGT TCAACGCCCG CATCGACGAC
ATCGCCGCGG AGCTCACCCG GGACATGGGC GCCCCCGCGG TGTTCGCACG CAAGGTCCAG
GCGGGCCTGC CCGCCCTCAT GTTCCAGACC TACATCGACC TGGTCGAGGA GAGCGGCGAG
CGCTACTTCG GCGGCGAGCG GGTGGGCAAC TCGCTCATCG TGCGCGAGCC CGTCGGCGTG
GTCGGCGCCA TCACCCCGTG GAACTACCCG CTCCACCAGA TCGTGCTCAA GGTCGTCCCC
GCCCTCCTGG CGGGCAACAC CGTCGTCCTC AAGCCCAGCG AGGTCGCCCC GCTCAGCGCC
TACGCCCTCA CCGAGGTGTT CCACGAGGCG GGCCTGCCCG CGGGCGTGTT CAACCTGGTG
TCGGGCACCG GCCCGGTCGT GGGCGAGGCC ATCGCCGCCC ACCCCCGCGT GGACATGGTG
TCCTTCACCG GCTCCACGCG CGCCGGGACC CGGGTCAGCC AGGTCGCCGC GGAGACCGTC
AAGAAGGTCG CCCTGGAGCT GGGCGGCAAG TCCCCCAACG TCATCCTGCC CGACGCCGAC
CTGGTCAAGG CGGTCAAGCG CGGCGTCGCC GACGTCATGC GCAACACCGG GCAGAGCTGC
AACGCGCTCA CCCGCATGCT GGTGCACCGC GACTCCTACG AGGAGGCCGT CGCCCTGGCC
GCCGAGTCCG CGGCCAAGTA CGCGCCCGGC GACCCCGCGG ACGAGGCCAC CCGCATGGGC
CCGCTGGTCT CCGCCGACCA GCTGGAGCGG GTCCGCTCCT ACCTCGCGCT CGGAGTGGAG
GAGGGCGCCC GCCTGGTCAC CGGCGGCCCC GAACCGGTCC GGGGCCGCCC GGACGGCTAC
TACGTCAACC CCACGGTCTT CGCCGACGTG AGCAACGACA TGCGCGTCGC CCAGGAGGAG
ATCTTCGGGC CCGTGCTGGT GCTGATCCCC TACGACACGG AGGAGGAGGC CGTCGCCATC
GCCAACGACA CCGTGTACGG GCTCAACGCC GCGGTGTGGT CCGGCGACCC CGAGCGCGGC
CTGGCCGTCG CCCGGCGCCT GCGGGCCGGA CAGGTGGAGG TCAACGGCGG CGCCCTCAAC
CCCCGCGCCC CCTTCGGCGG CTACAAGCGC TCCGGCAACG GCCGTGAGTG GGGCGCCCAC
GGCCTGGAGG AGTTCTGCGA GGTCAAGGCC GTCCAGCTGT GA
 
Protein sequence
MRSLYIDGAW RDSASSEALD VVNPATEQVI DTVPAGAAED VDAAVAAAAA ALPAWSALTP 
GQRVTHLAKA LELFNARIDD IAAELTRDMG APAVFARKVQ AGLPALMFQT YIDLVEESGE
RYFGGERVGN SLIVREPVGV VGAITPWNYP LHQIVLKVVP ALLAGNTVVL KPSEVAPLSA
YALTEVFHEA GLPAGVFNLV SGTGPVVGEA IAAHPRVDMV SFTGSTRAGT RVSQVAAETV
KKVALELGGK SPNVILPDAD LVKAVKRGVA DVMRNTGQSC NALTRMLVHR DSYEEAVALA
AESAAKYAPG DPADEATRMG PLVSADQLER VRSYLALGVE EGARLVTGGP EPVRGRPDGY
YVNPTVFADV SNDMRVAQEE IFGPVLVLIP YDTEEEAVAI ANDTVYGLNA AVWSGDPERG
LAVARRLRAG QVEVNGGALN PRAPFGGYKR SGNGREWGAH GLEEFCEVKA VQL