Gene Ndas_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1935 
Symbol 
ID9245785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2358176 
End bp2359723 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content75% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003679868 
Protein GI297560894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.174652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTT CCGCCCTCCC CACCACCGCC TCCCTCGCCG ACCGCGCGCG GGCCGCCCTG 
GACCGGTGCC GCGTCGCCGC GCCCCAGGAA CCGGCCGGGG CCGGCGGCGC GGCGAGGTCG
CCCATCACCG GCCGGGAGCT CTTCGCCCTG GGCCACGAGT CCGCCGCCGA CGTCGAGCAG
GCCGTCACCG AGGCCCGGGA GGCCTTCGCC GCCTGGCGCG ACACCCCCGC CCCGGTGCGC
GGGCAGCTCG TCCACAGGCT CGGCGAGCTG CTGCGCGAGC ACAAGGCCGA CCTGGCCGAG
CTGGTGACCA TCGAGGCGGG CAAGATCCGC TCCGAGGCCC TGGGCGAGGT CCAGGAGATG
ATCGACGTCT GCGACCTGGC CGTGGGCATG TCCCGGCAGC TGTACGGGCG CACCATGCCC
TCCGAGCGGC CCGGCCACCG GCTCATGGAG ACCTGGCACC CGCTGGGCGT GGTCGGCGTG
ATCACCGCCT TCAACTTCCC CGTGGCGGTG TGGTCGTGGA ACACGTGCGT GGCCCTGGTG
TGCGGCAACA CCGTGGTCTG GAAGCCCTCG GAGCTGACCC CGCTCACCGC GCTGGCCTGC
CACGGCCTGC TCATGCGCGC GGCGCAGGAG GTCGGCGCGC CCGCCCGGGG TCTGCACCGG
GTCGTGCTGG GCGGCCGCGA GATCGGCCGG GCCCTGGCCG ACGACCCCCG GGTGGCCCTG
CTGTCGGCCA CGGGATCGAC CGCCATGGGC CGCGAGGTGG CGCCCCGGGT GGCCGCCCGC
ATGGGCCGCT ACCTGCTGGA ACTGGGCGGC AACAACGCGG CCGTGGTGGC CCCCTCCGCC
GACCTGGACC TGGTGGTGCG CGGCAGCGTC TTCTCCGCCG CGGGCACGGC GGGTCAGCGC
TGCACGACGC TGCGCCGCCT CATCGTGCAC GAGGACGTGG CCGAGGAGGT CACCCGGCGG
ATCGTGGACG CCTACAAGCA GCTGCGCGAC CGCGTCGGCG ACCCCTTCGC GGAGTCGACG
CTGGTGGGTC CGCTGGTGGG CGAGCGCGGG TACGCCGCGA TGCGCTCGGC GCTGGAGCGC
GCCGCGGCCG AGGGCGGCCA GGTGCTGGTC GGCGGCTCCC GGGTCCTGGA GGAGGAGGCC
GCCGACGCCT ACTACGTCGA GCCCGCGGTG GTGCGCATGC CCGGCCAGAC CGCCGTCGTG
CGCGAGGAGA CCTTCGCGCC GATCCTGTAC GTGATGCCCT ACCGCACGCT GGAGGAGGCG
GTGGAGCTGC ACAACGGCGT GCCCCAGGGG CTGTCGTCGT CGATCTTCAC CCAGGACCAG
TCCGAGGCCG AGCGGTTCCT GTCCGCGGCC GGTTCGGACT GCGGGATCGT CAACGTCAAC
ATCGGCACCT CCGGCGCGGA GATCGGCGGC GCCTTCGGCG GGGAGAAGGA CACCGGCGGC
GGCCGCGAGT CGGGTTCGGA CTCCTGGAAG GCCTACATGC GCCGGGCGAC CAACACGGTC
AACTACGGCG GCGAGCTGCC GCTGGCCCAG GGCGTGAGCT TCCTCTGA
 
Protein sequence
MPISALPTTA SLADRARAAL DRCRVAAPQE PAGAGGAARS PITGRELFAL GHESAADVEQ 
AVTEAREAFA AWRDTPAPVR GQLVHRLGEL LREHKADLAE LVTIEAGKIR SEALGEVQEM
IDVCDLAVGM SRQLYGRTMP SERPGHRLME TWHPLGVVGV ITAFNFPVAV WSWNTCVALV
CGNTVVWKPS ELTPLTALAC HGLLMRAAQE VGAPARGLHR VVLGGREIGR ALADDPRVAL
LSATGSTAMG REVAPRVAAR MGRYLLELGG NNAAVVAPSA DLDLVVRGSV FSAAGTAGQR
CTTLRRLIVH EDVAEEVTRR IVDAYKQLRD RVGDPFAEST LVGPLVGERG YAAMRSALER
AAAEGGQVLV GGSRVLEEEA ADAYYVEPAV VRMPGQTAVV REETFAPILY VMPYRTLEEA
VELHNGVPQG LSSSIFTQDQ SEAERFLSAA GSDCGIVNVN IGTSGAEIGG AFGGEKDTGG
GRESGSDSWK AYMRRATNTV NYGGELPLAQ GVSFL