Gene Ndas_2498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2498 
Symbol 
ID9246348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2960986 
End bp2962635 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003680423 
Protein GI297561449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC ACACTCCGGA CCCGAAGGGC ACGAACACGG GCACGGCCGC GGACGTCCTC 
GGACCGCTGG TGGCCTGCGC TCCCGTCGAG GGCGGGGCCG GGACCCTGCG CGCGACCGAC
CCCGCCACGG GGGAGGAGTT CGGCGAGCCC GTGGGCCTGG TGGACTCCGG CCAGATCCAG
GAGGCCACCC GCGCCGCCGA GCAGGCCCTG GACGCCTTCC GCGCCCAGTC GCCAGCCGAG
CGCGCCGACT TCCTGCGCCG CGTCGCCGAC AACATCGACG CCCTGGGCGA CGCGCTCGTG
GACCGGGCGG TGCGCGAGAG CGGCCTGCCG CGCCAGCGCC TGACGGGGGA GCGGGCCCGC
ACCACCGGCC AGCTGCGCAT GTTCGCCGAC GTCGTGGCCC AGGGCGACGC CCTGGGCGCC
CGCATCGACC CGGCGCTGCC CGACCGCACC CCCCAGCCCC GCCCCGACCT GCGCCTGGCG
CACATCCCCG TCGGCCCCGT GGTGGTCTTC GGCGCGAGCA ACTTCCCCCT GGCCTTCTCC
ACCGCCGGGG GCGACACGGC CGCGGCCCTG GCCGCCGGGT GCCCGGTGAT CGTCAAGGGC
CACAACGCCC ACCCCGGCAC CGCCGCGCTG GTCGGGCGCG CCGTGGCCGA CGCGGTGCGC
GAGAGCGGCC TGCCCGGCGG CGTGTTCTCC CTGTTGTTCG GGGAGGGCAA CGGCATCGGA
CAGGAGCTGG TCGCCGACCC GCGCGTGAAG GCCGTGGCCT TCACCGGGTC GCGGGGCGGC
GGGCTGGCCC TGATGCGGGT GGCGGCCGAG CGCCCCGAGC CGATCCCGGT CTTCGCCGAG
ATGTCCTCGG TCAACCCCGT GTTCGTGCTG CCCGGCGCGC TCGCCGGGCA GGGCGCCCAG
GACCTGGCCG GGGCCTACGT CGCCTCGCTC ACCCTGGGAT CGGGGCAGTT CTGCACCAAC
CCCGGGCTGG TGTTCGTCCC CTCCACCCCG GACGGGGACC GGTTCGTGGA AGCCGCCGCG
CGCCTGGTGG CCGACGCCAC CGGCCAGACG ATGCTCACCG CGCCCATCGC CGCGGCCTTC
CGCGACGGGG TGGAGGCGCT GGAGGGGCGT TCGGAGGTCG TGCTGCGGGC CAGGGGCGGC
GAGGGGGAGG GCCCCAACGC GCCGGCCCCG GCCCTGGCCG AGGTGTCCCT GGCCGACCTC
ACGGCCGACC CGCGCCTGAG CGAGGAGGTC TTCGGCGCCG CGGGCACGGT GGTGCGCTAC
CCCGACGCCG CCGCGCTGCC CGCCGCGCTG GAGGGGCTGG AGGGCCAGCT CACCGCGACC
CTGCACGCCG ACACCGGCGA CGCCGACGAC CTGGCGGCGG CGCGCGCCCT GCTGCCGGTG
CTGGAGCGCC GCGCGGGCCG CGTCCTGTTC GGGGGCTGGC CCACTGGGGT GGAGGTGACC
CACGCGATGG TCCACGGCGG CCCCTTCCCC GCCACCTCCG ACGGCCGGGG CACCTCGGTG
GGCAGCCTGG CCCTGCACCG GTTCCTGCGG CCGGTCAGCT ACCAGGACAT GCCCGACGCC
CTGCTGCCGC CCGCACTGCG CCAGGACAAC CCCTGGCGGC TGAACCGACG AGTCGAGGGC
ACCGTCGTGC CCGGAGGAGA CCCCGCATGA
 
Protein sequence
MAEHTPDPKG TNTGTAADVL GPLVACAPVE GGAGTLRATD PATGEEFGEP VGLVDSGQIQ 
EATRAAEQAL DAFRAQSPAE RADFLRRVAD NIDALGDALV DRAVRESGLP RQRLTGERAR
TTGQLRMFAD VVAQGDALGA RIDPALPDRT PQPRPDLRLA HIPVGPVVVF GASNFPLAFS
TAGGDTAAAL AAGCPVIVKG HNAHPGTAAL VGRAVADAVR ESGLPGGVFS LLFGEGNGIG
QELVADPRVK AVAFTGSRGG GLALMRVAAE RPEPIPVFAE MSSVNPVFVL PGALAGQGAQ
DLAGAYVASL TLGSGQFCTN PGLVFVPSTP DGDRFVEAAA RLVADATGQT MLTAPIAAAF
RDGVEALEGR SEVVLRARGG EGEGPNAPAP ALAEVSLADL TADPRLSEEV FGAAGTVVRY
PDAAALPAAL EGLEGQLTAT LHADTGDADD LAAARALLPV LERRAGRVLF GGWPTGVEVT
HAMVHGGPFP ATSDGRGTSV GSLALHRFLR PVSYQDMPDA LLPPALRQDN PWRLNRRVEG
TVVPGGDPA