Gene Ndas_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1930 
Symbol 
ID9245780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2350963 
End bp2352405 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003679863 
Protein GI297560889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.68692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCCAAGGCGC CCCCGCCGCC GAGGCCGCCC CCCGGCCCCC CGCCGAGCCG 
CTCACCCTGA CCACCGAGAT CCCACCGGTG GTCGCCCGCC TGCGCGCCGC CTTCGCCTCC
GGCCGCACCA AGCCCGTCGC CTGGCGCCGC GCCCAGCTGC GCGCGCTGCG CCGCATGCTC
ACCGAGGAGC GCACCGCGTT CGAACGCGTG CTCAAGGCCG ACCTCGGCAA GAGCCCCATC
GAGGCCCACA CCACCGAGAT CGGCTTCGTG GTCAACGAGA TCGACCACAC CCTCAGGCAC
CTGGCCTCCT GGCTGCGCCC GCAGCGGGTG CCCGTGCCCG TCGCCCTGGC CCCGGCCAGG
GCCCGCCGCG TGCGCGAGCC GCTGGGCACC GTGCTGATCA TCGCCCCGTG GAACTACCCG
GTGAACCTGT CCCTGGCGCC CCTGGTCGGT GCCCTCGCCG CGGGCAACGC CGCCCTGGTC
AAGCCCAGCG AACTGGCCCC GGCCACCTCC GCCGCCCTGG CCGAGCTGCT GCCCCGCTAC
CTGGACACCG AGGCGGTCGC GGTCGTGGAG GGCGGCATCC CCGAGAGCAC CGCCCTGCTC
GATGAGCGCT TCGACCACAT CTTCTACACC GGCAACGGCA CCGTGGCCCG CATCGTCATG
GCCGCCGCCG CCAAGCACCT GACCCCCGTC ACCCTGGAGC TGGGCGGCAA GAGCCCGGCC
ATCGTCGAAC CCGGGGTGGA CCTGGCCACC ACCGCCCGCC GCCTGGCCTG GGGCAAGTTC
ACCAACACCG GTCAGACCTG CGTGGCCCCC GACTACGTGC TCGCCGTCGG CGACACCGCC
GAACCGCTCC AGCGCGAGCT GACCGCCGCC ATCACCGAGA TGTTCGGCGA GGACCCCTCA
CGCAGCGCCG ACTACGGGCG CATCGTCAAC GAGCGCCACT TCGCCCGGAT CACCGCCCTG
CTGGGCAGCG GCACCGTGGT CACCGGCGGA CAGCACGACA TCGACCGCCT CTACGTCGCC
CCCACCGTCC TGGCCGACGT GGACCCCGAC TCCCCGGTGA TGTCGGAGGA GATCTTCGGC
CCCGTCCTGC CGGTCCTGCG GGTCCCCGAC CTGGACGCGG CCATCGCCTT CGTCAACGCA
CGCGACAAGC CGCTGGCGCT GTACGGCTTC ACCGACTCCG AGGAGACCAA GCGCCGCCTG
ACCACCGAGA CCTCCTCGGG CGGCCTGGCC TTCGGTCTGC CGATCGCCCA CCTGGCCGTT
CCCGACCTGC CCTTCGGCGG CGTGGGGGAC AGCGGTATGG GCGCCTACCA CTCCGCGGCC
TCCCTGGACA CCTTCTCGCA CACCAAGTCG GTGCTGGACA AGTCGCTGTT CATGGACACC
ATGCGCATCG CCTACGCGCC CGTCACCGAC CTCAAGCAGA AGCTGCTCCG CCGGCTCCTG
TGA
 
Protein sequence
MTDTQGAPAA EAAPRPPAEP LTLTTEIPPV VARLRAAFAS GRTKPVAWRR AQLRALRRML 
TEERTAFERV LKADLGKSPI EAHTTEIGFV VNEIDHTLRH LASWLRPQRV PVPVALAPAR
ARRVREPLGT VLIIAPWNYP VNLSLAPLVG ALAAGNAALV KPSELAPATS AALAELLPRY
LDTEAVAVVE GGIPESTALL DERFDHIFYT GNGTVARIVM AAAAKHLTPV TLELGGKSPA
IVEPGVDLAT TARRLAWGKF TNTGQTCVAP DYVLAVGDTA EPLQRELTAA ITEMFGEDPS
RSADYGRIVN ERHFARITAL LGSGTVVTGG QHDIDRLYVA PTVLADVDPD SPVMSEEIFG
PVLPVLRVPD LDAAIAFVNA RDKPLALYGF TDSEETKRRL TTETSSGGLA FGLPIAHLAV
PDLPFGGVGD SGMGAYHSAA SLDTFSHTKS VLDKSLFMDT MRIAYAPVTD LKQKLLRRLL