Gene Ndas_4883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4883 
Symbol 
ID9248770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp15594 
End bp17117 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_003682772 
Protein GI297563799 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCT ACGCACCCCC GGGCCAGCCC GGAAGCGTCG TCGAGTACGC CGCCCGCTAC 
GACAACTGGA TCGGGGGCGA GTGGGTCAGG CCGGTCCGGG GCCGCTACTT CGAGAACCCC
AGCCCCGTCA ACGGCCGCGT CTTCACCGAG GTCGCCCGCA GCGGCGCGGA GGACGTGGAA
CTGGCCCTGG ACGCGGCCCA CGGCGCCGCC CCCGCGTGGG GCCGCACCTC CGCCGCCGAG
CGGGCCCTGG TCCTCAACCG GATCGCCGAC CGCGTCGAGG AGAACCTGGA GAGGCTCGCC
GTCGCCGAGT CCTGGGAGAA CGGCAAGCCC GTCCGCGAGT GCCTGGCCGC CGACCTGCCG
CTGGCCGTGG ACCACTTCCG CTACTTCGCC GGGGCGATCC GCGCGCAGGA GGGGCACACC
TCCCAGATCG ACGGCGACAC CGTCGCCTAC CACTTCCAGG AGCCCCTGGG CGTGGTCGGC
CAGATCATCC CGTGGAACTT CCCGCTGCTC ATGGCCACCT GGAAGCTCGC GCCCGCGCTG
GCCGCCGGGA ACGCGGTCGT GCTCAAGCCC GCCGAGCAGA CCCCCGCGTC GATCCTGCTG
CTCATGGAGC TGGTCGCCGA CCTGCTGCCG CCCGGCGTGG TCAACGTCGT CAACGGCTTC
GGCGCGGAGG CGGGCAAACC GCTGGCCAGC AGCCCCCGCG TCAGCAAGGT CGCCTTCACC
GGCGAGACCA CCACCGGCCG CCTCATCATG CAGTACGCGT CGGAGAACCT CATCCCGGTC
ACCCTGGAGC TGGGCGGCAA GAGCCCGAAC ATCTTCTTCG CCGACGTGGC CGCGGCCGAC
GACGCCTTCT ACGACAAGGC CCTGGAGGGC TTCACCCTCT TCGCCCTCAA CCAGGGCGAG
GTGTGCACCT GCCCCTCGCG GGCCCTGGTG CAGGACGCCG TCTACGACCG CTTCATGGGC
GACGCCCTGG CCCGCGTCGG CCGGATCCGG CAGGGGAACC CGCTGGACAC CGACACCATG
GTCGGCGCCC AGGCCAGCAA CGACCAGCTG GAGAAGATCC TGTCCTACAT CGACATCGGC
CGCCGGGAGG GGGCCGCGGT GCTCGCCGGA GGGGAGCGGG TCGATCCCGG CGGAGACCTG
TCCGGCGGCT ACTACGTCGC GCCGACCGTC TTCGAGGGCC ACAACGGCAT GCGGATCTTC
CAGGAGGAGA TCTTCGGCCC GGTGGTGTCG GTGGCCCGCT TCGACGACTA CGACGACGCC
CTCAAGACCG CCAACGACAC CCTCTACGGG CTGGGGGCGG GGGTGTGGTC GCGCGACGGC
AACACCGCCT ACCGCGCGGG CCGCGACATC CAGGCGGGCC GCGTGTGGGT GAACAACTAC
CACTCCTACC CGGCGCACGC GGCCTTCGGC GGGTACAAGC AGTCCGGCAT CGGCCGCGAG
AACCACAAGA TGATGCTCGA CCACTACCAG CAGACCAAGA ACCTGCTGGT CAGCTACTCC
GACAAGGCGA TGGGGCTGTT CTGA
 
Protein sequence
MAIYAPPGQP GSVVEYAARY DNWIGGEWVR PVRGRYFENP SPVNGRVFTE VARSGAEDVE 
LALDAAHGAA PAWGRTSAAE RALVLNRIAD RVEENLERLA VAESWENGKP VRECLAADLP
LAVDHFRYFA GAIRAQEGHT SQIDGDTVAY HFQEPLGVVG QIIPWNFPLL MATWKLAPAL
AAGNAVVLKP AEQTPASILL LMELVADLLP PGVVNVVNGF GAEAGKPLAS SPRVSKVAFT
GETTTGRLIM QYASENLIPV TLELGGKSPN IFFADVAAAD DAFYDKALEG FTLFALNQGE
VCTCPSRALV QDAVYDRFMG DALARVGRIR QGNPLDTDTM VGAQASNDQL EKILSYIDIG
RREGAAVLAG GERVDPGGDL SGGYYVAPTV FEGHNGMRIF QEEIFGPVVS VARFDDYDDA
LKTANDTLYG LGAGVWSRDG NTAYRAGRDI QAGRVWVNNY HSYPAHAAFG GYKQSGIGRE
NHKMMLDHYQ QTKNLLVSYS DKAMGLF