Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4883 |
Symbol | |
ID | 9248770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 15594 |
End bp | 17117 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Aldehyde dehydrogenase (NAD(+)) |
Protein accession | YP_003682772 |
Protein GI | 297563799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCT ACGCACCCCC GGGCCAGCCC GGAAGCGTCG TCGAGTACGC CGCCCGCTAC GACAACTGGA TCGGGGGCGA GTGGGTCAGG CCGGTCCGGG GCCGCTACTT CGAGAACCCC AGCCCCGTCA ACGGCCGCGT CTTCACCGAG GTCGCCCGCA GCGGCGCGGA GGACGTGGAA CTGGCCCTGG ACGCGGCCCA CGGCGCCGCC CCCGCGTGGG GCCGCACCTC CGCCGCCGAG CGGGCCCTGG TCCTCAACCG GATCGCCGAC CGCGTCGAGG AGAACCTGGA GAGGCTCGCC GTCGCCGAGT CCTGGGAGAA CGGCAAGCCC GTCCGCGAGT GCCTGGCCGC CGACCTGCCG CTGGCCGTGG ACCACTTCCG CTACTTCGCC GGGGCGATCC GCGCGCAGGA GGGGCACACC TCCCAGATCG ACGGCGACAC CGTCGCCTAC CACTTCCAGG AGCCCCTGGG CGTGGTCGGC CAGATCATCC CGTGGAACTT CCCGCTGCTC ATGGCCACCT GGAAGCTCGC GCCCGCGCTG GCCGCCGGGA ACGCGGTCGT GCTCAAGCCC GCCGAGCAGA CCCCCGCGTC GATCCTGCTG CTCATGGAGC TGGTCGCCGA CCTGCTGCCG CCCGGCGTGG TCAACGTCGT CAACGGCTTC GGCGCGGAGG CGGGCAAACC GCTGGCCAGC AGCCCCCGCG TCAGCAAGGT CGCCTTCACC GGCGAGACCA CCACCGGCCG CCTCATCATG CAGTACGCGT CGGAGAACCT CATCCCGGTC ACCCTGGAGC TGGGCGGCAA GAGCCCGAAC ATCTTCTTCG CCGACGTGGC CGCGGCCGAC GACGCCTTCT ACGACAAGGC CCTGGAGGGC TTCACCCTCT TCGCCCTCAA CCAGGGCGAG GTGTGCACCT GCCCCTCGCG GGCCCTGGTG CAGGACGCCG TCTACGACCG CTTCATGGGC GACGCCCTGG CCCGCGTCGG CCGGATCCGG CAGGGGAACC CGCTGGACAC CGACACCATG GTCGGCGCCC AGGCCAGCAA CGACCAGCTG GAGAAGATCC TGTCCTACAT CGACATCGGC CGCCGGGAGG GGGCCGCGGT GCTCGCCGGA GGGGAGCGGG TCGATCCCGG CGGAGACCTG TCCGGCGGCT ACTACGTCGC GCCGACCGTC TTCGAGGGCC ACAACGGCAT GCGGATCTTC CAGGAGGAGA TCTTCGGCCC GGTGGTGTCG GTGGCCCGCT TCGACGACTA CGACGACGCC CTCAAGACCG CCAACGACAC CCTCTACGGG CTGGGGGCGG GGGTGTGGTC GCGCGACGGC AACACCGCCT ACCGCGCGGG CCGCGACATC CAGGCGGGCC GCGTGTGGGT GAACAACTAC CACTCCTACC CGGCGCACGC GGCCTTCGGC GGGTACAAGC AGTCCGGCAT CGGCCGCGAG AACCACAAGA TGATGCTCGA CCACTACCAG CAGACCAAGA ACCTGCTGGT CAGCTACTCC GACAAGGCGA TGGGGCTGTT CTGA
|
Protein sequence | MAIYAPPGQP GSVVEYAARY DNWIGGEWVR PVRGRYFENP SPVNGRVFTE VARSGAEDVE LALDAAHGAA PAWGRTSAAE RALVLNRIAD RVEENLERLA VAESWENGKP VRECLAADLP LAVDHFRYFA GAIRAQEGHT SQIDGDTVAY HFQEPLGVVG QIIPWNFPLL MATWKLAPAL AAGNAVVLKP AEQTPASILL LMELVADLLP PGVVNVVNGF GAEAGKPLAS SPRVSKVAFT GETTTGRLIM QYASENLIPV TLELGGKSPN IFFADVAAAD DAFYDKALEG FTLFALNQGE VCTCPSRALV QDAVYDRFMG DALARVGRIR QGNPLDTDTM VGAQASNDQL EKILSYIDIG RREGAAVLAG GERVDPGGDL SGGYYVAPTV FEGHNGMRIF QEEIFGPVVS VARFDDYDDA LKTANDTLYG LGAGVWSRDG NTAYRAGRDI QAGRVWVNNY HSYPAHAAFG GYKQSGIGRE NHKMMLDHYQ QTKNLLVSYS DKAMGLF
|
| |