Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4806 |
Symbol | |
ID | 9248689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5696069 |
End bp | 5697379 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | catalytic domain of components of various dehydrogenase complexes |
Protein accession | YP_003682696 |
Protein GI | 297563722 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA TCCAGATGCC GCGCCTCTCC GACACCATGG AGGAAGGCGT CATCAGCACG TGGGTCAAGA ACGTGGGCGA CAAGGTCGCC TCCGGTGACG TCCTGGTCGA GATCGAGACC GACAAGGCCG TCATGGAGTA CGAGGCCTAC GAGGACGGCT ACCTGGTCAA GCAGTCCGTC TCCGAGGGCG AGACGGTGCC GATCGGCGCG GTCATCGGCG TGATCGCCGA CTCCCCGGAC GCGGTACCCG AGGACTCCGG CGACGGCGGC TCGGAGCCCG AGGCCGCGCC CGCCGAGGAG GAGCAGGGCG AGAAGGCGGA GGAGATCCAG GAGGCCGCCG AGGGCACCGA GGCCGAGAGC GCCGGGGAGT CCGCCGCCTC CTCCGGCGAT GGGGCCGCGC GCCCGCGCAC CTCCCCGCTG GCCCGGCGTC TGGCCAAGGA GTACGGCCTG GACATCAACA GGATCCAGGG GTCGGGCCCC AAGGGCCGGA TCGTGCGCGC CGACATCGAG GCCGCCCGGG AAGGCGGTGC CGCCGAGCAG GCCGCACCCG CCGCGCAGCC CAAGGAGGAG GCCAAGCCCG CCGCGGAGAA GGCGGCGACC GCTCCCGCCT TCGACGACGG CCGCGCCTCC GAGGAGCTCA AGGTCAGCAA CGTGCGCAAG GTGATCGCGC GCCGCCTGAC CGAGAGCAAG CAGACGGTGC CGCACTTCTA CCTGCGCCGC ACGATCGACG CCGAGGCGCT CAAGGCCTTC CGCGCGCAGA TCAACGAGCA GCTGTCCAGC ACGGGCGTGA AGGTCAGCTT CAACGACCTG ATCGTCAAGG CCAGCGCGAC GGCGCTGAAG CTGCACCCGG CGGTGAACAC CTCGTGGGTG GACGACAAGC TGCTCCAGCA CCACCGGGTC AACGTCGGCG TGGCCGTGGC CGTGGACGCC GGGCTCGTGG TGCCGGTGCT GCACGACACC GACAAGGCGA CGCTGTCGGA GATCTCCACG CGCACGCGCG AGCTGGCGGG CAAGGCCCGT GACGGCAAGC TCAAGCCGCA GGAGATGAGC GGCGGCACGT TCAGCGTGTC CAACCTGGGC ATGTTCGGCG TGGACAGCTT CTCCGCGGTG ATCAACCCGC CGGAGGCGGC CATCCTCGCG GTCGGCGCGA TGCGCCAGGA GCCGGTGGTC GTGGACGGCG AGGTCGTCGT GCGCAACCGG ATCTCCCTGG AGCTGTCGGT GGACCACCGC GCGGTGGACG GCGCCGTGGG CGCCGCGTTC CTCAAGGACC TCGCGGAGAT CCTGGAAGAG CCGATGCGGA TCATCCTGTA G
|
Protein sequence | MSEIQMPRLS DTMEEGVIST WVKNVGDKVA SGDVLVEIET DKAVMEYEAY EDGYLVKQSV SEGETVPIGA VIGVIADSPD AVPEDSGDGG SEPEAAPAEE EQGEKAEEIQ EAAEGTEAES AGESAASSGD GAARPRTSPL ARRLAKEYGL DINRIQGSGP KGRIVRADIE AAREGGAAEQ AAPAAQPKEE AKPAAEKAAT APAFDDGRAS EELKVSNVRK VIARRLTESK QTVPHFYLRR TIDAEALKAF RAQINEQLSS TGVKVSFNDL IVKASATALK LHPAVNTSWV DDKLLQHHRV NVGVAVAVDA GLVVPVLHDT DKATLSEIST RTRELAGKAR DGKLKPQEMS GGTFSVSNLG MFGVDSFSAV INPPEAAILA VGAMRQEPVV VDGEVVVRNR ISLELSVDHR AVDGAVGAAF LKDLAEILEE PMRIIL
|
| |