Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3217 |
Symbol | |
ID | 9247074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3847601 |
End bp | 3848581 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_003681131 |
Protein GI | 297562157 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAGAG TCGTCGTCTT CGACGAGTTC GGCGGTCCGG AAACCATGCA CATCGTCGAA GAACCGGTCT CCGAGCCCGG TTCCGGCGAG GTAAGGGTCA GGATCGAAGC CTTCGCCGTC AACCCGCTCG ACCAGATGAT GCGTTCCGGT ACCTCGCCCG CACCTGTCCC CCTGCCACAC GCTCGCCTCG GTATCGAAGG AACCGGCGTC GTCGACGCAC TCGGCCCGCG GGTCCCGGGG CTGAAGATCG GCGACCCGGT CATCCTCACG GCCATACCGG ACGCCGGCGT CCGGGGCAGT TACGCCGAAT ACACCACGGT CCCCGCCGAC AGGGTCATCG TCCGGCCTGC CTCACTCGGG GTCGCGGAGG CGGCAGCGAT ATGGGTGGCC TTCTCCACCG CCTTCGGCGC GCTCGTCGAG AAGGCGCGGA TGCTGCCCGG CGACCACGTC CTCATCACAG CCGCGTCCGG CAGCGTCGGA CGGGCGGCGG TACAGATCGC CAACCAGATC GGCGCTGTTC CCATCGCCGT CACCCGGAGC GCCGCGAAGA AGGACGTCCT GCTCGCAGCG GGTGCGGCCG CAGCCATCGC CACCGATGAG GCGGACATCG CCGAAGCCGT CCACCACCAC ACCGGCGGAA CCGGCGCCGA CATCATCCTC GATCTCGTCA TGGGCCCCGG CCAGCAGGAC CTCCTGGCCG CGGCCCGCCC CGGTGGAACC CTGGTCGCCG CGGGCTTCCT GGACCCCCAG CCCGCGCCCT TCCCGACAGG CGCGCCTCTG ACGGTTTTCA GCTACCAGAG CTTCGAGCAC ACTCTCGACG ATGTCGTGGT CAAGCGCATG TCGGCTTTCC TGAACGCCGG TGTACGCCTC GGGGCACTAC AGCCAGCCAT CGACAGGGTG TTCACCCTCG ACGACATCGT CGAGGCACAT CGCCACCTCG AGAAGGGGAT TCACACCGGC AAGATCGTCG TCACGACATA G
|
Protein sequence | MPRVVVFDEF GGPETMHIVE EPVSEPGSGE VRVRIEAFAV NPLDQMMRSG TSPAPVPLPH ARLGIEGTGV VDALGPRVPG LKIGDPVILT AIPDAGVRGS YAEYTTVPAD RVIVRPASLG VAEAAAIWVA FSTAFGALVE KARMLPGDHV LITAASGSVG RAAVQIANQI GAVPIAVTRS AAKKDVLLAA GAAAAIATDE ADIAEAVHHH TGGTGADIIL DLVMGPGQQD LLAAARPGGT LVAAGFLDPQ PAPFPTGAPL TVFSYQSFEH TLDDVVVKRM SAFLNAGVRL GALQPAIDRV FTLDDIVEAH RHLEKGIHTG KIVVTT
|
| |