Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3535 |
Symbol | |
ID | 9247404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4245466 |
End bp | 4246476 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_003681442 |
Protein GI | 297562468 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.582496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.719008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCA CCAGCAGGGA GGTCCACCTG GTGGCCCGTC CCGTCGGCGA GCCCGAGCCC ACCGACTTCT CCCTCGTGGA GACCACCGTC GCCGACCCCG GTCCCGGGCA GGTCCTGGTG CGCAACGACT GGATGTCCGT GGACCCGTAC ATGCGCGGCC GCATGAACGA CGCCAAGTCC TACGTCCCCC CGTTCCGGCT CGGCGAGCCG ATGGACGGCG GCGCCGTGGG CGTGGTCACC GCCTCCGGCA GCGACGACGT CCCCGTGGGC ACCACCGTCC TGCACTCGGC CGGATGGCGC GAGTACGCGC TGCTGCCCGC GGATTCCGTG CGCGCGGTGG ACGCCTCCCT GGCACCCGCC GAGGCCTACC TCGGCGTGCT GGGCATGATC GGCCTCACCG CCTACGCGGG CCTGACCGAG ATCGCCCCGG TGCGCGAGGG CGACGTGGTG TTCGTCTCCG GCGCCGCGGG CGCGGTCGGC TCCGCCGCCG GCCAGATCGC CCGCCAGCTG GGCGCGTCCC GGGTGGTCGG GTCCGCGGGC GGCCCGGAGA AGAAGCGCCG CCTCCTGGAG GACTTCGGCT TCGACGCCGC CATCGACTAC CGCGAGGGCC GCCTGGAGGA GCAGCTCGCC GAGGCCGCGC CCGAGGGGAT CGACGTCTAC TTCGACAACG TCGGCGGCGA CCACCTGAGG GCCGCCATCG CCGCGATGCG CAACCACGGC CGGATCGCCC TGTGCGGCGC GATCTCCCAG TACAACGCCA CCAAGCCCGA GCCCGGCCCC GACAACCTCT TCCTGGCCGT CGGCAAGCGC CTCACCCTGC GCGGGTTCAT CGCCGGAGAC CACGGCCACC TGATGAAGGA GTACGCCGAG CGCGCCTCCG GGTGGATCGT CGACGGCAGG CTGCGCAGCG AGCAGACCGT CGTCGACGGC ATCGACAACG CCGTGCGGGC CTTCCTCGGC ATGATGCGGG GCGCCAACAC GGGCAAGATG CTGGTCCACC TCACACCCTG A
|
Protein sequence | MSVTSREVHL VARPVGEPEP TDFSLVETTV ADPGPGQVLV RNDWMSVDPY MRGRMNDAKS YVPPFRLGEP MDGGAVGVVT ASGSDDVPVG TTVLHSAGWR EYALLPADSV RAVDASLAPA EAYLGVLGMI GLTAYAGLTE IAPVREGDVV FVSGAAGAVG SAAGQIARQL GASRVVGSAG GPEKKRRLLE DFGFDAAIDY REGRLEEQLA EAAPEGIDVY FDNVGGDHLR AAIAAMRNHG RIALCGAISQ YNATKPEPGP DNLFLAVGKR LTLRGFIAGD HGHLMKEYAE RASGWIVDGR LRSEQTVVDG IDNAVRAFLG MMRGANTGKM LVHLTP
|
| |