Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1589 |
Symbol | |
ID | 9245439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1944721 |
End bp | 1945905 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_003679524 |
Protein GI | 297560550 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0618655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCAG TGGTGTGGAA CGGGACCAGG AACGTCGACA CCGTGACCGT TCCCGATCCG CGGATCGAGG AGCCCGGTGA CGCCCTCGTC CGGATCACCA GTTCGGGCCT GTGCGGATCC GACCTGCACC TGTACGAGGT GCTCGGGCCG TTCATGACCC CGGGGGACAT CCTCGGACAC GAACCCATGG GCGTGGTCGA GGAGGTCGGT TCCGGCGTCA CCTCCCTCAC CCCCGGCCAG CGGGTCGTCA TCCCCTTCCA GATCTCCTGC GGGCACTGCC TCATGTGCGA CACCGGCCTC CAGAGCCAGT GCGAGAACAC CCAGGTCGAG GAGCAGGGCA TGGGCGCCGC GCTCTTCGGC TACAGCAAGC TCTACGGCTC GGTGCCCGGA GCGCAGGCCG AGTACCTGCG GGTTCCGCGC GCCGAGACCA CCGCGGTCCC GGTGCCCGAC CAGGGCCCCG ACGACCGCTA CCTGTTCCTG TCCGACGTGC TGCCGACCGC CTGGCAGGCC GTCCGCTACG CCGACGTCCC CGAGGGCGGG TCGGTGGCCG TCCTGGGGCT GGGGCCGATC GGCGACATGT GCTGCCGGGT CGCCCGCCAC CTGGGCGCGG GCCGGGTGTT CGGCGTGGAC CCGGTGCCGG AGCGGCGCGC CCGCGCCGCC GCCCGGGACG TGGAGGTGTT CGACTCCTCC AAGGGCACCG ACGACGTGGT CCAGGAGATC CGCGACCGTA CGGACGGGCG CGGCCCGGAC GCGGTCATCG ACGCGGTCGG CATGGAGGCG GCCGGGCACG GCTCGGCCAA GTTCGCGCAG CGCGTGGCCA ACCTCATGCC CCGGGGCGTG GCGGCCAAGA TGATGGAGAC GGCCGGGGTG GACCGGCTGA CCGCCCTGCA CACCGCCATC GACCTGGTGC GGCGCGGCGG GACCGTCTCC CTGATCGGGG TGTACGGCGG CATGGCCGAC CCGATGCCGA TGCTCACGCT CTTCGACAAG CAGATCCAGC TGCGGATGGG GCAGGCCAAC GTGCGCCGGT GGGTGCCGGA GATCCTGCCG CTGCTGGAGG GGTCCGACCC GCTGGGGGTG GACGACTTCG CCACCCACCA CGTGGGCCTG GACGCGGCCT CGCTGGCCTA CGAGAAGTTC CAGAAGAAGC AGGACGGCGT GTTCAAGGTC GTCTTCCGGC CCTGA
|
Protein sequence | MKAVVWNGTR NVDTVTVPDP RIEEPGDALV RITSSGLCGS DLHLYEVLGP FMTPGDILGH EPMGVVEEVG SGVTSLTPGQ RVVIPFQISC GHCLMCDTGL QSQCENTQVE EQGMGAALFG YSKLYGSVPG AQAEYLRVPR AETTAVPVPD QGPDDRYLFL SDVLPTAWQA VRYADVPEGG SVAVLGLGPI GDMCCRVARH LGAGRVFGVD PVPERRARAA ARDVEVFDSS KGTDDVVQEI RDRTDGRGPD AVIDAVGMEA AGHGSAKFAQ RVANLMPRGV AAKMMETAGV DRLTALHTAI DLVRRGGTVS LIGVYGGMAD PMPMLTLFDK QIQLRMGQAN VRRWVPEILP LLEGSDPLGV DDFATHHVGL DAASLAYEKF QKKQDGVFKV VFRP
|
| |