Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1689 |
Symbol | |
ID | 9245539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2061904 |
End bp | 2062953 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_003679624 |
Protein GI | 297560650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.458756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.196927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCG TTCACGCCTA CACGGCCTCG TCGGCGACCG CGCCCCTGGC GCCGGGCACG ATCGAGCGCA GGGAGGTCGG CCCGAAGGAC GTCCTGATCG ACATCGCCTG GGCGGGGATC TGCCACTCCG ACATCCACAC CGTCCGCGGG GACTGGGGCG AGGTCCCCTA CCCGTTGACC GTGGGGCACG AGATCGCCGG TGTGGTCGCC GAGGTCGGCT CCGAGGTCAC CCGCCACAAG GTCGGCGACC GGGTGGGAGT GGGCTGCATG GTCGACTCCT GCCGGGAGTG CGCGAACTGC CTGGCCGGTG TGGAGCAGTA CTGCCTGCGC GGGTTCACCG ACACCTACAA CGGCACCGAC CGGGACGGGA CCGTGACCCA GGGCGGGTAC TCCCAGCGCA TCGTCGTGGA CGAGCACTTC GCGCTGCGCG TCCCCGAGGC CATCCCCTTC GAGAAGGCCG CACCGCTGCT GTGCGCGGGG ATCACCACCT ACTCGCCGCT GCGCAACTGG AACGCCGGGC CGGGCAGGAA GGTCGCCGTG GTCGGGCTGG GCGGCCTCGG GCACATGGCG GTCAAGCTCG CCCACGCCAT GGGGGCGGAG GTGACGGTGC TCTCGCAGAG CATGAAGAAG CGCGAGGACG GGCTGCGGTT CGGCGCCGAC CACTACCACG CCACCAGCGA CCCGGACACC TTCGAGCGGC TGGCCAACAC CTTCGACCTG ATCGTCAACA CGGTCAGCGC GCCCATCGAC CTGGACGCCT ACCTGAACCT GCTGGCCCTG GACGGCGCGA TCGTGAGCGT GGGCGCGCCG CCGGAGCCGG TGGCGGTGAC GCTGTTCACG CTGTTCGAGA ACCGCCGCTC GTTCGCCGGT TCCAAGATCG GCGGTATCGC CCAGACCCAG GAGATGCTGG ACTTCTGCGC CGAGCACGGC ATCGCCCCCG AGGTCGAGAT CGTCCGCGCC GACCAGATCA ACGAGGCATG GGAGCGGGTG CTCGCCTCGG ACGTGCGGTA CCGGTTCGTC ATCGACGCCT CGACCCTGGG CGGTGCCTGA
|
Protein sequence | MSLVHAYTAS SATAPLAPGT IERREVGPKD VLIDIAWAGI CHSDIHTVRG DWGEVPYPLT VGHEIAGVVA EVGSEVTRHK VGDRVGVGCM VDSCRECANC LAGVEQYCLR GFTDTYNGTD RDGTVTQGGY SQRIVVDEHF ALRVPEAIPF EKAAPLLCAG ITTYSPLRNW NAGPGRKVAV VGLGGLGHMA VKLAHAMGAE VTVLSQSMKK REDGLRFGAD HYHATSDPDT FERLANTFDL IVNTVSAPID LDAYLNLLAL DGAIVSVGAP PEPVAVTLFT LFENRRSFAG SKIGGIAQTQ EMLDFCAEHG IAPEVEIVRA DQINEAWERV LASDVRYRFV IDASTLGGA
|
| |