Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5034 |
Symbol | |
ID | 9248923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 175539 |
End bp | 177164 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Cyclohexanone monooxygenase |
Protein accession | YP_003682921 |
Protein GI | 297563948 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.623836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCAACG CGAAGAACAC CGATCTCGAC GCGATCGTCG TCGGTGCCGG GTTCGCCGGC ATCTACGCGC TGCACAAGCT CCGCAACGAA CTGGGCCTGT CCGTCCGGGC CTTCGACAAG GCGGGCGGGG TCGGGGGCAC CTGGTACTGG AACCGCTACC CCGGCGCCAT GTCCGACAGC GAGGGCTTCA TCTACCAGTA CTCCTTCGAC CGCGACCTGC TGCGGGAGTG GACCTGGAAG AAGCGCTACC TGTCCCAGGC GGAGATCCTG GGCTACCTGG AGGCGGTCGT GGAGCGGCAC GACCTCGCCA GGGACATCCA GCTCAACACC GGCGTCGAGA CGCTGGTCTA CGACGAGGCC GCCGCCCTGT GGACCGCGAC CACCAGCGAC GGCCAGACCC TCACCGCCCG CTACGTGGTG ACCGCCCTCG GACCGCTGTC CACCTCCCAC TTCCCCGACT TCAAGGGCCG CGACAGCTTC CAGGGCCGCC TGGTCCACAC CGGCTCCTGG CCCGACGACC TCGACATCGA GGGCAAGAGG GTCGGCGTCA TCGGCACCGG CTCCACCGGA ACCCAGTTCA TCTGCGCGGC CTCGAAGGTG GCCGGGCAGC TCACCGTGTT CCAGCGGACC CCCCAGTACA ACGTGCCCTC GGGCAACGCC GAGGTGGACG AGGCCTACTT CACCGACCTG CGCGGCCGCT ACGACCAGGT CTGGGAGCAG GCCAAGAAGT CCCGCGTGGC GTGCGGCTTC GAGGAGAGCG AGATCGCCGC GATGAGCGTC TCCGAGGAGG AGCGCCGACG CGTCTTCCAG GAGAACTGGG ACCGGGGCAA CGGCTTCCGC TTCATGTTCG GCACCTTCTC CGACATCATC TTCGACCCCA GGGCCAACGA GGCGGCGGCC GACTTCATCC GGTCCAAGAT CCGGGAGATC GTCAAGGACC CAGAGACCGC CCGCAAGCTC CAGCCGACCG ACTACTACGC CAAGCGCCCG GTCTGCAACG AGGACTACTA CGAGTCCTAC AACCGGGACA ACGTCAGCCT GGTGAGCCTC AAGGAGACCC CGATCCGGGA GTTCACCCCC ACGGGGATCG TGACCGAGGA CGGGGTGGAG CACGAGCTGG ACGTCGTCGT CTTCGCGACC GGTTTCGAGG CCGTCGAGGG CAGCTACCGG CAGATGGAGA TCCGCGGCCG GGGCGGCGTG ACCATCGAGG AGCACTGGGG GGACACGCCC GCCAGCTACC TCGGGGTCAA CGTCTCGGGC TTCCCCAACA TGTTCATGGT CTACGGCCCC AACAGCGTCT TCAGCAACCT GCCCACGGCC ATCGAGACCC AGGTCGAGTG GATCACCGAC CTGGTCCGGA TGATGGAGGA GCGCGACCTG ACCTCCATCG AGCCGACCCC GGAGGCGGAG GAGGGCTGGA CCGAGCTGTG CACGCAGATC GCCGACCACT CCCTGTTCCC CAAGGTCAAC TCCTGGATCT TCGGGGCCAA CATCCCGGGC AAGAAGAAGC GGGTCCTGTT CTACTTCGCG GGGCTCGGCA ACTACCGCCA GAAGCTCGGT GACGTGGCCG CGGCCGACTA CGAGGGCTTC ATGCTCAAGG GCAACCCCTC GGTGGTGACC GCCTGA
|
Protein sequence | MTNAKNTDLD AIVVGAGFAG IYALHKLRNE LGLSVRAFDK AGGVGGTWYW NRYPGAMSDS EGFIYQYSFD RDLLREWTWK KRYLSQAEIL GYLEAVVERH DLARDIQLNT GVETLVYDEA AALWTATTSD GQTLTARYVV TALGPLSTSH FPDFKGRDSF QGRLVHTGSW PDDLDIEGKR VGVIGTGSTG TQFICAASKV AGQLTVFQRT PQYNVPSGNA EVDEAYFTDL RGRYDQVWEQ AKKSRVACGF EESEIAAMSV SEEERRRVFQ ENWDRGNGFR FMFGTFSDII FDPRANEAAA DFIRSKIREI VKDPETARKL QPTDYYAKRP VCNEDYYESY NRDNVSLVSL KETPIREFTP TGIVTEDGVE HELDVVVFAT GFEAVEGSYR QMEIRGRGGV TIEEHWGDTP ASYLGVNVSG FPNMFMVYGP NSVFSNLPTA IETQVEWITD LVRMMEERDL TSIEPTPEAE EGWTELCTQI ADHSLFPKVN SWIFGANIPG KKKRVLFYFA GLGNYRQKLG DVAAADYEGF MLKGNPSVVT A
|
| |