Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2946 |
Symbol | |
ID | 8448559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3226250 |
End bp | 3227671 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042031 |
Product | 9-cis-epoxycarotenoid dioxygenase |
Protein accession | YP_003202273 |
Protein GI | 258653117 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000652034 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000168571 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAGCA CCCGCTCGGC GCCCCCGCCG GTTGACCCAA CCACCAATCC GCACCTGCAG GGCGTCTTCG CGCCCACCAC CACGGAGATC GACGTCGACG GCCTGCGCAT CGACGGCGAG CTGCCGGCCG GGATCGACGG TGACTACGTG CGTAACGGCC CCAACCCCCG GTTCACCCCC ATCGGCGGCT ACCTGTACCC GATCGACGGC GACGGCATGC TGCACCGCGT CCGGTTCCGG GACGGCCGGG CCGGCTACAC CAACCGGTTC GTCCGCACCC CGGCGGTGGT GGCCGAGGAG GCCGCGGGCC GGGCGCTGTG GCCGGGGTTG ATGCGGATGG ACTACCAGCC CGGCGCGGAT CAGGTCGGGC CCGACCTGGC CGGCACCGTC AAGGACCTGC CGGGCATCAA CGTGGTCCGC CACGCCGGGC GGCTGCTGGC CCTGGCCGAG TCGGCCAATC CGTTCCTCAT GTCACCCGAG TTGGCCACCA TCGGTCGGGA GACCTTCTGC GGCACGATTC CGGTCGGCAT CACCGCGCAT CCGAAGGTGG ATCCGGCCAC CAGCGACATG GTCGTGTTCT GCTACCAGCT CGAAGCACCA TTTCTCACCT GGGCGGTGAT CGGCCCGGAC GGCTCCACTG TTCGCCCGCC GACACCGGTG GCCGGACTGG ACCGGCCCGC GATGATCCAC GACATGGCGA TCACCGATCG CTACGTCGTC GTGGTGGTGG CGCCGTTCTA CTTCGACCCG GCCGGCGCGG CGACCGGCGG ATCGCTGCTG TCCTGGCGAC CCGACGACGG CACCCGCATC GCCCTGATCC CGCGGGACGG CGGGCCGGTG CGGTGGGCGG CGACCGAGTC GTTCTGGTTG TGGCACACCG CGAACGCGCA TGATCGGGCC GACGGTCAGG TGGTGCTGGA CTACGCGCGG TGGTCCGCGC CGGGCGGGCT GGTGCCGGGT GTGCGACCGG CCGGTGGTCT GGCCCGGATG GTGATCGACC CCGGCACCGG GCGGGTGCGG CACGAGACCC TGGTCGACCG GTCCATGGAG TTCCCCCGGA TCGACGACCG GACGATCGCC GCCGACCATC GGCAGATTGC CACCTCGCTC AAGGGCGGAG CCCGTTCGCT GCCGTCCGGG GACGCGGACA CCCTCGGCTG GTTCGACGCG GGCACCGGGT CCTTCGCCAC CTGGGACGCC GGCGACCTGT CCGTCGGTGA GCAATGCTTC GTGCCCACGC CGGGTGATGC CGACGCCTCC CACGGCTGGT GGTTGTGCCT CGCCACCGAT CGCACCGATC TGACCAGCCG CCTGCTGGTG ATTCCGGCGG CCGATCCCCG GTCCGGTCCC GTGGCCACCG TGCACCTGCC GCAGCGAGTG CCGGCCGGCC TGCACGGCGC GTGGCTGCCG ACGCAGGAGT GA
|
Protein sequence | MISTRSAPPP VDPTTNPHLQ GVFAPTTTEI DVDGLRIDGE LPAGIDGDYV RNGPNPRFTP IGGYLYPIDG DGMLHRVRFR DGRAGYTNRF VRTPAVVAEE AAGRALWPGL MRMDYQPGAD QVGPDLAGTV KDLPGINVVR HAGRLLALAE SANPFLMSPE LATIGRETFC GTIPVGITAH PKVDPATSDM VVFCYQLEAP FLTWAVIGPD GSTVRPPTPV AGLDRPAMIH DMAITDRYVV VVVAPFYFDP AGAATGGSLL SWRPDDGTRI ALIPRDGGPV RWAATESFWL WHTANAHDRA DGQVVLDYAR WSAPGGLVPG VRPAGGLARM VIDPGTGRVR HETLVDRSME FPRIDDRTIA ADHRQIATSL KGGARSLPSG DADTLGWFDA GTGSFATWDA GDLSVGEQCF VPTPGDADAS HGWWLCLATD RTDLTSRLLV IPAADPRSGP VATVHLPQRV PAGLHGAWLP TQE
|
| |