Gene Namu_2946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2946 
Symbol 
ID8448559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3226250 
End bp3227671 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content72% 
IMG OID645042031 
Product9-cis-epoxycarotenoid dioxygenase 
Protein accessionYP_003202273 
Protein GI258653117 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000652034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000168571 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGCA CCCGCTCGGC GCCCCCGCCG GTTGACCCAA CCACCAATCC GCACCTGCAG 
GGCGTCTTCG CGCCCACCAC CACGGAGATC GACGTCGACG GCCTGCGCAT CGACGGCGAG
CTGCCGGCCG GGATCGACGG TGACTACGTG CGTAACGGCC CCAACCCCCG GTTCACCCCC
ATCGGCGGCT ACCTGTACCC GATCGACGGC GACGGCATGC TGCACCGCGT CCGGTTCCGG
GACGGCCGGG CCGGCTACAC CAACCGGTTC GTCCGCACCC CGGCGGTGGT GGCCGAGGAG
GCCGCGGGCC GGGCGCTGTG GCCGGGGTTG ATGCGGATGG ACTACCAGCC CGGCGCGGAT
CAGGTCGGGC CCGACCTGGC CGGCACCGTC AAGGACCTGC CGGGCATCAA CGTGGTCCGC
CACGCCGGGC GGCTGCTGGC CCTGGCCGAG TCGGCCAATC CGTTCCTCAT GTCACCCGAG
TTGGCCACCA TCGGTCGGGA GACCTTCTGC GGCACGATTC CGGTCGGCAT CACCGCGCAT
CCGAAGGTGG ATCCGGCCAC CAGCGACATG GTCGTGTTCT GCTACCAGCT CGAAGCACCA
TTTCTCACCT GGGCGGTGAT CGGCCCGGAC GGCTCCACTG TTCGCCCGCC GACACCGGTG
GCCGGACTGG ACCGGCCCGC GATGATCCAC GACATGGCGA TCACCGATCG CTACGTCGTC
GTGGTGGTGG CGCCGTTCTA CTTCGACCCG GCCGGCGCGG CGACCGGCGG ATCGCTGCTG
TCCTGGCGAC CCGACGACGG CACCCGCATC GCCCTGATCC CGCGGGACGG CGGGCCGGTG
CGGTGGGCGG CGACCGAGTC GTTCTGGTTG TGGCACACCG CGAACGCGCA TGATCGGGCC
GACGGTCAGG TGGTGCTGGA CTACGCGCGG TGGTCCGCGC CGGGCGGGCT GGTGCCGGGT
GTGCGACCGG CCGGTGGTCT GGCCCGGATG GTGATCGACC CCGGCACCGG GCGGGTGCGG
CACGAGACCC TGGTCGACCG GTCCATGGAG TTCCCCCGGA TCGACGACCG GACGATCGCC
GCCGACCATC GGCAGATTGC CACCTCGCTC AAGGGCGGAG CCCGTTCGCT GCCGTCCGGG
GACGCGGACA CCCTCGGCTG GTTCGACGCG GGCACCGGGT CCTTCGCCAC CTGGGACGCC
GGCGACCTGT CCGTCGGTGA GCAATGCTTC GTGCCCACGC CGGGTGATGC CGACGCCTCC
CACGGCTGGT GGTTGTGCCT CGCCACCGAT CGCACCGATC TGACCAGCCG CCTGCTGGTG
ATTCCGGCGG CCGATCCCCG GTCCGGTCCC GTGGCCACCG TGCACCTGCC GCAGCGAGTG
CCGGCCGGCC TGCACGGCGC GTGGCTGCCG ACGCAGGAGT GA
 
Protein sequence
MISTRSAPPP VDPTTNPHLQ GVFAPTTTEI DVDGLRIDGE LPAGIDGDYV RNGPNPRFTP 
IGGYLYPIDG DGMLHRVRFR DGRAGYTNRF VRTPAVVAEE AAGRALWPGL MRMDYQPGAD
QVGPDLAGTV KDLPGINVVR HAGRLLALAE SANPFLMSPE LATIGRETFC GTIPVGITAH
PKVDPATSDM VVFCYQLEAP FLTWAVIGPD GSTVRPPTPV AGLDRPAMIH DMAITDRYVV
VVVAPFYFDP AGAATGGSLL SWRPDDGTRI ALIPRDGGPV RWAATESFWL WHTANAHDRA
DGQVVLDYAR WSAPGGLVPG VRPAGGLARM VIDPGTGRVR HETLVDRSME FPRIDDRTIA
ADHRQIATSL KGGARSLPSG DADTLGWFDA GTGSFATWDA GDLSVGEQCF VPTPGDADAS
HGWWLCLATD RTDLTSRLLV IPAADPRSGP VATVHLPQRV PAGLHGAWLP TQE