Gene Namu_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3841 
Symbol 
ID8449460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4212828 
End bp4213859 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content72% 
IMG OID645042891 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_003203127 
Protein GI258653971 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.353412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.439298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAG CACCGCAGAA CACCCCGACT TACGAGGGTC GAGCCCTCCC CCGGCCGGAC 
GAGGAGGTCA TGGACCAGGG CCTGTCCTTC GATATGCAGA CGATGCTCGG CCGCCGGCAG
ATGCTCCGGG CGGTCGGCCT GGGGGCGGTC GCCGTCGGGC TGGCCGCCTG CGGCGCCGGC
ACGACCACCG CATCAACGTC GGCGTCGACC AACGGGTCGA CGACCGGGTC GACCGCCGGG
TCGACCGCCG CCAGCCCGTC GAGCAGTGCG GCCGGCAGCG CCAGCAGTTC GAGCGCGGCC
GCCGCGACGC TGACCGAGAT CCCGGACGAG ACCGCCGGCC CGTACCCCGG GGACGGCTCC
AACGGGCCGG ACGTGCTGTC CCAGAGCGGC ATCGTGCGCA GCGACATCCG CTCCAGCTTT
GCCGGCTCGA CCGGCGTCGC CGAGGGCGTG CCGATGACGC TCACCCTGAG CATCAAGGAC
ATGGTCAACG GCAACGCCCC GTTCGCCGGG GTCGCGGTCT ACGTCTGGCA TTGCGATCGG
GAGGGCCGGT ACTCCCTGTA CAGCGACGGG GTGACCGAGG AGAACTACCT GCGCGGCGTC
CAGGTCGCCG GAGCGGACGG CACGGTCAGC TTCACCAGCA TCTACCCGGC CACCTACTCC
GGCCGCTGGC CGCACATCCA CTTCGAGGTC TACCCGGACG TCGCCAGCAT CACCGACTCG
ACCAACGCCA TCGCCACCTC GCAGGTGGCC ATGCCCAGCG ACGTCAGCGA CCTGGTCTAC
CAGCAACCCG GGTACGAGCA GTCGATCATC AACAAGGCCC AGATCAGCCT GACCAGCGAC
AACGTCTTCG GCGACGACGG CGGCATCCAC GAGCTGGGCA CCGCGACCGG GGACGTCGCC
AGCGGCTACC ACGTCACTCT GGACGTGCCG GTCGACACCT CGACCGCGCC GACGGCCGGC
AGCGCCCCCG CGGGCGGCCC CGGCGGTGGC CAGGGCGGTG GCCAAGGTGG TGGCCAGCCG
CCGAGCCGCT GA
 
Protein sequence
MRRAPQNTPT YEGRALPRPD EEVMDQGLSF DMQTMLGRRQ MLRAVGLGAV AVGLAACGAG 
TTTASTSAST NGSTTGSTAG STAASPSSSA AGSASSSSAA AATLTEIPDE TAGPYPGDGS
NGPDVLSQSG IVRSDIRSSF AGSTGVAEGV PMTLTLSIKD MVNGNAPFAG VAVYVWHCDR
EGRYSLYSDG VTEENYLRGV QVAGADGTVS FTSIYPATYS GRWPHIHFEV YPDVASITDS
TNAIATSQVA MPSDVSDLVY QQPGYEQSII NKAQISLTSD NVFGDDGGIH ELGTATGDVA
SGYHVTLDVP VDTSTAPTAG SAPAGGPGGG QGGGQGGGQP PSR