Gene Namu_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2052 
Symbol 
ID8447661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2264605 
End bp2265894 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content77% 
IMG OID645041175 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003201421 
Protein GI258652265 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.171524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00281905 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGGCCG AGGGTTCGCT GGACAGCCCG GCCGCGGCCG ACCCCGGCCC CGGCCCGGCC 
ATCGACCCCG CTCGCCTGCC CGGCCCGACG CCCTCGTCGG TGCTGCGGGC GGTCTACTCC
GCGCTCGGCG GCCCGCCGGC CGCGGTGCAC GGGGTCAGCG AGGAGACGCT GCACCGGGTC
GAGCGGGCCG GTGGGCAACT CGCCCGGCGG GCCCTGTCGC TGATGGACCA GCGACTGCCC
TGGTTCCGCT CGTTGCCGGC CGAGCAGCGG TCCTGGGTCA CCCTGGTCGC CCAGGCCGGC
ATCTCCGGCT ACGTCGTCTG GGCACAGGCC AGCAGCGACG AGTACCGGAT CACCGGGGAG
GTGTTCGGCA CCGCCCCCCG TGAGCTGGTC CGCGCGGTGT CCCTGCGCCG CACGGTGGAG
CTGGTCCGGG TGGCCATCAC CGTCGCCGAG CAGGACCTGC CCGCGCTGGC CGCGGATGAC
GCCGAGCGGA TCGCGCTGCG CGACTCGCTG CTGCGCTACA GCCGGGAGAT CGCCTTCGCC
GCGGCCGAGG TGTACGCGGC GGCGGCCGAG ACCCGTGGCG CGTGGGACGC GCGGGTAGAG
GCGGCGGTCG TCGACGGGGT GGTCCGCGGC GAGGACATCG GCACCCTGTC CTCCCGGGCC
GCCGCGCTGA ACTGGGATCC CACCGCCGAC ACGGTCGTCG TCGCCGGGGC CGCCCCGACC
GGCGACCGGG CCGACGCGGT GGCCGCGGTC ACCGACTGGG CCGCCGTCAG CGGACGGCCG
GCGATGGCCG GTGTGCACGG CGATCGCCTG GTGCTGGTGC TGGCTGGGGC CGAGCCGCCG
GCGGACACCG TCGCCCAGCT GTTCGGCGAG GGTCCGGTGG TCCGTGGCCG CCCCGGCAGC
GGCCTGCGCG CCGCGATCAA CTCCGCCGCC GACGCCCTGG CCGGCCTGGA CGTGGTCGCC
GCCTGGCCGG ACGCCCCGCG GATGATCGAC GCCGACGACC TGCTGGTCGA GCGGGTGCTG
GCCGGCGACG CCCAGGCCGC GGCCCGGTTG CGCGCCGCCG TCTACGCCCC CCTGGCCGCC
TCCCCGCACC TGCTCTCGAC GGTGGACGCC TACCTGGCCA CCGGCGGCGC CCTGGAGCCC
ACCGCCCGCA ACCTGTTCGT CCACCCCAAC ACCGTCCGCT ACCGCCTGCA CCGCGTCGCC
GACCTGACCG GACGCGACCC CTGGGAGCCC CGGGACCTGC TGGTCCTGCA GACCGCGGTC
ATCCTCGGCC GGCTGGCCGC CGCCCCCTGA
 
Protein sequence
MTAEGSLDSP AAADPGPGPA IDPARLPGPT PSSVLRAVYS ALGGPPAAVH GVSEETLHRV 
ERAGGQLARR ALSLMDQRLP WFRSLPAEQR SWVTLVAQAG ISGYVVWAQA SSDEYRITGE
VFGTAPRELV RAVSLRRTVE LVRVAITVAE QDLPALAADD AERIALRDSL LRYSREIAFA
AAEVYAAAAE TRGAWDARVE AAVVDGVVRG EDIGTLSSRA AALNWDPTAD TVVVAGAAPT
GDRADAVAAV TDWAAVSGRP AMAGVHGDRL VLVLAGAEPP ADTVAQLFGE GPVVRGRPGS
GLRAAINSAA DALAGLDVVA AWPDAPRMID ADDLLVERVL AGDAQAAARL RAAVYAPLAA
SPHLLSTVDA YLATGGALEP TARNLFVHPN TVRYRLHRVA DLTGRDPWEP RDLLVLQTAV
ILGRLAAAP