Gene Namu_5266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5266 
Symbol 
ID8450897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5880945 
End bp5882246 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content74% 
IMG OID645044297 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003204521 
Protein GI258655365 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones82 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG CCACCCGGGA GGCCGTCGAG GCGGTGTGGC GCATCGAGTC GGCCCGGCTG 
ATCGCCGGGC TGGCCCGGGT CACCGGCGAC CTGGGCCTGG CCGAGGAACT CGCCCAGGAC
GCGCTGGTCA CCGCCCTGGA ACAGTGGCCG AGCGGCGGCA TCCCGGCCAA CCCGGCCGGG
TGGCTGATGA CCACGGCCAA ACGGTTGGCG ATCGACGCCC ACCGGCGCCG GATCGTGTAC
CAGCGCAAAC TGGAGACGAT CGGCCGCGCA CAGCCCGGCC ATGATCTGGC CGACGAGATC
GGCCGGGCGA TCGACGATGC GCTGGACGAC CACATCGGCG ACGACCTGCT GCGGCTGATC
TTCACCGCGG CCCACCCGGC CCTGTCCACC GAATCCCGGG TCGCCCTGAC CCTGCGCTGC
CTGTGCGGTC TGTCCACCCA GGAGATCGGT CGGGCCTACC TGGTGAGCAC CGCCACCGCG
GCCGCCCGGA TCGTCCGCGC CAAACGGACG CTCGCCGCGC AGTCCGTGCG CTTCGAGCTG
CCCGACCCGG CGGCGATGGC CGATCGGCTG GCCTCGGTCC TCGAGGTCAT CTACCTGATC
TTCAACGAGG GGTACTCGGC CTCGTCCGGG GACGACTGGA TGCGGCCGGC GCTGTGCGAG
GAGGCCTTGC GGCTGGGCCG GGTGCTCGCC GGGATGGTGC CGGACGAGCC GGAGGTGCAC
GGGCTCGTCG CCCTGATGGA GATCCAGGCG TCCAGAACCG CGGCCCGGAC CGGACGCGAA
GGGGAACCGG TCCTGCTGGC CGACCAGGAC CGGCGCGCGT GGGACCGGTT GCTGATCCGC
CGCGGCCTGG CCGCCCTGCA GCGGGCCGAG TCCCTGTCCA AGCCGATCGG GCCGTACACC
CTGCAGGCCG CGATCGCCGC CTGTCACGCC CGGGCGACCC GGGCGCAGGA CACCGACTGG
GCCCGAATCG CCTCGTTGTA CGTCGTCCTG GCTCATGTGT GGCCCTCACC GGTGGTCGAG
CTGAACCGGG CCGTCGCGGT GGGGATGGCG GACGGTCCGG CGGCCGGACT GGCCGTCCTC
GACGCCGCCG TGAACCGGGG CGAGCTGGGC GACTACCCGC TGGCGCACGC CGTCCGCGGC
GACCTGCTAG CCCGCTCGGG GAATCACGGC GAGGCCGCGC AGCACTTCCA CCGGGCCGCC
GAGCTGACCC GCAACGCCGG CGAACGAAAG GTGTTCACCC GGCGCGCCGC CGAGCTGGCG
GCGCCCGAGG TCAGCGCCGG GTGTCGTCCG CCCGGAAGTT GA
 
Protein sequence
MSTATREAVE AVWRIESARL IAGLARVTGD LGLAEELAQD ALVTALEQWP SGGIPANPAG 
WLMTTAKRLA IDAHRRRIVY QRKLETIGRA QPGHDLADEI GRAIDDALDD HIGDDLLRLI
FTAAHPALST ESRVALTLRC LCGLSTQEIG RAYLVSTATA AARIVRAKRT LAAQSVRFEL
PDPAAMADRL ASVLEVIYLI FNEGYSASSG DDWMRPALCE EALRLGRVLA GMVPDEPEVH
GLVALMEIQA SRTAARTGRE GEPVLLADQD RRAWDRLLIR RGLAALQRAE SLSKPIGPYT
LQAAIAACHA RATRAQDTDW ARIASLYVVL AHVWPSPVVE LNRAVAVGMA DGPAAGLAVL
DAAVNRGELG DYPLAHAVRG DLLARSGNHG EAAQHFHRAA ELTRNAGERK VFTRRAAELA
APEVSAGCRP PGS