Gene Namu_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2337 
Symbol 
ID8447948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2577481 
End bp2578740 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content73% 
IMG OID645041458 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003201702 
Protein GI258652546 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000298794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000148317 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGATC CGGTCGCCGA TGCGGTGACG CGGCTGGCGC GAGAGGACAG CGGCCGGATG 
GTCGCGATTC TCGCGCGCCG GTTCGGCGAC CTCGACGTCG CCGACGAATC GGTGCAGGAC
GCCCTGATGG AGGCGGTGCA GGCCTGGCCG GGCACCGGCG TCCCGGACAA TCCGGCCGGG
TGGCTGCGCA CGGTCGCCAC CCGCAAGGCC ATCGACCGGA TCCGCCGGGC CGGGTCGGCC
TACCGCCGCA CGTTGGCCGT TGCGCGGGAC CTGCTGGCCG AGCCGGTGGA CGGTCCGGAC
CCCGAGGAAG ATCCGGGGGA AGAGCTCATG ATCGACGACG ACGCGATCCG GGATCAGCAG
TTGCGGCTGA TCCTGCTGTG CTGCCATCCG GCGCTGCACC CCGATGCGCA GGTGGCCCTG
ACCCTGCGCC TGGTCGGAGG GCTGTCCACG GCCGAGATCG CGGCCGGCTT CCTGGCGCCG
GAGGCGACGA TCGCGCAACG GATCGTGCGG GCCAAGCGCA AGATCCGGGA GGCCCGGATC
CCGCTGCGGC GGCCCGACGA CCTGGCCGAA CGGGTCGAGG TGGTGCTCGC CGTGCTCTAC
CTGGTCTTCA ACGAGGGTTA CCTGTCCCGC CGGTCGGCCG GCGCCCGGGT CAACCTGATG
GACGAGGCGA TCCGGTTGAC CGAGGTGCTG GCCGACCTGC TGCCGCAGGA GCCGGAAATC
GCCGGCCTGC TGGCCCTGGA ACTGTTCCAA CGTTCGCGCA CCGGTGCCCG CTTCGATGCG
GCCGGCGACC TGGTGCTGCT GGAGGATCAG GATCGCTCGT CCTGGGACCT GGCGATGATC
AACCGGGCCA ACCGGGTGCT CGGCCCGGCG CTGGGCCGGA TGCGGCCCGG GGTGTACCAG
GTCCAGGCCC TCATCGCGGC CCAGCACGCG AACGCGCGGA CCGCCGCCGA CACCGACTGG
CCTGCGATCG CGACGCTCTA CGGGCAGCTG CTGGCCATGA CCGGCTCGCC GGTGGTCGCC
CTGAACCATG CGGTCGCGGT CGGACTGGCC GACGGGCCGG ATGCGGGGTT GGCCCGGCTG
GATCAGCTCA CCGGTCTGGA CGGCTATCAC CTGTTGCCGG CCGCGCGGGC CGAGATGCTG
GTGCGCGCCG GGCGTCCGGC CGAGGCGGTG GCGCAGTTCG ACGCGGCCCT GCGGCTGGTC
GGCGGCCAGA CCGAGCGGCG TCATCTGCAG CGCCGGCGGG ATTGCCTGCG GAGCGGCTGA
 
Protein sequence
MSDPVADAVT RLAREDSGRM VAILARRFGD LDVADESVQD ALMEAVQAWP GTGVPDNPAG 
WLRTVATRKA IDRIRRAGSA YRRTLAVARD LLAEPVDGPD PEEDPGEELM IDDDAIRDQQ
LRLILLCCHP ALHPDAQVAL TLRLVGGLST AEIAAGFLAP EATIAQRIVR AKRKIREARI
PLRRPDDLAE RVEVVLAVLY LVFNEGYLSR RSAGARVNLM DEAIRLTEVL ADLLPQEPEI
AGLLALELFQ RSRTGARFDA AGDLVLLEDQ DRSSWDLAMI NRANRVLGPA LGRMRPGVYQ
VQALIAAQHA NARTAADTDW PAIATLYGQL LAMTGSPVVA LNHAVAVGLA DGPDAGLARL
DQLTGLDGYH LLPAARAEML VRAGRPAEAV AQFDAALRLV GGQTERRHLQ RRRDCLRSG