Gene Namu_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3337 
Symbol 
ID8448952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3674683 
End bp3675981 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content76% 
IMG OID645042414 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003202654 
Protein GI258653498 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00156813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0107915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG GCGCGGCCGC TGCGGCCGGG GTCGCGGTGG CCAGGGCCCG GACGGAACTG 
TCCGGCCGGG TCCTGGCCGT TACCCTGCGG GCGGTGGCCG ACGCCGACAT CGCCGAGGAG
GCGACCGCCG AGGCTTTCCT GTCGGCGGTC GAGACCTGGC CGCGGACCGG CGTTCCCGAG
TCGCCGGAGG CCTGGCTGAT CACCGTCGCC CGGCGCAAGG CCCTGGATCG GCTGCGCCGC
GACTCACTGG GTCGCCGGCT GGCCCTGCGG ATGCTGCCGC CGCAACCGCA CCCGGCCGCG
GACAGCGGCA TCGACGTCGG GGTCATCGGG GACGAGGAAC TGCGCCTGGT GGTGCTGTGC
GCCGATCCCC GGCTGCCCGA GTCCGATCAG ATCGCGCTCA CCCTCAAATG GGCGTGCGGG
GCCAGCACGG CGGCCATCGC GGCGGCCCAC CTGGTCAGCA CGCCGACGAT GGCGGCCCGG
CTGACCCGGG CCCGCAAGAA GCTGGCCGCG GCCGGTCCGC GGCTGGACCT GCCGGACGAC
GCGACGGTCG ACGCGCGGCT GCCGGCGGTC TGCCGCGTGC TGCACTTGGC CTACACCCTG
GGTCACACGG CGGCCGACGG CGCCGAGCTG ACCGACGAGG ACCTGACCGG CCGGGCCGTG
CACCTGACCC GCACGTTGCA CCGGCAGCGC CCCAAGGACG CCGAGATCAC CGGTCTGCTC
GCGCTGGTGC TGCTCGGCCA GGCCCGGTCG AGCAGCCGCA TCGTCGACGG CCGGCAGGTG
CTGCTGGCCG ACGCGGACCG CACGGCCTGG GACCGAGCGC TGGCGGACGA GGGCCTGGCC
CTGTCCGAGC TGGCCCTGGC CCAGGGCATC GTGGCCGGCG CCCGGCCCGG GCCGCTGGCG
TTGCAGGCGG CGATCTCGGC GGCGCACACC CGGGCGTCCT CGTTCGCCGA CACCGACTGG
CGGCTGATCG TGCAGCTCTA CGGCCTGCTG CTGACGGCCG AGCCCAGCCA CACGCACGCG
CTGGGCCGGT GCGTGGCGGT GTCCTACCTC TACGGTCCGC AGGTCGGGCT GGCCGACCTC
GACGGGGTGA TGGCGGACGG GGTCCTGAAC CGCTACCCGT ACGCCCATGC CGCCCGCGCC
CAGCTGCTCG AGCGGGCCGG CCGGCCCGAG GACGCCGTGC GGGCCTGGCA GGCGGCGGCC
GCGACCGGGC GCACCGACGC GGAACGGGCG TATTTCACCG AGCGGGCCGG GAACAGTCGC
GCCCCGGCCG TGGTTGCCAC TGAGCGTGAA TACTCCTGA
 
Protein sequence
MSTGAAAAAG VAVARARTEL SGRVLAVTLR AVADADIAEE ATAEAFLSAV ETWPRTGVPE 
SPEAWLITVA RRKALDRLRR DSLGRRLALR MLPPQPHPAA DSGIDVGVIG DEELRLVVLC
ADPRLPESDQ IALTLKWACG ASTAAIAAAH LVSTPTMAAR LTRARKKLAA AGPRLDLPDD
ATVDARLPAV CRVLHLAYTL GHTAADGAEL TDEDLTGRAV HLTRTLHRQR PKDAEITGLL
ALVLLGQARS SSRIVDGRQV LLADADRTAW DRALADEGLA LSELALAQGI VAGARPGPLA
LQAAISAAHT RASSFADTDW RLIVQLYGLL LTAEPSHTHA LGRCVAVSYL YGPQVGLADL
DGVMADGVLN RYPYAHAARA QLLERAGRPE DAVRAWQAAA ATGRTDAERA YFTERAGNSR
APAVVATERE YS