Gene Namu_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1556 
Symbol 
ID8447154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1716682 
End bp1717971 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content74% 
IMG OID645040683 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003200940 
Protein GI258651784 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00576969 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0421989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG ACCCCGCTCC GCGCATCGAC ACCGCCCCGC GCATCGGTTC CGTCCCGGCC 
CGGAGCATCG AGCAGGTGTT CCGGGCCGAG CACGGACGGC TGGTCGCCAC CCTGATCCGT
CGCTTCAGCG ACATCGACCT GGCCGAGGAA GCGCTCGGCG ACGCGCTGGT CGTCGCGCTG
CAGACCTGGC CCGAGCAGGG CCTGCCGGCC AATCCCGGCG GCTGGCTCAC CACCACGGCC
ACCAACCGGG CCATCGACCG GATCCGTCGT GAGTCCACCC GGGACGCTCG GCACCGGCAG
GCCAGCGCCC AGACCCTGAC CCAGCACGAC CCGATCGCCG AGCGCGACCG GCAGGAGGAG
GAGGTCGGCG TGATCACCGA CGACCGGCTC CGGCTGATCT TCACCTGCTG CCACCCGGCG
CTGGCCCCGG AGGCGCGGGT CGCGCTCACC CTGCGCCTGC TCGGCGGGCT GACCAGCACC
GAGATCGCGC AGGCGTTCCT GGTCCCGGAG CCGACCATGG CCCAGCGGCT GACCCGGGCC
AAGCGCAAGA TCAAGACGGC CGGCATCCCG TACCGGGTGC CGCAGGCGGA GGACCTGCCG
GCCCGGCTGG CCGGTGTGCT GGCGGTGCTC TACCTGATCT TCAACGAGGG CTATCTGGCC
AGCTCCGGCG ACCAGCCGGT GCGCGACGAC CTGTGCGCCG AGGCGATCCG GCTCACCCGG
GTGGTCCGCG GCCTGCTCCC CGACGAACCC GAGGTCGCCG GGCTGCTCGC GCTGATGCTG
CTCACCCAGG CCCGGCGGGC CACCCGGGTG GCCGGCGGGG TGCTGGTGCC GCTGGACGAG
CAGGACCGGA CGGCCTGGTC CCGGGACCTG ATCGGGCAGG GGCACGAGCT GGTCCGCGAG
TGCCTGCGCC GCAACCGACC CGGCCAGTAC CAGCTGCTGG CCGCGATCAA CGCGGTGCAC
ACCGACGCGC CGACCGCCGC GGACACCGAC TGGGGCCAGA TCGTCGCGCT GTACGACCAG
CTGCGCCGGG TGCACCCGTC ACCGATCGTC GAGCTGAACC GGGCGGTCGC GGTCGCCGAG
CTGGACGGTC CGGCGGTCGG GCTGGCCCTG GTCGAACCCC TGGACCTGGC CGGATACCAC
CCGTGGCACG TCGCCCGGGC CGATCTGCTG CGCCGGCTCG ACCGGCCGCA GGACGCCGCG
GCCGCCTACG AACAGGCCCT GGGCATGACC GAGAACGAGG CCGAACGGGC CTTCCTGCGC
CGCAAGCAGC GCGAGCTGAC CGGCCACTGA
 
Protein sequence
MPDDPAPRID TAPRIGSVPA RSIEQVFRAE HGRLVATLIR RFSDIDLAEE ALGDALVVAL 
QTWPEQGLPA NPGGWLTTTA TNRAIDRIRR ESTRDARHRQ ASAQTLTQHD PIAERDRQEE
EVGVITDDRL RLIFTCCHPA LAPEARVALT LRLLGGLTST EIAQAFLVPE PTMAQRLTRA
KRKIKTAGIP YRVPQAEDLP ARLAGVLAVL YLIFNEGYLA SSGDQPVRDD LCAEAIRLTR
VVRGLLPDEP EVAGLLALML LTQARRATRV AGGVLVPLDE QDRTAWSRDL IGQGHELVRE
CLRRNRPGQY QLLAAINAVH TDAPTAADTD WGQIVALYDQ LRRVHPSPIV ELNRAVAVAE
LDGPAVGLAL VEPLDLAGYH PWHVARADLL RRLDRPQDAA AAYEQALGMT ENEAERAFLR
RKQRELTGH