Gene Mkms_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3149 
Symbol 
ID4610984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3296821 
End bp3297963 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID639792820 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_939133 
Protein GI119869181 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.328513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGCCG CCCTCGCGGC GCCCACCGGC GACATCGCCG CGGCCGAGGA CGCGCTCGCC 
GATGCCTTCG AACGCGCGCT CCGGAGATGG CCGGTCGACG GTATCCCGGC CGAACCCGCC
GCCTGGGTGA TCACCGTCGC CCGCAACAGA TTGCGCGACC GCTGGCGCTC GGCCGGTCAC
CGCAGAGCCG CTCGTCTCGA CGAGAACCTC GACGTGACAG CGGAATCCGT CGACTGGCCG
GCCATCCCGG ACAAACGCCT GGAGCTCATG CTGGTCTGTG CGCATCCCTC GGTGGCGGTC
AACGTCCGTA CGCCGCTGAT GCTGCAGGTG GTCATGGGTG TCGACGCGGC GGCGATCGCC
GAGGCGTTCG CCGTCGAACC GGCGACCATG GCGCAGCGGC TCGTACGGGC CAAGCGGCGT
ATCCGCGACA CGGGTGTGCC ATTCACCCTG CCGGAACGTG ACGATCTGGC CGAGCGGCTG
CCCGCCGTGC TCGAATCGGT CTACGGCGTC TATGCCATCG ACTGGCAGCG CGGCCCACCC
GACGACCCGG GGGATTCGTT GGCCGCCGAG GCGTTGCACC TGACCGCCCT GCTGACCGAG
TTGCTGCCCG CCGATCCGGA GGTGCTCGGC CTGGCCGCGC TGGTGTGTTT CGGCGAGGCG
CGCCGCCCCG CGCGGCGTGG GGTCGAGGGC GCGTTCGTCG GCCTCGACGA TCAGGACAGT
GGGCGGTGGG ACCACGAGTT GATCGCCCGG GCCGAGGATC TGCTGCGGCG CGCGCACACC
CACCGGCGGC CGGGCCGGTT CCAGTACGAG GCGGCCATCC ACTCGGCACA CTGTCACCGC
CCGGTGGATC GGCGGGCGCT GCGCAAGCTC TATCTGGCCC TGCTGCGGGT GGCGCCGTCA
CTCGGTGCGG CGGTGGCGCT GGCGGCCCTC GACGGCGAGA TCGACGGGCC GGACGCCGGT
CTGCGGGCAC TCGCGGCGAT CGATGACCCT GCGCTCGACC GGTTTCAACC GGCGTGGACC
ACCCGCGCAC ACCTTCTCGA GCGCGCGGGC CGAACGGCCG AGGCAAATAT CGCCTACCAG
CGGGCACTCG CGATCACCAG CAACCCCGCA CTGAGAGCGC ATCTACGGCA ACGCCTGCGG
TGA
 
Protein sequence
MLAALAAPTG DIAAAEDALA DAFERALRRW PVDGIPAEPA AWVITVARNR LRDRWRSAGH 
RRAARLDENL DVTAESVDWP AIPDKRLELM LVCAHPSVAV NVRTPLMLQV VMGVDAAAIA
EAFAVEPATM AQRLVRAKRR IRDTGVPFTL PERDDLAERL PAVLESVYGV YAIDWQRGPP
DDPGDSLAAE ALHLTALLTE LLPADPEVLG LAALVCFGEA RRPARRGVEG AFVGLDDQDS
GRWDHELIAR AEDLLRRAHT HRRPGRFQYE AAIHSAHCHR PVDRRALRKL YLALLRVAPS
LGAAVALAAL DGEIDGPDAG LRALAAIDDP ALDRFQPAWT TRAHLLERAG RTAEANIAYQ
RALAITSNPA LRAHLRQRLR