Gene Mmcs_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3089 
Symbol 
ID4111921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3268539 
End bp3269681 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID638032219 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_640252 
Protein GI108800055 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.614785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGGCCG CCCTCGCGGC GCCCACCGGC GACATCGCCG CGGCCGAGGA CGCGCTCGCC 
GATGCCTTCG AACGCGCGCT CCGGAGATGG CCGGTCGACG GTATCCCGGC CGAACCCGCC
GCCTGGGTGA TCACCGTCGC CCGCAACAGA TTGCGCGACC GCTGGCGCTC GGCCGGTCAC
CGCAGAGCCG CTCGTCTCGA CGAGAACCTC GACGTGACAG CGGAATCCGT CGACTGGCCG
GCCATCCCGG ACAAACGCCT GGAGCTCATG CTGGTCTGTG CGCATCCCTC GGTGGCGGTC
AACGTCCGTA CGCCGCTGAT GCTGCAGGTG GTCATGGGTG TCGACGCGGC GGCGATCGCC
GAGGCGTTCG CCGTCGAACC GGCGACCATG GCGCAGCGGC TCGTACGGGC CAAGCGGCGT
ATCCGCGACA CGGGTGTGCC ATTCACCCTG CCGGAACGTG ACGATCTGGC CGAGCGGCTG
CCCGCCGTGC TCGAATCGGT CTACGGCGTC TATGCCATCG ACTGGCAGCG CGGCCCACCC
GACGACCCGG GGGATTCGTT GGCCGCCGAG GCGTTGCACC TGACCGCCCT GCTGACCGAG
TTGCTGCCCG CCGATCCGGA GGTGCTCGGC CTGGCCGCGC TGGTGTGTTT CGGCGAGGCG
CGCCGCCCCG CGCGGCGTGG GGTCGAGGGC GCGTTCGTCG GCCTCGACGA TCAGGACAGT
GGGCGGTGGG ACCACGAGTT GATCGCCCGG GCCGAGGATC TGCTGCGGCG CGCGCACACC
CACCGGCGGC CGGGCCGGTT CCAGTACGAG GCGGCCATCC ACTCGGCACA CTGTCACCGC
CCGGTGGATC GGCGGGCGCT GCGCAAGCTC TATCTGGCCC TGCTGCGGGT GGCGCCGTCA
CTCGGTGCGG CGGTGGCGCT GGCGGCCCTC GACGGCGAGA TCGACGGGCC GGACGCCGGT
CTGCGGGCAC TCGCGGCGAT CGATGACCCT GCGCTCGACC GGTTTCAACC GGCGTGGACC
ACCCGCGCAC ACCTTCTCGA GCGCGCGGGC CGAACGGCCG AGGCAAATAT CGCCTACCAG
CGGGCACTCG CGATCACCAG CAACCCCGCA CTGAGAGCGC ATCTACGGCA ACGCCTGCGG
TGA
 
Protein sequence
MLAALAAPTG DIAAAEDALA DAFERALRRW PVDGIPAEPA AWVITVARNR LRDRWRSAGH 
RRAARLDENL DVTAESVDWP AIPDKRLELM LVCAHPSVAV NVRTPLMLQV VMGVDAAAIA
EAFAVEPATM AQRLVRAKRR IRDTGVPFTL PERDDLAERL PAVLESVYGV YAIDWQRGPP
DDPGDSLAAE ALHLTALLTE LLPADPEVLG LAALVCFGEA RRPARRGVEG AFVGLDDQDS
GRWDHELIAR AEDLLRRAHT HRRPGRFQYE AAIHSAHCHR PVDRRALRKL YLALLRVAPS
LGAAVALAAL DGEIDGPDAG LRALAAIDDP ALDRFQPAWT TRAHLLERAG RTAEANIAYQ
RALAITSNPA LRAHLRQRLR