Gene Mmcs_5294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5294 
Symbol 
ID4114121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5579842 
End bp5581008 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID638034450 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_642451 
Protein GI108802254 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGCCG ACCCGCTGGA CGCGGCGCGG TTGCGCGACC TGATTCCCGG CGTGCTGGCC 
GCCCTCGTCC ACCGCGGGGC GGACTTCGCG ACCGCCGAGG ACGCAGTGCA GGAGGCGTTG
ATCCGAGCCG TCGAGACCTG GCCGGAGCAT CCGCCCGACG AGCCGAAGGG CTGGCTGATC
ACGACGGCGT GGCGCCGGTT CCTCGATCTC TCCCGGTCGG ACACTGCGCG TCGCCGGCGG
GAGGAACGGG TGTCCAACGA GCCGCCGCCG GGCCCGACCG AATCGGCCGA CGACACGCTG
CAGCTCTGTT TCCTGTGCGC CCATCCCAGT CTCACTCCGG CTTCGGCAGT GGCTCTGACG
CTGCGGGCCG TCGGCGGCTT GACCACCCGG CAGATCGCGC AGGCCTACCT GGTGCCCGAA
GCGACGATGG CGCAGCGGAT CAGCCGCGCC AAGCGCACCG TGAGCGGTGT CCGGCTGGAC
AGCCCCGGCG ACCTGCGCAC GGTGTGCCGG GTGCTGTACC TGATCTTCAA CGAGGGCTAC
AGCGGCGACG TCGACCTCGC CGGGGAGGCG ATCCGGTTGG CCCGTCAGCT TGCGCGCATG
ACCGACGATC CGGAGGTCGC CGGACTGCTC GCGCTGTTCC TGCTCCACCA CGCGCGCCGG
CCCGCGCGGA TCCGGGCCGA CGGCAGCCTG GTGCCGTTGG CCGACCAGGA CCGCAGCCGG
TGGCGACGTG ACCTGATCGC AGAGGGCGTG ACGATCCTGC AGGCCGCCCT GGCCCGCGAC
CGGCTCGGCG AGTACCAGGC CCAGGCCGCG ATCGCGGCCC TGCACGCCGA CGCCCGCACC
GTCGAAGAGA CCGACTGGGT GCAGATCGTC GAGTGGTACG ACGAGCTGGT CCGGCTCACC
GACAGCCCCG TCGTCCGGCT CAACCGGGCG GTCGCCGTCG GAGAGGCGGA CGGGCCGCGG
GCGGGACTCG CGGCGCTCGC CGAACTCGAC CCGTCACTGC CGCGGTACAG CGCATCGGCC
GCCCACCTCC ACGAGCGGGC GGGGGAAATC GCCACGGCCG CAGAGCTTTA CGTGCAGGCC
GCGAATCAGG CGCAGAACCT CGCCGAGCGG AACCATCTCA CGGTCCGCGC GGCCGCCCTC
CGTCAGCGCC TCGCGGGTGA CATCTAG
 
Protein sequence
MAADPLDAAR LRDLIPGVLA ALVHRGADFA TAEDAVQEAL IRAVETWPEH PPDEPKGWLI 
TTAWRRFLDL SRSDTARRRR EERVSNEPPP GPTESADDTL QLCFLCAHPS LTPASAVALT
LRAVGGLTTR QIAQAYLVPE ATMAQRISRA KRTVSGVRLD SPGDLRTVCR VLYLIFNEGY
SGDVDLAGEA IRLARQLARM TDDPEVAGLL ALFLLHHARR PARIRADGSL VPLADQDRSR
WRRDLIAEGV TILQAALARD RLGEYQAQAA IAALHADART VEETDWVQIV EWYDELVRLT
DSPVVRLNRA VAVGEADGPR AGLAALAELD PSLPRYSASA AHLHERAGEI ATAAELYVQA
ANQAQNLAER NHLTVRAAAL RQRLAGDI