Gene Mmcs_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0738 
Symbol 
ID4109583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp783382 
End bp784632 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID638029863 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_637914 
Protein GI108797717 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.481415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTCTG ACATCCAGCA GACCGTCGAC GCGGTGTGGC GGATGGAGGC CGCCAAGATC 
GTCGCCACGC TCACCCGCAC CGTCGGCGAC GTCGGCTTGG CCGAGGATCT CGCCGCCGAC
GCCCTCGTCG ACGCCCTCAC CCAGTGGCCG TCGACCGGCG TGCCGAACAA CCCGGGCGCC
TGGCTGACCA CCGTCGCCAA ACGCAAGGCG ATCGACCACT GGCGGCGGCG CGACAACCTC
GACGCGAAGT ACACCGAACT GGCCCGCGAC CTCGAAACCC ACCTCGACGA ACCCGCCTGG
GACCCCGACC ACATCGACGA CGACGTCCTG CGGCTCATCT TCATCGCGGC CCACCCGGTG
CTGTCGCGGG AGAACCAGAT CGCGCTGACC CTTCGCGTCA TCGGCGGCCT GACCACCGAG
GAGATCGCCA GGGCCTTCCT GACGCCGAAA GCCACGGTGG CCCAACGCAT CGTGCGGGCC
AAGAAGACAC TGGCCGACGC GGAGGTCCCG TTCGAGGTGC CGCCCCGCGA GCAGTACCCG
CAACGCCTGT CGGCGGTGTT GAGCGTCATC TACCTCATCT ACAACGAGGG GTATTCGGCG
TCCTCCGGGC AGCGTTGGAT CCGCGACGAA CTGTGCCGCG AGGCGCTGCG CCTGGGCCGC
GTCCTCGCCG CGCTGGTGCC CGACGAACCC GAGGCCCACG GGCTGGTGGC GCTGATGGAA
CTGCAGAGCT CGCGGTTCGC GGCCCGTACC GACGACGCGG GCCGCCCGAT CCTGCTCGAG
GACCAGGACC GGACGAAATG GGACCGCGCC CAGATCGGCC GGGGCGTCGC GGCGCTGCGG
CGTGCGGTGG CGGCCATCGA CCGGCGAGGC ACCGGGTGGG GCCCGTACGC GCTGCAGGCC
GCGCTCGCCG AATGCCACGC CACCGCACCG TCGACCGCCG AGACCGACTG GCGGCGCATC
GTGACCATCT ACGACGCGCT GCTGCAGATC ACGCCCTCGG CCGTCGTCGA ACTCAACCGC
GCCGTGGCCG TCGCGATGGC CGACGGGGCC GCCGCCGCGC TCGAGATCGT CGACGGGATC
ACCGGGCTCG CCGACTCCTA TCTATTGCCC AGCGTGCGCG GCGAACTGCT GGCCCGCCTG
GGCCGCGCCG ACGAGGCCGC CGCTCAGTTC GACCGGGCCG CCGCACTCGC CGACAACGAG
CGCGAACGAG ACGTGTTGTC CGACAAAGCC GCCCGGGTAC GCCAGAGGTG A
 
Protein sequence
MVSDIQQTVD AVWRMEAAKI VATLTRTVGD VGLAEDLAAD ALVDALTQWP STGVPNNPGA 
WLTTVAKRKA IDHWRRRDNL DAKYTELARD LETHLDEPAW DPDHIDDDVL RLIFIAAHPV
LSRENQIALT LRVIGGLTTE EIARAFLTPK ATVAQRIVRA KKTLADAEVP FEVPPREQYP
QRLSAVLSVI YLIYNEGYSA SSGQRWIRDE LCREALRLGR VLAALVPDEP EAHGLVALME
LQSSRFAART DDAGRPILLE DQDRTKWDRA QIGRGVAALR RAVAAIDRRG TGWGPYALQA
ALAECHATAP STAETDWRRI VTIYDALLQI TPSAVVELNR AVAVAMADGA AAALEIVDGI
TGLADSYLLP SVRGELLARL GRADEAAAQF DRAAALADNE RERDVLSDKA ARVRQR