Gene Mmcs_5234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5234 
Symbol 
ID4114062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5519949 
End bp5521496 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content72% 
IMG OID638034391 
Productputative DNA-binding protein 
Protein accessionYP_642392 
Protein GI108802195 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.198671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCACTT TGGACCGGCT CGTCAATGTG CTCGGCGGTT ACGGCGTCCA GTTCCGGGCG 
GGTTCGGCGC CGCGCTCGAC CGAGTTGCGC ACCGTGGTGA TCCACGAGGA TCGCCACGTC
GTCGGCGACG TCCTGCTGGC GGTCGGAGCC GATTCGGTGG CCACCGCACT CGAATGGGCG
CGCGCCGCAC GGGCGGCGGT GGTGCTGGTC CGCGGCGACG ACGTCGGTGT GGACGCGACA
CCGGCCGGCG GGCCCGCGGT TCTCACGGTC GACCCCGACG TGTCCTGGAG TGAGTTGGCG
GCGCTGGTGT TCGGCCTGGT GCTCGAGGGG CGCGAGACGG AGTCCGGGCG CGGCCCGACC
GATCTGTTCG CGCTGGCCGA CAGCCTGGCC GACGCGATCG GCGGTGCGGT CACCATCGAG
GACCGGCACT GGCGGGTGCT GGCCTACTCG CGGATGCAGC AACACGCCGA TGACGCGCGC
GTCGCGACCA TCCTCGGTAG GCAGGCCCCC GACAGACTGC GGGCGCTGTT CACCGAACGC
GGCGTAGCCC GGCACCTCGC CAACTCGGAT GAACCGATGT TCGTGGCCCC CGCACCCGCC
GACGGGCTCG CCGGCCGGAT GGTGATCGCG GCCCGCGCCG GTCGCGAACT GCTCGGCTCG
GTGTGGGTGG CCTGCGCGGA GGAGTTGCGC GGTGACCAGC TGCGCGCGTT GGCCGACGGC
GCCCGCATGG TCGCACTGCA CCTGTTGCGG TCGCGGGCCA GCGCCGACCT CGAGCGCCAG
GTGGAATCCG ATCTGGTGAT CGGTCTGCTG GAGGGCACCG TCGACGCCCC GACGGTGGTG
AGCAAGCTGG CGTTGCCGCC TGCGGGACTG CGGGTCATCG CGCTGCGCGC CCGCCTCGGC
GAGGAACGCC ACGCGGCGCT GCTGTTGGCC TTCGAACGCG CGACCACGGG TTTCGGGTGG
TCGCGGCCCG GCCGCTCCAC GCTGTCGGCC ACCACCGTCT ACACCGTGTT GCCCAGCGAA
CCGGCGGAGA CCGCGCGCCG CTGGGTGGAC AGCCTGCGGG CCGCACTGCC GGAACGGGCC
GCCATCCTCG CCGGAATCAG CAGTACGGCA ACGGTTTTGG AACTGCCGAC GGCTCGTGAC
GAGGCCGACG AGTGCCTGGC GCTGCACGAA CTGCAGGGCG GCGTCGGCGA GGCGCCCGCC
TACGACGAGT CCTGGGACGA CATCGTGCTG CGGCGGCTGC GGATCGCCGC GCGCGTCGGC
CGCACCCCGC AACGCGGACC GGTGGCCGAC CTGCGGCGCC ACGACGAGCA TCACGGGACC
CGCTACGTGG ACACGCTGCG CGCCTGGCTG GCCGCGCAGG GAGATCTGCA CGAGGCGGCC
GAGCGCCTGG GCGTGCACGA GAACACCGTG CGCTACCGGC TGCGCAAGAT GGCCGAGGTC
ACCGACCTCG ACCTGACCGA CGCGCGCAAG CGGCTGGCCA TGACGGTCGA ACTCGCCGCT
ACAGACGACG ACGGTTTCAC GTTGTCGGAG GCCGACAAAA TTTCGTGA
 
Protein sequence
MVTLDRLVNV LGGYGVQFRA GSAPRSTELR TVVIHEDRHV VGDVLLAVGA DSVATALEWA 
RAARAAVVLV RGDDVGVDAT PAGGPAVLTV DPDVSWSELA ALVFGLVLEG RETESGRGPT
DLFALADSLA DAIGGAVTIE DRHWRVLAYS RMQQHADDAR VATILGRQAP DRLRALFTER
GVARHLANSD EPMFVAPAPA DGLAGRMVIA ARAGRELLGS VWVACAEELR GDQLRALADG
ARMVALHLLR SRASADLERQ VESDLVIGLL EGTVDAPTVV SKLALPPAGL RVIALRARLG
EERHAALLLA FERATTGFGW SRPGRSTLSA TTVYTVLPSE PAETARRWVD SLRAALPERA
AILAGISSTA TVLELPTARD EADECLALHE LQGGVGEAPA YDESWDDIVL RRLRIAARVG
RTPQRGPVAD LRRHDEHHGT RYVDTLRAWL AAQGDLHEAA ERLGVHENTV RYRLRKMAEV
TDLDLTDARK RLAMTVELAA TDDDGFTLSE ADKIS