Gene Mmcs_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5038 
Symbol 
ID4113867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5338110 
End bp5339045 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content72% 
IMG OID638034196 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_642198 
Protein GI108802001 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTCC TGCTTGTCAT GGTGGTGTCG CTGGTCGGCC TGACCGTGTG GGTCGACACG 
TCACTGCAGC GCATCCCCGC CCTGGCCGCC TATCCCGACC GGCCGGCCGC CGGTCGCGGC
ACCACCTGGC TGCTGGTCGG CTCCGACAGC CGCGCCGGCC TCGACGCCGA ACAGCAGGCC
CAGCTCGCCA CCGGCGGTGA CGTCGGCAAC GGACGCACCG ACACGATCAT GCTCGTCCAC
CTCCCCGGCC TGACCTCGAG CGCACCCGCG ACCATGGTGT CGATCCCGCG CGACTCCTAT
GTGCCGATCC CCGGGTACGG CGAGGACAAG ATCAACGCCG CATTCGCGCT GGGCGGCGCG
CCGCTGCTCG CCCAGACCGT CGAGCAGGCC ACCGGTATGC GCCTCGACCA CTACGCCGAG
GTCGGATTCG ACGGGTTCGC CTCGGTCGTC GACGCCGTCG GCGGCGTGAC GATGTGCCCG
GCGGAGCCCA TCAACGATCC GCTGGCCGGG ATCGACCTGC CCGCCGGATG TCAGGAACTC
GACGGGCGCA ATGCGCTCGG CTTCGTGCGC ACTCGCGCCA CCCCGCGCGC CGACCTGGAC
CGGATGACCC ACCAGCGGGA GTTCATGTCC GCGCTGCTGC ATCGCGCGGC CAGCCCGGCG
GTCCTGCTCA ACCCGCTGCG CTGGTATCCG ATGGCGAGCG CGGCCGGCGG CGCACTGACC
GTCGACACCG GTGCGCACGT TTGGGATCTC GCCCGGCTCG GCTGGGCGCT GCGCGGTGAT
CTGACCACCA CGACGGTGCC CATCGGGGAG TTCACCGACG GCGGTGCCGG CGCCGTCGTG
GTCTGGGACA GCGAGGCCGC CGGACGCCTC TTCGACGCGC TGTCAACCGA CACGCCGATC
CCCGCCGACG TGCTCGACAC CACACCGGGC GGCTGA
 
Protein sequence
MAVLLVMVVS LVGLTVWVDT SLQRIPALAA YPDRPAAGRG TTWLLVGSDS RAGLDAEQQA 
QLATGGDVGN GRTDTIMLVH LPGLTSSAPA TMVSIPRDSY VPIPGYGEDK INAAFALGGA
PLLAQTVEQA TGMRLDHYAE VGFDGFASVV DAVGGVTMCP AEPINDPLAG IDLPAGCQEL
DGRNALGFVR TRATPRADLD RMTHQREFMS ALLHRAASPA VLLNPLRWYP MASAAGGALT
VDTGAHVWDL ARLGWALRGD LTTTTVPIGE FTDGGAGAVV VWDSEAAGRL FDALSTDTPI
PADVLDTTPG G