Gene Mmcs_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3900 
Symbol 
ID4112730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4154639 
End bp4156567 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content71% 
IMG OID638033043 
Productserine phosphatase 
Protein accessionYP_641061 
Protein GI108800864 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGTTCG TCGTCGGGTG TCTGTTCGCC GTCCTCGTCG TCGGGTGGCT ATCGCTGGTG 
ACCCAGCCGA CGGCGCTGGC CAGCACGGCG TGGTGGCCCG TCGCCGGGAT CGCGGTCGGG
TTGGGCATCC GCTTCCCGCG CCGTCAGGTG TGGGCGCCGG CCGCCGCCGT CGCCGCGATC
ACCCTTCCGC TCCTGCTCTG GGCGGGACGG CCCGCTGCGC TCGCGACCGC GCTCGCATTC
GCCGTCGCCC TCGAGATGGT GATCGCCACC CTCATCCTCA GGGGGCGCCA CGATCGCCTG
CCGAGCCTGT CGGAACCGCG CGACCTCGGC CGGTTACTGG TGGCCGTCGC CTCGGCGGCG
ATCGTCTACG ACGTGGTCGG CGCCGGCGCC ACCTACCTGC TCGCCGATTC CACCGAGGCG
TGGATCCGGT TCGTCACGTC CGCGCCGAAG CACGCGGCCG GGATGCTGCT GCTGGTCCCG
CTGTTCATGC ACCTGCCGCG CCGCCCCCGG CCTGCGGGTC CGGTCGAGAC CGTCGCGCAG
GTCGTGACGA CCCTGGGCCT GGTCACCTTC GTGTTCGCGT TCAACCCCGG AATGCCGCTG
TCCTTCCTCC CCTTCATGCC ACTGGTGTGG GTGGCGATCC GACTGACGAC CCGGGAACTG
ATCCTGCTGA TGCTGGCGAT CGCCGTCATC GCCTCGGCCG GCAGCGCGTA CGGCACCGGG
CCGTTCGCGT TCAACCTCCT CGCACCGGAG GTGGGCAACC TCGTCCTGCA GGTCTTCGAG
CTGTCGATGG TGGTCGTCTT CCTCTCGCTC TCGCTCGCGG TCGGCCACGA GCGGACCACC
GCACGGCGCC TCAACGAGAG CGAGGAGTTG TTCCGCCGGA TCTTCGAAAC GTCGGTGGCC
GGGATGCTGA TCGCCACCCG TGCCGCCACG GGATGGAAGG TGTTGCGCGC CAACGACTCC
GCGGTGGCCA TCATCCCCGG TCTCGCCGAC GCGTCGGCCG AGCTCACCGA TCTGTTGGGC
GAGGAGGCCA CCGCCGCGCT CTCGGCGGAA GCCGACGCGC TCACCGAGGG CAACGCGCGC
CTGACGCTGA CCACCGGCAC CGAGCGGATC CTCAACGTCA GCATCTCCCC GATCAGCGTC
GACGGGGACA GCAGGACCCT CGCGCTGCAG TTCTACGACA TCACCGAGGC GATGCGCGCC
CGCAGGCTGG AACAGGAGGA ACTCGAGCGC GCCGCCGAAG TGCAACGTGC CCTCCTGCCC
GGGACGCTCC CACCCACCCC CGGGTGGACT TCCGGTGCGG CTTCGGTGCC GGCCAGACAG
GTCGGCGGGG ACTTCTACGA CATCCGGGTC CAGGTCCCGC ACGTGGTCCT CAGCCTCGGC
GACGTCATGG GTAAGGGCAT GGGCGCGGGA ATGTTGGCCG CCGCGACCAG AGCCGCGCTG
CGCGCCACCG ACCCCGAGCT CAGTCCATCG GCCGCCGTGA GCCACATGGC CGGGGTCGTC
GATCACGACC TGCAACGCAC CAGCGCCTTC ATCACGCTGA CCTACGTCCT CGTCGACCTC
GTCACCGGCG ACTTCCGCGT CGCCGACGCC GGGCACGGAC TGCACTTCGT CGTCCGGACC
GGATCGGGTC TGGTGGAGCG CACCGCCTCC AGCGATATGC CGGTGGGGCT CGACAGCGGC
TGGGGCGAGA AGCGCGGAGC GCTCCAGCCC GGCGACGCGA TCCTCCTCGT CAGCGACGGC
GTGATGGACC TGTGGGGCGG CTCCGTCGAA GAGCTGTCGG ATGCCGTGGC ACAGTGCGCC
CGGCAGCACG GCACGAGCCC GCAGGCGTTG GTCGACGCCC TGTGCGCGCG GGCGAACGGT
GATCTGGACC GCGATGATGT GACGGCCGTC GTCCTGCGGC GGGAACCGGT GGACGTGGCG
GCACGCTGA
 
Protein sequence
MVFVVGCLFA VLVVGWLSLV TQPTALASTA WWPVAGIAVG LGIRFPRRQV WAPAAAVAAI 
TLPLLLWAGR PAALATALAF AVALEMVIAT LILRGRHDRL PSLSEPRDLG RLLVAVASAA
IVYDVVGAGA TYLLADSTEA WIRFVTSAPK HAAGMLLLVP LFMHLPRRPR PAGPVETVAQ
VVTTLGLVTF VFAFNPGMPL SFLPFMPLVW VAIRLTTREL ILLMLAIAVI ASAGSAYGTG
PFAFNLLAPE VGNLVLQVFE LSMVVVFLSL SLAVGHERTT ARRLNESEEL FRRIFETSVA
GMLIATRAAT GWKVLRANDS AVAIIPGLAD ASAELTDLLG EEATAALSAE ADALTEGNAR
LTLTTGTERI LNVSISPISV DGDSRTLALQ FYDITEAMRA RRLEQEELER AAEVQRALLP
GTLPPTPGWT SGAASVPARQ VGGDFYDIRV QVPHVVLSLG DVMGKGMGAG MLAAATRAAL
RATDPELSPS AAVSHMAGVV DHDLQRTSAF ITLTYVLVDL VTGDFRVADA GHGLHFVVRT
GSGLVERTAS SDMPVGLDSG WGEKRGALQP GDAILLVSDG VMDLWGGSVE ELSDAVAQCA
RQHGTSPQAL VDALCARANG DLDRDDVTAV VLRREPVDVA AR