Gene Mkms_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3974 
Symbol 
ID4611914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4189064 
End bp4191049 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content71% 
IMG OID639793658 
Productstage II sporulation E family protein 
Protein accessionYP_939956 
Protein GI119870004 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCCTG ATCAGGGGGC GCTGGGAGAT CACCGGGTTC GCTCACGAAA ACCGGCCGTG 
GTGTTCGTCG TCGGGTGTCT GTTCGCCGTC CTCGTCGTCG GGTGGCTATC GCTGGTGACC
CAGCCGACGG CGCTGGCCAG CACGGCGTGG TGGCCCGTCG CCGGGATCGC GGTCGGGTTG
GGCATCCGCT TCCCGCGCCG TCAGGTGTGG GCGCCGGCCG CCGCCGTCGC CGCGATCACC
CTTCCGCTCC TGCTCTGGGC GGGACGGCCC GCTGCGCTCG CGACCGCGCT CGCATTCGCC
GTCGCCCTCG AGATGGTGAT CGCCACCCTC ATCCTCAGGG GGCGCCACGA TCGCCTGCCG
AGCCTGTCGG AACCGCGCGA CCTCGGCCGG TTACTGGTGG CCGTCGCCTC GGCGGCGATC
GTCTACGACG TGGTCGGCGC CGGCGCCACC TACCTGCTCG CCGATTCCAC CGAGGCGTGG
ATCCGGTTCG TCACGTCCGC GCCGAAGCAC GCGGCCGGGA TGCTGCTGCT GGTCCCGCTG
TTCATGCACC TGCCGCGCCG CCCCCGGCCT GCGGGTCCGG TCGAGACCGT CGCGCAGGTC
GTGACGACCC TGGGCCTGGT CACCTTCGTG TTCGCGTTCA ACCCCGGAAT GCCGCTGTCC
TTCCTCCCCT TCATGCCACT GGTGTGGGTG GCGATCCGAC TGACGACCCG GGAACTGATC
CTGCTGATGC TGGCGATCGC CGTCATCGCC TCGGCCGGCA GCGCGTACGG CACCGGGCCG
TTCGCGTTCA ACCTCCTCGC ACCGGAGGTG GGCAACCTCG TCCTGCAGGT CTTCGAGCTG
TCGATGGTGG TCGTCTTCCT CTCGCTCTCG CTCGCGGTCG GCCACGAGCG GACCACCGCA
CGGCGCCTCA ACGAGAGCGA GGAGTTGTTC CGCCGGATCT TCGAAACGTC GGTGGCCGGG
ATGCTGATCG CCACCCGTGC CGCCACGGGA TGGAAGGTGT TGCGCGCCAA CGACTCCGCG
GTGGCCATCA TCCCCGGTCT CGCCGACGCG TCGGCCGAGC TCACCGATCT GTTGGGCGAG
GAGGCCACCG CCGCGCTCTC GGCGGAAGCC GACGCGCTCA CCGAGGGCAA CGCGCGCCTG
ACGCTGACCA CCGGCACCGA GCGGATCCTC AACGTCAGCA TCTCCCCGAT CAGCGTCGAC
GGGGACAGCA GGACCCTCGC GCTGCAGTTC TACGACATCA CCGAGGCGAT GCGCGCCCGC
AGGCTGGAAC AGGAGGAACT CGAGCGCGCC GCCGAAGTGC AACGTGCCCT CCTGCCCGGG
ACGCTCCCAC CCACCCCCGG GTGGACTTCC GGTGCGGCTT CGGTGCCGGC CAGACAGGTC
GGCGGGGACT TCTACGACAT CCGGGTCCAG GTCCCGCACG TGGTCCTCAG CCTCGGCGAC
GTCATGGGTA AGGGCATGGG CGCGGGAATG TTGGCCGCCG CGACCAGAGC CGCGCTGCGC
GCCACCGACC CCGAGCTCAG TCCATCGGCC GCCGTGAGCC ACATGGCCGG GGTCGTCGAT
CACGACCTGC AACGCACCAG CGCCTTCATC ACGCTGACCT ACGTCCTCGT CGACCTCGTC
ACCGGCGACT TCCGCGTCGC CGACGCCGGG CACGGACTGC ACTTCGTCGT CCGGACCGGA
TCGGGTCTGG TGGAGCGCAC CGCCTCCAGC GATATGCCGG TGGGGCTCGA CAGCGGCTGG
GGCGAGAAGC GCGGAGCGCT CCAGCCCGGC GACGCGATCC TCCTCGTCAG CGACGGCGTG
ATGGACCTGT GGGGCGGCTC CGTCGAAGAG CTGTCGGATG CCGTGGCACA GTGCGCCCGG
CAGCACGGCA CGAGCCCGCA GGCGTTGGTC GACGCCCTGT GCGCGCGGGC GAACGGTGAT
CTGGACCGCG ATGATGTGAC GGCCGTCGTC CTGCGGCGGG AACCGGTGGA CGTGGCGGCA
CGCTGA
 
Protein sequence
MDPDQGALGD HRVRSRKPAV VFVVGCLFAV LVVGWLSLVT QPTALASTAW WPVAGIAVGL 
GIRFPRRQVW APAAAVAAIT LPLLLWAGRP AALATALAFA VALEMVIATL ILRGRHDRLP
SLSEPRDLGR LLVAVASAAI VYDVVGAGAT YLLADSTEAW IRFVTSAPKH AAGMLLLVPL
FMHLPRRPRP AGPVETVAQV VTTLGLVTFV FAFNPGMPLS FLPFMPLVWV AIRLTTRELI
LLMLAIAVIA SAGSAYGTGP FAFNLLAPEV GNLVLQVFEL SMVVVFLSLS LAVGHERTTA
RRLNESEELF RRIFETSVAG MLIATRAATG WKVLRANDSA VAIIPGLADA SAELTDLLGE
EATAALSAEA DALTEGNARL TLTTGTERIL NVSISPISVD GDSRTLALQF YDITEAMRAR
RLEQEELERA AEVQRALLPG TLPPTPGWTS GAASVPARQV GGDFYDIRVQ VPHVVLSLGD
VMGKGMGAGM LAAATRAALR ATDPELSPSA AVSHMAGVVD HDLQRTSAFI TLTYVLVDLV
TGDFRVADAG HGLHFVVRTG SGLVERTASS DMPVGLDSGW GEKRGALQPG DAILLVSDGV
MDLWGGSVEE LSDAVAQCAR QHGTSPQALV DALCARANGD LDRDDVTAVV LRREPVDVAA
R