Gene Mkms_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1901 
Symbol 
ID4613647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2017599 
End bp2019035 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content70% 
IMG OID639791566 
ProductGntR family transcriptional regulator 
Protein accessionYP_937891 
Protein GI119867939 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.775747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCA GAGTGCTCGA CGTCGAGCGA CTGGCACGTG AGTTGGGCAA CTGGCGTACG 
GCCAGTTCGA GTGGCCCGGT CTACCAGGGT CTGGCCGACG GAATCCGGAT GCTGATCGTC
GACGGGCGGT TACCGGTCGG CGCCCGCCTA CCCAGTGAAC GCGCACTCGC GGAGTGCCTG
CGGGTTTCGC GCACCACCGT CACCAGCGCC TACGCCCAAC TCCGCGAGGA CGGCTATCTG
CTCGCCCGCC GCGGGGCGCG CAGCACCACG GCCCTGCCGG CCACACCGAA CGCCGACACA
CCCCCGGCCG TTTCCCGCGG TGCGGTCAAC CTGGCCAATG CCGCGCTGGC CGCCCCTGTA
CCCGCGGTGC TCGCCGCCCT CACCGCCGCC ACCGGCCAGA TGGCGCCCTA CCTGCACGAC
ACCGGCATCG AACTCACCGG TGTCCCCTCG CTTCGCCGGG CCGTCGCCGA AAGATATTGT
GAGCGTGGGC TTCCCACCGA ACCCGACGAC ATCATGATCA CCACCGGGGC GCTGCACGCC
ATCGGGTTGA TCCTGGCGAC GTACACCCAG CCGGGAGATC GTGTCCTCGT CGAGCAGCCG
ACCTACCACG GCGCACTCGC CGCGATCTCC ACCGCGGGCG CCCGGGCGGT CCCCGTCGCG
ATCGGTGAGG ACGGCTGGGA TCTCGGGGCC TTGCACAACG CGGTCCGCCA GCTCGCGCCC
AGCCTCGCCT ACGTGGTGCC CGACAACCAC AACCCCACCG GTCTGACGAT GCCGCAGCCC
CAGCGCGAGG AGTTGGCCGC GATGATCGCC GACACCCGCA CCCGCACCAT CGTCGACGAG
ACGATGACCG AGGTGTGGCT GGACGAGCCG GTCCCCCCTC CGGTCGCCAC GGCGATGACC
CGCCGCCGCG ATCTGCTGCT GACCATCGGG TCGATGTCCA AGTCGTTCTG GGGTGGTCTG
CGGATCGGCT GGATCCGCGC CGACCCGTCG ACGTTGGCCT CGATCGCCGC GCTCCGCCCG
TCCATCGACA TGGGCACGCC GATCCTCGAA CAGCTCGCCG CCGCGGAGTT GATACGCGTC
GCCGACGACG TGTTGCCGGA GCGACGCGAG ATCCTGCGCA CCCGCCGGGC ACTGATGTTG
TCGCTGCTCG ACGAACATCT GCCGGATTGG CAGCCCTGCC CGGGTGGTGG CGGTCTGGCG
CTGTGGGTGC GACTGCCGAC ACCGATGAGT TCGGCCCTCT CGGCCGCGGC GTCACGGATG
GGGCTGGACG TGCCGCCCGG CCCGCGCTTC GGTGTGGACG GCTCGCTGGA GCGGTTCATC
CGGATTCCCT ACACGCTGCC GCCCGAGCAG ATGACCGAGG CCGTCACCCT GCTGGCGCGC
GCCTGGCATG CGGTGACCGG TGCGACGGCT CCGCAGCAGC GTGTCGTAGT GGTATGA
 
Protein sequence
MPTRVLDVER LARELGNWRT ASSSGPVYQG LADGIRMLIV DGRLPVGARL PSERALAECL 
RVSRTTVTSA YAQLREDGYL LARRGARSTT ALPATPNADT PPAVSRGAVN LANAALAAPV
PAVLAALTAA TGQMAPYLHD TGIELTGVPS LRRAVAERYC ERGLPTEPDD IMITTGALHA
IGLILATYTQ PGDRVLVEQP TYHGALAAIS TAGARAVPVA IGEDGWDLGA LHNAVRQLAP
SLAYVVPDNH NPTGLTMPQP QREELAAMIA DTRTRTIVDE TMTEVWLDEP VPPPVATAMT
RRRDLLLTIG SMSKSFWGGL RIGWIRADPS TLASIAALRP SIDMGTPILE QLAAAELIRV
ADDVLPERRE ILRTRRALML SLLDEHLPDW QPCPGGGGLA LWVRLPTPMS SALSAAASRM
GLDVPPGPRF GVDGSLERFI RIPYTLPPEQ MTEAVTLLAR AWHAVTGATA PQQRVVVV