Gene Mkms_5086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5086 
Symbol 
ID4612769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5321038 
End bp5322396 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content73% 
IMG OID639794783 
ProductGntR family transcriptional regulator 
Protein accessionYP_941065 
Protein GI119871113 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.45366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGTC CCGGCGCCCG CGGAGTGCGC GAACTGCTGG TCAACGCGCT GCGCGAGGCC 
GTGCGCTCGG GGCGGCTCGC GGCGGGCACC GTGCTGCCGC CGTCGCGCAC ACTCGCGGCC
GACCTCGGCA TCGCCCGCAA CACGGTCGCC GAGTCGTACG CCGAACTCGT CGCCGAGGGC
TGGCTCGCGT CACGGCAGGG CGCGGGCACC TGGGTGGTCA ACTCCCAGGC GGGCCATCAG
ATCGCCCGGC CCCGGCGAGT GCGCTCGACG GCGCGACACA ACCTGATGCC CGGCTCCCCC
GACGTGTCGG AGTTCCCTCG CAGCGCCTGG CTCGCCTCCG CCCGCCGCGC GCTGACCAAC
GCCCCCAGCG AGGCGTTCTG GATGGGCGAT CCGCGCGGCC GACCGGAGCT TCGCGAGGCA
CTCGCCGACT ACCTCGCACG GGTGCGCGGC GTCCGCACCT CACCGGAGAC AATCGTGATC
TGCGCCGGGG TCCGGCACGC GGTGGAGCTG CTCACCCGGG CCCTCGGGGC GCAGCGCCCG
ATCGCCGTCG AGGCCTACGG CCTGTTCATC TTCCGCGACG CGATCCGCGC GCTCGGGGGG
TCGACGGTGC CAATCGGCCT CGACGGCCAC GGTGCGGTGA TCGGCGACCT CGACGGACAT
GACGTGCCCG CGGTCCTCCT CACACCGGCC CACCACAATC CGCACGGTAT GCCGCTGCAC
CCCACCCGCC GCACCGCCGC GGTGGAGTGG GCCCAGCGTC GCGGCGCCTA TGTCCTCGAG
GACGACTACG ACGGCGAGTT CCGCTACGAC CGTCAACCCG TCGGCGCGAT GCAGAGCCTG
GACCCCGAAC GGGTGGTGTA CCTGGGCTCG GCCAGCAAGA GCCTCGCCCC CGCGTTGCGG
CTGGGATGGA TGGCGCTGCC CGACGCCCTC GTCGAACCCG TCCTGGCCGC CGCGGGTGGC
AACCAGTTCT TCGTCGACGC ACTCGCCCAA CTGACCATGG CCGACTTCAT CACCGCCGGC
CATTACGACC GCCACATCCG CCGGATGCGG ATGCGCTACC GGCGGCGGCG CGACCGGCTC
GTCGACGCAC TCGCCCCGTT CGACGTCGAA ATCCGCGGTC TGGCAGCGGG ACTCAACGCG
CTGCTGACGC TGCCCGACGG CGCCGAACGC GAGGTGCTGC AGCGGGCGGG CAATGCGGGG
ATCGCGCTGG TGGGCCTGTC CGCGATGCGA CATCCGGCGG CCGGACCGGC CGTCGCGGAC
CCCGACGGTG TGATCGTCGG CTTCGGCGCA CCGGCCGATC ACGCCTTCGC CGCGGCGGTC
GAAGCGCTGT GCGACGTCCT GGAGGCCACG CTGCGCTGA
 
Protein sequence
MIRPGARGVR ELLVNALREA VRSGRLAAGT VLPPSRTLAA DLGIARNTVA ESYAELVAEG 
WLASRQGAGT WVVNSQAGHQ IARPRRVRST ARHNLMPGSP DVSEFPRSAW LASARRALTN
APSEAFWMGD PRGRPELREA LADYLARVRG VRTSPETIVI CAGVRHAVEL LTRALGAQRP
IAVEAYGLFI FRDAIRALGG STVPIGLDGH GAVIGDLDGH DVPAVLLTPA HHNPHGMPLH
PTRRTAAVEW AQRRGAYVLE DDYDGEFRYD RQPVGAMQSL DPERVVYLGS ASKSLAPALR
LGWMALPDAL VEPVLAAAGG NQFFVDALAQ LTMADFITAG HYDRHIRRMR MRYRRRRDRL
VDALAPFDVE IRGLAAGLNA LLTLPDGAER EVLQRAGNAG IALVGLSAMR HPAAGPAVAD
PDGVIVGFGA PADHAFAAAV EALCDVLEAT LR