Gene Mkms_5169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5169 
Symbol 
ID4612852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5417014 
End bp5418030 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID639794866 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_941148 
Protein GI119871196 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.787894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGT TCGCCGACAT CAAGGCCAAG TGGGCGAAAT CGTCACCGTT CCAGGTGCTT 
CCGCATATCG ACTGGGCAGA GCAGAAACCC ACCTACCAGG ATGCGCTGCC GGCGCTGATC
AACGATGCGC TGGCCCGCGC GAAGTCCCGT CCGAGCGGCA ACTGGTTCCC GTTCGCGGCC
AGCGACGCCA TCCGGCGTAA ACCGGTGGGC GCCTCGGTGG GCGGCGTCGA ACTCGTCGCG
TGGCGGGGCG CCTGCGGCGA ACTGCGTGTC GGCCCTGCGA GCTGTCCGCA TCTCGGGGCG
GACCTGTCCA CCGGCACCGT CGACTGCGGC ACGCTGATCT GCCCCTGGCA CGGCCTGCGG
CTGTCCGGGG AGCGCCGCGA ATTCGGGTGG AAACCGTTGC CCGCCTTTGA CGACGGGGTA
CTGGCCTGGG TCCGTCTCGA CCGGGTCGGC GGCGAGCAGC CGACGGACCG CCCGATCATC
CCGGTGCGTC CGGCGGAACC CAGGCTGCAC GCAGTGACCA GCCTGGTCGG TGTCTGCGAA
CCGGACGATG TGATCGCCAA CCGGCTCGAC CCGTGGCACG GCGCCTGGTT CCACCCGTAC
TCGTTCACCC GCCTCGAGGT GCTCAGCGCC CCGGCGGCCG GTGAGGTGCC CGAAGCGGAA
GACCGGTTCC TCGTGGCGGT CACGTTCCGC ATCGGCCGCC TGGGCGTGCC GGTGGTCGCC
GAGTTCATCG CGCCCGGACC GCGCACGATC GTCATGCGGA TCGTCGACGG TGAGGGCGCG
GGCAGCGTCG TGGAAACCCA CGCGACACCC GTCGGTCCGG GTCCGGACGG GCGTCCGCGC
ACCGCGGTGA TCGAAGCCGT TGTCGCACAC TCGGATCGGC GCCGGTTCGG CTACGGGAAG
AAGGTCGCGC CGTTGATCAC GCCGTTCATG CGGCATGCGG CGACGAAGCT GTGGCGCGAC
GACCTCGCGT ATGCGGAGCG CCGTTACGCA GTGCGCTCAC AGCTCAACCG ACGCTGA
 
Protein sequence
MSAFADIKAK WAKSSPFQVL PHIDWAEQKP TYQDALPALI NDALARAKSR PSGNWFPFAA 
SDAIRRKPVG ASVGGVELVA WRGACGELRV GPASCPHLGA DLSTGTVDCG TLICPWHGLR
LSGERREFGW KPLPAFDDGV LAWVRLDRVG GEQPTDRPII PVRPAEPRLH AVTSLVGVCE
PDDVIANRLD PWHGAWFHPY SFTRLEVLSA PAAGEVPEAE DRFLVAVTFR IGRLGVPVVA
EFIAPGPRTI VMRIVDGEGA GSVVETHATP VGPGPDGRPR TAVIEAVVAH SDRRRFGYGK
KVAPLITPFM RHAATKLWRD DLAYAERRYA VRSQLNRR