Gene Mkms_5847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5847 
Symbol 
ID4610555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008704 
Strand
Start bp54861 
End bp56264 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content59% 
IMG OID639789501 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_935836 
Protein GI119855233 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.0153324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTT CAGGCGGTTC GGACACGACC GCTCGTAGTC TCGTCGATGT CGATCGCGGG 
GAGATCAGTC GCGAAATCTT CACCGATGCT TCGATTTTCA ACCGTGAGTT GGAGTATTTG
TTTCCGCGGA GCTGGTTGTT TGTGGGTCAT GCCTCGCAGG TGTCTGCGCC GGGTCAGTTC
TTCTCGTCGA GGATGGGCTC GGATCCGGTG TTGTTGACCC GTGACGCTCA GGGTGGTGTG
AATGTCTTGC TCAATTCGTG CCGACATCGG GGTATGGCGG TGTGTCGCTA TGACGAGGGC
CGTACGCTGC AGTTCACCTG TCCCTATCAT GGCTGGTCGT ACTCGATGGA CGGTTCGCTG
GTGTCCACTC CAGGGGATTT GCACGGTGTG CCGCAGCAGG GCATGGCCTA CGGCAACGGC
CTTGACAAAG CGGCCTGGGG ACTTGTCAGG GCTGCCAAGG TGCACAATTA CAAGGGCCTG
GTATTCGCGT GCTGGGATCC ATCAGCCCCG GAGTTCGATG AATACGTTGG GGACTTTCAT
CATTGGCTGG ATAACCTGTC TGATGCTTTT GATGGTACGG AAGGTGGTAC CGAGGTGTTC
CGTGGGGTGC AGAAGTGGCG CATCAAATCA AATTGGAAAT TCGTCTCAGA GAATTTCTTG
GGCGATACCT ATCACGGGGC GACGACCCAT GCCTCGGTTG AACAGGTGGG CATCGGGCCG
GGTGGCAGAA ATTCACGACG TCACGGTGAA CGACAGGATC AGGGTGGTTT TTCGAAGGGC
CGTGTGAAGA CGTCGTTTCG GATGGGCCAT GGCGCGTCGG ACAATCTGGC GTATGAGATT
CCCTATCCTG AGTTCGCCGA AGAACCGGCC TTGAGTGAGT ACTTCTCCCA GGCGTGGGCG
GTCCGCAAGG AGCGACTGCA GGCGCAGGGC AGACTGCTCG GTGGTCGTGG CCCGGCGACG
ATGTTCCCCA ATATGTCGTT TTCGGCCGGT TTTCCGCGGA CGATCCTGGT GTCACATCCG
ATCAGCCCGA CCGAAACCGA GGTGTGGCGC TGGTATCTCG TCGACAAGAA CGCACCCGAT
GATGTGCGTG ACTGGTTGCG CCGCTATTAC ATGCGCTACT CGGGTCCTGG AGGGATGACG
GAGCAAGACG ATATGGAGAA CTGGAATTAC GCGACGCAGG CCAGCCAGGG CGTGATAGCC
CGGCGCTACC CCTACAACTA TCAGCAGGGT CTCGGCAAGG AAACTCCCAG TGAGCTCGAC
CAGGCGGTGC ATTCTCACCA GCCCATCGCT GGCGAGGTGA ATGCACGCGC CTTTTACCGG
CGATGGGCCG AGTTCACCGA CAACCTCTCG TGGCCCCTAC TCATCGAACT CGCCAAATCC
GACGAGAGAG CCGCACGGTC GTGA
 
Protein sequence
MSTSGGSDTT ARSLVDVDRG EISREIFTDA SIFNRELEYL FPRSWLFVGH ASQVSAPGQF 
FSSRMGSDPV LLTRDAQGGV NVLLNSCRHR GMAVCRYDEG RTLQFTCPYH GWSYSMDGSL
VSTPGDLHGV PQQGMAYGNG LDKAAWGLVR AAKVHNYKGL VFACWDPSAP EFDEYVGDFH
HWLDNLSDAF DGTEGGTEVF RGVQKWRIKS NWKFVSENFL GDTYHGATTH ASVEQVGIGP
GGRNSRRHGE RQDQGGFSKG RVKTSFRMGH GASDNLAYEI PYPEFAEEPA LSEYFSQAWA
VRKERLQAQG RLLGGRGPAT MFPNMSFSAG FPRTILVSHP ISPTETEVWR WYLVDKNAPD
DVRDWLRRYY MRYSGPGGMT EQDDMENWNY ATQASQGVIA RRYPYNYQQG LGKETPSELD
QAVHSHQPIA GEVNARAFYR RWAEFTDNLS WPLLIELAKS DERAARS