Gene Mkms_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1668 
Symbol 
ID4613956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1783726 
End bp1785093 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID639791335 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_937661 
Protein GI119867709 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.779225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG AAACAACCGG AACAGCTGAC GCGACCGATC CCTACCTGCG GCGCGCGTTG 
CGGGAGGTAG CGGACGGGCT CAAGGTCGGG CGCTTACCGG CCCGCGTCGT CAGCGATCCC
GCGCTACACA CGATCGAGAT GGAGCGGATC TTCGGGCGCG CCTGGGTGTT TCTCGGACAC
GAGTCGGAGT TGGCCAAGTC CGGCGACTTC GTCGTGCGGC ACATCGGGGC CGATTCGGTG
ATCGTTTGCC GGGACAACTC CGGCCGCATC CAGGCGCTGT CCAATTCTTG TCGCCACCGT
GGTGCGCTCG TGTGCCGGGC GGAGATGGGA AACACCGCGC ACTTCCAATG CCTGTACCAC
GGCTGGGTGT ACAGCAACAC CGGAGAGCTC GTCGGCGTGC CGGCGATGAC GGAGGCCTAT
CCCGGCGGCT TCGACAAGTC GCAGTGGGGA TTACGTCACA TCCCCCATGT CGACTCGTAC
GCCGGATTCA TCTTCGGCAG CGTCGATCCG AAGGCGCCGA GCCTGACCGA CTACCTCGGC
GACACGACGT TCTACCTCGA CCTCATTGCG AAGAAGACAG CGGGCGGTCT GGAGGTGATA
GGGGCACCGC ATCGATGGGT GATGTCAGCG AACTGGAAGA CAGCCGCCGA CAATTTTGTC
GGCGACTCCT ACCACACCCT CTTTGCTCAC CGCTCGATGG TCGAGCTAGG CATGGCGCCC
GGTGACCCAA ACTTCGCGAG CGCACCAGCG GAAATCTCGC TGCAGAACGG CCACGGCGTC
GGCGTACTCG GCTTTCCGCC CACGCTCGCC GATTTTCCCG AGTACGAGGG ATACCCCGAC
GAAGTCGTCG ACCAGATGGC GACGTCCTAC CCGTCGCCGG TACACAAGGA CCTGATGCGA
CGCTCATCCT TTATTCACGG CACCGTGTTC CCGAATTTGT CGTTCATCAA CGTGACCCTC
GCGCAGGACC ACATGTCGCC CCCTACCCCC TTCATCACGT TCCGGGTATG GCATCCGCTC
TCCCATGATC GGATGGAGAT CCTCTCCTGG TTCCTGGTCG AACGCGATGC TCCGGAATGG
TTGCGCGATG CGTCCCAGGC GTCCTACGTC AACAACTTCG GCCCAGGTGG GGTTTTCGAA
CAGGACGACG CCGAGGCATG GAAGGCCATC ACCGAATCTG TCCAGGGCCC GTTCGCCGGT
GAAGGCCTGC TGAACTACGA AATGGGCATG GACTTGACTC CGCTCACCGA CTGGCCAGGG
CCGGGAGAGG CCCTCCCGAG CGGGTACGCC GAGCAGAATC AGCGGCGGTT TTGGGGGAGA
TGGCTGGAAT ACATGGGTCA GCCTCCCGCA TTCGGCGGGC GTGCTTGA
 
Protein sequence
MTTETTGTAD ATDPYLRRAL REVADGLKVG RLPARVVSDP ALHTIEMERI FGRAWVFLGH 
ESELAKSGDF VVRHIGADSV IVCRDNSGRI QALSNSCRHR GALVCRAEMG NTAHFQCLYH
GWVYSNTGEL VGVPAMTEAY PGGFDKSQWG LRHIPHVDSY AGFIFGSVDP KAPSLTDYLG
DTTFYLDLIA KKTAGGLEVI GAPHRWVMSA NWKTAADNFV GDSYHTLFAH RSMVELGMAP
GDPNFASAPA EISLQNGHGV GVLGFPPTLA DFPEYEGYPD EVVDQMATSY PSPVHKDLMR
RSSFIHGTVF PNLSFINVTL AQDHMSPPTP FITFRVWHPL SHDRMEILSW FLVERDAPEW
LRDASQASYV NNFGPGGVFE QDDAEAWKAI TESVQGPFAG EGLLNYEMGM DLTPLTDWPG
PGEALPSGYA EQNQRRFWGR WLEYMGQPPA FGGRA