Gene Mkms_1690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1690 
Symbol 
ID4613978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1806851 
End bp1808242 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content59% 
IMG OID639791357 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_937683 
Protein GI119867731 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.688738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTC ACGTTCTAGG GGCACAAATC GATAGGAAGG TGCGCCCGGT GGATTCGATG 
GCGCCTGATG CGACGACAAT GCGAACCTTA GAGAATGCGC GCGGCTCCAT CCTAAAGGGT
CGCCTCCCTG CGTCTCTCAT CGCTAATGCA GCGCTTTACG AGCTTGAATT GAAGCGAGTA
TTTGGTAGGA CCTGGCAGTT TCTCTGCCAC GAAGACGAGA TCCCCAATGC GGGTGACTAT
GTAGTGCGCT ACATCGCTGA TAACTCAATT ATTGTCGCGC GGCAGCAGGA TATGACGATT
CGGGCGATGT CGAACTCGTG TCGGCACCGC GGCACGCTGC TTTGCCGAAC CGAGTCTGGG
AATGAGTCGG CGTTCCAGTG TCCGTACCAC GGTTGGACCT ATCGAAACAA CGGTGATCTC
ATCGCGATAC CTGCGCAGCA GGCAGTGTAC GGTGCTGCGT TCGACAAGAG TCGGCTAGGG
TTGCGCGCTC TGCCGATGCT GGACTCGTAC GCGGGCCTTG TCTTCGGGTG TGTGTCGGAT
GAGGCGCCGG GACTGGATGA GTACCTCGGG GACATGCGCT GGTATCTCGA CTTGATGATG
AAGAAGAGCC CGACCGGCCT TGAGGCGTGG GGTGCCCCGC AGCGTTGGGT GATTGACGCG
AACTGGAAGA CCGGCGCCGA TAACTTTGTT GGGGACGGCT ATCACACGGT CATGACGCAC
CGTTCGATGT GTGAGCTGGG GTTGTTACCG CCCGATAATG TGGCCGTTTC GCCGGCCCAC
GTCAGCCTAT CGGGCGGGCA CGGGGCGGGC GTTCTAGGCG CACCACCCGG CATACCCGCA
CCGCCGTACA TGGGCTATCC GGAGGAAGTC GTCTCCGGTC TCAGCGAGGG TTACGGCGAT
GACGTCCATG GCGAGTTGCT GAAACGGACG ATGTTCATTC ATGGCAATGT GTTCCCGAAC
TTGTCCTTCT TGAACGCCTT CATCGCCAAG GACGGGGAGT CTATGCCGGT GCCCATTCTG
ACCTTGCGGC AATGGCGTCC CTTGGACGCA GCGCGTATGG AGGTGTGGTC GTGGTTCTTC
GTGGAGCGCA ACGCGCCCGA AGAGTTCAAG CAGCAGTCGT TTGAGACTTA TGTTCGGACG
TTCGGGGTCG GGGGTGTCTT CGAGCAGGAT GACGCCGAGA TATTCCAGGC TATTACCAAG
GGAACACGCG GCGAGTTGGC TGGTGGTGTG GAGCTGAACC TGGAGATGGG ACTGGACAAT
CTGGCTCCTG ATCCAACGTG GCTGGGCCCG GGACGACCGT TGGCCAGTGG CTACGCCGAA
CAGAATCAGC GCGAGTACTG GAAGCAGTAC TTCGACTATC TGGCCACACC GAGAAGGGAT
GAGAACGTAT GA
 
Protein sequence
MSAHVLGAQI DRKVRPVDSM APDATTMRTL ENARGSILKG RLPASLIANA ALYELELKRV 
FGRTWQFLCH EDEIPNAGDY VVRYIADNSI IVARQQDMTI RAMSNSCRHR GTLLCRTESG
NESAFQCPYH GWTYRNNGDL IAIPAQQAVY GAAFDKSRLG LRALPMLDSY AGLVFGCVSD
EAPGLDEYLG DMRWYLDLMM KKSPTGLEAW GAPQRWVIDA NWKTGADNFV GDGYHTVMTH
RSMCELGLLP PDNVAVSPAH VSLSGGHGAG VLGAPPGIPA PPYMGYPEEV VSGLSEGYGD
DVHGELLKRT MFIHGNVFPN LSFLNAFIAK DGESMPVPIL TLRQWRPLDA ARMEVWSWFF
VERNAPEEFK QQSFETYVRT FGVGGVFEQD DAEIFQAITK GTRGELAGGV ELNLEMGLDN
LAPDPTWLGP GRPLASGYAE QNQREYWKQY FDYLATPRRD ENV