Gene Mkms_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1647 
Symbol 
ID4613935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1763095 
End bp1764549 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content59% 
IMG OID639791318 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_937644 
Protein GI119867692 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGATC ACGGTGAGGT GTTAGCGGCT GTACGCACTG GCATGATCCC GGCGCACGTG 
TATAACGACA AGCAAATTTT CTCGCTCGAG AAGGAGCGCC TGTTCAGTCG GGCGTGGTTG
TTCGTGGCGC ACGAGTCGGA GATTCCGCAG CCGGGGGACT ACGTGGTCAG ACAAGTGTTG
CAGGATTCGT TCATCGTCGC TCGTGATTCT GCCGGCGAGA TCCGGGTGAT GTTCAATATG
TGCCTCCATC GCGGTATGCA GGTTTGCCGG GCGGAAATGG GGAACGCGTC GAACTTCAGA
TGCCCGTACC ACGGGTGGTC TTACCGCAAT GACGGTCGCA TTATTGGCCT GCCGTTTCAC
CAAGAGGCTT ACGGAGGAGA CGCCGGGTTT AACAAGACGG GGCAGACCTT GTTGCCCGCG
CCGAGTGTGG CCAGCTACAA CGGGTTGATC TTTCTGTCGA TGGATCCTGA CGCAGAATCG
CTTGAAGACT ATCTGGGTGA TTTCAGGTTC TATCTCGATT TCTACACCAA GCAAGGCCCC
AACGGTCTTG AGGTGCGAGG TCCGCAGCGT TGGCGGGTAA AAGCGAACTG GAAGATCGCA
GCTGAAAATT TCGCCGGGGA CATGTACCAC ACACCTCAGA CGCACACGTC GGTGGTCGAG
ATCGGCCTGT TCCGAGAGCC GAAGGCTAAC AAGCGCAAAG ACGGCGCCAC GTATTGGGCG
GGTAGAGGTG GGGGCACCAC ATACAAGCTG CCCGAGGGGA GTTTCGAGGA CCGGATGAGC
TACGTGGGCT ACCCGGCGGA CATGATTAGT CGAGCCAAGG CCACCTGGAC CGAGCAGCAG
CAACAAGTCG TCGGCACCGA CGGGTTCATG ATCTCGGCCG CGACGTGTTT TCCCAACATC
AGTTTCGTGC ACAACTGGCC GAAAGTGGAG GACGGGGAGC ACGTCTTGCC GTTCATTTCA
ATCCGGGTGT GGCAGCCAAT CAGCGAGAAC GAAACCGAGG TGCTGTCGTG GTTTGCGGTG
GATTCTGATG CCCCGGCAGA CTTTAAGGCG GACTCGTATA AGGCTTATTT GATGTGCTTC
GGCTCGACGG GAATGTTCGA GCAAGACGAT GTCGAGAACT GGGTGTCGCT GACCAACACC
GCGGGGGGTT CCATGGCCCG CCGACTGCGG CTGAACAGCC GGATGGGGCT GCTCGCAGAC
GATGCCCGGG TGGTCGACAC CCTAAGCAGC GCTCAATTCC ACGGGCCTGG ATACGCTCAG
CTCGGCTACA ACGAGAACAA TCAACGGGAA TTGTTGAGGC TCTGGGCCGA CTACCTCGAC
ATGCCGCCGC TGCGAGTCGA CCCGGCTACT GTGCTCACGG ACAACCCGCA AGGTATTGAG
CCGATGGTGC AGACCAACGG CGGGGCCGTC GCCGGTATCG ACTCGGAGTC GGCTACGACG
TCGGTGACGC TGTGA
 
Protein sequence
MQDHGEVLAA VRTGMIPAHV YNDKQIFSLE KERLFSRAWL FVAHESEIPQ PGDYVVRQVL 
QDSFIVARDS AGEIRVMFNM CLHRGMQVCR AEMGNASNFR CPYHGWSYRN DGRIIGLPFH
QEAYGGDAGF NKTGQTLLPA PSVASYNGLI FLSMDPDAES LEDYLGDFRF YLDFYTKQGP
NGLEVRGPQR WRVKANWKIA AENFAGDMYH TPQTHTSVVE IGLFREPKAN KRKDGATYWA
GRGGGTTYKL PEGSFEDRMS YVGYPADMIS RAKATWTEQQ QQVVGTDGFM ISAATCFPNI
SFVHNWPKVE DGEHVLPFIS IRVWQPISEN ETEVLSWFAV DSDAPADFKA DSYKAYLMCF
GSTGMFEQDD VENWVSLTNT AGGSMARRLR LNSRMGLLAD DARVVDTLSS AQFHGPGYAQ
LGYNENNQRE LLRLWADYLD MPPLRVDPAT VLTDNPQGIE PMVQTNGGAV AGIDSESATT
SVTL