Gene Mmcs_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1622 
Symbol 
ID4110458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1758941 
End bp1760395 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content59% 
IMG OID638030743 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_638789 
Protein GI108798592 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0299575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGATC ACGGTGAGGT GTTAGCGGCT GTACGCACTG GCATGATCCC GGCGCACGTG 
TATAACGACA AGCAAATTTT CTCGCTCGAG AAGGAGCGCC TGTTCAGTCG GGCGTGGTTG
TTCGTGGCGC ACGAGTCGGA GATTCCGCAG CCGGGGGACT ACGTGGTCAG ACAAGTGTTG
CAGGATTCGT TCATCGTCGC TCGTGATTCT GCCGGCGAGA TCCGGGTGAT GTTCAATATG
TGCCTCCATC GCGGTATGCA GGTTTGCCGG GCGGAAATGG GGAACGCGTC GAACTTCAGA
TGCCCGTACC ACGGGTGGTC TTACCGCAAT GACGGTCGCA TTATTGGCCT GCCGTTTCAC
CAAGAGGCTT ACGGAGGAGA CGCCGGGTTT AACAAGACGG GGCAGACCTT GTTGCCCGCG
CCGAGTGTGG CCAGCTACAA CGGGTTGATC TTTCTGTCGA TGGATCCTGA CGCAGAATCG
CTTGAAGACT ATCTGGGTGA TTTCAGGTTC TATCTCGATT TCTACACCAA GCAAGGCCCC
AACGGTCTTG AGGTGCGAGG TCCGCAGCGT TGGCGGGTAA AAGCGAACTG GAAGATCGCA
GCTGAAAATT TCGCCGGGGA CATGTACCAC ACACCTCAGA CGCACACGTC GGTGGTCGAG
ATCGGCCTGT TCCGAGAGCC GAAGGCTAAC AAGCGCAAAG ACGGCGCCAC GTATTGGGCG
GGTAGAGGTG GGGGCACCAC ATACAAGCTG CCCGAGGGGA GTTTCGAGGA CCGGATGAGC
TACGTGGGCT ACCCGGCGGA CATGATTAGT CGAGCCAAGG CCACCTGGAC CGAGCAGCAG
CAACAAGTCG TCGGCACCGA CGGGTTCATG ATCTCGGCCG CGACGTGTTT TCCCAACATC
AGTTTCGTGC ACAACTGGCC GAAAGTGGAG GACGGGGAGC ACGTCTTGCC GTTCATTTCA
ATCCGGGTGT GGCAGCCAAT CAGCGAGAAC GAAACCGAGG TGCTGTCGTG GTTTGCGGTG
GATTCTGATG CCCCGGCAGA CTTTAAGGCG GACTCGTATA AGGCTTATTT GATGTGCTTC
GGCTCGACGG GAATGTTCGA GCAAGACGAT GTCGAGAACT GGGTGTCGCT GACCAACACC
GCGGGGGGTT CCATGGCCCG CCGACTGCGG CTGAACAGCC GGATGGGGCT GCTCGCAGAC
GATGCCCGGG TGGTCGACAC CCTAAGCAGC GCTCAATTCC ACGGGCCTGG ATACGCTCAG
CTCGGCTACA ACGAGAACAA TCAACGGGAA TTGTTGAGGC TCTGGGCCGA CTACCTCGAC
ATGCCGCCGC TGCGAGTCGA CCCGGCTACT GTGCTCACGG ACAACCCGCA AGGTATTGAG
CCGATGGTGC AGACCAACGG CGGGGCCGTC GCCGGTATCG ACTCGGAGTC GGCTACGACG
TCGGTGACGC TGTGA
 
Protein sequence
MQDHGEVLAA VRTGMIPAHV YNDKQIFSLE KERLFSRAWL FVAHESEIPQ PGDYVVRQVL 
QDSFIVARDS AGEIRVMFNM CLHRGMQVCR AEMGNASNFR CPYHGWSYRN DGRIIGLPFH
QEAYGGDAGF NKTGQTLLPA PSVASYNGLI FLSMDPDAES LEDYLGDFRF YLDFYTKQGP
NGLEVRGPQR WRVKANWKIA AENFAGDMYH TPQTHTSVVE IGLFREPKAN KRKDGATYWA
GRGGGTTYKL PEGSFEDRMS YVGYPADMIS RAKATWTEQQ QQVVGTDGFM ISAATCFPNI
SFVHNWPKVE DGEHVLPFIS IRVWQPISEN ETEVLSWFAV DSDAPADFKA DSYKAYLMCF
GSTGMFEQDD VENWVSLTNT AGGSMARRLR LNSRMGLLAD DARVVDTLSS AQFHGPGYAQ
LGYNENNQRE LLRLWADYLD MPPLRVDPAT VLTDNPQGIE PMVQTNGGAV AGIDSESATT
SVTL