Gene Mmcs_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1646 
Symbol 
ID4110481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1783964 
End bp1785145 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content63% 
IMG OID638030766 
ProductRieske (2Fe-2S) region 
Protein accessionYP_638812 
Protein GI108798615 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.719874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCTTG GACCGACGAT GGAGTTCGAC CACATACAGC CGGCGATCCG GAAACGGTTC 
ACCTCTGCAG CGGACATCCC CAAGGAGGTG TTCAGCGACC CCGACGTCTA CCGGGAAGAG
CTCACCCGCA TCTTCTATGG CCCCTACTGG CATCCGATCG CGCACCGGGC CGAGCTGGCC
GAGCGCAACG CCTTCCGGAC GAGATGGCTG GCCGACGTGC CGCTGCTGAT GGTGCGAGAC
GGCGATGACC GCATCCGCGT GTTCGTCAAC TCCTGCGCCC ATCGGGGAAC GCTACTGGAA
CAGCGCCGGT GCGGGGTGGC GGAGCGATTC GAGTGTCCGT ATCACCGGTG GATCTTCAAC
AATGACGGCC GTTTCGCCGG CGCGCCCCGC CGCATGCAGT TTCGCCCGGA CTTTCGCGAG
GAGGACTACG GCCTCCGGGA GCTGCACGTA GTCGAGGCGT GGGGTTTGAT CTTCGTCAGC
ATGGCTGCGC AGCCGCCGCC GTTCGACGAT TATCTCGGCG ATAGCGCGGA TCCGTTGCGC
GACTGCATGG TCGATGACGG GAACTTGACG TTGCTGGGCT ACCAGACGGT GGTGTTTCAG
AGTAATTGGA AAACCTACAT CGACAACGAT CCCTATCACG CGCCGCTGCT GCACAGCGCA
TTCAAACTGC TCAACTGGCA GGGCGGCAGC GGAAACGTCT TGGTCAGCAA GCCCTATGGG
CACATGTCGA TTCTGTACGA TGCGCAACCC TACGTGGACA ACGGTTTCCT GGCTGACCCG
AGTGTGGTCA CGCGGATGGG GGATGACAGC CGAGCCCGCG TGATTGCGTT ACGGCCGGTT
ACCGGGATCG TGCGTCACGT CGACACGATC AACATCCGGT ACGCCCGCCC GCTGGGGGTT
GATCGTACCG AGGTGCGATA CACGTTCTTC GGCCATGCCA GTGACTCCGA GGACTTCGCA
CGCCACCGAG TCCGCCAGTC GTCAAATCTG CTGGGGCCGA GCGGCTTCAT CAGTATCGAG
GACGCCGCCG TCTACAACCG CGTGCAGGCG ACCGCGCGTG ACGGCGGCTA TCAGCGCTTT
GTCGCCGGCG TCGGCCGACC ATTGTCGGAG TCGTCGCAGA ACGACGAGGT CGCCAATACC
GGCTGGTGGG CGCACTACCA GGAGGTGATG GAGTTTTGCT GA
 
Protein sequence
MKLGPTMEFD HIQPAIRKRF TSAADIPKEV FSDPDVYREE LTRIFYGPYW HPIAHRAELA 
ERNAFRTRWL ADVPLLMVRD GDDRIRVFVN SCAHRGTLLE QRRCGVAERF ECPYHRWIFN
NDGRFAGAPR RMQFRPDFRE EDYGLRELHV VEAWGLIFVS MAAQPPPFDD YLGDSADPLR
DCMVDDGNLT LLGYQTVVFQ SNWKTYIDND PYHAPLLHSA FKLLNWQGGS GNVLVSKPYG
HMSILYDAQP YVDNGFLADP SVVTRMGDDS RARVIALRPV TGIVRHVDTI NIRYARPLGV
DRTEVRYTFF GHASDSEDFA RHRVRQSSNL LGPSGFISIE DAAVYNRVQA TARDGGYQRF
VAGVGRPLSE SSQNDEVANT GWWAHYQEVM EFC