Gene Mmcs_5458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5458 
Symbol 
ID4114543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp37792 
End bp39135 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content59% 
IMG OID638034613 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_642614 
Protein GI108802418 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATTCGA TGGCGCCTGA TGCGACGACA ATGCGAACAT TAGAGAATGC GCGCGGCTCC 
ATCCTAAAGG GTCGCCTCCC TGCGTCTCTC ATCGCTAATG CAGCGCTTTA CGAGCTTGAA
TTGAAGCGAG TATTTGGTAG GACCTGGCAG TTTCTCTGCC ACGAAGACGA GATCCCCAAT
GCGGGTGACT ATGTAGTGCG CTACATCGCT GATAACTCAA TTATTGTCGC GCGGCAGCAG
GATATGACGA TTCGGGCGAT GTCGAACTCG TGTCGGCACC GCGGCACGCT GCTGTGCCGA
ACCGAGTCTG GGAATGAGTC GGCGTTCCAG TGTCCGTACC ACGGTTGGAC CTATCGAAAC
AACGGTGATC TCATCGCGAT ACCTGCGCAG CAGGCAGTGT ACGGTGCTGC GTTCGACAAG
AGTCGGCTAG GGTTGCGCGC TCTGCCGATG CTGGACTCGT ACGCGGGCCT TGTCTTCGGG
TGTGTGTCGG ATGAGGCGCC GGGACTGGAT GAGTACCTCG GGGACATGCG CTGGTATCTC
GACTTGATGA TGAAGAAGAG CCCGGCCGGC CTTGAGGCGT GGGGTGCCCC GCAGCGTTGG
GTGATTGACG CGAACTGGAA GACCGGCGCC GATAACTTTG TTGGGGACGG CTATCACACG
GTCATGACGC ACCGTTCGAT GTGCGAGCTG GGGTTGTTAC CGCCCGATAA TGTGGCCGTT
TCGCCGGCCC ACGTCAGCCT ATCGGGCGGG CACGGGGCGG GCGTTCTTGG CGCACCACCC
GGCGTACCCG CACCGCCGTA TATGGGCTAT CCCGAGGAAG TCGTCGCCGG TCTCAGCGAG
GGTTACGGCG ATGACGTCCA TGGCGAGTTG CTGAAACGGA CGATGTTCAT TCATGGCAAT
GTGTTCCCGA ACTTGTCCTT CTTGAACGCC TTCATCGCCA AGGACGGGGA GTCTATGCCG
GTGCCCATTC TGACCTTGCG GCAATGGCGT CCCTTGGACG CAGCGCGTAT GGAGGTGTGG
TCGTGGTTCT TCGTGGAGCG CAACGCGCCC GAGGAGTTCA AGCAGCAGTC GTTTGAGACT
TATGTTCGGA CGTTCGGGGT CGGGGGTGTC TTCGAGCAGG ATGACGCCGA GATATTTCAG
GCTATTACCA AGGGAACACG CGGCGAGTTG GCTGGTGGTG TGGAGCTGAA CCTGGAGATG
GGACTGGACA ATCTGGCTCC TGATCCAACG TGGCTGGGCC CGGGACGACC GTTGGCCAGT
GGCTACGCCG AACAGAATCA GCGCGAGTAC TGGAAGCAGT ACTTCGACTA TCTGGCCACA
CCGAGAAGGG ATGAGAACGT ATGA
 
Protein sequence
MDSMAPDATT MRTLENARGS ILKGRLPASL IANAALYELE LKRVFGRTWQ FLCHEDEIPN 
AGDYVVRYIA DNSIIVARQQ DMTIRAMSNS CRHRGTLLCR TESGNESAFQ CPYHGWTYRN
NGDLIAIPAQ QAVYGAAFDK SRLGLRALPM LDSYAGLVFG CVSDEAPGLD EYLGDMRWYL
DLMMKKSPAG LEAWGAPQRW VIDANWKTGA DNFVGDGYHT VMTHRSMCEL GLLPPDNVAV
SPAHVSLSGG HGAGVLGAPP GVPAPPYMGY PEEVVAGLSE GYGDDVHGEL LKRTMFIHGN
VFPNLSFLNA FIAKDGESMP VPILTLRQWR PLDAARMEVW SWFFVERNAP EEFKQQSFET
YVRTFGVGGV FEQDDAEIFQ AITKGTRGEL AGGVELNLEM GLDNLAPDPT WLGPGRPLAS
GYAEQNQREY WKQYFDYLAT PRRDENV