Gene Mmcs_4640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4640 
Symbol 
ID4113469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4913229 
End bp4914386 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID638033791 
ProductRieske (2Fe-2S) region 
Protein accessionYP_641800 
Protein GI108801603 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTACCG ACACCGCCCA CAGCGGCATT CGCGAGATCG ACACCGGAAC CCTGCCCGAC 
CGGTACGCCA GGGGCTGGCA CTGCCTCGGC CCGGTCAACG ACTACCTCGA CGGCGAACCG
CACTCCGTCG AGGCGTTCGG CACCAAACTC GTGGTGTTCG CCGATTCGAA GGGCGACGTC
AAGATCCTCG ACGGCTACTG CCGGCACATG GGCGGCGACC TGTCCCAGGG CACCATCAAG
GGTGACGAGG TCGCCTGCCC CTTCCACGAC TGGCGCTGGG GCGGCGACGG CAAGTGCAAG
CTCGTGCCCT ACGCCAAGCG GACGCCGCGG CTGGCCCGCA CCCGCGCCTG GACCACCGAC
GTGCGCAGCG GTCTGCTGTT CGTCTGGCAC GACCACGAGG GCAACCCGCC TCCCCCCGAG
GTGCGCATCC CCGAGATCCC GGAGTTCGCC AGCGACGAGT GGACCGACTG GCGGTGGAAC
TCGATCCTGA TCGAGGGCGC GAACTGCCGC GAGATCATCG ACAACGTCAC CGACATGGCG
CACTTCTTCT ACATCCACTT CGGGCTGCCC ACGTACTTCA AGAACGTGTT CGAGGGCCAC
ATCGCCAGCC AGTACCTGCA CAATGTGGGC CGCCCCGACG TCAACGACAT GGGCACCACC
TACGGCGAAG CGCACCTCGA CTCCGAGGCG TCGTATTTCG GGCCGTCGTT CATGATCAAT
TGGCTGCACA ACAACTACGG CGGCTACAAG GCCGAGTCCA TCCTGATCAA CTGCCACTAC
CCGGTGACCC AGGATTCGTT CGTGCTGCAG TGGGGCGTCA TCGTCGAGAA GCCCAAGGGC
ATGGACGAGA AGATGACCGA CAAGCTGGCG CGGACCTTCA CCGACGGCGT CAGCAAGGGC
TTCCTGCAGG ACGTCGAGAT CTGGAAGCAC AAGACGCGTA TCGACAATCC GCTGCTGGTC
GAAGAGGACG GCGCGGTCTA CCAGCTGCGC CGCTGGTATC AGCAGTTCTA CGTCGACGTC
GCCGACGTGA CCCCGGAGAT GACCGACCGT TTCGAGATCG AGGTCGACAC CACCGCGGCC
AACGAGTACT GGAACACCGA GGTTCAGGAG AATCTCGCGC GCCGCGAGGG CGAGAAAGCC
GAACAGCCGA CCCCATGA
 
Protein sequence
MSTDTAHSGI REIDTGTLPD RYARGWHCLG PVNDYLDGEP HSVEAFGTKL VVFADSKGDV 
KILDGYCRHM GGDLSQGTIK GDEVACPFHD WRWGGDGKCK LVPYAKRTPR LARTRAWTTD
VRSGLLFVWH DHEGNPPPPE VRIPEIPEFA SDEWTDWRWN SILIEGANCR EIIDNVTDMA
HFFYIHFGLP TYFKNVFEGH IASQYLHNVG RPDVNDMGTT YGEAHLDSEA SYFGPSFMIN
WLHNNYGGYK AESILINCHY PVTQDSFVLQ WGVIVEKPKG MDEKMTDKLA RTFTDGVSKG
FLQDVEIWKH KTRIDNPLLV EEDGAVYQLR RWYQQFYVDV ADVTPEMTDR FEIEVDTTAA
NEYWNTEVQE NLARREGEKA EQPTP