Gene Mmcs_5441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5441 
Symbol 
ID4114526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp21884 
End bp23200 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content60% 
IMG OID638034596 
ProductRieske (2Fe-2S) region 
Protein accessionYP_642597 
Protein GI108802401 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0352555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.90702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAA TCGGTTCCAA AACCGGCCCG GCGCTGCCGA TAGACTCCTC GGCCTTCTAC 
GACGAGGCGG TGTACCAGCG GGAACTCGAC TCCATCTTCA AGCGGTCGTG GCTTTTCGTC
GGCCACGAGT CCATGATCCC CAAGCCCGGT GACTTCCGCA CAACCTATAT GGCCGACGAC
GCCGTGATCG TATGCCGGGA CAAGGAATCT CGTGTTCGTG TGCTGCTGAA CAAATGTCGG
CACCGGGGCA ATAAAGTCTG TCAGTTCGAC ATGGGCAACG CGAACATCTT TCACTGCAGC
TATCACGGCT GGAGTTACGA CACTGCCGGA CGGCTGCGCA GTGTTCCGCT AGCCGAGAGC
GCCTACGGGC CGCAGTTCGA TAAGGCCAGC ATGGGCCTCG TCTCCCCCCG GGTAGCTACC
TACAAAGGAC TTATCTTTGC TTGCTGGGAT CAGTCGGCGC CGCCGCTGGA AGAATATCTG
GGGGAGGGCC TGCTGTGGTA CCTCGACAAC TTTCTACTCG ACTGTGACCC CAACGGCCTG
CAGGTCGTTC CCGGCCTGCA CCGCTACCTC ATGCCCGTCA ACTGGAAGTT GTTGGCCGAG
AACTTCGGCG GCGACCAGTA TCACTTCGCT GCCACTCACG GATCGGTTTC CGCACTGTCG
AAAGCTGGGC AGACTGCCCG CATCAACTTT TCGATAGACG AGGGGCAGCA CTACAGCGTG
GTTCTCGACG GCTGCGCGCC CCATGGGTTG CTGCAACTGG CAGTCGGTAA GAATTTCTAT
CAGGACGATC TCGCCCAAGC CGAAACGCTG GGTACCGAAG CAGTCGACTG GCTAACCGAG
CGGCAGCGCC TGCAAGACGA GCGGTTGGCG ACATCCCCCG TGCAGCCCTA CAGCTTTCAC
GTAGCCAATA TCTTCCCAAA CTTCAGCATG ATTGGCATGG GCACGGCTTT TTACGGTCGG
GGATTCATCA TGTGGCAGCC TCGGGGACCG CGGTTGACCG AGGTTTGGGA GTGGTGTTTG
GTGGAAAGCT CCGCGCCCCG TGCTGTCAAG GAACGTATGG TCTTCGTCTT GAGTCAGCGG
CAATCGGCGG CCGGTCTCGT GACGCCCGAT GATCACGAAA ACTTCGAACG ATTGTCTGAC
GCCCTTGACA CCGGAGTCGC CCGGGATGTG CCGTTCAACT ACTCACTCGG CGAGGACGTC
GAACCAATGG AGTCGCTGGT TGCGGAGTTA CCCGGCAATG TCAGGCCGCA GATCAGCGAA
GCCTACCAGC GCGAGTTCTA TCGGCACTGG CACCGAACTA TGACGGAGCC GGCCTAG
 
Protein sequence
METIGSKTGP ALPIDSSAFY DEAVYQRELD SIFKRSWLFV GHESMIPKPG DFRTTYMADD 
AVIVCRDKES RVRVLLNKCR HRGNKVCQFD MGNANIFHCS YHGWSYDTAG RLRSVPLAES
AYGPQFDKAS MGLVSPRVAT YKGLIFACWD QSAPPLEEYL GEGLLWYLDN FLLDCDPNGL
QVVPGLHRYL MPVNWKLLAE NFGGDQYHFA ATHGSVSALS KAGQTARINF SIDEGQHYSV
VLDGCAPHGL LQLAVGKNFY QDDLAQAETL GTEAVDWLTE RQRLQDERLA TSPVQPYSFH
VANIFPNFSM IGMGTAFYGR GFIMWQPRGP RLTEVWEWCL VESSAPRAVK ERMVFVLSQR
QSAAGLVTPD DHENFERLSD ALDTGVARDV PFNYSLGEDV EPMESLVAEL PGNVRPQISE
AYQREFYRHW HRTMTEPA