Gene Mmcs_5081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5081 
Symbol 
ID4113910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5379392 
End bp5380408 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID638034239 
ProductRieske (2Fe-2S) region 
Protein accessionYP_642241 
Protein GI108802044 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.040812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGT TCGCCGACAT CAAGGCCAAG TGGGCGAAAT CGTCACCGTT CCAGGTGCTT 
CCGCATATCG ACTGGGCAGA GCAGAAACCC ACCTACCAGG ATGCGCTGCC GGCGCTGATC
AACGATGCGC TGGCCCGCGC GAAGTCCCGT CCGAGCGGCA ACTGGTTCCC GTTCGCGGCC
AGCGACGCCA TCCGGCGTAA ACCGGTGGGC GCCTCGGTGG GCGGCGTCGA ACTCGTCGCG
TGGCGGGGCG CCTGCGGCGA ACTGCGTGTC GGCCCTGCGA GCTGTCCGCA TCTCGGGGCG
GACCTGTCCA CCGGCACCGT CGACTGCGGC ACGCTGATCT GCCCCTGGCA CGGCCTGCGG
CTGTCCGGGG AGCGCCGCGA ATTCGGGTGG AAACCGTTGC CCGCCTTTGA CGACGGGGTA
CTGGCCTGGG TCCGTCTCGA CCGGGTCGGC GGCGAGCAGC CGACGGACCG CCCGATCATC
CCGGTGCGTC CGGCGGAACC CAGGCTGCAC GCAGTGACCA GCCTGGTCGG TGTCTGCGAA
CCGGACGATG TGATCGCCAA CCGGCTCGAC CCGTGGCACG GCGCCTGGTT CCACCCGTAC
TCGTTCACCC GCCTCGAGGT GCTCAGCGCC CCGGCGGCCG GTGAGGTGCC CGAAGCGGAA
GACCGGTTCC TCGTGGCGGT CACGTTCCGC ATCGGCCGCC TGGGCGTGCC GGTGGTCGCC
GAGTTCATCG CGCCCGGACC GCGCACGATC GTCATGCGGA TCGTCGACGG TGAGGGCGCG
GGCAGCGTCG TGGAAACCCA CGCGACACCC GTCGGTCCGG GTCCGGACGG GCGTCCGCGC
ACCGCGGTGA TCGAAGCCGT TGTCGCACAC TCGGATCGGC GCCGGTTCGG CTACGGGAAG
AAGGTCGCGC CGTTGATCAC GCCGTTCATG CGGCATGCGG CGACGAAGCT GTGGCGCGAC
GACCTCGCGT ATGCGGAGCG CCGTTACGCA GTGCGCTCAC AGCTCAACCG ACGCTGA
 
Protein sequence
MSAFADIKAK WAKSSPFQVL PHIDWAEQKP TYQDALPALI NDALARAKSR PSGNWFPFAA 
SDAIRRKPVG ASVGGVELVA WRGACGELRV GPASCPHLGA DLSTGTVDCG TLICPWHGLR
LSGERREFGW KPLPAFDDGV LAWVRLDRVG GEQPTDRPII PVRPAEPRLH AVTSLVGVCE
PDDVIANRLD PWHGAWFHPY SFTRLEVLSA PAAGEVPEAE DRFLVAVTFR IGRLGVPVVA
EFIAPGPRTI VMRIVDGEGA GSVVETHATP VGPGPDGRPR TAVIEAVVAH SDRRRFGYGK
KVAPLITPFM RHAATKLWRD DLAYAERRYA VRSQLNRR