Gene Mmcs_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1788 
Symbol 
ID4110622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1928983 
End bp1929981 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content65% 
IMG OID638030908 
ProductRieske (2Fe-2S) region 
Protein accessionYP_638953 
Protein GI108798756 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.651662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTC CGTTCACCTG GAAGGTCACC GGGTGGTTCA TGGTCGGGTG GTCGGCGGAA 
TTCGTCTCCG GCGAGACGCG GGCGCTGCAC TACTTCGGCG ACGATCTGGT CGCCTACCGG
GACGAGTCGG ACACCCTGCA TGTCCTGGAG GCGCACTGCA AACACCTCGG TGCGCATCTC
GGCCACGGCG GAAAGGTGGT CGGCGACTGC GTGGAGTGCC CGTTCCACGG CTGGCGCTGG
GGTCCACAGG GCGACAACAC CTACATCCCC TATCAACCCG ACCGGCCGAA CCGGGCACTG
AAGCTGAGGG TGTACCCCGT CGTCGAGCAG TACGGCTGCG TCTTCGTCTG GCACCATCCC
GACGGCGCGC CACCGCAGTG GCCGCTGCCG GACCTGTTCG AGAAGTTCCC CCAGTTCCCC
ACCGACCCGG ATGCGTACTA CCGGCCGTAT CCCGAGTTCT CCAGCCGCGC CGAGAACGAA
CCGGTGCACC CGCAGATCGT CGCCGAGAAC GGCCCGGACA GTTCACACTT CCGCTACGTC
CACGGCGCCT CGGTGACGCC GGTCTGCCTG AACTGGGAGG TGGTCGGTGA GGAGTGGCGC
TTCCTCACCG GCTGGCCGGA TCCGCGCAGC GACGATCCGG ACAAGATGGC GCTGTTCATC
CATAGCCACT TCTCCGGGCT GGGGTTCGCC GTGAGCGTCT TCGAGGGTTC GTCGAACCAT
CGGCTGATCT TCGCGTGCAC CCCGGTCGAC GACGGGCTCT CGGACATGTT CTATTCGATC
TGGTGGCCCA AGGTCGACGG GGAGACCTCC GACGTCCCAC CGGACGATGT CCGCGCCCGG
GTGGAGAGAC AGTTCCTGCG CACGGTCTGG GAGGATCTCG ACATCTGGCG CTATCAGCGC
TATGTCGAAC GGCCGCCGCT GGCCAAGATC GACGCGAAAC CGTATATGGC GATGCGGGAG
TGGGCCAAAC AGTTCTACGA CGTGTCCGCA TCGGTATGA
 
Protein sequence
MKVPFTWKVT GWFMVGWSAE FVSGETRALH YFGDDLVAYR DESDTLHVLE AHCKHLGAHL 
GHGGKVVGDC VECPFHGWRW GPQGDNTYIP YQPDRPNRAL KLRVYPVVEQ YGCVFVWHHP
DGAPPQWPLP DLFEKFPQFP TDPDAYYRPY PEFSSRAENE PVHPQIVAEN GPDSSHFRYV
HGASVTPVCL NWEVVGEEWR FLTGWPDPRS DDPDKMALFI HSHFSGLGFA VSVFEGSSNH
RLIFACTPVD DGLSDMFYSI WWPKVDGETS DVPPDDVRAR VERQFLRTVW EDLDIWRYQR
YVERPPLAKI DAKPYMAMRE WAKQFYDVSA SV