Gene Mmcs_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1698 
Symbol 
ID4110532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1836098 
End bp1837285 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID638030817 
Productprotocatechuate 3,4-dioxygenase, beta subunit 
Protein accessionYP_638863 
Protein GI108798666 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR02422] protocatechuate 3,4-dioxygenase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACGGTG GCAAGGGTCG CCGCGTATAC GTCATACAAG AACGGTTCAG CCACCTGGCG 
CAATTGGCCT TCCACCGGCG CCAGCAATCC CGCGTTCCAG ACCAGCCTGC CCACCACGTA
CTGACCGTCG GCGCGGCGCT GCAATGCCGA GCCGGCGACC AACTCGGCCG CAAGGCGGTG
TGTGGTGGCA ATCGGCAGTC CGGCCCGGCG GGCCAACTCG GACAGCGTCA GGCTGTTATG
CCGCTCGTCG AAGGCGCCGA GAATGTTCAG CAAACGCGAG GTGACAGTGG TTCCCGGCGT
CGAAGTGTTC CCTGCCAATT ATGGATGGCC TTTCCGCTCA GTGGAATTCT ACCGTTCGCC
AATCTCAACG TCGTCGGTAC CGTAGGCGCC GTGACAACGT CGATCGACAG CAATCCCGAC
GGCGCCGTAG CGAGCCAGTC GGAGATCAGC GCGGAGATCG GTGCGATCGA GTCCGCCTAC
CAGCGTGCAG GGGTCGAGGA GACGCAGCCG CGCCTGAGTT ATCCGCCTTA CCGGAGCAGC
CTGCTACGGC ATCCGACAAA GGACCTTCAC CACGCCGACC CGGAAGGGGT CGAGCTATGG
ACGCCGTGCT TCTCCGAACG CGACGTTCAC CCGCTGGAGG CCGACCTCAC CGTCCAGCAC
TCGGGTGAAC CCATCGGTGA ACGACTGGTG GTGACCGGCA GGGTCGTCGA CGGCGCAGGG
CGGCCGGTGC GGCGCCAGCT CGTCGAGATT TGGCAGGCCA ACGCCGGCGG ACGTTACATC
CACAAGGGGG ATCAGCACCC GTCCCCAATC GACCCCAACT TCACCGGCGC CGGCCGCTGT
TTGACCGACG AGGACGGCAT CTACCGGTTC ACCACGATCA AGCCGGGGCC GTATCCGTGG
AAGAACCACC GCAACGCGTG GCGGCCCGCA CACATCCACT TCTCGCTCTT CGGCACGGAA
TTCACGCAGC GAATGGTCAC CCAGATGTAC TTCCCGGGTG ACCCGCTCCT CTGCCTTGAT
CCGATCTTCC AGGCGATCCC GGATCAGAAG GCGCGCAGCC GGCTGGTGGC CAGCTACGAT
CACGAACTCA GCACCCACGA ATGGGCTACC GGCTACCGAT GGGACGTCGT CCTGACCGGG
TCGGCGCGCA CCCCAATCGA GAACCTCGGC CGCGGAGCCC ACCGCTGA
 
Protein sequence
MHGGKGRRVY VIQERFSHLA QLAFHRRQQS RVPDQPAHHV LTVGAALQCR AGDQLGRKAV 
CGGNRQSGPA GQLGQRQAVM PLVEGAENVQ QTRGDSGSRR RSVPCQLWMA FPLSGILPFA
NLNVVGTVGA VTTSIDSNPD GAVASQSEIS AEIGAIESAY QRAGVEETQP RLSYPPYRSS
LLRHPTKDLH HADPEGVELW TPCFSERDVH PLEADLTVQH SGEPIGERLV VTGRVVDGAG
RPVRRQLVEI WQANAGGRYI HKGDQHPSPI DPNFTGAGRC LTDEDGIYRF TTIKPGPYPW
KNHRNAWRPA HIHFSLFGTE FTQRMVTQMY FPGDPLLCLD PIFQAIPDQK ARSRLVASYD
HELSTHEWAT GYRWDVVLTG SARTPIENLG RGAHR