Gene Mmcs_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1628 
Symbol 
ID4110464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1765451 
End bp1766521 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID638030749 
Productcupin 2, barrel 
Protein accessionYP_638795 
Protein GI108798598 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACCG CCGAGAGTTC AGAGCTCCGC GAATTTGACG TCGAGCTCGA GGCTGCCAAC 
TTACGTGGGC AATGGATATA CGACGAAATG CTGGAAAGCG TCGTCGGCGG CCCCAAGCCC
GCCGGTGTTC CGTTTCTGTG GCGATGGCAC GATGTTTACG CGAAGCTTCT GAAGTCGTGC
GACGTGATGC CTGAAAGTTT GACGGCGCGA CGCAATCTCT CGTTCATCAA TCCGGATGCC
CGGGGAACCA CGCACACCAT GAACATGGGT ATGCAGATGC TCAAGCCCGG CGAGATTGCC
TATGCGCACC GCCATACCAT GGCAGCGCTG CGGTTCGCTA TTCAAGGCGG CCCCGGCCTG
GTGACTGTGG TGGATGGCGA GCCTTGTCAA ATGGATACCT ACGACCTGGT TCTGACCCCT
CGCTGGACGT GGCATGACCA TGAGAATGCC ACCTCGGAGA ACGTCGTTTG GCTCGACGTG
CTCGATATCG GCCTAGTGCT CGGGCTGAAT GTTCCCTTCT ATGAGTCCTA TGGCGAGAAG
CGCCAACCTC AACGCGAGGA CCCGGGGGAG CATCTCGCTG ACCGCGGTGG GATGCTGCGC
CCTGCGTGGG AGCAGGTCAA GGCGGCGAAC TTCCCGTACC GCTATCCTTG GCGTGACGTC
GAGCGGCAGC TCCAGCGGAT GGCGGGCCTT GCGGGCAGTC CTTACGACGG CGTAGTCCTG
CGTTATGCGA ACCCCGTTAC CGGCGGATCG ACTATGCCAA CGCTGGATTG CTGGGTGCAG
TTGCTGCGGC CGGGCCAGCG GACCGAGGCC CATCGCCACA CGTCGAGTGC CGTGTATTTC
GTCGTGCGCG GTGAGGGAAC TACGGTTGTC GACGGGGTCG AACTCGACTG GGGGCCCCAC
GACAGCTTCG TGGTGCCCAA CTGGAGCACC CATCACTTCG TCAACCGGTC GGCGGAAGAT
GCGTTGCTGT TCTCGGTCAA CGACATCCCT ACATTGAAGG CTCTCGATCT CTACTACGAA
GAGCCCGAGC TGTCTTTGGG GACGCAGCCA TTTCCGCCGG TCCCCGGCTA A
 
Protein sequence
MSTAESSELR EFDVELEAAN LRGQWIYDEM LESVVGGPKP AGVPFLWRWH DVYAKLLKSC 
DVMPESLTAR RNLSFINPDA RGTTHTMNMG MQMLKPGEIA YAHRHTMAAL RFAIQGGPGL
VTVVDGEPCQ MDTYDLVLTP RWTWHDHENA TSENVVWLDV LDIGLVLGLN VPFYESYGEK
RQPQREDPGE HLADRGGMLR PAWEQVKAAN FPYRYPWRDV ERQLQRMAGL AGSPYDGVVL
RYANPVTGGS TMPTLDCWVQ LLRPGQRTEA HRHTSSAVYF VVRGEGTTVV DGVELDWGPH
DSFVVPNWST HHFVNRSAED ALLFSVNDIP TLKALDLYYE EPELSLGTQP FPPVPG