Gene Mkms_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1654 
Symbol 
ID4613942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1769605 
End bp1770690 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content62% 
IMG OID639791325 
Productcupin 2 domain-containing protein 
Protein accessionYP_937651 
Protein GI119867699 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG CCGAGAGTTC AGAGCTCCGC GAATTTGACG TCGAGCTCGA GGCTGCCAAC 
TTACGTGGGC AATGGATATA CGACGAAATG CTGGAAAGCG TCGTCGGCGG CCCCAAGCCC
GCCGGTGTTC CGTTTCTGTG GCGATGGCAC GATGTTTACG CGAAGCTTCT GAAGTCGTGC
GACGTGATGC CTGAAAGTTT GACGGCGCGA CGCAATCTCT CGTTCATCAA TCCGGATGCC
CGGGGAACCA CGCACACCAT GAACATGGGT ATGCAGATGC TCAAGCCCGG CGAGATTGCC
TATGCGCACC GCCATACCAT GGCAGCGCTG CGGTTCGCTA TTCAAGGCGG CCCCGGCCTG
GTGACTGTGG TGGATGGCGA GCCTTGTCAA ATGGATACCT ACGACCTGGT TCTGACCCCT
CGCTGGACGT GGCATGACCA TGAGAATGCC ACCTCGGAGA ACGTCGTTTG GCTCGACGTG
CTCGATATCG GCCTAGTGCT CGGGCTGAAT GTTCCCTTCT ATGAGTCCTA TGGCGAGAAG
CGCCAACCTC AACGCGAGGA CCCGGGGGAG CATCTCGCTG ACCGCGGTGG GATGCTGCGC
CCTGCGTGGG AGCAGGTCAA GGCGGCGAAC TTCCCGTACC GCTATCCTTG GCGTGACGTC
GAGCGGCAGC TCCAGCGGAT GGCGGGCCTT GCGGGCAGTC CTTACGACGG CGTAGTCCTG
CGTTATGCGA ACCCCGTTAC CGGCGGATCG ACTATGCCAA CGCTGGATTG CTGGGTGCAG
TTGCTGCGGC CGGGCCAGCG GACCGAGGCC CATCGCCACA CGTCGAGTGC CGTGTATTTC
GTCGTGCGCG GTGAGGGAAC TACGGTTGTC GACGGGGTCG AACTCGACTG GGGGCCCCAC
GACAGCTTCG TGGTGCCCAA CTGGAGCACC CATCACTTCG TCAACCGGTC GGCGGAAGAT
GCGTTGCTGT TCTCGGTCAA CGACATCCCT ACATTGAAGG CTCTCGATCT CTACTACGAA
GAGCCCGAGC TGTCTTTGGG GACGCAGCCA TTTCCGCCGG TCCCCGCTAA CCTCCGAGCC
CGCTGA
 
Protein sequence
MSTAESSELR EFDVELEAAN LRGQWIYDEM LESVVGGPKP AGVPFLWRWH DVYAKLLKSC 
DVMPESLTAR RNLSFINPDA RGTTHTMNMG MQMLKPGEIA YAHRHTMAAL RFAIQGGPGL
VTVVDGEPCQ MDTYDLVLTP RWTWHDHENA TSENVVWLDV LDIGLVLGLN VPFYESYGEK
RQPQREDPGE HLADRGGMLR PAWEQVKAAN FPYRYPWRDV ERQLQRMAGL AGSPYDGVVL
RYANPVTGGS TMPTLDCWVQ LLRPGQRTEA HRHTSSAVYF VVRGEGTTVV DGVELDWGPH
DSFVVPNWST HHFVNRSAED ALLFSVNDIP TLKALDLYYE EPELSLGTQP FPPVPANLRA
R