Gene Mjls_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1601 
Symbol 
ID4881510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1703768 
End bp1704853 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content62% 
IMG OID640138906 
Productcupin 2 domain-containing protein 
Protein accessionYP_001069889 
Protein GI126434198 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCG CTGAGATTTC AGGGCTTCAC GAATTCGACG CCGAGCTCGA GGCTGCCAAC 
CTACGTGGAC AATGGATCTA CGACGAAATG CTGGAGAGCG TCGTCGGCGG GCCCAAGCCT
GCGGGTGTTC CCTTTCTGTG GCGATGGCAG GACGTTCACG CGAAGCTTCT GAAGTCGTGC
GACGTGATGC CTGAAAGTTT GACGGCGCGA CGCAATCTCT CGTTCATCAA CCCGGATGCC
CGAGGAACCA CGCACACCAT CAACATGGGT ATGCAGATGC TCAAGCCCGG CGAGATTGCC
TATGCGCACC GCCACACCAT GGCGGCGCTC CGGTTCGCTA TTCAAGGCGG CCCCGGCCTG
GTGACTGTGG TGGATGGCGA GCCTTGTCAA ATGGATACCT ACGACCTGGT TCTGACCCCT
CGCTGGACGT GGCATGACCA TGAGAACGCC ACCTCGGAGA ACGTCGTTTG GCTCGACGTG
CTCGATATCG GCCTAGTGCT CGGGCTGAAT GTTCCCTTCT ATGAGCCCTA TGGCGAGAAG
CGCCAACCTC AACGCGAGGA CCCGGGGGAG CATCTCGCTG ACCGCGGTGG GATGCTGCGC
CCGGCGTGGG AGCAGGTCAA GGCGGCGAAC TTCCCGTACC GCTATCCTTG GCGTGACGTC
GAGCGGCAGC TCCAGCGGAT GGCGGGCCTT GCGGGCAGTC CCTACGACGG CGTAGTCCTG
CGTTATGCGA ACCCCGTTAC CGGCGGATCG ACTATGCCAA CGCTGGATTG CTGGGTGCAG
TTGCTGCGGC CGGGCCAGCA GACCGAGGCC CATCGCCACA CGTCGAGTGC CGTGTATTTC
GTCGTGCGCG GTGAGGGAAC TACGGTTGTC GACGGGGTCG AACTCGACTG GGGGCCCCAC
GACAGCTTCG TGGTGCCCAA CTGGAGCACC CATCACTTCG TCAACCGGTC GGCGGAAAAT
GCGTTGCTGT TCTCGGTCAA CGACATCCCT ACATTGAAGG CTCTCGATCT CTACTACGAA
GAGCCCGAGC TGTCTTTGGG GACGCAGCCA TTTCCGCCGG TCCCCGCTAA CCTCCGAGCC
CGCTGA
 
Protein sequence
MSTAEISGLH EFDAELEAAN LRGQWIYDEM LESVVGGPKP AGVPFLWRWQ DVHAKLLKSC 
DVMPESLTAR RNLSFINPDA RGTTHTINMG MQMLKPGEIA YAHRHTMAAL RFAIQGGPGL
VTVVDGEPCQ MDTYDLVLTP RWTWHDHENA TSENVVWLDV LDIGLVLGLN VPFYEPYGEK
RQPQREDPGE HLADRGGMLR PAWEQVKAAN FPYRYPWRDV ERQLQRMAGL AGSPYDGVVL
RYANPVTGGS TMPTLDCWVQ LLRPGQQTEA HRHTSSAVYF VVRGEGTTVV DGVELDWGPH
DSFVVPNWST HHFVNRSAEN ALLFSVNDIP TLKALDLYYE EPELSLGTQP FPPVPANLRA
R