Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1628 |
Symbol | |
ID | 4110464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 1765451 |
End bp | 1766521 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638030749 |
Product | cupin 2, barrel |
Protein accession | YP_638795 |
Protein GI | 108798598 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACCG CCGAGAGTTC AGAGCTCCGC GAATTTGACG TCGAGCTCGA GGCTGCCAAC TTACGTGGGC AATGGATATA CGACGAAATG CTGGAAAGCG TCGTCGGCGG CCCCAAGCCC GCCGGTGTTC CGTTTCTGTG GCGATGGCAC GATGTTTACG CGAAGCTTCT GAAGTCGTGC GACGTGATGC CTGAAAGTTT GACGGCGCGA CGCAATCTCT CGTTCATCAA TCCGGATGCC CGGGGAACCA CGCACACCAT GAACATGGGT ATGCAGATGC TCAAGCCCGG CGAGATTGCC TATGCGCACC GCCATACCAT GGCAGCGCTG CGGTTCGCTA TTCAAGGCGG CCCCGGCCTG GTGACTGTGG TGGATGGCGA GCCTTGTCAA ATGGATACCT ACGACCTGGT TCTGACCCCT CGCTGGACGT GGCATGACCA TGAGAATGCC ACCTCGGAGA ACGTCGTTTG GCTCGACGTG CTCGATATCG GCCTAGTGCT CGGGCTGAAT GTTCCCTTCT ATGAGTCCTA TGGCGAGAAG CGCCAACCTC AACGCGAGGA CCCGGGGGAG CATCTCGCTG ACCGCGGTGG GATGCTGCGC CCTGCGTGGG AGCAGGTCAA GGCGGCGAAC TTCCCGTACC GCTATCCTTG GCGTGACGTC GAGCGGCAGC TCCAGCGGAT GGCGGGCCTT GCGGGCAGTC CTTACGACGG CGTAGTCCTG CGTTATGCGA ACCCCGTTAC CGGCGGATCG ACTATGCCAA CGCTGGATTG CTGGGTGCAG TTGCTGCGGC CGGGCCAGCG GACCGAGGCC CATCGCCACA CGTCGAGTGC CGTGTATTTC GTCGTGCGCG GTGAGGGAAC TACGGTTGTC GACGGGGTCG AACTCGACTG GGGGCCCCAC GACAGCTTCG TGGTGCCCAA CTGGAGCACC CATCACTTCG TCAACCGGTC GGCGGAAGAT GCGTTGCTGT TCTCGGTCAA CGACATCCCT ACATTGAAGG CTCTCGATCT CTACTACGAA GAGCCCGAGC TGTCTTTGGG GACGCAGCCA TTTCCGCCGG TCCCCGGCTA A
|
Protein sequence | MSTAESSELR EFDVELEAAN LRGQWIYDEM LESVVGGPKP AGVPFLWRWH DVYAKLLKSC DVMPESLTAR RNLSFINPDA RGTTHTMNMG MQMLKPGEIA YAHRHTMAAL RFAIQGGPGL VTVVDGEPCQ MDTYDLVLTP RWTWHDHENA TSENVVWLDV LDIGLVLGLN VPFYESYGEK RQPQREDPGE HLADRGGMLR PAWEQVKAAN FPYRYPWRDV ERQLQRMAGL AGSPYDGVVL RYANPVTGGS TMPTLDCWVQ LLRPGQRTEA HRHTSSAVYF VVRGEGTTVV DGVELDWGPH DSFVVPNWST HHFVNRSAED ALLFSVNDIP TLKALDLYYE EPELSLGTQP FPPVPG
|
| |