Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1077 |
Symbol | thiG |
ID | 3103277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1132992 |
End bp | 1133972 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637170266 |
Product | bifunctional sulfur carrier protein/thiazole synthase protein |
Protein accession | YP_113552 |
Protein GI | 53804598 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis [COG2104] Sulfur transfer protein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01683] thiamine biosynthesis protein ThiS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.129832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTTT TCGTCAACGG CGAAGAGCGT ACGGTACCGC CTGGAACGAC CCTGGAGGAT CTGATCGCCG CCATGGACCT CGCCGGGAAA CGTGTGGCCG TCGAGCTGAA CCTCGAAATC GTGCCACACG GCGACTATGG TTCCCGGGTC CTCGAGCCCG ACGACCGGGT GGAAATCGTC CACGCCATCG GGGGGGGGCA GGGCGATCCG CTGGTGATCG CCGGGAAGGC ATACACTTCA CGCCTGCTGG TCGGGACTGG CAAATACAAG GATCTGGCGG AAACCCGGGC CGCGGTGGAA ATGGCCGGTG CGGAGATCGT CACGGTGGCG ATCCGGCGCA CCAACATCGG CCAGGACCCG GGCCAGCCCA GTCTCCTGGA CGTGATCCCG CCCGACCGCT ACACCATTTT GCCCAATACC GCCGGCTGTT ACACGGTCGA AGACGCCGTG CGTACCTGCC GGCTGGCCCG CGAGCTGCTC GGGGGGCATC GTCTGGTCAA GCTGGAAGTG CTGGGCGACC CGACCACCCT CTTCCCGGAC GTGACCGCGA CGCTGGAGGC GGCGGAGATC CTGGTGCGCG ATGGATTCGA TGTCATGGTA TACACCAACG ACGATCCGAT CATCGCCAAG CGCCTGGAAG AGATCGGCTG CGTCGCCGTG ATGCCGTTGG CCGCCCCCAT CGGATCGGGG TTGGGGATTC GCAATCCCTA CAACATCCTG ACCATCGTGG AAAACGCCAA GGTCCCGGTC CTGGTCGACG CGGGCGTGGG TACGGCTTCC GACGCCGCCG TAGCGATGGA ACTCGGCTGC GATGGCGTGC TCATGAACAC GGCCATCGCC GAGGCTAAAA ATCCAGTACT GATGGCATCG GCGATGAAGA AGGCGATCGA GGCCGGACGC GAGGCCTTCC TGGCGGGCAG GATGCCTAGG CGCCGGTTTG CGTCGGCTTC GTCCCCGCTG GCGGGGTTGT TCTTCGATTG A
|
Protein sequence | MRVFVNGEER TVPPGTTLED LIAAMDLAGK RVAVELNLEI VPHGDYGSRV LEPDDRVEIV HAIGGGQGDP LVIAGKAYTS RLLVGTGKYK DLAETRAAVE MAGAEIVTVA IRRTNIGQDP GQPSLLDVIP PDRYTILPNT AGCYTVEDAV RTCRLARELL GGHRLVKLEV LGDPTTLFPD VTATLEAAEI LVRDGFDVMV YTNDDPIIAK RLEEIGCVAV MPLAAPIGSG LGIRNPYNIL TIVENAKVPV LVDAGVGTAS DAAVAMELGC DGVLMNTAIA EAKNPVLMAS AMKKAIEAGR EAFLAGRMPR RRFASASSPL AGLFFD
|
| |