Gene MCA1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1077 
SymbolthiG 
ID3103277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1132992 
End bp1133972 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID637170266 
Productbifunctional sulfur carrier protein/thiazole synthase protein 
Protein accessionYP_113552 
Protein GI53804598 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2022] Uncharacterized enzyme of thiazole biosynthesis
[COG2104] Sulfur transfer protein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01683] thiamine biosynthesis protein ThiS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTT TCGTCAACGG CGAAGAGCGT ACGGTACCGC CTGGAACGAC CCTGGAGGAT 
CTGATCGCCG CCATGGACCT CGCCGGGAAA CGTGTGGCCG TCGAGCTGAA CCTCGAAATC
GTGCCACACG GCGACTATGG TTCCCGGGTC CTCGAGCCCG ACGACCGGGT GGAAATCGTC
CACGCCATCG GGGGGGGGCA GGGCGATCCG CTGGTGATCG CCGGGAAGGC ATACACTTCA
CGCCTGCTGG TCGGGACTGG CAAATACAAG GATCTGGCGG AAACCCGGGC CGCGGTGGAA
ATGGCCGGTG CGGAGATCGT CACGGTGGCG ATCCGGCGCA CCAACATCGG CCAGGACCCG
GGCCAGCCCA GTCTCCTGGA CGTGATCCCG CCCGACCGCT ACACCATTTT GCCCAATACC
GCCGGCTGTT ACACGGTCGA AGACGCCGTG CGTACCTGCC GGCTGGCCCG CGAGCTGCTC
GGGGGGCATC GTCTGGTCAA GCTGGAAGTG CTGGGCGACC CGACCACCCT CTTCCCGGAC
GTGACCGCGA CGCTGGAGGC GGCGGAGATC CTGGTGCGCG ATGGATTCGA TGTCATGGTA
TACACCAACG ACGATCCGAT CATCGCCAAG CGCCTGGAAG AGATCGGCTG CGTCGCCGTG
ATGCCGTTGG CCGCCCCCAT CGGATCGGGG TTGGGGATTC GCAATCCCTA CAACATCCTG
ACCATCGTGG AAAACGCCAA GGTCCCGGTC CTGGTCGACG CGGGCGTGGG TACGGCTTCC
GACGCCGCCG TAGCGATGGA ACTCGGCTGC GATGGCGTGC TCATGAACAC GGCCATCGCC
GAGGCTAAAA ATCCAGTACT GATGGCATCG GCGATGAAGA AGGCGATCGA GGCCGGACGC
GAGGCCTTCC TGGCGGGCAG GATGCCTAGG CGCCGGTTTG CGTCGGCTTC GTCCCCGCTG
GCGGGGTTGT TCTTCGATTG A
 
Protein sequence
MRVFVNGEER TVPPGTTLED LIAAMDLAGK RVAVELNLEI VPHGDYGSRV LEPDDRVEIV 
HAIGGGQGDP LVIAGKAYTS RLLVGTGKYK DLAETRAAVE MAGAEIVTVA IRRTNIGQDP
GQPSLLDVIP PDRYTILPNT AGCYTVEDAV RTCRLARELL GGHRLVKLEV LGDPTTLFPD
VTATLEAAEI LVRDGFDVMV YTNDDPIIAK RLEEIGCVAV MPLAAPIGSG LGIRNPYNIL
TIVENAKVPV LVDAGVGTAS DAAVAMELGC DGVLMNTAIA EAKNPVLMAS AMKKAIEAGR
EAFLAGRMPR RRFASASSPL AGLFFD