Gene MCA2710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2710 
SymbolthiC 
ID3103797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2895662 
End bp2897536 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content64% 
IMG OID637171841 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_115111 
Protein GI53803179 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGA TCCCCCAAGA AATCCTGCAC GAGGCCGTCA CCGCCAAGAA AGCCTCGATC 
CAGCCGTTCG CGGCCTCGGA AAAAGTCTAC CTCCAGGGCA GCCGCCCGGA CCTGCGCGTC
CCAATGCGCA AGATCAGCCA GTCGGACACG CCCACCAACA CGGGCCGGGA AAAGAACCCG
CCAGTCTATG TCTACGACAC CTCCGGCCCC TATACCGACC CCACCGTATC GGTCGACCTG
AGACTGGGCC TGCCGCCCTT GCGCGAGCCC TGGATCGAGG AACGCGGGGA CACCGAGCTG
CTGAAGGGGC CCTCGTCGTC CTACGGCCTT CAGCGCCAGC GCGATCCGGC TCTGGCGTCC
CTGCGCTTCG AGCATATCCG CGCGCCCCGC CGGGCGAAGG GCGGCGCCAA CGTCACCCAG
ATGCACTACG CCAGGCAGGG CATCATCACG CCGGAGATGG AGTTCGTCGC CATCCGCGAG
AACCAGAAAC TGGAAGCGCT GGCGGAGACA TACAAGTTCC AGCATCCGGG AGAAGCCTTC
GGCGCGGCCA TCCCCCAGGT CATCACCCCC GAATTCGTCC GCGACGAAGT CGCCCGCGGC
CGGGCCATCA TCCCCAGCAA CATCAACCAC CCGGAATCGG AGCCGATGAT CATCGGCCGC
AACTTTCTGG TGAAGATCAA CTGCAACCTC GGCAATTCCG CCGTCAGCTC TTCGATAGAA
GAGGAAGTGG AAAAGATGCT GTGGGCGATC CGCTGGGGCG GCGACACGGT GATGGACCTG
TCCACGGGCA AGAACATCCA CGAGACCCGC GAATGGATCA TCCGCAATTC CCCGGTGCCC
ATCGGCACCG TGCCGATCTA CCAGGCGCTG GAAAAGGTGG ACGGCAAGGC CGAGGAGCTG
ACCTGGGAAA TCTTCCGCGA CACGCTGATC GAGCAGGCCG AGCAGGGGGT GGACTATTTC
ACCATCCATG CCGGCATCCG CCTGCCCTTC ATCCCGCTGA CCGCCAAGCG CACCACCGGC
ATCGTGTCCC GGGGCGGCTC CATCATGGCC AAATGGTGCC TGGCCCACCA CAAGGAGAGC
TTCCTCTACA CCCATTTCGA GGACATCTGC GAGATCATGA AGGCCTACGA CGTGGCCTTC
TCCCTGGGTG ATGGCCTGCG CCCCGGTTCC ATCGCCGATG CCAACGACGA GGCTCAGTTC
GCGGAATTGC GCACCCTGGG CGAGCTGACC CGGATCGCCT GGAAGCACGA CGTGCAGGTG
ATGATCGAAG GTCCCGGCCA CGTGCCCATG CACATGATCA AGGCCAACAT GGAGGAGCAG
CTCAAGCATT GCCACGAGGC GCCCTTCTAC ACCCTGGGAC CCTTGACCAC CGACATCGCG
CCGGGCTACG ACCACATCAC CTCGGCCATC GGCGCCGCCA TGATCGGCTG GTACGGCACG
GCCATGCTGT GCTACGTCAC GCCCAAGGAG CATCTGGGCC TGCCGAACAA GCAGGACGTG
CGCGACGGCA TCATCGCCTA CAAGATCGCC GCCCACGCTG CGGATCTGGC CAAGGGCCAC
CCCGGCGCCC AGGCGCGCGA CAATGCCCTG TCCAAGGCAC GCTTCGAGTT CCGCTGGCAG
GATCAGTTCA ACCTGTCGCT GGACCCCGAG AAGGCGTTGG AATTCCATGA CGAGACACTG
CCCCAGGAAG GGGCGAAACA GGCGCATTTC TGCTCCATGT GCGGCCCGCA TTTCTGCTCG
ATGAAGATCA CCCAGGACGT GCGCGACTAT GCCCGCGAAC ACGGCCTGGA CGAGGCGCAA
GCCCTGGCGA AAGGCATGGA AGAAAAATCG GACGAGTTCG TGAGGTCCGG GGCCGAGGTG
TACCAGCGCA CCTGA
 
Protein sequence
MSAIPQEILH EAVTAKKASI QPFAASEKVY LQGSRPDLRV PMRKISQSDT PTNTGREKNP 
PVYVYDTSGP YTDPTVSVDL RLGLPPLREP WIEERGDTEL LKGPSSSYGL QRQRDPALAS
LRFEHIRAPR RAKGGANVTQ MHYARQGIIT PEMEFVAIRE NQKLEALAET YKFQHPGEAF
GAAIPQVITP EFVRDEVARG RAIIPSNINH PESEPMIIGR NFLVKINCNL GNSAVSSSIE
EEVEKMLWAI RWGGDTVMDL STGKNIHETR EWIIRNSPVP IGTVPIYQAL EKVDGKAEEL
TWEIFRDTLI EQAEQGVDYF TIHAGIRLPF IPLTAKRTTG IVSRGGSIMA KWCLAHHKES
FLYTHFEDIC EIMKAYDVAF SLGDGLRPGS IADANDEAQF AELRTLGELT RIAWKHDVQV
MIEGPGHVPM HMIKANMEEQ LKHCHEAPFY TLGPLTTDIA PGYDHITSAI GAAMIGWYGT
AMLCYVTPKE HLGLPNKQDV RDGIIAYKIA AHAADLAKGH PGAQARDNAL SKARFEFRWQ
DQFNLSLDPE KALEFHDETL PQEGAKQAHF CSMCGPHFCS MKITQDVRDY AREHGLDEAQ
ALAKGMEEKS DEFVRSGAEV YQRT