Gene Mpal_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1206 
Symbol 
ID7271484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1239764 
End bp1240849 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content52% 
IMG OID643569843 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_002466267 
Protein GI219851835 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00578978 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAT ATCTAGTTCG GTACTCTGAA ATTTTCTTAA AATCAGAACC GGTGAAACGG 
ATCTGGCAGG GCGTGCTGGT GCATGATATC TATTCCAGGA TGCCCGATGT CACCGTCAGT
GTTGAGCGAG GCAGAGTCTG GGTTGAGGGA GATGTCAAGC CCGAGTGTCT GCGCAGAATC
TTTGGGATCG TCTCATTCTC TGAGGTGACC GAATGTACGC TCGACACCCT TGCCGAGGTG
CTCCCGAACT ACCTTGAGGA ACATGGAATA AAGGACGCGA AGTCCTTTGC ATTCCGGGTC
AAGCGAGTCG GCAACCATCC GTTCAACTCT CGTGAAAAGG AGATTGAACT TGCAAATCTG
GTTTTAGAAC CGTACCCGAA CCTCAAGATC GACCTTGACC ACGCCGAACT GGAGGTCTCC
ATAGAGATCA GACAGGACCG GTGCTACCTG TTCACAACCG TCGAAGAGGG ACCAGGCGGT
CTGCCGGCCG GTGTCGAGGG AACGCTGGTC GCGCTGTTTT CAGGCGGGAT CGACTCACCG
GTCGCCTCAT ACATGATGAT GAAGCGCGGG TGCAAGATCA TCCCGATCTA TGTCGCACTG
GACTCATTTC TGACTGAGAA ACACATTGCA CGGGCTGAGC AGGTGATCGA ATGCCTCCGT
CAGTACCAAC CGGATATAGC ACTGCATGTG ATCTCAGATG ATTATCTGAT CAAGGCCAAA
GAGGCACTCG TTGCCAGAAA ACTCGAAAAA TATACCTGTG TTTTCTGCAA GCGAAGGATG
TACAGACTGG CCACCAAATT TGCAGAGGAG GTTGGAGCGA TCGGTATCGT AACCGGGGAG
TCACTCGGAC AGGTCGCCTC GCAGACCCTA GATAATATGA CCGTGCTGAC GAATGCCACG
ACAATGCCGA TCTATCGACC ACTGATCGGA CTTGACAAGA CCGAGATCAT CAGCATCGCC
CGAAAGATCG GGACATTTGA GGATTCGATT GCAAAGGCCG GAGGATGTGG AGCAGTCCCG
AAGATGCCCT GCACCAAGGC AAAACTTGAA CTGGTACAAG AGATCGAGGA AGAGATCGGG
ATCTGA
 
Protein sequence
MTLYLVRYSE IFLKSEPVKR IWQGVLVHDI YSRMPDVTVS VERGRVWVEG DVKPECLRRI 
FGIVSFSEVT ECTLDTLAEV LPNYLEEHGI KDAKSFAFRV KRVGNHPFNS REKEIELANL
VLEPYPNLKI DLDHAELEVS IEIRQDRCYL FTTVEEGPGG LPAGVEGTLV ALFSGGIDSP
VASYMMMKRG CKIIPIYVAL DSFLTEKHIA RAEQVIECLR QYQPDIALHV ISDDYLIKAK
EALVARKLEK YTCVFCKRRM YRLATKFAEE VGAIGIVTGE SLGQVASQTL DNMTVLTNAT
TMPIYRPLIG LDKTEIISIA RKIGTFEDSI AKAGGCGAVP KMPCTKAKLE LVQEIEEEIG
I