Gene Mpal_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1977 
Symbol 
ID7270783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2103219 
End bp2104493 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content60% 
IMG OID643570592 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002467003 
Protein GI219852571 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.540426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA TCAAGGATGC ACAGCGCGGC CTCGTCACCG AGGAGATGAA GCTGGTCGCT 
GCACAGGAGG GCGTAACCGA GGAGTTCGTC AGAAAAGGGG TCGCTGGAGG GCATATCGTT
ATTCCGGTCT CCCCCTACCG AAAGGTTAAG ATCTGCGGGA TCGGCGAGGG GCTCCGCACC
AAGGTGAACG CCTCGATCGG AACATCCTCC GATATCAGCG ACGTTTCGGT GGAGATCGAG
AAGGTCAGGC AGGCAGAACT GGCAGGTGCC GATACGCTGA TGGAACTCTC CACCGGCGGG
GACCTGGCCG ACATCAGGCG CCAGGTCATC GCAGCCACCT CCCTCTCGGT CGGTTCCGTG
CCCCTGTACC AGGCTTTCAT CGAGGCTGCC CATAAGAAAG GCGCAGTTGT CGATATGGAG
GCGGACGACC TCTTCCGGAT CACTGCCGAA CAGGCAAAGG CAGGCACCAA CTTCATGGCG
ATCCACACGG GGATCAATTA CGAGACGATG AAGCGGCTGC AGAACCAGGG CAGGCACGCC
GGGCTCGTCT CCCGCGGCGG TGCGTTCATG ACTGCATGGA TGCTCCACAA CGAGAAGGAG
AACCCACTCT ACGCTGAGTT CGACTACCTC CTCGAGATCA TGAAGGAGCA TGAAGTGACC
CTCTCGATGG GGAACGGCAT GCGAGCCGGC GCCGTCCATG ACTCTACGGA CCGTGCAGCC
ATCCAGGAAT TGCTGATCAA CGCAGAACTG GCAGACAAAG CCTTCAACGA AGGGGTTCAG
ACGATCGTCG AGGGACCCGG GCATGTCCCG ATCGATGAGA TCCAGGCCAA TGTGATCCTG
CAGAAGCGGG TCACGAACCG CAAGCCCTTC TACATGCTCG GCCCACTGGT CACCGACATC
GCACCAGGCT ACGATGACCG GGTCGCCATG GTCGGTGCAG CTCTCTCCTC GTCGTACGGT
GCAGACTTCA TCTGCTATGT GACGCCGGCG GAGCATCTGG CCCTTCCGAC ACCTGAAGAG
GTCTTCGAGG GAGTCATCTC CTCCCGTATC GCAGCTCACA TCGGGGACAT GGTCAAGCTG
AACAAGCGCG ACGATGATCT GGAGATGGGA CATGCGAGAA AAGCCCTCGA CTGGGACCGG
CAGTACGCAG TTGCAATCAA CCCGAAGAGG GCGAAGGAGA TTCGGGACAG CCGGATGCCG
GCCGATACCG ACGGGTGCAC GATGTGCGGC GACTACTGTG CCATCAAGAT CGTCGCCAAG
CACTTCAACT TCTAA
 
Protein sequence
MSLIKDAQRG LVTEEMKLVA AQEGVTEEFV RKGVAGGHIV IPVSPYRKVK ICGIGEGLRT 
KVNASIGTSS DISDVSVEIE KVRQAELAGA DTLMELSTGG DLADIRRQVI AATSLSVGSV
PLYQAFIEAA HKKGAVVDME ADDLFRITAE QAKAGTNFMA IHTGINYETM KRLQNQGRHA
GLVSRGGAFM TAWMLHNEKE NPLYAEFDYL LEIMKEHEVT LSMGNGMRAG AVHDSTDRAA
IQELLINAEL ADKAFNEGVQ TIVEGPGHVP IDEIQANVIL QKRVTNRKPF YMLGPLVTDI
APGYDDRVAM VGAALSSSYG ADFICYVTPA EHLALPTPEE VFEGVISSRI AAHIGDMVKL
NKRDDDLEMG HARKALDWDR QYAVAINPKR AKEIRDSRMP ADTDGCTMCG DYCAIKIVAK
HFNF