Gene Mthe_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1413 
Symbol 
ID4463218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1514971 
End bp1516941 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content55% 
IMG OID639700432 
Productacetate--CoA ligase 
Protein accessionYP_843828 
Protein GI116754710 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.243729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATATG AAAAAGCAGA GGTATCCTCT CAGGAGAACG TCTACAGGCC GGCAAGCGAC 
CTGGTGGAGA ACTCCAATGT GATGCAGTGG ATGAAGAGAA AGGGATTCAG GAGTGAGAAA
GAGCTGCGGG CCTGGTGTTC CGAGAACTAC GTTGAGTTCT GGGACGAGAT GGCAAAGACA
TATGCAGATT GGTTTGTGCC GTACGAGAAG GTCCTGGAGT GGAACCCACC GCATGCGAGA
TGGTTTGTCG GAGGAAAGTG TAACGTCGCG CACAACGCCC TCGACAGGCA TGCAAGATCC
TGGCGAAGAA ACAAGGTCGC TTACTACTTC GTGGGTGAGC CGGTCGGAGA TACCAGGGCC
ATTACATACT ATCAGCTCTA CAGAGATGTG AATAAGCTCG CAAATGGTCT GAAGAGTCTT
GGCGTGAAGA AGGGCGACAG GGTTGGTATC TACCTGCCGA TGATCCCCGA GCTGCCGGTG
GCGATGCTGG CTTGTGCTAA GATCGGCGCG ATACATGTCG TAGTCTTCTC CGGATTCAGC
GCAGGCGCGC TCCGCGAGAG GATTAACGAT GCTGGGGCCA GAGTCCTGAT AACATGTGAT
GGATCATACA GAAGGGGCAA GCCTATCCCG ATAAAGGCCC AAGCGGATGA GGCCCTTCAG
GACGCGCCCT CTGTCGAACG CCAGATTGTT TACAGACGGA CTGGCCAGAG CATCGAATGG
AAGGACGGAT TCGATATCTG GTGGCATGAG CTCGTGAAGA ACCAGCCTGA TGAGTGTGAG
ACGCTTCAGA TGGACTCGGA GGATCCGCTC TTCATACTCT ACACAGCTGG AGCTGGAGGA
AAGCCGAGGG GTGTTGTCCA TGCGCACGGC GGCTTCTGCG TCGGGCCTGC GTACACGACG
AGCTGGGTTT TCGACATAAA GGATACTGAT GTGTACTGGT CGACTGCTGA CATAGGGTGG
ATCACAGGCC ACACATACAT AGTATACGGA CCGCTCTGCC TCGGCGCGAC GAGCGTTATG
TACGAGGGCT CTCCGGATTA CCCTGATTTC GGGAGATGGT TCCAGATCAT AGAGGATTAC
GGTGTCTCTG TGATATACAC AGCGCCCACC GCAATCAGGA TGTTCATGAA GGAGGGCGAG
GAGTGGCCTA GGAAGTACGA CCTGAGAAGC GTCCGGCTCA TGGGATCTGT GGGAGAGGCC
ATGAACCCCG ATGCTTTCCT GTGGTGGAGA AAGCATGTCG GCAACGACTG GGCTCCCATA
ATGGACACAT GGTTCCAGTC GGAGACAGGA TGCCATGTGA TAGCTCCACT GCCGATAACC
CCGCTCAAGC CGGGCTCGCC TGCATTCCCG CTTCCCGGAT ACAACGTTGA TCTCCTTGAT
GTGAACGGGA GAGCAGTTGG TCCTGGAGAG AGTGGGAACA TCGTGCTCAC AGCCCCATGG
CCGACGATGC TCAGAGGTAT ATACGGAGAG CCGGAGAAGC TCAGGGAGAT CTATTACGAC
TACTACTGGA GCATCAAACC TGGTATATAC CTCAGCGGCG ACAGGGCGAG GAGGGATGCT
GACGGCTACT GGTGGATACT CGGAAGGATA GATGATGTTC TGAAGGTCGC AGGCCACAGG
ATAAGCAATG CAGAGGTCGA GAGCGCAGCG CTCTCACATC CGAATGTCGC GGATGCAGCT
GTCATCGGCA GGCCGGACAA GGTCAAAGGA GAGAACATCA TTCTCTTCGT TGTGCTTAAA
GAGGGCATCA ATCCAAGCGA GGAACTCAAG AAGGATATCA GGAACCATGT CAGGGCGACC
ATGGGACCGA TAGCGATGCC CTCTGAGGTT TACTTCGTCT CCGCCATACC CAAGGACAGA
ACGGGAAAGC CTGTTAGGGC AGTGATCAAG GCGAAGGCAC TTGGAGCAGC TCTCGGCGAT
ACATCCTCTG TAATAAACAA AGATGCCATC GATGCCATAC CCGCGATTTA G
 
Protein sequence
MVYEKAEVSS QENVYRPASD LVENSNVMQW MKRKGFRSEK ELRAWCSENY VEFWDEMAKT 
YADWFVPYEK VLEWNPPHAR WFVGGKCNVA HNALDRHARS WRRNKVAYYF VGEPVGDTRA
ITYYQLYRDV NKLANGLKSL GVKKGDRVGI YLPMIPELPV AMLACAKIGA IHVVVFSGFS
AGALRERIND AGARVLITCD GSYRRGKPIP IKAQADEALQ DAPSVERQIV YRRTGQSIEW
KDGFDIWWHE LVKNQPDECE TLQMDSEDPL FILYTAGAGG KPRGVVHAHG GFCVGPAYTT
SWVFDIKDTD VYWSTADIGW ITGHTYIVYG PLCLGATSVM YEGSPDYPDF GRWFQIIEDY
GVSVIYTAPT AIRMFMKEGE EWPRKYDLRS VRLMGSVGEA MNPDAFLWWR KHVGNDWAPI
MDTWFQSETG CHVIAPLPIT PLKPGSPAFP LPGYNVDLLD VNGRAVGPGE SGNIVLTAPW
PTMLRGIYGE PEKLREIYYD YYWSIKPGIY LSGDRARRDA DGYWWILGRI DDVLKVAGHR
ISNAEVESAA LSHPNVADAA VIGRPDKVKG ENIILFVVLK EGINPSEELK KDIRNHVRAT
MGPIAMPSEV YFVSAIPKDR TGKPVRAVIK AKALGAALGD TSSVINKDAI DAIPAI