Gene Mthe_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1533 
SymbolaksA 
ID4461720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1660269 
End bp1661450 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID639700556 
Producttrans-homoaconitate synthase 
Protein accessionYP_843945 
Protein GI116754827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.200545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT TCTCCGTCAA TCAGTTTCTT GAGATGGCCG GCACCCCTCC TCTGGATATA 
GAGATATGCG ATGTAACGCT TAGGGACGGC GAGCAGATGC CGGGGGTTGT GTTCAAGCCC
GAGGAGAAGC TTGAGATCGC TAGGAGGCTC GACGAGATCG GCGTCGAGAT CATAGAGGCC
GGATTTCCAG TTGTATCAAA GAGCGAAAAG AACGCGGTGA GAGATATCTG CAATCTCGGC
CTGAATGCGA AGATATCCGC TCTCTCCAGG TCCAGGCAGT CTGATGTCGA TGTGGCGATC
GATTGCGGTG TTGATATGGT GAGCGTATTC ATAGCGACCT CAGATCTCCA TCTGAAATAC
AAGCTGCATA TGACATGCGC AGAGGCGATA AGGTGTGCGC TGGAGACTGT TGAGTATGCA
AAGGAGCATG GTCTCATAGT CAGGTTCTCG GCTGAGGATG CGACGCGAAC GGATTTCAAC
ACGCTCAAGA AGCTCTACAA AAAAGCAGAG GAGTACCACG CAGATTACGT GAGCGTGGCC
GACACAGTCG GCATAATGAA CCCGAGGACG ATGTACTACA TGATCAGCGA GATCAAGAAG
ATTGTGAACA TTCCGATATG TGTTCACTGT CACGATGATC TCGGTCTCGC GCTGGCGAAC
ACCCTTGCCG GAGCAGAGGC GGGCGCGAAG CAGCTCCACA CCACCGTCAA CGGCATCGGC
GAGAGGAGCG GAAACACGCC GCTTGAGGAG CTGCTGGTCA ACCTTCGCCT ACACTACGGC
ATAGATCGCT ACGATCTGAG CAAACTCAAG TCGATCTCCT CTCTGGTGGA GAGATATTCG
GGCGTACCTG TTGCAAAGAA CAAGGCTGTT GTTGGAGATA ATGCCTTTGC GCACGAATCC
GGGATCCATG TCGCCGCGGT CCTCGAGGAG CCCAGGACCT ATGAGCTCTA CTCCCCTGAG
ATGGTGGGGG CTGAGAGGAG GATCATCATC GGGAAGCACA CAGGAGCCAA GGCGCTCAAG
TACATCACGA AGAAGATGGG CTATGACCTG GAGAAAAAGG ATCTCTGCCT CCTTGCTGAG
AAGGTGAAGA CCGCGAGCGA GTTCAAGAGA CCGATAACAT GCGATGAGTT GAGAAGACTG
ATCCTCGATC TCAAAATAGA GTTTGTGTAC AACGGTCCGT AG
 
Protein sequence
MSDFSVNQFL EMAGTPPLDI EICDVTLRDG EQMPGVVFKP EEKLEIARRL DEIGVEIIEA 
GFPVVSKSEK NAVRDICNLG LNAKISALSR SRQSDVDVAI DCGVDMVSVF IATSDLHLKY
KLHMTCAEAI RCALETVEYA KEHGLIVRFS AEDATRTDFN TLKKLYKKAE EYHADYVSVA
DTVGIMNPRT MYYMISEIKK IVNIPICVHC HDDLGLALAN TLAGAEAGAK QLHTTVNGIG
ERSGNTPLEE LLVNLRLHYG IDRYDLSKLK SISSLVERYS GVPVAKNKAV VGDNAFAHES
GIHVAAVLEE PRTYELYSPE MVGAERRIII GKHTGAKALK YITKKMGYDL EKKDLCLLAE
KVKTASEFKR PITCDELRRL ILDLKIEFVY NGP