Gene Mthe_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0771 
Symbol 
ID4462411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp817112 
End bp819481 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content55% 
IMG OID639699782 
Productvon Willebrand factor, type A 
Protein accessionYP_843201 
Protein GI116754083 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.65454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGAGG GAGCTGTGAG CGAGATATAC CTGCAGAGGC CTGAGCTGCT CTGGCTCGTT 
CCAGCTGTTC TCGCAGCAGG GCTTCTGTAT ACCAGGCGCA CACAGCAGAA GCTCCTGGTC
CTCACGAGAT CCGTGGTGGT ATGCCTGATC ATAATCGCGC TGGCGAATCC ATACACAGTG
GCAACCCACA CAAAGGACGT TAGCAGGCCG AGGATAACGA TACTCTCGGA TCAGACCGCA
TCCATGGAGA TCTTCGATAC GAATATCGCA GAGCGTTTGA GCTCGAGGAT ACCCGACTCA
CAGCTCAGGT ACTTCTCCGG AGAGAGCACG CCGCTCGGAG ACAGGATCAT ACAGTACATG
CCACAGGCTG ATTCAATTCT GCTTGTGAGT GATGGTTACA GCAACAGCGG TCGCCCGCTC
AGGGACGCTC TTCTCCTCGC CAGAAGCTCA AATGTATCCG TATTCGCGAT CGAGATGAGC
CCTGTGGCGA AAGAGGCTGG CGTCGAGATA TCGGGAAGCA ATGTTGCGGT TCTGGATGGT
GATTATCCAT TTAAGATAAT CGTAAGAAAA TCCGGAAGCG CCAGCGGAGA TGTTGTGGTA
TACGCGGATG ACCTGGAGAT ATTCAGGGGA TCTCTGGAGA GCATAAACGA TTCCCTCAGG
ATATCACACA GGTTCCGCAG CACAGGCACG CACATCCTGA GAGCGGAGAT ATACCCAGAT
GAGGATCATT TCGATATGAA CAACATGTAC ACGAAAGCCG TGTACGTTGT GCCGAGGCCA
CGAGTTCTGC TTCTGGGCAG CTCATCCCCT CTGGAAGATG TCCTTAAGGA TATCGTCGAG
TTAAACACCG CAGAGGACCT TCCTCAATCT CTGAGCGGTT ACAAGGCGGT CGTCCTGGAC
AATATCAAGT ACAATCCCGA TCTCGACCGG CTGAAGGACT ACGTGGCCAG CGGCGGCGGC
CTGGTCGTTG TTGGTGGAGC TGATGCATAC GAGCTGGGCG GCTATTACGG CACAGAGTTT
GAGAAGGCTC TCCCTGTCAT CTCGTCTCCG AGCCTCTTCG AAGGCGGGAA GGTTCTGATC
ATGGTCATAG ATATCTCAGG AAGCACGATG GCGCCGATGA GGATTGGGGA GAGCACGACG
TACCTTGATT ATGAGAAGTC ACTCGCGATA GAGCTTCTCC AATCACCGGA GCTCAGGGAT
GCCAGGGTCG GCATCGTCGT CTTCGGCACG AAACCTTACG TCGTATCTCA GCCGGTGCCT
ATCAGAAACA GGGCTGTCAT AGAGGAGAGT ATAAAGAGCC TTCAGACTCC CCTCGGAAGA
GATGAGACTA ACCTGGATGA GGGACTCCGC CTGGCGTGGA AGATCATCAA CGAGAGCAAG
GCTGAGGCCG ATCTTGTGAT CATATCAGAT GGCAGAATAG AGCCTGACAA GGTGAAGGGA
GATCAGGTAT TCCTGAACTC TGTTGATATT CTGAGGGAGA TGAACGCGAC GGTCACACTG
ATACAGGTCC AGAGCTACGC AGGGTCTGCC GGCAGGTTTG AGGAGCTGGC TGCGCTCACA
GGCGCCACGT TCCGTCCTGC AGTATACCCA AGCTCTCTGA CTGTGAGGAC TCCTGAGATC
GAAAGGGGGA TAGAGGTTGC GAAGAACGTC TCCGGATACA CACTTGTGGT GACGGATGAG
AATCATTACA TAACAAGCGA TATAGAGATC AATGCTACGA TCAGCGGATT CAATGATGTG
ACGCCGAAGC CAGGCGCCCA GAGGCTCGTG GCGCTGACAG ATGGAAAACC GATAGTGACA
GCAATGCGCT ACGGCCTCGG CAGGAGCGTC TCCCTGGCCA CAGATGATGG CAATGCATGG
GCTCAGTCGA TTTACGCCGA GGAGAACTCG ATGCTCATTT CCTCCATGGT TAACTGGGCC
GTCGGCGATC CGAGGCCCGA GAGAGAGAGG GTTGAGGCCG ACGATGGATG GGCGGGAAGC
CCTCTGGAGA TATACGTCTC AAGCGAGAGC CCGCCTGAGA TCGGAAAGGG AGCCAGGATC
GAGAGCACCG CCCCGGGAAG GTACACGGTG ACGATCGTCC CGGAGTCAAA GGGTGTGTAT
TACATCAACG ATTATGGAAT CGCCGTAAAC TACCCGCTGG AGTACAGGGA GTTCGGGTTC
AACCCGGAGC TGAAAGGAAT GATAGAGGCA GTGGGTGGAA AGATCTTCAC AGAGGACGAG
GCGGGAAGGA GCATCGTGGA GGAGGCGAGG AAGGCAAGCG CGCGCCTCGT TCAGGAGAGA
GCCAGCATCA GCTGGCAGCT GCTTCTGGCT GCGCTGGTTC TATTCCTGCT GGAGGTCACA
GTCAGGCGTC TGAGGGAGAT AAGGGGGTGA
 
Protein sequence
MEEGAVSEIY LQRPELLWLV PAVLAAGLLY TRRTQQKLLV LTRSVVVCLI IIALANPYTV 
ATHTKDVSRP RITILSDQTA SMEIFDTNIA ERLSSRIPDS QLRYFSGEST PLGDRIIQYM
PQADSILLVS DGYSNSGRPL RDALLLARSS NVSVFAIEMS PVAKEAGVEI SGSNVAVLDG
DYPFKIIVRK SGSASGDVVV YADDLEIFRG SLESINDSLR ISHRFRSTGT HILRAEIYPD
EDHFDMNNMY TKAVYVVPRP RVLLLGSSSP LEDVLKDIVE LNTAEDLPQS LSGYKAVVLD
NIKYNPDLDR LKDYVASGGG LVVVGGADAY ELGGYYGTEF EKALPVISSP SLFEGGKVLI
MVIDISGSTM APMRIGESTT YLDYEKSLAI ELLQSPELRD ARVGIVVFGT KPYVVSQPVP
IRNRAVIEES IKSLQTPLGR DETNLDEGLR LAWKIINESK AEADLVIISD GRIEPDKVKG
DQVFLNSVDI LREMNATVTL IQVQSYAGSA GRFEELAALT GATFRPAVYP SSLTVRTPEI
ERGIEVAKNV SGYTLVVTDE NHYITSDIEI NATISGFNDV TPKPGAQRLV ALTDGKPIVT
AMRYGLGRSV SLATDDGNAW AQSIYAEENS MLISSMVNWA VGDPRPERER VEADDGWAGS
PLEIYVSSES PPEIGKGARI ESTAPGRYTV TIVPESKGVY YINDYGIAVN YPLEYREFGF
NPELKGMIEA VGGKIFTEDE AGRSIVEEAR KASARLVQER ASISWQLLLA ALVLFLLEVT
VRRLREIRG