Gene Msed_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1217 
Symbol 
ID5103831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1189803 
End bp1191254 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content51% 
IMG OID640507109 
ProductAcetyl-CoA hydrolase 
Protein accessionYP_001191302 
Protein GI146303986 
COG category[C] Energy production and conversion 
COG ID[COG0427] Acetyl-CoA hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.872767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGGA TAGCTTGCCG TGAACTGAGG TCGTTAGTCA CGGACCCGAT GGACGCAGTG 
AAGAGTCACC TTCCCAGACA TGGGGTAATT GCCTTCGGTG GAATGTCCGG CTCCTCGGTT
CCCAAGGAGT TTCCCAGGGC CATGGCCCAG CATCTGGAGG GAGAGAGTGG ATACGCCTTA
ACCCTTTTCA CGGGAGGAGC CACATCGGAG GAGTTCGAGA GGAACATCTC TAGGATAGGT
CACGCATTTC GAAGAAGGTA CCCTTTTCTC AGTGGAGGTG TGCATAGGGA ACTTGTGAAC
AAAAACGAGC TCGAGTTCTT CGATTACGGT CTCCACAAGT TTAACAGAAA AATCAGATCA
GGAGAACTGG GACCAATAGA CCTCGCAGTA ATTGAGGCAA CTGCGATTCT CGAGGACTGC
TCCGTGATAC CCTCTCTCTC CCTGGACTCG TCTATCTCCT TTATTAGCGT CGCTAAGAAG
GTGATCCTGG AAATCAATGA GAGTAAACCT GAATTGGAGG GCCTTCATGA CATTTACTTG
GGGGATAACG TATCACTCGC GAAGGTTTCC CAAAGGATTG GGGACACTAG GCTGACTGTA
CCCCCCTCAA GGATAGGGGC AATACTGGTC ACTAGGGAGG AAGATGGATC TCCTGGAGCA
TATAAGTCTC CTGGAGACGT GGAGGCGAGG ATCTCGGAGA ACATCATGGA GTTCATGATG
AACGAGATCA GAGAGGGCTT CTCCAGAGAA CTATACGAAC GGGGGGATTT CGTGATCCAA
CCTGGGGCTG GATCCCTCTC TAGCAAGCTC GCAGACGTGT TACCCCAGTC GGGTTTGACC
CTAAGTGTGT GGGCTGAAAC CCTACCCGCA AAGTGGGCCC TCATTCCTGA GGATAACGTG
AAGTTCATTA GCTCCTCAGC CTTTTTTACC CTACGTGGAG AGGAGAACTT CCTTCGTAAA
TTCTTTGAAA CCGGGGACTT CAGACGGATA GTGCTACGCG GACAGAACGT CACGAACAAC
CCCGAGGTGA TTCAAAGGCT TAACCTGATG TCCATACTAC AGGCAATCGA ACTAGATATT
TACGGTAATG CCAATATATC TCACATACAC GGAAATATAT ATAATGGCGT CGGTGGGTCT
GTGGACTTTG CCCCAAACGC TTACATTACC GTGATCGCAC TACCCTCGGT CACGGGAGAC
GGTAAGATCT CTAGGATTGT CCCACTCACG ACTCACGTAG ACGTTCCAGA GCACTACGTG
GATGTGGTGG TAACGGAACA GGGATATGCG GACTTAAGGG GAAAATCCCC CCACGAGAGG
GCTGAGGAAA TAATAAACAA GTGCGCTCAC CCTAGGTTTA GGGACTCCCT TCTACAATAT
TATGGGAAGG TCAGGGAGAT GGGACATGAG GCGCATGACC TGAGGCTGGC CCATGAACTG
TTCGGTCTTT GA
 
Protein sequence
MERIACRELR SLVTDPMDAV KSHLPRHGVI AFGGMSGSSV PKEFPRAMAQ HLEGESGYAL 
TLFTGGATSE EFERNISRIG HAFRRRYPFL SGGVHRELVN KNELEFFDYG LHKFNRKIRS
GELGPIDLAV IEATAILEDC SVIPSLSLDS SISFISVAKK VILEINESKP ELEGLHDIYL
GDNVSLAKVS QRIGDTRLTV PPSRIGAILV TREEDGSPGA YKSPGDVEAR ISENIMEFMM
NEIREGFSRE LYERGDFVIQ PGAGSLSSKL ADVLPQSGLT LSVWAETLPA KWALIPEDNV
KFISSSAFFT LRGEENFLRK FFETGDFRRI VLRGQNVTNN PEVIQRLNLM SILQAIELDI
YGNANISHIH GNIYNGVGGS VDFAPNAYIT VIALPSVTGD GKISRIVPLT THVDVPEHYV
DVVVTEQGYA DLRGKSPHER AEEIINKCAH PRFRDSLLQY YGKVREMGHE AHDLRLAHEL
FGL