Gene Msed_0267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0267 
Symbol 
ID5103887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp224544 
End bp226370 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content51% 
IMG OID640506173 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001190368 
Protein GI146303052 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0200095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.25307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATGGA AGCCTGACAA GGAATGGAGG GAAGAGAGCA ATATAGGGAA ATGGTTAGCT 
GAAAGGAACC AGTCCCTGGA TCAGTTCCAG GAATTCACAT GGAAGGAACC TGAAACCTTT
TGGCCTTCCT TTCTAGAAAG AGTCGGGGTT AATTTCAGGA GGAAACCCGA GAAGGTTCTG
GACCTCTCTC GAGGAAGAGA ATGGTCCAAG TGGTTCGTAG GATCTAGGTT AAACGTTACA
GATCAGCTGG ATGATTCCCC TGAAACTCTG GTCTCTTCCA TGAATGAGGA GGGAGAAGTT
AAGGAGTTCA GTCGCTCCCA GGTCCTGAGC TGGGCTAAAT CCATATCCAG CTGGTTGAGG
AGAGCTGGGC TGTCCCCAGG GGATAGGGTG GCCGTGTACA TGCCCATGAC AGCTGAGATA
GTTCCCATTA TGCTGGGGAT AGCTAGGGCA GGGATGATTA TTGTCCCGCT ATTCTCAGGG
TATGGAGAGG AACCAATTCG AGTTAGGGTG GAGGATAGCG GAGCTAAGGC AATCTTCACT
GTAGATAGGT ACACAAGAAA GGGGAAACGG GTGGAACCGA CTAGGAACCT GGAGAGACTC
AATCTCGTAA AGATAGCCCT GAAAACTTCC CTAGAGCTAA AGGATTATCA CGACTTAAGG
GAGTTAACCA GGGAAGGAGG AGACGGATAC GAGGAAACTG AGGCGGAGAG TCCCCTAATG
ATAATTTACA CTTCAGGTAC CACCGGGAAA CCAAAGGGAT GTGTTCACGT TCACGGTGGC
TTTCCAGTGA AGGCGTCAGC GGACATGTAC TTCCATTTTG ACGTGAGGAA AGGAGAGGGT
GTTTCCTGGA TTTCAGACAT GGGATGGATG ATGGGACCGT GGTTAGTGTT TGGCTCTCTG
ATGGTGGGAG CGAGAATGGC TCTCCTCGAC GGTTACGCAA CCCCCGAAAC CCTGGAAAAC
TTCGTGAACA CCTTGAGGGT AAACGTCCTA GGTCTATCAG CCAGCCTAAT CAGGAGCTTG
AGGTCGTCTA AGCCGTCCAT GAAGCTGGAT GTGAGGGTCG TGGGGAACAC CGGTGAACCC
ATAGATCCTG AGAGCTGGAA CTGGATTGCC CAGGTTACGG AGTCCCCAGT GATTAATTAC
TCTGGTGGCA CGGAGATCTC CGGAGGAATA CTGGGGAACT ACGTTGTCAA GGAGATGAGG
CCCTCCTCCT TTAACGGGCA ATCTCCAGGA ATAAGGGCTG AGGTCTTCAA CGAGAGTGGC
GAACCTGCTA ATCCGGGCGA GGAGGGGGAG CTGGTGGTGC TGAGCGTTTG GCCCGGAATG
ACCAGGGGGT TCTGGAAGGA TCCAGGCAGG TACATTGAAA CCTACTGGTC TAGATGGAAA
AACGTTTGGG TTCACGGGGA TCTAGCCATA AAGGATGAGG ACGGTTACTT CTACATCGTG
GGAAGGAGCG ACGATACCAT AAAGGTCTCA GGGAAAAGGA TAGGTCCAGG GGAGATAGAG
GCAGTTCTCA ATGCCCATCG AGCTATCGTG GAGAGTGCAT GTGTTGGTGT CCCTGATCCT
ACGAAGGGAG AGAAGGTGAT ATGCCTAGCA GTACCTAAGG AGGTTAGGAC TGGACTCGAG
GAGGAGTTAC TGAAATACCT TGAGGAGAGG TTGGGGAAGG CAATAGCTCC CTCCATCGTG
AAACTAGTCC CTGAACTGCC AAAAACCAGG AACGCGAAGA TCATGAGGAG GCTCATAAGG
AACACGATAC TCAACAAAGA TCTAGGAGAT ATATCCTCCC TCGAAAATCC TCAGTCCCTA
GAGCTCATAA AAAAGGCGCT GTCTTGA
 
Protein sequence
MIWKPDKEWR EESNIGKWLA ERNQSLDQFQ EFTWKEPETF WPSFLERVGV NFRRKPEKVL 
DLSRGREWSK WFVGSRLNVT DQLDDSPETL VSSMNEEGEV KEFSRSQVLS WAKSISSWLR
RAGLSPGDRV AVYMPMTAEI VPIMLGIARA GMIIVPLFSG YGEEPIRVRV EDSGAKAIFT
VDRYTRKGKR VEPTRNLERL NLVKIALKTS LELKDYHDLR ELTREGGDGY EETEAESPLM
IIYTSGTTGK PKGCVHVHGG FPVKASADMY FHFDVRKGEG VSWISDMGWM MGPWLVFGSL
MVGARMALLD GYATPETLEN FVNTLRVNVL GLSASLIRSL RSSKPSMKLD VRVVGNTGEP
IDPESWNWIA QVTESPVINY SGGTEISGGI LGNYVVKEMR PSSFNGQSPG IRAEVFNESG
EPANPGEEGE LVVLSVWPGM TRGFWKDPGR YIETYWSRWK NVWVHGDLAI KDEDGYFYIV
GRSDDTIKVS GKRIGPGEIE AVLNAHRAIV ESACVGVPDP TKGEKVICLA VPKEVRTGLE
EELLKYLEER LGKAIAPSIV KLVPELPKTR NAKIMRRLIR NTILNKDLGD ISSLENPQSL
ELIKKALS