Gene Msed_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1353 
Symbol 
ID5103412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1323783 
End bp1325777 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content50% 
IMG OID640507242 
Productacetyl-CoA synthetase 
Protein accessionYP_001191435 
Protein GI146304119 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000252998 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC CAGAAAATGA ATCAGGTCTA CCTTTTCAAG AGAAAGTTGT ACCTGAGTTA 
CTCAAACACA GGTTAGTGAC ACCTGAGGAA TACCTAAGAA TTCACAAGAA GACCGTAGAG
AACTACCAAG AGTACTGGGA ATCCGTAGCT AAGGAACTGG ACTGGTTCAA GCCATGGGAA
AAGGCGCTTG ACGATTCGCA TCCTCCTTTC TATAAGTGGT TTGTTGGAGG AGAGCTGAAC
GCGTCTTATC TTGCAGTGGA TAGGCACGCC AACTCATGGA GGAGAAATAA GGTAGCTATA
ATCTGGGAGG GAGAGCCGTG GGAAAACGGT CCCAAGGAAG TCAGGAAATT GACGTACCTT
GACCTATACC GTGAGGTAAA TAGGGCAGCC TATCTCCTCA AGGAAGTTTA CGGCCTAAAG
AAGGGCGATA CCATAGGGAT TTACCTTCCC ATGATTCCAG AGCTTCCAAT ATTCATGCTT
GCTGCCGCCA GACTAGGCGT AGCTTTCACT GTAGTCTTCT CGGGATTCAG TGCACAGGCG
GTAGCTGATA GGATGAACGA TGCCGACACC AAGTTGCTTA TCACAGCTGA TGGTGGTTGG
AGAAGGGGGA AAGTGATACC CCTCAAGGAG ATCGTGGATA AGGCGCTTGA GACAGCCACC
ACAGTGAAGA ACGTGTTGGT GGTCAGGAGA ACCGGAACAG AGATTAGCAT GAAACCCGGA
AGAGATGCCT ACTTACACGA CGTGATGAGT AAGGTTCCGA TTAAGGCCTA CGTGGAGCCG
GAGAGGGTGA AAAGTGAGGA TCCTCTTTAC ATTCTTTACA CCTCTGGCAC CACGGGGAAG
CCTAAGGGCA TAATTCACGA CACAGGCGGA TACATGACTC TCCTTCATAA CACCATGAAA
CTTGTGTTTG ACATTAGGGA TACAGACGTT TTTTGGTGCA CCGCAGACAT AGGATGGGTA
ACTGGTCACT CCTACATCGT GTTCGGGCCC CTTCAGGAAG GAGCGACTGA GGTCATGTAC
GAGGGAGCCT TGGACTTCCC TGAACCTGAC AGGTGGGTCT CGATCATAGA GAGACATCAG
GTCTCAATAC TTTACACTTC ACCCACGGCG ATAAGGACGT TCATGAAGCA GGGAGAGCAA
TGGATAAAGA AGCACGACGT TAGCAGCGTA AGGCTAATGC ACTCAGTGGG AGAGCCAATT
AACCCTGAGG CGTGGAGATG GTTCCACAAA CTAGTGGGGA GAGGACAAGT TCCCTTTGGT
AGCACTTGGT GGATGACTGA AACTGGTGGA ATAATGATAT CACACATGCC TGGTGGATAC
CTAGTGCCCA TGAAACCTGG AACCAATGGT CCTCCTCTTC TTGGCATAGA GACTAACGTA
TTTGACGAGG AAGGAAAACC CATGCCAGAG GAGCAGAAGG GTTACCTAGT GATTACCAAG
CCGTGGCCTG GCATGCCACT CACTATCAAC AAGGATCCAG AAAGGTACGT TAAGGTATAC
TGGAACAAGT TTCCCAACGT TTTCTACGCC GGAGATTACG CGATCAAGGA TAGGGATGGT
TACTTCTGGA TACTGGGGAG AGCCGACGAG GTAATGAAGA TTGCCGGACA CAGGATTGGA
ACGTATGAGC TGGAGTCGGC CCTCGTGCAA CATCCAGCGA TAGCCGAAGC AGCAGTAGTT
GGTGTTCCAG ATCCTGTCAG GGGAGAGGTG GCTGAGGCCT TCGTCATCCT CAGATCTGGA
GTGGAGCCTA GCGCTAAGCT AAGGGAGGAG ATTGTGAAGT TCGTGAGAGA AAACTTCGGT
CCCATAGCGG TGTTCAGGGA AATCCACTTC GTCTCGAAGC TACCCAAGAC CAGAAGCGGA
AAGATCATGA GGAGGGTAAT CAAGGCGGTT GCAACTAACT CTCCCGTGGG CGACGTCACC
ACGCTAGAGG ATGAGGCCTC TGTGGAGGAG GTAAAGAAAG CCTTCCAGGA GCTCAAGGAA
CAGGTCGGGA AGTGA
 
Protein sequence
MSQPENESGL PFQEKVVPEL LKHRLVTPEE YLRIHKKTVE NYQEYWESVA KELDWFKPWE 
KALDDSHPPF YKWFVGGELN ASYLAVDRHA NSWRRNKVAI IWEGEPWENG PKEVRKLTYL
DLYREVNRAA YLLKEVYGLK KGDTIGIYLP MIPELPIFML AAARLGVAFT VVFSGFSAQA
VADRMNDADT KLLITADGGW RRGKVIPLKE IVDKALETAT TVKNVLVVRR TGTEISMKPG
RDAYLHDVMS KVPIKAYVEP ERVKSEDPLY ILYTSGTTGK PKGIIHDTGG YMTLLHNTMK
LVFDIRDTDV FWCTADIGWV TGHSYIVFGP LQEGATEVMY EGALDFPEPD RWVSIIERHQ
VSILYTSPTA IRTFMKQGEQ WIKKHDVSSV RLMHSVGEPI NPEAWRWFHK LVGRGQVPFG
STWWMTETGG IMISHMPGGY LVPMKPGTNG PPLLGIETNV FDEEGKPMPE EQKGYLVITK
PWPGMPLTIN KDPERYVKVY WNKFPNVFYA GDYAIKDRDG YFWILGRADE VMKIAGHRIG
TYELESALVQ HPAIAEAAVV GVPDPVRGEV AEAFVILRSG VEPSAKLREE IVKFVRENFG
PIAVFREIHF VSKLPKTRSG KIMRRVIKAV ATNSPVGDVT TLEDEASVEE VKKAFQELKE
QVGK