Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1353 |
Symbol | |
ID | 5103412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1323783 |
End bp | 1325777 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507242 |
Product | acetyl-CoA synthetase |
Protein accession | YP_001191435 |
Protein GI | 146304119 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000252998 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAC CAGAAAATGA ATCAGGTCTA CCTTTTCAAG AGAAAGTTGT ACCTGAGTTA CTCAAACACA GGTTAGTGAC ACCTGAGGAA TACCTAAGAA TTCACAAGAA GACCGTAGAG AACTACCAAG AGTACTGGGA ATCCGTAGCT AAGGAACTGG ACTGGTTCAA GCCATGGGAA AAGGCGCTTG ACGATTCGCA TCCTCCTTTC TATAAGTGGT TTGTTGGAGG AGAGCTGAAC GCGTCTTATC TTGCAGTGGA TAGGCACGCC AACTCATGGA GGAGAAATAA GGTAGCTATA ATCTGGGAGG GAGAGCCGTG GGAAAACGGT CCCAAGGAAG TCAGGAAATT GACGTACCTT GACCTATACC GTGAGGTAAA TAGGGCAGCC TATCTCCTCA AGGAAGTTTA CGGCCTAAAG AAGGGCGATA CCATAGGGAT TTACCTTCCC ATGATTCCAG AGCTTCCAAT ATTCATGCTT GCTGCCGCCA GACTAGGCGT AGCTTTCACT GTAGTCTTCT CGGGATTCAG TGCACAGGCG GTAGCTGATA GGATGAACGA TGCCGACACC AAGTTGCTTA TCACAGCTGA TGGTGGTTGG AGAAGGGGGA AAGTGATACC CCTCAAGGAG ATCGTGGATA AGGCGCTTGA GACAGCCACC ACAGTGAAGA ACGTGTTGGT GGTCAGGAGA ACCGGAACAG AGATTAGCAT GAAACCCGGA AGAGATGCCT ACTTACACGA CGTGATGAGT AAGGTTCCGA TTAAGGCCTA CGTGGAGCCG GAGAGGGTGA AAAGTGAGGA TCCTCTTTAC ATTCTTTACA CCTCTGGCAC CACGGGGAAG CCTAAGGGCA TAATTCACGA CACAGGCGGA TACATGACTC TCCTTCATAA CACCATGAAA CTTGTGTTTG ACATTAGGGA TACAGACGTT TTTTGGTGCA CCGCAGACAT AGGATGGGTA ACTGGTCACT CCTACATCGT GTTCGGGCCC CTTCAGGAAG GAGCGACTGA GGTCATGTAC GAGGGAGCCT TGGACTTCCC TGAACCTGAC AGGTGGGTCT CGATCATAGA GAGACATCAG GTCTCAATAC TTTACACTTC ACCCACGGCG ATAAGGACGT TCATGAAGCA GGGAGAGCAA TGGATAAAGA AGCACGACGT TAGCAGCGTA AGGCTAATGC ACTCAGTGGG AGAGCCAATT AACCCTGAGG CGTGGAGATG GTTCCACAAA CTAGTGGGGA GAGGACAAGT TCCCTTTGGT AGCACTTGGT GGATGACTGA AACTGGTGGA ATAATGATAT CACACATGCC TGGTGGATAC CTAGTGCCCA TGAAACCTGG AACCAATGGT CCTCCTCTTC TTGGCATAGA GACTAACGTA TTTGACGAGG AAGGAAAACC CATGCCAGAG GAGCAGAAGG GTTACCTAGT GATTACCAAG CCGTGGCCTG GCATGCCACT CACTATCAAC AAGGATCCAG AAAGGTACGT TAAGGTATAC TGGAACAAGT TTCCCAACGT TTTCTACGCC GGAGATTACG CGATCAAGGA TAGGGATGGT TACTTCTGGA TACTGGGGAG AGCCGACGAG GTAATGAAGA TTGCCGGACA CAGGATTGGA ACGTATGAGC TGGAGTCGGC CCTCGTGCAA CATCCAGCGA TAGCCGAAGC AGCAGTAGTT GGTGTTCCAG ATCCTGTCAG GGGAGAGGTG GCTGAGGCCT TCGTCATCCT CAGATCTGGA GTGGAGCCTA GCGCTAAGCT AAGGGAGGAG ATTGTGAAGT TCGTGAGAGA AAACTTCGGT CCCATAGCGG TGTTCAGGGA AATCCACTTC GTCTCGAAGC TACCCAAGAC CAGAAGCGGA AAGATCATGA GGAGGGTAAT CAAGGCGGTT GCAACTAACT CTCCCGTGGG CGACGTCACC ACGCTAGAGG ATGAGGCCTC TGTGGAGGAG GTAAAGAAAG CCTTCCAGGA GCTCAAGGAA CAGGTCGGGA AGTGA
|
Protein sequence | MSQPENESGL PFQEKVVPEL LKHRLVTPEE YLRIHKKTVE NYQEYWESVA KELDWFKPWE KALDDSHPPF YKWFVGGELN ASYLAVDRHA NSWRRNKVAI IWEGEPWENG PKEVRKLTYL DLYREVNRAA YLLKEVYGLK KGDTIGIYLP MIPELPIFML AAARLGVAFT VVFSGFSAQA VADRMNDADT KLLITADGGW RRGKVIPLKE IVDKALETAT TVKNVLVVRR TGTEISMKPG RDAYLHDVMS KVPIKAYVEP ERVKSEDPLY ILYTSGTTGK PKGIIHDTGG YMTLLHNTMK LVFDIRDTDV FWCTADIGWV TGHSYIVFGP LQEGATEVMY EGALDFPEPD RWVSIIERHQ VSILYTSPTA IRTFMKQGEQ WIKKHDVSSV RLMHSVGEPI NPEAWRWFHK LVGRGQVPFG STWWMTETGG IMISHMPGGY LVPMKPGTNG PPLLGIETNV FDEEGKPMPE EQKGYLVITK PWPGMPLTIN KDPERYVKVY WNKFPNVFYA GDYAIKDRDG YFWILGRADE VMKIAGHRIG TYELESALVQ HPAIAEAAVV GVPDPVRGEV AEAFVILRSG VEPSAKLREE IVKFVRENFG PIAVFREIHF VSKLPKTRSG KIMRRVIKAV ATNSPVGDVT TLEDEASVEE VKKAFQELKE QVGK
|
| |