Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1456 |
Symbol | |
ID | 5104826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1424552 |
End bp | 1426537 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507344 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001191537 |
Protein GI | 146304221 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTATGC GATATATTAT GGTTGAGGAA CAGACCCTGA AGACCGGGTC ACAGGAACTA GAGGAGAAGG CAGACTATAA CATGAGATAT TACGCTCACC TCATGAAGTT GAGTAAGGAA AAACCTGCAG AGTTCTGGGG ATCTCTAGCA CAGGACCTGC TAGACTGGTA TGAGCCTTGG AAGGAGACCA TGAGACAGGA AGACCCGATG ACAAGGTGGT TCATAGGAGG TAAGATAAAT GCCTCGTACA ACGCTGTCGA CAGACACCTC AACGGCCCCA GAAAGTTCAA GGCTGCGGTC ATCTGGGAAA GTGAGTTAGG GGAAAGGAAG ATCGTGACGT ATCAGGACAT GTTCTATGAG GTTAATAGGT GGGCCAATGC GCTCAGATCC CTAGGAGTTG GTAAAGGGGA TAGGGTGACC ATATACATGC CCCTGACCCC AGAGGGAATA GCTGCAATGC TGGCCTCGGC CAGGATAGGT GCAATTCATA GCGTAATATT TGCCGGCTTT GGTTCGCAAG CCATAGCCGA CAGGGTTGAG GACGCCAAGG CGAAGGTAGT GATCACTGCT GACGCCTATC CCAGAAGGGG AAAGGTTGTG GAGTTAAAGA AGACTGTCGA CGAGGCCTTA AACTCCCTTG GAGAAAGGAG CCCAGTACAG CACGTGCTCG TGTATAGGAG GATGAAAACG GATGTAAACA TGAAGGAGGG AAGAGACGTT TTCTTCGACG AGGTCGGCAA GTACAGGTAC GTGGAGCCTG AAAGGATGGA CTCCAATGAT CCACTCTTCA TTCTCTACAC CTCTGGGACC ACCGGTAAAC CTAAGGGAAT TATGCACTCT ACCGGTGGTT ATCTGACCGG GACAGCCGTT ATGCTACTGT GGAGCTACGG CCTTAGCCAG GAGAACGACG TTCTCTTCAA CACCTCAGAT ATTGGTTGGA TAGTTGGCCA CTCCTACATT ACCTATTCCC CCCTTATCAT GGGGAGAACG GTTGTCATTT ACGAGAGCGC CCCAGACTAT CCCTACCCAG ACAAGTGGGC TGAGATTATT GAGAGATACA GGGCAACCAC TTTCGGCACC TCAGCTACAG CCTTGCGTTA CTTCATGAAG TATGGGGACG AATACGTGAA GAACCACGAT CTCTCGTCCA TCAGGATAAT TGTGACGAAC GGGGAAGTGC TTAACTACTC TCCGTGGAAG TGGGGGCTAG AAGTGTTAGG TGGAGGAAAG GTATTCATGT CCCATCAGTG GTGGCAAACT GAGACAGGCG CACCGAACCT GGGCTACCTT CCGGGTATAA TTTACATGCC AATGAAGTCG GGTCCAGCCT CAGGCTTCCC TCTACCCGGT AACTTCGTGG AGGTTCTGGA CGAGAACGGA AATCCCTCTG CCCCTAGAGT GAGAGGATAC CTTGTAATGA GGCCACCCTT CCCGCCTAAC ATGATGATGG GGATGTGGAA CGATAATGGG GAGAGGTTGA AGAAGACGTA CTTTAGCAAG TTCGGTTCCC TGTATTATCC AGGAGACTTC GCCATGGTGG ATGAGGATGG ATACATCTGG GTGTTGGGTA GGGCAGACGA GACTCTAAAA ATTGCAGCCC ACAGAATTGG AGCTGGGGAA GTGGAATCAG CAATCACTTC TCACCCATCG GTTGCCGAGG CAGCAGTCAT AGGCGTGCCA GACTCAGTGA AAGGAGAAGA GGTTCACGCG TTCGTTGTGC TAAAGCAAGG TTACGCTCCT TCCTCTGAAC TGGCTAAGGA CATACAGTCA CACGTTAGGA AGGTCATGGG GCCCATTGTT AGTCCGCAGA TTCATTTCGT GGATAAGTTG CCTAAGACAA GGTCTGGGAA GGTCATGAGA AGGGTGATAA AGGCAGTGAT GATGGGTTCG AGTGCTGGCG ACTTAACCAC CATAGAGGAC GAAGCATCAA TGGACGAAAT AAAGAAGGCT GTCGAGGAAC TAAAGAAGGA GTTAAAGACC TCCTAG
|
Protein sequence | MFMRYIMVEE QTLKTGSQEL EEKADYNMRY YAHLMKLSKE KPAEFWGSLA QDLLDWYEPW KETMRQEDPM TRWFIGGKIN ASYNAVDRHL NGPRKFKAAV IWESELGERK IVTYQDMFYE VNRWANALRS LGVGKGDRVT IYMPLTPEGI AAMLASARIG AIHSVIFAGF GSQAIADRVE DAKAKVVITA DAYPRRGKVV ELKKTVDEAL NSLGERSPVQ HVLVYRRMKT DVNMKEGRDV FFDEVGKYRY VEPERMDSND PLFILYTSGT TGKPKGIMHS TGGYLTGTAV MLLWSYGLSQ ENDVLFNTSD IGWIVGHSYI TYSPLIMGRT VVIYESAPDY PYPDKWAEII ERYRATTFGT SATALRYFMK YGDEYVKNHD LSSIRIIVTN GEVLNYSPWK WGLEVLGGGK VFMSHQWWQT ETGAPNLGYL PGIIYMPMKS GPASGFPLPG NFVEVLDENG NPSAPRVRGY LVMRPPFPPN MMMGMWNDNG ERLKKTYFSK FGSLYYPGDF AMVDEDGYIW VLGRADETLK IAAHRIGAGE VESAITSHPS VAEAAVIGVP DSVKGEEVHA FVVLKQGYAP SSELAKDIQS HVRKVMGPIV SPQIHFVDKL PKTRSGKVMR RVIKAVMMGS SAGDLTTIED EASMDEIKKA VEELKKELKT S
|
| |