Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1291 |
Symbol | |
ID | 5104703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1268035 |
End bp | 1269690 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640507181 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001191374 |
Protein GI | 146304058 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTATT CCTCGGCCTT TGCGACAGCT CTGGAAAGGC CAGATCTGTA CTGGTCGCCC TTTGCCTCAG CACTAAGATG GTTTAAGCCC TGGAATTCAG TCCTAGAGGG CGGGAAGTGG TTCGTGAACG GGGAGACCAA CATTGCGCAT AACGTCCTTT CACATGAGGG CACGGCACTA ATCTGGTATG GTGAAGACGA GAGGAGAGAA ATCACGTACT CCGAGCTGGC CAGGCTGACA GAGAGGGTGG TCAACGTCAT GAAGGACAAG GGAGTGACTC GGGGCGATAG GGTCGCAATC TACATGCCAA ACCTTCCTGA AACCATTGCC TCACTCCTCG CCTGCGCCAA GATGGGAGTC GTGTACTCCG TTATCTTTGC AGGTCTTGGG GAGCAGGCAG TCAAGGCCAG GATTCAGGAC CTCTCACCCA AGCTTGTGCT CACCACGAGG TACACCCAGA GAAGGGGTCA GAGGATTCCT CTCCTCGGTG GTGATGTGAC ACTAGAAAGG AACCTCACCC CCTGGGAAGA TGACTTCACT CTCCCTGAGA GGATTGAGGC AAATGATCCC CTCGTGATAA TGTACACCTC CGGTACCACA GGTAGACCTA AGGGGATAGT TCTACCCCAC GGCTCCTGGA TGGTGGGTCA CTACACCGTG TTCGATATCG TGTTTTCCCT GAGACCTGGT GACGTGGTAT TCACCTCAGC TGACGTGGGA TGGATAACGT TTTCCAGGAT CATGTATGGC ACTCTTCTTC ATGGCGGAAC ACTGGTTTTC ATGGAGGGAG CTCCTGATCA CCCTAGGGAT AGGGTTAGGA AGATCATGGA GAGGGAGAAC CCCAAGGTAT TCTTCACATC CCCAACGCTT CTACGCCTGT TGAGGAGTAT GGACCTATCC CTGCCTCGCG TTGAGTACAT TGCCACTGCA GGCGAGATAA TGGATGAGCC TTCCTGGGAT TACGCCATAA GGTTCGCCGA TAGGGTCACG GATATTTACG GCCAATCCGA GACAGGATAT GTGGTGGGGA CTCCGTTCTC CCTGGGTGTG GAGTCAAGGA AGGGATATGC CGGAGTTCCG TTCCCTGGAG CACTGCTGGA AACAGTTGAT GAGAACGGGA ATAGGGTTGA GGGTGAGGTC GGCCACCTAG TCCTGAAGAG TCCCTTCCCC ACGAAGTTCA TTGGGGTCTG GAGAAATGAA GAGAAGTTCA AGGAGTATCA GAGGTACGGC GGACATGACA CGGGAGACCT GGCAATCGTG GAAGGCGGAT ACGTCAAGAT TGTGGGCAGG AGTGATGACA TGATAAAGGT CGCAGGTCAC AGGATCACCA GCGGAGAGGT GGAGGACGTG GTTTCAAAGG TTCCAGGGGT TAAGGACGCG TCTGCGGTGG GAGTTCCCGA CCCCGTCAAG GGGGAGAAAC TTGTCCTCTT CATCGTGGGA GACGCAGACC CTGAGAGAGT TAAGGCGGAG GTTAGATCCA AACTAGGGCC AATATACGTG GTTGACAGGG TGGTGAGAGT GCCTAGGCTT CCTAAGTCCA GGAGCGGGAA AGTGGTGAGG AGGATCTTGA GAGACCTACT CACGGGAAAG GATGTAGACC CGACGATACT GGAGGACCCT GAGGTGGTGA ACGAGGTCAG GAGAAGTTTA GGCTGA
|
Protein sequence | MDYSSAFATA LERPDLYWSP FASALRWFKP WNSVLEGGKW FVNGETNIAH NVLSHEGTAL IWYGEDERRE ITYSELARLT ERVVNVMKDK GVTRGDRVAI YMPNLPETIA SLLACAKMGV VYSVIFAGLG EQAVKARIQD LSPKLVLTTR YTQRRGQRIP LLGGDVTLER NLTPWEDDFT LPERIEANDP LVIMYTSGTT GRPKGIVLPH GSWMVGHYTV FDIVFSLRPG DVVFTSADVG WITFSRIMYG TLLHGGTLVF MEGAPDHPRD RVRKIMEREN PKVFFTSPTL LRLLRSMDLS LPRVEYIATA GEIMDEPSWD YAIRFADRVT DIYGQSETGY VVGTPFSLGV ESRKGYAGVP FPGALLETVD ENGNRVEGEV GHLVLKSPFP TKFIGVWRNE EKFKEYQRYG GHDTGDLAIV EGGYVKIVGR SDDMIKVAGH RITSGEVEDV VSKVPGVKDA SAVGVPDPVK GEKLVLFIVG DADPERVKAE VRSKLGPIYV VDRVVRVPRL PKSRSGKVVR RILRDLLTGK DVDPTILEDP EVVNEVRRSL G
|
| |