Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1015 |
Symbol | |
ID | 5104318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 935268 |
End bp | 936764 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640506914 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001191107 |
Protein GI | 146303791 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00519801 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATTAGAG GCCCAATTTT ACCCGACCTC AGACCTAGAA CCCAGAGCGA TATCCTAGAG TCCTCTGGGG AAGGCGTGGC CATAAACTTC CTTGGCAACA GGATAAGTTA CCCCGAGCTG AGGGGAATGG TGGAGAGCGT TTCCTCCCAA CTGGAAATTG GCCGGGGAGA CGTGGTAATC CTCTCCACCC AGAACATACC GCAGTTCGTC ATTGCCGAGT ACGCGGTGTG GAGGAAGGGA GGGATTGTGT TGCCCGTTAA TCCCAGTTAC ACGCAAGCTG AACTGGACTA CTTGGCAAGG GACTCTGGGG CGAAGCTCGT GATCGCGTCA TGTGAGTCCA ACGTTCCCTC GAACTTGCCT GTGATCAGGA CCAATCCTCA CACCTTTCAC AAGGTGGAAG GGTGGAATAT CCCGGACTGC GAAGAGGAAC TCAACCTCAA GTCGGGGAGA GGGGACAGGG TGAACTACTC CCCCCAAGAG GTCGCGGTCC TGATGTACAC CTCGGGAACA ACGGGGAAGC CCAAGGGAGT TCCGATTACG CACTCCAACC TTTACGCCTC CTCTCTCATC TACGTGAGGT GGTTCCAGTT CACGGGACGG GACAAGGTCC TTGGGATCGC ACCATTCTTC CACGTGACTG GGCAGGTCTT CCACGTGACC ACGCCCGTGA TGGCTGGGTC GCAGATAGTA GCAACTTTCA GGTTCGATCC CAGGTCAGCA CTTAGGACAG TTCAAGAGGA GAGGACCACG GTAACCATGA GCGTTGCCAC AGCGTATAGG GCCATGCTCA ACTCCTACTC TGGGGAAGAC CTAACGTCGA TGAGGTTATG GTCCTCTGGC GGGATGCCAA TGCCTCGAGC CCTAGAGGAG GAGTGGAAAA GGTTGACAGG TTCCTGGATC TATATGGCCT GGGGCCTCAC GGAGACCACA TCACCGGCCA CGCTGTGGCC TTACCCCTAC TCAGGCGAGC TACCGGTTAA CGAAATGGGT GTAGTGAGCT CTGGGATGCC CGTGTACAAC ACAGAGATCG AGTTGGAGGA CGGCGAGCTC CTGGTGAGGG GTCCTCAAGT CGTGAAGGGT TACTGGAAAC AGGAGGAGTT CAAGGACGGA TGGCTTCACA CAGGGGACAT TGGCGAGATA AGAGATGGTT GGGTTTACGT AATAGACAGG AAGAAGGATG TCATAGTTAC CTCGGGCTTC AAGGTAATGC CGAGAGAGGT GGAGGAAGTT CTTCATCTTC ACCCTGGGGT TGACGAGGCA GTTGTCGTGG GTATACCGGA CGAGTACAGG GGCGAGCGGG TAGTAGCCTT CGTGAAACCT AGACCGGGAG CCAAACTGAA CCTCGAGGAA CTTAAAGAGT TCTGTAGGAC AAGGCTAGCC CCATACAAGG TCCCCAGAGA GATCAGACTT GTGGACGAGA TTCCGAAGAC AGGTTCAGGC AAGATTATGA GGAGAGCCTT CAAGGAGGAG AGGTCACCAA GTCATAGTAA CAGTTAA
|
Protein sequence | MIRGPILPDL RPRTQSDILE SSGEGVAINF LGNRISYPEL RGMVESVSSQ LEIGRGDVVI LSTQNIPQFV IAEYAVWRKG GIVLPVNPSY TQAELDYLAR DSGAKLVIAS CESNVPSNLP VIRTNPHTFH KVEGWNIPDC EEELNLKSGR GDRVNYSPQE VAVLMYTSGT TGKPKGVPIT HSNLYASSLI YVRWFQFTGR DKVLGIAPFF HVTGQVFHVT TPVMAGSQIV ATFRFDPRSA LRTVQEERTT VTMSVATAYR AMLNSYSGED LTSMRLWSSG GMPMPRALEE EWKRLTGSWI YMAWGLTETT SPATLWPYPY SGELPVNEMG VVSSGMPVYN TEIELEDGEL LVRGPQVVKG YWKQEEFKDG WLHTGDIGEI RDGWVYVIDR KKDVIVTSGF KVMPREVEEV LHLHPGVDEA VVVGIPDEYR GERVVAFVKP RPGAKLNLEE LKEFCRTRLA PYKVPREIRL VDEIPKTGSG KIMRRAFKEE RSPSHSNS
|
| |