Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0401 |
Symbol | |
ID | 5105518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 351598 |
End bp | 353205 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506307 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001190502 |
Protein GI | 146303186 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATATT ATAACTACAA TTTAACGGTG GATAAGCTCC TCGACTACGG GTCCACCGTG TACTCAAGCA AGGAGATCGT GTATAGGGAT AGGAGGCGTT ACACATTTTC CAAGTTTAGG GAGAGAGTAG AAGCCTTCGC TCAATCGCTT CTTAAGATTG GCGTGAAGCC AGGTGACAAG GTGGCTGTAG TTGACTGGGA CACAGACGTT TACATGACAG CATATTACGC CGTGCCCATG ATAGGGGCTG TCCTGCACAC AGTTAACGTT AGATATCCCC CTGAGGTCAT GCTTAAGACG GTGCTTCACG CTGAGGATAA ATGGGCCATT GTCAGGGATG AGTTTACCCC GCTCCTCGAG AAGGGAAAGG CCTTTCTTGG CGGGCTGAAG GGAGTGATCA CATATTCAGA TGAGAGAGAG AAGGTTAAGA CTAGTTTCCC TACCCACGAT TTCTGGGAGC TCCAGGAGAG CGGTGAAAGG TTGCCTCCCC AGGAGATCAA GGAGGACATG CAGGCGACCG TTTTCTATAC CTCTGGTACC ACAGGTGAGC CGAAGGGTAT ATGGTTCACG CACAGGGACC TGGTTCTTCA TTCCATGAGC GTGGCGCTCA CCACATCTAA GCCTCCACTT AGGTCGTCAC AGGACGACAA CTTCATGATT CTCGTTCCCA TGTTCCACGT TCACCAGTGG GGGTTCCCCT ACGTCACGAT GCTCGTGGGG GCAAATTATG TTCTGCCTGG TAGGTATGAC CCAGCGATGG AGATAGAACT CATGAGAAAG GAACACGTTA CATTCTCCGC CATGGTTCCC ACAATCCTTT ACATGATTCT ATCTCACCCC AACGCGTCCA AATATCCAGA GGTGTTTAAG GGTTGGAAGG TTATGATAGG TGGGTCGGCT TTGCCCACTG AACTGGCATA TGCAGCTAGG AAGATGGGGA TAAACATTGC CGTAGGATAT GGGATGTCCG AGACAGCACC TGTGCTAACT GTGGCCTACT ACACACCCGA GGTGGAGAAG TTGCCAGAGG AGGATAGGTT CCTCCAACAG ATAAAGACAG GAGTACCCAT ACCCCTGTCT CAGGTAAGGG TAGTTGATGA GAAGGGAAAC GACGTTCCTA GGGACGAGAA GACTGTGGGC GAGATAGTTG CCAGGGCGCC TTGGCTAACG AGGTCGTATT ATAAGGATCC CGAGAGGACC GAAAAGCTCT GGAAGGACTC ATGGCTTCAT ACTGGAGACC TGGCGGTTGT GGACAAGTAT GGATACATCA GGATCGTGGA CAGGGATAAG GACGCCATAA AGAGCGGAGG CGAGTTCATC CCCTCGCTTA TACTTGAGGA CATCATCTCC ACCCATCCTA AGGTAGGAGA GGTAGCCATT GTGGGGATGA AGGACGAGAA GTGGGGAGAG AGACCTGTGG CATTCATTGT TCCCAAGGGA GATCTAAAGG AAGAGGAGAT AAGACAGTTC CTATTGACTA AGGTCGAGGA AGGGAAGTTG CAGAAGTGGT GGATTCCAGA CAGGTTCGTC TTCGTAAAGG AATTCCCGAA GACTTCCACT AATAAGATAG ATAAAAAGGC TCTACGTAAT CAACTAAGCT CTGGTTAA
|
Protein sequence | MGYYNYNLTV DKLLDYGSTV YSSKEIVYRD RRRYTFSKFR ERVEAFAQSL LKIGVKPGDK VAVVDWDTDV YMTAYYAVPM IGAVLHTVNV RYPPEVMLKT VLHAEDKWAI VRDEFTPLLE KGKAFLGGLK GVITYSDERE KVKTSFPTHD FWELQESGER LPPQEIKEDM QATVFYTSGT TGEPKGIWFT HRDLVLHSMS VALTTSKPPL RSSQDDNFMI LVPMFHVHQW GFPYVTMLVG ANYVLPGRYD PAMEIELMRK EHVTFSAMVP TILYMILSHP NASKYPEVFK GWKVMIGGSA LPTELAYAAR KMGINIAVGY GMSETAPVLT VAYYTPEVEK LPEEDRFLQQ IKTGVPIPLS QVRVVDEKGN DVPRDEKTVG EIVARAPWLT RSYYKDPERT EKLWKDSWLH TGDLAVVDKY GYIRIVDRDK DAIKSGGEFI PSLILEDIIS THPKVGEVAI VGMKDEKWGE RPVAFIVPKG DLKEEEIRQF LLTKVEEGKL QKWWIPDRFV FVKEFPKTST NKIDKKALRN QLSSG
|
| |