Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1040 |
Symbol | |
ID | 5104339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 965266 |
End bp | 966531 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506936 |
Product | AAA ATPase |
Protein accession | YP_001191129 |
Protein GI | 146303813 |
COG category | [R] General function prediction only |
COG ID | [COG1373] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATGA TATCGCAGAT GAGCTTTCAA AACCCCTGGT GGACTCAACC CTCATCCATT GATGATGACG ATCACGTGAG GAGGGCAAAG TATTACCTCC CACCCGTGAG GGAGAACCTG CTTATCCTAG GTCCGAGACA GGTGGGGAAA ACTACTTACA TGAAGACCGT GATCAGGGAT CTACTGAGGG AGGTGGAGCC CAGGAAGGTG TTCTATTTCT CCTGCGACTC ACTCTCCAGG AAGGACGAGT TAATCCAGCT ACTTAACGAG TATCGAACCC TTGTGAACGG AGATGAGGCC TTCATATTTC TCGACGAGAT CACGTCAGTA GATGCGTGGA ACATGGGCCT TCTTCACCTC TTTAACGCAG GTTATTTCAG AAACTCCTTG GTTTACGTGT CTGGATCCTC CTCTCTTAAC CTGAGTAGGG AAACTCTCCC GGGTAGACCG CTCAAGAAGG TCGTGTATTA TCCGCTCAAC TTTAGGGTTT ACTTTGACCT TTTTACACGG AAATTGGACG TCCCCACACT CCCCGTGACC AGTCCCCATG AGATCATGAA GGAGGCGAAA AAGCTACTAC CACACCTCTC GGCCCTCAAC AAGGCCCTAT TAAGTTACGT TGAAAGGGGA GGATTCTTCG CCACAAATCT AAGCTCTGCC TCGCTGTATG AAACGTATAG GGACACCGTT CTAAGCGAGA TCGCGAAGAC TGGGAGGAGT GAGGCCCTCT TCAAGCAGGT GATTTCCAGG ATAATCGAGA GTTATGGTAG CAGAATTTCA GACAACGGGA TATCCAAGGA GATTTCGGCA TCCCACACGA CGGTATCTGA ATACCTGGAG CTATTGGAGA GGTTGTTCAT TACGAGAACC TATAGGAAAT GGGAAAATGG GAGGGTGAAC TATAGGTCCT TAAAGAAGGT CTACATGATA GATCCCTTCC TTTTTAGGGT AATGAAGAGG TATTCCCTGG GGAAGGACCT GGAGACGGAG GACATACCCC ACGTGATCGA GGGAATAGTT GGGGAGCACC TATCTAGGGA GTACGCAGAG AGCCTCTTCA CCTTCTTCAA GGACGGTAGA GAGATCGACT TTCTAGTTAG GGGGATTGGG ATTGAGGTTA AATGGAGTGA ACGGGTGAGG TCTAGGCCTA AAGCACCAGA GTACGTTCTT ACCATGGACG AGTTTGATGA GGAAAGGAGG TTAATTCCCG TGTCCCTATT CCTTTACCTC ATTTCCTCGG ACAAGGTGTT TTACGACCTG GGTTAG
|
Protein sequence | MSMISQMSFQ NPWWTQPSSI DDDDHVRRAK YYLPPVRENL LILGPRQVGK TTYMKTVIRD LLREVEPRKV FYFSCDSLSR KDELIQLLNE YRTLVNGDEA FIFLDEITSV DAWNMGLLHL FNAGYFRNSL VYVSGSSSLN LSRETLPGRP LKKVVYYPLN FRVYFDLFTR KLDVPTLPVT SPHEIMKEAK KLLPHLSALN KALLSYVERG GFFATNLSSA SLYETYRDTV LSEIAKTGRS EALFKQVISR IIESYGSRIS DNGISKEISA SHTTVSEYLE LLERLFITRT YRKWENGRVN YRSLKKVYMI DPFLFRVMKR YSLGKDLETE DIPHVIEGIV GEHLSREYAE SLFTFFKDGR EIDFLVRGIG IEVKWSERVR SRPKAPEYVL TMDEFDEERR LIPVSLFLYL ISSDKVFYDL G
|
| |