Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1098 |
Symbol | |
ID | 5103572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1024786 |
End bp | 1026351 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506993 |
Product | hypothetical protein |
Protein accession | YP_001191186 |
Protein GI | 146303870 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTTG AGGACGCTAA GAGTCGTGCG AGTAAGTACG GCGAGGTTGT GGGTCTCATT AGCAGGGTGA CCCCAATCTC CCACGGAAAG GACAATAACC AGATTAGGGC TGAGATCCCC TACGAGGTCT ATCTCAAGAG GAAGTTCCTC ATAGGGAGTT ATGTGGGGAT ATCTATACCC GTGTCAGGGA CTCTCATGCT GGGGAGGATT ACCTCGGTCG AGAGGGCAGA CATCCTTGCA ATCTCCAGGA TCCCAGCTCT CTCCCCTGTT GAAGACGTGT CCGCTATTAC CACCCCTCTC TCCCTGACCA TAGAGCTCCT GTCCGAGAAG GTGGAGAATG AGGTGGTTCC GCCCAGTTCA CCTGTGGACC CGCAGAGTCC TATCTTCGTT CCAAGCCAGG AGTTCATAAA GGAAATGTTG GGGTTACCGC AAGACGGAAT ACCCATAGGG AAGATCGTCG AGGGTTACAG GATCCTTGAC GTACCTGTCA ACCTCACCGA GGAGGCGTTG AGACACCACG TCCTCGTAGT GGGAACTACG GGTGCGGGGA AAACAAACCT TCTTCGCCTC CTGATCACCA GGAGCAGGAT CCCGGTCCTA GGCTTCGACA TTCAGGGAGA TTACGTGAAG ACCATGGCGA AGATTGGGGG AACCGTCCTT GTTCCTGTGA CGAGGGATAT GGGGAAGGTA ACTGAATTCG TTTCGCTCTT CTTGAAGAGG AGCAACCTGC AGGACTTCAG GATATCCCAG GTTGACGGCC AAAGGATCAC GTTGACCAAC GGTGAGAAAA CCTTTCACGT GGAGCTCTTG GGTTTCAGGT TGAGGGAGAC CTACAAGGAG ATTCCAGATG TGTCGCCCCT CTTCTCTGGG CAGGGAGCCT ATTTCTTCAA GTTGATCACA GAGCACTGCC TAACGGAAAT TGACAATTGG ATTGGGGAAT GTGAGGAACT CTTTTCTGAG TTTCATGTTC ATAAGACCAC AGAGGACAAC ATAAGGAGGT CAGTTATTAT GCTGAAGGAG ACTGGCATCC TGGACATACC CCTGGAAAAG GGGTTCCTTG GGGAACCCAA CTACGAGGAC CTAGTAAGAA AGAAGGCAAT AGTGGACTTG AGATGGGTAA TGGAGAAAGG AATTTCCACC GCAACCACCA CGGCCTTCCT CATTGTGGAC AGATTGTTCA GACTGATAGA CGCGAAATAC AAGAATGAGG GGGTTGAGAC TCCTTACCTC CTCGTATTCG ATGAGGCTCA CGAGTACTTC CCGCAGTCAA GGAGGGATGA GGAGAAGGAG GGGCTTGAGA GACTCATCAA CAGAATACTT AGGCTGGGGA GGGTAAGGGG AATGGGCACG GTGTTGGCTA CCCACAGGCC GACGGACCTA AACGACCTCA TCCTTACACT CACGAACACG AAGATCGCCA TGAGGGCTGA CGAGGATGCT CTCGAGAAGA TAGGGATGGA GGAGTACGCG AACATACTGC AGGCCTCACC ACCTGGGTAC GCAGTTATGA GGACTTTCTC CTTGAAGGTC CAGGACCTAG TGTTCAGGAC GGATAAGTAC GAGTAA
|
Protein sequence | MSLEDAKSRA SKYGEVVGLI SRVTPISHGK DNNQIRAEIP YEVYLKRKFL IGSYVGISIP VSGTLMLGRI TSVERADILA ISRIPALSPV EDVSAITTPL SLTIELLSEK VENEVVPPSS PVDPQSPIFV PSQEFIKEML GLPQDGIPIG KIVEGYRILD VPVNLTEEAL RHHVLVVGTT GAGKTNLLRL LITRSRIPVL GFDIQGDYVK TMAKIGGTVL VPVTRDMGKV TEFVSLFLKR SNLQDFRISQ VDGQRITLTN GEKTFHVELL GFRLRETYKE IPDVSPLFSG QGAYFFKLIT EHCLTEIDNW IGECEELFSE FHVHKTTEDN IRRSVIMLKE TGILDIPLEK GFLGEPNYED LVRKKAIVDL RWVMEKGIST ATTTAFLIVD RLFRLIDAKY KNEGVETPYL LVFDEAHEYF PQSRRDEEKE GLERLINRIL RLGRVRGMGT VLATHRPTDL NDLILTLTNT KIAMRADEDA LEKIGMEEYA NILQASPPGY AVMRTFSLKV QDLVFRTDKY E
|
| |