Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1110 |
Symbol | |
ID | 5104276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1037798 |
End bp | 1039021 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640507004 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001191197 |
Protein GI | 146303881 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.48551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGTTAA GAGTCAAGGT TAATTACGCC ACCTACTCAG CACTCAAAGA AGTTGAGAAA GAGTATAGAG AGGTTCTTGA GGAGGCAGTA AATTACGGGC TTGCAAAGAA CACAACGTCA TTCACTAGGA TTAAGGCTGG GATTTACAAG ACTGAGAGGG AGAAGCACAG GGACTTGCCA TCCCATTATA TTTACACCGC TTGTGAAGAT GCCAGCGAGA GGCTAGATAG CTTCAATAAG TTGAAGAAGA GAGGAAGGAG TTACACTGAC AGACCGTCAG TTAGGAGGGT TACTGTTCAT CTTGATGACC ACTTGTGGAA GTTCAGCCTC AGCGTGATTT CAATTGCTAC TAAGAGGGGT AGAGTTTTCA TTTCTCCAAT CTTTCCAAAG ATCTTTTGGA GGTATTATAA CGACAATTGG TTAATTGCTA GTGAGGCTAG GTTTAAACTG TTGAAGGGGA ATGTTGTAGA GTTCTTCATA GTTTTTAAGA AGGATGTTAA ATCTTACGAT CCAAGGGGTT TTATTCCAGT TGATTTGAAT GAGGGTTCAG TCTCTGTATT AATTAATGGG AAGCCGATAC TTTTAGAGAC TAACACTAGG ATGATTACTC TGGGTTATGA GTATAAAAGG AGGTCGATAA CTAATGGTAG GTCTACTAAG GATAGGGAGG TTAGGAGGAA GTTAAGGAAG TTGAGGGAGA GGGATAAGAA GCTTGATGTT AGGAGGAAGT TTGCTAAGTT GATTGTTAAG GAGGCTTTTG GGAGTAGGAG TGCTATAGTC TTGGAGGATT TGCCAAAGAG AGTTCCGGAG CATATGGTTA AGGGCGTGAA GGATAAACAG CTTAGGTTGA GGATTTATCG TTCGGCATTT TCTTCAATGA AAAATGCTAT TGTTGAGAAG GCTAGGGAGT TTGGTGTTCC CGTAATCTTG GTTGATCCTT CTTATACTTC TTCTATTTGC CTTGTTCACG GGTCGAAGAT TATTTATCAA CCCGATGGGG GCTCTGCCCC AAGGGTTGGT GTTTGTGGGA AGGGAGGAGA GAGGTGGCAT AGGGATGTTG TTGCGTTATA TAATTTGAGG AAAAGAGCTG GAGATGTGAG CCCCGTGCCG TTGGGCTCGA AGGAGTCCCA TGACCCACCT GTCGTTAAGG CTGGCAGGTG GTTGAGGGCT AAGTCCCTAC ACTTGATCAT GATTGAAGAT AAAATGAGTG AAATGAAAGT GTAG
|
Protein sequence | MKLRVKVNYA TYSALKEVEK EYREVLEEAV NYGLAKNTTS FTRIKAGIYK TEREKHRDLP SHYIYTACED ASERLDSFNK LKKRGRSYTD RPSVRRVTVH LDDHLWKFSL SVISIATKRG RVFISPIFPK IFWRYYNDNW LIASEARFKL LKGNVVEFFI VFKKDVKSYD PRGFIPVDLN EGSVSVLING KPILLETNTR MITLGYEYKR RSITNGRSTK DREVRRKLRK LRERDKKLDV RRKFAKLIVK EAFGSRSAIV LEDLPKRVPE HMVKGVKDKQ LRLRIYRSAF SSMKNAIVEK AREFGVPVIL VDPSYTSSIC LVHGSKIIYQ PDGGSAPRVG VCGKGGERWH RDVVALYNLR KRAGDVSPVP LGSKESHDPP VVKAGRWLRA KSLHLIMIED KMSEMKV
|
| |