Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1513 |
Symbol | |
ID | 5104042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1476530 |
End bp | 1477747 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507401 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001191594 |
Protein GI | 146304278 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0963227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.774713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAATCAA AAGGCAACAA GGTAACCCTC TCATTCAAGT ACAGGGCATA CCCAACACCA GAGGTAGAGA GAAAACTCAT CAACACAATG GAAATAGAGG CGAAAGTGTA CAATAAACTA CTGAACTACA TCGCAGAGAG AAGAAAACAA GGGATTAAGG TAACACAGCT GGACACTCAA AAGCTACTCA AGGACATGGA CGAAAAACAC GAGGTGTACT CTAAAGCCCT ACAAATGGTC AACAACGTCC TGTGGTACAA CATCAACGCG TTAGCTAAAC TGAAGAGAAA TGGAAAGAAA ATCGGAAAAC TTAGACACAA GAAGGCATTC AAGATAGTCT GGTACAACCA ATCTGGGTTC AAACTACAAG GGGATAAACT CCATCTATCC AAGATAGGGG AGATCAAACT CCTTCTACAT AGGCCAATAC AAGGAGAAGT GAAAGGGGTC ATCCTGAAGA GAAGCAAGAC AAACAAGTGG TACGCTATCT TTCAAGTTGA GCAGGAGAAA CAACCCCTAG AGAGGACCGG TAGAGTCGTG GGGATTGATC TAGGCGTGGA GAAATTTGTT ACAACGAGTG ATGGTGACGT AATTGAGAAC CCGAAACTCC TAGATAGGAG GGAAGAGAGG ATCAAATTGT TACAGAGGAG ACTATCGAGA AAAAGAAGGG GTTCAAGGAA TTACGAAAAG GCCAGGGCAA AGCTCGCTAT GGCTTACGAA AGGCTTGAGA ACACTCTGAA TGATTACATA CACAAGATAA CAACGTGGTT AGTCAAGGAT CATGACGTAA TAGTTGTTGA GAAACTGAAC ACACGAGAAA TGGTACAGGA CTCCCTCGGC AGGTTGAGGA AGCACATCCT TTACTCCAGT TTCTCCACCT TCCTCCATCA CCTCTCCTAC AAGGCTGAAA GAGCTGGTAG GAGGGTGGTC GAGGTGGATC CGGCATATAC ATCGCAGACC TGTTCCAGGT GTGGGTACAG GGTAAAACTT AGCCTATCTG ATAGGGTGTT TCGATGCCCT AGCTGTGGCC TTGTAATTGA TCGCGATTAC AACGCGTCCC TAAACATCCT GAAACGCGGG GTTGGGACTG CCCCTCTGCC TGTGGAGGGG GAACCTCTAC TGTTCACCTT TCATGAGGTG GTGTACAGCA AGTTCCCTCA GAGAAGCAGG AAATCCTCAC CGCGAGGTGG GGATGCTCCG TCCGTAAGGG CGGAGTAG
|
Protein sequence | MKSKGNKVTL SFKYRAYPTP EVERKLINTM EIEAKVYNKL LNYIAERRKQ GIKVTQLDTQ KLLKDMDEKH EVYSKALQMV NNVLWYNINA LAKLKRNGKK IGKLRHKKAF KIVWYNQSGF KLQGDKLHLS KIGEIKLLLH RPIQGEVKGV ILKRSKTNKW YAIFQVEQEK QPLERTGRVV GIDLGVEKFV TTSDGDVIEN PKLLDRREER IKLLQRRLSR KRRGSRNYEK ARAKLAMAYE RLENTLNDYI HKITTWLVKD HDVIVVEKLN TREMVQDSLG RLRKHILYSS FSTFLHHLSY KAERAGRRVV EVDPAYTSQT CSRCGYRVKL SLSDRVFRCP SCGLVIDRDY NASLNILKRG VGTAPLPVEG EPLLFTFHEV VYSKFPQRSR KSSPRGGDAP SVRAE
|
| |