Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2185 |
Symbol | |
ID | 5105406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2096903 |
End bp | 2098210 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640508079 |
Product | transposase |
Protein accession | YP_001192248 |
Protein GI | 146304932 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0280977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000348884 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGAGGG CCAACATAGT TAAACTAATC GTAGACAAGA AGACGCACGA GAAGCTCAAA GAACTCGCAA TCGCTACTGC AAAATGCTGG AACGAAGTGA ACTGGTTAAG AATGCAACAG TTTAAGAAAG GTGAGAGGGT CGATTTCTCT AAAACAGAAA AGGATGTATA CGAGAAGTAC AAGCAAATAT TAAAGGTTAA CACACAACAA GTTGCTAGGA AGAACGCTGA GGACTGGAGG AGTTTCTTCT CATTAATCGA GGAGAAGAAG GAGGGTAAAT TGCCAAGATG GTTTAAACCT AGACCTCCAG GGTACTGGAA GGATAAAATT GGAAAATACA AGCTGATAAT AATCATTAGA AAAGACCGTT ACGAGGTTAA TGAGGAAAAG AGGATCATCT ATTTGAAGGA CTTCAAACTC TCTCTGAGTT TTAAGGGAAA GTTGAAGTGG CACGGGGAAC AAGGTAGGCT GGAAATAATT TATAATGAGG CTAGGAGGAT TTGGTATGCA CATATACCGG TGGAAGTCCA GAACGATGTG AAAGCTGAAG GCAAACTAAA GGCTTCCATA GACCTAGGGA TAGTAAACTT AGCAACTGTC TACGTTGAGG ATGGTAGCTG GTATATTTTC AAAGGTGGTA GTGTTCTCTC TCAGTACGAG TATTATAGCA AAAGGATTAG CATAGTCCAG AAAACCTTGG CTAGGCATAA GCAGAGGAGG AGTAGGAAGA TGAAATTATT ATATGAGAAA AGGAAGAGGT TTCTGAAGCA CGCCCTTAAC AGTATGGTAA GGAAGATAAT GGAGGAGTTG AAGAAGAAGG GTGTGAGCAA GCTTATCATA GGCTATCCTA AAGAGATAAG TAAGGATCAT GGAAACAAAC TCACGGTTAA CTTCTGGAAC TACGGTTACA TCATTAGACG TTTTGAGGAG GTTGGGGAGG AGTTAGGTAT TGAAGTGGTT GAGGTGGACG AGGCGTGGAC TTCTAAGTCT TGCTCCCTAT GCGGGGAAGC CCACCATCGT GGGCGTATTA AGCGTGGTCT CTATAGGTGT CCCCGCATGG GGAAAGTAAT AAACGCAGAC TTGAATGGTG CGATAAATAT CCTACATATC CCCGAGTCCC TAGGAGCTGG GAGCGGAGGG CAACTCCCGG TAAGGGATAG GGGTAATGGG CTGAAGGCCC AGCCCGCGGT CTACCGCTGG TCGAATGGAG TGGGGTGGGT GTCATCACCC ACTAGCTATG AAGTGATGAA AATGAAGGCG GTAAACCGCA AACCAATGAA TCGCCCTAAG GGAACCCTCG CCCTTTAG
|
Protein sequence | MKRANIVKLI VDKKTHEKLK ELAIATAKCW NEVNWLRMQQ FKKGERVDFS KTEKDVYEKY KQILKVNTQQ VARKNAEDWR SFFSLIEEKK EGKLPRWFKP RPPGYWKDKI GKYKLIIIIR KDRYEVNEEK RIIYLKDFKL SLSFKGKLKW HGEQGRLEII YNEARRIWYA HIPVEVQNDV KAEGKLKASI DLGIVNLATV YVEDGSWYIF KGGSVLSQYE YYSKRISIVQ KTLARHKQRR SRKMKLLYEK RKRFLKHALN SMVRKIMEEL KKKGVSKLII GYPKEISKDH GNKLTVNFWN YGYIIRRFEE VGEELGIEVV EVDEAWTSKS CSLCGEAHHR GRIKRGLYRC PRMGKVINAD LNGAINILHI PESLGAGSGG QLPVRDRGNG LKAQPAVYRW SNGVGWVSSP TSYEVMKMKA VNRKPMNRPK GTLAL
|
| |