Gene Msed_2185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2185 
Symbol 
ID5105406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2096903 
End bp2098210 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content45% 
IMG OID640508079 
Producttransposase 
Protein accessionYP_001192248 
Protein GI146304932 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0280977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000348884 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGAGGG CCAACATAGT TAAACTAATC GTAGACAAGA AGACGCACGA GAAGCTCAAA 
GAACTCGCAA TCGCTACTGC AAAATGCTGG AACGAAGTGA ACTGGTTAAG AATGCAACAG
TTTAAGAAAG GTGAGAGGGT CGATTTCTCT AAAACAGAAA AGGATGTATA CGAGAAGTAC
AAGCAAATAT TAAAGGTTAA CACACAACAA GTTGCTAGGA AGAACGCTGA GGACTGGAGG
AGTTTCTTCT CATTAATCGA GGAGAAGAAG GAGGGTAAAT TGCCAAGATG GTTTAAACCT
AGACCTCCAG GGTACTGGAA GGATAAAATT GGAAAATACA AGCTGATAAT AATCATTAGA
AAAGACCGTT ACGAGGTTAA TGAGGAAAAG AGGATCATCT ATTTGAAGGA CTTCAAACTC
TCTCTGAGTT TTAAGGGAAA GTTGAAGTGG CACGGGGAAC AAGGTAGGCT GGAAATAATT
TATAATGAGG CTAGGAGGAT TTGGTATGCA CATATACCGG TGGAAGTCCA GAACGATGTG
AAAGCTGAAG GCAAACTAAA GGCTTCCATA GACCTAGGGA TAGTAAACTT AGCAACTGTC
TACGTTGAGG ATGGTAGCTG GTATATTTTC AAAGGTGGTA GTGTTCTCTC TCAGTACGAG
TATTATAGCA AAAGGATTAG CATAGTCCAG AAAACCTTGG CTAGGCATAA GCAGAGGAGG
AGTAGGAAGA TGAAATTATT ATATGAGAAA AGGAAGAGGT TTCTGAAGCA CGCCCTTAAC
AGTATGGTAA GGAAGATAAT GGAGGAGTTG AAGAAGAAGG GTGTGAGCAA GCTTATCATA
GGCTATCCTA AAGAGATAAG TAAGGATCAT GGAAACAAAC TCACGGTTAA CTTCTGGAAC
TACGGTTACA TCATTAGACG TTTTGAGGAG GTTGGGGAGG AGTTAGGTAT TGAAGTGGTT
GAGGTGGACG AGGCGTGGAC TTCTAAGTCT TGCTCCCTAT GCGGGGAAGC CCACCATCGT
GGGCGTATTA AGCGTGGTCT CTATAGGTGT CCCCGCATGG GGAAAGTAAT AAACGCAGAC
TTGAATGGTG CGATAAATAT CCTACATATC CCCGAGTCCC TAGGAGCTGG GAGCGGAGGG
CAACTCCCGG TAAGGGATAG GGGTAATGGG CTGAAGGCCC AGCCCGCGGT CTACCGCTGG
TCGAATGGAG TGGGGTGGGT GTCATCACCC ACTAGCTATG AAGTGATGAA AATGAAGGCG
GTAAACCGCA AACCAATGAA TCGCCCTAAG GGAACCCTCG CCCTTTAG
 
Protein sequence
MKRANIVKLI VDKKTHEKLK ELAIATAKCW NEVNWLRMQQ FKKGERVDFS KTEKDVYEKY 
KQILKVNTQQ VARKNAEDWR SFFSLIEEKK EGKLPRWFKP RPPGYWKDKI GKYKLIIIIR
KDRYEVNEEK RIIYLKDFKL SLSFKGKLKW HGEQGRLEII YNEARRIWYA HIPVEVQNDV
KAEGKLKASI DLGIVNLATV YVEDGSWYIF KGGSVLSQYE YYSKRISIVQ KTLARHKQRR
SRKMKLLYEK RKRFLKHALN SMVRKIMEEL KKKGVSKLII GYPKEISKDH GNKLTVNFWN
YGYIIRRFEE VGEELGIEVV EVDEAWTSKS CSLCGEAHHR GRIKRGLYRC PRMGKVINAD
LNGAINILHI PESLGAGSGG QLPVRDRGNG LKAQPAVYRW SNGVGWVSSP TSYEVMKMKA
VNRKPMNRPK GTLAL