Gene Msed_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1110 
Symbol 
ID5104276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1037798 
End bp1039021 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content43% 
IMG OID640507004 
ProductIS605 family transposase OrfB 
Protein accessionYP_001191197 
Protein GI146303881 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.48551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTAA GAGTCAAGGT TAATTACGCC ACCTACTCAG CACTCAAAGA AGTTGAGAAA 
GAGTATAGAG AGGTTCTTGA GGAGGCAGTA AATTACGGGC TTGCAAAGAA CACAACGTCA
TTCACTAGGA TTAAGGCTGG GATTTACAAG ACTGAGAGGG AGAAGCACAG GGACTTGCCA
TCCCATTATA TTTACACCGC TTGTGAAGAT GCCAGCGAGA GGCTAGATAG CTTCAATAAG
TTGAAGAAGA GAGGAAGGAG TTACACTGAC AGACCGTCAG TTAGGAGGGT TACTGTTCAT
CTTGATGACC ACTTGTGGAA GTTCAGCCTC AGCGTGATTT CAATTGCTAC TAAGAGGGGT
AGAGTTTTCA TTTCTCCAAT CTTTCCAAAG ATCTTTTGGA GGTATTATAA CGACAATTGG
TTAATTGCTA GTGAGGCTAG GTTTAAACTG TTGAAGGGGA ATGTTGTAGA GTTCTTCATA
GTTTTTAAGA AGGATGTTAA ATCTTACGAT CCAAGGGGTT TTATTCCAGT TGATTTGAAT
GAGGGTTCAG TCTCTGTATT AATTAATGGG AAGCCGATAC TTTTAGAGAC TAACACTAGG
ATGATTACTC TGGGTTATGA GTATAAAAGG AGGTCGATAA CTAATGGTAG GTCTACTAAG
GATAGGGAGG TTAGGAGGAA GTTAAGGAAG TTGAGGGAGA GGGATAAGAA GCTTGATGTT
AGGAGGAAGT TTGCTAAGTT GATTGTTAAG GAGGCTTTTG GGAGTAGGAG TGCTATAGTC
TTGGAGGATT TGCCAAAGAG AGTTCCGGAG CATATGGTTA AGGGCGTGAA GGATAAACAG
CTTAGGTTGA GGATTTATCG TTCGGCATTT TCTTCAATGA AAAATGCTAT TGTTGAGAAG
GCTAGGGAGT TTGGTGTTCC CGTAATCTTG GTTGATCCTT CTTATACTTC TTCTATTTGC
CTTGTTCACG GGTCGAAGAT TATTTATCAA CCCGATGGGG GCTCTGCCCC AAGGGTTGGT
GTTTGTGGGA AGGGAGGAGA GAGGTGGCAT AGGGATGTTG TTGCGTTATA TAATTTGAGG
AAAAGAGCTG GAGATGTGAG CCCCGTGCCG TTGGGCTCGA AGGAGTCCCA TGACCCACCT
GTCGTTAAGG CTGGCAGGTG GTTGAGGGCT AAGTCCCTAC ACTTGATCAT GATTGAAGAT
AAAATGAGTG AAATGAAAGT GTAG
 
Protein sequence
MKLRVKVNYA TYSALKEVEK EYREVLEEAV NYGLAKNTTS FTRIKAGIYK TEREKHRDLP 
SHYIYTACED ASERLDSFNK LKKRGRSYTD RPSVRRVTVH LDDHLWKFSL SVISIATKRG
RVFISPIFPK IFWRYYNDNW LIASEARFKL LKGNVVEFFI VFKKDVKSYD PRGFIPVDLN
EGSVSVLING KPILLETNTR MITLGYEYKR RSITNGRSTK DREVRRKLRK LRERDKKLDV
RRKFAKLIVK EAFGSRSAIV LEDLPKRVPE HMVKGVKDKQ LRLRIYRSAF SSMKNAIVEK
AREFGVPVIL VDPSYTSSIC LVHGSKIIYQ PDGGSAPRVG VCGKGGERWH RDVVALYNLR
KRAGDVSPVP LGSKESHDPP VVKAGRWLRA KSLHLIMIED KMSEMKV