Gene Msed_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1513 
Symbol 
ID5104042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1476530 
End bp1477747 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content47% 
IMG OID640507401 
ProductIS605 family transposase OrfB 
Protein accessionYP_001191594 
Protein GI146304278 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0963227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.774713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATCAA AAGGCAACAA GGTAACCCTC TCATTCAAGT ACAGGGCATA CCCAACACCA 
GAGGTAGAGA GAAAACTCAT CAACACAATG GAAATAGAGG CGAAAGTGTA CAATAAACTA
CTGAACTACA TCGCAGAGAG AAGAAAACAA GGGATTAAGG TAACACAGCT GGACACTCAA
AAGCTACTCA AGGACATGGA CGAAAAACAC GAGGTGTACT CTAAAGCCCT ACAAATGGTC
AACAACGTCC TGTGGTACAA CATCAACGCG TTAGCTAAAC TGAAGAGAAA TGGAAAGAAA
ATCGGAAAAC TTAGACACAA GAAGGCATTC AAGATAGTCT GGTACAACCA ATCTGGGTTC
AAACTACAAG GGGATAAACT CCATCTATCC AAGATAGGGG AGATCAAACT CCTTCTACAT
AGGCCAATAC AAGGAGAAGT GAAAGGGGTC ATCCTGAAGA GAAGCAAGAC AAACAAGTGG
TACGCTATCT TTCAAGTTGA GCAGGAGAAA CAACCCCTAG AGAGGACCGG TAGAGTCGTG
GGGATTGATC TAGGCGTGGA GAAATTTGTT ACAACGAGTG ATGGTGACGT AATTGAGAAC
CCGAAACTCC TAGATAGGAG GGAAGAGAGG ATCAAATTGT TACAGAGGAG ACTATCGAGA
AAAAGAAGGG GTTCAAGGAA TTACGAAAAG GCCAGGGCAA AGCTCGCTAT GGCTTACGAA
AGGCTTGAGA ACACTCTGAA TGATTACATA CACAAGATAA CAACGTGGTT AGTCAAGGAT
CATGACGTAA TAGTTGTTGA GAAACTGAAC ACACGAGAAA TGGTACAGGA CTCCCTCGGC
AGGTTGAGGA AGCACATCCT TTACTCCAGT TTCTCCACCT TCCTCCATCA CCTCTCCTAC
AAGGCTGAAA GAGCTGGTAG GAGGGTGGTC GAGGTGGATC CGGCATATAC ATCGCAGACC
TGTTCCAGGT GTGGGTACAG GGTAAAACTT AGCCTATCTG ATAGGGTGTT TCGATGCCCT
AGCTGTGGCC TTGTAATTGA TCGCGATTAC AACGCGTCCC TAAACATCCT GAAACGCGGG
GTTGGGACTG CCCCTCTGCC TGTGGAGGGG GAACCTCTAC TGTTCACCTT TCATGAGGTG
GTGTACAGCA AGTTCCCTCA GAGAAGCAGG AAATCCTCAC CGCGAGGTGG GGATGCTCCG
TCCGTAAGGG CGGAGTAG
 
Protein sequence
MKSKGNKVTL SFKYRAYPTP EVERKLINTM EIEAKVYNKL LNYIAERRKQ GIKVTQLDTQ 
KLLKDMDEKH EVYSKALQMV NNVLWYNINA LAKLKRNGKK IGKLRHKKAF KIVWYNQSGF
KLQGDKLHLS KIGEIKLLLH RPIQGEVKGV ILKRSKTNKW YAIFQVEQEK QPLERTGRVV
GIDLGVEKFV TTSDGDVIEN PKLLDRREER IKLLQRRLSR KRRGSRNYEK ARAKLAMAYE
RLENTLNDYI HKITTWLVKD HDVIVVEKLN TREMVQDSLG RLRKHILYSS FSTFLHHLSY
KAERAGRRVV EVDPAYTSQT CSRCGYRVKL SLSDRVFRCP SCGLVIDRDY NASLNILKRG
VGTAPLPVEG EPLLFTFHEV VYSKFPQRSR KSSPRGGDAP SVRAE