Gene Ssol_2771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2771 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2535348 
End bp2536568 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content44% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX92854 
Protein GI261603251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.680266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGTTAA GGGTTAAGGT TGATTATTCT ACATACTCAG CACTTAAGGA GGTCGAGAAG 
GAGTACAGAG AGGTTCTAGA GGACGCAATA AATTATGGGC TGTCAAACAA AACTACCTCC
TTCACCAGGA TTAAAGCTGG AGTTTACAAG ACTGAGAGGG AGAAACATAA GGACTTACCC
TCCCATTATA TTTACACAGC TTGTGAGGAT GCAAGCGAGA GATTAGACAG TTTTGAGAAG
TTAAAGAAGA GAGGTAGGAG TTACACTGAG AAACCGTCAG TGAGGAGAGT TACTATTCAC
CTCGACGATC ATCTGTGGAA GTTCAACCTC GACACGATTT CAATTTCCAC AAAGAGGAGT
AGGATTCTCA TTTCACCAAC CTTCCCTAAG ATCTTCTGGA GATATTATAA CACGGAGTGG
AGGATTGCGA GTGAGGCCAG GTTTAGGCTG ATGAAGGGGA ATGTTGTAGA GTTCTACGTC
ATTTTTAAGA GAGATGAGCC TAAACCTTAT GAACCTAAAG CGTTTATTCC CGTCGACCTT
AACGAGAATT CGGTCTCGGT GCTCATCAAC AGTAAACCCT TATTGCTTGA GACTAACACT
AAGAAAATTA CTCTGGGCTA TGAGTATAGG AGGAAGGCAA TAACTACTGG TAAGTCAACT
AAGGATAGGG AAGTGAGGAG GAAGTTAAAG AGGCTGAGGG AGAGGAATAA GAAAGTAGAC
ATTAGGAGGA AATTAGCTAA GCTAATCGTT AAAGAGGCTT TTGAAAGTAG GAGTGCAATT
GTCTTGGAGG ACTTGCCAAG GAGAACTCCG GAGCATATGA TAAAGGACGT GAAGGATAAA
CAGCTTAGGT TGAGGATTTA TAGATCTGCA TTTTCCTCAA TGAAGAACGC TATTATTGAG
AAGGCTAGGG AGTTTGGTGT CCCCGTGGTC TTAGTTAACC CATCTTATAC TTCCACTGTT
TGCCCAATTC ATGGGGCGAA TATCGTTTAC CAACTCGATG GGGGCGATGC CCCAAGGGTT
GGTGTTTGTG AGAAGGGGAA GGAAAAGTGG CATAGGGATG TAGTTGCACT GTACAACTTA
GCGAGGAGAG CTGGAGATGT GAGCCCCGTG CCGTTGGGCT CGAAGGAGTC CCATGACCCA
CCTACCTTAA GTGGGTGGTT GAGGGCTAAG TCCCTACACT CGATCATGAA TGAACATAAA
ATGATTGAAA TGAAAGTGTA G
 
Protein sequence
MKLRVKVDYS TYSALKEVEK EYREVLEDAI NYGLSNKTTS FTRIKAGVYK TEREKHKDLP 
SHYIYTACED ASERLDSFEK LKKRGRSYTE KPSVRRVTIH LDDHLWKFNL DTISISTKRS
RILISPTFPK IFWRYYNTEW RIASEARFRL MKGNVVEFYV IFKRDEPKPY EPKAFIPVDL
NENSVSVLIN SKPLLLETNT KKITLGYEYR RKAITTGKST KDREVRRKLK RLRERNKKVD
IRRKLAKLIV KEAFESRSAI VLEDLPRRTP EHMIKDVKDK QLRLRIYRSA FSSMKNAIIE
KAREFGVPVV LVNPSYTSTV CPIHGANIVY QLDGGDAPRV GVCEKGKEKW HRDVVALYNL
ARRAGDVSPV PLGSKESHDP PTLSGWLRAK SLHSIMNEHK MIEMKV