Gene Ssol_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2633 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2414577 
End bp2415695 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content41% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX92737 
Protein GI261603134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.56671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTTAG CGTGCAAAAT CTACAACACC TTAAGGTGGG CAGACATCTA TTTCTATCAG 
AGGGATGGGA AAGGACTAAC ACAAACTGAG TTAAGACAGT TGGCTCTAGA TCTGAGAAAA
CAAGATGATG AGTATAAGCA ACTCTACTCG CAAGTAGTTC AACAAATAGC TGACCGTTAT
TACGAAGCTA GACAGAGGTT TTTCGAAGGT CTAGCACGTT TCCCAAAAGA AAAGAAACCT
CATAAATACT ACTCCCTTGT CTATCCCCAG TATGGTTGGA AAATACTTCA GGTTAGAGAA
ATAAGAAAAG GAAGCAAGAA GAATAAGAAG AGACTAATAA CGCTTAAACT ATCAAATCTT
GGTACGTTCA AGGTAATTAT ACACAGGGAC TTTCCCCTTG ACAAAGTAAA GAGGGTAGTA
GTGAAGCTAA CAAGATCTGA GAGGATATAC ATCACTTTCG TAGTAGAAGA TTACGAATTC
CCCAAGTTAC CTAACACTGG TAAGGTAGTG GCGATAGATG TTGGCATAGA GAAGCTGATC
GTAACGTCAG ATGGTGAGTA TTTTCCTAAT TTGAGACCTT ACGAGAAAGC GTTATGGAAA
GTGAAGCATC TACACAGAGA ACTTTCAAGG AAGAAATTCC TCTCTAATAA TTGGTTTAAG
GCTAAGGTTA AGCTTGCTAG GGCTTATGAG CATTTGAAGA ATCTAAGAAC GGATCTTTAC
ATGAAGTTGG GTAAGTGGTT TGCTGAGCAT TATGATGTTG TGGTGATGGA GGACATTCAT
GTTAAGCAGT TGATAGGTAA GTCATTAAGG TCTCTGAGGA GGAGATTGAG TGATGTCGCG
TTCAGCGAGC TTAGAGATTT GATTAAGTAT CAGTTGGAGA AATACGGTAA GAAACTCATC
CTAGTTAATC CTGCATACAC TTCCAAAACT TGTGCTAAGT GCGGGTACGT AAAAGAAGAT
CTGTCTCTAT CTGATCGTGT TTTCGTTTGT TCCAACTGTG GTTGGATTGC AGATCGTGAC
TATAATGCTT CTCTTAACAT CTTACGTGGA TCGGGGTCGG AGCGACCCTT AGTGTGGAGC
TCCGCCCTCT ACCAGTACTC TGGCATGGCA GAGCTGTGA
 
Protein sequence
MKLACKIYNT LRWADIYFYQ RDGKGLTQTE LRQLALDLRK QDDEYKQLYS QVVQQIADRY 
YEARQRFFEG LARFPKEKKP HKYYSLVYPQ YGWKILQVRE IRKGSKKNKK RLITLKLSNL
GTFKVIIHRD FPLDKVKRVV VKLTRSERIY ITFVVEDYEF PKLPNTGKVV AIDVGIEKLI
VTSDGEYFPN LRPYEKALWK VKHLHRELSR KKFLSNNWFK AKVKLARAYE HLKNLRTDLY
MKLGKWFAEH YDVVVMEDIH VKQLIGKSLR SLRRRLSDVA FSELRDLIKY QLEKYGKKLI
LVNPAYTSKT CAKCGYVKED LSLSDRVFVC SNCGWIADRD YNASLNILRG SGSERPLVWS
SALYQYSGMA EL