Gene Ssol_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2454 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2259106 
End bp2260293 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content42% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionACX92603 
Protein GI261603000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCT TAGGGTTTCG CTTCCGTGCT TACGCTGACG AACAAACCCT TAGGGCGTTA 
AAAGCCCAGT TGAAGTTAGC GTGCAAAATC TACAACACCT TAAGGTGGGC AGACATCTAC
TTTTACCAAA GGGATGGGAA AGGACTAACA CAAACTGAGT TAAGACAGTT GGCTCTAGAT
CTGAGAAAAC AAGATGATGA GTATAAGAAA CTCTACTCCC AAGTAGTTCA ACAAATAGCT
GACCGTTATT ACGAAGCTAG ACAGAGGTTT TTCGACGGTT TAGCACGTTT CCCGAAAGAA
AGGAAACCTC ACAAGTACTA CTCCCTTGTC TATACGCAAA GCGGTTGGAA AATACTTCAA
GTTAGAGAAA TAAGAAAAGG AAGCAAGAAG AAACTAATAA CGCTTAAACT ATCAAATCTT
GGTACGTTCA AGGTAATAGT TCACCGAGAC TTTCCCCTTG ACAAAGTAAA GAGAGTGATA
GTGAAGCTAA CAAGATCTGA GAGGATTTAC ATCACTTTCG TAGTTGATCA CGAATTCCCC
AAGTTACCTA ACACTGGTAA GGTAGTGGCG ATAGATGTTG GTGTAGAAAA GTTGTTAGTA
ACGTCAGATG GTGAGTATTT TCCTAATTTG AGACCTTACG AGAAAGCGTT ATGGAAAGTG
AAGCATCTAC ACAGAGAACT TTCAAGGAAG AAGTTCCTCT CTAATAATTG GTTTAAGGCT
AAGGTTAAGC TTGCTAGGGC TTATGAGTAT TTGAAGAATC TAAGAACGGA TCTTTACATG
AAGTTGGGTA AATGGTTTGC TGAGCATTAC GACGTTGTAG TCATGGAGGA CATTCATGTT
AAGCAGTTGA TAGGTAAGTC ATTAAGGTCT CTGAGGAGGA GATTGAGTGA CGTCGCGTTC
AGCGAGCTTA GAGATTTGAT TAAGTATCAG TTGGAGAAAT ACGGTAAGAA ACTCATCCTA
GTTAATCCTG CATACACTTC CAAAACTTGT GCTAAGTGCG GGTACGTAAA AGAAGATCTG
TCTCTATCTG ATCGTGTTTT CGTTTGTCCC AACTGTGGTT GGATTGCAGA TCGTGACTAT
AATGCTTCTC TTAACATCTT ACGTGGATCG GGGTCGGAGC GACCCTTAGT GTGGAGCTCC
GCCCTCTACC AGTACCAGCA CTTCGGTACT GGCATGGCAG AGCTGTGA
 
Protein sequence
MPTLGFRFRA YADEQTLRAL KAQLKLACKI YNTLRWADIY FYQRDGKGLT QTELRQLALD 
LRKQDDEYKK LYSQVVQQIA DRYYEARQRF FDGLARFPKE RKPHKYYSLV YTQSGWKILQ
VREIRKGSKK KLITLKLSNL GTFKVIVHRD FPLDKVKRVI VKLTRSERIY ITFVVDHEFP
KLPNTGKVVA IDVGVEKLLV TSDGEYFPNL RPYEKALWKV KHLHRELSRK KFLSNNWFKA
KVKLARAYEY LKNLRTDLYM KLGKWFAEHY DVVVMEDIHV KQLIGKSLRS LRRRLSDVAF
SELRDLIKYQ LEKYGKKLIL VNPAYTSKTC AKCGYVKEDL SLSDRVFVCP NCGWIADRDY
NASLNILRGS GSERPLVWSS ALYQYQHFGT GMAEL