Gene Ssol_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0489 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp436038 
End bp437984 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content35% 
IMG OID 
Productprotein of unknown function DUF608 
Protein accessionACX90768 
Protein GI261601165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.234687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGA TAGAGAAGAA TTTTGGTATA CCTTTAGGAG GAATAGGTAC TGGAAAGATA 
AACTTCTACC CAGATCTCAC AATTGGCGAG ATAACTATTC TAAACAACTG GTCAAATCCA
CTTAGAAAAG TAAGGGGTTT CCACCTATTA ACGTTTCTAG ATAAGAAACC TATATTTCTT
CAAACTAATC CTGGCAAGAA TGTTGAAACA CCTCCTCAAT ATACTTACGT AAAAGACATT
GAGACATTTG TTGAGTTCCC TGTTATCAGA TATTCGACAC CTCTAGCTGA AATAGAAGTG
TATTCAATCC TACGGAAAAA TGACGTTAAG AATTCATCCC TCCCAGTAAT AAAGTTTAGA
ATAAAAGGTA AAGGAAGATT TGCAATTTCT TTTCCGAATA TAGTTGGAAG TAAGAGAGCT
GGTAGAGCAA ATGAGAGCTA TAAGGGAAAG TTAAATGGTG TAGTGATGAA AAACGAGAAG
GCCTTAAATT CTGATCCAGC TTATGGAGAA ATTTTCCTTG GATGTATTGG CTGTAAGGTT
ATAACCAATT ACGCTTACTA TAAACCAGCT AAGGTCGGAA TGACTGAAGA TATCACATAT
TTTTATAATT TAGAAGAATA TGATGAAACA TATGTAATAA AACCATATGC CAGAGAAGAA
ATAGGGGGTA TTGTATATAA AGATGTGGAT GAAGAAGAAA CGTTTATCCT TTCATGGTAT
TTTAACGGAA GACCTTATCA CTATCCATAT GGACATTACT ACGAAAATTT CTTTAAAGAT
TCTATAGATG TTGCAGAATA CGCTCTCAAA TTAGAACCAG ATTTAGGAAT AGAAGAGAAA
GTGGAGTGGT TAAAAGATGC GTTAGTAAAT AGCTTGTACA TACTAACATC AAATACTTGG
TTAACTAAAG ATGGAAGATT CGCTGTATAT GAAGATCCAT TCGTTACTAA ATTAATGAAT
ACGATTGGTT CTATGACCTT TGATGGTCTC GGATTTACTC TCTTAGAACT TTACAGAGAT
TTAGTAATAT CAGCTGATAA TTACTTTATG AACTATGTAA ATTATGGGGA AACGCCACAT
GATGTTGGAG AGGAAAGTAT TGAAGATCCA ATTTACGGTG CGTCATATCC ATATTGGTGG
ACAGATTTGG GACCTACTTT AGTCCTAATG TTATACAGAG ACTACGTGTT TACCTCTAAT
AGGGAAATTT TGGAGAAGAA CTATAATAAA ATAAAGGAGA TAATTGATTG GCTCATAAGA
AAAGATATGG ATAACGATTG TATACCGGAC TCAAAGGGTG GTTACGATAA CTCCTATGAC
GGAACTCACA TGTACGGAGC CTCATCCTAT ATAGCATCGA TGTTCCTTTC AGCATTGACC
GCATTTATCA AAATGTCTGA AATCTTGGAT GTTAAGATTG ATGATAAGTA TTACAGATTT
TTAGAGTGTG GAAAGAAGAC ATTTAATTCA TTATGGAACG GCAAATATTT CATTTTATGG
AAGAAAAATG ACGAGGAAAA TACGTCTTGT CTAAACTCAC AATTGTTAGG CCAATTCTGG
TGTGATATTT TAGGATTACC ACCTATAACT GATCATGATA AAATAAATAC GGCCTTAAGA
AGTATTTATG AACTAAACTT CAAAGCTTCT AAATACTGTT TAACTAATGC AGTAAGGGAA
GATGGCAGTG TGGATTCTTC AACAGCCCAA CTTAGGTCAT GTTGGCCTAG AGTTTCCTTT
GCAGTAGCTG CACATATGAT ATTAAGGGGA ATGGTTAAGG AGGGAATTGA AGTAGCTAAA
AGGGAATGGG AGACGATAAA GGAGCTTAAC CCATTCGATC AATCCTCTAG AATAGATGCA
ATAGATGGAA AGTACGTGGG GCTAATGTCG TACATAGGAA GCACTTCGGT TTGGCTATTG
AAATTAGCGT TAGACAAAAT ACGTTAA
 
Protein sequence
MMKIEKNFGI PLGGIGTGKI NFYPDLTIGE ITILNNWSNP LRKVRGFHLL TFLDKKPIFL 
QTNPGKNVET PPQYTYVKDI ETFVEFPVIR YSTPLAEIEV YSILRKNDVK NSSLPVIKFR
IKGKGRFAIS FPNIVGSKRA GRANESYKGK LNGVVMKNEK ALNSDPAYGE IFLGCIGCKV
ITNYAYYKPA KVGMTEDITY FYNLEEYDET YVIKPYAREE IGGIVYKDVD EEETFILSWY
FNGRPYHYPY GHYYENFFKD SIDVAEYALK LEPDLGIEEK VEWLKDALVN SLYILTSNTW
LTKDGRFAVY EDPFVTKLMN TIGSMTFDGL GFTLLELYRD LVISADNYFM NYVNYGETPH
DVGEESIEDP IYGASYPYWW TDLGPTLVLM LYRDYVFTSN REILEKNYNK IKEIIDWLIR
KDMDNDCIPD SKGGYDNSYD GTHMYGASSY IASMFLSALT AFIKMSEILD VKIDDKYYRF
LECGKKTFNS LWNGKYFILW KKNDEENTSC LNSQLLGQFW CDILGLPPIT DHDKINTALR
SIYELNFKAS KYCLTNAVRE DGSVDSSTAQ LRSCWPRVSF AVAAHMILRG MVKEGIEVAK
REWETIKELN PFDQSSRIDA IDGKYVGLMS YIGSTSVWLL KLALDKIR