Gene Ssol_2704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2704 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2480344 
End bp2482194 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content37% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX92796 
Protein GI261603193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCG TCAGTAACTC CGATAAGTTG TTAAGGAAAG AGTTGGGCCT CTTGGATCTA 
ACCTTTTTAT CTTTAAGTGC AATGATAGGT TCTGGTTGGT TACTAGCTTC CTTAAGCGTG
GCATCCATAG CGGGGCCATC TGGAATTTTA TCTTGGATAA TAGCTGGGAT AATGGTAATG
TTTATAGGGT TAGCTTATGC CGAACTTGGA AGTTCAATAC CTAAAACTGG TGGTATAGTT
AGATATCCAG TTTATTCCCA CGGAAGTTAC ACTGGATTTG TCATAGCATT TCTATATCTT
CTCTCGGCAA TCTCTACACC TTCTATAGAA GCTTTAGCAA CAATCGAATA CTTAACTAAT
GTAAACCCTA CTTTGAGTAA ATTGCTTACT AACTCAACAG TTGTTAACGG CACACCAGTT
ACTATATTGA CGCTTCCCGG ATTACTGTTT TCCATTCTTT TACTATTTAT CTATTTCGTA
ATAAATTATT ATGGCATAAA GATCTTAGGG AAAACGAACT CGGCTATAAC AGTTTGGAAG
CTGATCATAC CGGTTGTTAC ATTTATATTA CTATTTTTTG CATTTAAATC AAATAACTTT
ACAAATTATG GAGGAATATT CCCACAAAGT GTTGCATCTA ATTACGTAGG TCCAGTTGGT
ATGCTGTATG CGATACCCTC GGCTGGGATA ATTTACTCAT ATTTAGGGTT TAGACAACCA
ATAGAGTACT CTGGAGAGGC TAAAAATCCA GAAAAGAACG TATGGAGAGC CATAATTCTC
TCAATAGTAA TTGCCATACT AATTTACACT TTCCTACAAA TAGCCTTTAT AGGTGCAATA
AATTGGGAAT CTGCTAGAAT AACCCCCGGC AATTGGTCAG CATTGCTAAA TAGTAAGTGG
GCTAACAGTC CATTCTACAG TGAGCTGGAA GCTGAGGGCA TAGCAGTGAC TGGAGCCTGG
GGATACGTAC TATTAATAGA TGCAATTCTA TCTCCCACTG GAACTGGATT AGTCTATACT
GGGACATCAG CTAGAACGCT TTATGGCCTA TCAGCAGAAG AATATTTTCC ATCAATATTT
AAGAAATTAA ATAGCTATAG GATTCCTATA TGGTCCTTGG TTGGATCCTT AATAGCAAGT
ATAATATTCC TACTTCCGTT TCCAAGTTGG TATTTGTTAG TTGGATTCAA CTCCTCAGCT
ACCGTACTTA CGTATATTAT GGGTGGAATA GGTTTACAAA CGTTGAGGAA GACTGCTCCC
GATTTAAAGA GAGGAATTAA GATACCATGG GCAAGTCTAG TAGCTCCAGT AGCTACAATA
GTTTCATTGT TAATAGTGTA TTGGGCAGGG TTTACAACAT TGTTTTATAT CGTATCAATT
TTATTTATTG GAATTCCAAT CTTCTGGGTA AATTATGCTG TAAAGGTTCT AGGACTAAAG
AAGTGGGTTG GTGTTAGCTT GGGAATAACA CAGCTAATTG TAAATATCGC ATTAACCATT
TTTGGTTATT ATAACCTAGT TTTGAATAAC GATAGTCCTA TGTTATACTT CGTTTTATTC
ATATTGATGG AATTGGTGCC CTTGGTAGTA GCTTATCTAA CTATAAACGA TAATGGGAAA
GTACACTTGA GGAGTGGATT TTGGCTTATG GGCTTAATTT TTACAATTTA CATTATTAGT
TATCTCAGTG AGTTTGGACC ATTGGGTAGC TCTGCTCCTA TAGAATTTCC TTATGATATT
GCAATAATTT CGATAGTAGG TCTAATATTC CATTATCTAG CAGTTTTAAG CGGTTTCAGA
ACCAAGGAAA TAGAGGAAAT TATTGAGGAT CAGAAGGAGA GAAGAGATTA A
 
Protein sequence
MTSVSNSDKL LRKELGLLDL TFLSLSAMIG SGWLLASLSV ASIAGPSGIL SWIIAGIMVM 
FIGLAYAELG SSIPKTGGIV RYPVYSHGSY TGFVIAFLYL LSAISTPSIE ALATIEYLTN
VNPTLSKLLT NSTVVNGTPV TILTLPGLLF SILLLFIYFV INYYGIKILG KTNSAITVWK
LIIPVVTFIL LFFAFKSNNF TNYGGIFPQS VASNYVGPVG MLYAIPSAGI IYSYLGFRQP
IEYSGEAKNP EKNVWRAIIL SIVIAILIYT FLQIAFIGAI NWESARITPG NWSALLNSKW
ANSPFYSELE AEGIAVTGAW GYVLLIDAIL SPTGTGLVYT GTSARTLYGL SAEEYFPSIF
KKLNSYRIPI WSLVGSLIAS IIFLLPFPSW YLLVGFNSSA TVLTYIMGGI GLQTLRKTAP
DLKRGIKIPW ASLVAPVATI VSLLIVYWAG FTTLFYIVSI LFIGIPIFWV NYAVKVLGLK
KWVGVSLGIT QLIVNIALTI FGYYNLVLNN DSPMLYFVLF ILMELVPLVV AYLTINDNGK
VHLRSGFWLM GLIFTIYIIS YLSEFGPLGS SAPIEFPYDI AIISIVGLIF HYLAVLSGFR
TKEIEEIIED QKERRD