Gene Ssol_2510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2510 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2308451 
End bp2309857 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content34% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionACX92642 
Protein GI261603039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCA TTAAGAATGC ATTGAAGAAC CTAAGCGTCA CAACAACTAA TGTAATAATT 
GCTTTAATTT TCTTCGTTAT TACAGCTAAG ATTTCTAGTC CAGCATTCTT CGGAAAAGTT
GTGATAGTTC AATTACTTGA GATAGTAACG TCGACTTTCT TTTACTTTAT TCCAGGCCAA
ATAATAACTA GGGAAATATC TTATCTTTAC GCTAAAAAAG AGATAACTAA AGAGGTAGTA
GGAAAATTCC TTTCATTTCC TTTTCTAGTT TTACCGATTT TCCTCATCCT TCTTATTTTT
CCAGATTACG TTAAGTTAGC AATACCTTAT CTTTTCCTTT ACCTCCTTAA TGGCGTAATG
ACAGCAGTGA TGATAGGAAT GGATATGTTT ACAGAATCTG CAATTACTGG AAACTTCTTT
CTAGTCATAA GATGGGGGAT AGCAATAATC GCTGTTCTCT ACCACAATAT ATATCTCTTC
GTTAAAATTT GGACTTTGGG AGGAATTCTC TCAGTATCTA TGAATTACGC ATTTATTAGC
AAAAAGGTTG GGTTAGTACT TCCTACGCCA GACTTTGCCT TTCTCTTTAG GCATTTTAGG
GAAGGTTTAC CTGTTTATTT ATCTTCTTTT GCTGGTTTTC TTTCCTCCCA AGGGGATAGG
GTAACTACTG CGTATTTGCT AGGTTCTTAT TATCTGGGCA TTTATCAGTT TTCAGCTTTA
GTTGCTGGTG TTCCCTCAAT GATTTTAGGT GCCTTAGGTG GGGTTTTGTT ACCTACCGCG
TCATTTTATA AGGCTTTAGG GAAGGATGAA AAGAAGATGT CGTCTCTTTC TTTTATATTC
CTCTCGCTTT TAACTTTTCT AACAGTAATA ATTTCTATTC CGATAGGTGA GATCATAATT
ATTCATTTCT TTCCTAATTA TAAAGAGGGA TTAGAAGTAT TCGTGTTACT CTTGATTTCG
GCTACTCTTC CGTTTCCTAT AGGTTCTCTT ACGAATTTTA TTGTAGCGTT CAAGAGAAAC
TTAAGACCTT TCCTTATCCT TTCAATTTTA AACGGAAGTT TAGTCTTACT TACTTCCTAT
TTATTAATTC CGAGGATAGG AATAATGGGT GGTGCTATAT CTCAAGTTAT AGTAGCTACA
ATTTCTTCTC TCTTTATCAT ATTTTACTCT ATAAGAACAT CGGTATTTTC AACTGGAAGG
AAAGAAATAA TTTTACTTTT CCTCATACCC GTAGTAGGAA TTTATGAGGC TATAGATCCT
CCGTTTCTAG ATTTTCTCCT AATTCTTCTT ATACTTTTAG TGTTTAAACT GTTTAAAATA
ATCACTGAAG AGGACGTTAA AATAATTGAA GGTTTCTTGC CACATGGGTT AAAATTCGTG
TCGAAAATTT TAAGTAAACT AACGTAA
 
Protein sequence
MNPIKNALKN LSVTTTNVII ALIFFVITAK ISSPAFFGKV VIVQLLEIVT STFFYFIPGQ 
IITREISYLY AKKEITKEVV GKFLSFPFLV LPIFLILLIF PDYVKLAIPY LFLYLLNGVM
TAVMIGMDMF TESAITGNFF LVIRWGIAII AVLYHNIYLF VKIWTLGGIL SVSMNYAFIS
KKVGLVLPTP DFAFLFRHFR EGLPVYLSSF AGFLSSQGDR VTTAYLLGSY YLGIYQFSAL
VAGVPSMILG ALGGVLLPTA SFYKALGKDE KKMSSLSFIF LSLLTFLTVI ISIPIGEIII
IHFFPNYKEG LEVFVLLLIS ATLPFPIGSL TNFIVAFKRN LRPFLILSIL NGSLVLLTSY
LLIPRIGIMG GAISQVIVAT ISSLFIIFYS IRTSVFSTGR KEIILLFLIP VVGIYEAIDP
PFLDFLLILL ILLVFKLFKI ITEEDVKIIE GFLPHGLKFV SKILSKLT