Gene Ssol_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1526 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1397359 
End bp1398867 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content41% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX91753 
Protein GI261602150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA GAAAAGCTGC ATATATATTA CTCTTAATAA TCGCATTACC GAGCTTAGCA 
TTACCAGTCA CAGCTGCAGC CAATCCAGTA GCAACGTTCA TTAATGATCT AGAGATATTA
ATCCCAGCAG TATTGTTCAT TCTGTCATTG ATAGCACTAA GGAGCGGGGA TTATGAATAC
TCGTTCATGC TGTTATTAGC AGCCACAATA GTAACTATAG CGCTGGCATC AGTAACCGGG
GGGAATTTAG GGACTAACGG GGTTTCATTA ACACTAGTTC AGCTACAGGT GACTGTTAAC
GGTCCCACGT CAGCTTATAC AGGCAATACA GAAACATACA CAGTCTCGTG GAGCCCTTCC
ATGTCTGGAA CTGTAATATG GACTGTTCTA TATAATGGAA GTATAGTGTA CAACGCTACT
GGAGGTACTT CTTTTACCTA TACTTTTAAG TATCCAGGTA AATACATAGT TGCAGCAACT
GTAATTAATC AACAGAATTT TGCTGGAGGT TCTGGAGCAG TTCTGGTTAC AGTTACAAAT
CCTCCTTCAC CTCTTGGGTG GATAGAAGGA GCAATCACAG GCGCAGTTTC TGGATTAATT
AACTCTGTCG CTAATGCGTT TACAGGATTC CTAACAACAT TACTACAGAT TTTTGGAGCA
CCTCTGGAAT GGATGACTTA TTCACCAACT CCTTACGCTT CTACTTCTAC TCCTAATGCT
TCCCCAATAG TACCAACAAT TTACAACCAA ATGAAAGACT TTAGCGTTGG GCTCGCAATG
CTTTTCATAG CTTTCTCAAT CGCCTATAAC GCTATAAGGG GAGAATATGC CGACCTCGTT
GACCTTGCTG GAGACGTAAT GTATAAATTA TCTGTTTGGG GGTTGTTCTT CGCGGGCGGC
CTAACAATTT ATACTTACGC TGCGAATTTC ATTAATTCTA TAATATATTC TGTTGCAGGA
CCTTACTTAG GGATCGCGAC ACTTGAATAT ACAGGGGGAG CTACATTGTT TACTGCATTA
TTTGCCTTAA TGAATGGTAT CCCGTTTGGG TTTGGTGATG CGTTATCAAT GTTCTTGTCC
CTTGTTATGT TCTTATTAGC TATTACTTTA GCAATAGCAA CGATTAAGTA TGCTGTAATG
CTAGCGATAG TAGACACAAT TCCCTTATGG GCTACTCTAT GGATATTCGA ATGGACTAGA
AAAATTGCTA TGGTGGTCAT AGACTTATTG ATAGGACTTA TGGTTGTTGG GCTGATAGCT
GCAGTAACAT TCGCTATATT GGCAACACTG CCATTGGGAG CGTTAATGTT TGCTATCGAC
CCTATAGCTA TGGATGGGGA ATTTTTGTTC AGTCTGGCTT TCTTCGTCTT CGGACTAAGA
CCAGGAGAAC ATATGATGGG AGCATTCAGA AAGAAAAACG AAGGAGGATC CGGAAATACC
GTAGTAGTAG TAGAAAATAA TAGTGGCGGA TCTACGTCCT CAGAACCACC AGCTGGAAGA
TATATGTAA
 
Protein sequence
MAKRKAAYIL LLIIALPSLA LPVTAAANPV ATFINDLEIL IPAVLFILSL IALRSGDYEY 
SFMLLLAATI VTIALASVTG GNLGTNGVSL TLVQLQVTVN GPTSAYTGNT ETYTVSWSPS
MSGTVIWTVL YNGSIVYNAT GGTSFTYTFK YPGKYIVAAT VINQQNFAGG SGAVLVTVTN
PPSPLGWIEG AITGAVSGLI NSVANAFTGF LTTLLQIFGA PLEWMTYSPT PYASTSTPNA
SPIVPTIYNQ MKDFSVGLAM LFIAFSIAYN AIRGEYADLV DLAGDVMYKL SVWGLFFAGG
LTIYTYAANF INSIIYSVAG PYLGIATLEY TGGATLFTAL FALMNGIPFG FGDALSMFLS
LVMFLLAITL AIATIKYAVM LAIVDTIPLW ATLWIFEWTR KIAMVVIDLL IGLMVVGLIA
AVTFAILATL PLGALMFAID PIAMDGEFLF SLAFFVFGLR PGEHMMGAFR KKNEGGSGNT
VVVVENNSGG STSSEPPAGR YM