Gene Ssol_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1553 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1410618 
End bp1413614 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content42% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX91780 
Protein GI261602177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAGC TCTACAAAGT CAGGTTAAAA AGCGGTGTAG ATTATCAAAA ACTCAAAGCT 
ACGGAATTAT TTGGTAATGT ATTCGAGTTG ATTTTCATAA AGAAGAGTAG GAACGAAATG
GACTTTTATG TGAGGACTGC TGCAAAAGAG GAAATACTAA GACAATACTT CGTCCTTTTG
AAAGCTGATG AGTATCCTAC AAATCGATTT GTCGCAGTTT TGAAACTGAA GAAAGAGAGC
GATTTCTATG CAAATTTGGA ATATTCGAAT TTGTTAAACT TGATATCTTC TCTAGAAGAA
GGGGAACAGA TTAGGATATG GGTAGTTTTG GAGCCGAGAT TGAATGACCT CTTCATAAAG
AAAGCTGACA AATTAAAACT TCAAGCCCAA AAGGCAATGA TAGGAAAGAG GAAGAAAGAA
CTTCTAGCAA ATATTTTGGA GTCGTATGCA AAAGATAACT TATATCTTCT TGATATAAAG
ATCTTCAGCA ATCAAAAGAG TAGACTGAAG TTACTCTTCG ATTATGCCAA ACAGCTAATT
CATACAAAAA GCAGGAAGCT AAGAATGGAG GTAAAGAAGG CTAAGAAGTT CAAGGAAAAA
CAGCCGAAAA TAGGGGCATG GGAAGCCATT GCAAAGTATA AAGTAAGGCT ATGGATAGAC
GAAGATAAAC TAAACGAGAA CTTGCCTTTG CCCTCTCCAA CTCAGGTACC AATTCCGATG
AGTATCGGTG TTTCGCTACC GTACTTTAGG CTAGAAAGGA AAGATATCTA TTTGGGAGAC
GATATACTTT ATGCAGAGAA AGTATTTCTA GACTGGACAG ATTTCCAACG TCACGCCATA
ATTTACGGCT CAACTGGGTC AGGTAAATCA AACACGTTAG AAATCTTAGC ACAAGAATTA
GCTAAATATG GCATAGTGAT ATTCCTTGAC CCTAACAGCC AGAGTGCTAG AAAGTTATCA
CAAATAGCGA ACTACTATTT CACAATAACG AAGGACAGGC CGAACTACGG CATAAATATC
CTCCAGCTCC CCCACATTTT CCAGGATAGA GAGAAGGACA TTGATTATCA GATTTCGAAA
GTGTTACAAC TGTTCGATAA GTTGCTAAAC CTCGTTGACA CTGCAGTAAA CGTGAGGTAC
ATCTTACAGG TACTGTTAAG GCAAATGTAC AGGGTGTCGG ACAGGATAAC GTTCAGGGAC
GTATATGACG CTGTCATAGC GTTGCAACAG GGCACGCTTG ACCTAGACGT TAATGACGAA
ACGTTTGAGC ATGAGAAAGA GTTACTTCAG CAAATGCAGG CTCAAAGTTT CATGTCTATA
CTCTCTAGGC TGAAACTCCT TGTTGATAAC AACATCTTCA AAATCGTCAC GTCAGAGACT
ACTATTGACT GGGACAGGGT CATTAATGAA ACAAAGAGAG GGCTGATAAT CTTCGATGTA
GGGAAGAGCG CCGGGAACGA AGTGTCTGAG ATGATGCAGA TGATTATCAC GTTATCGCTT
TTCAACTACG TCTTCCTGAG GGACGCTTTA GGGAAGGAGA AAATACCGAT CTTCCTGGTC
ATTGATGAGG CCCAAAACGT TGCACACTTT GACTTCATAA ATGAGGTCTT GGCGGAGGCG
AGGAAGTATG GACTACATCT AGTTCTTGCT ACACAGTCCT TCGTGAGGCT GCAGGCATTA
GCGGGAGAGA ATAACGCCAG GGCGATAAAC GCTAACACAA ACGTAAAGCT GTTAATGAGG
CTAACGGAGG GTAGTGACAT ATCTCAGCTG GCAAAGTCGG TAGGGGCTAA CCAAGAGATC
GTTGAGGCTT TGCCGAAGCT GTCTATCGGA CAGGCTTTCC TCTTCCTTCT AGGAAAAACT
GGGGAGTTTA CTGTACCTAA ACTGGTACAA ATTAGACCTT CTGAGCTACA AGATAAAGAG
AAAGAACCTA CAAAGGGCTT CGAGCCTAAG GGGGTCAGTA AAGGACTAAC TAAGGAAACG
ATTAACCCGG CGTTGGCACT ACTCAAGGAG CCTCCAGACG TTTTAGGACA GCTTATACTA
TATACAGCCT TCGAGAAAGG CGAGTACGGT ATAACAATTA CAGACCTGAT AGCCCAATTA
GGGATAAAGA GGGAAGTAGC ATTAGCTAAA CTGGCGGAGC TGGAGAAACT CGGCGCTGTG
CAGATTGAAC AGAGGGGTAG GAGTAAGATA GTGAAATACG CGAAGGGGTT GTTTAGACTC
AGGGGTATTG TAGAGAATGA GGAGGGAAAG AAAGTAGCAT TAAGAGTCTT GAGGAAGTAC
CTGAAGGATG GGTACATTGT TGTTCCGGGG AGACAGGAGG GTGACATCAG ACCAGACTTT
ATAGCACTGA CTTACGACAA GACCACTCTG AGGCCTAATT ATTCCAACAT CGTGATAATT
GAAATAGAGT CTCCGAATGA GGTCGCTGTG CACGCTGAAC AAGTGAGAAA GAATATGCAG
AAATACCTAT CGTTAGATGA GAGGACTAAG AGCATTATCA AGGAAATTCA CATCTGGACC
TCTGAGGAAA AATTCGATAA ATTGAAAGAA ATCTACGACA ACTTCATTAA CGATAATTCT
ATTCCTCAAG AGTATAAAAC AAAAGTCAAG ATATTCCCAG TAGAAATTAA ACAAAAAGTA
AAACAAGGAG CTCTGAAAGA GAAGAAAACT AAGGCTGAAA CTGGGGAGTT TAACGGCAAA
AGAGAAGAGA AGGCTGAAAG TATAGCTCGA CAAGCAGCCC AGGGAGCTCC AAACAATGCT
AACAGTAAAC TGGGGAGTTT ACTCAAGATA GGGCATTTAG AGTTCCAAGT GTTGGACGAG
GTAAACGACA AGGTAATAGT AAAAACTGGA GATAAAGACT ACAAGATAAG CAAGAAGGAC
CTAATAGATT TAGAGGGGCT CAAGGACCTA ATAGTAGAAG CAAAAATTGA AAACGGTTAC
CTTAAGGTCA AGACAAGTTT AGGCCTCATT CAGAAGATTT CTTTGGAGCC CTTATGA
 
Protein sequence
MMELYKVRLK SGVDYQKLKA TELFGNVFEL IFIKKSRNEM DFYVRTAAKE EILRQYFVLL 
KADEYPTNRF VAVLKLKKES DFYANLEYSN LLNLISSLEE GEQIRIWVVL EPRLNDLFIK
KADKLKLQAQ KAMIGKRKKE LLANILESYA KDNLYLLDIK IFSNQKSRLK LLFDYAKQLI
HTKSRKLRME VKKAKKFKEK QPKIGAWEAI AKYKVRLWID EDKLNENLPL PSPTQVPIPM
SIGVSLPYFR LERKDIYLGD DILYAEKVFL DWTDFQRHAI IYGSTGSGKS NTLEILAQEL
AKYGIVIFLD PNSQSARKLS QIANYYFTIT KDRPNYGINI LQLPHIFQDR EKDIDYQISK
VLQLFDKLLN LVDTAVNVRY ILQVLLRQMY RVSDRITFRD VYDAVIALQQ GTLDLDVNDE
TFEHEKELLQ QMQAQSFMSI LSRLKLLVDN NIFKIVTSET TIDWDRVINE TKRGLIIFDV
GKSAGNEVSE MMQMIITLSL FNYVFLRDAL GKEKIPIFLV IDEAQNVAHF DFINEVLAEA
RKYGLHLVLA TQSFVRLQAL AGENNARAIN ANTNVKLLMR LTEGSDISQL AKSVGANQEI
VEALPKLSIG QAFLFLLGKT GEFTVPKLVQ IRPSELQDKE KEPTKGFEPK GVSKGLTKET
INPALALLKE PPDVLGQLIL YTAFEKGEYG ITITDLIAQL GIKREVALAK LAELEKLGAV
QIEQRGRSKI VKYAKGLFRL RGIVENEEGK KVALRVLRKY LKDGYIVVPG RQEGDIRPDF
IALTYDKTTL RPNYSNIVII EIESPNEVAV HAEQVRKNMQ KYLSLDERTK SIIKEIHIWT
SEEKFDKLKE IYDNFINDNS IPQEYKTKVK IFPVEIKQKV KQGALKEKKT KAETGEFNGK
REEKAESIAR QAAQGAPNNA NSKLGSLLKI GHLEFQVLDE VNDKVIVKTG DKDYKISKKD
LIDLEGLKDL IVEAKIENGY LKVKTSLGLI QKISLEPL