Gene Ssol_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2056 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1846309 
End bp1847505 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionACX92262 
Protein GI261602659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC TGGATTTTAT ATCTTATGCT TTAAACTCAC TAAAAGAAAG AAAGGTTAGG 
GCAATACTTA CTATCCTTGG GATAGTTGTC GGACCTGCTA CAATAATTTC TATAAACTCC
ATGGTATTGG GCTACTCACA CACAATAATT TCGCAAATAT CCAACTTTCT CTCACCCTAT
GATATTATCG TGACCCCCAC TGGAAGGGGT CTGCCATTAT CCCAATACCT CATACTCCAA
CTGGAGACTA TCCTTGGAGT AAAAATGGTT ATTCCATTCT ATTCATTTCC GGCGTTAATT
AGAACACCTA ATGGCTATGA AGGGGCAACA GTATTTGCGG TTAATATAAA CCAGCTTAAG
ATAGCAGCTC CAGCTATAAG CTTATCATCA GGTTACTTTC CAGCTGCGGA GGTTAGCTAT
GAAGCTTCAA TAGGCTATCA GTTGGGAAAT CCGCAAGGTG GATATAGTCC AATAAGACCA
AATCAAGTGA TACAAACAAT TATATTCTAT AACGGGAATA ATTTCACTAA GACATTTCTA
GTAACTGGAG TTTTGAACGA ATATGGAAGT TTTCTCGGAG TTGATATAGA TAAGTCAATA
ATAGTACCTT TATCTTTTGG TCAGTCAATC TCAAGTTCTT ATAGTGGAGC CATAATAATA
GTGAGCTCTC TAGGAGAAGT GAATGAAGTT GTAAATGAAA TAAAACAAAA GTTTGGAAAT
TCTTTAGATA TTGTAGTGGC GGAGGAATTT ATACAATTAA TAGATAATAC TTTACAATCT
CTTAACGGAT TGCTAGTATC TGCAGGAGCT ACGTCATTCA TAGTTTCGTT TATGGGAGTA
ACTACAACAA TGTTCACAAC AGTGGTGGAA AGAACTAAGG AGATAGGGAT ATTAAGAGCA
TTAGGATTTA CTAGGTTTGA TGTACTCACA ATGTTTTTAG TTGAAGCTAG TGTGATGGGG
TTCATAGGTA GTATAACAGG GCTCGCATTA GGTTCAGTAG TTGCATTAAT ATTAACACAA
GAACATTTCG GATTGGGATT TAGTTTTCTA AAGGGTCTTT CAGTATCACC GGTCTATTCT
CCTACCTTTA TGTTGTTAGT GCTAATATTT TCTACAATTC TAAGCGTCAT TGCAGCACTA
GGACCTGCTT ACAATGCATC CAAACTAGAT CCAAATAAAG CTTTAAGATA CGAGTAG
 
Protein sequence
MKILDFISYA LNSLKERKVR AILTILGIVV GPATIISINS MVLGYSHTII SQISNFLSPY 
DIIVTPTGRG LPLSQYLILQ LETILGVKMV IPFYSFPALI RTPNGYEGAT VFAVNINQLK
IAAPAISLSS GYFPAAEVSY EASIGYQLGN PQGGYSPIRP NQVIQTIIFY NGNNFTKTFL
VTGVLNEYGS FLGVDIDKSI IVPLSFGQSI SSSYSGAIII VSSLGEVNEV VNEIKQKFGN
SLDIVVAEEF IQLIDNTLQS LNGLLVSAGA TSFIVSFMGV TTTMFTTVVE RTKEIGILRA
LGFTRFDVLT MFLVEASVMG FIGSITGLAL GSVVALILTQ EHFGLGFSFL KGLSVSPVYS
PTFMLLVLIF STILSVIAAL GPAYNASKLD PNKALRYE