Gene Ssol_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1071 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1002919 
End bp1004118 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content32% 
IMG OID 
Productmetal dependent phosphohydrolase 
Protein accessionACX91314 
Protein GI261601711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0173536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG TTTATGATGA GATCCATGCG TATATTGAAC TTGACGATAG AGAGGCCAAG 
ATAATTGATA TGCCAGAATT TCAGCGCCTA CGAAGAATAA AACAAACAAG TTTAGCATAT
CTGGTATACC CTGGGGCTAC TCATACCAGG TTCAGTCACT CTTTAGGGAC ATTTTATCTT
ACTACAATTT TAGGTGAGAA ATTTAGACAG CTAGGAATAA TAACTGACGA AGAGTCAACT
TACCTAAAAT TATCTGCACT GCTCCATGAT ATAGGTCAAT TTCCTTTTAG TCATAGCCTA
GAGCCTTTAT ATTTAGAAAA GGGATTATCA AATAAGGATT TAAGGTATAT GATAATTTCC
AAATCGCCTA ATTTTAGGGA ATTTTTTGAT AATGAATCAA TTGACTATAG TAAGATTATA
GAAATTTTGA ACGGAAACTC AATGATGTCA TCTATAGTAA ATAGTGACGT AGATGTTGAT
AGGATGGACT ATCTGGTAAG GGACTCTAGA CATACTGGAG TGCAACTAGG CAATATTGAT
TTATATAGAT TATTGGATAC CATCTTCTAT GGAAATAATA ACGAAATTGT TGTTCAAGAT
AAAGGTATAT ATAGTTTAGA GAACTTTTTC ATATCCAGGC TTCACATGTA TCAAGCTGTA
TATTATCATA AGACCATAAT AGGTTATGAA CTGATGCTGA GAGAAATTTT CAGAACTATT
TACGATTGCT GTGATTCGTC AATCTTAAGC GTAGAAAATA TAAGAGGTCT TGTCTATGAT
TCCTCAATAT CCTATTGGGA TGATGAATGG GTTTTCATGA TTCTTTACAC ATATCTCTAT
TCCTCTAACT CTCCCCTTTA TTTAAAGCAG AAAATAAGAA ATTTCTTGGA TAGAAGAGGT
CCTAAAGTGG TTTATGAAGA GATTTCCTAC GATAACGAGA TGAAAGGAGG AGATATTAAA
ATTAAGGAGA TAGTAGATCG TTTAGAGAGA AATCAGATTC CGAGGAGTTC AATATATCCC
ATTGAGGAAA AAATAAAAAT ACTGAATAAG GATAAAATAA ATATAATTTC AAAGAATAAT
GAGATGAATA TAATCCGGTA TAAGTCCACT TTAATTAACC ATATACCAGA GACTTTAACT
ATAAGAAGAA TTTATGTAGA TCATGAATAC GCTAAAAAAG CTAGAGATGT AGTTCCATGA
 
Protein sequence
MKKVYDEIHA YIELDDREAK IIDMPEFQRL RRIKQTSLAY LVYPGATHTR FSHSLGTFYL 
TTILGEKFRQ LGIITDEEST YLKLSALLHD IGQFPFSHSL EPLYLEKGLS NKDLRYMIIS
KSPNFREFFD NESIDYSKII EILNGNSMMS SIVNSDVDVD RMDYLVRDSR HTGVQLGNID
LYRLLDTIFY GNNNEIVVQD KGIYSLENFF ISRLHMYQAV YYHKTIIGYE LMLREIFRTI
YDCCDSSILS VENIRGLVYD SSISYWDDEW VFMILYTYLY SSNSPLYLKQ KIRNFLDRRG
PKVVYEEISY DNEMKGGDIK IKEIVDRLER NQIPRSSIYP IEEKIKILNK DKINIISKNN
EMNIIRYKST LINHIPETLT IRRIYVDHEY AKKARDVVP