Gene Ssol_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2040 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1829237 
End bp1830334 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content36% 
IMG OID 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionACX92248 
Protein GI261602645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTCAG TACTAGACTG GAAGCCTAAA ATTGGAATAT TAGGAGGAGG ACAGCTCGGC 
TGGATGATAG TATTAGAGGG TAGAAAATAC CCATTTACTT TTTACGTATT AGAGAACGAT
AAGAATGCTC CAGCTTGCAG AATTGCAGAT AGGTGTTTCT CTCCTCAAGA TTATAAGGAA
TTCGTTGATT CCTCAGACGT TATAACATTT GAGTTCGAAC ACGTGTATGA AAAGGCATTA
GAGTATGCTG AGTATAGTGG CAAGCTATTA CCTAGACTTA ACTCTGTAGA GTTGAAGAGA
GAGCGTTATA AGGAGAAGCT GTTCTATAGA CAACATAATT TACCAACTCC TAGATTCTAT
GTAGCAGAGG ATGGTGAGGA AGCATTAAAG ATATTAAGAG AGGAATTCAA TAATGTCGGA
GTTATTAAGG AATCTAAAGG AGGATATGAT GGTAAGGGGC AATATTTCAT CTTTAATGAC
GTTGAGAAAT ATCAATTTCT AAGGGAAAAG AAAGAGAAGA TGGTCGTTGA GGAGTATGTA
AAATTTGATT TTGAGGCCTC CATTATTATA GCAAGGGATA AGAGAGGTGT TTTTATTAGT
TACCCTCCAA CTTATAATTA TAATGAAAAA GGTATTTTAG TTTATAATTA TGGGCCGTAT
AATAATCAGA ATATAGTAGA GATTGCAAGA AGGTTAAGTG AGGAGTTGGA TTACGTAGGA
ATTATGGGCG TTGAGGTATT CGTAGTTAAC GGTAAAGTTT TAATTAATGA GTTTGCCCCA
AGAGTTCACA ATACTGGGCA CTATACTCTT GACGGCGCTC TAATCTCTCA ATTTGAACAA
CACCTAAGGG CAATAATCGG TATGGAGTTA GGTCCATCTA CCATCTTATC TCCTAGCGGG
ATGGTTAATA TTCTTGGTAC AGATAAAATA CCAGTTGAGG TATTAAAATA CGGTAAAGTT
TACTGGTACT CTAAGAGTGA AGTTAGGAAG AGGAGAAAAA TGGGTCATGT AAATGTAGTA
GGGAACAATC TTGAAGAAGT TAAGCAAAAA ATTGATAAAA TTATGCAACT AATCTATACT
AATGGGTTAG ATTTATGA
 
Protein sequence
MFSVLDWKPK IGILGGGQLG WMIVLEGRKY PFTFYVLEND KNAPACRIAD RCFSPQDYKE 
FVDSSDVITF EFEHVYEKAL EYAEYSGKLL PRLNSVELKR ERYKEKLFYR QHNLPTPRFY
VAEDGEEALK ILREEFNNVG VIKESKGGYD GKGQYFIFND VEKYQFLREK KEKMVVEEYV
KFDFEASIII ARDKRGVFIS YPPTYNYNEK GILVYNYGPY NNQNIVEIAR RLSEELDYVG
IMGVEVFVVN GKVLINEFAP RVHNTGHYTL DGALISQFEQ HLRAIIGMEL GPSTILSPSG
MVNILGTDKI PVEVLKYGKV YWYSKSEVRK RRKMGHVNVV GNNLEEVKQK IDKIMQLIYT
NGLDL