Gene Ssol_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1701 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1525781 
End bp1527115 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content33% 
IMG OID 
Productargininosuccinate lyase 
Protein accessionACX91918 
Protein GI261602315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATATA GAAAATGGGG ATCTGAAAAA GACGAAGTAG TTAACTATAC CTCATCCGTG 
GATAGCGATA GAGAGATTAT TGAAGAAGTG AAATTAACTA TGAAGGCACA CGTAATTAGT
CTTTATTTGA CTGGATACCT TGGGAAGGAA ACCGCTAGAA AGATCCTTGT TGCATTAAAC
GAGTTTAAAG AAATTGGCCA AGGATATGAG GATATTCATG AAGCATTAGA GGATTTCTTA
ATAAAGAAAG TAGGGGAGGA TGCCGGATGG ATAGGGTTAG GTAGGAGTAG AAATGATCAC
GTTGCCACAG CTTTAAGACT GAGATTAAGA AATAAGCTAA TAGAACTTTT AACTGACATT
AATAGCTTGA GAAAAATACT ATTGGATAAA GCAAAAGAGC ATATAACAAC AATATTTCCC
TCATATACAC ATTTACAATT AGCACAACCC ACAACATTTG CCCACTACCT AACTTACATT
GAAGAGGAGT TAGCTTCGAG GTGGGAAATA ATATTTTCCA CGTTAAAACA AGTTAATAAA
TCGCCATTAG GCTCTGGAGC AATAGTAGGG ACAAACGTTA AGATTGATAG AGAAAAAGAA
GCCGAACTAT TAGGATTTGA CAGTATAATA TACAATACAT TATCAGCTAC TTCATCGAGA
GCGGATATTC TCAGCACGAT CTCGGAATTA ACTGTATTAA TGGTAGTACT AAGCAGAATA
GCTGAGGATT TAATTTTCTT TTCGTCAAAT AAATTAATTA AATTACCAGA CTCTCATGTT
AGCACTAGCA GTTTGATGCC CCAAAAGAGA AATCCAGTTA CAATGGAGAT ACTACGAGCA
AAGGCAGCGG AGTCTATAGG TATGCTAACT AGTTTGCTTT CCATTTACAA AGGTTTACCT
ACTGGATACA ATCTTGATTT ACAAGAGATG AATAAATATT ATTGGCTTGT AATAAACTAT
ACTAAATCTT CAATAGGAGT TTTAAGCTCA CTCTTTAGTC AAATACAAGT AAATAAAATA
AACATTGATG AATCTAGTTT GGCCACTGAT GACGCTGAAT TACTTTCCAT AAGTAAGAAA
GTACCTTATA GATCGACCTA CTTTGAGATA GCTAAAAAAG TTAGGGAAGG TTCTTATAAG
TCAACTTTAA AAATAGAGGA TTCTATTAAT ATGAAGGCAG TAATCGGGTC TCCTAATTTT
GATTTAATGG CTAATTTGAT AAAAATTAGA GAAACTAAAT TGAAAGAAGA TGAAAAGGAA
ATCGAGGAGT ATAAGTTAAA AATAATCTCC AAATTAGGAG AATTACAAGT GATCGAAAAT
GAAATTGGAG AATAA
 
Protein sequence
MLYRKWGSEK DEVVNYTSSV DSDREIIEEV KLTMKAHVIS LYLTGYLGKE TARKILVALN 
EFKEIGQGYE DIHEALEDFL IKKVGEDAGW IGLGRSRNDH VATALRLRLR NKLIELLTDI
NSLRKILLDK AKEHITTIFP SYTHLQLAQP TTFAHYLTYI EEELASRWEI IFSTLKQVNK
SPLGSGAIVG TNVKIDREKE AELLGFDSII YNTLSATSSR ADILSTISEL TVLMVVLSRI
AEDLIFFSSN KLIKLPDSHV STSSLMPQKR NPVTMEILRA KAAESIGMLT SLLSIYKGLP
TGYNLDLQEM NKYYWLVINY TKSSIGVLSS LFSQIQVNKI NIDESSLATD DAELLSISKK
VPYRSTYFEI AKKVREGSYK STLKIEDSIN MKAVIGSPNF DLMANLIKIR ETKLKEDEKE
IEEYKLKIIS KLGELQVIEN EIGE