Gene Ssol_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1696 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1521371 
End bp1522810 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content38% 
IMG OID 
Productphosphoribosylamine/glycine ligase 
Protein accessionACX91913 
Protein GI261602310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAT TACTCGTTGG AGATGGAGCT AGAGAAAACG TTCTAGCCTA TTCGTTGGCA 
AGATCATCTA AAGGTTACAA GATTTACGCA CTATCGTCAT ATATAAATCC CGGGATTAAT
TCAATAGTGA AAACCACTGG TGCAGAGTAT TTTATAGGTA ACGTAAACTC CCCAGAAGTT
ATTAAGGAGG TAATTAAGAA AGTAAACCCA GATTTAGGCG TAATTGGACC AGAAGATCCC
TTATTCAACG GAATTGCGGA CATTTTTAGA AAAGAGGGAA TATCGGTATT CGGGGCTAGC
AAAAAGTGTG CAAGGATAGA GGAGTCTAAG GCATGGGCAA GAGAGTTAAT GTGGAAACAT
TCTATTCCAG GAAGATTAAG ATATAAGGTA TTTTACACAA TAGAAGATAC TGCAAAGTTC
ATATTAGAAT ATGGCGGATC AGTCGCAATA AAACCTGCTG GGCAAGCTGG AGGAAAAGGG
GTTAAGGTAA TAGCTGATCT AGAGGCTTAT TTAACCCATG ATAAGAGAGA GGCACTGACA
AAAAGCGTGA ATGAAATAGG GAGTCTATAC AATAAGGAAG GTGAGCCGAG AATTATAATA
GAGGAGAAAG TTGATGGACC AGAATACACA CTTCATGTTT TAAGTGATGG GAAAACAACT
ATCTCCTTAC CTTTGGCTCA AGATTATAAG AACGCGTATC AAGACGGAAT AGGTCCAGAG
ACTGGAGGAA TGGGATCAAT TTCTGGACCT AACGAATTGC TTCCATTTAT CAGCAATGAA
GAGTATCAAA CAACTTATGA TATAGTTAAA AGGACTATGG ATGCGATATA CAAGGAGACT
GGAGAGAGAT ACGTAGGAGT TATTGCAGGA CAAATGATGT TAACTGAACT TTGGGGACCT
ACAGTAATTG AGTATTATTC AAGATTTGGT GATCCAGAAG CTTCCGCCAT AATTCCAAGA
TTAGAATCTG ATTTTGGAGA GACAATTGAG CTCACAGCTA CTGGACATTT GAATAAAGCT
AGTATAAAAA TAAACGAGAA ACCTTCTATA GTCAGAGCTG TTGCTACATT AGGATACCCT
ATCTCAAAAC AAATGGCATC TGGGCATAAG ATTGTAGTAG ATTTAGAAAA GATGAAAGAG
CGCGGATGCG TGGTATTTTT TGGATCTGTA GCATTAGAGG GAATGCAACT TATAACTAAA
GGCTCTAGAG CTTTAGAAAT AGTTGCAATA GGAGATTTCG AAGAAGCTGC TGAGAACTTA
GACAGATGTA TGCAATATAT TAGCAGTGAT ACTAAATTGA TATATAGGAC TGATATTGGG
AGGACAGTTA AATCTCAAAT TGAAAAGGCT GAAATCATAA GATATTCTTA TAAAAATAGA
GAAAAAAGAG GGATTCTTGG AGTTTCTGCA GATTGGTCTC CTAATGGTGG GTTATGGTGA
 
Protein sequence
MKVLLVGDGA RENVLAYSLA RSSKGYKIYA LSSYINPGIN SIVKTTGAEY FIGNVNSPEV 
IKEVIKKVNP DLGVIGPEDP LFNGIADIFR KEGISVFGAS KKCARIEESK AWARELMWKH
SIPGRLRYKV FYTIEDTAKF ILEYGGSVAI KPAGQAGGKG VKVIADLEAY LTHDKREALT
KSVNEIGSLY NKEGEPRIII EEKVDGPEYT LHVLSDGKTT ISLPLAQDYK NAYQDGIGPE
TGGMGSISGP NELLPFISNE EYQTTYDIVK RTMDAIYKET GERYVGVIAG QMMLTELWGP
TVIEYYSRFG DPEASAIIPR LESDFGETIE LTATGHLNKA SIKINEKPSI VRAVATLGYP
ISKQMASGHK IVVDLEKMKE RGCVVFFGSV ALEGMQLITK GSRALEIVAI GDFEEAAENL
DRCMQYISSD TKLIYRTDIG RTVKSQIEKA EIIRYSYKNR EKRGILGVSA DWSPNGGLW