Gene Ssol_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1898 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1685489 
End bp1687015 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content38% 
IMG OID 
ProductGlycine dehydrogenase (decarboxylating) 
Protein accessionACX92110 
Protein GI261602507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGAGAC AAGCTAAATG GAATGAGCCT TTAATATTCG AGCTAAACAA AAGTGGTGCA 
GCTAGACAAG GTTTCTTGAT AAATAAGGAC GAGGAGATAA GAAGTCAGAT TAAGGAAATG
AAAATACCTA AAAATCTCCT AAGGGAAAAT GAGCCAGATC TACCAAGTTT AAGTGAGCTA
GAAGTAGTAA GACATTTCGT AAGACTATCT CAGATGAATT TTGGAGTAGA TATTGGAATT
ATGCCACTAG GCTCATGTAC AATGAAATAT AATCCAAAGA TTGAGGAGAA AGCTACAGCA
ATTACGGAAT TTCATCACCC CTTAGAGGAT GAAGACTATA TTCAAGGAAT ACTAGAGATG
ATTTACGAAC TTCAGAACTG GTTTAGCGAA ATAACCGGCA TGGACGAATG TAGTTTACAA
GTGCCGGCTG GATCTGCTGG TGAGTTCGCA GGAGTCCTAA TGATTAAAAA GTACCATGAG
GAGCACAATA GAAATTATAA AGATACAATA CTGGTCGCAG ATACTGCTCA TGGAACTAAT
CCTGCAAGTG CAGCAATGGC TGGATACAAA GTTATGTATG TAAAATCAAA TGCAGAAGGA
CTAGTAGACA TGGATATCTT AAGGGAGATC GTAAACGATA AAACTGCCGG CTTCATGCTA
ACTAATCCAA ATACGTTGGG ATTATTTGAG GAGAATATCC TAGAAATCTC CAAGATAATA
CATTCTACAA ATGCTGTATT GTACTATGAT GGAGCCAATT TAAATGGAGT GTTAGGTATT
GCGAGGCCAG GAGATATGGG ATTTGATATT GTACATTTAA ACCTCCATAA GACATTCGCT
GTTCCACATG GAGGAGGTGG ACCTGGGGCA GGCGCAATTT GTGCAAAGGG TGAACTCGTT
AATTACCTCC CATATCCGAT GGTAGAGAAG GTAAATGGAA AGTATAAGTT AAGCAAGATT
CCGAAGAATA GTATTGGTAA AATAGCTACA TTTTACGGCA ACGTTGGTAA TTTAGCGCGT
AGTTTTGCAT ACATATTAGG ATTAGGACCT CAAGGTATTC AAATGATAGG AAAAATGAGT
ACTTTAGCGA CTAATTATCT CATAGCTAAG TTAAGAGACG TTAAAGAACT GGAGTTAATT
GCGCCAAATA GGCATAGGAA GCACGAGGTA GTATTTAGCG TGAAACAGTT GATGGAAAAT
TATGGAGTAA GTGCTAACGA TGTTGCTAAA GCCTTACTCG ATAATGGATT TTACGCTCCA
ACCATATACT TCCCACCAAT TGTCGAAGAA GCACTAATGA TAGAGCCTAC GGAAACTGAG
ACTAAAGAAA CTTTAGATAT GTTTGCCGAA ACTCTTAAGA AGATTGTGAA TGATGCTAAA
ATAAACCCAG AACAAGTTAT GAAAAGCCCT AATAATACAA GTATTGCAAG ATTAGATCAA
GCTTATGCTA ATCATCCTTC AACTATAACG CCAACATATA GAGTGTTAAG GCTGAGGAGG
TTAGGTAAAA TAGATTACCT TAAATAA
 
Protein sequence
MWRQAKWNEP LIFELNKSGA ARQGFLINKD EEIRSQIKEM KIPKNLLREN EPDLPSLSEL 
EVVRHFVRLS QMNFGVDIGI MPLGSCTMKY NPKIEEKATA ITEFHHPLED EDYIQGILEM
IYELQNWFSE ITGMDECSLQ VPAGSAGEFA GVLMIKKYHE EHNRNYKDTI LVADTAHGTN
PASAAMAGYK VMYVKSNAEG LVDMDILREI VNDKTAGFML TNPNTLGLFE ENILEISKII
HSTNAVLYYD GANLNGVLGI ARPGDMGFDI VHLNLHKTFA VPHGGGGPGA GAICAKGELV
NYLPYPMVEK VNGKYKLSKI PKNSIGKIAT FYGNVGNLAR SFAYILGLGP QGIQMIGKMS
TLATNYLIAK LRDVKELELI APNRHRKHEV VFSVKQLMEN YGVSANDVAK ALLDNGFYAP
TIYFPPIVEE ALMIEPTETE TKETLDMFAE TLKKIVNDAK INPEQVMKSP NNTSIARLDQ
AYANHPSTIT PTYRVLRLRR LGKIDYLK