Gene Ssol_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1957 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1741603 
End bp1742988 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content37% 
IMG OID 
Producthomocitrate synthase 
Protein accessionACX92168 
Protein GI261602565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAG TAGGTATTTT AGATTCGACG TTAAGAGAAG GAGAACAAAC TCCTGGAGTA 
ATATTTACTG TAGACCAAAG AGTAGAGATA GCTAAGGCTC TATCCGATTT AGGAGTATCT
ATGATAGAAG CCGGTCATCC GGCTGTATCT CCAGATATTT ACGAAGGGAT AAAAAGAATA
GTCAAATTGA AAAAAGAGGG TATTATAACA TCAGAAATTG TAGGACACAG TAGAGCCGTA
AAAAGAGATA TAGAAATTGC AGCAGAATTA GAGGTAGATA GGATAGCAAT ATTTTACGGC
GTAAGTGATA TACATCTAAA GGCGAAACAT AAAGCAACAA GAGAAGAGGC TTTAAGGGTA
ATAGCTGAGA CAATTAGTTA CGCTAGGAGT CACGGCGTAA AAGTCAGATT TACCGCAGAA
GATGGTTCAA GGACAGACTT TGACTTCTTA GTTACAGTAT CGAGAACGGC TAGAGATGCA
GGTGCGGATA GGGTTAGTAT AGCTGATACT GTAGGCATAT TATATCCATC AAAAACCAAG
GAATTATTTA GCGCGTTAAT AAGGGAAGTT CCAAACTTGG AGTATGATAT TCACGCTCAC
AATGACTTAG GTCTAGCAGT AGCAAATGCA TTGGCTGCAG TAGAAGGTGG AGCTACGATT
GTTCATGCAA CGGTTAATGG GCTTGGAGAG AGGGTTGGTA TAGTACCTTT GCAACAAATC
GTAGCAGCTA TTAAGTATCA TTTTGGTATA GAAGTAGTTA AACTAGATAA ATTACAGTAC
GTTTCCAGTT TAATTGAAAA GTACAGTGGA ATTCCGATGC CACCTAATTA TCCCATAACT
GGGGATTACG CTTTTTTGCA TAAGGCAGGA GTTCATGTTG CGGGTGTGTT GAGTGATCCT
AGAACATATG AATTTATGCC TCCAGAGACG TTTGGTAGAA CAAGAGATTA CACTATTGAT
AAATATACAG GAAAGCATGC GTTAAGAGAT AAATATGAAA AACTAGGTGT GAAAATCAGT
GAGGCTGAAA TGGATCAGAT TTTAGCTAAA ATTAAGTCAA ATACGACTAT AAGATTTTAC
AGAGATGTGG ATTTACTAGA GTTAGCTGAA GAAGTTACCG GAAGAGTTTT GAAGCCAAGA
CCACCTGAGC AAATAGAAGC GTTAATTTCA GTTAAGTGTG ATTCTAACGT TTATACCACA
TCAGTAACTC GTCGTTTATC AGTTATTAAT GGCGTTAAAG AGGTTATGGA AATTTCAGGA
GATTATGACA TACTGGTCAA GGTTCAAGCT AAGGACTCTA ATGAATTAAA CCAGATAATC
GAAAGTATAA GAGCAACTAA AGGTGTGAGA TCAACATTAA CATCATTAGT CCTTAAGAAA
ATGTAA
 
Protein sequence
MIKVGILDST LREGEQTPGV IFTVDQRVEI AKALSDLGVS MIEAGHPAVS PDIYEGIKRI 
VKLKKEGIIT SEIVGHSRAV KRDIEIAAEL EVDRIAIFYG VSDIHLKAKH KATREEALRV
IAETISYARS HGVKVRFTAE DGSRTDFDFL VTVSRTARDA GADRVSIADT VGILYPSKTK
ELFSALIREV PNLEYDIHAH NDLGLAVANA LAAVEGGATI VHATVNGLGE RVGIVPLQQI
VAAIKYHFGI EVVKLDKLQY VSSLIEKYSG IPMPPNYPIT GDYAFLHKAG VHVAGVLSDP
RTYEFMPPET FGRTRDYTID KYTGKHALRD KYEKLGVKIS EAEMDQILAK IKSNTTIRFY
RDVDLLELAE EVTGRVLKPR PPEQIEALIS VKCDSNVYTT SVTRRLSVIN GVKEVMEISG
DYDILVKVQA KDSNELNQII ESIRATKGVR STLTSLVLKK M