Gene Ssol_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1102 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1032372 
End bp1033952 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content36% 
IMG OID 
Product2-isopropylmalate synthase/homocitrate synthase family protein 
Protein accessionACX91345 
Protein GI261601742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCAAGA AATCTGTTGA GGTTCTAGAT ACAACATTAA GAGATGGTTC ACAAGGAGCA 
AATATATCTT TTACTCTAAA TGATAAGATT AAGATAGCAT TACTCTTAGA TGAGCTTGGA
GTAGACTATA TCGAAGGGGG ATGGCCAGGA TCTAACCCTA AGGATGAGGA GTTCTTTAGA
GAGATAAAGA AATACAGACT ATCTAAGGCT AAAATAGCTG CTTTTGGGAG TACCAAAAGA
AAGGATGTTA GTGTAAAGGA AGATATTAGC TTAAATAGTA TAGTGAAAGC GGATGTGGAC
GTTGCTGTGA TATTTGGTAA GTCTTGGTCA CTGCATGCTA CCGAAGTCCT TAAGGTAACT
AAACAAGACA ATTTAGATAT AGTATATGAT AGTATTAATT ATTTGAAATC GCATGGACTT
AAGGTAATAT TTGATGCAGA GCACTTCTAT CAAGGCTTTA AAGAGGATCC GGAATATGCA
CTAGAGGTAG TAAAAACTGC AGAGTCCGCT GGGGCAGATG TAATTGCTTT AGCTGATACA
AATGGTGGAA CTCCACCGTT TGAGGTTTAT GAGATAACCA AGAAAGTTAG GGAAGTTCTA
CAGGTTAAGT TAGGAATTCA CGCTCATAAT GACATAGGAT GTGCAGTTGC TAACTCTCTA
ATGGCTATAA AGGCGGGAGC TAGACATGTT CAAGGAACAA TCAACGGGAT TGGCGAGAGA
ACTGGCAATG CTGACCTAAT ACAGATAATA CCAACGCTAA TATTAAAAAT GGGATTAAAT
GCATTAAATG GTCAAGAGAG TTTAAGGAAA TTGAGAGAGG TTTCAAGGAT AGTATATGAA
ATATTAGGAT TACCTCCTAA TCCTTACCAA CCTTATGTTG GTGATAATGC TTTTGCGCAC
AAAGCTGGTG TTCATGTAGA TGCTGTAATG AAGGTACCTA GAGCATATGA ACACGTTGAT
CCTTCATTAG TGGGTAACGA TAGGAAGTTT GTAATTTCAG AATTGTCTGG AACTGCTAAT
CTGGTCTCAT ATTTACAAGG CTTAGGAATA GCGGTTGATA AGAAAGATGA AAGGCTTAAA
AAGGCTTTAA ATAAGATTAA GGAACTAGAG GCTAGAGGAT ATAGTTTTGA TGTTGGACCG
GCTTCCGCAA TACTAATAAC ATTAAAGGAG TTGAACATTT ATAAGAACTA TATAAATCTA
GAATATTGGA AGGTAATAAA TGAAAATAAT GGTCTTTCTA TAGGGATTGT TAAAGTTAAT
TCTCAGCTTG AAGTTGCTGA GGGCGTAGGT CCAGTTAATG CGATAGATAG AGCATTAAGA
ATGGCACTTC AGAGAGTATA TCCAGAAATA GGTGAAGTTA AGCTAATAGA TTATAGGGTA
ATATTGCCTA GTGAGATTAA GAATACTGAA AGCGTTGTTA GAGTTACCAT AGAATTTACT
GATAATAAAA TGAATTGGAG AACTGAAGGA GTATCTAAGA GTGTAGTTGA AGCTTCGGTT
ATGGCATTAG TAGATGGGCT TGATTATTAT CTACAATTAA AGAAGACATT AAAAACAGCT
GTAGACAATT ATATCGTGTG A
 
Protein sequence
MSKKSVEVLD TTLRDGSQGA NISFTLNDKI KIALLLDELG VDYIEGGWPG SNPKDEEFFR 
EIKKYRLSKA KIAAFGSTKR KDVSVKEDIS LNSIVKADVD VAVIFGKSWS LHATEVLKVT
KQDNLDIVYD SINYLKSHGL KVIFDAEHFY QGFKEDPEYA LEVVKTAESA GADVIALADT
NGGTPPFEVY EITKKVREVL QVKLGIHAHN DIGCAVANSL MAIKAGARHV QGTINGIGER
TGNADLIQII PTLILKMGLN ALNGQESLRK LREVSRIVYE ILGLPPNPYQ PYVGDNAFAH
KAGVHVDAVM KVPRAYEHVD PSLVGNDRKF VISELSGTAN LVSYLQGLGI AVDKKDERLK
KALNKIKELE ARGYSFDVGP ASAILITLKE LNIYKNYINL EYWKVINENN GLSIGIVKVN
SQLEVAEGVG PVNAIDRALR MALQRVYPEI GEVKLIDYRV ILPSEIKNTE SVVRVTIEFT
DNKMNWRTEG VSKSVVEASV MALVDGLDYY LQLKKTLKTA VDNYIV