Gene Ssol_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1206 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1124485 
End bp1126287 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content35% 
IMG OID 
Producttranslation initiation factor aIF-2 
Protein accessionACX91444 
Protein GI261601841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA GCAACTCTGA GAGAAGGCTT AGGCAACCTA TAGTAGTAGT ATTAGGCCAT 
GTAGATCATG GAAAAACTAC ACTACTTGAT AAAATAAGGG GTACAACAGT AGTTAAAAAG
GAACCTGGAG AAATGACACA AGAGGTAGGA GCTAGTTTCG TTCCAAGTTA TATAATTGAA
AAATTAGCAG AACCTCTTAA GAAGGTAATA CCCATAAAAC TTCAGATACC AGGATTATTG
TTTATTGACA CGCCAGGTCA TGAATATTTT TCAAACTTGA GAAGAAGAGG TGGAAGCGTA
GCAGATATTG CGATCCTAGT TGTTGATATA ACTGAAGGCC TACAGAAGCA ATCAATAGAA
TCAATACAAA TACTAAGAGA AAGAAAAGTT CCATTTCTCA TAGCTGCTAA TAAAATAGAT
AAAATACCCG GATGGAAATC AAATAATGAC ATACCGTTTT TAGCATCAAT CGAGAAACAG
AGGAATGATG TGAAAGTTTA TCTAGACAAC TTAGTCTATA ATTTAGTTTC TCAATTAGCA
AACTTAGGCT TTAGCTCAGA ACGTTATGAT AGAATAAAGG ATTTCACTAA AACAGTAGCA
ATAGTTCCGG TTTCTGCAAA GACCGGTGAA GGCGTTGCAG ATCTTTTAGC ATTACTAGCT
GGATTAACCC AAAGGTACTT AGAGACTAGA TTAAAATTTG CAGAGGGTCC AGCAAAGGGA
GTTATATTAG AAGTAAAAGA AGATCCAGGG TTAGGACACA CCATAGATGT TATAATTTAT
GATGGAGTGC TTAAGAAAAA TGATACTATA ATATTAGGAG GCATTAACGG CATTATTATA
ACTAAAGTTA GAGGAATATT TGTACCTAGA CCATTACAAG ATATGAAATT AAGCAAGTAT
GATCTAACGC CAATAGATGA AGTATATGCA GCAGCTGGAG TGAAGATATC TGCACCCAAT
TTAGAGGAAG CATTAGCTGG ATCACCAATT TATGTGGTAG AAGACGAGTC TAAGGTTGAG
CGATATAAAC AACAGATAGA AGAGGAAATT AAAGAAGTTA GACTCTACAG CGATATTGAC
GGAATAATAC TCAAGGCAGA TAGTTTAGGA ACATTAGAAG CCTTAGTTAG TGCTCTGCAG
CGTGAAGGGA TTCCAATAAG GCTAGCGGAC ATAGGGCCGA TTTCGAAAAG AGATGTTATA
GAAGCGAGTA TAGTAGCTCA AAGATCAAAA GAATATGGAA TTATTGCTGC TTTTAGAGTA
AAGTTATTAC AAGGAATTGA TACTAGTGGA ATAAAAATAT TGTATAACGA AATAATTTAT
CAATTAATCG AAGATATCAA GAAGCATATT AATGATGTCA GGGAAGCGGA AAAAAGGCGC
ACGTTTGACA CATTAATATT GCCAGGGAAA ATAAAGATCT TGCCGGGTTA TGTATTTAGA
CGCAGTGACC CAGTAGTAGT AGGTATTGAG GTTATAGGAG GCATTATAAG ACCTAAGTAT
CCGTTAATTA AGGAAGATGG AAGGAGAGTC GGTGAGGTAC TACAAATCCA GGATAATAAG
AAAAGTCTAG AAAGAGCCAC TAAAGGAATG GAAGTTGCAA TATCAATTAA AGGCAATATA
ATGATTGGGA GACATGTAAA TGAAGGGGAT GTTTTATACA CAGACGTACC TAAAGAAGAC
CTCGAGATAT TAGTCAACAA GTATCCAAGT TCTATTACAG ATGATATGAG GGAAGTAATA
AAAGAAATAA TAAGAATAAA GAGAAAAGAA GATCCTTTAT ATGGATTAGG ATTACAGATC
TGA
 
Protein sequence
MKISNSERRL RQPIVVVLGH VDHGKTTLLD KIRGTTVVKK EPGEMTQEVG ASFVPSYIIE 
KLAEPLKKVI PIKLQIPGLL FIDTPGHEYF SNLRRRGGSV ADIAILVVDI TEGLQKQSIE
SIQILRERKV PFLIAANKID KIPGWKSNND IPFLASIEKQ RNDVKVYLDN LVYNLVSQLA
NLGFSSERYD RIKDFTKTVA IVPVSAKTGE GVADLLALLA GLTQRYLETR LKFAEGPAKG
VILEVKEDPG LGHTIDVIIY DGVLKKNDTI ILGGINGIII TKVRGIFVPR PLQDMKLSKY
DLTPIDEVYA AAGVKISAPN LEEALAGSPI YVVEDESKVE RYKQQIEEEI KEVRLYSDID
GIILKADSLG TLEALVSALQ REGIPIRLAD IGPISKRDVI EASIVAQRSK EYGIIAAFRV
KLLQGIDTSG IKILYNEIIY QLIEDIKKHI NDVREAEKRR TFDTLILPGK IKILPGYVFR
RSDPVVVGIE VIGGIIRPKY PLIKEDGRRV GEVLQIQDNK KSLERATKGM EVAISIKGNI
MIGRHVNEGD VLYTDVPKED LEILVNKYPS SITDDMREVI KEIIRIKRKE DPLYGLGLQI