Gene Ssol_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0414 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp372055 
End bp373545 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content34% 
IMG OID 
ProductABC-1 domain protein 
Protein accessionACX90697 
Protein GI261601094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTAGAA GATTACTGGA AGTGTTTTTC AAATTGGCCC CTAGAGTACT AGCGTATAGA 
GAGTTTAGAA ATAAAATCCT AAAGGAAATT CCAATCGATG AAAAAGAGAT GGCAGAAGAG
GCTAAGAAAT TTGTTGACAC GCTAATTGAA CTAGGTCCCA CATTCATCAA GTTCGGACAA
ATTCTTTCAG TTAGACCTGA CATTATGCCG GAAACGTATA TTAAAGAATT AGCCAGATTA
CAAGATGAAG TCCCACCTGC TCCATTTAGC CAAGTAAGTA AAATTATCGA TGAGGAACTC
GGCAATAGTG TAAAAATTTT AAAAGAACTG TCTTCAGCGT CTTTAGGTCA AGTCTATCTA
GGAGAATATA ATGGAAAGAT CGTTGCTATT AAGGTAAATA GACCTGGAAT AAAGGAAACT
GTAAATGAAG ATATTCAAGT AGTAAAGAAA TTGTTGCCAC TCCTTAGATT TGTATTTGAT
GAATCATTCA TCGAAATAAT AAAAGTATTC CTAGAAGAAT TTTCCAGAAG AATATTTGAA
GAAATGGATT ATACTAAGGA AGCCTTTTAT TTAAATAAGA TCAAAGAGGA ATTGTCAGAT
TATCCGTCCT TAAGAATACC TTCAATAATA AAGGCTACTA AAAGAGTTTT AGTGATGGAG
TATATTAAAG GGTATAAAGT TACAAGCGAA GAGGCAAAAA AGATAGTTGA TAACAGAATA
TTGGCATATA GAGTATTTAG ATTGTTCATG TATATGTTGC TTAATAAGGA CTATTTTCAT
GCAGATCCTC ACCCTGGTAA TATAGCGGTT GACGAGCAAG GGAATTTGGT ACTCTACGAC
TTTGGAATGT CTGGAAAAAT AGATGAAAAG ACTAGAAATC TATTGATTAG GGCATATGTG
GCAATGATTA GAATGGATGC AGATTCACTA GTAAGGGTAC TAGACGAGTT AGGGGCAATA
CAGCCTTTCG CGGATAGGAG GGTATTAGCG AAGGGATTGA GGCTTTTCAT GCAAGCAATG
CAAGGAATAG AAGTTAGTGA ATTAGAGCTG GAAGATTTTA TGAAACTGGC TGATCAAGTA
TTCTTTAAGT TTCCACTTAG AATGCCATCT AAACTTGTTC TGCCTTTTAG AATGGTTAAC
GTTTTAGATG GGACATGTAG AGAAATAGAT AAAGATTTTG ACTTTGTCAA ATCATCTATA
ACTTTTTTAG AAGAAGAAGG GTATACAACG AAAGTTGTTA TAGAACAAGT AAGAGAACTA
GTTAATGGAA TTTGGAATAG ATTTAGGAGC TTTCTATTAT CATATTCACA ACAGCAAGAG
CTGATAAACA TACAAAGCAG TAGGAAAGGT AGTGCAATAA CTAATTATAT TCCTCAAATG
ATATTGGTTA TAACCATAAT ATTTTATGCT ATAACAAAAG ATATTATCAT TACCCTCCTT
ATGGTTATTC TAGCATTCTC AATATCACTA AGTGGAAGAA AAAATACATA A
 
Protein sequence
MIRRLLEVFF KLAPRVLAYR EFRNKILKEI PIDEKEMAEE AKKFVDTLIE LGPTFIKFGQ 
ILSVRPDIMP ETYIKELARL QDEVPPAPFS QVSKIIDEEL GNSVKILKEL SSASLGQVYL
GEYNGKIVAI KVNRPGIKET VNEDIQVVKK LLPLLRFVFD ESFIEIIKVF LEEFSRRIFE
EMDYTKEAFY LNKIKEELSD YPSLRIPSII KATKRVLVME YIKGYKVTSE EAKKIVDNRI
LAYRVFRLFM YMLLNKDYFH ADPHPGNIAV DEQGNLVLYD FGMSGKIDEK TRNLLIRAYV
AMIRMDADSL VRVLDELGAI QPFADRRVLA KGLRLFMQAM QGIEVSELEL EDFMKLADQV
FFKFPLRMPS KLVLPFRMVN VLDGTCREID KDFDFVKSSI TFLEEEGYTT KVVIEQVREL
VNGIWNRFRS FLLSYSQQQE LINIQSSRKG SAITNYIPQM ILVITIIFYA ITKDIIITLL
MVILAFSISL SGRKNT