Gene Ssol_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1094 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1023286 
End bp1024797 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content29% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX91337 
Protein GI261601734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00376614 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACTAA AGCTATTTAA AAAAATGAAT TTATTTGAAT TAATGAATAC AAGAATAAAT 
GAAAATATCA AATATTATGG TGATGATATG GAAAGACTCC GTAAAGAATA TATTGTGATG
TTGTTTTTAA TGCCGCTAAT ATCTAGTATC TCTATTATCT TATATTTGAA AATTAGTCCA
TATTTTTTAC TGTTAAATAT AATGAATGTT TTTATTTATT TCTTTCCTAT ACTTATTACT
CAAATTAGAA AGGATGAACA AAGAAAGCTT ATAGAAAACG AGATGCCCGT TTTCTTACTA
TTCGCTTATG TTAATAGCCT TTTAGGAAGA AATCTATATA AAACGTTTGA AGAAATAAGG
AATTCTAAGG TTTTTAAAGG TTTGAGAAGA GAGGCCATGT TGCTTGTTAA AGAGGTCGAA
GTTTTAGGAA AATCATCTTT TTCGGCAATG GAAAGCAGAG CCAAGGCTCA TAGAGGAGAC
TTCTTAGGTA AGATTTATAC GGTATACACA AGTGGTGAGT CAATAGGTAT TTCAATGCCC
GAGAGATTGA AAGACCTTTT AAATGAAACT ATTGATGGCT TGAATTCAAA TTTTCAAGGT
TATGTTGAAA AAGTAAATGA ATTAGTTGAG ATTTTGTTCA TGCTATTTCT TATTACTCCG
ATGATTTTAT TAGCATTTCA GTATATTTCC TCATCCATAA ATATTTTCGA ACTTATATTT
CCTCTTTTGC TATTCCCTAT CATTTTCTTT TATATTTCAC TTATACAGCC TAATATAGGA
TATGATATAA AAATCGATAT AAAAGAGGTT AAAAACTCGC TCTACATATT ACCTATTCCC
TTTATACTCG TGTTCCTCTT TCACCTTAGC CTAGAGTATG AAATCTTAGT ATTTTATTCG
ATATTTATAA TGTTCTCCTT CTTTATCTAT CGTAAAATAT CAGTTGCTGA CGCAATCTTG
AATAATTTAC CATATATATT ATCAGATATT GCAGATTATT TAAAAATAGG TTATAGTATA
AAAAGTGCAA TATTGAAACT TAACGTAGAT AATACGCACT TTAGGATGTT CTTAGGAGAA
CTCGCTGCAA AGATCAGGAA AAACGAAGCG ATATCTAATG TAAGAACTAA TATCTGGATT
GTGAATGCTA TCTTAGAACT CATAGAAAAT ATTGACAAAA AAGGTTTTGC AGACACCTAT
ACTTTCAAAG ATTTATCCTT AATTCTCTAT AATTATACTT CTTTGCGAAA AAAAGTATTA
CAAAATCTTC GTCTATTCAG CATTCTAGCT ACTATAACAC CTATTGTATT CTATTTTGCA
CTAGGGATAA TGAGTAAGAT TAGGGCAGTT GGCAACTTAG ATCTAATTAT CGTGTTGTAT
ACAATGGCAC TTAGTATAAT TTATGCTAAG GTATCCCGAT TTACCGTTTT TAATTTTCCA
TTACTTGTTT TAGCTCTAAT GAACCTAATA ATAGTATTAC TATTCGGAAA CGTTATCTTA
ACCTTTATCT AG
 
Protein sequence
MKLKLFKKMN LFELMNTRIN ENIKYYGDDM ERLRKEYIVM LFLMPLISSI SIILYLKISP 
YFLLLNIMNV FIYFFPILIT QIRKDEQRKL IENEMPVFLL FAYVNSLLGR NLYKTFEEIR
NSKVFKGLRR EAMLLVKEVE VLGKSSFSAM ESRAKAHRGD FLGKIYTVYT SGESIGISMP
ERLKDLLNET IDGLNSNFQG YVEKVNELVE ILFMLFLITP MILLAFQYIS SSINIFELIF
PLLLFPIIFF YISLIQPNIG YDIKIDIKEV KNSLYILPIP FILVFLFHLS LEYEILVFYS
IFIMFSFFIY RKISVADAIL NNLPYILSDI ADYLKIGYSI KSAILKLNVD NTHFRMFLGE
LAAKIRKNEA ISNVRTNIWI VNAILELIEN IDKKGFADTY TFKDLSLILY NYTSLRKKVL
QNLRLFSILA TITPIVFYFA LGIMSKIRAV GNLDLIIVLY TMALSIIYAK VSRFTVFNFP
LLVLALMNLI IVLLFGNVIL TFI