Gene Ssol_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1124 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1050066 
End bp1051302 
Gene Length1237 bp 
Protein Length411 aa 
Translation table11 
GC content32% 
IMG OID 
ProductCitrate transporter 
Protein accessionACX91362 
Protein GI261601759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATAA GTGCACTAAT AGTCATGGTA ATAACATACT GCCTTATAAT TTCTAGAAGT 
ATTACGAAAA TTCCACCTTG GGCTTCTATG TTTTTTGGTG GTATCCTTAT GGTAATTCTA
GGTATAATAT CTCCAGAAGA AGCTTTACAA TCAATAAATT TAGATGTAAT ATTATTTCTC
ATTACCCTTT TTACATTTGC ATCAGCGTTA GAGGTTTCTG GATTTTTGAA ATTCCTTGCA
TATAAGATTA TAGAAAAATT CAAGGAACCT AGGAAAGTTC TCTTCTATAT TCTTTTATAT
TCTGGTCTAT TATCAAATTT AGTTACCAAC GATGGAGTAT CAGCAAGCTG GACTCCAGTC
ATCTTAGAAT TAAGCAGGAT GATAGGCGTT TCTGAGGTTC CTTTTCTTTA TGCATTAGCT
GTTGGTGTTA CTATTGGGAG CGTTATAATG CCTACTGGCA ATCCTCAAAA TTTACTCATA
GCTTTAGAAT CTGGAATAAA AAACCCTTTC ATTACATTTA CAATATATTT GACCTTACCC
TCAATAATTA GCTTAATAAT TGCTTATTTT ATACTCTTTC GTCTATTCAG AAAATCCTTG
TCTTTACCAA GTGGAATTAA TATAAAAAAA AGAAGAAGAG GAAAAGGTTG ATTTCGATAG
AAGACTTGGA TATCTGACAT TAACCTTATT AGTAGTTACC ATAATATTAT TTTTTTCCTT
AAGTTTCTTT AAAATAGATA TTTTACTGGG TTCTTTAGTT ACTTCATCTA TCTTACTGCT
TTTAACGGAA AAGAGGAGGG ATATTGTAAG AAGAATGGAC TGGCCAACTA TACTATTTTT
TATCGGATTG TTCATATTTA CTGATGGAGT ATTAAAATCT GGGATTATAC AGTATTTATC
TAATTTTCTT CCCCCTCCAG ATAGTGTGGC TAGTATAATG ATTGTAAGTA TTTTACTGAG
CCAAGTATTA AGTAATGTAC CATTGGTTGC AATATACATA CCGATCATGA TCTCTCATAG
TGGTATTACA GTGGTGGATT GGCTAGCGTT AGCTGCGGGT AGTACTATAG CGGGTAACTT
CACCATATTA GGCGCAGCAA GTAACGTAAT AATTTCTGAA GCTTCTGAGA GCAGAGGTGG
AAAAGGATTT AATTTCGTAG AATTTATGAA ATATACTATT CCGATTCTAA TACCAAATGC
AATAATCATT TATTTGTTTT TAGTATTATT CAGGTAA
 
Protein sequence
MLISALIVMV ITYCLIISRS ITKIPPWASM FFGGILMVIL GIISPEEALQ SINLDVILFL 
ITLFTFASAL EVSGFLKFLA YKIIEKFKEP RKVLFYILLY SGLLSNLVTN DGVSASWTPV
ILELSRMIGV SEVPFLYALA VGVTIGSVIM PTGNPQNLLI ALESGIKNPF ITFTIYLTLP
SIISLIIAYF ILFRLFRKSL SLPSGINIKK EEEEKVDFDR RLGYLTLTLL VVTIILFFSL
SFFKIDILLG SLVTSSILLL LTEKRRDIVR RMDWPTILFF IGLFIFTDGV LKSGIIQYLS
NFLPPPDSVA SIMIVSILLS QVLSNVPLVA IYIPIMISHS GITVVDWLAL AAGSTIAGNF
TILGAASNVI ISEASESRGG KGFNFVEFMK YTIPILIPNA IIIYLFLVLF R