Gene Ssol_2703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2703 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2478470 
End bp2479948 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content37% 
IMG OID 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionACX92795 
Protein GI261603192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAC AAGAAATAAA CCACGTTAAT TTAGACTTAA CGGAATATAA TCAAGGAACG 
ACTGTAGTAC CAGACAGTTA CTATAATCCA AACATAGCAC CGCTTCCTAA AAACGCAAAA
ACATGGACAT GGATAAATTA TGCTACCATA TGGGCTGGAA TGATACATAA CGTCCCCGCA
TTTATGCTTG CGGGGCTATT AACATTTGAG TTCGGCCCAC TAATAGCATT AATGATCATC
GCAATAGCCT ACTTTACCTT GCTAATAGCG CTATACTTAA ATGGGCATAT AGGTACAAAA
TGGGGAATTC CATTCCCCTC ATCAATTAGA CCAATGTTCG GAATAAGGGG TGCTAGAGTA
CCAGTAATAA TGAGGGCAAT TTCAGCATTG TTTTGGTTCT CCGTTGAGAC CTATGCTGGC
GGTCTAATAT TAGATGCGCT AATCTCAATC TTTTATCCCT CATGGTCAAC AATCTCAGCA
GACCTCTTAG GAATGCCACT CCATCTGACA ATTTCGTTCT TCCTCTTTTG GTTCCTTAAT
GTATTAGTCT TATTTAAGGG AATGGATGAT ATAAAGAAAT TTGAACTAAT TGCTGGTCCC
TTGGTAATAA TAATCTTAGG AGGTTTGATG ATTCACGCAA TTACTCTTGC AAATGGTCTA
TCATCATTGT TTCAAATAAG GGGCAATAAC GTTTCATTAC CTAACATAGC CTTAGCAATA
TCCACAATGG CAGGTTTTTG GGCAACCCTA GTCCTAAACA TTCCGGACTT TACGAGATTT
TCTAGAAGCC AAAAGGACCA ACTAATAGGA CAAACTATTG GTCTACCTAT ACTTACGTTG
CTTTTCAGCT TCATAGCAGT TGGGTTAGCA TCGGCAGTAA TTTATATTTA CAATATTCCA
AGTAATGACA CAATTAATTA TGTAAACCCA GTAAATATAA TGTATCTCTT TACTGACAAT
CCTTACATAA CGTTAATCTT AGGAATCAGT CTAGTTATTG CAACAATCTC AGTTAACGTT
GCAGCAAATA TTGTATCACC CGTTTACGAC TTGATAAGTT TATTCCCAAA GAAGCTTAAC
ACGTGGTCTA AATCAGCTAT TGTATCTGCA ATTCTGGGTT TACTTTACGC CCCATGGTTA
TGGTACAATA ACGCTTCAAG TATAGAAAAT GTGATAAATT TGATTGGTGC CGGTCTAGGT
TCTGTCGCCG GAGTCATGAT AGCCCACTAC TGGATATTAG GAAAAACTGA AATTAAACTT
GCAGATCTAT TTAAGCCAAA TGGAAGATAT TGGTATGTGT CAGGCTATAA CGTTAATGCG
TTAGTTGCAA TGATCATAGG GTTCTCTGTA CCAGTAATAG GATTTCTAAT TCCTAAACTA
TCCTTGCTAT ATGACTATGG TTGGTATCTT GGATTATTTT TGAGTATAGC AATATATTTG
GGATTGGAGA GAAAAAGAGA AGTGAAAATG GAACCTTAA
 
Protein sequence
MSEQEINHVN LDLTEYNQGT TVVPDSYYNP NIAPLPKNAK TWTWINYATI WAGMIHNVPA 
FMLAGLLTFE FGPLIALMII AIAYFTLLIA LYLNGHIGTK WGIPFPSSIR PMFGIRGARV
PVIMRAISAL FWFSVETYAG GLILDALISI FYPSWSTISA DLLGMPLHLT ISFFLFWFLN
VLVLFKGMDD IKKFELIAGP LVIIILGGLM IHAITLANGL SSLFQIRGNN VSLPNIALAI
STMAGFWATL VLNIPDFTRF SRSQKDQLIG QTIGLPILTL LFSFIAVGLA SAVIYIYNIP
SNDTINYVNP VNIMYLFTDN PYITLILGIS LVIATISVNV AANIVSPVYD LISLFPKKLN
TWSKSAIVSA ILGLLYAPWL WYNNASSIEN VINLIGAGLG SVAGVMIAHY WILGKTEIKL
ADLFKPNGRY WYVSGYNVNA LVAMIIGFSV PVIGFLIPKL SLLYDYGWYL GLFLSIAIYL
GLERKREVKM EP