Gene Ssol_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1364 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1261832 
End bp1263061 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content38% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionACX91598 
Protein GI261601995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAC TGGGAGTTGT CTACGATAAA TTCCTCTCAC CGTATTTTGC TGGTGGTGGA 
GCCGTTCATG CTTATGAGGT TACGATTAGG CTTAAGGAGC ATTTCAAAAT TGTATATTAC
CCTTCTAGCC CAGTCCTTTC ATGGGATAAG GAGAACGTAG AGAAGAAGGC TAAGGAATTA
GAGAGCCAAG GCATAAAGGT TGCTGATGAA TTTTATGAGA TATTGGAGGA GAAGAGGAGA
ATTGGAAGGC TTAAGAGGTT TTTATTTGCC GATAAGATCG CTAGGGAGTT TTCCAAGGGT
TTTAAAGTTG ACGCCGATAT CTTGTACGAG CCAGACCACA CATCCCTTGA TATTTTCTAT
CTGGCTAGGG ATACTAAATA TGGCGTAACT TTCCATGAAC CCCCCTTTTA TAATAACTCC
CTTAGATACT TTAGGAGATT AGTCAAATTT TATGGTGTAA ATCCATATAC TGGAAAAGGT
TTTCACACTA GGTTTCTATA CAACGAGTAT ATAAAATATT TGTATAAGAG GTTGTTTAAA
AAAGTGAAAA AACCTACTTT TTTAGCTGGT GTCAGTGAAG CTCCTTTACT TGAGTCTGGT
TTAGGTGGTG AGGTTATTAA ACCCGGAAAT GCTTTTAATC CTTCTCTTCT GAAGTTTAGG
AATAGGGGGA AAGAGGATTA CGTTGTATTC TGGAGTAGGT TAAATCAAGA TAAGGGTTTT
CATGAGTTGC CAGACATTTT GCGCATTATG GAAAAGAGGG GTGGTAATAA GGTAAGGTTA
ATTCTAATGG GCAAATTTTT CGATAAATAC AACGAGAGGA GGTTTTGGTC TAAGGTCAGA
AAATACGATT TGAGGGTTGA CTATAAGGGC TTTGTTAAGA GGGAGGAGTT AGCAGATATT
GTTTCTAAGG CTAAGGTTCT AATTTATCCA TCTCATGTTG ATGGTTTCTC ATTAGTTGCT
CTAGAATCTC TAGCCCTAGG TACGCCGGTT GTTGCCTATG ACATTCCCGC AATTAAGAGT
GTTTATGGAG GATTAGAGTG TGTTAGGATT GTTAATGAAT TCGATAAGGA AAGTATGGCT
GAAAACGCTT TAAAGTTCTA CAAAATGAGT GAGAAAGAAA TTGAAGAGAT CATGAATGGA
GATAAGTTAA TGGAATTCTT AAAGCTGCAT TCGAATTGGG ATAATGTTGC CAATTCTGTC
TTGAAAATTT TAAAGAAGTA TCTTATTTGA
 
Protein sequence
MTTLGVVYDK FLSPYFAGGG AVHAYEVTIR LKEHFKIVYY PSSPVLSWDK ENVEKKAKEL 
ESQGIKVADE FYEILEEKRR IGRLKRFLFA DKIAREFSKG FKVDADILYE PDHTSLDIFY
LARDTKYGVT FHEPPFYNNS LRYFRRLVKF YGVNPYTGKG FHTRFLYNEY IKYLYKRLFK
KVKKPTFLAG VSEAPLLESG LGGEVIKPGN AFNPSLLKFR NRGKEDYVVF WSRLNQDKGF
HELPDILRIM EKRGGNKVRL ILMGKFFDKY NERRFWSKVR KYDLRVDYKG FVKREELADI
VSKAKVLIYP SHVDGFSLVA LESLALGTPV VAYDIPAIKS VYGGLECVRI VNEFDKESMA
ENALKFYKMS EKEIEEIMNG DKLMEFLKLH SNWDNVANSV LKILKKYLI