Gene Ssol_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2449 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2253677 
End bp2255038 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content38% 
IMG OID 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionACX92598 
Protein GI261602995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA AGGAGGAAAT TAGCTCAAAA TACGACGATT ATTCACTGAA GGAAGTTCCT 
AAAGATTCCA GATACGGCTT CTTTAACGTT TTTCTAGTAT TTTCATCTGT ATATGGTGCA
ATAGCTGTAA TATGGGCTGG AGGAGCACTA GGTTACGGTC TCACATTTTC TCAAGCTATA
ATTGCAGTAT TGTCGGGAAC AGTAGTATTA GGCATCTTAG GTTCATTGAC TGCAGCTGTG
GGAGCTTATA GTGGCCTTTC CACTTATGTT ATGTGGAGAC ATCCTTTAGG AAGATGGGGA
GGTAAAATTG CTGGATTGTT ACTGATAACT ATAACCACGG GAATAGGGTG GTATGCAGTA
GAAACATGGC TATTTGGTAT AGTAATGAGC GAGATATTCC CAAATAATCC ATTCTTTTCA
GTTGGGGTAG CTGCGATTTG GGGAGGAATT TTGATGACAA TAATGACATA TGTAGGGTAT
AGAATGCTGT CTTTCCTAAG TTACTTTACA ATTCCATTTC ATATATGGCT GATAGCAATA
GGAATAGCAA TAGTGTTAGC ACTAAAAGGG GGATTCCACA CAGTTATGGC TGCTGTCCCA
ACAAGCCATA TGAGCTTGCT TGACGGTATA TCTGCTACCA TAGGACTATA TAGCGCTGGG
ACTATAATTT CTCCCGATAT CTCCAGATTT GCCAAATCAG CTAAGGACGC TGGATATGCG
TGGTTTGCTC ACATTATTTT CCTATATCCA TTCTTAATAT TGGGGGGAGT TGCAATAGTG
TTAGCAACTG GTTCCTATTT AATAACTAAC GCAATGTTAG AGTTAGGTAT GGGAGTTGGT
GTTTTACTAA TTATAGTCTT TGGTCAGTTC ATAATAAACA CTGATAATCT ATATAGTGGT
TCCTTATCTT TAGTTAACCT AATTCCAATG AGGCGTGAAA TCGCCTCTGT GATCAACGGT
GTCATAGGTA CTGCTATTGC TGCATACGTC GGATTCTCAG CAGGTTCATC CATAACCCCC
TTTGAGAACT TTATCTCTTT ACTAGGAGAC TTTCTACCAG CAATGGGAGG AATTGTACTA
GCCGACTTCT ACATTGTGAA GAAATATGTT AATAAAATCC AAGATCCTCA TAAACGGTAT
GAATTCGTAC CAAATAATAA GTATTACAAT ATAAATATTG CAGGAATATT AGCTCTAGCA
TTAGGTTCAA TAATAGGTTA CTTCGTAAAT GCAGGTATAC CCGCCATAAA CTCCTTAGTT
ACTGGCTTCC TATCCTACAT AATAATATAT TACATTATCA AAGCAATGGG TAAGAGTCCA
GAAATATTGC CGTTTAACTA TGAAGGGGGG ATATTAAGAT GA
 
Protein sequence
MTGKEEISSK YDDYSLKEVP KDSRYGFFNV FLVFSSVYGA IAVIWAGGAL GYGLTFSQAI 
IAVLSGTVVL GILGSLTAAV GAYSGLSTYV MWRHPLGRWG GKIAGLLLIT ITTGIGWYAV
ETWLFGIVMS EIFPNNPFFS VGVAAIWGGI LMTIMTYVGY RMLSFLSYFT IPFHIWLIAI
GIAIVLALKG GFHTVMAAVP TSHMSLLDGI SATIGLYSAG TIISPDISRF AKSAKDAGYA
WFAHIIFLYP FLILGGVAIV LATGSYLITN AMLELGMGVG VLLIIVFGQF IINTDNLYSG
SLSLVNLIPM RREIASVING VIGTAIAAYV GFSAGSSITP FENFISLLGD FLPAMGGIVL
ADFYIVKKYV NKIQDPHKRY EFVPNNKYYN INIAGILALA LGSIIGYFVN AGIPAINSLV
TGFLSYIIIY YIIKAMGKSP EILPFNYEGG ILR