Gene Ssol_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2375 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2176707 
End bp2177999 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content36% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX92539 
Protein GI261602936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.529747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTATACG GCCTAAATAA ACAACAATGG CTAGCAGTGT TCTCTACATG GTTAGGATGG 
TTAATGGATG GTTATACTTC TATAGCTTAT GCTCTAGTTG CAGTTACTAT TTCGAAAATA
TTTTTCCCTT CAACCATAGG AATTCTAGGT TTAATAGCCA CTTTTGGAGG ATTCGCAGTT
GGTGCATTAG CTAGGCCCGT AGGATCTTTA GTGTTTGGAA ATTTCATAGG AGATAAGATA
GGTAGGAAAA ATATGTTAGT TCTAACGATT TTAGGTTTTT CCTTAATAGC CTCTTCTAAA
GCCCTATTAC CTTCATACGA AACCGCGGGA ATTTTAGCTC CACTATTTCT TTACATCATA
TTATTTGCTG AGGGCATGTT TGCAGGTGCA GAATATGGAG GAGGAACCAC ATTGGCGTTA
GAGTCTGTAC CTGTAGGCAA GAGAGGATTT ATTGGCTCTT TTGTGCAAAG TGGTTTTGGT
ACAGGTTATT TCGTAATATC GTTAGTATAC TCAGCTCTGT ATAGTATGTT TGGGAATGAA
GGATTCCAAA CTTTAGGATG GAGAGTCCTT TTTGCAACTT GCATATTGCC TGGATTAATT
ACGTTAATAA TTAGAAAAAT GACAGACGAA AGTCCAATCT TTAAGGATAT GAAAAGTGGG
AATGAAGTGG TCAAGATACC TATAAAGGAG TTGTTCAAAA TGTCTTATTC CTCAGTATTA
ATAGGGTTAA TGATAACAAG TGGATTGTTA TACATAAACA CTGCTACCTT TTCTTTCTAT
CCTACAGTGT TGACTATTCA AGGAATACCA GGGACAATTG TGGGATTAAG CGTTGCTATA
ATAAATTTGG TTTCCCTTTT TGGAGTTTGG TTTGGCGGAT TTCTAGCTGA TGTCATTAAA
AGGGGAAGAA AAGTTCCAAT GCTAATTTAT TCAATAATAT TCATTTTCAC CGTATACCCA
GTTTTGTATC TGGGATTGCT AAAGAACGTG TATTTATCCA CTATCGTATT TAGCTTACAA
GCATTTTTAG AAGCTATGAT ATTCTCCACT TTACCTGCAT TTCTTGCAGA ACAGTTTAGT
AAAAAATATA GAACTACGGG AGTAGGATTT ACATATAATG GGGGAGCAAT AGGAGGTGGT
TTTGCTATAT CTGCTACTTT AGCGTTATCA ACGTACTTAG GCTTACTTTA CTCATGGTCG
ATAAACATTA TTATAGCTGG GATAATAATG ATAATGGGTA TTGTCTTAGC AAAAGAAACT
TATACTGGAA AAGAAGATCC AATTTTGAGG TGA
 
Protein sequence
MVYGLNKQQW LAVFSTWLGW LMDGYTSIAY ALVAVTISKI FFPSTIGILG LIATFGGFAV 
GALARPVGSL VFGNFIGDKI GRKNMLVLTI LGFSLIASSK ALLPSYETAG ILAPLFLYII
LFAEGMFAGA EYGGGTTLAL ESVPVGKRGF IGSFVQSGFG TGYFVISLVY SALYSMFGNE
GFQTLGWRVL FATCILPGLI TLIIRKMTDE SPIFKDMKSG NEVVKIPIKE LFKMSYSSVL
IGLMITSGLL YINTATFSFY PTVLTIQGIP GTIVGLSVAI INLVSLFGVW FGGFLADVIK
RGRKVPMLIY SIIFIFTVYP VLYLGLLKNV YLSTIVFSLQ AFLEAMIFST LPAFLAEQFS
KKYRTTGVGF TYNGGAIGGG FAISATLALS TYLGLLYSWS INIIIAGIIM IMGIVLAKET
YTGKEDPILR