Gene Ssol_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1984 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1771054 
End bp1772460 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content37% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX92195 
Protein GI261602592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAG AAAAAAAGAA TGTAAGTGAA TTGAGAAAAG GTGTCCTTGG TACTTGGCTT 
GTTGCAAGTT ATGGAATTGC AGCTAATGCC CCAATAGCTG TTGCCACACT CTATTTTGTG
GGCCTTGCTG GATTGGTAGG AGGAGCTATG CCACTCACTG TGATACTTTC GTACTTGATC
TATGCTACTA CACTTATTGT TATTTATGAG TGGAGCAAGG AGATTGCAGC TTCATATGGC
TATGTTGCTA TGATAAAGAA GGGATTAGGC AGTAGTTTGG CTTCCTTTAC TGTAGGATAT
GGTTATATTT ATCAATATCT TGTTGCTGGA ACAGCTGGAT TTGGAATATT AGGAATTGCG
TCTTTCATCT ACTTGATCTC TCCCAGTATT GCTTCTTCAA TGCCTTGGTT ATGGGCAGCA
ATAGTGATTA TAGTTACAAT TGAGATTACA ACAATAATGT GGCTTGGAGT GAAGCCTGGA
GGTCTGTTAA ATCTCGTAAT AGGATTGATT TCAATAGGTT TTCTAATTAT AACTTCGATC
GTTTTAATTG CTGGAGCAAA GAATAGTATT TTACCGTTTA CGGCTGTTCC GGTTAACAAC
AATTGGGCGC TGGTACTTAC GGCAATGATT TTTGGTGTTA CTACTTTTGG TGGTGCCACA
ACTCCAATAG GAGTAGCGGA AGAAGCTAAG GTTCCAAAAA GTACTTTGCC AAAGGCACTT
CTCTTAACGT TTGGAATACT TGGAGTTGGA TTGATATTGA ATTCTTATGC GCAGACGATA
GTTTATGGAA TAAATAATAT GTTTAATTAT GCTAATCTTC CAGACCCAAT GATAGTGATT
TATAGTAAGT ATTTCAATCC CGCTATTGTA TATATGTTAA TAATACTTGT AGCGTTTATG
TTTAACTCTT CTGCATTAGC GTTTGCTACT AGTGGGAGTA GAATGATATT CGGTATGGCT
AGGGATGGTG TATTATATCC TAAAGTCTTT TCAAAAGTTA ATAAATACGG TGTGCCGGGT
AATGCAATAA TACTTACTGG TATTGTTACA GGTGCTCTTA GCCTTATAAG TGGTTACATT
CTAGGTCCGT TAGAGGCTAG TATATTTTTA ATAACATTTG GCTCATTCTA CGTCGCCTTA
GGTCATTTAT TTGCTGCCTT AGGGTTAATT GTACGTAAGG TTAAAATGCG TACGGCTAAC
ATAGCGAAAC ACGTAGTGAT ACCGATAATT TCAATACTAT TATATATTGC TGTAATATAT
TTTGGTACTT ACCCTGCGCC AGCTTTCCCA TTAAATATAG CAGTTTATGC AGCTTGGGCT
ATTCTTTTGA TTCACATAAT TACATATTAT GTGATAAAGA GCAGATTTCC AGACAGAATT
AAGAAGTTCG GTGATTATAG TCTTTGA
 
Protein sequence
MDEEKKNVSE LRKGVLGTWL VASYGIAANA PIAVATLYFV GLAGLVGGAM PLTVILSYLI 
YATTLIVIYE WSKEIAASYG YVAMIKKGLG SSLASFTVGY GYIYQYLVAG TAGFGILGIA
SFIYLISPSI ASSMPWLWAA IVIIVTIEIT TIMWLGVKPG GLLNLVIGLI SIGFLIITSI
VLIAGAKNSI LPFTAVPVNN NWALVLTAMI FGVTTFGGAT TPIGVAEEAK VPKSTLPKAL
LLTFGILGVG LILNSYAQTI VYGINNMFNY ANLPDPMIVI YSKYFNPAIV YMLIILVAFM
FNSSALAFAT SGSRMIFGMA RDGVLYPKVF SKVNKYGVPG NAIILTGIVT GALSLISGYI
LGPLEASIFL ITFGSFYVAL GHLFAALGLI VRKVKMRTAN IAKHVVIPII SILLYIAVIY
FGTYPAPAFP LNIAVYAAWA ILLIHIITYY VIKSRFPDRI KKFGDYSL