Gene Ssol_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1945 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1730198 
End bp1731646 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content41% 
IMG OID 
Productprotein of unknown function UPF0027 
Protein accessionACX92156 
Protein GI261602553 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0738266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATTA ACATTACTAG AGTTAGCACA TACGAGTGGC GTATTGATAA AGGCGCTCAA 
GAGTGTATGA AAGTTCCCGT TACAATATTT GCAGATGACG TTCTAATTGA AAAAATGAAA
CAAGATATGA CATTAAGACA AGCAACTAAC GTAGCTTGTT TACCCGGTGT CCAAGAGTCA
ATTTACGTTT TACCAGATGG TCATCAAGGT TACGGTTTTC CTATAGGCGG TATAGCAGCT
ACTGCGATCG AGGAAGGAGG AGTGGTAAGT CCTGGAGGTA TAGGATATGA CATAAATTGT
GGTGTCAGAT TACTCAGAAC TAATTTAGAC TATAAAGATG TAAAGCCAAA ATTAGCTCAG
TTGGTTGAGG AGCTGCATAG AAACGTGCCG AGCGGTGTAG GAAGTGAGGG TAAAGTAAAA
TTGACATATC AGCAATTAGA TCAAGTATTA GCGGAAGGAG TTGCGTGGGC GGTTGATAAG
GGCTTTGGAT GGAAAGAAGA CATGAATCAC ATGGAACAAC GTGGTAGCTG GGAGCTAGCC
GATCCTTCAA AAGTAAGTCC GATAGCAAAA CAAAGAGGTG CCTCTCAGTT AGGAACTTTA
GGAGCTGGTA ATCATTTCTT GGAAATTCAA GTTGTTGATA AGATATTTGA TCCCCAAATT
GCTAAAGCAA TAGGGGTAGA TCACGAAGGT CAAGTAATGG TTATGGTTCA TACGGGTTCA
AGAGGTTTAG GTCATCAAGT AGCTAGTGAT TATTTACAGA TAATGGAAAG AGCAATGAAG
AAGTATAACA TTCAATTGCC AGATAGAGAA CTAGCTGCAG TTCCCTTTGA GAGTAGAGAG
GGTCAGGATT ACTTTCATGC AATGGCATCT GGAGCTAATT TTGCGTGGAC GAATAGACAA
TTGATTACGC ATTGGACGAG AGAGAGTTTC GGTAGAGTAT TTGGTGTCGA CCCAGAAAAA
TTAGATCTTA GCATAGTTTA TGATGTAGCT CATAATATAG CTAAAATTGA GGAGTATGTA
ATTGGTGGAG AAAGGAAGAA GGTATTAGTA CATAGGAAAG GTGCTACTAG GGCTTTCCCG
CCTGGTAGTC CGGAGATTCC CGCTGATCAT AGAAATATTG GCCAGATTGT TTTAATCCCA
GGTAGTATGG GCACTGCTAG TTATGTTATG GCTGGAATAC CAGAAGGTAG AAGGACATGG
TTTACTGCGC CTCATGGTGC TGGTAGGTGG ATGTCTAGGG AAGCTGCAGT GAGAAATTAC
CCTGCTAACG TAGTAGTTGA AACTTTAGCT GAAAAGGGTA TAGTAGTAAG GGCTGCTACT
AGAAGGGTAG TAGCTGAAGA AGCACCGGGA GCCTACAAAG ATGTTGATAG GGTAGCTAAA
GTTGCTCATG AAGTTAAAAT TGCTAAATTA GTTATGCGAT TAAGACCCAT AGGGGTTACC
AAAGGATGA
 
Protein sequence
MQINITRVST YEWRIDKGAQ ECMKVPVTIF ADDVLIEKMK QDMTLRQATN VACLPGVQES 
IYVLPDGHQG YGFPIGGIAA TAIEEGGVVS PGGIGYDINC GVRLLRTNLD YKDVKPKLAQ
LVEELHRNVP SGVGSEGKVK LTYQQLDQVL AEGVAWAVDK GFGWKEDMNH MEQRGSWELA
DPSKVSPIAK QRGASQLGTL GAGNHFLEIQ VVDKIFDPQI AKAIGVDHEG QVMVMVHTGS
RGLGHQVASD YLQIMERAMK KYNIQLPDRE LAAVPFESRE GQDYFHAMAS GANFAWTNRQ
LITHWTRESF GRVFGVDPEK LDLSIVYDVA HNIAKIEEYV IGGERKKVLV HRKGATRAFP
PGSPEIPADH RNIGQIVLIP GSMGTASYVM AGIPEGRRTW FTAPHGAGRW MSREAAVRNY
PANVVVETLA EKGIVVRAAT RRVVAEEAPG AYKDVDRVAK VAHEVKIAKL VMRLRPIGVT
KG