Gene Ssol_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2141 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1926413 
End bp1928212 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content39% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX92344 
Protein GI261602741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAA AGCATTTAAT TTCCTTAATA GTAATATTAA CTCCATTAGT TACTTTACTC 
ACTAGCGCCG TCTATACGTC TGGTGGTATA ACTTTTTACA GTCCAGCCTA TAACGGTGAG
TCCTATTACA CTGGGCAATC AATAACCATT GACGCATTAC TACCGCAACA ATTTGCAACA
GATGCAGCAA CCATAAACTT CTTTTTCCCC AATTCATCCT TAGCTGTGAC AATACCCGTT
CAAATTAATG GAAGTGGTGG AATATACGTA CCTAATGCCT ATGCTTTCCC CAATGTTCCC
GGGACATGGC AAATTACAAT AGAAGTTGCG GGCGGTGTGG CAGTAGGTAC CATTAACGTT
AACGTTATTC AAAGAACTCC ATTAGTTACA GTACATCTGG GTTACGGTGT TGTCGGTCAA
GCACTACCAC AAACGCCAAC CATAACCTTA ACTTTCCCTA ATGGTACAAC AATTACAGTT
CCTCTTCAAG GTACAGTTAA CGTTCCTTCC GGTACTTCAT ATCAAGTTGA GCAAGCAATA
ACTGAAAATA ATATCAGATG GGCTACCAAT TACACTAGTG GTACTATAAC CCCAGCGACT
ACATCCATAA CGCCTACATA TTATCAACAA TATCTAGTTA CCTTTAATTA CACAGTCCAA
GGAGGTACTG GCTACTCTCC ACCTACAGTT TACTATCGAA GTCTTGGAAT GAACGAAACA
GCAAAAGCAC CAGCATCAGT TTGGGTAGAT GCCAATTCAG CTTATATTTA CTCGCCAGAA
CTTCAATCTA ACGTCCAAGG AGAGAGATGG ATAGCGGTAA ACTTCACTGG GATCATTAAA
GCTCCTGGCG AAATCAATGA ATATTATATT AACCAATATC TAGTTACCGT ACAATCCCAA
ATCCCAGTTT ACGCAATAGT AAACGGAGCT AACGAGACCT TAAACTCTAC AAACTGGTTC
ACACAAGGCA CTACAATCAA ACTAGAAAAT ATAACGAAAT ACGTAAGCAG TGTTGAGAGA
TATGTAATAG CTAATTTCTC ACCCTCAGAG GTTATAACAG TAAATCAGCC TACTACGATA
AAAGTAAATA CTGTAACCCA ATATTTCATT AACGTTAACT CTCCAGTTCA ATTAAAAGCC
TTAATAAACG GCGCAAATGA AAGCCTTACA GCAGGTTGGT ATAATCAAGG AACATCAATC
AAAATAGAGA ACCTTACATA CTACGTGGGA AATGGAGAGA GATTAATCTT AGGTAAAGTT
CTTCCATCCT TAGAGATAAT TGTAAATGGC TCCTATACCA TAAGCACTAC AACCATAACT
CAATACTTCG TCAACGTCTC TTCTCCCATA CCAGTCCAAG TACTAATTAA CGGTTCTAAG
ACTATACTTA ACTCCTCCTG GATAAACGCT GGAACATCGA TACTAGTGTT AAACTACACT
TACAACATTA GTCCACAAGA GAGGGTTATA ATAGTTGGTA TATCACCCTC ACAGTCATTT
ACAGTGAACT CACCCGAAAC CCTAAAGCTA CTTACAGTCA CACAATATCT AGTCACAATT
AATGGTGTGT CTAAATTCTA TAACTCGGGA TCAAAGATAG TCCTTAATGC GAGTGTGCCA
TTCTACGAAA CTGCCACGTT TAAGGGAACG TATAATGTCT CTCCGGGAGC TACAATTACA
GTGAACCAAC CAATAACTGA AACATTAGTA GAATCTCCAA ATTACTTAAT TTTAGGAGCA
ATAGCAGCTG TTATAATAAT AGTAGTAGCT GTGGTGGTAA TAATCCTCTT AAGGCGTTAA
 
Protein sequence
MKAKHLISLI VILTPLVTLL TSAVYTSGGI TFYSPAYNGE SYYTGQSITI DALLPQQFAT 
DAATINFFFP NSSLAVTIPV QINGSGGIYV PNAYAFPNVP GTWQITIEVA GGVAVGTINV
NVIQRTPLVT VHLGYGVVGQ ALPQTPTITL TFPNGTTITV PLQGTVNVPS GTSYQVEQAI
TENNIRWATN YTSGTITPAT TSITPTYYQQ YLVTFNYTVQ GGTGYSPPTV YYRSLGMNET
AKAPASVWVD ANSAYIYSPE LQSNVQGERW IAVNFTGIIK APGEINEYYI NQYLVTVQSQ
IPVYAIVNGA NETLNSTNWF TQGTTIKLEN ITKYVSSVER YVIANFSPSE VITVNQPTTI
KVNTVTQYFI NVNSPVQLKA LINGANESLT AGWYNQGTSI KIENLTYYVG NGERLILGKV
LPSLEIIVNG SYTISTTTIT QYFVNVSSPI PVQVLINGSK TILNSSWINA GTSILVLNYT
YNISPQERVI IVGISPSQSF TVNSPETLKL LTVTQYLVTI NGVSKFYNSG SKIVLNASVP
FYETATFKGT YNVSPGATIT VNQPITETLV ESPNYLILGA IAAVIIIVVA VVVIILLRR