Gene Ssol_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2065 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1853456 
End bp1854598 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content36% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionACX92271 
Protein GI261602668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTTT CGTTAAGAGT AGATACAAGC CATAAGTACT CATTTAATGC AGATGTTAGG 
TATATCTTTA AGGTATTATT AGTTCCAGAG AAATTAGGCT CTGCAACTGG TTTCCACTAC
ATAGTAGCCC TTGATACAAG CGGGTCTATG ACGGGATATA AAATAGAGTT AGCTAAACAA
GGTGCCATAG AATTATTCAA AAGAATACCT AATGGCAATA AGGTTTCCTT CATCACTTTT
TCATCAAATG TGAACGTGAT TAAGGAATTC GTTGATCCTT TAGACTTAAC GAATGAGATA
TTGCAGATAA CAGCAGGAGG TCAAACGGCA CTATATACAG CGATCTTAAC TGCAAATAGT
TTAGCTAAGA AGTATCAAAT GCCAACCTAT CTATTACTGT TAACTGACGG AAATCCCACA
GATGAGACGA ATATTGGGAA TTATCTAAAG TTACCCTATT ATGAAAAAAT ACAGGTCTAT
TCATTTGGAA TTGGTGACGA CTATAATGAA CAACTACTTC AAAGTGTTAG TGATAAGACG
GGAGGGGTAA TGTATCATAT TTCAGATGCT AACGAAATAC CGCAAAAGCT TCCTCAAAAG
GCTGTAACGC AAATAGCTGC AAAGAATGTT ACGGTTGATA TAACTGCTGA GGGTAACGTA
AAACTTTTGA ATTATGTAAC AACACCAGTA AAAGTAAATG GGATAGAGAA CGTTATTAAA
ATTTTTGGAG AAACCATTTT ACCAGCCAAT TATGAGGGTA ACTTCTTAAC TGTGAAAGTC
AATTATGAAG ATCCGGTAAC TAATAAGCCA GAATCACTTT TGCAAGTTAT TCAAGTTAGG
AAAGCACAAG ATCAAAATAC ATTTGTATCT GGCATAAACA ATGACGTGAT AAATGAATAT
AGATACTATG AACTATTGGA TAAATACGCG AAACAAGTTC AAGCCGAACA ATTGGTTGAA
GCTACGAAAA CTCTTAACCA ACTAAATGAA ATAGCCCAAC AGACCAGAAG AATAGACTTC
ATGGAGACTA CTAGAAGGTT GTCTGAAGGT TTAGAGACCA CTAAAAGGAT AGGTACAGTT
GAACAGACTA AGAGGTTATC AAAAGAGGTT ACTAGTGAGG TTACTAGAAA GCTTAGGGAA
TGA
 
Protein sequence
MTLSLRVDTS HKYSFNADVR YIFKVLLVPE KLGSATGFHY IVALDTSGSM TGYKIELAKQ 
GAIELFKRIP NGNKVSFITF SSNVNVIKEF VDPLDLTNEI LQITAGGQTA LYTAILTANS
LAKKYQMPTY LLLLTDGNPT DETNIGNYLK LPYYEKIQVY SFGIGDDYNE QLLQSVSDKT
GGVMYHISDA NEIPQKLPQK AVTQIAAKNV TVDITAEGNV KLLNYVTTPV KVNGIENVIK
IFGETILPAN YEGNFLTVKV NYEDPVTNKP ESLLQVIQVR KAQDQNTFVS GINNDVINEY
RYYELLDKYA KQVQAEQLVE ATKTLNQLNE IAQQTRRIDF METTRRLSEG LETTKRIGTV
EQTKRLSKEV TSEVTRKLRE