Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67516 |
Symbol | SAR1 |
ID | 4838846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 543584 |
End bp | 544707 |
Gene Length | 1124 bp |
Protein Length | 190 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390161 |
Product | GTP-binding protein |
Protein accession | XP_001384064 |
Protein GI | 126135080 |
COG category | [R] General function prediction only |
COG ID | [COG1100] GTPase SAR1 and related small G proteins |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCATGTGGTT ATTCGATTGG TGTAAGTATC GTTTGGGAGA GCATGGGCCA GTTTTTCTCA GAGATTTCTG TTGAAGGCGA GCTCACTGAG ACCGTTCAAT GTGAACAAGG ATCGGATAGG ATGAGACCAT GGAAAACATG ATATTCTTCA GTAAAGTAAA AGATACGGGG ATTTGAAACA TTTCAGAACT AGGAGATGCC TGGAAATGGT ATCCGATAGT AGAAGTAGAT ATACTAGAGG GATATCGGTA GAATCGAATA GTCATTGCTG GCGTTTCTCG AACTTTTCAC TCATAAAGTC AACTATCTGA AGATCAACAA CAATTTTCAC AAGATGAATG TTCGTAGTCG TCTAACCACC GTAACATCAC CATATATCAT GTTTGTCATC TCCATCGATT GTTTACTGTC TTAACTATAC AACATACTAA CATTTGCAGT CCAAGACGTG TTATCGTCGT TAGGTTTGTG GAACAAACAC GCAAAGTTGC TCTTCTTGGG TTTGGACAAC GCCGGAAAGA CCACCCTTTT GCACATGTTG AAGAACGACA GATTGGCCAC CTTACAGCCA ACCTTGCACC CTACCTCTGA AGAGTTGGCT ATTGGCTCTG TACGTTTCAC TACTTTCGAT TTGGGTGGAC ATCAACAGGC TCGTCGTTTG TGGAAGGACT ACTTCCCAGA AGTCAATGGT ATTGTCTTCT TGGTTGATGC TGCTGATCCA GAAAGATTCG CTGAGTCCAA GGCTGAATTG GAGTCTTTGT TCAAGATCGA AGAGTTGAGT CATGTGCCAT TCTTGATCTT GGGTAACAAG ATCGACGTTC CTACAGCCGT CGGAGAAATG GAGTTGAAGT CCGCTTTGGG CTTGTACAAT ACCACCGGTA AGGACACCGG CAAGTTACCA GAAGGCTCCA GACCCATTGA AGTGTACATG GTCTCCGTAG TGATGAGATC CGGCTATGGA GAGGGTTTCA AGTGGTTGTC GCAATACATC TAGACTGTGC GCTGCCCTTA ATCGTGTGAG CTTTAGTAAC ATTACCGTCG TATCCGTCGC ACCGTAGATT ACTACTTCAA TCTTATTGTT AGTACATATT ATTGTTTAAT ACTATATATA CATCATCTAG ATAG
|
Protein sequence | MWLFDWFQDV LSSLGLWNKH AKLLFLGLDN AGKTTLLHML KNDRLATLQP TLHPTSEELA IGSVRFTTFD LGGHQQARRL WKDYFPEVNG IVFLVDAADP ERFAESKAEL ESLFKIEELS HVPFLILGNK IDVPTAVGEM ELKSALGLYN TTGKDTGKLP EGSRPIEVYM VSVVMRSGYG EGFKWLSQYI
|
| |