Gene PICST_33678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33678 
Symbol 
ID4841052 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp192966 
End bp195149 
Gene Length2184 bp 
Protein Length673 aa 
Translation table12 
GC content39% 
IMG OID640392367 
Productpredicted protein 
Protein accessionXP_001386633 
Protein GI150866890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAG TGGTGAACCC CGCAATTTCG AAATCGACAG AAACTCGAAA ACTGAACTTC 
ATCGAAACTG GGGATGTCAC CAAACTCTCC AAATCCTGCG ATTTCTGCAA ATCCAAGAAG
GTCAAATGTG ACCAGGTCAA GCCCATCTGC TCTTACTGTG TGAGACATAG CCAGGAATGT
GTCTATAGTA GAGTCAGAAA GCCAGGCTTG AGGCCAGGAT ACGGCCAGCA GGTATTTGAC
AGAATCAACG GTCTTGAATC GTTTGTAGAA AACTTTCATA CCTCCAGTAG TACAGAAATC
GAAAGTCTCA AGAACAGAAT CGAGGAGTTC ACCTCTAGAT TCGAAAGCAT AGAAGACAAA
CTTATCAACA TTGGCAATGC AAACATCACT AAAATATCTG CTACTGGTGA AGCTACAAAT
AATGTAAATG CAAATGACAA TGTTATTGTT AATGGCACTG TCAATAATGG CTTTGCAAAT
GTAAACCCCA GCAGTAGTAT TTCCAATAAA GAAACAATAG ATATGAACAG CATTGACAAT
ATCATTTCGT CTACTAGTAC AGCTTCAGGG AACCAAGAGG GTAAGTACTT GTATGGTGAA
TTGCCTACTA TACACGAGGC AACAATCTTG TTAGACATTT TTCAAGAAAA GATTCATCCA
ATATTTCCAG TCGTTGAGCA TTCAAAATTC AACACTTTGC TTGAAGAGTA TGAAACAGCC
CCAAGATCAA TACTTTTAGG AGCCATATTA TGTTCATTGC GATTTGCTGA TACATCACTA
ATTACCCATA GACAAAAGAA GGTGTATCAC GAGTCCATTT TCTCAAGGTT GCTAAATTCT
TGCTTTGTGG TAGGCACTGT AGAGGAGTTG CAAGCCATGA GTTTACTAGC TTTTGACCTT
TATAGCTACT CCAACAACCC CAAGACATGG AGTGTGATAT CGTTAATAGC CTCTGGAGTT
GTCCATCTTA ATTTATCTAG AGGCAGGCTT CAAACTTCTA TTCTTGAGCT ATACACGAGT
CGTTCTGGTG TATCCAAGAA TGTAACTTCC AGAACAGTAG CAAGCCAAAA AGTGGTGGAA
GAGAGAAAGT TGTTATTTTG GGAAATCTTT CAGCTTGATA TTCTTTCCAG TGCTTCAAGC
TCGTTTCCCT TGAAGATACC CTCTTCTGAA ATCGACTGTT CGCTACCATT AAAAAGGGAG
TTGTTTGAAA GTGCACAAAC GAATGAGGAC TATGAAAGAC TCAAGAGCTT GCCCACAAGA
ACACTCAATA AATATGTCAG TAATGTCAAC TATGACCACT ATGACTCAAA TTGTTTTCTC
ATTGAAATCT TGAATATTCT TGGAAAGATT CACATGTTCA TGAGGAAACC GCTTGATCTT
ACGAATATAA AGGAAATGTT GAATTGGCAG ATTAAGTTTT CTGAGTTGGA TAACGAAATC
CAAGTATGGA AAGCAACTTT GCCACGTATG TTTAACGATT TATTGGATAA CGAAAAGCTA
CCGTATGATA AAATAAATTC GTACAAGGAT ATACTCTTCC ATTCGTTATA CTATACTACG
ATTGTCAGGC TAAATTCTAG TGTGGGTTAT CCTTATCTTC AGCTTTCTAG TGGGCCTCTA
TCGTTCAAGG ATGCTCGTTC CAGGTGTTTG GATGCTGCCC AGCATGTAGT CAATTTTGCC
AAGAAGCTTT CGCAAATCTT TGAGGATGAT GCAGCTTTCC ACCAACGGAT TGGTCCTTAT
TATGCCTTCT CTCTTTGGGT TTCAGCACGT TTGCTTTTGG TGAATGCTAT CAATAGTGAC
TTGGAAATCC CTGCAGATGT CCAATACTTA ATATCTCTAT TGACACGTAT GGGGGACTCT
TGGGAGAGTG CTTCGAAGTA TGCTAACATA TTGAACTTCT TGATAAGTGA GTTGGAAACA
GAATCTCAAG AGAACTTGAA TATCATAAAC CATTTCTCTC ATGGCGGTAG TGAAGAGTCA
ATGTACCGCT CAGAAGATGC AAGCATCATC TCGGATATGA GACTCAATGC TTACAACTTG
GATGTGATTC TCTCTGAAAA GGTTGAAAAG TTTACTAATA GAAAGGGGGG CAAGGTGTCC
CCTAATAACC AGGCTGATAT CTCTAATTTT TTTGAATGGT TCAAGTTACC GTTTACTGAG
ATCAATACTC CAACATTGCA GTAG
 
Protein sequence
MSRVVNPAIS KSTETRKSNF IETGDVTKLS KSCDFCKSKK VKCDQVKPIC SYCVRHSQEC 
VYSRVRKPGL RPGYGQQVFD RINGLESFVE NFHTSSSTEI ESLKNRIEEF TSRFESIEDK
LINIGNANIT KISATGNQEG KYLYGELPTI HEATILLDIF QEKIHPIFPV VEHSKFNTLL
EEYETAPRSI LLGAILCSLR FADTSLITHR QKKVYHESIF SRLLNSCFVV GTVEELQAMS
LLAFDLYSYS NNPKTWSVIS LIASGVVHLN LSRGRLQTSI LELYTSRSGV SKNVTSRTVA
SQKVVEERKL LFWEIFQLDI LSSASSSFPL KIPSSEIDCS LPLKRELFES AQTNEDYERL
KSLPTRTLNK YVSNVNYDHY DSNCFLIEIL NILGKIHMFM RKPLDLTNIK EMLNWQIKFS
ELDNEIQVWK ATLPRMFNDL LDNEKLPYDK INSYKDILFH SLYYTTIVRL NSSVGYPYLQ
LSSGPLSFKD ARSRCLDAAQ HVVNFAKKLS QIFEDDAAFH QRIGPYYAFS LWVSARLLLV
NAINSDLEIP ADVQYLISLL TRMGDSWESA SKYANILNFL ISELETESQE NLNIINHFSH
GGSEESMYRS EDASIISDMR LNAYNLDVIL SEKVEKFTNR KGGKVSPNNQ ADISNFFEWF
KLPFTEINTP TLQ