Gene PICST_37102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37102 
Symbol 
ID4841028 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp25482 
End bp28076 
Gene Length2595 bp 
Protein Length536 aa 
Translation table12 
GC content43% 
IMG OID640392343 
Productpredicted protein 
Protein accessionXP_001386406 
Protein GI150866721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC TTAGTTCCCA TATTGAGAGC AACGTCAGAG AAGTGTGTGT CTGCTCTGGA 
GTTGCAAATT AGTTTCTAGT CGAATCAAAA CGACTTCACC CAAAATGCTA CTTTTTGCAA
TCAAACTGCC AAAAACTGGT AAACTCAACT GCCAAATAAA CCAACACAAA AGCACTAAAT
CTAACTGGCA ATTGATATCA ATGGATCCCT TGGGGATATA AGATCACCCC ACGGACGATA
CTGCCGCTAT CGCTCCACGC TGAGCCCAAA CCAATGCACA TTGCAACAGA TTATGCAAGT
GGGGAATCAT ACATGTGACT ATCGCGTGAC CGTTGCAATG TCTCCATCTT TTTGCCCTTA
TCAGCACATC GTTTTTTTGT CGCGTCCGCA TGCAGGGATC CAACCGTTAG TATCATTCCC
CCTTCTTTCT AACTCTATCC ACAGTTGCTC CCCTCAGTTG TTCTTCTGAC ACTTCAGTTA
CAAAGTGCCA TTTTGTTTAC CAATCCTTTC TGATAGCTCA AGATATACTC AGCAGGGGGT
ACATCAATTC TGTTACAAAT TCCCGCGACA TACTTTCCAA GGCTTCTTGT AATCGTATAA
CACCAGTCGT CCTTAGATCT CGTTCGTGTT TCATTTCCGC CAAACACTGG TTTGCTGCCA
AATTCGGGCA CTTAACCAAT CTGTCCCACA TCCCAACCCT GACCGAGGTA TATATACGTA
ATACTCGTAC TTGCAGTTTC TTTGTTGTGA GTTGTCCGAT CCCTCGATCC CGCCACTGAT
ACATTACTTA CCAGTTCTAC TAGTTCATAT TCCACAAACT TCGCATTACC AATGGACTTA
GCTAAGGTCC GCAATCTGGA CACAGGCTCG TTGAAAAGAG TCCGGTCACT TCCAGTCAGC
GCCGAGGATG AAGGCAACCT CGCGTATCAA CATCAACAGC ACCCTAGCTT CTCCAACGAG
GCTTCGTTCC GCAAGTTACT CTCTTCCCAT CTGTCCATCT ACAACGTGTT GAACAATCCC
GACTTAGACG TCTCCCCTAG CTTCAGCAGG ACACAAAGCC CGTTGTTATT AACGCCTGAG
TTGCCTGACC CGTCGTTCTC CATCACGCTG TGGCAGTCTA TAAGCAATGA CTCTGTGTTT
TTGGACAATC TCAATCTCAA GAATCTTTCG CCATCGCCAC ACAATGTACC TCAATTCCAG
AAGCGAAACG AAAACGATGA ATCCGGCAAA AGGAGACCCA CTTCTTCCAT CTTGTCGGTT
ACAACTGAAG AGGACGACAC AACGAAAGAC TCAACAGTGT TGATCTCGAC ACCAAAGCCG
TCTAACGTTA TGGTCTTCGA GACAGACGAG GAAGACAACG ACGATGACCG TAACCATGTT
AATCCTCCGA ATTCATTTAT CATGCCGAAG ATGAGCATTT CGGAAAGACC TAATATTCAT
GACTATAGTG GCAGAAGCTC GCGATTTCAG GTCACGCTTC TTAGCTCCTA TGGTTCGTAC
AAGGTGGATA CCAACTACTT GGTCAAGAGC ATAGAGCGTG AATTGAGTTG TGATTGCATC
GGCATTCGCC ATGTAAATTT AGATATCCAC AATGATTACA GATCCTCTAG TTTCCTGCGA
TTTGACAAAT CGTTGGTCAA GAATTCCGAT TTGATATTTG TGGTTAACGA TGGCTCTTCC
GTGTTTCTCG AATATTTGAC TAGCGTCTTT GGTGGAGACG TGCAATTAGA CGAGGACTCT
ATGGAAGCTT TGCCGAAACT CACAATCATC AACATGATGA CAGTCAACTA CTTTGTCAAC
TTGTTTGAGT TGATAAATTA CTTAAAACCC TACCAAATCT GGAAGACTTC ATCTTTAAAA
CAGGAGAAAT TGGTCAATAA AGTTAAAGAC TTCATCGAAA TCGAGTTGAA CCAACTGGAC
CATTTTGAAT CCAATAAGGT TGCTACAAAG GATACAAATT TGTCATTGGT TGTCTCGAAT
GTCGGCAGAT CACAAACTAT GTATTCCAAT TTAATTCTGC ACAAGAGAGC CGACTACAAG
GGCATAGAGA AGAAATTCAA AACTGATCTT CAGGGTTCAT CTAGTTTTAG CGATCCGTTG
CTGATTTCTT CCAACTTTGC CCATATTAAC ATCTTGTATT CAATCTTGAT GAAATTATTT
TCGACTTCAC AATTGAGTCA AAACATCGTT GCTGTTGATA AGTCAACTCC CAAATCCTCG
CGATTTTGGT TGATTTGTAG TTTCACAGTA GGTATTGGCT TTGGTATTGG TATCGCAAGC
GGTGCCACTT CCGTTGTTGG GTTATACATA TATGAGAAGT TTCTTCAGTT TAGCCCTGGC
CAAACACAGC AATGTATTCC AGTTTCATCT CCTGCAACTA AACCAATTGT GGATACCGTC
GTTGATTTAT CGAAGGAGTT TCAAGGTTCT ATGTTCCAGT TTTACAATGA AGTTTCGACT
GACTTAATTG GAGAGCTCAG ATCATTTTCA ACGTTATATG TTAGTTATTT AAGGTCCGCT
GGTGATATTG TTATTGATTG TATTAGAGGA GGATTAGAGA AAGTTGTTGG TCTTGTTGTG
TACACTAATT GCTGA
 
Protein sequence
MDTLSSHIES NVREASFRKL LSSHSSIYNV LNNPDLDVSP SFSRTQSPLL LTPELPDPSF 
SITSWQSISN DSKRNENDES GKRRPTSSIL SVTTEEDDTT KDSTVLISTP KPSNVMVFET
DEEDNDDDRN HVNPPNSFIM PKMSISERPN IHDYSGRSSR FQVTLLSSYG SYKVDTNYLV
KSIERELSCD CIGIRHVNLD IHNDYRSSSF SRFDKSLVKN SDLIFVVNDG SSVFLEYLTS
VFGGDVQLDE DSMEALPKLT IINMMTVNYF VNLFELINYL KPYQIWKTSS LKQEKLVNKV
KDFIEIELNQ SDHFESNKVA TKDTNLSLVV SNVGRSQTMY SNLISHKRAD YKGIEKKFKT
DLQGSSSFSD PLSISSNFAH INILYSILMK LFSTSQLSQN IVAVDKSTPK SSRFWLICSF
TVGIGFGIGI ASGATSVVGL YIYEKFLQFS PGQTQQCIPV SSPATKPIVD TVVDLSKEFQ
GSMFQFYNEV STDLIGELRS FSTLYVSYLR SAGDIVIDCI RGGLEKVVGL VVYTNC