Gene PICST_81171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81171 
Symbol 
ID4851745 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2724606 
End bp2727924 
Gene Length3319 bp 
Protein Length997 aa 
Translation table 
GC content43% 
IMG OID640393453 
Productpredicted protein 
Protein accessionXP_001387089 
Protein GI126275470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.378562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCAGTTAAAG TATTCACCAG TAACAACAAC CCCTATTCGT GTAAGCCGTT AAGCGTTATA 
GGTTCAGACT GGAGAAGATC TGATCTCCAA CTGTTGCAGA TATAATCGCA TTATCTGCAG
GATTCCAGAT AAGTAGTCGC ACTACGAAAA ATCCAAATTC CATACAGAAA TTAAACAATC
TACAGTATGA AGATGAAGAT TGGCTGGTTT CGAATACTTT CTCAATTGAC TCCCACTGTC
CACTACAATT CCTTCGCTAA ACCACATATG ACAATATAGA ATGAGTGGTT CAATTGTGCG
GGCTTTGAAC TGGGACTCGG GCTACGAGCA GCAGTTTCTT GCTGTGAATC CAATTGGAGA
CGAAGTCCTT CTCTACCAGA CTAATCATGA AGACCCAGGG ATCGAGTCCA ATGACTTGAT
CAAGTTGAAC AGCCGAACGG GGTTCGAAAA CATACAATGT TCGTCGTACT CGATGATTAA
CCGAGGGATA ACTGGAGTCG GTTCCATTTC CGGAAACATC TCTATCTTCG ACATCAATTC
TAACAACTCG TCGATTCTCA AATTGAGACC CAAACAGAAT CGTCCATGTA ATGCGATTTC
CTTCAACAGC AGTAACTTAA TAGTAGCAGG TTTTGACAAG GGCCGTCAGG ACAATTCGTT
GCAAATTTGG AACATCGAGC ATTACTCACG AAACAGTACT AATGAGCATA TAAAAAGACC
TGTGGCTACA TACTTACCCA ATGAGGCTAT CTTGCTGTCT ATATTCTATC CTGACAGAGA
AGGCAGTATA CTCTGTGGTT CATACAAATT TTTGCGTGAA ATAGACCTAC GTGTGGACCA
GCCGATTTTC CAGATGGCGA CAAAATGTAC GTTGGGTTTA GCAGTAGACC ATTTCCGAAC
CCATTTGTTT CTGTCGTTCA GTGAAGATGG TTCTTTAGCC ATTTGGGATA GACGGAAATT
GACATCGAAT ACGGCGGTTA AACCTAAAGG TCCTCTCACG TCAGGAAATG TCATCACCGA
AACTCCAGTA TTGCAATTCG TAAAATTACT CAATGATTCA ACTTCTCGAA AGAACCAGAA
TCCTTGTGTA CGATATTCTA CTATCAGAAA AGGTGAGTTT TCAGCTATAT TCAACGGAGA
TTTGATTAGA AGATGGAATA CAGGCATAGT TCCAGCAACC AGCTCAACCT CAATCTCGGA
GAAGAGTTCT AGGGAGAGTA ATCCTACTCT TGCCAGCTTG CAACAACAAT CGCAACAACT
ATACAAACCT ACTGATGAGT CTCTATTTGT GTCACTAGTA TTGGATGTAA AGACAGACTA
TGAACGTGTA GTTTCGTTTG ATTATTCGCC AGATATCACA TCTGCTACTT CAACTAATTT
CGTATGTATG CGCCAATCCG GTTCGGTGTT TAGAATGCCA GCTGTAGAAT GCATTGAGTC
CCTTGATTTC AACTCTCTTA ACGAGTTTAC AATCGCTGGA CCTGAGGGAA CATTGACCAA
GTTTTCGGAT CAGGAGGAAC TAGCTAAGGT GCGAGCAGAA GCTGCTGCTA TGGCTCAGAC
AGCATCGAAT CGTGGAAACT CCGTAGTTAA CAAACTCGCT GATCTAGGCA TTATTGACGA
AACAAGAAAG TACAGCGAGG CAGAGTTTTC CGAAGATATT GAGTCCACTG TAGATGATGA
AAGTGCCATT GCTCCTTATG AAGCTGATAA TGCCAATAGA TACAACTTTG GCGATCTTGA
AGTAGATATA CATTTAAACG ATATTCTTGA TGCATCGGCT GTTATACATA GCGACATCTG
CTCCACAATA AGAAAAAGAG CAATACTAGG CTATGGTGTC GATTGCGACA GAAACATTCG
CGTTTTGGAA GACTTGGACT CTCTCAACAG TCAACTTTTC TTGAGGAACA CCTGGAAGTG
GTTGGGGTTG GCTAAGAAGT CCTTGGAAAA GGGTACCATG ATCTCCGAAG GGATTGACTT
GGGGTATCAG GGAGTATTGG GAATCTGGGA AGGAGTAAAA GAAATGGATA ACCAGAAACG
GTCTGTGCCT GAAGCGGGTC TAATAACCGA TGGCTGGTTT TCCCATGCTG TCAAGTCTAT
TGTTTCATCT AAGGGAAAGA AGACAGCTGG TATCAATATC GCTAGTAACA GCGAAAAAAA
GGCGCAACGG AAACTTTGTT TAATTGTCTC TGGGTGGTAT TTGGCAGATA GTGAATTTGA
GGAGAAATTG AATATTTTGA TTTCTTTAGG ATACTCAGAA AAAGCTGCTG GTTGGGCAGT
TTTCCATGGC GATGTGCCTA AAGCTATTGA AATTCTTGCC AATGCGAAAA AGGAAAGATT
GCGATTGATG TCTACGGCTG TAGCTGGTTA TTTGGCATAC AAAGATTCCA ACGTTAATAG
TCCGTGGAAA GACCAATGCC GGAAGATGGC TTCAGAGTTG GACGATCCAT ATCTCAGAGC
CATTTTTGCG TTTATTGCAG ACAATGACTG GTGGGACGTG CTTGATGAAC ATTCGTTGCC
GTTGAGAGAA AGACTAGGTG TAGCCCTCAG GTTCCTTTCA GATAAGGACT TGAATGTTTA
CTTACACAGA ATCGCCGATA CTGTAGTCAA CAAAGGCGAA TTGGAAGGGC TCATTTTGAC
CGGAATTACA CCTCGAGGAA TCGACTTGTT GCAGAGCTAT GTAGATAGAA CCAGCGATGT
CCAGACTGCA GCATTGATTG CCGCGTTTGG GAGCCCCAGG TATTTCTCCG ACGAACGAGT
AAGGCATTGG ATTGATTGTT ACAGAAGCTT GTTGAACAGT TGGGGACTCT TTAGTGTACG
AGCCAAATTT GATGTAGCTC GTACTAAGCT TTCCAAGAAT GCCGCTGGCA CTTCGACCAT
TAAACCTTCG CCAAAGCAGG TTTACTTACA GTGCTCCAGA TGTAACAAGA ATTTATCAAA
ATCGAAGACA ACTAACTCCA ACAGTCTTCC TGGTTCGAAC CCTCAGGCAA TCATCAAACA
ATTCAACAAA ATGAACCACC ACAACAATAA TAGTAGCAAA CTGGCTACAA ATGACATTGC
TGCTTGTCCT CATTGTGGTG CTCCGCTTCC TCGTTGTTCT GTTTGTTTGC TTACCTTGGG
TACACCTCTT CCATTGGAGC CATCCGAGAA AATCCAGGAA GTCACATTGG CCAACAAAAT
CGAAAACAGA TTTAGAGAGT GGTTCAGTTT CTGCTCTAGC TGCAACCATG GCTGCCATGC
ACATCATGCT GAGGAGTGGT TTTCCAAGCA CTACGTTTGT CCTGTTCCGG ATTGTAATTG
TAGATGTAAC AGTAAATGA
 
Protein sequence
MSGSIVRALN WDSGYEQQFL AVNPIGDEVL LYQTNHEDPG IESNDLIKLN SRTGFENIQC 
SSYSMINRGI TGVGSISGNI SIFDINSNNS SILKLRPKQN RPCNAISFNS SNLIVAGFDK
GRQDNSLQIW NIEHYSRNST NEHIKRPVAT YLPNEAILLS IFYPDREGSI LCGSYKFLRE
IDLRVDQPIF QMATKCTLGL AVDHFRTHLF LSFSEDGSLA IWDRRKLTSN TAVKPKGPLT
SGNVITETPV LQFVKLLNDS TSRKNQNPCV RYSTIRKGEF SAIFNGDLIR RWNTGIVPAT
SSTSISEKSS RESNPTLASL QQQSQQLYKP TDESLFVSLV LDVKTDYERV VSFDYSPDIT
SATSTNFVCM RQSGSVFRMP AVECIESLDF NSLNEFTIAG PEGTLTKFSD QEELAKTASN
LNKLADLGII DETRKYSEAE FSEDIESTVD DESAIAPYEA DNANRYNFGD LEVDIHLNDI
LDASAVIHSD ICSTIRKRAI LGYGVDCDRN IRVLEDLDSL NSQLFLRNTW KWLGLAKKSL
EKGTMISEGI DLGYQGVLGI WEGVKEMDNQ KRSVPEAGLI TDGWFSHAVK SIVSSKGKKT
AGINIASNSE KKAQRKLCLI VSGWYLADSE FEEKLNILIS LGYSEKAAGW AVFHGDVPKA
IEILANAKKE RLRLMSTAVA GYLAYKDSNV NSPWKDQCRK MASELDDPYL RAIFAFIADN
DWWDVLDEHS LPLRERLGVA LRFLSDKDLN VYLHRIADTV VNKGELEGLI LTGITPRGID
LLQSYVDRTS DVQTAALIAA FGSPRYFSDE RVRHWIDCYR SLLNSWGLFS VRAKFDVART
KLSKNAAGTS TIKPSPKQVY LQCSRCNKNL SKSKTTNSNS LPGSNPQAII KQFNKMNHHN
NNSSKLATND IAACPHCGAP LPRCSVCLLT LGTPLPLEPS EKIQEVTLAN KIENRFREWF
SFCSSCNHGC HAHHAEEWFS KHYVCPVPDC NCRCNSK