Gene PICST_66897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66897 
Symbol 
ID4837503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp927483 
End bp931285 
Gene Length3803 bp 
Protein Length993 aa 
Translation table12 
GC content43% 
IMG OID640388818 
Productpredicted protein 
Protein accessionXP_001382949 
Protein GI150864215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC TACGACCCAT CAATCAGCCA GAGGGACCTC TAGGGGTTTT CTCTGGTGAA 
AGCTCTTCAA ATCAAGCCTT AAACGAGCCA AAACCACCGG ATATTGACCG AAATCCACAC
GACCATGTCG CACCCCCTGA TCTGATGGAC CTCGATGGCG AGTCCACAGA TGCCGATGAA
AACGGCGATT TAGAGCCATC TTTCGAGACA GCATTGTCAA CTACTTCCGA GGAACCAGCC
CAACTACGGG ATAATCCCAT GGACGGACTG GTTAGTCAGA ACCAGACCCA ATTAAGCTCC
ACTATGGACA GTTTTGAAAC TTCGGTTTCG AAAAATTTTC ACGATCACGT GATCGGACAG
GTACACGACC AGGAAATGGC TGCAAATGAT GAAAATGAAA TGGTTTCCAC TTCTTTAGAA
TCTACCAGCA GATCAACTGA TTTACCAGAA CAAGAAGACT CTCTTCTTAT TCACCATCAA
CATGAAAACG CCAAAACCCA AAAAAACAAA GAAAATTTAC AAAACTCCCA AAAAAATACA
GGAAACTTAC AAAACTCAAA AAAAAACTCA AAAAACTCAA AAAATTCAAA AACAACTCAA
CCAAATGTTG ATCAGATCTT CCCTATCTTA GGAACTGCCA GCAAATCAGC CAAAACAGGT
TTACGTACCT TCAATATTGC CAAACAAGTA CCAATCCCAA TACTCAATCC AAAAAATGGC
CCATCTGCTA GCCAACTCCA AAAGGTAATT GTGGACAGAG ACTCCTCACC TATTCTTCAA
GACGCAAAAA CTAGACGTAA GCAATTGACC GAACTACACG AAACTACCGG TCACCTTTCA
GAAGAACAAT ACCGTCAATT AGCAAACACC TACTATCTCC AAAAATATGC CAACTCCCTT
TCAGATCTCA ATTGGGCGGG ACAAGAAGAA AGACTTAAGG CATGGAATGT GCCCCAAGAT
TTCTGCCTTA CCGCACTCGG TGAGATTGCT CGAAATTCAA ACGAGAAAAG ATACGTACGT
CTTCAAATAG ACGCTTTCTA TAACAAAAAT GACCATCTCG ACCAAAGGCA TTCAATGCGG
GCCGAGGAAA TGGCCAAAAT AATTGAAAAA CACCTTACAG AGACTATCCC AAACAAATGG
CCCATGGTCT CCCAAAACAA TAACAAAACT TTAAATGAAC TCCATAAATC CTTGGAGTTC
TTAAAAAGTC AATTCGACCC CGGCTCAAAT GACGAATTTG AAGCCACTAA GGACATAAAA
GACAAAATGC AGCAATTGTC TAGAGAAATA AGTTTCAGCG CTACAATGAG AGATGTTAAA
GACAAGTTCC ATACAATTGT AAGAGATAAC ACTCATGTGG ATTTTCGTTT CGGAAGCATA
GTAACTCAGG ATCGGATCCC AGAACTGGCT CAACAGAAAT CAACACCCAT GACAACTTGG
CTTGAAAGAT TTCAACAACT TTTCCACTCA CCTTACGTGG GCAGTGAAGA CACCTTTGAA
TTTCAGCTCA GTCTTGTAAG GCCAAAGCAA CTATCCAACA TGTATCTAGT TGGCATAAAA
GCTTCGTCGG ACTTCCCAGA TCCGCGAGAC ATCCTAGACA TCATGTTCCA CACAAAAGAT
TTTGAAATTC CTCAATACAT CGATACATCC TTACCTTCTC GTAAACCCTC GAGACGACAG
GACCGAATCC CGTACGAAAT TCTACATCAA TTCATAAGAA AGGCACCAAT CTTAAATTAC
ACTGACAAAA AAGCAGACTT TACACACGTC CTGTTCTTCT TGATTGGCAG CAACTCAGAC
AATATTCCCA ACAGAAAGAG TATCTTCTTG GAAAACGTCC AACTTGACAT CATCTCTTCA
TACCAATTCT GTTTCAGATG CCACAACAAC AAGCACACTA CCAAAAGATG TCCAGTTCCC
AAATCAACCA CATTGTTCCA AAGTCGGCCC TTAACACAAT GGCCCACTAA GGTACCATCC
CCAAACAAAC AGAACCATCC AATAACCCAC ACCACGGGTC CCAGAACTAC AAATCAAGAC
GGGGATGGCT TTAGTCGACC CACGAAAAGA TCTAGACAAA AGACAACCCC CAGTCCACCA
ACTATCCCAC AACAACAGAA TAGCTTCGAG GTGCTCCCGA TAGAGGACCT CACAACACAA
GAGGTTACAG CAGAGGAGAC CGAAGCCACA CGCAACCGGC CAAACACTGT TTCATCTACA
CCACAAGCAC CATCACGTCA CGAAACCCCG AAAGCCATTA ACAAGAACAA CGATAAAGCC
CCTGAAATTT CAACACAAGA CGATGAAATG GTGTACTACA CCGATGACGA AGAACCCTCG
ACTATAAATG ACGAGGACAC CCAACTCGCT CCATCTACAT TGATTGAGCA ACAACAAAAA
AATTCAAAAG TGGATACCAC CCCAAATACT CCACAATACA CTACTGACAT GCTCCCTCTG
AAGCATGCAC CTAAATCAAA TTCAAGATCT CAACCGGTAA CTCCCTCCCG GCCTACTAGC
ATGCTCCCTC TGAAGCATGC TCCCATAACT AGTTCCAAAT CTCAACCGGT AACCCCTGCC
CGGCTTGGAT CCAAGGTCCT CAAACCTGCG TCAACAGGTA AGAAACCCTG GACACCGACA
CCCACTCCCA GGACCACGAA TGGCCCTAGT GGGTCCCCCG CAAGGACTCC GACCACGTCA
ATACGACTAC CTTCTCTCCT AAACAACAGT AGAACAAACT ATAGTGAATT AAGGGAATCA
CAAATGGGAT CGACCATTAG TTTTCCTGAG TCTATGAGGA CATCCAACCT CAACATCCCC
GACTCTTCAC TAATTCTACA AACTCAAATT GAGGAAGCAA CTCAAGTAGA TTCTCCAGGC
CAACAACAGG CACCAGGTCA CAATCCATTA ACAGATGACA ACCTGGACCT AAGTATGGAC
ATAGAGGACA TCAACGTTCA TTCTGATAAT ACTAATTATT AAGATGTCTA GATTAGTAAA
TTTTCGGATA ACCAACCCGC TGTTAACGGA ACACCGGGTA GCAAATCCAA CTAATCTTCT
CAAGATCCGT ACTAAAAATA TTCAAAAAAA CACACAGATC AATAAGTTTC GAGAACTCGC
GACTTACTGT GATGTCTTAC TCATTCAAGA AACTGATTTT AATTCAAAAC AACCAGCCAA
CTCCCAGAAC AGGCGGGGAG CCAGGAGGGG AGCAAGACGG GCAATGCAAC AACAACCCCC
AAACTACTCT CAGGATCCCA CCCCAGATTG GATTACCTCA TTACAAAAAC AATTGAACCA
AGCAAATCAA GAGCTAATCT ATACAGATAC ATTAGCTCGC TCGGGAATCA TACTTAATTT
TCAACACCAA CACTTTGAAA AGATTTCGTC TAATAACCTC CAGCTCGACT CTGAAATCGC
GCGGTATGCG ACCGACGTAA TCATTCAACT CAAAGAAACA AAAGAATATA TCTTAGTTAT
CTCAGTGTAC GGACCGAGTG GGAATCATCG TTCTCAAGAG CAATTATTCC ACTCTCTTTA
TACTCTGATT AACACACTCA TCACAAATTT CGAAATTGAC AATAACAATC ACAAGCTCCA
CCTCTGTATA GGGGGAGACT TCAATATGAT ACAAAATCCA GAACTAGATT CTACTGCAAG
AGAAAGCTCC AGTCGAGAGC ACGCCTCCCG ACAAGCCTTC AACTCTCTTT GCAACGAATT
TCAACTACAT GACTCTCTTA GAGGATTAGA ACCGACTATC AAGGTACCCA CAAATACCAA
CACAAATAAT TGCAGAAGAC TCG
 
Protein sequence
MADLRPINQP EGPLGVFSGE SSSNQALNEP KPPDIDRNPH DHVAPPDSMD LDGESTDADE 
NGDLEPSFET ALSTTSEEPA QLRDNPMDGS VSQNQTQLSS TMDSFETSVS KNFHDHVIGQ
VHDQEMAAND ENEMVSTSLE STSRSTDLPE QEDSLLIHHQ HENAKTQKNK ENLQNSQKNT
GNLQNSKKNS KNSKNSKTTQ PNVDQIFPIL GTASKSAKTG LRTFNIAKQV PIPILNPKNG
PSASQLQKVI VDRDSSPILQ DAKTRRKQLT ELHETTGHLS EEQYRQLANT YYLQKYANSL
SDLNWAGQEE RLKAWNVPQD FCLTALGEIA RNSNEKRYVR LQIDAFYNKN DHLDQRHSMR
AEEMAKIIEK HLTETIPNKW PMVSQNNNKT LNELHKSLEF LKSQFDPGSN DEFEATKDIK
DKMQQLSREI SFSATMRDVK DKFHTIVRDN THVDFRFGSI VTQDRIPESA QQKSTPMTTW
LERFQQLFHS PYVGSEDTFE FQLSLVRPKQ LSNMYLVGIK ASSDFPDPRD ILDIMFHTKD
FEIPQYIDTS LPSRKPSRRQ DRIPYEILHQ FIRKAPILNY TDKKADFTHV SFFLIGSNSD
NIPNRKSIFL ENVQLDIISS YQFCFRCHNN KHTTKRCPVP KSTTLFQSRP LTQWPTKVPS
PNKQNHPITH TTGPRTTNQD GDGFSRPTKR SRQKTTPSPP TIPQQQNSFE VLPIEDLTTQ
EVTAEETEAT RNRPNTVSST PQAPSRHETP KAINKNNDKA PEISTQDDEM VYYTDDEEPS
TINDEDTQLA PSTLIEQQQK NSKVDTTPNT PQYTTDMLPS KHAPKSNSRS QPVTPSRPTS
MLPSKHAPIT SSKSQPVTPA RLGSKVLKPA STGKKPWTPT PTPRTTNGPS GSPARTPTTS
IRLPSLLNNS RTNYSELRES QMGSTISFPE SMRTSNLNIP DSSLILQTQI EEATQVDSPG
QQQAPGHNPL TDDNSDLSMD IEDINVHSDN TNY