Gene PICST_33080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33080 
Symbol 
ID4840253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1386486 
End bp1389467 
Gene Length2982 bp 
Protein Length993 aa 
Translation table12 
GC content44% 
IMG OID640391568 
Productpredicted protein 
Protein accessionXP_001385963 
Protein GI150866385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.343186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC TACGACCCAT CAATCAGCCA GAGGGACCTC TAGGGGTTTT CTCTGGTGAA 
AGCTCTTCAA ATCAAGCCTT AAACGAGCCA AAACCACCGG ATATTGACCG AAATCCACAC
GACCATGTCG CACCCCCTGA TCTGATGGAC CTCGATGGCG AGTCCACAGA TGCCGATGAA
AACGGCGATT TAGAGCCATC TTTCGAGACA GCATTGTCAA CTACTTCCGA GGAACCAGCC
CAACTACGGG ATAATCCCAT GGACGGACTG GTTAGTCAGA ACCAGACCCA ATTAAGCTCC
ACTATGGACA GTTTTGAAAC TTCGGTTTCG AAAAATTTTC ACGATCACGT GATCGGACAG
GTACACGACC AGGAAATGGC TGCAAATGAT GAAAATGAAA TGGTTTCCAC TTCTTTAGAA
TCTACCAGCA GATCAACTGA TTTACCAGAA CAAGAAGACT CTCTTCTTAT TCACCATCAA
CATGAAAACG CCAAAACCCA AAAAAACAAA GAAAATTTAC AAAACTCCCA AAAAAATACA
GGAAACTTAC AAAACTCAAA AAAAAACTCA AAAAACTCAA AAAATTCAAA AACAACTCAA
CCAAATGTTG ATCAGATCTT CCCTATCTTA GGAACTGCCA GCAAATCAGC CAAAACAGGT
TTACGTACCT TCAATATTGC CAAACAAGTA CCAATCCCAA TACTCAATCC AAAAAATGGC
CCATCTGCTA GCCAACTCCA AAAGGTAATT GTGGACAGAG ACTCCTCACC TATTCTTCAA
GACGCAAAAA CTAGACGTAA GCAATTGACC GAACTACACG AAACTACCGG TCACCTTTCA
GAAGAACAAT ACCGTCAATT AGCAAACACC TACTATCTCC AAAAATATGC CAACTCCCTT
TCAGATCTCA ATTGGGCGGG ACAAGAAGAA AGACTTAAGG CATGGAATGT GCCCCAAGAT
TTCTGCCTTA CCGCACTCGG TGAGATTGCT CGAAATTCAA ACGAGAAAAG ATACGTACGT
CTTCAAATAG ACGCTTTCTA TAACAAAAAT GACCATCTCG ACCAAAGGCA TTCAATGCGG
GCCGAGGAAA TGGCCAAAAT AATTGAAAAA CACCTTACAG AGACTATCCC AAACAAATGG
CCCATGGTCT CCCAAAACAA TAACAAAACT TTAAATGAAC TCCATAAATC CTTGGAGTTC
TTAAAAAGTC AATTCGACCC CGGCTCAAAT GACGAATTTG AAGCCACTAA GGACATAAAA
GACAAAATGC AGCAATTGTC TAGAGAAATA AGTTTCAGCG CTACAATGAG AGATGTTAAA
GACAAGTTCC ATACAATTGT AAGAGATAAC ACTCATGTGG ATTTTCGTTT CGGAAGCATA
GTAACTCAGG ATCGGATCCC AGAACTGGCT CAACAGAAAT CAACACCCAT GACAACTTGG
CTTGAAAGAT TTCAACAACT TTTCCACTCA CCTTACGTGG GCAGTGAAGA CACCTTTGAA
TTTCAGCTCA GTCTTGTAAG GCCAAAGCAA CTATCCAACA TGTATCTAGT TGGCATAAAA
GCTTCGTCGG ACTTCCCAGA TCCGCGAGAC ATCCTAGACA TCATGTTCCA CACAAAAGAT
TTTGAAATTC CTCAATACAT CGATACATCC TTACCTTCTC GTAAACCCTC GAGACGACAG
GACCGAATCC CGTACGAAAT TCTACATCAA TTCATAAGAA AGGCACCAAT CTTAAATTAC
ACTGACAAAA AAGCAGACTT TACACACGTC CTGTTCTTCT TGATTGGCAG CAACTCAGAC
AATATTCCCA ACAGAAAGAG TATCTTCTTG GAAAACGTCC AACTTGACAT CATCTCTTCA
TACCAATTCT GTTTCAGATG CCACAACAAC AAGCACACTA CCAAAAGATG TCCAGTTCCC
AAATCAACCA CATTGTTCCA AAGTCGGCCC TTAACACAAT GGCCCACTAA GGTACCATCC
CCAAACAAAC AGAACCATCC AATAACCCTC ACCACGGGTC CCAGAACTAC AAATCAAGAC
GGGGATGGCT TTAGTCGACC CACGAAAAGA TCTAGACAAA AGACAACCCC CAGTCCACCA
ACTATCCCAC AACAACAGAA TAGCTTCGAG GTGCTCCCGA TAGAGGACCT CACAACACAA
GAGGTTACAG CAGAGGAGAC CGAAGCCACA CGCAACCGGC CAAACACTGT TTCATCTACA
CCACAAGCAC CATCACGTCA CGAAACCCCG AAAGCCATTA ACAAGAACAA CGATAAAGCC
CCTGAAATTT CAACACAAGA CGATGAAATG GTGTACTACA CCGATGACGA AGAACCCTCG
ACTATAAATG ACGAGGACAC CCAACTCGCT CCATCTACAT TGATTGAGCA ACAACAAAAA
AATTCAAAAG TGGATACCAC CCCAAATACT CCACAATACA CTACTGACAT GCTCCCTCTG
AAGCATGCAC CTAAATCAAA TTCAAGATCT CAACCGGTAA CTCCCTCCCG GCCTACTAGC
ATGCTCCCTC TGAAGCATGC TCCCATAACT AGTTCCAAAT CTCAACCGGT AACCCCTGCC
CGGCTTGGAT CCAAGGTCCT CAAACCTGCG TCAACAGGTA AGAAACCCTG GACACCGACA
CCCACTCCCA GGACCACGAA TGGCCCTAGT GGGTCCCCCG CAAGGACTCC GACCACGTCA
ATACGACTAC CTTCTCTCCT AAACAACAGT AGAACAAACT ATAGTGAATT AAGGGAATCA
CAAATGGGAT CGACCATTAG TTTTCCTGAG TCTATGAGGA CATCCAACCT CAACATCCCC
GACTCTTCAC TAATTCTACA AACTCAAATT GAGGAAGCAA CTCAAGTAGA TTCTCCAGGC
CAACAACAGG CACCAGGTCA CAATCCATTA ACAGATGACA ACCTGGACCT AAGTATGGAC
ATAGAGGACA TCAACGTTCA TTCTGATAAT ACTAATTATT AA
 
Protein sequence
MADLRPINQP EGPLGVFSGE SSSNQALNEP KPPDIDRNPH DHVAPPDSMD LDGESTDADE 
NGDLEPSFET ALSTTSEEPA QLRDNPMDGS VSQNQTQLSS TMDSFETSVS KNFHDHVIGQ
VHDQEMAAND ENEMVSTSLE STSRSTDLPE QEDSLLIHHQ HENAKTQKNK ENLQNSQKNT
GNLQNSKKNS KNSKNSKTTQ PNVDQIFPIL GTASKSAKTG LRTFNIAKQV PIPILNPKNG
PSASQLQKVI VDRDSSPILQ DAKTRRKQLT ELHETTGHLS EEQYRQLANT YYLQKYANSL
SDLNWAGQEE RLKAWNVPQD FCLTALGEIA RNSNEKRYVR LQIDAFYNKN DHLDQRHSMR
AEEMAKIIEK HLTETIPNKW PMVSQNNNKT LNELHKSLEF LKSQFDPGSN DEFEATKDIK
DKMQQLSREI SFSATMRDVK DKFHTIVRDN THVDFRFGSI VTQDRIPESA QQKSTPMTTW
LERFQQLFHS PYVGSEDTFE FQLSLVRPKQ LSNMYLVGIK ASSDFPDPRD ILDIMFHTKD
FEIPQYIDTS LPSRKPSRRQ DRIPYEILHQ FIRKAPILNY TDKKADFTHV SFFLIGSNSD
NIPNRKSIFL ENVQLDIISS YQFCFRCHNN KHTTKRCPVP KSTTLFQSRP LTQWPTKVPS
PNKQNHPITL TTGPRTTNQD GDGFSRPTKR SRQKTTPSPP TIPQQQNSFE VLPIEDLTTQ
EVTAEETEAT RNRPNTVSST PQAPSRHETP KAINKNNDKA PEISTQDDEM VYYTDDEEPS
TINDEDTQLA PSTLIEQQQK NSKVDTTPNT PQYTTDMLPS KHAPKSNSRS QPVTPSRPTS
MLPSKHAPIT SSKSQPVTPA RLGSKVLKPA STGKKPWTPT PTPRTTNGPS GSPARTPTTS
IRLPSLLNNS RTNYSELRES QMGSTISFPE SMRTSNLNIP DSSLILQTQI EEATQVDSPG
QQQAPGHNPL TDDNSDLSMD IEDINVHSDN TNY