Gene PICST_33722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33722 
Symbol 
ID4840890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp329990 
End bp332212 
Gene Length2223 bp 
Protein Length740 aa 
Translation table12 
GC content39% 
IMG OID640392205 
Productpredicted protein 
Protein accessionXP_001386650 
Protein GI150866902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.900153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGA CAAACAATCA AGTGATTTCT GTGAAGGTAA CACACAACAG ACTTGCACTT 
GCAAGCCTCC CTTCAGTAGA GAAGACTCTT TCAATCAACC GTACCACTTT TGACAAAGTC
AAGTCGAAGG ATGAGTTCAC AGATTTATTG TCGAAGACTG ATTTGAGTGT TTCTAGTGAG
AACAATTCAT TTATCAGCTA CCTTCGTAAG TCGAAAAAGC GCAAGGAGTA TGTTCCTTTG
GAATCAACAG AAGACTTCAA GTCCTTATCA AGAAGTTTAA AGGTGAAGAA TCATGTCAAG
TTGATTATTA ATGACAAATC CCCAATTATT GAAGGGAACT ACAAGAAGAG AAAGACTTCT
TCCATCGACT TCGCAGGCTT AGGTGATGCA TTACTTGAAG CAGCCTTGGA GCACTTCAAG
GAAATCTTCG CTGATCTACC TGTCCATGAG ACTCCTTCTT CTGGCGTGGA AAAGAATTTG
AATATCGAGA AGGAAATTAC TCAAACAAAG GTATCAGGAG AAGACTTGAT TGTGCACGAA
AATGTTGCTT GTGATTCATG CAGTAGAAAT GTTTTTATTC CAATTAAGGG CATAAGATAC
TGCTGTCTTG TTTGCCCAGA TTTTGATTTG TGTGAAGCGT GTGCCACTAG GTCTATAAAC
AATAAGCAGG ATATTGGCGA TCATAGTTAT TCTCATCCAG TGGCAATGCT TCAGAAACAG
GACAAGTATT TTAACCAAAA GAAATTCAAC TTCAGTGGTC ATTTACCTGC TCACGGCGGT
CCTTCAAACT TGCCAAGTGG TGAATCCAAT GTAGTTGTCT ATGATATTCC ATTAGAAAAC
TGCAATTCTC AAACAAAAGA AAGATTAGAG AACTTGCTTC AAAGAGATGG TATTGATCCA
TTTATTAGCA ATGTCCTGAA GTATATCGAG AACTCTGACA GATATGAAGA GTTGTTGTCT
TTGTCACAGG TTGGTTCCTC AGTTGAAAAT GATGATGAAA TGAATTTTGT CATATTGAAG
AGCATGATTG ACAAATGTAG GAACGATAAT ACGGAGTCCT GCGTGGGTCC AGTTCAGATT
CTTGAACCTT CGGTGTCTCC TTTCGAATCA ATTCCAGTAG ACTCTGTCCC CAAGGTACTT
GTAAAACCAA AGAAAATAAG TAACCATGCT CGTGTGATCT CATTAATGCT TACAAACACT
TCTGCAGTGA CAATCGATGG GGGTGACTTC AAGTTTGAAT TTTTCAATGA TACACAGAAG
GAAATTGTAG TAGTGAAGAA TGCTAGCAGA GTCAAACCTG GACAGCAGAG ATTCTACAAC
TTGGGTAGAT TAAATGACAG TTTTGACAAG TTATCAGGAA TGAAATTGAG ACTTTCAGCG
CCAAATGCAG TTTTAGAGGG TGACTACAAT GAAACGTATG ACTCAGAATT ATTGTTTGTT
AGCATGACTG GTAGTTTGTT GGAACAACCA CTGAACGAGC ATATTATTGA CAACCTGTAC
CTTGATAACA ACGACGAGGT CATTGTCACT GTAGTTCCAA AATCTAGCTC GACAGCGCAG
ATAATGATTT CCAATAAGTC TGAAAAGATG ATTGATTGTT CATATTTGAA GTTGGAGATC
GTAAACTGTT TCAATAGATC AGTTGTAACT GTGATTGTTC GCAAGAAGCA TGGTATTAAC
CCAGGTAAGG TTGGCAAGTT CAATATTGGA TTAATCGATG CTCATATGAA GTACCCGTTC
AAACTTGTAA TGAAAAACGA CTTTAATATC GGCTATTGTG ATTTGAGTAT CAACAATTTG
AGTGGAAGAC TCACCTTCGA GAAAGAGTCC TCTATTGAAA TTGATGACCA AGCTGGATTT
GTTTCTGATG ACGAAACTCA CACTTCCGGA ACAGAGTCAA TTCTTGAAGA TCAAATTGAG
ATGGAAGGGG TAGATGATCA CATCAGGAAT GATGTAGAAG TTAAAATTAC GACACCGCAC
GAATCTTTGG AATCGTCTCT TGTGGGTTCA ATTCACTCGA TTGTATTACC TTCGTTGCCC
AAAGAAGCAC TCGTTGAAAG CTCTAATTCT GAATACGTTG ATGCTCGTCT GGCTTCCATT
GAGCATGCTG ATAAAGCACT AACCGAAAAC GTGGAGGATG ACTACGATAT TATCAGTGTA
GAAGGTGATG CTGAAACTGA CTCTGATTTT GAATTATTGT CACCATCTAT CTCTAACCAA
TGA
 
Protein sequence
MNTTNNQVIS VKVTHNRLAL ASLPSVEKTL SINRTTFDKV KSKDEFTDLL SKTDLSVSSE 
NNSFISYLRK SKKRKEYVPL ESTEDFKSLS RSLKVKNHVK LIINDKSPII EGNYKKRKTS
SIDFAGLGDA LLEAALEHFK EIFADLPVHE TPSSGVEKNL NIEKEITQTK VSGEDLIVHE
NVACDSCSRN VFIPIKGIRY CCLVCPDFDL CEACATRSIN NKQDIGDHSY SHPVAMLQKQ
DKYFNQKKFN FSGHLPAHGG PSNLPSGESN VVVYDIPLEN CNSQTKERLE NLLQRDGIDP
FISNVSKYIE NSDRYEELLS LSQVGSSVEN DDEMNFVILK SMIDKCRNDN TESCVGPVQI
LEPSVSPFES IPVDSVPKVL VKPKKISNHA RVISLMLTNT SAVTIDGGDF KFEFFNDTQK
EIVVVKNASR VKPGQQRFYN LGRLNDSFDK LSGMKLRLSA PNAVLEGDYN ETYDSELLFV
SMTGSLLEQP SNEHIIDNSY LDNNDEVIVT VVPKSSSTAQ IMISNKSEKM IDCSYLKLEI
VNCFNRSVVT VIVRKKHGIN PGKVGKFNIG LIDAHMKYPF KLVMKNDFNI GYCDLSINNL
SGRLTFEKES SIEIDDQAGF VSDDETHTSG TESILEDQIE MEGVDDHIRN DVEVKITTPH
ESLESSLVGS IHSIVLPSLP KEALVESSNS EYVDARSASI EHADKALTEN VEDDYDIISV
EGDAETDSDF ELLSPSISNQ