Gene PICST_31200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31200 
Symbol 
ID4838636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp188491 
End bp190617 
Gene Length2127 bp 
Protein Length708 aa 
Translation table12 
GC content48% 
IMG OID640389951 
Productpredicted protein 
Protein accessionXP_001384335 
Protein GI150865212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.645088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT ACGAGAAGAT CGTCAAGGGC GCCACCAAGA TCAAGGTGGC TGCGCCCAAG 
CCCAAATACA TCGAGCCCAT TCTCATGGCC ACCTCTACCG AGCTCTCGTT AGAATCCGAC
AATTTTTCAA CAATCATGAA AACACTCCAG CATAGGCTTC AGGATCTGGC GTGGTCGGTT
GTTTATAAGG CTTTGATAGT GATCCACATA ATGATCCGCG AAGGTGACAA AGACGTCACG
CTCAAATACT TGGCCCACAA GAATCCCAAC ATGCTCTCCT TAGCCCTGGC TCCCGTAGTC
AAGAACCAGG CTGCTAACGC CGACGTCCGG TTCATCGTCA AGTACAGCAA GTATTTAGCG
ACAAGGGTCC GTCAATTCGA TACTACAGGG ATAGACTATG TTCGTGATGA ACGCTCCAAC
AACTCGACGT TGCAATCGGG AGGTAGACTC AGAACCCTCA CTGTAGAAAA GGGATTACTC
AGAGAGCTGG AGCTGGTGCA GAAACAGATA GATGCACTTT TGAAAAACAG CTTTATGGAA
AATGAAATTA ACAACGATAT CGTAGTTACA GCCTTCCGCT TACTTGTAAA TGACTTGCTT
GCACTTTTCC AGGAGCTCAA CGAAGGTGTC ATCAACATTT TGGAGCACTA CTTTGAGATG
TCGAAAATCG ACGCTGAACG GGCCCTCAAA ATCTATAAAA AGTTCGTAGA CCAGACGAAA
TATGTCATTG ATTATTTGCG GGTAGCCAAA CACCTAGAAT ACGCAACCCG TTTGCATGTT
CCTACGATCA AGCACGCTCC TACAGCCTTG ACTTCATCGC TAGAGGAATA CTTGGACGAT
CCAAACTTTG AAGCCAATAG AAAACAGTAC TTGCTGGAAA AGAAGGGAGA AACACCATTA
GAAGCAAAGC CTCAAAATTC ACAACAGCTT CAAAGCCAAC AATCCCAACA ATCCCAACAG
CAACAGCAGC CTGAGTTGCA GAGAAATAAT ACCTTGATTG TTCAGCAATC AACATACAAC
CCCTGGGGCG CAGTTATCCA ACAGCCCCAA CTTGCAAATG GCACAGGCTA CCAGATCGCA
GCATCCAATC TGATCGACGC CATGCTGCCT CAATTACAAC AGCAGAACGC TCAGCAACAA
CAGATGTTCG CTTCTGGCTT TTCTGGTATG CCTGTAATCG TTCAGGGACA GCAATTTCTG
CCGCTGCCTG TTGGCATCTC CACTGCCTTC ACAGGTGCGG GCTTTGGAGG CTATGGTCCC
CAGAATCAAA ATCAGATTCA ACATGTTCAG ATTCCACAGC AGGCAACGGG CCATAATCCA
TTTTTGCAAG GATTTTCCCA GCAGCAGCAG CCTCCTGCAG TACAGCAGCC TCTTGTAGCT
CCGCAGACTC TTGCTCTGCA GCAGCAGCCA TTCCAGCCCC AGGGGGCAGC TCAGCAGTCG
CAGACACAGC CAGATTTAAG AAGAGCAAGT ACGAATCCAT TCTCTACTTT GACTTCTTCT
GTAACTGGTC ACCAAGATGG TGGCGAGTAC TCGAATCCGT TTGCCAATTC GAGGTTTGCT
CCCAAGACCA CTACCACAGC TTTGAGCTTC AATAATGGTG TCACTACAAA TTCTGCATCT
CCTGCCGTCG ATCCAACTGC TACGGGTAGC AATCCATTCA AGGTGAGTCA AGCAACAACT
GCCTTGTTCA ACAATGCATC CAGCAAGGTT CAGCATCCAC AGCAACCTTT GAAGTCTCAA
CCAACAGCAG GAGGATTGGA GCACTTGCCT GTCATTCCTG TTTTCCCTGA AACCCAATTG
GAGAGTCAGA GAGAAAACTT CTTGACGGCT GCCAGAACTG GAATTGAAAA CCAACTCCAC
CAACAGCAAT TTCAGCAACA GCAATTTCAG CAACAGCAAC TTCAACAACA GCAGCAACTT
CAACAACAGC AGCAACTTCA ACAACAACAG CAACTTCAGC AACAGCAACA GCAACAATTT
CAACAACCAC AACAACCACA ACAGTTTCCT AATCTGTTTC AACAATCACA AACCCAGCAG
TTTCCTCAGC AGTTTTCGCA GCAGCAGTTT CCCTTTGCTC AACCACAGGT GACAGGCACG
TCGTACCAGG GAGCCAACTT GATTTAG
 
Protein sequence
MTTYEKIVKG ATKIKVAAPK PKYIEPILMA TSTELSLESD NFSTIMKTLQ HRLQDSAWSV 
VYKALIVIHI MIREGDKDVT LKYLAHKNPN MLSLASAPVV KNQAANADVR FIVKYSKYLA
TRVRQFDTTG IDYVRDERSN NSTLQSGGRL RTLTVEKGLL RESESVQKQI DALLKNSFME
NEINNDIVVT AFRLLVNDLL ALFQELNEGV INILEHYFEM SKIDAERALK IYKKFVDQTK
YVIDYLRVAK HLEYATRLHV PTIKHAPTAL TSSLEEYLDD PNFEANRKQY LSEKKGETPL
EAKPQNSQQL QSQQSQQSQQ QQQPELQRNN TLIVQQSTYN PWGAVIQQPQ LANGTGYQIA
ASNSIDAMSP QLQQQNAQQQ QMFASGFSGM PVIVQGQQFS PSPVGISTAF TGAGFGGYGP
QNQNQIQHVQ IPQQATGHNP FLQGFSQQQQ PPAVQQPLVA PQTLASQQQP FQPQGAAQQS
QTQPDLRRAS TNPFSTLTSS VTGHQDGGEY SNPFANSRFA PKTTTTALSF NNGVTTNSAS
PAVDPTATGS NPFKVSQATT ALFNNASSKV QHPQQPLKSQ PTAGGLEHLP VIPVFPETQL
ESQRENFLTA ARTGIENQLH QQQFQQQQFQ QQQLQQQQQL QQQQQLQQQQ QLQQQQQQQF
QQPQQPQQFP NSFQQSQTQQ FPQQFSQQQF PFAQPQVTGT SYQGANLI