Gene PICST_85406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85406 
Symbol 
ID4840459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp219647 
End bp222034 
Gene Length2388 bp 
Protein Length780 aa 
Translation table12 
GC content42% 
IMG OID640391774 
Productpredicted protein 
Protein accessionXP_001386060 
Protein GI150866451 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.920544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTCA AACTCAGTGC CACATTAAGG GGGCACGAGC AGGATGTCCG TGGAGTGGTT 
GCACCTTCAG ACGAACTTGT AGTAACTTGC CTGAGAGATT CTACTACCAG AATCTGGTTG
CCACCATCAG ACAGCAAGCA ATCACGTTTT GTTTCTGATC GCACCGAGCC ATTGATCGTG
TTCCATTCGC CAAACAATAG TTTTATCAAT AGTGTGACCT ATATAGACTC TAAACAGCAC
GAACCACTCA TTGCCAGTGG AAGCCAGGAT GCCATTATTT ATTTGTCAGA AGTGGCTGTT
TCAGACCGAA AGCCCGGAGA CGACACGGGA AAATACCAAT TGATAGGCCA CGCCGGCAAT
GTCTGTGCTT TGGAATATAA AAACAACCAG ATCATTTCGG GCTCATGGGA TTGCACTGCC
AAGGTGTGGG ATTTGGATAC CTTATTGGTG AAGTACGATC TTGTTGGTCA CGAGTCCTCA
GTGTGGGATG TCAAGATTTT GGACAACGAC ACGTTTTTAA CATGTTCTGC TGATAAGAGC
ATCCGTCTCT GGAACGGAAA GAAAGAAGTG CAACGTTTCA GCGGACATAC AGATGTAATC
AGAAAATTGT TAGTGTTTCC AGATGGGTCA AGATTCGCAT CTGCTTCGAA TGACGGAACT
GTTAAACTCT GGGACTTGAA ATCTGGCCGT GTCTTGCAGA CATTGCACGG CCACGAATCG
TTTGTATATG ATTTGACTCT TTTGCCTAAT GGAGATCTTG TTTCTGTAGG TGAAGATCGT
ACCATTAGAG TTTGGAGAGA TGGCTCTATT TTGCAAGTCA TCACTTTGCC CTGTATCTCA
GTATGGTGTG TTGCAGCTCT CCCTAATGGC GATATAGTTG TGGGAGGCTC GGATAACATA
GTGAGAGTTT TCACGAGAGA CCTGTCTCGA ATTGCTTCAG ATGAAGAAAT TGCAGAGCTC
GTGGAAGCAG TACAACAATC AAGCATTGCT GAACAATCAT TGGACAATTT GAAGAAGACG
GATATACCTA GCTATGAAGC CTTGGAACGT CCAGGGAAAC AAGAGGGTGC AACCATTATG
GTGAAGAACC CATCTGGTGT GATAGAAGCC CATCAATGGT CGGGAGGGGA GTGGGTCAAG
ATTGGTGACG TGGTTGGATC AGCCGGTTCT GGACAGAAAA AGACTTACAA TGGCAAGGAA
TACGACTATG TTTTTGATGT TGACATCGAG GATGGTGCTC CGCCTTTGAA ATTGCCTTAC
AATGTGAACG AAAACGCCTA TACAGCAGCA CAGAGGTTCT TAGCCGAAAA TGATTTACCA
AGTTCATACA CTGACGAAGT GGTTAAATTC ATTAATAAGA ATACTGAAGG TTTCAGTATC
CAAGAAGCGG ATGAAGCTCC AACTTATGAC GCTTCGTTGA ATCCATATTC TGATAGTTAT
CAGCGGGAAC ATAACCAAGA TACCATTGCC TCATCTAAGC CAAGCTTGAA AGTCATTCCT
GAAACCACAT ATATTTCCTT CAAAGACTAC AAGGAAGCTC AATTGATTGC TGGATTAAAG
AAACTCAATT CAGAACAAGA TGAAAGTAAT CAACTATCTG AAAGCGATAT TAGCACCATC
TCTAGAAATT TGAAATTATT AACTTCTAAG GAGTCTTTGC AATTGATTGT TGAATATATT
CCAAGAATCA TAACTACGTG GTCTCCAGCA ACTAGATTGA TTGGTTATGA TCTTTTAAGA
ATTAGCATAC CTCGCGTGAC GACCGTTGAT TTATTGAGAT CAACTGAAGG TGCTGAGGCA
GTATTGAAAG CTATATCGTC TGGCTTAGAT GTAGCGGATG CCTCTACGAT TCCTTTGTTG
ATGATGATTT TGAAGACTTT GAATAACTTG ATTGGGAATA CTTTGTTTGT TCAATTGTAC
ATTGATCCTA ACGAGGACGG AACGTATACC TACAACAAGT TCTTCTTGGA CTTGCTATCA
ACATTGACCG CTAAGGTTAT AGACATAACT GAGAATGGCA AGTTCCACAA GCTCTATAAC
ACTACAGTTA CTACTATTGC AACTTTGTTG TACAATCTTT CCGTGTATCA TCTTCAAACT
TCTGGCTTGA AGAAGAATCC AAGGTCATCA GAATCGATTG TTCAATTCAC TAACGAAGTT
GGAGACCTTA TTGTTGAATC CAACAGCGAA GCTGCCTATA GACTAGCAAT AGCTTATGGC
AACTTGAGGT ATGCCAAGGC ATTTGAACCT GTACCAGAAT GGTTAAATAA GGCTGGAGAG
CTATATATCA GCAAGGGCGA GCAGAGATTT ATCGACTTGG CTCGCGATTT GCAAAGCTTG
TAAGGCTTAT TAAAAGTTGG TTTCTATTTT AATACACAAA CATTATGT
 
Protein sequence
MPFKLSATLR GHEQDVRGVV APSDELVVTC SRDSTTRIWL PPSDSKQSRF VSDRTEPLIV 
FHSPNNSFIN SVTYIDSKQH EPLIASGSQD AIIYLSEVAV SDRKPGDDTG KYQLIGHAGN
VCALEYKNNQ IISGSWDCTA KVWDLDTLLV KYDLVGHESS VWDVKILDND TFLTCSADKS
IRLWNGKKEV QRFSGHTDVI RKLLVFPDGS RFASASNDGT VKLWDLKSGR VLQTLHGHES
FVYDLTLLPN GDLVSVGEDR TIRVWRDGSI LQVITLPCIS VWCVAALPNG DIVVGGSDNI
VRVFTRDSSR IASDEEIAEL VEAVQQSSIA EQSLDNLKKT DIPSYEALER PGKQEGATIM
VKNPSGVIEA HQWSGGEWVK IGDVVGSAGS GQKKTYNGKE YDYVFDVDIE DGAPPLKLPY
NVNENAYTAA QRFLAENDLP SSYTDEVVKF INKNTEGFSI QEADEAPTYD ASLNPYSDSY
QREHNQDTIA SSKPSLKVIP ETTYISFKDY KEAQLIAGLK KLNSEQDESN QLSESDISTI
SRNLKLLTSK ESLQLIVEYI PRIITTWSPA TRLIGYDLLR ISIPRVTTVD LLRSTEGAEA
VLKAISSGLD VADASTIPLL MMILKTLNNL IGNTLFVQLY IDPNEDGTYT YNKFFLDLLS
TLTAKVIDIT ENGKFHKLYN TTVTTIATLL YNLSVYHLQT SGLKKNPRSS ESIVQFTNEV
GDLIVESNSE AAYRLAIAYG NLRYAKAFEP VPEWLNKAGE LYISKGEQRF IDLARDLQSL