Gene PICST_30681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30681 
Symbol 
ID4837674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp665254 
End bp667623 
Gene Length2370 bp 
Protein Length789 aa 
Translation table12 
GC content38% 
IMG OID640388989 
Productpredicted protein 
Protein accessionXP_001383410 
Protein GI150864550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0209542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG AGAACAGTAC ATCCGATACA GTTAATCTCG ATGAGCTCGC GGCCAATATC 
AAGACATCCA TAGCTCAATT GGTAGATGTC TCTTTTTCCA ATGATCCTCT ACCAGTATTG
TTAGATCTCC AGGACCCTGT AGCAAAATGT GGACTAGACG AGTCTGATCT CAACAATCTC
TTCGATTTGA TCTTTGACTT GAAGAAGGCT GCCGTCCTCA ACAATACTGA TAAGAAGTTT
GTTATTAAAA GTCTACTACT TCCTCGTCCT GGAAGTTACC CAGCTATGGA TATAGTCTAC
AGAATCCTAA GCAATATTGG AACTCCGCAG AGTATCTACA AAGATGGGTC TACCGCTAAA
CTACGATTAC TCCCTACAGC AATTCAACTT CAACTACTTG ATTGGTTGGT TTGTTCACTT
CATTTGTTTG GCCCTCAGGT ATACACTTCG TTGCGTAGAA TGCTTCCTAT CTTATTCAAA
TTGATAACAT ATGAGTACCT GCGTCCCTAC ATAGCAAACA TCATATTTCT TACTATACTG
AGCAGTTCAA CTTCAGTTGA GACATACCTA AACAGACGAA ATATCGGGCG TATCAATTCG
TGGAAACCGT GGCATGTCCA ATTGGTAGTT GACCTTTTCG GAAGATTTCC GTTGGATGAG
TACCTCAAAA GTTTACTAAT TCTATTCAAG ACATTGGATC CAACTATTGA TTTCAACAAG
TTTGGAAAAG ACATTGCTTA TGATATGAAT AAGTTAGTAC TAACGTCGTC ACGACCATTT
GTTTATCCCA ACTTTGATTA CCTCAGTCGA TTGAAAGAAT TGAAATACAT CAATGATATT
TCCAGCCATA ACAGCCTGGC AACAATCCTT CAAGATAATC TAGACAAATA CCGAAATTTC
AGCCGTGCTA TGAACAGAAA CAAGAGACAA AAATTTAACA ACAATACTGT TGCAATAGAC
ATCGATACCC TTGATTTTCA TAACTCTAGT AATGATACTG CCATAACTGA AGTGCACTCA
TTTGAAGCAT TGGTGCAATC TTTAGATAAA ATCAAACACA TAAACATTCG ACTGATATTC
ACTAGCGACT GGGACCAAGG TTATCTTGCA ATGGCAAAGA GGTACTACTG TATCCTATCC
AATTTGACAG TAAATGACTC CAGTGAAGCT TTAAAGAAAT TAGACTTTTT CATCCGTCTC
AGTATACTTG ATGACAATAT AAACATCAAA GAACTTTCGT CATTTTGCGA TAGATTATGT
CAGTTCTTGC TACTTGGGGC GGATACCATC ATTTTACCAC TGGTAAAAGA TTACGTTATG
TTCAACTATA GTGCCCAGCC CATAGGTTCA GATGAAACTA ATGTCGAGTA CTATAATCTC
AAGGAAAGGC TTAAACTTTT GAAATTTTTG CCAATGGTTG CTGAACAGGA ATTTCAAAAC
TCAGTACTTA ATCCTATTCT TGCATTATTG TCAAGAGCTA CTAGACACAG ATTACTCAAA
TTGAGAAAGG AATGTACTCT GCAATTCATT ACAGAGTTGA TGTTCATATT TTCCAAGTGG
TACGACCAAG TCAAGGATTT GAACACTTAC AAAGAACAGA AGTTCAATTA TTTTGCCATT
ATTAACCAGT CGCTTCCTAA GTTATATGCA TTTCTTCTGG ACATCTACAA AACAGAAAGT
GATTTTTCAA ATTTCTTGTT GATACAAGTG CTAAGATTTG TGAGGAGCAT TGAAGAAGAT
GATGTGAACG AGCTCTTTAA TGACAGTAGT ATCATGCTTC CTCAAATACT AGTACATACT
TTATTGTTCA AGAGTAACCC CTTCTTGTTT TCCGAGTTGT GTGGTTTCAT AGCTTGGACC
AAGAGGTATA GTCACAGGGA TTTGAACAAT AGATCCATCC AGAATTCTTA CATTATGGAC
ACGCTCAATT TCATATGGAG AGACAAGGCA TTTCATCTTG AAAAATCAGC TTCTTCTCCT
AGCAAAGCTT TTCAATTGAA TTCAGATTTC GTCACCGCAA TCTCCAGTTC GCATATGTTC
AATTCTGTAA GTACTGGCTC ATTAAGCAAA GTAGGCAACC TCTTTGTGAA CCCAGCCTGG
TCATATATTG TGGCTCAGCT CGTCTGGGGC TTTGAAGATG GAAATGACAG CATTACAACG
AGACACCAGG GTCCCATTAG TAAAGAAAGC ATCGATCAAC TCAATGGAGA CTCCGATGTG
AGATGGTTAT CGGTGAACTA CGACGAGTTA AAGCTAAAGA TTCTTCAAGA ATTAGACGCT
CTTGGATTTA CAGGGTTCTG TGATTTGCTT TTCGGTTCGT TGAAACCACT TCTGGGAAAG
AGAAAGCACA GTATCGAAAC AATATACTGA
 
Protein sequence
MDEENSTSDT VNLDELAANI KTSIAQLVDV SFSNDPLPVL LDLQDPVAKC GLDESDLNNL 
FDLIFDLKKA AVLNNTDKKF VIKSLLLPRP GSYPAMDIVY RILSNIGTPQ SIYKDGSTAK
LRLLPTAIQL QLLDWLVCSL HLFGPQVYTS LRRMLPILFK LITYEYSRPY IANIIFLTIS
SSSTSVETYL NRRNIGRINS WKPWHVQLVV DLFGRFPLDE YLKSLLILFK TLDPTIDFNK
FGKDIAYDMN KLVLTSSRPF VYPNFDYLSR LKELKYINDI SSHNSSATIL QDNLDKYRNF
SRAMNRNKRQ KFNNNTVAID IDTLDFHNSS NDTAITEVHS FEALVQSLDK IKHINIRSIF
TSDWDQGYLA MAKRYYCILS NLTVNDSSEA LKKLDFFIRL SILDDNINIK ELSSFCDRLC
QFLLLGADTI ILPSVKDYVM FNYSAQPIGS DETNVEYYNL KERLKLLKFL PMVAEQEFQN
SVLNPILALL SRATRHRLLK LRKECTSQFI TELMFIFSKW YDQVKDLNTY KEQKFNYFAI
INQSLPKLYA FLSDIYKTES DFSNFLLIQV LRFVRSIEED DVNELFNDSS IMLPQILVHT
LLFKSNPFLF SELCGFIAWT KRYSHRDLNN RSIQNSYIMD TLNFIWRDKA FHLEKSASSP
SKAFQLNSDF VTAISSSHMF NSVSTGSLSK VGNLFVNPAW SYIVAQLVWG FEDGNDSITT
RHQGPISKES IDQLNGDSDV RWLSVNYDEL KLKILQELDA LGFTGFCDLL FGSLKPLSGK
RKHSIETIY