Gene PICST_85013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85013 
Symbol 
ID4840705 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp754133 
End bp755437 
Gene Length1305 bp 
Protein Length407 aa 
Translation table12 
GC content44% 
IMG OID640392020 
Productpredicted protein 
Protein accessionXP_001386342 
Protein GI150866674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.5193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGTC AAAGACCGAA ACTTCTTACG TTTGACGCGT TTGCCAAAAC GGTGGAAGAT 
GCCAGAATCA GAACCACTTC CGGGGGTATC ATTACGCTTT TCTGCATTTT CGTTGTAATG
TTTCTCATTA GAAACGAATA CAGCGACTAT ACCTCGGTGA TTACGAGACC GGAATTGGTC
GTAGACAGAG ATATCAACAA GCCGTTGGAC ATCTACCTCG ATGTGTCGTT CCACAACTTA
CCGTGCGACT TGTTATCGTT GGATATCATG GACGAAGCAG GCGACTTACA GTTGGATATC
TTGAAGTCAG GATTTGAAAA GTTTAGGATT GTCAAAGACA GCGAGGGTAA AATCAAAGAA
GAGATTATTG ACAGGGAATC AACGCCAATT AACGCCGATT TGCTGATAGA GGAAATGGCT
AAAGGCTTGA AAGAGGGCGA AGATGGCGAA TGTGGCTCAT GTTACGGTGC CTTGCCCCAG
GACAAAAAGC AATACTGTTG TAATGACTGT GAAACAGTCA AGCTTGCCTA CGCTGAAAAG
TTGTGGGGTT TCTATGACGG TGAGAACATT GAACAGTGTG AAAACGAAGG CTATGTCCAG
AGAGTTCAAA GTAGAATCAA CGGCAAAGAG GGCTGTAGAA TCAAAGGTAA TGCCAGAATC
AACAGAATCT CTGGAACTAT GGACTTTGCT CCGGGAGCTT CGTTCACCAG TTCTGGTCAT
CATGTCCACG ATCTTTCCCT CTACGATAAG CACCCTCACC TTAATTTTGA CCATATTGTA
AATAAGTTGA CGTTTGGCCC TATTCCTGAT GAGCTGGTTC CTACAGCTGA ATCAACTCAT
CCATTGGATA ACTACGGTGT GGCACTCAAC GACAAGAATC ACGTCTTCAC CTACTACTTG
AAGGTGGTAG CTACACGATT CGAGTTCTTG AACGGTGCCA GTAAGGCTTT GGATGCTAAC
CAGTTCTCCG TCATCACCCA CGATAGACCT ATCAGCGGTG GTAAAGACAA CGACCATCAA
CATACCTTGC ATGCTAAAGG AGGCATACCG GGTGTTGTTT TCCACTTTGA TATTTCACCA
TTAAAAATCA TTAACCGCGA ACAGTACGCC AAGAGTTGGT CTGGATTCGT TCTTGGAGTC
GTCAGTTCAG TAGCTGGTGT TCTCATTGTT GGTTCTCTTT TGGACCGATC CGTGTATGCG
GCCGAGAGTG CCATCAAGGG GAAGAAGAAC ATGTAATGTC TTATAGATGT ATAATAGCTA
GAAATGTATT AGAAGAATTT AGTACTATGC ATGAAACAAT AATCC
 
Protein sequence
MSGQRPKLLT FDAFAKTVED ARIRTTSGGI ITLFCIFVVM FLIRNEYSDY TSVITRPELV 
VDRDINKPLD IYLDVSFHNL PCDLLSLDIM DEAGDLQLDI LKSGFEKFRI VKDSEEEIID
RESTPINADL SIEEMAKGLK EGEDGECGSC YGALPQDKKQ YCCNDCETVK LAYAEKLWGF
YDGENIEQCE NEGYVQRVQS RINGKEGCRI KGNARINRIS GTMDFAPGAS FTSSGHHVHD
LSLYDKHPHL NFDHIVNKLT FGPIPDESVP TAESTHPLDN YGVALNDKNH VFTYYLKVVA
TRFEFLNGAS KALDANQFSV ITHDRPISGG KDNDHQHTLH AKGGIPGVVF HFDISPLKII
NREQYAKSWS GFVLGVVSSV AGVLIVGSLL DRSVYAAESA IKGKKNM