Gene PICST_31436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31436 
Symbol 
ID4838548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp801898 
End bp803037 
Gene Length1140 bp 
Protein Length379 aa 
Translation table12 
GC content41% 
IMG OID640389863 
Productpredicted protein 
Protein accessionXP_001384452 
Protein GI150865298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.120567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCT CCAAGATAAC CAGCCCAGAC GATTGGTCGT ACTTCGCGAA GGGCGCAGCC 
AACATCCTCT TCAAATACAA CGGGCCCAAC GACTACTTGA GACACAAACT ACTTAGAGTA
AGACTTCTCA AACAGGAAGA CCAGTACATT TCCACCTGCG AGCTCTACGA CTTCATCGAG
CTCAAATGCA AGCACCTATT TCCACAGCAG ATCATTGACA TTCAGCTTAT AGTGCTAACC
ACAGACTTTG TCAACCAATT GGACTCCAAA GGAAACCAGT TGATGCTAAA AGAAAGATAT
GGCTTGCTTA TTCCGAACAT ACTTGACGGA GACTATGAGA AACAGGTCCT ACTGAAAAAT
TGCACTTTGT ACTACGATTC AAATTCAAGT ACAACAAATA TCAGCACGAA TTCAGATACA
AACAAGATCG ACTCTGTGAT CTTCGAAATA AAACCCAAAT GGCTCTATGA CAACGTTTCG
TCCAACTACT GCCGAACATG CCTGTATAAC CAGCTTCGGC AATACCCGAG ACACTACTGT
CCATTAGATT TCTTGTACAA ACGAACTATC GACACTGGCT TGGACGATTT GTTCAAACCA
ATTGCTCCCA ATATTCTCGA GAATATCGAG AATTCCAACA AAATCCCTTT AAAGAAGTTG
TTCCGCAACT TTCTCAATAA TCCGGAAAAT GTGTTTCAAA AATTGAAACA GTACCAAAGA
ATTAATTCCA AAAACGACCT CATCAAGAAC CTCTCTTCGC CGCTGGATGT GCTGCTGAAC
CTCTCTTTGG TGATGACTTT ACGAGATGTT GGGCTCTTTA TTAAGTTTGA AAAATACAAC
CCCAACAATA ACGTTCACAA TTCACAGAAC AACGTCAACA ACTTGATTGT ACTAGAAGAC
GGCAAGTTCG TCGTTTCATG TAATATCTAC GACTTGGACT TGAAGTCGAA GTTGAAGTAC
AAACACTGGT TGGACGTTGA ACTGAAGCTC CAGGGCATTT ATAACTCCTC TAATGATGAT
TGGAAGTACT GCGTAAGTTA TAATGAAGCC ACAGACGCAG ACTTCAGCAG AGACTTGGCT
AACGAAACTG TCGATCCTAG TTCAAACAAT GAAGAAATGG AAGTAGACAT AGGAATATGA
 
Protein sequence
MEISKITSPD DWSYFAKGAA NILFKYNGPN DYLRHKLLRV RLLKQEDQYI STCELYDFIE 
LKCKHLFPQQ IIDIQLIVLT TDFVNQLDSK GNQLMLKERY GLLIPNILDG DYEKQVLSKN
CTLYYDSNSS TTNISTNSDT NKIDSVIFEI KPKWLYDNVS SNYCRTCSYN QLRQYPRHYC
PLDFLYKRTI DTGLDDLFKP IAPNILENIE NSNKIPLKKL FRNFLNNPEN VFQKLKQYQR
INSKNDLIKN LSSPSDVSSN LSLVMTLRDV GLFIKFEKYN PNNNVHNSQN NVNNLIVLED
GKFVVSCNIY DLDLKSKLKY KHWLDVESKL QGIYNSSNDD WKYCVSYNEA TDADFSRDLA
NETVDPSSNN EEMEVDIGI