Gene PICST_36080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36080 
Symbol 
ID4838702 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1575908 
End bp1577053 
Gene Length1146 bp 
Protein Length332 aa 
Translation table12 
GC content46% 
IMG OID640390017 
Productpredicted protein 
Protein accessionXP_001384607 
Protein GI150865404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.494982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATACG TAAAAGATCT CAGTGATATA CCGTTGAAAC CTCTCCAGGA GACTTTCGTC 
GACGACCCAA CCTCACTTGA AGAAATCTAC ATTGACGAAT TAATCTTCGA TCTTGAGCAC
AAGCTCAAGA ACATCAACAA GCTCTCCATT TTCCATGCCA TCTATATCCT TTCTCAGCTG
GTTCAAAACA TCATCAAGCT CCAGTCGGAT CCGGTTTTGT TCCAGCAGTT CAAGAATGAG
CAGTTGGCCA AGTACAATAT CGACTTCTCC AGTAGTAGTT GTAGCAGCAG TAGTACAGTC
ACTGACAGCC ACGAATCTGA CAGCCACGAA TTCACGCGTG TTTCGCTGCA TTCCTTGTTG
CGGTCTCATA CGCCTCCGTT ATCGCCTCCA TTGAAGTTTG CCAAGTTGTC TCAGCCAATT
TACCCTCAGT ATTCGTTTAA GGAATCGACA CCAGACTCTT TGGCTAATGA AGAAGTAACG
CCAGACTCTA TTGAAGAACG TAAGGAATTG GAAGCTGAGT CTGAAGCCCA GCGAAGTCCT
TTCACAGAGC AAGAAGATGA TGACGAGCAA GTAGAAGAAC CGAAGGAGCC TCCGTATATC
CCCATCAAGC AGTTGGTGAA GGAACTCAAG CTTGACCCGG TTTCAGATCC TGTCACTAAC
TTGAATCTCG ACAGCTTCAA GAAAGAAGTT CTATTCAACA GAGACTCTAA GCGTATCGAG
CAGAATCAGC ACCTTCTCAA AATCTTCAAT CTTGTCAAGG TGCCACCTCT TACCATCGAT
GAGTTCTTGC TCCGAATCAA GACGTACTCA TCTAGCATTT CGGTGCTGGC CTACATCCAC
ACGGCATCAA TGATGTTCAA ACTCTGCATT CTTCTTGACA TCATCCCCCT CAGTCCGGTC
AACGTGTACC GGTTCATTTT GGCTTCCTTG CGCTGCTCCA CTAAGAAGTT GGAGGATGTG
TACCAAAAAC AGAAATCGTT TGCTACCGTC GGTGGAGTGT CCACACGGGA CTTGTACCGT
TTGGAAGTGG GCTTTCTTTA TCTATGCAAC TTCAAGTTGG TTCTTGGTGA GGCAACGCTC
AACAAGTTCT TGAACCAGGA CTTTGTCGAC TTGCACACCT TCGTCAAGGA AAACTACCAA
AGCTAG
 
Protein sequence
MAYVKDLSDI PLKPLQETFV DDPTSLEEIY IDELIFDLEH KLKNINKLSI FHAIYILSQS 
VQNIIKLQSD PVLFQQFKNE QLANHEFTRV SSHSLLRSHT PPLSPPLKFA KLSQPIYPQY
SFKESTPDSL ANEEVTPDSI EEQEPKEPPY IPIKQLVKEL KLDPVSDPVT NLNLDSFKKE
VLFNRDSKRI EQNQHLLKIF NLVKVPPLTI DEFLLRIKTY SSSISVSAYI HTASMMFKLC
ILLDIIPLSP VNVYRFILAS LRCSTKKLED VYQKQKSFAT VGGVSTRDLY RLEVGFLYLC
NFKLVLGEAT LNKFLNQDFV DLHTFVKENY QS