Gene PICST_30719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30719 
Symbol 
ID4838207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp767354 
End bp768889 
Gene Length1536 bp 
Protein Length511 aa 
Translation table12 
GC content48% 
IMG OID640389522 
Productpredicted protein 
Protein accessionXP_001383780 
Protein GI150864804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.119081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.810962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCAC CTCCGGCTTC TGCCGCAACC ACGGCTCCGG AATTGGCCAC AACCCCTGCT 
CCAACCTCGG TTTTGTCCCA GGAAAGAGCC TCCTCTATAG GCTGGTACTT CATCGTATCC
TACTACGACT TCTACAACAC CAACATCGAG AATATCCACA AGATATACCA CCAAAACGCT
CTGATTTCTC ACGACTCCTT CCCAGTGGAT TCTGCTAATA CTGCTGAAGA CGAAGTTAAA
ACCATCCATG CTGCTCATGG CACCGAAGCT ATCAGAACTC GTTTCAAGAA CGATCCAGAG
TTGAAAGCAA ACAACCGTAT CGTAGTCACT TCGGCCGCGT TTGAAGTTTC GTTGGAGAAG
AACATCTTGA TTGTCGTATT TGGTGAATGG GCCAAAGAAG ACTCTGTCTA TCACCAATTC
ACCCAGACCT TCGTCTTGAC TCCAGGTAAG AAGGAAAACT CATTTGATGT AGCCAATGAT
GTTTTGAGAT TTATCGACTT TGGTGAATTC AAGGCCGTTA AGAAGGAAGA AAAGAAACCT
GTTCGTAACG GAGAAACCAT TAATGCTTCT GCTACTGCAG CAACTACTAC TGAAGCTTCT
ACTAGTACTC CCAAGGCTGC TTCTAGTGCC ACTAGTATTT CTGCTAATTC TACGTCAACG
TCTACTTCGA CTTCTACTGC TGCTCCAACG TCTACTGCTG CTACAACTGT TCCAGTTGCC
GCTGCAACTC CTACTGTTGC TACTGTTGCT GCTTCTGCTG CTTCTGCCAA CACGACCGCT
GTTTCCTCTG CTGCTTCTGC CGAGTCTGAA ACCACTTCCG CTTCTTCGGT CCCATCTGAA
GAGAAACAAA AACCTGAAGC TCCTGTGACT CCAGAACCAG TTGAAAAGTC TGAAACTAAG
GAATCTACAC AAGAACCAGT GAAGGAATTG TCTCCTACTG ACTCTGTGGC CTCCAAGAAC
GAAGAAGAGT CTTCTACTGC TGTGACTGCC AAGTCTGCTC CAGGCCAACC TTTGTCTTGG
GCAGCATTGG CCTCACAGGC CGCTCCTCCT AAGAATAAGC CAGTTGCTGT AGCAAAGTTG
TCTCCAGCCC CAGCCAAGAA AGCTGCTACA ACCCCTCCAG CTAATGGTAT AGGCAGCAAA
AAGAAAGAAG AATGGTACCC CATCTACATT CGGGGGATCA GAGAACTCGA TGAGAAATTG
TTGAGAGATC ACATTTCAAA ACACTTTGGT GAGCTCAAGT ACTTCAAGAC CAATCTGAAC
ATTGCGCTCT GTGACTTTGT CACCTATGAC GCCCAACACA AGGCCCTTGA AGCTGGTGAA
ACCATTGTAG ATGGTATCGT CATCCTGTTG GAACCTCGTG AGTCAAAGAC AGGCAACAGT
TACCACAGCA TCAACAAGAA AAAGGACAAG CCCGCGGGCT CAGCTGTCAC AAATAGCAAA
CAGTCGCCAC AACAGACTCC ACAACAGAAG GCTGCCAAGG GCGAGAAGAA AGTTGTTGGG
AAGAAGAGCA ACCGCACTGC TACGCGTAGT GATTAA
 
Protein sequence
MSPPPASAAT TAPELATTPA PTSVLSQERA SSIGWYFIVS YYDFYNTNIE NIHKIYHQNA 
SISHDSFPVD SANTAEDEVK TIHAAHGTEA IRTRFKNDPE LKANNRIVVT SAAFEVSLEK
NILIVVFGEW AKEDSVYHQF TQTFVLTPGK KENSFDVAND VLRFIDFGEF KAVKKEEKKP
VRNGETINAS ATAATTTEAS TSTPKAASSA TSISANSTST STSTSTAAPT STAATTVPVA
AATPTVATVA ASAASANTTA VSSAASAESE TTSASSVPSE EKQKPEAPVT PEPVEKSETK
ESTQEPVKEL SPTDSVASKN EEESSTAVTA KSAPGQPLSW AALASQAAPP KNKPVAVAKL
SPAPAKKAAT TPPANGIGSK KKEEWYPIYI RGIRELDEKL LRDHISKHFG ELKYFKTNSN
IALCDFVTYD AQHKALEAGE TIVDGIVISL EPRESKTGNS YHSINKKKDK PAGSAVTNSK
QSPQQTPQQK AAKGEKKVVG KKSNRTATRS D