Gene PICST_30689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30689 
Symbol 
ID4838273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp683021 
End bp685348 
Gene Length2328 bp 
Protein Length775 aa 
Translation table12 
GC content38% 
IMG OID640389588 
Productpredicted protein 
Protein accessionXP_001383766 
Protein GI150864792 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00144758 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.461418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGAT ATGCCCGTAC AGTACGATTT CTGCAAGTTG GTAACTCTGT CCGTACTTCT 
AGTGGTAGTT TTGGGCCCAT ACGAACCATT TTCATAAGCT CACACGATAG AAAATCACCT
TCTCATTCTC TTTCGCCCAT ATCAAATCTT CCAAATCATA ACGATTCCTC AACCGAAAGA
GCTAGAAAGA CTTTGGACGA CGATTTTCAA CGTAAAATCA AATATGATGA GCTTAAGACA
CGCAAAAGAA TCGAAGATCT ACGTGCTCTC ACTAAGAAAG TGTCTCAATT AGTAAAGCAG
AAACAGGAAC TATCCAAGAT AGCCAAAATT CCTGTTCCCT CAGATGAAAT CAAAAAACTC
AAAAGTGTTG AAGATGTAGT AAAAGTCACT AAGTCTCAAC ATGTAGCAAC AGTTCGAACT
AATATTCCAG AACCTGAAAC TGTAACTACT CATATAGAGA AAGAAGAATC CGAATTCGTT
ATGGAACAGA ATGCGTTTAT TGTTCCTGCA ACTCCTATTC CAGAAGAAAT AGCAAAGAGA
TTGGGCTTGG CTCTCAGATA TTTGGTTTCG GAAACTAATC AAAACTGGAC CCTTGTACTT
GACCAACTCA AGGCTGACAG AGGTTTCAAA GATTTACCCT ACACAACAGT AGTAGACTTT
CTTACAAAGA TTCCAGCTTC GGAGTTGCAA AAGGTTATTC CTAAAGTCGA CCAGCTTTTG
AAGGAGGCAA AGATACCAAA GACTGCCAAA ATATTGAACT TGTACATAGC CAGTTTGGCA
CTGGGATCAG CTGTGCCCAA CCAAGTAATT CAAATCTTGG AAAACTACTG CAAGAGAATC
AGAAAATTGA AGAAAGGAAA GTTGCCTAAG AGGACCTGCG AAGTTATGGT GCAAGCATAT
GGAAAAAACG GAAATATAAA TAGAATTCAA GATCTCTTAT CAGAGATGAA ACTACACAAA
ATCGAGATAT CAGGTATGGC TCTCACAAAT ATACTAGCCA CTTGTGTCTA CAAGGCTAGA
GATCACAAAC AAGCAGTAGA AATATTCGAT ACAATGAGAT TTCAAAAAGA AGTATATAAA
CCTGGAACAC GAGCATATCA AGATATCATT GTTTCATATG TGAACAATGA CGATATCGAA
AAAGCGATAG ATATATACCG AGAAATGATA ACCGAAAAAA TCGAACCTAA TCAACAAATC
ATGGTGGCTT TAGCTCGAGG TTGTGCTTCA AGAGAGGCCT TTAAATTCAA AAGTTGGGAT
TTCATATTTG AAATTAATAG AAACAACTGG ACCCCGACAT TGCCTACTTA TGAGTATATG
CTCTATTTGT CGTCTAGGGA CGGTGATCTT GCTTTAACCA GGGCATTGTA TTCAAGACTT
TTAAAAGATA ATACTGTTTC ATTGAGGTCG TTCAATTTCT TATTATTGGC GTATTCTAAA
GCCCGTTTGA GTGATGATTT AGGGGAACCA TTTTTGATTA ATGCAGATGA AAAAGGAAGA
AAATTTAGAT TTAACGTAAT TGATAGATCA GGAATTTCTG ACCCCACTAA TCAGTTCCCT
TTTCTTCCCT TTAACGAGCT CACAACTAAA GAGCAAATTA TGGCGGAATC CAGTGCAATT
TGGGCACATG CTTGTTTGAA TAACTCGGAG CTTATCAATA GTGAAAGCAC GACTTCATAC
TTAAACATCG CCAGTGAAAG AGGAACATTG AGTGATTTCA TCGACAGGAT GGAGGGGTCA
ACATTCCTTG ATGAAAAAAT TAACAATGTC TTGTCAGGAG TTGTTATAGA AGAACCAGAC
GTAGTTGTAG AAACTACTGA CTTTTTGACA CAGAAATCAT CAGCTCATGA GAAATACGAC
GAAACTTCCA TAGTAAAATC ACCTATTTTG AAGTCCATTC AGAGCAAAAG AACTCCTAGA
GTCTCATTAA CATATGTGGT TGCGTTGAAG GCAGCTGGAA AATTCAATAA CTACAACTTT
GCCAATAGAA TCTGGCAGGA AAGAGGAAAA TTTAGAAAAA CTGAAACCTT TAAGAAATTA
CCAAGAACCG AAAAAGATAA GCTTGATTTC CAATTTGCTA CTCAAATGGT TCGAACCTTG
ACAGAACTTA ATTTACTTGA AGATGCTTTG GCAGTGTTGA AGAGTACGGA ATATCAATTT
AGATGGACCT GGAAAGAGCT TGATGTCTTG AAGTCAGCTG CTATTAGACA AGGAAATACC
AATGTGGCCC AGACTGTGCG CAGTATCGCA AGAAGGGCAC AATTGACTTA TGAAGGTAAA
ATCAGAAGAA AGGATTACAA GAGGTACGTG ATGCAAAGGG GATACTAA
 
Protein sequence
MLRYARTVRF SQVGNSVRTS SGSFGPIRTI FISSHDRKSP SHSLSPISNL PNHNDSSTER 
ARKTLDDDFQ RKIKYDELKT RKRIEDLRAL TKKVSQLVKQ KQELSKIAKI PVPSDEIKKL
KSVEDVVKVT KSQHVATVRT NIPEPETVTT HIEKEESEFV MEQNAFIVPA TPIPEEIAKR
LGLALRYLVS ETNQNWTLVL DQLKADRGFK DLPYTTVVDF LTKIPASELQ KVIPKVDQLL
KEAKIPKTAK ILNLYIASLA SGSAVPNQVI QILENYCKRI RKLKKGKLPK RTCEVMVQAY
GKNGNINRIQ DLLSEMKLHK IEISGMALTN ILATCVYKAR DHKQAVEIFD TMRFQKEVYK
PGTRAYQDII VSYVNNDDIE KAIDIYREMI TEKIEPNQQI MVALARGCAS REAFKFKSWD
FIFEINRNNW TPTLPTYEYM LYLSSRDGDL ALTRALYSRL LKDNTVSLRS FNFLLLAYSK
ARLSDDLGEP FLINADEKGR KFRFNVIDRS GISDPTNQFP FLPFNELTTK EQIMAESSAI
WAHACLNNSE LINSESTTSY LNIASERGTL SDFIDRMEGS TFLDEKINNV LSGVVIEEPD
VVVETTDFLT QKSSAHEKYD ETSIVKSPIL KSIQSKRTPR VSLTYVVALK AAGKFNNYNF
ANRIWQERGK FRKTETFKKL PRTEKDKLDF QFATQMVRTL TELNLLEDAL AVLKSTEYQF
RWTWKELDVL KSAAIRQGNT NVAQTVRSIA RRAQLTYEGK IRRKDYKRYV MQRGY