Gene PICST_77798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77798 
Symbol 
ID4838798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp82273 
End bp84582 
Gene Length2310 bp 
Protein Length605 aa 
Translation table12 
GC content42% 
IMG OID640390113 
Productpredicted protein 
Protein accessionXP_001384314 
Protein GI150865197 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATATTTATTA TCATTAGAGC CAACTACCAG TTATCGAAGT TTCTCGTAGT TGCCGCATAA 
CAGCATTTGT GAATTATCAT TCTTTTGCTG TATTTTACTC AAATATCTCC GGATTTTTAT
ACTCTTGTGA GATTTTTCAG TGCAGAAAGC GAACCGGTTC TTGTTATTTG TAAGACAATA
AGGTCGCTAG AATCGTTACC ACGGCTCAAA GTTCTTCGGA AAGGATTCAG TTTTAGTTGA
TTTGGTATTT TACTCAATTT TGACACAATT CAAGAGAATC AACAGTGTAT CATATTCAAC
TGGAATCATA GAAGAATACA CACATCGAAA CATAACGACT ACTATCAAGG GTCTATTTAT
CATAATCTTC GTCTATTTTA AATAGTATAC AACTTTTGTA TACTTTCCTA ATCTAAGAAC
CTATAATTAA TATGTCTTTT GTTGGAGGTG GTGCCGACTG TTCTGTCAAC GGCAATGCAA
TTGCTCAGTT CAACAAACAT ACACAACAGG ACAGATCTCT CCAAAGACAA GCGGCAAACC
AGCCAGCAAT CTTGCAGAGT CAGGGCTTCA AACAAGAGAA TCTTATGAAT GCTCGTGATC
GCCAGAATTT GGACAACTTC ATGAATGCCA ACAATGGCCA GAACAGCTTC CAGTTCCAAC
CAATGAGACA CGAGTTGAAC ATGATTCACA ACCACCAACA GCCTCAACAC CAGCATAACT
GGCTGGCCGA GTTCAAAACT CATTCTCCTT CGCCAATTCC GCAGGTAGCC GCTCCTATTG
CTCCTATTGC CAAGACTGGT TCTCCATTAA ACGCCCAATG GGCGAACGAG TTTCAACCCA
TGGATCAAGC AGTGGCTCGT CAGAATCCAC AGCAAATGAA CGCCATGCCC TCTATGATGA
TGGGAGCCTA TAGGCCTATG ATGAGCATGC CTATGATGAC AAACAATGTT GCACAGCAGC
AACAGCAAAC ACCTCAGAAC CAGGAAGTAC AGGTTGACTG GGATGACCAT TTCAAGCAGA
TGGAAGAACT CGATAGTAAG GTGGAAGAAA AGATTGAAGA AGCTACAGCT GATCCCGAAG
AGATCGCTCG AGAAGCCTCG CCAGATTTCG TCATTGACGA CAAATACCAG GCTACTTTCC
AAGAAGTATG GGATGGGTTA AACAGCGAAG CAATAGAAGC GGATTTCATC AGCCAGCAGT
ATGAAGACTT CAAGAACACA CAGAAAGAGA CTTTCCCACC AGATATGAGT CAATGGGAAA
AAGACTTTTC TCGTTACGTT TCCACCAGAG CTCATTTTGG AGACTATCAA TTTGAAGATA
GGCAGAATAA CCAATTCTTG GATTTGCCAG CTGAGAATGA CCCTTATGAA ATTGGCTTAC
AGCTTATGGA AAATGGTGCC AAGTTGTCAG AGGCTGCGTT GGCATTTGAA GCTGCTATTC
AGAGAGACGA GGGGCACATC AATGCCTGGC TCAAATTGGG AGAGGTGCAA ACCCAAAATG
AAAAGGAGAT TGCAGGTATT TCAGCATTGG AAAAGTGTTT GGAATTGAAT CCTGAAAACT
CCGAGGCTTT GATGACGTTG GCTATATCCT ACATTAACGA AGGCTACGAC AATGCTGCAT
TTGCCACCTT GGAAAGATGG ATTTCTACTA AATACCCTCA AATTGTAGAC AAAGCTCGTG
CGCAAAACCC AGAAATTACC GACGAAGACA GATTTTCGTT GAACAAACGT GTCACTGAAC
TTTTCCTCAA GGCTGCTCAG TTATCGCCTA GTGCAGCTAA CATGGACGCA GATGTTCAGA
TGGGTTTAGG TGTCTTGTTC TACGCAAACG AAGAGTTTGA CAAGACTATC GACTGTTTCA
AAGCAGCATT AAGTATCAGA CCTGACGATC CTGTGTTGTG GAATAGATTG GGTGCCTCGC
TTGCTAATTC GAACCGTTCT GAAGAAGCCG TAGATGCATA CTTCAAAGCT TTGCAGTTGA
AGCCTACCTT TGTCAGAGCC AGATACAATT TGGGGGTGTC TTGTATCAAC ATCAGATGCT
ACAAAGAGGC AGCTGAGCAT CTCTTGAGTG GTTTGTCCAT GCACCAAGTG GAAGGCGTAG
AGAATGACAG CACACTCAAC CACAACCAAT CTACAGCTTT GACAGAGACC TTGAAGAGAG
CATTCATTGC TATGGATAGA AGAGACTTGT TAGAGTTGGT CAAGCCTGGA ATGGACTTGA
CTCCGTTCAG AAAGGAGTTC AACTTCTGAG GATACGATAC ATAGTAATGC TTATATGGGT
ATGTAGGTTA AATATTTATA CTTACAAAAT
 
Protein sequence
MSFVGGGADC SVNGNAIAQF NKHTQQDRSL QRQAANQPAI LQSQGFKQEN LMNARDRQNL 
DNFMNANNGQ NSFQFQPMRH ELNMIHNHQQ PQHQHNWSAE FKTHSPSPIP QVAAPIAPIA
KTGSPLNAQW ANEFQPMDQA VARQNPQQMN AMPSMMMGAY RPMMSMPMMT NNVAQQQQQT
PQNQEVQVDW DDHFKQMEEL DSKVEEKIEE ATADPEEIAR EASPDFVIDD KYQATFQEVW
DGLNSEAIEA DFISQQYEDF KNTQKETFPP DMSQWEKDFS RYVSTRAHFG DYQFEDRQNN
QFLDLPAEND PYEIGLQLME NGAKLSEAAL AFEAAIQRDE GHINAWLKLG EVQTQNEKEI
AGISALEKCL ELNPENSEAL MTLAISYINE GYDNAAFATL ERWISTKYPQ IVDKARAQNP
EITDEDRFSL NKRVTELFLK AAQLSPSAAN MDADVQMGLG VLFYANEEFD KTIDCFKAAL
SIRPDDPVLW NRLGASLANS NRSEEAVDAY FKALQLKPTF VRARYNLGVS CINIRCYKEA
AEHLLSGLSM HQVEGVENDS TLNHNQSTAL TETLKRAFIA MDRRDLLELV KPGMDLTPFR
KEFNF