Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77798 |
Symbol | |
ID | 4838798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 82273 |
End bp | 84582 |
Gene Length | 2310 bp |
Protein Length | 605 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390113 |
Product | predicted protein |
Protein accession | XP_001384314 |
Protein GI | 150865197 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATATTTATTA TCATTAGAGC CAACTACCAG TTATCGAAGT TTCTCGTAGT TGCCGCATAA CAGCATTTGT GAATTATCAT TCTTTTGCTG TATTTTACTC AAATATCTCC GGATTTTTAT ACTCTTGTGA GATTTTTCAG TGCAGAAAGC GAACCGGTTC TTGTTATTTG TAAGACAATA AGGTCGCTAG AATCGTTACC ACGGCTCAAA GTTCTTCGGA AAGGATTCAG TTTTAGTTGA TTTGGTATTT TACTCAATTT TGACACAATT CAAGAGAATC AACAGTGTAT CATATTCAAC TGGAATCATA GAAGAATACA CACATCGAAA CATAACGACT ACTATCAAGG GTCTATTTAT CATAATCTTC GTCTATTTTA AATAGTATAC AACTTTTGTA TACTTTCCTA ATCTAAGAAC CTATAATTAA TATGTCTTTT GTTGGAGGTG GTGCCGACTG TTCTGTCAAC GGCAATGCAA TTGCTCAGTT CAACAAACAT ACACAACAGG ACAGATCTCT CCAAAGACAA GCGGCAAACC AGCCAGCAAT CTTGCAGAGT CAGGGCTTCA AACAAGAGAA TCTTATGAAT GCTCGTGATC GCCAGAATTT GGACAACTTC ATGAATGCCA ACAATGGCCA GAACAGCTTC CAGTTCCAAC CAATGAGACA CGAGTTGAAC ATGATTCACA ACCACCAACA GCCTCAACAC CAGCATAACT GGCTGGCCGA GTTCAAAACT CATTCTCCTT CGCCAATTCC GCAGGTAGCC GCTCCTATTG CTCCTATTGC CAAGACTGGT TCTCCATTAA ACGCCCAATG GGCGAACGAG TTTCAACCCA TGGATCAAGC AGTGGCTCGT CAGAATCCAC AGCAAATGAA CGCCATGCCC TCTATGATGA TGGGAGCCTA TAGGCCTATG ATGAGCATGC CTATGATGAC AAACAATGTT GCACAGCAGC AACAGCAAAC ACCTCAGAAC CAGGAAGTAC AGGTTGACTG GGATGACCAT TTCAAGCAGA TGGAAGAACT CGATAGTAAG GTGGAAGAAA AGATTGAAGA AGCTACAGCT GATCCCGAAG AGATCGCTCG AGAAGCCTCG CCAGATTTCG TCATTGACGA CAAATACCAG GCTACTTTCC AAGAAGTATG GGATGGGTTA AACAGCGAAG CAATAGAAGC GGATTTCATC AGCCAGCAGT ATGAAGACTT CAAGAACACA CAGAAAGAGA CTTTCCCACC AGATATGAGT CAATGGGAAA AAGACTTTTC TCGTTACGTT TCCACCAGAG CTCATTTTGG AGACTATCAA TTTGAAGATA GGCAGAATAA CCAATTCTTG GATTTGCCAG CTGAGAATGA CCCTTATGAA ATTGGCTTAC AGCTTATGGA AAATGGTGCC AAGTTGTCAG AGGCTGCGTT GGCATTTGAA GCTGCTATTC AGAGAGACGA GGGGCACATC AATGCCTGGC TCAAATTGGG AGAGGTGCAA ACCCAAAATG AAAAGGAGAT TGCAGGTATT TCAGCATTGG AAAAGTGTTT GGAATTGAAT CCTGAAAACT CCGAGGCTTT GATGACGTTG GCTATATCCT ACATTAACGA AGGCTACGAC AATGCTGCAT TTGCCACCTT GGAAAGATGG ATTTCTACTA AATACCCTCA AATTGTAGAC AAAGCTCGTG CGCAAAACCC AGAAATTACC GACGAAGACA GATTTTCGTT GAACAAACGT GTCACTGAAC TTTTCCTCAA GGCTGCTCAG TTATCGCCTA GTGCAGCTAA CATGGACGCA GATGTTCAGA TGGGTTTAGG TGTCTTGTTC TACGCAAACG AAGAGTTTGA CAAGACTATC GACTGTTTCA AAGCAGCATT AAGTATCAGA CCTGACGATC CTGTGTTGTG GAATAGATTG GGTGCCTCGC TTGCTAATTC GAACCGTTCT GAAGAAGCCG TAGATGCATA CTTCAAAGCT TTGCAGTTGA AGCCTACCTT TGTCAGAGCC AGATACAATT TGGGGGTGTC TTGTATCAAC ATCAGATGCT ACAAAGAGGC AGCTGAGCAT CTCTTGAGTG GTTTGTCCAT GCACCAAGTG GAAGGCGTAG AGAATGACAG CACACTCAAC CACAACCAAT CTACAGCTTT GACAGAGACC TTGAAGAGAG CATTCATTGC TATGGATAGA AGAGACTTGT TAGAGTTGGT CAAGCCTGGA ATGGACTTGA CTCCGTTCAG AAAGGAGTTC AACTTCTGAG GATACGATAC ATAGTAATGC TTATATGGGT ATGTAGGTTA AATATTTATA CTTACAAAAT
|
Protein sequence | MSFVGGGADC SVNGNAIAQF NKHTQQDRSL QRQAANQPAI LQSQGFKQEN LMNARDRQNL DNFMNANNGQ NSFQFQPMRH ELNMIHNHQQ PQHQHNWSAE FKTHSPSPIP QVAAPIAPIA KTGSPLNAQW ANEFQPMDQA VARQNPQQMN AMPSMMMGAY RPMMSMPMMT NNVAQQQQQT PQNQEVQVDW DDHFKQMEEL DSKVEEKIEE ATADPEEIAR EASPDFVIDD KYQATFQEVW DGLNSEAIEA DFISQQYEDF KNTQKETFPP DMSQWEKDFS RYVSTRAHFG DYQFEDRQNN QFLDLPAEND PYEIGLQLME NGAKLSEAAL AFEAAIQRDE GHINAWLKLG EVQTQNEKEI AGISALEKCL ELNPENSEAL MTLAISYINE GYDNAAFATL ERWISTKYPQ IVDKARAQNP EITDEDRFSL NKRVTELFLK AAQLSPSAAN MDADVQMGLG VLFYANEEFD KTIDCFKAAL SIRPDDPVLW NRLGASLANS NRSEEAVDAY FKALQLKPTF VRARYNLGVS CINIRCYKEA AEHLLSGLSM HQVEGVENDS TLNHNQSTAL TETLKRAFIA MDRRDLLELV KPGMDLTPFR KEFNF
|
| |