Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28602 |
Symbol | |
ID | 4851370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1626298 |
End bp | 1627851 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393078 |
Product | predicted protein |
Protein accession | XP_001387953 |
Protein GI | 126274411 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.478127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACC CCTTGGTAGA AATGTTCGAT GGATTTCCTA TTTTTCAGCA ATTTACCCTT GGAAAGCTCC AAGATGCTGA AGTCATGGCA TTTACATCCG TGTACCCCAA TCTCTACTTC ATGATCAAGC ATTTTGGTAT CGCAGAGGAC GATTCACATA TCGCTACCTA TAGTGGATAT TTGGGTGCCG CCTATTTACT TGGTGAATAC ATAAGCTCGC TGTATTGGGT CAAAGCTTCG AATAAATACG GCAGAAAAAC CATACTCTTG TATGGGCTCG CAAGCACAGC ATTCTCCTTG CTCATTTTTG GATTCAGTAC AAACTTTTAT ATGGCTCTTC TTGCAAGGTT CTTCATGGGG TTATGCGGTG GTAAGAGTCA AGTCTATAGA AACACAATGG AAGAAATCGC CCTTGAAGGT AGACATAAAC ATCACGCCTT AACATCACTC TCGCAAAACT GGACTTCTGG GATATTGATG GGTTACTTTT TTGGAGGATT ATCAAGTCTT TCATATAAGT CTGACATAAA GTATGATGGG CTGTTACTTT CGAAGTATCC ATTTCTTCTT TCAAACCTTA TAATTATCAG CGTTATTGTA GCTGAAATTA TCATGGGCTG GCTCTTTTTG GAGGAAACAC ATGAACAGAT AAAGTATGTG CGAGACATCG GTTTAGAAAA GGGAGATTCC ATTAGGCGTA TGTTGGGATT CCAAGTGCCA GAGAGACCTT GGCAGCTAAG AGAACAAGAT CCCAAAGTAG ACCAACAACC CTTTGACGAC AATATGAAGT TGACAGAAAG GCACGTCGTT CCTTACCAAA TAAGAAACGA CCTGGTGTAT ACCCCCGAAG AACTGTCGAC TGATACAGAA ACATATGAAG AATTTGAGCT GGTGAGATCA TTGGCTACAT GGAATCGCAT AATTAACAAT TATATGTTAT GCTTTCAGAA TACATTCTTC TTCGAGTTTT TTCCAATTTT TCTTGCTAGT CCCCTCAGAG AAGGGGATCT AAAGTTCCCA TTTCAAATTA AAGGGGGATT CAGTTACAAT GCATATGGTA TTGGGATGCT TACATTTCTT GCGGGATATA TTGGGTCAGT TTTTGAAGTT CCGCTTTCTA TTATTAGAGT GTACTTTGGG AGAAAGTGTG TGGCTGGAAT TGCCCTTCTC GTATACCCCA TCACTTACTT CTTGTTGCCT TTATATCTTT TTACACTGCA CGAGTACAAT AAGGGAATAT CCAAGTCGTT GGCAAATTTA TTACTTGTGG TGAACATTTC TGTTGTTTGG TTATTTAAAT CCTTTACATT CCCCCTGTAT CAAAGTTATT TTGATATTTC GTCTTCCAAA GAGCAAAGGC GGCCAACTAA TAGTTATTCG ATTAGATTCA TCACGTTGGC TAAGTGTGCC ACCCCGATTA TTGGAGGCTG GATGATATCA ATTTTCGATG CGCAAGGATA CGGAGGTACT CCTTGGTGGA TCCTTTCAGT TTGGTCAACC ATGACATTAT TGCACTCTAT TTACATCGAT AGAAGAAGTG TAGCATTAGC ATAG
|
Protein sequence | MTNPLVEMFD GFPIFQQFTL GKLQDAEVMA FTSVYPNLYF MIKHFGIAED DSHIATYSGY LGAAYLLGEY ISSLYWVKAS NKYGRKTILL YGLASTAFSL LIFGFSTNFY MALLARFFMG LCGGKSQVYR NTMEEIALEG RHKHHALTSL SQNWTSGILM GYFFGGLSSL SYKSDIKYDG LLLSKYPFLL SNLIIISVIV AEIIMGWLFL EETHEQIKYV RDIGLEKGDS IRRMLGFQVP ERPWQLREQD PKVDQQPFDD NMKLTERHVV PYQIRNDLVY TPEELSTDTE TYEEFELVRS LATWNRIINN YMLCFQNTFF FEFFPIFLAS PLREGDLKFP FQIKGGFSYN AYGIGMLTFL AGYIGSVFEV PLSIIRVYFG RKCVAGIALL VYPITYFLLP LYLFTLHEYN KGISKSLANL LLVVNISVVW LFKSFTFPLY QSYFDISSSK EQRRPTNSYS IRFITLAKCA TPIIGGWMIS IFDAQGYGGT PWWILSVWST MTLLHSIYID RRSVALA
|
| |