Gene PICST_28602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28602 
Symbol 
ID4851370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1626298 
End bp1627851 
Gene Length1554 bp 
Protein Length517 aa 
Translation table 
GC content40% 
IMG OID640393078 
Productpredicted protein 
Protein accessionXP_001387953 
Protein GI126274411 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.478127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC CCTTGGTAGA AATGTTCGAT GGATTTCCTA TTTTTCAGCA ATTTACCCTT 
GGAAAGCTCC AAGATGCTGA AGTCATGGCA TTTACATCCG TGTACCCCAA TCTCTACTTC
ATGATCAAGC ATTTTGGTAT CGCAGAGGAC GATTCACATA TCGCTACCTA TAGTGGATAT
TTGGGTGCCG CCTATTTACT TGGTGAATAC ATAAGCTCGC TGTATTGGGT CAAAGCTTCG
AATAAATACG GCAGAAAAAC CATACTCTTG TATGGGCTCG CAAGCACAGC ATTCTCCTTG
CTCATTTTTG GATTCAGTAC AAACTTTTAT ATGGCTCTTC TTGCAAGGTT CTTCATGGGG
TTATGCGGTG GTAAGAGTCA AGTCTATAGA AACACAATGG AAGAAATCGC CCTTGAAGGT
AGACATAAAC ATCACGCCTT AACATCACTC TCGCAAAACT GGACTTCTGG GATATTGATG
GGTTACTTTT TTGGAGGATT ATCAAGTCTT TCATATAAGT CTGACATAAA GTATGATGGG
CTGTTACTTT CGAAGTATCC ATTTCTTCTT TCAAACCTTA TAATTATCAG CGTTATTGTA
GCTGAAATTA TCATGGGCTG GCTCTTTTTG GAGGAAACAC ATGAACAGAT AAAGTATGTG
CGAGACATCG GTTTAGAAAA GGGAGATTCC ATTAGGCGTA TGTTGGGATT CCAAGTGCCA
GAGAGACCTT GGCAGCTAAG AGAACAAGAT CCCAAAGTAG ACCAACAACC CTTTGACGAC
AATATGAAGT TGACAGAAAG GCACGTCGTT CCTTACCAAA TAAGAAACGA CCTGGTGTAT
ACCCCCGAAG AACTGTCGAC TGATACAGAA ACATATGAAG AATTTGAGCT GGTGAGATCA
TTGGCTACAT GGAATCGCAT AATTAACAAT TATATGTTAT GCTTTCAGAA TACATTCTTC
TTCGAGTTTT TTCCAATTTT TCTTGCTAGT CCCCTCAGAG AAGGGGATCT AAAGTTCCCA
TTTCAAATTA AAGGGGGATT CAGTTACAAT GCATATGGTA TTGGGATGCT TACATTTCTT
GCGGGATATA TTGGGTCAGT TTTTGAAGTT CCGCTTTCTA TTATTAGAGT GTACTTTGGG
AGAAAGTGTG TGGCTGGAAT TGCCCTTCTC GTATACCCCA TCACTTACTT CTTGTTGCCT
TTATATCTTT TTACACTGCA CGAGTACAAT AAGGGAATAT CCAAGTCGTT GGCAAATTTA
TTACTTGTGG TGAACATTTC TGTTGTTTGG TTATTTAAAT CCTTTACATT CCCCCTGTAT
CAAAGTTATT TTGATATTTC GTCTTCCAAA GAGCAAAGGC GGCCAACTAA TAGTTATTCG
ATTAGATTCA TCACGTTGGC TAAGTGTGCC ACCCCGATTA TTGGAGGCTG GATGATATCA
ATTTTCGATG CGCAAGGATA CGGAGGTACT CCTTGGTGGA TCCTTTCAGT TTGGTCAACC
ATGACATTAT TGCACTCTAT TTACATCGAT AGAAGAAGTG TAGCATTAGC ATAG
 
Protein sequence
MTNPLVEMFD GFPIFQQFTL GKLQDAEVMA FTSVYPNLYF MIKHFGIAED DSHIATYSGY 
LGAAYLLGEY ISSLYWVKAS NKYGRKTILL YGLASTAFSL LIFGFSTNFY MALLARFFMG
LCGGKSQVYR NTMEEIALEG RHKHHALTSL SQNWTSGILM GYFFGGLSSL SYKSDIKYDG
LLLSKYPFLL SNLIIISVIV AEIIMGWLFL EETHEQIKYV RDIGLEKGDS IRRMLGFQVP
ERPWQLREQD PKVDQQPFDD NMKLTERHVV PYQIRNDLVY TPEELSTDTE TYEEFELVRS
LATWNRIINN YMLCFQNTFF FEFFPIFLAS PLREGDLKFP FQIKGGFSYN AYGIGMLTFL
AGYIGSVFEV PLSIIRVYFG RKCVAGIALL VYPITYFLLP LYLFTLHEYN KGISKSLANL
LLVVNISVVW LFKSFTFPLY QSYFDISSSK EQRRPTNSYS IRFITLAKCA TPIIGGWMIS
IFDAQGYGGT PWWILSVWST MTLLHSIYID RRSVALA