Gene PICST_81023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81023 
Symbol 
ID4851805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2875490 
End bp2878080 
Gene Length2591 bp 
Protein Length670 aa 
Translation table 
GC content41% 
IMG OID640393513 
Productpredicted protein 
Protein accessionXP_001387115 
Protein GI126275654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAGCTTGTTG GATATTCTCA TTACTAATTG CATAATTGCT TTCTTCTGCT AGCGTTGAAA 
TATCTTCCGA CAAGAGAGCA ATTTACCGAG TTATTTGACA TAGATACAAA CACCTGTTGA
TCCACATTTC ACATTTTCGT ATATTCATAT TTTCATATCC TTGATCTAGA CCAGGTTGCC
AAGGTGATTA CTAAAAGTTG ACATTACAAA AGTTGATATT ACCAAAAGTT GATATCAAAA
GTTCACAAGA CTGATAAGAG TTGTAAAGTG GATAGATCCC ATAGAGTTGA TTTAAGTCAT
AAAGTCGATA AAAGTTCATA CGGTGTAAAG TCATCAACTA GATAAAAGTC ATACAGGTCG
TATTTCATTC AGTTTTTTCA GTCATCGATC TGTACATCCA TAAAATACAA AACATAAAGA
GCTATTGTAT GCTAACAAAC TACTGTCTCA CAGTCATCTA ATCACATATA ATTCATTTAC
AGTGGACATG TCTGAGCCCC AGGATATTTC CCATTCTACC AGAAGAAGGT CTATAGCCCT
TCTCACCAGA CCACCGATGT CTCAATCGCC AGTTCCTGTG CCAGTTGGAT CGCCAGGAAA
CAGAATACAG TCAGCTTCGA ACTCGTATGT AGCCAACGCC CCTTTTCTCA GGCCAGATCT
GACTTCGCGA GATGAACTTC ATCATAACGA AGAAGTTTTA GAGTCACCAG AGTCATCTAA
AGAGTCTTCC ACTCTCAATT TGCACCTGCA AGTCCGTCTG GCACATATTC TGTCCGAAAA
CGTCGATAGC AACGACTTTT CAGTCGGAAT CGAAGATCCC CGTTTGCTTC TGCTGGTAGC
TAGAATTGTT CCTCAGGACC AGGAGATGGA TTCAGAGGGA GCAGATATAA CCAGGGACTT
GTATAAACTC ACTCAGGAAG TATCTCGCCC ACCTAGCTTG CGTAGAGTTA AGCTGTATTC
GTCTTCCATT GAGCTGACAT CGCGCGAAAG ACGTGAAACT GCCAGCTCCA TTAACGTTCC
TGGTGGATTC AGAAGAGAAT TCATGGTCCA GAAGGCTTTC CACAGCGACC AACTCAGAAC
AAAGACACCA AACTTCTTGA CTAGAAATTT TGTAGAATTC CTCAGCATCT ATGGCCACTT
TGCCGGAGAA GAATTGGAAG ACGAGGAAGA CATTGCTTGT CATTATAAAG CATTTAGTCC
TGTAAAAGTG GATGAAGAAG CTCCGTTGTT GCTCGGGGCA GCAGAACCTG GTCCAGAATA
CAACTCTATC AATACTCGTG GAACTGCTAC TGATACCAAA GCTTATTTCT TGCTACTCAA
GGCTTTTGTA GGAACAGGAG TCTTATTTCT TCCCAAGGCT TTCTCCAATG GTGGTTTGCT
CTTCTCGGTC TTGGTCTTAC TCTTCTTTGG TGTGTTATCG TTGTGGTGTT ACTTGACTTT
GGTTTACTCG AAGATTGCCG CAAAGGTGCT GAGTTTCGCC GAATTGGGTC TCAAACTTTA
CGGAAACTGG CTTCAACGTC TCATTTTGTT TCTGATTGTT ATTTCACAAA TCGGATTCGT
AGCAGCATAT ATAGTGTTCA CACTGGAGAA CTTGAGAGCT TTTGTTTCCA CTGTTTCTGG
TTATGATGTT GGCGACTTTG ACATTGTGTG GTTCATCATT TTCCAAGTGA TTGTTCTTGT
TCCACTTTCA CTCATTCGGG ACATCACCAA ATTGTCGCTT CTGGCTGTGC TTGCTAATTT
CTTCATCCTT ATAGGGCTTG TTACCATCTT GTACTTCATC TTTTACGAAT TGTTGGTAGA
AAACCACGGT TCCATGGGTC CCAACATCGA GTTCTTCTTC AACAAGAACG AATTTTCGTT
GTTTATTGGT GTCGCTATCT TTGCCTTTGA AGGTATAGGT CTCATCATCC CGATCCAGGA
GTCAATGGTG TACCCCAATC ACTTCCCCAA AGTGTTATGT CAAGTGATTG CTACCATTTC
TCTCATCTTT GTCAGTATGG GAGTTCTTGG TTATACGACT TTTGGCTCTG ATATCAAGAC
AGTAATCATC TTGAACTTGC CCCAGAAGCT GCCTTTGATT GTTCTCATTC AATTGCTCTA
CTCCTTTGCT ATATTGTTGC TGACACCATT GCAACTCTTC CCAGCCATCA GATTACTTGA
ACTGAAACTC TTCTTCAGGA AAACGGGAAA GAACTCACTT ACGGTGAAAT GGTTGAAAAA
CATCTTTAGA CTTATCTTTG TCTTGCTTGT TGCCTATGTG GCTTTTGTGG GTGGCCAGAA
CTTAGATAAG TTTGTCTCTT TTGTTGGATG TTTTGCTTGT ATTCCGTTGG TGTATATGTA
TCCGCCAATC TTACATTTGA AGAGTTGTTG TAATATTGAT GACAACATGC TGGAGAAAGA
GAAGAGAAAA AGGTTCTGGT TGGGGGTCGC AGACTATGTG CTTGTGGTCA TTGGTGCTAT
AGCAATGGTA TACACCACCT ACGACATTTT AATCAATTAG CCCTCATCAA CTTGAATCAT
TCCATTAAAG CTTATGATAA TCATAATAGT AAAAGTCATT ATAATCATAT TAATATATTG
CATTTTAAAT T
 
Protein sequence
MSEPQDISHS TRRRSIALLT RPPMSQSPVP VPVGSPGNRI QSASNSYVAN APFLRPDLTS 
RDELHHNEEV LESPESSKES STLNLHLQVR LAHILSENVD SNDFSVGIED PRLLLLVARI
VPQDQEMDSE GADITRDLYK LTQEVSRPPS LRRVKLYSSS IELTSRERRE TASSINVPGG
FRREFMVQKA FHSDQLRTKT PNFLTRNFVE FLSIYGHFAG EELEDEEDIA CHYKAFSPVK
VDEEAPLLLG AAEPGPEYNS INTRGTATDT KAYFLLLKAF VGTGVLFLPK AFSNGGLLFS
VLVLLFFGVL SLWCYLTLVY SKIAAKVLSF AELGLKLYGN WLQRLILFLI VISQIGFVAA
YIVFTLENLR AFVSTVSGYD VGDFDIVWFI IFQVIVLVPL SLIRDITKLS LLAVLANFFI
LIGLVTILYF IFYELLVENH GSMGPNIEFF FNKNEFSLFI GVAIFAFEGI GLIIPIQESM
VYPNHFPKVL CQVIATISLI FVSMGVLGYT TFGSDIKTVI ILNLPQKLPL IVLIQLLYSF
AILLLTPLQL FPAIRLLELK LFFRKTGKNS LTVKWLKNIF RLIFVLLVAY VAFVGGQNLD
KFVSFVGCFA CIPLVYMYPP ILHLKSCCNI DDNMLEKEKR KRFWLGVADY VLVVIGAIAM
VYTTYDILIN