Gene PICST_30629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30629 
Symbol 
ID4837686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp548729 
End bp550309 
Gene Length1581 bp 
Protein Length526 aa 
Translation table12 
GC content44% 
IMG OID640389001 
Productpredicted protein 
Protein accessionXP_001383740 
Protein GI150864769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.555507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.773072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCC CAGAAAAGGA TGCCTCGCAA GTGGCTGTAG ACCAAAAGTC TGTAGGCTCA 
GTAGTCGAGT TCAAAGACCC AGCTCAGCTT GAGAAACGTT TCAGTCTCTG GAGCACTTGC
GCTTTGCAAT ACTCATTGAT CTGCTCTCCT TTGGCATTAG GAACTTTCTT AAGTACTGTC
ATTGGGGTCG GTGGGTCCCC GGTGTTGATA TACGGATTCA TTCTCTCAGC TACTATGAGT
TTGATCATCT GCTGTTCATT GGCAGAATTG GCTTCCGCAT ATCCTCATTC TTCAGCTCAG
GTCCATTGGA CATACTGTTT GGCTCCCAAA GCGTATAAGA GAGTGTTGTC GTTTGCTTCA
GGTATTCTCT CGTGTGCTGG TTGGATCTTT GCTTGCATCA GCTCTACATA TGTTGCATCC
ATGTTTATTC TTGCTTTGGC TCAGATATAC CATCCGGATT ACGTTCCTAA AAACTGGCAT
TACTACCTTG TATATGTTGC AATTATTGTA GCAGGGTACC TCATCAACGT ATTCTTCGTC
GTCATCTTGC CATACATGAT TGACGTGTTG GTTGCCATCA TCAACTTTGC TACTCTTTTT
GTCATCATCA CTTTACTTAT AAAGTCTGAT CCTAAGCAGT CCGCGAAATT CGTGTTCAAG
AATATCATCA ACGAAACAGG CTGGTCCTCT AATGGGGTGG TATTCTTCTT GGGACTTCTT
CCAAGTATTG CCAGTGTATG TTTGTTTGAT GGGGCTGCTC ATATGACTGA CGAAATTGCC
GAACCAGAAA GAAATATTCC TTTGGTCATG GTGATTTCCA ACAGTGTTTC TGCAGTGGTT
GCACTTTTCG CAGCCATTGT CTACATGTTC TGCATTGTTA GTATGGACAA CCTCAGTGTC
CCATTAGGTG GGCAACCAAT TGTTCAGTTG ATGTATGATT CTTTCAGTTC TAAGACTCTC
ACTACCATTG GTGTCTTGTG CTTCATTTTT ACCTTTGTAG GTTCCTCCTT CACTTACTAC
ACTTCCACTT CTAGACTTAT CTGGTCATTT TCAAAGTCCA AGGGTCTTCC TTTTGGAACT
TACTTTGGAA GAGTATCGCA AACTCTCAAG ACTCCAGTAT ATGCCTTGAC GTTTGTAACT
CTAATCTGTG CCATCCTAGG TACTATGATC ATGGGATCTT CTACGGCATT GTATGCTGTG
CTTGGTTCAG CCATGGTCTG TGTCAATCTC TCCTATGTTC CACCAATAAT GTGCTTATTG
ACAAGATCCA AGTTCTCGAT GTCTCCCTAC GTTAGGTTTG ACAACCAGGA TAGTCTGGTC
GCAGCTGTCT TGAAAGAAGG AAAGAGTTTG CCCTACTTCT CCTTGGGTAA GCTCGGTATG
CCTTTAAACA TCATCTCTGT GTTGTGGATC TGCTTTATCA TGATATGGTT GAACTTTCCC
ATCTACTATC CTGTGAGCAC TGCCAGTATG AACTACGCCT GCGTCGTCTT GGGCTGCACT
GGTATCTTTG GTCTTGCCGT GTGGTTATTC TATTCAGCCA AGCACTTTGA TCATGACGTG
GACTCCAAAC ATATACTTTA G
 
Protein sequence
MSSPEKDASQ VAVDQKSVGS VVEFKDPAQL EKRFSLWSTC ALQYSLICSP LALGTFLSTV 
IGVGGSPVLI YGFILSATMS LIICCSLAEL ASAYPHSSAQ VHWTYCLAPK AYKRVLSFAS
GILSCAGWIF ACISSTYVAS MFILALAQIY HPDYVPKNWH YYLVYVAIIV AGYLINVFFV
VILPYMIDVL VAIINFATLF VIITLLIKSD PKQSAKFVFK NIINETGWSS NGVVFFLGLL
PSIASVCLFD GAAHMTDEIA EPERNIPLVM VISNSVSAVV ALFAAIVYMF CIVSMDNLSV
PLGGQPIVQL MYDSFSSKTL TTIGVLCFIF TFVGSSFTYY TSTSRLIWSF SKSKGLPFGT
YFGRVSQTLK TPVYALTFVT LICAILGTMI MGSSTALYAV LGSAMVCVNL SYVPPIMCLL
TRSKFSMSPY VRFDNQDSSV AAVLKEGKSL PYFSLGKLGM PLNIISVLWI CFIMIWLNFP
IYYPVSTASM NYACVVLGCT GIFGLAVWLF YSAKHFDHDV DSKHIL