Gene PICST_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_3849 
SymbolSUC1.4 
ID4839768 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp26659 
End bp28221 
Gene Length1563 bp 
Protein Length487 aa 
Translation table12 
GC content37% 
IMG OID640391083 
ProductProbable sucrose utilization protein 
Protein accessionXP_001385339 
Protein GI150865926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.411737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGATCCAAAT ATTCTCGTCC ATGTGATGCT TGTGCTCATA GAAAGGTACG ATGTACCGAC 
AATCGACCCT GTAGTCGATG TATTGACTAT GGTATTTCAT GCACCAATAC AAGAATCAGA
AAGCGTAGTG GACCAAAACC CAAAATAATA AGTGAATTAC CATTCAGCAA TAATATTTCA
ACTTCACCTA AAGGTACTTC TCAATATTTG ACAATAGTTA CCACTGTAAG TGGAAGCAAG
AAGGGTCAAG ATTCATTAGA TGAAGTTGAA TCATCTATGG ATAGTAATGT TTCAACATTC
CTGAGATATG GCGGTGAGTT CATTCCAATA GATGACCTAT ATCCTTTTCT TCAATTCTTT
CAAACATGGT ACTATGGCTT ATGGCCTGTT CTTTCAGTTG GATCACTTAT CACAAATTTG
ACGACAATAA CAGATAGAAG ATCCCTAAAC AAACAAACAT CACCATACTA CTCTTTGGGG
CTCGCTCTTT GTGCTGCTAT CTCGGGACAA TTTGCTTTCT TGAGCAAGAG TTCGGATTTG
CCACATATGA CCACAACAGT AAGTGCCAGT CAACTTGCTC GAGCAGCATT GAATGTCAGA
GAAACATTTG ATCACAGAAT GAAGCCAACC GAAGAAACAT TGTTGACTTC GTTCTTTTTA
TATAATTATT ACATTAATGT CAAAAATGGA ACCGAAGCAG CAATAATGTA CTTGAGGGAA
GCAATTTCGA TGGTCCATAT ATTAGGATTA CATGATCCTC AGGCTTATTT GCAAATGTCT
AGTGAGTCAC TGCATCGTTT GAAGAAGACT TATTACCTAC TTCTTGCAAG TGAAAGGTTC
ATGTGCATTG AAGATAACGT TCCAGTTATC CTCGATTCTA GTGTACCTTA TCCATTGGTT
GAGGATGATG AATATCCAGA ATTATTAGCT GGATTTGTTG AAGTTGTCAA AATCTTTGCA
ATCCCTGATC GAAACTTTTT CGAAAAAATG AGTATCGCTA ATAGGAGAAC CGAAAATGTA
AAGGATTACG AGATCTTCCA AAACTTCTTC CACCTCTCTG CAACTTCATT AAGCGAGGCA
TGGGTAAGAG AAGTGGACAA AAAGATTAAA GGAATAACTA TCGTTGATTC AATGTCAGAT
ATACTGAAGG TTAACTTATT GCTTTCTAAA GATTGGATGA GGTCGCTAAT ATGGAGAATT
GCTTATCAAA ATGCTCTCAC ATCAGCACTC AGAGAGAAAG ATGATTGTCT AAGCTTGAAT
TATCCTATGA CGATCGCTTA TCAATTTCTT TCATCCACAA GCAATTTACC TGCTTTTGCA
TTTGAATGCA ATGGGGCAGG TGTAGTTGTA AAATTATTAG ATATTGCGAA TGGATTGGGC
GACTCAATGA ACGATTTAAG TTCAATGGAA AGGACATATG ACTTGTCATT ATTTGAAAAC
GCATTAACGT CAGTTTTTTG TTTGATATCA AAGTTTAAGA CAAAGGTAAC TTTACCTGTC
AGAATGTACC GCAAGATTGA GAATATGGTT AGTCGTTGGT CCATTCCAAG ACAATTTCTG
TCT
 
Protein sequence
RSKYSRPCDA CAHRKVRCTD NRPCSRCIDY GISCTNTRIR KRSGPKPKII SELPFSNNIS 
TSPKGTSQYL TIFIPIDDLY PFLQFFQTWY YGLWPVLSVG SLITNLTTIT DRRSLNKQTS
PYYSLGLALC AAISGQFAFL SKSSDLPHMT TTVSASQLAR AALNVRETFD HRMKPTEETL
LTSFFLYNYY INVKNGTEAA IMYLREAISM VHILGLHDPQ AYLQMSSESS HRLKKTYYLL
LASERFMCIE DNVPVILDSS VPYPLVEDDE YPELLAGFVE VVKIFAIPDR NFFEKMSIAN
RRTENVKDYE IFQNFFHLSA TSLSEAWVRE VDKKIKGITI VDSMSDISKV NLLLSKDWMR
SLIWRIAYQN ALTSALREKD DCLSLNYPMT IAYQFLSSTS NLPAFAFECN GAGVVVKLLD
IANGLGDSMN DLSSMERTYD LSLFENALTS VFCLISKFKT KVTLPVRMYR KIENMVSRWS
IPRQFSS