Gene PICST_5340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_5340 
Symbol 
ID4851397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1708486 
End bp1709589 
Gene Length1104 bp 
Protein Length368 aa 
Translation table 
GC content42% 
IMG OID640393105 
Producthypothetical protein 
Protein accessionXP_001387555 
Protein GI126274487 
COG category[S] Function unknown 
COG ID[COG2106] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGAAACCAG TCAAGACTTC CCCAAAAGTG CAAGTTTCCG TATGCATACC CTCCACAGTG 
ATCTCATCGA AGAATGCTAG GAATCTTGAA CAGATAACCT CCATAGTAAA TCAGATAGCG
AGAGCTGCCA CCATATACAA TGTGGTTGAG ATTATAATAC TCGATATTCC GGAATCGGCT
GCCGAGGATG AGAACCAAAA GGTGGTTGAG CTAGGTGGTT CCAAGGGTGG AAAGAAGTTA
AAGTTCAACT TTAGTGATGA AGAAATCCTA AAAGATAAGC AAGAAACAGT AGCGCAGACT
TCGGAAGATG TACAAGACTC TTCCGCCAAT GCATTCTTAT TGGCCGGTTT ATTGCAATTT
TTTGTGACTC CTCCATACTT GGTGAAAACC ATCTTTTCCC CAGCTATCAA TCCTAGTCCT
CTCAACAAAG ACATGTTAAA GAAGTTCAAG TATGCCTACA AATTGCCCAA GATCACCACC
TTGCCATTCA TGTCTAACAA CGAAGTGTTC CGAGACTTCA AAGAAGGTTT CACAGTCCCC
AAAGAAACAC CCAAGGTGGT TTCTAAGAAA GACAAGTCGA AAAAGGTCAA GGCAGAAAAG
AAGATCTCGG TGACAAAATA TGTCAATATT GGAGAAGCTG AGTTGTTGGA ATTGAATATT
AAGAGGGAAA TTCCTGCCTA TTCGCGAGTC ACTGTAGATT TGAAGAACAA GACTATTGTG
TCGCCGCTAC AGGCATATGG AGTGATGGGC AACAAGTCTT CGTTTGGCTA CTACGTCAGA
TTCTGTAAAA AGTTCAGTTC CATTTTTACT GAGAGTTCGG CTCCAGAGGG GTACTCATCC
AGTATCTGTG TTCAAAGCGA TGACTTCTAT AGCTCTGCAG ACAAGATTGA AGACTTGAAC
AAGATCAACA AACTTGACAA GGTGGAAGCT AGCGAGTCTG GAAATAATAA CATTTTGTTG
GTGGTGGGAA GTTTCAAAGA CTTGCAGAGA AGCTTTAAGA GCGACTCCAT CGAAGGGGTC
GACTCCGTGG GACAGATGTT TGATGGCCAA TTGGAAGTTC CAAATGGAGT TAGAATCGAA
GATGCCTTGA TGATTGCATT GACT
 
Protein sequence
KKPVKTSPKV QVSVCIPSTV ISSKNARNLE QITSIVNQIA RAATIYNVVE IIILDIPESA 
AEDENQKVVE LGGSKGGKKL KFNFSDEEIL KDKQETVAQT SEDVQDSSAN AFLLAGLLQF
FVTPPYLVKT IFSPAINPSP LNKDMLKKFK YAYKLPKITT LPFMSNNEVF RDFKEGFTVP
KETPKVVSKK DKSKKVKAEK KISVTKYVNI GEAELLELNI KREIPAYSRV TVDLKNKTIV
SPLQAYGVMG NKSSFGYYVR FCKKFSSIFT ESSAPEGYSS SICVQSDDFY SSADKIEDLN
KINKLDKVEA SESGNNNILL VVGSFKDLQR SFKSDSIEGV DSVGQMFDGQ LEVPNGVRIE
DALMIALT