Gene PICST_53424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53424 
Symbol 
ID4851621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2354464 
End bp2355669 
Gene Length1206 bp 
Protein Length401 aa 
Translation table 
GC content36% 
IMG OID640393329 
Productpredicted protein 
Protein accessionXP_001387024 
Protein GI126275066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.509507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.160647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTA ATACTTCTTC AAATACTAAT TCATATACTC ATTTAACAAT GACAAGTATA 
TCCAAAGAGA AGATAGATAA TTTGATTCTC AACTATTTCA TTCAGGAGGG CTATCAAGAG
GCGGCTATAA GTTTTTCCAA AGAGCTTAAC ATTGATATAA CTAAAACTTC AACCCACTAC
CATCATGATG GAAGGGCTCG CTTATTCTCC AGTAGTTTGG GATCCATCAA TTCTTTGAAC
GGCCAGGAGT TTTCCAATTT AGTTGAGGAT TATTTCGATC AGCCTGATGT CGCTACCAAT
TCCAGAAATC AAGCCAGTCA TCGCGAGAAC ACCAACTCCA AGCTTGTTTC TGCGTATTCT
ACTATCAATC AGAGAAAGGA GATTAAGTTG TTAATATTGA AGGGTTCGAT TACTGAAGCT
ATCAAGAAGA TCAGTGAGTA CTTTCCTTCT ATTTTAGATT CGAACAACTT ATTGCACTTT
AAATTGTTAA GATCAAACTT GATAGAAATG ATACGGAAGC ACAAGTTGAA TGCTGATATG
GATAAAGACG AAAAACAGTT TCTTGATAGT ATTCTCACTT TTGTGAGAGA AAACTTGATA
AACAAAGTTT CTAACTCTTC AAAGTTACTT AAGGAATTGG AAATAACCAT GAGTTTATTG
TGTTTCAACT TTGACCCTAA TATTAAGGAC ATTGAGGATC AGAAGGATTT GCCTCTGGAA
TTACGGTCTC TTTTCAAACT TTCGTTGAGA AACCAATGCT ACAGATTAGT AAACAAGGCT
ATCCTTAATC TCAACGACTA TGACGATGAT GATGATAAAT TTATAGTTGG AGATGTGGAT
AAAGGTAGCA TTGAAAAACT GTACAAGGGA CCTAAATTTG TTGAGTTTGA CTTGAGTAGT
TTGGACAAAT ATGAGCCTAC CGAAGGAGAT CAATCTGATA GAGACTTTGA AATGTTCAAT
GATGATGAGA TTGAATACAG TATGGACCAC AGTAAGAGTT TGAGTCAGCT TATTACTTCT
GAAGAGGCGG ATGACTCAGT AAACGAAAAG GAAACAGTAG AAGACGAGTT GAATAAACTA
CAGAGTTTAA CTTTGGAGTC CAAATTAGAG AGAATCGTCA AGTTATGGAC CATTACTGAA
CAGAGATTAG TGGATCTCAA CATCATTAAG GAGAAGCGAT ACACATTGAA TGATGAGTAC
TTGTAA
 
Protein sequence
MRFNTSSNTN SYTHLTMTSI SKEKIDNLIL NYFIQEGYQE AAISFSKELN IDITKTSTHY 
HHDGRARLFS SSLGSINSLN GQEFSNLVED YFDQPDVATN SRNQASHREN TNSKLVSAYS
TINQRKEIKL LILKGSITEA IKKISEYFPS ILDSNNLLHF KLLRSNLIEM IRKHKLNADM
DKDEKQFLDS ILTFVRENLI NKVSNSSKLL KELEITMSLL CFNFDPNIKD IEDQKDLPLE
LRSLFKLSLR NQCYRLVNKA ILNLNDYDDD DDKFIVGDVD KGSIEKLYKG PKFVEFDLSS
LDKYEPTEGD QSDRDFEMFN DDEIEYSMDH SKSLSQLITS EEADDSVNEK ETVEDELNKL
QSLTLESKLE RIVKLWTITE QRLVDLNIIK EKRYTLNDEY L