Gene PICST_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_3402 
SymbolHYR6.2 
ID4840202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1703837 
End bp1705063 
Gene Length1227 bp 
Protein Length409 aa 
Translation table12 
GC content41% 
IMG OID640391517 
Producthyphally regulated cell wall protein 
Protein accessionXP_001385683 
Protein GI150866179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.25351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTTT TCAGAAGCTT AGTCAGAGTT TTTCTCTTTG CAACTGTTGC TTTGGCTTTT 
ACGGTCCAAA AACCTAAGGT TGACAGAGGT TCTATCAACC TCTCTATTGG TGATATTACA
ATTCAGTCTG GCTCTTTCTG GTCCATTTTT GACAATACTG TTTCCATATT TAAAGGTGAT
CTTTGGGTGC AAAAAAATGC TGGTTTTTTT ATCACATCCA CCAACAAGTT GATCGGTTTG
AAAGTTGAAC TTGCATCAGG ATTTGGATCT ATTAGAAACG ATGGCTTGAT CGTCTTCAAC
TCCCTTGTCT CTATAACTCC ATCATTTTAC AAACTTATTG GCAAGAGTTT TCTTAATTCT
GGAGAAATAT TTTTGGTGTC TAGTGGTTAT GGAGTACCAA CAGCTGCACT TTTGGCTCCA
ATTTGGAAAA ATACTGGTTC CTTGACTTTC TTTCAGAACA AAAGAAACAA CGGGGTCGTC
AGTCTTGGTG CACCAGGATT AAAAATTGAA AATTGGGGTC AGATTTGTTT GTTCAATGAA
CTTTATAAAC AAACGACCCA TATCTTTGGT GATGGTTGTA TTACCGCAGA TCAAGACTCT
AGTATTTTCT TTTCAAACTG TTTATTGGAT ATTGATAGTA GACAAACCGT CTACTTAGCA
GACTCGAGGT CCTCGGTAAG GGCTGTGGCT CTCGCTAAGC CAAAGACTTT CAAAGTGGCT
GGTTTTGGAA ATGGAAACAA AATTGGATTG GATTTACCAC TTATCAGCCC ATTCCTGAAA
TCAGTAATCT ACAATGCTAA AACGGGAATC TTATCGCTTA GAGTTAAGGG CTTTTGGGGG
CAAGACTTCA ATATTGGTTT AGGTTACAAC TCGAACAAAT TTAAGATTAC AACTGACAAT
AGTCTCGGGT TGTTGAGTGT TCCATGGGGA GCTGTCTATT ATGACGGTCC AGTACCTAAT
AAGCAGATTC CAAGCAACTG TCAACCATGC AAGCCCTATC CATCACCTCC TACAACTACT
ACAACGAAGA CAAACGCTCA AACTACCAAA ACATCTACTT GGACTGGTAC TTTCACCACC
ACCGTCACCG AAACTGATAC CCCAGGTGGT ACCGACACTG TCATCGTTGA AGTTCCTTCT
ACTCCAAACT CTCAGACTAC TCTTACCTCA ACTTGGACCG GTACTTTCAC CACCACCGTC
ACCGAAACTG ATACCCCAGG TGGTACC
 
Protein sequence
MLLFRSLVRV FLFATVALAF TVQKPKVDRG SINLSIGDIT IQSGSFWSIF DNTVSIFKGD 
LWVQKNAGFF ITSTNKLIGL KVELASGFGS IRNDGLIVFN SLVSITPSFY KLIGKSFLNS
GEIFLVSSGY GVPTAALLAP IWKNTGSLTF FQNKRNNGVV SLGAPGLKIE NWGQICLFNE
LYKQTTHIFG DGCITADQDS SIFFSNCLLD IDSRQTVYLA DSRSSVRAVA LAKPKTFKVA
GFGNGNKIGL DLPLISPFSK SVIYNAKTGI LSLRVKGFWG QDFNIGLGYN SNKFKITTDN
SLGLLSVPWG AVYYDGPVPN KQIPSNCQPC KPYPSPPTTT TTKTNAQTTK TSTWTGTFTT
TVTETDTPGG TDTVIVEVPS TPNSQTTLTS TWTGTFTTTV TETDTPGGT