Gene PICST_72172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72172 
Symbol 
ID4838370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1007798 
End bp1008978 
Gene Length1181 bp 
Protein Length275 aa 
Translation table12 
GC content40% 
IMG OID640389685 
Productpredicted protein 
Protein accessionXP_001384501 
Protein GI126135954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.143035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGAGAATGGT AGACCGTTAA ATAATTAACT AATCATGCTT GCGTGTAACT TTAGCACTTG 
GACAAGGAGG GCTACTTGAT TTTTGGAATT ATCGCCACCA TATCTTTCAA ATCCTATATA
AGGAGATCAG TTTCCCGCTT GCAAGTAATT GTTTCTAACA TTTTCTCAAC ATCAGAACAT
TGATTTTTCA ACCGGCTTAA GCACTCATAT CTCCACCCCA CATAAAATTT CTATACATAC
TAGCAAAATG GTCTACCCTT CAAATTTCAA AGTTGTCAGC AGAAAATTGA ACCCTTCCAC
TGTGGTTGCA TCTGCACCTT TCACCAGGGC CAACAAGTTT AACTTCGGTG CTCGTATGGC
TGTGTTCAAC TACGATAACC AAGTCATCGT CTGGTCCGCT TTACCGTATG GTACGGAGGT
CAAGAAGACG TTGGAATTGT TGACCGGAAG TGATGCTAAG CCAAATATTA CTCACTTGAT
AATTCCAGAT CGTGAACACA CAATTGCGGC AAAGTCTTTC AAGGAAGAGT ATCCAGAGTT
GAAGATTATT GCCATGGATA CCGTATCTAT TCCTGGTGTT GAAATCGACT ACAAATTTTC
TGCCAAAGAG GGTAACAAAT TGATTGACAG AAAGGTTCTT GAGGAGGAAA TTGGCATCAA
GGAATCAGTT ATCTTAGACA ATTTCGAGTT CGTCTATTTA CCATTGCATG CTAATCAAGA
ACTAGTAACC TTTGACAAGA AGGCCAAGGT TGTGTATGAA GCCGATTTGC TTTTCAACTT
GGGTGTTCCA GGTACTACTA GCGGTAAGGT CACGTTAGAG CAGTATTCTC CAGAAACTGG
ATACAAGCAA GGATTCAACC CTCACTCTGG TTGGTCTTTC TTAACTAGAT ACATGCAACC
ACACAGTAAG GTTGGTACAT TTCTTTTCAA TAAACTTGTC CAGACTGCCA AGAGTAAGGA
TGGTCTTAAA GCTATCTACA ACTGGGACTT CGACACCATT GTGCCTTGTC ATGGTAACTT
GATTGAAAAA GATGCCAAGG CCTCCTTCAA GAGTGCATTC CACGGAGCAT TTTAGAGGAG
AAAATAACTA CAGAGCGGCG GTGGGCAATT AATTGTCCTA ATTTTGTATT TCTTACTTTA
ATATGTTATG CTTAATTCAA ATTTAATGTT TTATTACTTT G
 
Protein sequence
MVYPSNFKVV SRKLNPSTVV ASAPFTRANK FNFGARMAVF NYDNQVIVWS ALPYGTEVKK 
TLELLTGSDA KPNITHLIIP DREHTIAAKS FKEEYPELKI IAMDTVSIPG VEIDYKFSAK
EGNKLIDRKV LEEEIGIKES VILDNFEFVY LPLHANQELV TFDKKAKVVY EADLLFNLGV
PGTTSGKVTL EQYSPETGYK QGFNPHSGWS FLTRYMQPHS KVGTFLFNKL VQTAKSKDGL
KAIYNWDFDT IVPCHGNLIE KDAKASFKSA FHGAF