Gene PICST_41267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41267 
SymbolGPH2 
ID4836995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp761990 
End bp763210 
Gene Length1221 bp 
Protein Length406 aa 
Translation table12 
GC content44% 
IMG OID640388310 
Productglycerol-3-phospate dehydrogenase 
Protein accessionXP_001382385 
Protein GI150863788 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.847058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0947149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTAT CCCAATTCAG CCCGATTTTC AGGCGAAGCT TTTGTTCTTC TTCTCGGTAT 
TGCTCAGACT TTTCTCATGT AGTGATTGGA GGAGGGGTGG TAGGAACAGC AATTGCTGCT
GAGCTACTGG AAGTTGCTGG CAACAGCGTT CTCCTTGTTG AAAAGAATGA AGATTTGGGC
ATGGAAACCA CCTCAAGGAA TTCGGAAGTG ATACATGCTG GGTTGTACTA CCCTCAACTC
AGTCTCAAGG GACAACTCTG TATCCGAGGA AAGAACAAAA TCTACGAAGC CAATGACAAG
GGACTCTTTC AAGTGGCACT ACAGAAGTGC GGAAAGTGGG TAGTTGCACA GAATGAACTG
GAGGAAGCAT ATTTGGAAAA GCTTTATCAG AATAGTCGGG ATCTTGGAGT TCCAGTGTCC
ATGATTTCTG CTTCTGAAGC TAAGCGCAAG TATCCGTTGA TAAGGGCTGA AGCTGGGGCT
CTAAATAGTC CTACAACGGG TATCATTTCA GCACATGAGT TGACAACCTT CTATCAGAGT
AAAGTAGAAA ATAACGATGG AACAATTGCC CTTAACACCA GAGTAGTTGA CATTGGCCCT
AATTTGGCCA CACCCAACTA TACCTTAAGA TTAGTTGATA TAGAAGGTTC AGATATGGAA
GTCACCACTG ACAATGTCGT CAATTCTGCA GGTCTCTATG CTCAGAAAAT AGCCAATCTA
GTGCTACCTC CAGATAGACA GTACCAAAGT TACTTTGCTA AAGGTAGCTA TTTCAGTTTC
CAGCCAGAAG TAGCCTTAAG CCACAGCAAG ATCACGGACA AGTTAATCTA TCCATGTCCA
AACCCCAATG CTTCATCTCT AGGTACACAT TTGACACTAG ATTTGGGTGG ACAAATCAGA
TTTGGCCCTG ACCTCGAATG GCTTGATATA GAGGATGCTT CTGAGATAGA CTACCGGGCA
AGCACAAACA ATTTGGATGC CGCATACAAA GCAATTCAGA CATATTTTCC TAGCGTGACA
CCAGGCTCAC TTCAACCATC TTACTCTGGA GTGAGACCAA AGTTATTGTC GGCAGCAGAC
AGCAAAAAGC ACTTTGCCGA TTTTGTTATC AAAGAAGAAG ATGGATTCCC TGGATTTGTC
AATTTGTTGG GTATTGAGAG TCCGGGATTG ACTGCCTCTT GGGCTATTGC TGACTATGTA
AAAGAAATAT ACCATGGATA G
 
Protein sequence
MRLSQFSPIF RRSFCSSSRY CSDFSHVVIG GGVVGTAIAA ELSEVAGNSV LLVEKNEDLG 
METTSRNSEV IHAGLYYPQL SLKGQLCIRG KNKIYEANDK GLFQVALQKC GKWVVAQNES
EEAYLEKLYQ NSRDLGVPVS MISASEAKRK YPLIRAEAGA LNSPTTGIIS AHELTTFYQS
KVENNDGTIA LNTRVVDIGP NLATPNYTLR LVDIEGSDME VTTDNVVNSA GLYAQKIANL
VLPPDRQYQS YFAKGSYFSF QPEVALSHSK ITDKLIYPCP NPNASSLGTH LTLDLGGQIR
FGPDLEWLDI EDASEIDYRA STNNLDAAYK AIQTYFPSVT PGSLQPSYSG VRPKLLSAAD
SKKHFADFVI KEEDGFPGFV NLLGIESPGL TASWAIADYV KEIYHG