Gene PICST_47071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47071 
Symbol 
ID4839441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1353935 
End bp1355053 
Gene Length1119 bp 
Protein Length372 aa 
Translation table12 
GC content47% 
IMG OID640390756 
Productpredicted protein 
Protein accessionXP_001385264 
Protein GI150865873 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.454401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00245237 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCACC ACACCATGAC CTTCGAATCT AATATCATTA ACGGTAATGC TAGGAACAAC 
CCTGCCACCA ATCACATCCA AATTGGAGGC AGAAAGTCCA CGCTTGCAGT GGTGCAATCC
GAAATAGTCA AGAAGGAGAT CGAAGAAGCA TTTCCACACA TTAACTGCTC GATTTTGGCG
CTTTCCACTT TGGGAGACAA GATCCAGAAC AAGCCTCTCT ATTCATTCGG CGGAAAATCG
CTCTGGACAA AAGAACTCGA GATCTTGTTG TTGGACCTGA TTGACGAGTT TCCCCAGCTT
GACTTGATAG TGCATTCCTT GAAGGACATG CCAACGAACT TGCCTGATGA GTTTGAGTTG
GGTTGTATTT TGAAGAGAGA AGACCCTCGT GACGCTTTAG TTATGAGAGC CGGATCACCT
TACAAGACGT TGGACGATTT GCCAGCAGGC TCTGTAGTGG GAACATCTTC CATCAGAAGA
TCATCGCAAC TCGTGAAGAA CTATCCCCAC TTGAAGTTCG ACTCTGTTCG TGGTAACCTT
CAGACTCGTT TGAGCAAATT AGACGACGAC TCCCAACCAT TCGAGTGTAT CATTTTGGCA
CTGGCTGGCT TAATCAGAGT TGGTTTGGGC CACAGAGTTA CAGACTATTT GAACGCTCCA
CACATGTACT ACGCCGTTGG CCAGGGAGCT TTGGGTGTAG AAATCAGAAA AAACGACACC
AAGATGAAGA ACATCTTGGC TAAGATACTG CACATCCCAA CATCCCTCTG TTGTTACGCA
GAGAGATCGT TGATGAGATA CTTGGAAGGA GGGTGTTCCG TGCCGATAGG TGTTCACACG
AACTACGATG AAGACTCGAA GGTGTTGAAG TTTGAGGCTA TCATAGTCAG TCCCGACGGA
ACTCAGTTTG TGGAAGACGA ACTTGAAGCT CAAGTCGAGA CCTTGCAACA GGCCGAAGCC
TTGGGTATAC AACTAGGCGA CAGACTCATC GCTAAGGGTG CAAAGGATAT CTTGGACAAG
ATCGACTTCA ATAGAATCAA CCAGGCTCCT AGCACCATCA ACACACCCAC TCCATCCATA
GCTACCTCCA TAGAGGCCGT CGTTTCTACG GCTAACTAA
 
Protein sequence
MPHHTMTFES NIINGNARNN PATNHIQIGG RKSTLAVVQS EIVKKEIEEA FPHINCSILA 
LSTLGDKIQN KPLYSFGGKS LWTKELEILL LDSIDEFPQL DLIVHSLKDM PTNLPDEFEL
GCILKREDPR DALVMRAGSP YKTLDDLPAG SVVGTSSIRR SSQLVKNYPH LKFDSVRGNL
QTRLSKLDDD SQPFECIILA SAGLIRVGLG HRVTDYLNAP HMYYAVGQGA LGVEIRKNDT
KMKNILAKIS HIPTSLCCYA ERSLMRYLEG GCSVPIGVHT NYDEDSKVLK FEAIIVSPDG
TQFVEDELEA QVETLQQAEA LGIQLGDRLI AKGAKDILDK IDFNRINQAP STINTPTPSI
ATSIEAVVST AN