Gene PICST_73792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73792 
Symbol 
ID4840574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp123590 
End bp124789 
Gene Length1200 bp 
Protein Length325 aa 
Translation table12 
GC content41% 
IMG OID640391889 
Productpredicted protein 
Protein accessionXP_001386040 
Protein GI150866436 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0190] 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTTTCAACC CTCTATATCA ATTTTCAACA ATATATAGAA CACTCCTCAC TAAAATGGCT 
TCGACTGAAA CTGTTACTCC CAAGCCAGCT GGAAGAACCA TTTTGGCTTC AACTATAGCC
AAGCCGTTTG TGGAAGAGGT TACTGGAGGT TTAACCAAGC TCGATTTCAA GCCTAAGTTG
GTGGGTTTCT TGGCCAACGA TGATCCTGCG GCTAAAATGT ATGCCAACTG GACTGGCAAG
ACTTGTGAGT CTTTGGGCTT CCACTACGAA TTGATCGACG TCAACAAGAA TGAATTAGAG
AACGAGTTGA TCAAGGCCAA CAACGACGAT GCAGTCAATG GCATTATGGT GTATTTCCCC
GTTTTCGGTG ATAGCCAGGA CCAGTACTTA CAGCAATTGA TATCACCTGA AAAGGACGTC
GAAGGCTTGA ACTTCTTGTA CTACCACAAT TTGTACCACA ATGTGAGATT CTTAGATGCT
CCTACCAATG AGCAGAAGTC CATTTTGCCT TGCACACCTT TGGCAATGGT AAAAATCTTG
GAGTATTTGG GGGTATACAA CAAGATCTTG CACTATGGAA ACAGACTCTA TGGCAAAAAG
ATCTTGGTGG TGAACCGGTC AGAAATCGTT GGCCGTCCCT TAGCAGCCTT GTTGGCTAAC
GATGGTGCTA CAGTTTATTC TGTGGATATC CATAATGTGC AGCAATTCAC TAGAGGAGAC
GACTTGCTGG CTCAGAGACA CAAGGTGATC GACTTGGACC AGAACGAATA CTCCATCGAA
AAGATTGCAC CTCTTTGTGA TGTAATTATC ACTGGTGTTC CTTCCGACAA CTACAAGTTC
CCAACAGAAC ACGTTAAGTA TGGGACAGTA GTGATAAACT TCTCTAGCTC GAAGAACTTC
AACGATGACA TCAAGTTGAA AGCTGGCTTG TACGTACCTT CTATCGGTAA GGTCACCATT
TCAATGCTCT TGAGAAACTT GTTGAGACTT ATTGAGAACA AGCAAATCAG ATTGGCCAAA
GAGAAGAAGT AATTTGCCCG GAGAAAATCC CATTGCAACT ACTGCAATGT TAATGTAGAT
ACTACTACTA CAATGACAAC ATACATAACG ATAACATAAC AAAACAAAAC ATAGCAAAAG
AATTGCAATA ACCATACAAT ATCATTAGTA TAAACAAGAA TGCAAGAATA TTACGAATAG
 
Protein sequence
MASTETVTPK PAGRTILAST IAKPFVEEVT GGLTKLDFKP KLVGFLANDD PAAKMYANWT 
GKTCESLGFH YELIDVNKNE LENELIKANN DDAVNGIMVY FPVFGDSQDQ YLQQLISPEK
DVEGLNFLYY HNLYHNVRFL DAPTNEQKSI LPCTPLAMVK ILEYLGVYNK ILHYGNRLYG
KKILVVNRSE IVGRPLAALL ANDGATVYSV DIHNVQQFTR GDDLSAQRHK VIDLDQNEYS
IEKIAPLCDV IITGVPSDNY KFPTEHVKYG TVVINFSSSK NFNDDIKLKA GLYVPSIGKV
TISMLLRNLL RLIENKQIRL AKEKK