Gene PICST_47526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47526 
SymbolFOC1 
ID4839680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp397952 
End bp399283 
Gene Length1332 bp 
Protein Length444 aa 
Translation table12 
GC content44% 
IMG OID640390995 
ProductProbable formate transporter 1 (Formate channel 1) 
Protein accessionXP_001384735 
Protein GI150865496 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2116] Formate/nitrite family of transporters 
TIGRFAM ID[TIGR00790] formate/nitrite transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGA CTTACTACTT GACCACTCAC GAGACGGCTT TGGCCGTCGT GGCGACAGCG 
ATGAAGAAGG CCAGACTCCG TATAGATACA CTAGCTCTCA GCTCGCTCAT GGGAGGTATC
TTATTCAGTA CTGGAGGTAT GCTCTATGTG CTAGTGGAGT CATACTGTCC AGGATTAAGA
GAATCAAACC CCGGCATCAT ATACCTTATA CAAGGAGCTT TGTTTCCCAT TGGACTTTTC
TACGTGGTGA TAATGGGTGT TGATCTTTTC AACTCCAACA TACTCTTCTT CAGTACAGCA
TTGTGCCGGG GAGCAGTTTC CTTTTTAGAC TTATTCATCA GCTGGTTCGT TAGCTACTGG
TTGAATCTTG TAGGGAACAT TTTTGTTTGC TATATCATCT GCCATTATTC GCATGTAAGC
CAGGAGCCCA GTTTCATAGA GGGCTCTATT ACCGTCGTGG AGACAAAAGT AAAGTTTTCA
TTTGTAGCAA ATCTCATCAA GGCTATCGCG GGAAACTTCT TTGTCTCCTT AGCTATATAC
TTGCAGTTGA TGGCGAAGCC ACTCCATGTC AAATTGATCA TGATGGTGCT TCCAATTTTC
AGTTTTGTAG CCATGGGCTT CACGCATGCT GTGGCAGATA TGTATTTACT TATCATGGGC
ACTATCAATG GAGCACCCGT TTCTGTGGGT GAACTTGCTT GGAAATTGTT TTTGCCTGGT
GCCTTGGGAA ATATCATTGG AGGTTCCTTC TTTGGCGTAG TGATTACTTG GTACTTACAC
TTGGTTGTAG TAGAAAGGGA CCAGAGACAA TTGCATTTGC CTCAGTATGA GGTCAGAGAC
GAACAACCCG AATTGGGTAT GGATTCTAGG GTCGTGAGAC AACAGCCTTC AGTAGAAATC
ACAGAAGAGT TCCCTACTTT GGAAGAGAAG CTTGGCGAAG AACGTTTGTC GTCTGACTCC
AGCAATTTAG AAGTCTATAG ACTGAGACTG AGACTCATAC TGAGCGATTT GGCCAGGATA
TCTTCCAGAG CAACGGGAAT CTCACGTATA ACCACAAGAA CAATTCGTAC TGCTAAGAAA
TCTCCGAAGA ATGTCTTCCC AGTGTACGGA ATGGGTCCAG CATCCCTGAA AGACCAAAAA
ATCGCATCAG GTAGAGACGA CTACGACGAT AATGACACTA ATTCGATGTA CTCAGCAAGT
CAAGAGTTGG GTCCTAATGA GGTGCCTTCG GCTGATTACA TAGGTGAACA ATTAAGAAAA
GTAATCTCAA GAAAAGGATC CAGCGCTGGA AGTAATTTGA GACGGAAAGT CAGCGATTTG
GAGTCTCAAC GC
 
Protein sequence
MDETYYLTTH ETALAVVATA MKKARLRIDT LALSSLMGGI LFSTGGMLYV LVESYCPGLR 
ESNPGIIYLI QGALFPIGLF YVVIMGVDLF NSNILFFSTA LCRGAVSFLD LFISWFVSYW
LNLVGNIFVC YIICHYSHVS QEPSFIEGSI TVVETKVKFS FVANLIKAIA GNFFVSLAIY
LQLMAKPLHV KLIMMVLPIF SFVAMGFTHA VADMYLLIMG TINGAPVSVG ELAWKLFLPG
ALGNIIGGSF FGVVITWYLH LVVVERDQRQ LHLPQYEVRD EQPELGMDSR VVRQQPSVEI
TEEFPTLEEK LGEERLSSDS SNLEVYRSRS RLISSDLARI SSRATGISRI TTRTIRTAKK
SPKNVFPVYG MGPASSKDQK IASGRDDYDD NDTNSMYSAS QELGPNEVPS ADYIGEQLRK
VISRKGSSAG SNLRRKVSDL ESQR