Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47526 |
Symbol | FOC1 |
ID | 4839680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 397952 |
End bp | 399283 |
Gene Length | 1332 bp |
Protein Length | 444 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390995 |
Product | Probable formate transporter 1 (Formate channel 1) |
Protein accession | XP_001384735 |
Protein GI | 150865496 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2116] Formate/nitrite family of transporters |
TIGRFAM ID | [TIGR00790] formate/nitrite transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAGA CTTACTACTT GACCACTCAC GAGACGGCTT TGGCCGTCGT GGCGACAGCG ATGAAGAAGG CCAGACTCCG TATAGATACA CTAGCTCTCA GCTCGCTCAT GGGAGGTATC TTATTCAGTA CTGGAGGTAT GCTCTATGTG CTAGTGGAGT CATACTGTCC AGGATTAAGA GAATCAAACC CCGGCATCAT ATACCTTATA CAAGGAGCTT TGTTTCCCAT TGGACTTTTC TACGTGGTGA TAATGGGTGT TGATCTTTTC AACTCCAACA TACTCTTCTT CAGTACAGCA TTGTGCCGGG GAGCAGTTTC CTTTTTAGAC TTATTCATCA GCTGGTTCGT TAGCTACTGG TTGAATCTTG TAGGGAACAT TTTTGTTTGC TATATCATCT GCCATTATTC GCATGTAAGC CAGGAGCCCA GTTTCATAGA GGGCTCTATT ACCGTCGTGG AGACAAAAGT AAAGTTTTCA TTTGTAGCAA ATCTCATCAA GGCTATCGCG GGAAACTTCT TTGTCTCCTT AGCTATATAC TTGCAGTTGA TGGCGAAGCC ACTCCATGTC AAATTGATCA TGATGGTGCT TCCAATTTTC AGTTTTGTAG CCATGGGCTT CACGCATGCT GTGGCAGATA TGTATTTACT TATCATGGGC ACTATCAATG GAGCACCCGT TTCTGTGGGT GAACTTGCTT GGAAATTGTT TTTGCCTGGT GCCTTGGGAA ATATCATTGG AGGTTCCTTC TTTGGCGTAG TGATTACTTG GTACTTACAC TTGGTTGTAG TAGAAAGGGA CCAGAGACAA TTGCATTTGC CTCAGTATGA GGTCAGAGAC GAACAACCCG AATTGGGTAT GGATTCTAGG GTCGTGAGAC AACAGCCTTC AGTAGAAATC ACAGAAGAGT TCCCTACTTT GGAAGAGAAG CTTGGCGAAG AACGTTTGTC GTCTGACTCC AGCAATTTAG AAGTCTATAG ACTGAGACTG AGACTCATAC TGAGCGATTT GGCCAGGATA TCTTCCAGAG CAACGGGAAT CTCACGTATA ACCACAAGAA CAATTCGTAC TGCTAAGAAA TCTCCGAAGA ATGTCTTCCC AGTGTACGGA ATGGGTCCAG CATCCCTGAA AGACCAAAAA ATCGCATCAG GTAGAGACGA CTACGACGAT AATGACACTA ATTCGATGTA CTCAGCAAGT CAAGAGTTGG GTCCTAATGA GGTGCCTTCG GCTGATTACA TAGGTGAACA ATTAAGAAAA GTAATCTCAA GAAAAGGATC CAGCGCTGGA AGTAATTTGA GACGGAAAGT CAGCGATTTG GAGTCTCAAC GC
|
Protein sequence | MDETYYLTTH ETALAVVATA MKKARLRIDT LALSSLMGGI LFSTGGMLYV LVESYCPGLR ESNPGIIYLI QGALFPIGLF YVVIMGVDLF NSNILFFSTA LCRGAVSFLD LFISWFVSYW LNLVGNIFVC YIICHYSHVS QEPSFIEGSI TVVETKVKFS FVANLIKAIA GNFFVSLAIY LQLMAKPLHV KLIMMVLPIF SFVAMGFTHA VADMYLLIMG TINGAPVSVG ELAWKLFLPG ALGNIIGGSF FGVVITWYLH LVVVERDQRQ LHLPQYEVRD EQPELGMDSR VVRQQPSVEI TEEFPTLEEK LGEERLSSDS SNLEVYRSRS RLISSDLARI SSRATGISRI TTRTIRTAKK SPKNVFPVYG MGPASSKDQK IASGRDDYDD NDTNSMYSAS QELGPNEVPS ADYIGEQLRK VISRKGSSAG SNLRRKVSDL ESQR
|
| |