Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28914 |
Symbol | FRM1 |
ID | 4851654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2462524 |
End bp | 2463804 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393362 |
Product | formamidase |
Protein accession | XP_001386805 |
Protein GI | 126275167 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0986635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATC CAATTCGTTA TGTTACCAAG GTCGACTTGA ACAAGCCAGG TGACGAACAA CCTCAAGTAC ACAACAGATG GCACCCAGAT GTTCCCTTTA CTGAAACCAT CAAGCAGGGT GAAACTGTCA AAATTGAGTG TTTGGACTGG ACAGGAAACC AGATAGTCAA TTCGGACGAC GCAGACGATA TCAAGAACGT GGACTTATCT AGAATCCACT ATTTATCTGG GCCATTCAAC ATTGAAGGAG CCAAACCTGG AGATGTCCTT GTCGTAGAAA TCAAAGATGT GCAACCTTTA GATAGACAGC CATGGGGTTA CTGTGGAATA TTCCATAAAA CTAACGGTGG AGGGTTCCTT GACAAATTCT ATCCTGAAGC AGCCAAGGCT ATATTCGATT TGGAAGGTAT CCATGCTACA TCTAGACATA TCCCCAATGT CAAATTCACT GGTCTTATTC ATCCTGGCAT TATGGGGACA GCACCTTCTG CTGAAGTATT GGCTGAATGG ACTGCACGTG AAGGAAAACT AGTTGCTGCA GAGCAGCATG CGCACGATAC CGCGGTTGCA AATCTTCCCC TTTCAGTAAA TGCCCATGGA GGCCTGGCAA CTGGCGATTT GGCTGAAAAG ATAGCCACTG AAGGAGCACG CACCATTCCT GGAAGACCCG AACATGGAGG TAACTGCGAT ATCAAGAATC TCTCCAGAGG CTCCAAGTGC TATTTCCCAA TCTACGTTGA TGGCGCCAAG CTTTCTGTAG GAGACTTGCA CTTCTCACAA GGTGATGGTG AAATATCCTT CTGTGGAGCA ATTGAGATGC CTGGAGTACT CACCATCAAC TGTAAAGTCA TCCCTGGAGG AATGGAAAAG TTGTCTTTGA AGTCTCCCAT GTTCATTCCA GGAGATGTCC CCAACCAGTA CGGCCCTTCC AGATACTTGA CTTTTGAAGG TTTCTCAGTT GACGAGGAAG GTGAACAGAA GTTCTTATGT GCTACTACAG CCTATAGACA AGCCTGTATC AGAGCAATTG AATACTTGAG AAGATTTGGT TACAACGATT ACCAGATTTA CTTATTCTTG TCAACTGCTC CAGTAGAAGG ACATATTGCA GGTATTGTTG ATGTTCCAAA TGCATGTACT ACTCTTGGAA TTCCAATTGA TATCTTTGAA TTCGATATCA GACCTGAAGC TGAGCCAGTC AAAATCGATC AGGGAAATTG CGCTTTCGTA AAGGGGACAG ATCATCCTGT GACGTATGAC TTCAAGGATT TCATAAAGTG A
|
Protein sequence | MSNPIRYVTK VDLNKPGDEQ PQVHNRWHPD VPFTETIKQG ETVKIECLDW TGNQIVNSDD ADDIKNVDLS RIHYLSGPFN IEGAKPGDVL VVEIKDVQPL DRQPWGYCGI FHKTNGGGFL DKFYPEAAKA IFDLEGIHAT SRHIPNVKFT GLIHPGIMGT APSAEVLAEW TAREGKLVAA EQHAHDTAVA NLPLSVNAHG GLATGDLAEK IATEGARTIP GRPEHGGNCD IKNLSRGSKC YFPIYVDGAK LSVGDLHFSQ GDGEISFCGA IEMPGVLTIN CKVIPGGMEK LSLKSPMFIP GDVPNQYGPS RYLTFEGFSV DEEGEQKFLC ATTAYRQACI RAIEYLRRFG YNDYQIYLFL STAPVEGHIA GIVDVPNACT TLGIPIDIFE FDIRPEAEPV KIDQGNCAFV KGTDHPVTYD FKDFIK
|
| |