Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53486 |
Symbol | SMF2 |
ID | 4851936 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3237211 |
End bp | 3239364 |
Gene Length | 2154 bp |
Protein Length | 698 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393644 |
Product | manganese ion transporter |
Protein accession | XP_001386943 |
Protein GI | 126276072 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1914] Mn2+ and Fe2+ transporters of the NRAMP family |
TIGRFAM ID | [TIGR01197] NRAMP (natural resistance-associated macrophage protein) metal ion transporters |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.392071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATAC GAGAAACGTT CACCTTCAAG AAAGCTCTTC TGGTGCTAAA GACATATTCG AGGTTCATTG GCCCTGGCCT AATGGTCAGT GTAGCGTACA TGGATCCCGG AAACTATTCC ACTGCTGTAG CTGCTGGCTC GGCTTATAAA TATGACTTAC TTTTCCTGAT TTTTTTATCA AACTGTCTAG CCATTTTCCT TCAGATATTA GCTGCTAAGC TTGGAGCCGT AACAGGACTC GATTTGGCTG CAAATTGTAA AGCCAATTTC TCATTTAAAA CGAATTTATT TATCTACATT TTGGCCGAAG TTGCCATCAT TGCTACCGAC TTGGCAGAAG TAGTTGGTAC GGCCATAGCA TTGAACATCT TGTTCAATTT ACCCCTTTTT GCTGGGGTCA TCCTCACAGT CGTAGACGTA CTAATTGTGT TGATGGCATA TAGGCCGAAA GGTCCGTTGC TTTTCATCCG GATTTTTGAG TCGTTTGTAA CTGTCTTGGT GGCGGCAACT GTTGTCTGTT TTGCCATAGA ATTGCATCAA GTCAGTCATG ATCCACTGAT TAATTTTTCA GTGGTAGAAG TCATGAAGGG CTTTTTGCCA AATGAAAGCG TCGTAGATCT GTCTGATAGA AAAGGCGGAA ACGGACTTTT CTTGAGTTTG GCTATTTTAG GAGCAACCGT GATGCCCCAC TCCTTGTACT TGGGATCAGG TTTGGTTCAG GCCAGATTGA AGGATTTTGA CATCAAGCAT GGTTTTTACA GACCCTTTCA AAGAAGATCT CTCAATGCTG AATCACCCCA AACTCAGACG GAAGAAGAAG AATTACATGG CACTGAATCT TTAGCTGAAT CTGACTCAAT TACTCCATTG GCTACACCTT TGGTGAACCC ATTTGGTGGC ATACTAAGAT TTGAAAAGGA TAATGATGTT GCAACTGTAG ACGATGATGA AGACGACAAC TACCGTCCTT CCATTCATGC GATCGACGAC ACAATGAGCT ATACAATTGT GGAATTGGTA ATTTCGTTAT TCACAGTCGC ACTTTTCGTC AATGCTGCTA TTTTGATTGT CGCTGGTGCC ACTTTAAATG CCAATAAAAA ACATGAAAGA AGCTCCTTTG GGGGTTTTTC TAGGAGAGAT GATGACGATG ATGGCACAGA TTCAGACTCA GACGACGACT ACGAGAACGC AGATTTATTT ACCATCTACC ATTTGCTTTC AAAGCATTTG TCGCCTACGG CTGGGTTTGT GTTTGCATTA GCGTTGTTGT GTTCCGGCCA AAGTGCTGGT GTAGTTTGTA CTTTGGCAGG TCAGATGGTC TCTGAAGGTT TCTTGTCGTG GAGCTTACCT CCAGTGACGA GAAGATTGAT TACCAGAGGC TTGGCCATTG CTCCATGTCT TGTTGTCGTA AGCATTGCCG GTCGCGAAGG TTTAGCCAAA ACATTGAATG CCAGTCAAGT AGTTTTGTCG ATTCTTTTGC CAGTAGTCAG TGCCCCTCTT ATTATCTTTA CTTGTTCAAA AGAAATCATG AAGGTACCTA TCTTTGTCAG AGACGGGGAT GAAAATGTCG ATATTGTCTA CGAACAGCCG GAAGCTTCTA CTATCTTTGA CGATGACGAA ATAACTCTAC CTGCCCCTGC TCCTGGCAAA CCGCGTAGAT TCAAAAGTAC AGAGCCTGCT ATCCGCTTAC AAGATTTACG TAACTCACAT CCTTGCGCTG GTTGGGAAGA TGATGAGGAC GAACATATGA ATCACGGAAG TGAAGAACAA CTGCACCTTC TCAGTCTGAA CAGCGCTGAT GGAACTAATG CCAACGCTCC ATACCCTCGC CGTTTATCAG CTGTTGCTAT TCTGTCTACC ATAAATAATG CTAGTATTAG AGAAGCAACT CCAGTCAAGA AATCGATGAG TACAAGTGAC TCGAACAGTT TCAGCTCCTC TAGCGCTAAC ACCTCGACAA ATCCCACACC AAACCCTTTA CCAGTGGAAA TCTACGAAGA TGCCGACCAC GAGCTCTTCA GAATCCAAGG CTACAAAGAC TTCAGCAACA GCCCCTTGAC AGCTTTTGTT GCAATCTTGA TCTGGGGGTT TGTCACCTGC TTAAACTTGT ACTTGATAGT CAGCATGATG TTGGGCTACG ATGTTCCTTT GTGA
|
Protein sequence | MSIRETFTFK KALLVLKTYS RFIGPGLMVS VAYMDPGNYS TAVAAGSAYK YDLLFLIFLS NCLAIFLQIL AAKLGAVTGL DLAANCKANF SFKTNLFIYI LAEVAIIATD LAEVVGTAIA LNILFNLPLF AGVILTVVDV LIVLMAYRPK GPLLFIRIFE SFVTVLVAAT VVCFAIELHQ VSHDPLINFS VVEVMKGFLP NESVVDLSDR KGGNGLFLSL AILGATVMPH SLYLGSGLVQ ARLKDFDIKH GFYRPFQRRS LNAESPQTQT EEEELHGTES LAESDSITPL ATPLVNPFGG ILRFEKDNDV ATVDDDEDDN YRPSIHAIDD TMSYTIVELV ISLFTVALFV NAAILIVAGA TLNANKKHER SSFGGFSRRD DDDDGTDSDS DDDYENADLF TIYHLLSKHL SPTAGFVFAL ALLCSGQSAG VVCTLAGQMV SEGFLSWSLP PVTRRLITRG LAIAPCLVVV SIAGREGLAK TLNASQVVLS ILLPVVSAPL IIFTCSKEIM KVPIFVRDGD ENVDIVYEQP EASTIFDDDE ITLPAPAPGK PRRFKSTEPA IRLQDLRNSH PCAGWEDDED EHMNHGSEEQ LHLLSLNSAD GTNANAPYPR RLSAKSMSTS DSNSFSSSSA NTSTNPTPNP LPVEIYEDAD HELFRIQGYK DFSNSPLTAF VAILIWGFVT CLNLYLIVSM MLGYDVPL
|
| |