Gene PICST_84655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84655 
SymbolSMF1 
ID4840158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1509028 
End bp1510982 
Gene Length1955 bp 
Protein Length531 aa 
Translation table12 
GC content42% 
IMG OID640391473 
Productmanganese transporter 
Protein accessionXP_001385640 
Protein GI150866146 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1914] Mn2+ and Fe2+ transporters of the NRAMP family 
TIGRFAM ID[TIGR01197] NRAMP (natural resistance-associated macrophage protein) metal ion transporters 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.047618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.19959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAAGAATAA CAACAGTGTG ATTGCATGCT TAAAGAACTG CTGGAAACTA GAAGTGCTAT 
TTTCTCAAAT CAAAGCTGGA ATTCGATACT CTGAATCGCC AAATAATCAG CCCATATAGG
ATCAAGTATT GAGATACTAA GAACAAGGGC TTCATCTGAG GTTATCAACA TCCTATCCCA
TTCACATTCA CACATTCGCT ATCGAGTTCA ATAGTCTTGA CTTCCTTCTG AACATTTTTG
CATTCATATT CACTCATCAT ATTAGTATTT GAAAATGGTA GAGGGTCTGT CTAACAATAT
CATGGAGAAC TCTCCCCAGA CTCCTAACGA AAAGTCGCCA GACTATAACT GTTCCATCAA
GAATCAACCG CAACCGCATG CAAACGTTGG AGTGCGTATG AGAACTTCTC TCCGCAAGTA
TATGTCTTTT GTTGGCCCCG GGTTGCTTAT TTCAGTTGCT TACATGGATC CTGGTAACTA
TGCCACAGGC ATTACTGCTG GTGCTTCAAA CAGATACTCG TTACTTTTCT TCGTGTTTCT
CTCAAATATC ATTGCTATTT TCCTCCAATC ACTTTGTATC AAGTTGGGCT CGGTGACAGG
TTACGATTTG GCTAGATGCT GCCGTGAATA CTTGCCTCGC TGGTTAAACA TCATCCTTTG
GATTCTTGCG GAGTGTGCTA TTATTGCTAC AGATGTAGCC GAGGTTATTG GTTCCGCTAT
CGCCCTTAAT ATCTTATTGA AAGTGCCGTT ACCAGCCGGT GTAGTCATCA CTATTGTCGA
CGTTTTGTTC GTTCTTGCTG CTTACAGAAA CGACACTTCT TCGACAAAGT TCGTAAAAAT
GTTTGAATAT GCCGTAGGTT TGTTGGTCAT GGCAGTTGTT ATCTGTTTTG CTATTGAGCT
TTCGCAGATT CATGCCAATG CTGCACAAGT CTTCAGAGGC TTCGTCCCTT CGAAAGAGAT
GTTCGATGAC GGAGGCATGA CTATCGCCAC TTCTATCATT GGTTCCACCG TCATGATCCA
CAGTTTGTTC TTGGGTTCGG GATTAGTACA GCCAAGGTTG AGAGAATATG ACGTCGTTCA
TGGCCACGTC AACTTGAACG ACTTGTATGA CAATAGCAGT GACGAAGACG AGAAAGCAGC
AGCAAAATCT AAGCGCGATG TAGAGGCAGA CTACTTTTAC CATAAGTACA AACCTTCTTA
CCAGTCGATC AAGTACTCGT TAAAATATTC CATCATTGAG TTGACCTTGA CTTTGTTGAC
TTTGGCACTT TTCGTGAACC TGGCCATCTT AATTGTAGCT GGTTCCTCTT TGTATGACAC
TCCAGAAGCT GTGGATGCTG ACTTGTATAC TATCCACTAC TTACTTTCCA AGAACTTGGC
CCCTGTGGTG GGTACCATTT TCATGTTGGC ATTGTTGTTC AGTGGTCAGA GCGCAGGTAT
AGTATGTACC ATAGCTGGGC AAATAGTCAG TGAAGGTCAC ATCAATTGGA CTGTCAAGCC
ATGGATGAGA AGATTAATCA CTAGAGCTAT TTCTATCATT CCATGCTTAA TTATCTCGCT
TTGCATAGGA CGTAATGGTT TAAGTACGGC TTTGAACGTG TCCCAGATTG TCATTTCAAT
TCTATTGCCT CCTTTGACCG CTCCTTTAAT CTACTTTACT TGTAAGAAGA CGATCATGAA
GGTAGAACTC CCAGAGGAAA TGAACACAGA AGGTATCAAG GGCATTGTAG AGGATAAGGA
TACAAAAAAG AGATACAAAT ACTTGGGAAA CAACTGGATC ACTGCAGTGA TTGCAGTTTT
GATCTGGCTC TTCATCTCGG TCTTGAACGT GTATGCCATA GTGGATATGG CCATGCACGG
CGTACAATAG AGAGTTGTCA TTTACATTTT ATATTTATTG GGTAATACAA GGAATACGAG
AGTATTTCAT ACAGATCAAT ACATAGAATC TAACG
 
Protein sequence
MVEGSSNNIM ENSPQTPNEK SPDYNCSIKN QPQPHANVGV RMRTSLRKYM SFVGPGLLIS 
VAYMDPGNYA TGITAGASNR YSLLFFVFLS NIIAIFLQSL CIKLGSVTGY DLARCCREYL
PRWLNIILWI LAECAIIATD VAEVIGSAIA LNILLKVPLP AGVVITIVDV LFVLAAYRND
TSSTKFVKMF EYAVGLLVMA VVICFAIELS QIHANAAQVF RGFVPSKEMF DDGGMTIATS
IIGSTVMIHS LFLGSGLVQP RLREYDVVHG HVNLNDLYDN SSDEDEKAAA KSKRDVEADY
FYHKYKPSYQ SIKYSLKYSI IELTLTLLTL ALFVNSAILI VAGSSLYDTP EAVDADLYTI
HYLLSKNLAP VVGTIFMLAL LFSGQSAGIV CTIAGQIVSE GHINWTVKPW MRRLITRAIS
IIPCLIISLC IGRNGLSTAL NVSQIVISIL LPPLTAPLIY FTCKKTIMKV ELPEEMNTEG
IKGIVEDKDT KKRYKYLGNN WITAVIAVLI WLFISVLNVY AIVDMAMHGV Q