Gene PICST_39239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39239 
SymbolMNN4.2 
ID4850869 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp281957 
End bp285064 
Gene Length3108 bp 
Protein Length911 aa 
Translation table 
GC content40% 
IMG OID640392577 
Productprotein involved in mannose metabolism and cell wall synthesis 
Protein accessionXP_001387292 
Protein GI126273792 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTTTC TGATACGAAA AAAACACGTC CATCTATTTA TGATCATCGC ATTGCTTCTC 
GTGGTGACGG TGATGATGAT AATTACGTCT CACCTCATCT CCGAAGAAAC CAACCAGATC
ATCCGCAACA AGCTTAGAAT CAACTACTTC GACACTTTGG CCAGATCCGT GTATAAACCC
GGAAGTGAAG AAGATGAAAA AAGTCTTGAA GTCGATACTT CTGACCCCGA GTCTTATTTC
AACAGAGAAG TGGAAAAGTT GTACGACGCC AAAAAGAGCT CCGACGTTGA AAACAAGCTC
TGGTTGCTCA ACACGGACAT CCAGGACTCA GAAGTGCAGA TTCCATTTTA CTACTATATG
GATGAGAGCA CTGCTGGTGC TGATCAACTG GTGCAATTCC ACCAATACCA CAAGGAAAAT
AAGCCAGACA TACAGCCCTT TGAGCCTCGT TTCACTTTGG CGATGTACTA CCATTACATC
AAAGCGAGTT TGGAACAGAA TCCCAAGGAG CCTGTTCAAG TACCTTTCAA TTGGTACGAC
TGGGTAGATT TGTCACGTTT AGACAAGTAT TTGTTAGCAC CTTCCAACTT GAAACCCAAT
TGCTCACATC TTGATGCCAG ACCTGAAGAG GAAAAGTTGA ACAAGCAAAA GGAGGAGCAA
AAGCGTATCG AAGACGAAAA GAAGCGTATA GAGGAAGAAA AGAAGCGTAT AGAGGAAGAA
AAGAAGCGTG TTGAAGAAGA AAAGAAAAGA GCCAAGGAAG AAGCCCAGAG AAAGGAAGAA
GAAGAGAAGA AGAGAAAGGA AGAAGAGAAG AAAAGAAAAG AAGAAGAAGA AAAGAAGAAA
GCTGAGGAGC AGCAAAGAAA ATTGCTCGAA GCGAATGTAC CTCAGGAACA GAAATTAGAC
AACATCGTAG ACAATGATAC TAAACAGGTA CCAGTGGAAG CACTTCCTGA TGTCAATAGA
CCGGATGCAG CAAAACAACC ACAGGAAGTT AAACAACCTC AGGAAGTTGA ACAACCACAA
GAAGTCAAAC AGCCACAAGA AGTTAATCAA CCACAAGAAG TTAAACAGCC ACAAGAAGTT
AATCAACCAC AAGAAGTTAA ACAACTTCAG GAAGTTAAAC AACCTCAGGA AGTTAAACAA
CCACAAGAAG TCAAACAACC ACAAGAAATT AATCAACCTC AGGAAGTTAA ACAACCACAA
GAAGTTAATC AAGTACAAGA AATTAAACAA CCACAAGAAG CTGCTCGTCA GGGTGAATCT
GAAAAATCTG AAGAACAGCA GAAAAATGAA CAAAATAATG TTCAGCAGAA GCAGAATTTG
AACAAAAGAC AGGACGATGC TGAAGAAGAA AGACCAAGAC TCCAAAAACG TTTATCTGCT
AATGCTCCTG ATTTTTTCTG TGAGGATAAT AGTGACTTTC TTCGTGAACA TGATAACGGG
CATACTGTCC ATCCTGGTTT CAATGTGTTC AGGAGTTCTG GAAAGACCAC ACCAGAAAAG
GCTATTCTTT CTGGAAAATC GTATCTCTAC TCGTTTGCAC CAGCTCCGAC TATGATTCTC
TTTTTGACTA GTGATGGTTC ATATAATGTA TCTGTCAACT CTAACCAAAA GCTTCTTAAC
AATGGTTTGG TAGAACAATA CATTGCTGAT ACTAAGTCGC ATACCATTGA TATTATCCAT
GAATTAAGGT CGTTACGGAA GGCTTTTCCT CCAAACAAGC ATCAAGTAAT GACAGACTAT
AAAGTCAATA TTCCAGAAGA AAACTTTGAG CTCAAGACAG ATGAGTTAAT TAAGGGTTAC
GAATCTATAT TGCTTGACGG AAAGCAACTT CCTAGCAACG AATTAAAGTA TTACCACAGT
TTGAAATACT CCGACCATGA AGTCAAGAAT GGCGGTCCGC CAAAGTACTT CACTGAGGCT
CGTTTATTGA CCAACTTGTT GGGTGACCAT TATGATTGGA GATTCTTCAA CGGTGTAATC
TATGGCTCAT ACGAACAAAC GTTGATTTTA CATAGAATGG TCCGTGCCTG GTTGTCGTTT
ACCCGTAAAA ATGGAATTGT AACATGGGTA GCCCATGGAT CCTTATTATC CTGGTACTGG
AATGGTATTG CATTCCCATG GGATAACGAT ATCGATGTCC AGGTTCCAGT TATGGATTTG
CACAAGTTAT CGCTTCATTT CAACCAGACT TTAGTTGTGG AAGATGCAGA AGATGGATTC
GGTCGTTACT TCTTGGATTG TGGTACGTTT ATCACTTTAA GAGCAAAGGG TAATGGCAAT
AACAATATCG ATGCCCGTTT CATTGATGTT GATTCTGGTC TTTACATCGA TATTACCGGA
TTGGCATTGT CTCTGACTCT GCCCCCAGAC CGTTATAAGA AGAACTTACC TGCAAACTGG
AAAATCGATG GGAATGATTA CGTTCCAACT AATAGACAAT TGAAAATCTA CAACTGCCGA
AACAATCACT TTTCCAGCTT GAGTGAATTG AGCCCTCTTA TCAAAACTTC TATTGAAGGT
GAAATTGGTT ATGTTCCTCA GAAATACACT GACATTTTGA CTGTAGAATA TTCTAAGGGT
ATGTTGAACA AGAAATTCCA GGGGCATGTC TTTTTACCTC AGGTTAGATT GTGGCTCCGT
GAAGAAGACT TGTACTATTT CATCTACCAT CGTGAAAAAT GGAACAAGTA CCACAGTTTC
ACGATGAAAT ATGCCAACAG CGATGATGGT GAAGACCAGG AGTTTTTCAC GGACTTACAG
TATGAGTTGA CGGAGGATGA AAAGAAGCAA TTGAGAGCAA AGTCACACAG CCCTCTCATG
TTGAAGGATG ATGAGATGTC CACCATCTTC AAATTTACGG AAGATGAGTT ATTGCAACTT
CTCCACAAGG ATGAAATCTT CATGGCCTAT TACGGTTCCA AAGATTTTAC TTCTTTCCAC
GAGGAAGAAA TCATGCATTT ATTATTTGGC AAATCTACAG CTCAATTAAT CAACGATGCC
CCAGACTTCA AGCCAATGAA GTATGATCCA TTCCTCTTCA AGATGCATAA CGAATACATT
ACCTACGAGG AGGAGGTCAA CCGCTACTTG GCCCTCTTAA CAGCATAC
 
Protein sequence
MFFLIRKKHV HLFMIIALLL VVTVMMIITS HLISEETNQI IRNKLRINYF DTLARSVYKP 
GSEEDEKSLE VDTSDPESYF NREVEKLYDA KKSSDVENKL WLLNTDIQDS EVQIPFYYYM
DESTAGADQL VQFHQYHKEN KPDIQPFEPR FTLAMYYHYI KASLEQNPKE PVQVPFNWYD
WVDLSRLDKY LLAPSNLKPN CSHLDARPEE EKLNKQKEEQ KRIEDEKKRI EEEKKRIEEE
KKRVEEEKKR AKEEAQRKEE EEKKRKEEEK KRKEEEEKKK AEEQQRKLLE ANGESEKSEE
QQKNEQNNVQ QKQNLNKRQD DAEEERPRLQ KRLSANAPDF FCEDNSDFLR EHDNGHTVHP
GFNVFRSSGK TTPEKAILSG KSYLYSFAPA PTMILFLTSD GSYNVSVNSN QKLLNNGLVE
QYIADTKSHT IDIIHELRSL RKAFPPNKHQ VMTDYKVNIP EENFELKTDE LIKGYESILL
DGKQLPSNEL KYYHSLKYSD HEVKNGGPPK YFTEARLLTN LLGDHYDWRF FNGVIYGSYE
QTLILHRMVR AWLSFTRKNG IVTWVAHGSL LSWYWNGIAF PWDNDIDVQV PVMDLHKLSL
HFNQTLVVED AEDGFGRYFL DCGTFITLRA KGNGNNNIDA RFIDVDSGLY IDITGLALSL
TLPPDRYKKN LPANWKIDGN DYVPTNRQLK IYNCRNNHFS SLSELSPLIK TSIEGEIGYV
PQKYTDILTV EYSKGMLNKK FQGHVFLPQV RLWLREEDLY YFIYHREKWN KYHSFTMKYA
NSDDGEDQEF FTDLQYELTE DEKKQLRAKS HSPLMLKDDE MSTIFKFTED ELLQLLHKDE
IFMAYYGSKD FTSFHEEEIM HLLFGKSTAQ LINDAPDFKP MKYDPFLFKM HNEYITYEEE
VNRYLALLTA Y