Gene PICST_30762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30762 
SymbolMNN41 
ID4837783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp875618 
End bp878017 
Gene Length2400 bp 
Protein Length799 aa 
Translation table12 
GC content40% 
IMG OID640389098 
Productregulator of cell wall mannosyl phosphorylation 
Protein accessionXP_001383801 
Protein GI150864821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.799608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGA AAATGAGAAG GTCACGAGGG ACCATTCTCA TAGCCACAGT GCTTCTAATT 
CACATAATAG CATTCTCCCT TTTCAAGTTC GGCGACAATA CCCACACCTT GGATCGTTTC
AAAGAAAGCA TTCTAGACTT GAAAAAGGCT CTTCTCGAAA ATTCATACAC GAAAAAGCCT
AGTGATGGCT TCGACCAAAA GGTTGACAGC ATATTACAAA CAATAATCAG AAACCAGGAT
GAAGAAACTG AGAAGTTGCT TTATTCTTCT GACCTTAAGA ATAGAAAAGA CTTCAACGAA
ATCATATTGC CAGACTATCT AATAGACACC ACCAAGAAAC AACCCCAAGT CCAACACTTC
GACCCAAGAT TCACTGTTGG GTTGATAACA AACTATCTTA CCCGAAAAAT CAGTCTGGGA
GAAATCAAAG ACAAAGATGA ACTTGTTCTT CCATACTTCC ACTGGAGTGA TTTCGTAGAT
CTTTCTGAGT TGAACCAATA CTTCCATTCG AAAGACAAGG CTGACTGTAA ATTCTTTAGT
TTTGTTCCGC CTACACGTTC AAAACTTGCA AGGTTGGAAG TCCTTCCAAT TGAACTGTAT
TGCTTGAATG AAGACACTAT CGACGATCTT CTCACAACAT CCACAGACTC CACGTTGGTG
GACAATTTGA GAGTCATTAA AGAACAACAA TACTCCAGTG GTTTCCACAT CTATACGGCT
CCAGGAAGAT CTACTTTTAA ATTAAGACCT ATCTTAGCAA AATCCTATTT GCACGATTTC
ATGTTGGCTC CATACTCCAT TCTCATGCTT TTACCTGAAC AAAAGTCAAT TAACATTCAA
ATAGACAGAG ACATCACTAA AGCAAAGAAG AATTTGCTCA AATCTGGAAT TGCCACTTCC
TATATTGAAA TGTTTAAAGT AGAAAATGGA GGCGATGGGC CGGTAACTCT TAATATCCAG
AAGGAATTGA GTGCTTTCAT TGACGCAACA AACAAACATC CTAACTTGCT TCAACCAGTC
AATACTCTTA AGTATGAAAA GGCACTTGAC CATGCCCAGT TTATAGATGA ATCAAAGGCT
ATCTTGCGTG AAATGAAACG AGAGGATGTT TCGAAGTTCA CTCTTCATCA AAAGTCCTAT
TTTGAAGCTC TCGAATATTC TATTGGTTGT CAGGATCCTC CCAAGTATTT CCACGAAGCC
AAGATCTTCC AGTCAGAAAG GAATTTTGCT ATTGGGGCCC ATTATGACTG GAGGTTCTTT
AACGGAGTTG TCAACAATAA ACCGAAGCAT CAGCCAGCAT TACATCAGTT GATCCAAGCA
TGGTTCAGGT TCACAAACTC TCAAGACATC ACTACTTGGA TTGCACACGG GACGTTGCTT
TCTTGGTACT GGGATGGAAT GTCGTTTCCA TGGGATAATG ATAGCGATGT TCAAATGCCA
ATTGACCAAT TGCATAAACT TTCACGTAAC TTCAACCAAA CCTTGATTAT AGACATAGGT
AACGATCCAG TAACACAGGA AATTAGATAT GGGCGTTATT TCATAGACTG TGGCAGTTTC
TTGAGTTCTA GAGAACGTGG CAATTCCAAT AATTTCATAG ATGCCCGTTT CATAGATGTT
GACACAGGAT TGTATGTGGA CATCACAGGT CTTGCTGTAT CTCGAACCCC ATCTCCTGGT
AGGTACGACG GTTTCTTGAC AAGAGAGTTG GCCAGAACAC CTGGCAATAG CGACGTTAGT
GAGTTTATGA GAAACGACTT TTTGCAAGTA TACAATTGTC GGAATAATCA CTTCCTGAGG
CTTGCTGACT TATCGCCATT GAAGCTTCTG ATGCATGAAG GCGAATATGC CTACCTCCCT
AATCAGGTTG AATCAGCTTT GTCAACTGAA TACGGAGAAA AAAGTATCAA GTTGCAGTCT
TTCAACGTTT ACACGTTCAT TCCTAGGATC AGAAACTGGA TTCCTGTGAA GAAGTTGAAA
GCAAGCCTCC AGAACAAAGG GGTTAGACAG AAAGGAACTG GGGTTCTGGT GTTAGAATTT
GCAGAAGAAG AATGCTTGAA GATTCTCAGT GAGAACAGTG ACCTTCTTAT GGAATACTTA
GTCACAAGGG AAGTTACTGA AAGACACGAA TTGGAGTTAA TCAGCATGCG AGATGGAAAT
GATCCAAAAC GCTTCTTCAC ATCTTCTGGC AAATTGAAGC AAGGAAATCC ATTGAGACAT
GACAGCTTTT CGCTATCTGC GTTCCATGAT AAGTACAAGT TCGAAGAGTC CATTGATCAG
ACTGTCAATT TTATATCCTC CCTTGAGATG CAAACCCCAC CAGAAAAGGT AGAACCAGCA
GAACAAACAC AAACCAAGAA AATATCAACC AGGAAGGATC TTCCCAAGTT GGTTGAGTGA
 
Protein sequence
MTMKMRRSRG TILIATVLLI HIIAFSLFKF GDNTHTLDRF KESILDLKKA LLENSYTKKP 
SDGFDQKVDS ILQTIIRNQD EETEKLLYSS DLKNRKDFNE IILPDYLIDT TKKQPQVQHF
DPRFTVGLIT NYLTRKISSG EIKDKDELVL PYFHWSDFVD LSELNQYFHS KDKADCKFFS
FVPPTRSKLA RLEVLPIESY CLNEDTIDDL LTTSTDSTLV DNLRVIKEQQ YSSGFHIYTA
PGRSTFKLRP ILAKSYLHDF MLAPYSILML LPEQKSINIQ IDRDITKAKK NLLKSGIATS
YIEMFKVENG GDGPVTLNIQ KELSAFIDAT NKHPNLLQPV NTLKYEKALD HAQFIDESKA
ILREMKREDV SKFTLHQKSY FEALEYSIGC QDPPKYFHEA KIFQSERNFA IGAHYDWRFF
NGVVNNKPKH QPALHQLIQA WFRFTNSQDI TTWIAHGTLL SWYWDGMSFP WDNDSDVQMP
IDQLHKLSRN FNQTLIIDIG NDPVTQEIRY GRYFIDCGSF LSSRERGNSN NFIDARFIDV
DTGLYVDITG LAVSRTPSPG RYDGFLTREL ARTPGNSDVS EFMRNDFLQV YNCRNNHFSR
LADLSPLKLS MHEGEYAYLP NQVESALSTE YGEKSIKLQS FNVYTFIPRI RNWIPVKKLK
ASLQNKGVRQ KGTGVSVLEF AEEECLKILS ENSDLLMEYL VTREVTERHE LELISMRDGN
DPKRFFTSSG KLKQGNPLRH DSFSLSAFHD KYKFEESIDQ TVNFISSLEM QTPPEKVEPA
EQTQTKKIST RKDLPKLVE