Gene PICST_71553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_71553 
Symbol 
ID4838917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp247796 
End bp262786 
Gene Length14991 bp 
Protein Length4979 aa 
Translation table12 
GC content40% 
IMG OID640390232 
Productpredicted protein 
Protein accessionXP_001384347 
Protein GI150865220 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.473575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACA ATGTCGTTGT AAACTTGCCG CATTCGCGGA CGTTGTACCG TCAATATGAG 
GAATTCAATG TGCTCAAGAA TACTCCCAAA GCCTACCAGT TTGATACTCT GAAGACCATA
AAAGAGAACT TGAACTCTTT GGCACTTTTT GCATTGCAAG GCTCGACTAC CTTGCCAATT
TTCTACTCGT TTAAGGATAT TTTCCTCGAC TTGATATCGA GATGGGTTTC ACACCCCGAA
GAGTTTGAGA CCGAGTACCG TGCTGAATAC TCCAACCGGA ATAATGAACC AGCAGGTGTG
ATTCGAGGTT CCACCATATT ACGTGCGTTA TCGAGATTGG TCAATATTTC GAATGAATCA
TTGAATCTAA TTGAATTTTT CATCAACAAA ATCGATTTTT TCACCAGTAT TGATTCTCAT
CTCAGCACTA TTACTGCAGC AGAATTAGAA CATATTCTTC TTTCCTTTTA TAGACTTATT
ACCTACGACA ACCACAGATT CAAGAACTTT ATTCTTCCAC ACGTGTTGTA CACCATTATC
CGCCAAGACA ACCCTGAATA TGATGTTGCG AAGTACTTAG CTATCAGAAT CTTATCACAG
TTCCTAAGTG CTTCTGAACA AGCACAATCC AAAATGATAG ATACCCACAT TGGCCGAGCT
GACAACCTCC GATCTCGCTA TGAAACAGAC CCAGATTTGG ACTATGCATT CTTGTCACTC
TTAGAAGCTA AAAGATTGTC CAATTTCAAC TCCTTGCCAT CTGCATCGGA CACCGTTGTT
GCCAAAACTG AAAACATAGT CATAGCCCCA GGCGATCTCA GCGAGTTAGT GATTTCCATT
TGTGGAGTCT TGGTGCCTAA TATTGTTTCA GAAAAGGAGC CTAAGTCAGT TGCAGACCCT
ACTTTTGTAG CTACCGAGAA CTCTGTGAAT GTGTTGAGAC AATTGGCTCA CAACATTCAG
AATAACAAGC CAGTTATGCT CTACGGAAAG GCTGGTTCCG GAAAGTCTTT CCTTATCAAC
CAACTTGCAA ACTATATGTC ATATGACAAT TCCATAGTCA AGATCCATTT GGGCGAACAA
ACTGATGCTA AGCTCTTGTT GGGTACGTAT TCGTCAGGTG AAAAGCCAGG CACTTTTGAA
TGGAGGTCTG GTGTCTTGAC ATCCGCTGTT AGAGAAGGCA AGTGGGTATT GATTGAAGAT
ATTGACAAAG CACCTACAGA GGTGTTATCC ATCTTATTGA CATTATTGGA GAAGAGGGAA
TTGACTATCC CATCTAGAGG TGAAGTTATC AGAGCCAAAA ATGGATTTCA ATTGATTTCC
ACTATTAGAA TCTCAGACGA TTCAAAGTTC ACCGTTCCTG ACTTGATTGG TTTGAGACTA
TGGGAACTTA TTAAGGTTGA AGTTCCCAAT GAAGTAGACT TAAAGAACAT TTTGATCACG
AGATTCCCAT TACTTCAGAA TTTGATCATT CCTTTCATTA AGTGCTACAA GGAAATTACA
CGTATCTACT CGTTGACTTC ATTCATCTCT TTGAACAAGG GTTCTCATCC CAGAGTGATC
TCTTTCAGAG ATTTAATGAA ATTTTGTTCT AGATGCAATT CCATGTTGAT CAACTACGGT
GTGTCTACGC CCAATCAATT GCTTGAATCA ACTATATATG ATAACATCTT TGCAGAAGCT
GTAGACTGTT TTGGAAGTGC CATTACCGAA CAGACAGCTT TGACACCTCT TGTAAATGCA
ATTGGTGAAA TCTTGGAAGT TCCAGAGTCC AGAATCAACT TATTCTTAAC GAAGCATATC
CCTAACTTTA TTAATGATGA TGAAAAATTC CAAATAGGCA GAGCTACTTT ACGCAAGTCT
GCAACCGACA AAGCTTTATA CACTCTGAAG CAAACTGGTA ACAATACTTC GTTTGCCAGA
ACAAATCATT CCTTAAGATT GATGGAACAG ATCGGTGTTT CTATCCAGAT GGCAGAACCA
GTTCTTTTAG TGGGTGAAAC TGGTACTGGT AAAACCACAG TGGTTCAACA AATTGCCAAG
CAGATGAATA AGAAACTTAC AGTCATCAAC GTTTCTCAAC AAACGGAAGC TGGTGACTTG
TTGGGTGGTT ACAAGCCTGT CAACACCAAG ACAGTAGCTG TTCCCATTCA GGAAGTATTT
GAGAATCTCT TCATAGCCAC TTTTTCAGGC AAGAAGAACG AGAAGTTTTC TCGCGTGTTG
TCCAAGTGTT TCAACAAAAA CCAATGGAAG AACGTCATCA GATTATGGCG TGAAGCCATT
AAGATGTCGA AGGATATTCT TGCTCAGACT ACAGAATCAG ACGAAGACGG TGCTCCGAAG
AAGAAAAGAA AGTTGGGCTC TCTTGAGAAA TCCATCCTTC TTGAGAAGTG GGTGGAATTT
GAGATTAAAG TCAAAGACTT TGAAATCCAA ACCGCCTCTC TCGAGAACTC GTTTGTCTTT
AACTTTTTAG AAGGCTCTTT GGTTAAGGCA GTCAGAGAAG GTGAATGGTT GTTACTTGAT
GAAATTAACT TGGCTTCTGC TGACACTTTA GAAAGCATCG CCGACTTGTT ATCTGAGAGC
CTTAATCAAA GATCGGTACT TTTGACTGAA AAGGGAGACG TTGAATCGAT TAAGGCTCAC
CCAGAGTTCA GAATCTTCGG CTGTATGAAT CCTTCTACTG ATGTGGGTAA ACGAGACTTG
CCATTGAGTA TTCGTTCCAG ATTCAGTGAG ATTTATGTTC ATTCTCCAGA CAGGGACATC
AATGATCTTC TTTCAATCAT CGATAAGTAC GTTGGGAGAT ACGCTATTGG TGATGAATGG
GTTGGTAATG ATATTGCCGA GTTGTATTTG TCGGCCAAAG ATCTTTCGGA ATCCAACAAG
ATCGTTGATG GTGCCAACCA GAGACCTCAC TTCAGTATTC GTACTTTAAC CAGGACTCTA
ATCTATGTTT GTGACATTGT TTCAATCTAT GGGTTGAGAA GGTCTCTTTA CGAAGGTTTT
TGTATGGCAT TCTTAACATT GTTGGATGTA AAGTCAGAAG AAATTTTGAG ACCCGTAATA
GAAAAGCATA CTATCGGAAG ATTAAAGAAT GCAAAATCTG TAATGAGTCA AATTCCACCA
GCGCCCAGTT CGAAAGCTGA AGAATTTGTT CAATTTCGTC ACTACTGGAT GAAGCATGGA
CCCGAGGAGG TCATTCCTCA ACCACATTAT ATCATTACTC CATTCGTGGA AAAGAATATG
CTTAATTTGG TTCGTGCTAC AGCTGGTAGA AGATTCCCTG TCTTGGTTCA AGGTCCAACT
TCAGCCGGTA AAACTTCCAT GATTCACTAT CTTGCTAACA TTACGGGCCA TAAGTTCGTT
CGTATTAACA ATCACGAGCA TACAGATTTG CAAGAATACT TGGGTACATA TGTATCTGAT
TCTACTGGTA AGTTGTCGTT CAGGGAAGGT ATCTTGGTTG AAGCTTTGAG AAAGGGTCAT
TGGATTGTCT TGGATGAGTT GAACTTGGCT CCAACTGACG TGTTGGAGGC TTTAAATCGT
TTGCTTGATG ACAACAGAGA ATTGTTTATT CCCGAGACTC AAGAAGTAGT CCACCCACAC
CCCGACTTCA TGCTCTTTGC TACGCAAAAT CCTCCTGGTA TTTATGGTGG TAGAAAGGTG
TTGTCGCGAG CTTTCCGTAA TAGATTCTTG GAACTCCACT TCGATGATAT TCCACAGGAT
GAATTGGAGA CGATCTTGAA GGAAAGATGT CAGATCGCTC CTACCTATTG TAAAAAGATT
GTTGAAGTGT ACAAGAATTT GTCTATTCAA CGTCAATCTA CAAGGTTGTT TGAACAAAAG
AACTCGTTTG CTACATTGAG AGATTTGTTC AGATGGGCCC TTCGTGATGC TGTCGGATAC
GAGGAGCTTG CTGCCAATGG GTACATGTTG TTAGCTGAAC GTGTCAGAAA GGAAGACGAA
AAGAGAGTCG TCAAGGAAGT CATAGAGAAA GTTATGAGAG TGAAACTTGA TATGAATGCC
TACTACAAGA GCTTAGAAGT CGAAGCCTTG ATGAAGGTTG GCACTTCCGT GGTTTGGACT
AAGGCTATGC GTAGACTTGC TGTATTGGTT TTCACTTCTA TGAAGTACAA AGAGCCACTC
TTATTGGTAG GTGAAACCGG ATGTGGAAAA ACTACTGTCT GTCAGATCAT TGCTCAGTAC
CTTAGAAAAG AACTCATCAC TGTCAATGCC CATCAGAACA CAGAAACCGG TGACATCCTC
GGTGCACAAA GACCAGTACG TAATAGGTAC GAAAACCAAA GAATGCTTTT CACAGATCTT
GTTGCTCTCT TTGGCATCCT CAACATTGAT ATTTCTATGG AAGATAGCAC CCTTGAACAC
TTGCTTGCCA AATTCGATTC ATTACAAGAT TATGGTGAAG CGGATCCGGA GTTGATTTCT
AAAATCAAAG AAGGTAGAAG AAACTCCGCT GTTTTGTTCG AATGGAATGA TGGTCCTTTG
GTTCGTGCCA TGAAGAATGG AAGTTTCTTT TTGTTGGATG AGATTTCCTT GGCCGACGAT
TCTGTCTTAG AAAGATTGAA TTCAGTATTA GAGCCAGAAA GAAGTTTGTT ATTGGCTGAG
AAGGGTACTG ATGATGTCTT TGTCACTGGT GCCGAAGGTT TTGAGTTTTT GGCTACCATG
AACCCAGGTG GGGATTACGG GAAGAAAGAG CTTTCTCCGG CTTTGAGAAA CCGTTTCACG
GAAATCTGGG TGCCTTCAAT GGAAGACTTC AATGACGTTA GAGAAATCGT AGCTGACAGA
TTAGTTGACA AAGTTCACGT TGAGGCTATT GTCCAATTTT CTGAATGGTA CGGAAAGAAG
TTTGGAGGTG GCCATGCTGA CAGTGGTGTG ATTTCTCTTA GAGATATCCT TGCCTGGGTT
CAGTTTATCA ACTCGTGTGG AAATAGAGCA ACCAGCGAAG AGGCTTTATT ACATGGTGCT
TCTATGGTTT TCATCGATGC ATTAGGAACA AATAATACTG CCTTCTTAGC TGAGAATGAA
GTCAAGTTAC GCGAGCAAAA GGTTGAATGT GTCAGCGCCT TGTCCGAGTT TAGCGGAAAA
GATCTCTTAA GCATTTATTC TAATAGCTTT GAAGTTTCTT TGACCGACTC CAATGTCTCT
GCTGGCCCCT TTTCTATTGC CAGAGTTGAA GGTTCTGTTT CTAGTGATTC CTTTAACTTG
CATGCTCCAA CTACGTCAGC CAATACAATG CGTGTCATTA GAGCAATGCA AGTGCATAAG
CCAATTTTGT TGGAAGGTAG TCCTGGTGTC GGTAAGACTA GTTTAGTTTC AGCTATCGCC
AAAGCTACAG GCAACTATTT GGTTCGAATT AACTTGTCCG AACAGACTGA TTTGATTGAT
TTGTTTGGTT CTGATGCGCC AGCAGAAGGA GGAAAGACTG GTGAATTCGT CTGGCGTGAC
GCTCCTTTCC TCAGAGCAAT GCAAAGAGGT GAGTGGGTAT TGCTTGATGA AATGAATTTG
GCTTCCCAAT CTGTCTTAGA AGGTTTGAAT GCATGTTTAG ATCACCGTGG AGAAGCATAC
ATTCCAGAAT TAGACAAATC CTTTAGTTGC CATCCGGACT TCAAAGTTTT CGCTGCTCAG
AATCCACAGT ACCAAGGTGG TGGCAGAAAG GGTTTGCCCA AATCGTTTGT CAATCGTTTT
TCTGTCGTCT ATGTGGATAC ATTAAAATCA GAAGATTTGA ATTTGATTTC GCAACACTTG
TATCCTGAAA TTCCAAGTGA TGTTTGTTCT AAAATAATTG ATTTCATGTC CAAACTTGAA
GAACAAGTAG TGATCAAGAA GAATTGGGGG TCATCTGGTG GTCCTTGGGA ATTCAATTTG
CGTGATACTT TGAGATGGTT AGAATTGTTG AACGCTAAAT CCATCTGCGA AGATATCAGT
CCTGCTGATT TCCTTAACAT GATTGTCATC CAAAGATTCA GAACCGAGGA AGACCGAGCT
AAGGCAGTTT CTTTATTCAA GTCGATTTTT GGAGAACCAC TTCACCGTGA CAATTTTTAC
TTCGCCACAG AATCATACTT GCAATCTGGA GAAGCACTTA TTAAGAAGAC GAGCTCGGTC
CTGAGCTCTA GTGGATCGAA GGCTATTCCC TTGCAATGTA ACTTTTCTCT TATCGAGACC
TGTCTCAGAT GTGTCAATAG GAACATTCCC CTAATCTTAA CCGGTCCTTC TAATTCGGGA
AAGACTGAAT TAGTCAGACT TGTTGCCAAT GCTGTTGGAG CTAAGGTGGA TGAATTTGCT
ATGAATAGTG ACGTTGACTC CATGGATATA CTTGGTGGTT ACGAACAAGT TGATTTAACT
AGAGCAATTG TTGATGTCGT GGCTAAGGTT TACGATACCT TAATTGACTT AGTGTTGTCT
AATTTGTATG TCAATGATTC AGAATCCAGT ACTTTGTCTC AATCGTTGCA ATTAATCAGC
TACATTTCCA ACAAAGAGAT CTCTGTATCT AACTACAGCC AATTCCACTC ATTCTTGAAA
TCGTTTTTGT CATTCTTCTC TAACGAAGCA CTTATTTTGT TACTTGAAGC CTCTGAAAAG
CTCGAAAAGA GAATTCAAGA AGACAGGAGC GTCAAATTTG AATGGTTTGA TGGTTTGTTG
GTGAAGGCAG TTGAAAAAGG TAACTGGTTA GTTCTTGATA ATGCAAACTT ATGCAGTCCT
TCTGTTCTTG ACAGGTTGAA TTCCTTGTTG GAAACAAACG GTTCCTTGAT AATTAACGAA
TGCAGCAATG CCGACGGTCA ACCTCGTACA CTCAAGCCAC ATCCAAACTT CAGATTATTT
TTGACAGTAA ACCCTAAATA TGGAGAGTTG TCTAGAGCTA TGAGAAACAG AGGCATTGAA
ATCTACTTGG AATCGTTGTC AGAAAGAATC ACCTCTTTTG ATGGAAGTAT CCTAGGCTTA
ACCAAGACTA AACATGACAA GGATATCACC GAGAGATTAA ATGAATTGAA GGTAGTCGAA
TCGTTTATCC CTACAGTCGG TTTTGTACCT GCCAATGATA CACCGACCAG AACATTTGCT
CTCATTGATG ACGTTATGAA TTTAAATTCT AAAATTACGT CCGAAAGCAT TAACGCTTCT
TTAACCTTGT TGCCATTTGT GTTAACCAAC AAAGTGGAAT CTTGGGCTTC ATTGGTAGAA
GTGTCCAGCG AGTTTAATTC CAAAACTAAG GAGATAGCTC AGGTTATGCT AAGAAGATGG
AAGTTCCTTT TGGAAGAAGG CATTGTATCA AAGTTGGCAT CATTATTCTC TAAATCTGCT
GTTGCTGCCA ATCAAATCGT TGGTATTAAT ATCAACTTTT CAGAATCTCA ATATTTGCAT
CCGCAGCTCA ACAATTATTT GACAGCATTA GTCAAGAATA TCGCACCTGA ATTTGACTCT
TCTGAACCAA CTATGTTCTT TGAAGTTGTT TCTAATATCA TTGAATCTAT TGAAAGGAGA
AAGACAATTG AAGAGAGGGC CAATAATGCC CGAATCAACA CTTTGTCGTA CGTGGAAAAG
TCAGCTGCTT TTGAACTTGG AAGAGAAATC AAGTCTCCTC CAAACTTAAG AATCTTCAAA
TTTATCGTGG AAGTAGAAGA GTTCGTTAGA AATGTCTTCA CTTCAAGCAT AGAATCTCGT
CTTTTCTCTT CGGACTCCAC ATACAAGAGC CTTTACGATT TGCAGATTTT AGTGAATGGT
ATTGCTCGTT CTTCGGAAGA AAAAAGTGAA TCCAAAATCA GAATTTACCA AGAGTTAATT
AACAAATGGA TCGACAATTA CTCCGAAGCC GAATTTATAG CTCCATTTAT ATCTGATTTA
TCATCTTCGT TAACAGCCTT TGGAGATAAG TTGGTGTTAA CTTCTGGATT TTCAATGGAT
GCCCTATGGG AGCAATTTAG ACGCACTTAT CCTCGAGATG AAAACAGCTG GAATAATGCC
CAAAATTTGT TAAGGTTAGC AAATGAATTT GATCTCATTG CAAGAGAACA ATTTGTTGAC
GTCACAGAGT ACATTGTTCA ATTAAGATTG CTTTTCATCC AATTGTACAG CTATCTTTTG
TCTGATTCAT TCAACGCCGA GGAATATGCT CATATTTTGT CAACCCTAGA ATTAGGTATC
AGTAAATTAA GTGATGCTTC CTCCAGTTTC CTTTACAAAC GCCAGAACAC CTTGGAGTTG
GAATTTGCCT ACCTTCATAA TTTTGTTAAT TCACCTTCTG CCAGTAGCAA CTGGAACTTA
AACGATAGCA TTTATTTGGC TTCACTTGCC AATAGAAGTA CTGCTGCTTT GTTACAAAAT
CCGGACTGCA AGGAATTTTA TCCTTACCCA GCCGTGCTCG ACCTGCTTTG GCACGAAGTT
GAAGGTTCTA GTGTCTCTTT GAGTGAGGGT TTGTTCTCTA ATGATTTTGC CAGTAACTTG
TTGAAGAAAT CTGGAACATT TGGATCTGTT GCCGGTAAGT ACTTGCAGCA AGACTTGGAA
GATTTTAAAT TGATGGGAAG AGCTATTGTT AAACACTCCG CTGAGATTTT GATTGATCAA
AGACAATCAT TTAGGACAAT GCTACAGTCA TGGTTCTTCA GAATTTTGGA ATCTCATTCG
GAGACATTCG ATGAATGCAC ATATACTGAA ATTGTGGCAA AATTGAACCT AGGTGATATG
AATGCTTTTG ATTTTGAAGC CGCCTGCAAT TTGATTCAGA GATCGACTGA TGAGTATTTC
GTAACCGTTG TTAATGATTT CTTTATTCCA TCGATGATTG TTTCGAGAGA TTCTTCTAAC
TTGGGCTCGT TGGGTAAAGC ATGGGTTCTT TTTTCGTGTG GTTGTATTCA ACTTTTCGTT
CCAAGCTCAC CATATGATCC AGCCATTAGA GAGTACATCA TCTATGACAG ATTTGAAGAA
CAGAAGCAGG CCTCAGAGAG GCTTATCTCT TCGTGGCGCA CTATTAGAGC AGCTGTCAGT
GGTGATGAGA AGATTCACAT CGAAGACTGC TTGTTGCCAA TTTCGGATGA CGCTGCCCCT
CAAAAGCCTA TGGTGTATAG GCCAACAGAG TCCTTTGACG GATTATTTGA AGAGTGGAAT
TCTTTTATGG ACTCAACTAT CAGCATTGCT CCTGTTCAAG CTTTGTTGAA AAGTGCTGAA
AACTTGTCTG CTTCAAGCGA AAAGATGTTA GATATTTTTC AAAAAAACTC TTCTAAGTTC
TTGTTCCGCT TAAGACTGAA TTATTTGGTA TACTCCGATT TGAATGATAT ATTGAAAGGC
TACATTTATG GCCTCAAATT GGGGTTCGAA TTGTTGTCGA TTGAAGGCAA GAGACTGACT
GCTGGTTTCA AGTATGTTGA TCTTTGGTCT GTAGATGCTT CTGAACTTGC TATTGAATCA
AGTGTCATAT CGATGTTTGA GAAGGCTAAG AACTTCAACA AGAAAATTAG CGTAGATTCA
CAGGCTTCAG AACAATTGAT GCTTTTCTTC ATCAAGTTGT GCTACTCTCA TAAGGCAGCT
AACACAAACT CTGTCTTGTC AGATGTTCAC ATTCAGTCTT TCCAATCGTT GTACTACAGG
TGGACTTTAC GTCGAATGAA AGAAGAAGAA GAAAATTTGC AAAAAGGTAA CTTGTACAAG
TACTCTGATC CAGATTCGGA TATTGAGGGT GATTTCAGAA AGTTATTCCC AGACTACGAA
GAGGTGTTTG ATTCTGATGT CACCACCAGC AAGAATAATA ACAGTTTTGA TACATTGTAC
TACGACATTA CTCAAGCATA CATCGACTAC TACACTCATA ACAAGACCAA TGACTTGAGT
TCGTTGGTGA ACGAAGGCAA TGAATTGAAC AAGACATTGT CTCAGTACAG ATCTGAAATC
ATGAAGGATA AAATCAATTC TTCAGTCTTG TCTTCGTTGA TTACTTCTAT GGCCAGCATC
TACAAGAAGT ACAACAGCAA CGACGGTCCA TTTGATTTCT ACCACGACAG CTCCCCAAGT
GAAGCAAAAA AGTCCATTGT GATAATTTCC AGAGTACATG CTTCTGTTTC CAAGTTCCTT
GAACAGTGGC CTGAACACGC GACGTTGCAA AACATAACAA AGGCTTCAGA TGAATTTTTG
ATGCATCCAT TGGACTCACC AGTTGCTAAG TTGTTGCAGA AGGTGGAACA GATCTATACA
TTGATGGCTG AATGGCAGAA GTATGCCTCC AGCCAAGTTT CATTGAAGGA AAATTTTGAC
GAGTTGACCA CCTTGATTGT GAACTGGAGA AGATTGGAGT TGTCAACTTG GAACACTATA
TTCTTGAGCG AAGAAAAAGC TCTTGAAAAA GGTATTGGTA AGTGGTGGTT CCACCTTTTC
GAATCTATCA TTGTTCCGAT TCTTGCTGAT AATGAACAGG CAGAATTCTC GCCAACGAAG
TTGTTATCAG CCCTAAATGT TTTCATGAGT AAAACTACTT ATGGGGAGTT CGTTCCAAGA
TTAAACCTCC TCAAGGCATT TAAGAACCAT GTTTCTCAAT TGTTAGGAGA AAGCAATGTC
TATCATGCGC TTTCCAACTT CATCTTGTTC TACGAACAAT TCGTTCCTTT GGTCGCTGAT
GCTATTGCGG CAACTAGAAA GACACTCGAA AAGAATATTT CTGAAGTTAT CTTATTGGCC
AGTTGGAAGG ATGTGAACAT TGATGCTCTC AAGCAAAGTG CTAGAAAGTC TCACAACAGT
CTTTACAAGG TTGTGAGAAA ATACCGTGCT ATTCTTGCTA CTGAGGTGTC TCCGTTAATT
GAAGCTGGTA TCGCTGTAGA AACAAAATCC GAAGTTAACA TGAAAAGTTT GTCGAGAATT
GGAGGCATTG TCGTAGATAA TGATTTGGAA ATTTGTTCAG CAGTCAGTTC CTGGAAAGAG
AGACCTGCCA GATTGCAAAA CTTGGACCTT GTAAATAGGA ATATGGGAGT CTACATAAAC
AGAATCACAG ACGAGAGTCT TCCTAACCTA ATGGATTACG CCAAGGAATT GTTTGAAGAA
ATGGAAAGAC TTAGAAAAGA AACTCCCAAG GAATTAAAAG AAGCCAACAA GAAGTTGATC
ACTGCATTGA AGACACAAAA GAAGAAGTTA TTGAGTGACA CTTTAAAGGA ACTTAGACGA
ATTGGATTGA AGACTGCTTT GAGAGCTGAT ATAGTCAAGG GTCAAGGTGG TGTTAATGTA
ATTGTCGGTA ATAGTTCAAG TTTCGAAGGG ACTGAATTAG AATCGTCTGA TGTCTACTTC
TACAGAATAT TGGATATTCT TCCTAAATTA AGAGCATCCA TTTCCAACGT TGCTGAAGAT
GTTCCGCAGG TTGATGCTGA CAAGGGTCTT GCTGCAGCTG AGAATTTGGT GAACACTATT
GTAGCTACTA GAGACCCATT ACGAGTTTTC TCTGAGTACA TTCCTAAGTT GGCTAAAGTC
TACAAGTCGT TAGAGATTAT TTCCCAATCA AGAAGCGAGA ATATAGATCT TGCTAGAGCA
TCTGAATATT CTTCAATTGA GATGAACTTG CATGGAATCA AATATGTTCT TCATTGGCTA
CCAAAATTGA TTGACTTTGC GTTACTCACT GCTCAAGCAG TTGACACATT CGAACCAGGC
ACTCAATCAA AGTTGTTCTA CGACATTAAA CTCGAACTTT CGTCTTTGAA AAACGAATTT
GATAGTATCG ATCCATACGT CCGTACATCA GCTACTTCTA TGTTCATCCG GAAGTTTGCC
GAAGAGTATA TTGGGTTCTC AAAGAGGCTC AATAGCTGGA AGCTCCAGTA CAAGAATGCT
GGCTTCATTG CCGAAGTGGT TTTGTCTTGG ATGAATAGTG TCAACTTCTC AGGCTTAGTT
CAGTCTTCTA CGTCGTTGAA TAGCTTGGGT TCATTGGAAA ATGTGGAAAT TTCTCTTAGA
AAGCTCTCCA ACTCTATTAT TGTTGCTGTT CAAAGAGTTA CTGAATGTCA AGAAGGTGAA
ATCACTGAGG AGAATGATAA TTGGTTGGTT TTAACACAGA CCAGAATCAT GAAGTACATT
AAATCATTAC ATTCCAAGCG TATACATGAT GAGCTTGCTG TCTGTCTTGA TACCATTTCT
TCTGTGGAAC AGAATACAGA GACTTCTAAA GTTACTTCAG CGTTGGTTTC TTTCACTCTT
CCTTTGATAA ACCATTATTT CCGATTAAGT TCAAATGTGT TTGAAACAGC CAGAGATAAC
TACTACAATA CTTCTAGGGC TACTTTCATG ATGACTGTTG CATTATACAA TTTGGCTACT
AAGGGTTTCT GCTCTCCAGA ACCACCTACG GAACAAAAGG AAGATAACAA CTTGCACGAT
GGGACTGGTT TAGGTGATGG AGAGGGTGCG ACCAACAATT CGAATGATGT TGAGGATGAT
GAAGATTTAT CTGAACAAGC TCAACAACCC AACGAAGAAA ATAATGACGA CGACAATGAA
GAAGAAAACG ATGATGCTGT AGATATCGAA GGTGATATGG CCGGCAATCT AGAGAATGCT
TCTGATCAGG AAGATGACGA GAATGACGGT GATGAGGATG ACCAAGAAGA TTTGGATGAA
GAAATTGACG ACCTTGATGA CCTCGATCCC AACGCCATTG ATGAGAAGAT GTGGGACGAA
GAAGTTAAGG AAGACAAGAA TGAAAAGGAA AGCGATGAAG TTCCTGAAAA TGCCAATAAG
GATGACGATA ATATGGAAGC CAACGAAGAT GAAGACAGCG AACAGAGTAA AGACAAAGAA
AAAAAGCAAT CCGAACAAGG CGAAGAAGAT AAAGAAAGCG AAGAGAAAGA AAAAGAAGAA
GAAAATGGCA CAGATGAAGA AGGCGAAGAA AATGATGTTG GTGAGCAAGA AGACGAAGTC
AATCATCAAG ACAACGAGCA AGTGGAAGAC CATGTTCCTC AGACTGAGGC ACTTGATCTT
CCTGACGATA TCAATTTAGA TGACGACGAA GAGGCTGATG GTGATGATAA GGAAGAGGAA
TTTGACGATA AGATGGATGT TGATGATGAA GAAGAAGAAG AAGCCCAAGG TAAAGAAGAA
GAAAACCAAT CTGATGCTAA GGAAGATGGC GAAACCGAAG AAGGTGTCGA CGAAGAAGGA
GAAGAAGACG AAGAAGGAGA AGATCATGAC CAAGAAAATG GACAAGAAAA TGACAACAAT
GAAATTGGTG AAGAGCCTCA AGAAGAAAAT GAAAACGACG GAGAACAATC TGACGAAGAG
ACTCTTGGTA ACACAGATGA AAAAGAAGAA GAAAATGAAC ATGGCAATGA TGCAATGGAT
CAGGATGCTG AAGGTGTTGA TGGAGCGAAT CAAGATATTC CCAACGAAGA CATTGATGCG
GAGTCAGCTG TCAAACAAGA GGCTGGTGAT AAGGGTGAGG GTGCTGATAA TCAAGTTTCC
GAAGAGAATG ATAATATTGG TTCTACTGGA AATGCCTCAT CAGATCCAAA CCAACAGGAT
AAGGAGGAAG ATAATTCTGT TAAAGACGAT GCTCGTGATA TGGCCAAGGA ATCTTTAAAA
CAATTGGGAG ATACCTTGAA GGAATTCCAC CGTCGTCGTC AAGAGATTAA GGAAGCTGCC
AAGGAAGAAA AGGAAAAGGT AGAAGAAAAA GCCAACGAAA GACCAGATGA GTTTCAACAT
GTTCAAGGTG AGAATACTGA TTTCGACACC CAAGCATTGG GAGCAGCCGA TAAGGACCAA
GTCCAATCGT TGGATGAAGA CATGGCAATT GATGATGATA TCGAAGAGGA TAGGGAAGAG
AAGCCAGAAA TCAAACAAGA GGCCGAGGAA GAAGTCGGCG AAGAAGACAT GATGGACGTT
GATGAAGACT TGGATGAAGT GAAAAACAAC GCAGAAGATG ATAACGAAGG AAAGACAAAG
GGTGCCTTCA TTGGAGAAAG AGATCATAAG AATGATAGAG ACGGAGAGAT TAGTATCAAC
AATGAAATCA ATATTGACGA TGATCAGGAG TCAGAAGACG AAGAAAATAA CCTTGTGCCT
GATGCTATTG ATGGTGAAGA AATACCTACT ATTGAAATCG ATGAAGCTCG TCAATTGTGG
AAACACAGTG ATCTTGCTAC TCAAGAATTG GCCTCGGGTC TCTGTGAACA ATTAAGATTG
ATCTTGGAGC CTACTCTAGC CACCAAGTTG AGAGGTGATT TCAAGACCGG TAAGAGGTTG
AACATGAAGA GAATCATTCC CTACATAGCC TCTGAGTTCC GAAAGGACAA GATCTGGTTG
AGAAGAACGA AACCTTCTAA GAGACAGTAC CAGATAATGA TCGCTGTCGA CGACTCCAAG
TCCATGAGTG AGAGTAAATC TACCCAGTTG GCTTTCCACA GTATTGCCTT GGTTGCTAAG
GCATTGACCC AGTTGGAAAG CGGTGGTTTA TCGATTGTCA GATTTGGGGA GGACGTCAAG
GTAGTTCATC CATTTGACAA GCCATTCAAT GGCCAAGAGA GTGGAGCCAG AATCTTCCAA
TGGTTCGACT TCCAGCAGAC CAAGACGGAT ATTAAACAGT TGTGCAACAA GTCGTTGAAG
ATATTTGAAG ATGCGAGATC GTCTTCCAAC TCTGACTTGT GGCAGTTGCA AATCATCATA
TCGGACGGTG TCTGTGAAGA CCACGACACC ATTGAGAGAT TGGTTAGAAG AGCCAGAGAA
GAGAAAATTA TGTTGGTGTT TGTCGTCATC GATGGCATTA ACTCCAACGA GTCTATTTTG
GATATGAGCC AAGTACAATA CGAAGCCGAT CCTAAGACAG GAGCCATGAA CTTGAAGGTT
GTGAAATACT TGGACTCATT CCCATTCGAA TTCTTCGTTG TTGTCAGAAA CATCAACGAA
CTTCCAGAGA TGCTTGCCTT GATCTTGAGA CAATACTTTA CCGAAGTTGC CTCAATTTAA
CATTACTGTA TAAACAAAAC ATTGCATAAT AGACAAAATT AGGATTATGG A
 
Protein sequence
MNNNVVVNLP HSRTLYRQYE EFNVLKNTPK AYQFDTSKTI KENLNSLALF ALQGSTTLPI 
FYSFKDIFLD LISRWVSHPE EFETEYRAEY SNRNNEPAGV IRGSTILRAL SRLVNISNES
LNLIEFFINK IDFFTSIDSH LSTITAAELE HILLSFYRLI TYDNHRFKNF ILPHVLYTII
RQDNPEYDVA KYLAIRILSQ FLSASEQAQS KMIDTHIGRA DNLRSRYETD PDLDYAFLSL
LEAKRLSNFN SLPSASDTVV AKTENIVIAP GDLSELVISI CGVLVPNIVS EKEPKSVADP
TFVATENSVN VLRQLAHNIQ NNKPVMLYGK AGSGKSFLIN QLANYMSYDN SIVKIHLGEQ
TDAKLLLGTY SSGEKPGTFE WRSGVLTSAV REGKWVLIED IDKAPTEVLS ILLTLLEKRE
LTIPSRGEVI RAKNGFQLIS TIRISDDSKF TVPDLIGLRL WELIKVEVPN EVDLKNILIT
RFPLLQNLII PFIKCYKEIT RIYSLTSFIS LNKGSHPRVI SFRDLMKFCS RCNSMLINYG
VSTPNQLLES TIYDNIFAEA VDCFGSAITE QTALTPLVNA IGEILEVPES RINLFLTKHI
PNFINDDEKF QIGRATLRKS ATDKALYTSK QTGNNTSFAR TNHSLRLMEQ IGVSIQMAEP
VLLVGETGTG KTTVVQQIAK QMNKKLTVIN VSQQTEAGDL LGGYKPVNTK TVAVPIQEVF
ENLFIATFSG KKNEKFSRVL SKCFNKNQWK NVIRLWREAI KMSKDILAQT TESDEDGAPK
KKRKLGSLEK SILLEKWVEF EIKVKDFEIQ TASLENSFVF NFLEGSLVKA VREGEWLLLD
EINLASADTL ESIADLLSES LNQRSVLLTE KGDVESIKAH PEFRIFGCMN PSTDVGKRDL
PLSIRSRFSE IYVHSPDRDI NDLLSIIDKY VGRYAIGDEW VGNDIAELYL SAKDLSESNK
IVDGANQRPH FSIRTLTRTL IYVCDIVSIY GLRRSLYEGF CMAFLTLLDV KSEEILRPVI
EKHTIGRLKN AKSVMSQIPP APSSKAEEFV QFRHYWMKHG PEEVIPQPHY IITPFVEKNM
LNLVRATAGR RFPVLVQGPT SAGKTSMIHY LANITGHKFV RINNHEHTDL QEYLGTYVSD
STGKLSFREG ILVEALRKGH WIVLDELNLA PTDVLEALNR LLDDNRELFI PETQEVVHPH
PDFMLFATQN PPGIYGGRKV LSRAFRNRFL ELHFDDIPQD ELETILKERC QIAPTYCKKI
VEVYKNLSIQ RQSTRLFEQK NSFATLRDLF RWALRDAVGY EELAANGYML LAERVRKEDE
KRVVKEVIEK VMRVKLDMNA YYKSLEVEAL MKVGTSVVWT KAMRRLAVLV FTSMKYKEPL
LLVGETGCGK TTVCQIIAQY LRKELITVNA HQNTETGDIL GAQRPVRNRY ENQRMLFTDL
VALFGILNID ISMEDSTLEH LLAKFDSLQD YGEADPELIS KIKEGRRNSA VLFEWNDGPL
VRAMKNGSFF LLDEISLADD SVLERLNSVL EPERSLLLAE KGTDDVFVTG AEGFEFLATM
NPGGDYGKKE LSPALRNRFT EIWVPSMEDF NDVREIVADR LVDKVHVEAI VQFSEWYGKK
FGGGHADSGV ISLRDILAWV QFINSCGNRA TSEEALLHGA SMVFIDALGT NNTAFLAENE
VKLREQKVEC VSALSEFSGK DLLSIYSNSF EVSLTDSNVS AGPFSIARVE GSVSSDSFNL
HAPTTSANTM RVIRAMQVHK PILLEGSPGV GKTSLVSAIA KATGNYLVRI NLSEQTDLID
LFGSDAPAEG GKTGEFVWRD APFLRAMQRG EWVLLDEMNL ASQSVLEGLN ACLDHRGEAY
IPELDKSFSC HPDFKVFAAQ NPQYQGGGRK GLPKSFVNRF SVVYVDTLKS EDLNLISQHL
YPEIPSDVCS KIIDFMSKLE EQVVIKKNWG SSGGPWEFNL RDTLRWLELL NAKSICEDIS
PADFLNMIVI QRFRTEEDRA KAVSLFKSIF GEPLHRDNFY FATESYLQSG EALIKKTSSV
SSSSGSKAIP LQCNFSLIET CLRCVNRNIP LILTGPSNSG KTELVRLVAN AVGAKVDEFA
MNSDVDSMDI LGGYEQVDLT RAIVDVVAKV YDTLIDLVLS NLYVNDSESS TLSQSLQLIS
YISNKEISVS NYSQFHSFLK SFLSFFSNEA LILLLEASEK LEKRIQEDRS VKFEWFDGLL
VKAVEKGNWL VLDNANLCSP SVLDRLNSLL ETNGSLIINE CSNADGQPRT LKPHPNFRLF
LTVNPKYGEL SRAMRNRGIE IYLESLSERI TSFDGSILGL TKTKHDKDIT ERLNELKVVE
SFIPTVGFVP ANDTPTRTFA LIDDVMNLNS KITSESINAS LTLLPFVLTN KVESWASLVE
VSSEFNSKTK EIAQVMLRRW KFLLEEGIVS KLASLFSKSA VAANQIVGIN INFSESQYLH
PQLNNYLTAL VKNIAPEFDS SEPTMFFEVV SNIIESIERR KTIEERANNA RINTLSYVEK
SAAFELGREI KSPPNLRIFK FIVEVEEFVR NVFTSSIESR LFSSDSTYKS LYDLQILVNG
IARSSEEKSE SKIRIYQELI NKWIDNYSEA EFIAPFISDL SSSLTAFGDK LVLTSGFSMD
ALWEQFRRTY PRDENSWNNA QNLLRLANEF DLIAREQFVD VTEYIVQLRL LFIQLYSYLL
SDSFNAEEYA HILSTLELGI SKLSDASSSF LYKRQNTLEL EFAYLHNFVN SPSASSNWNL
NDSIYLASLA NRSTAALLQN PDCKEFYPYP AVLDSLWHEV EGSSVSLSEG LFSNDFASNL
LKKSGTFGSV AGKYLQQDLE DFKLMGRAIV KHSAEILIDQ RQSFRTMLQS WFFRILESHS
ETFDECTYTE IVAKLNLGDM NAFDFEAACN LIQRSTDEYF VTVVNDFFIP SMIVSRDSSN
LGSLGKAWVL FSCGCIQLFV PSSPYDPAIR EYIIYDRFEE QKQASERLIS SWRTIRAAVS
GDEKIHIEDC LLPISDDAAP QKPMVYRPTE SFDGLFEEWN SFMDSTISIA PVQALLKSAE
NLSASSEKML DIFQKNSSKF LFRLRSNYLV YSDLNDILKG YIYGLKLGFE LLSIEGKRST
AGFKYVDLWS VDASELAIES SVISMFEKAK NFNKKISVDS QASEQLMLFF IKLCYSHKAA
NTNSVLSDVH IQSFQSLYYR WTLRRMKEEE ENLQKGNLYK YSDPDSDIEG DFRKLFPDYE
EVFDSDVTTS KNNNSFDTLY YDITQAYIDY YTHNKTNDLS SLVNEGNELN KTLSQYRSEI
MKDKINSSVL SSLITSMASI YKKYNSNDGP FDFYHDSSPS EAKKSIVIIS RVHASVSKFL
EQWPEHATLQ NITKASDEFL MHPLDSPVAK LLQKVEQIYT LMAEWQKYAS SQVSLKENFD
ELTTLIVNWR RLELSTWNTI FLSEEKALEK GIGKWWFHLF ESIIVPILAD NEQAEFSPTK
LLSALNVFMS KTTYGEFVPR LNLLKAFKNH VSQLLGESNV YHALSNFILF YEQFVPLVAD
AIAATRKTLE KNISEVILLA SWKDVNIDAL KQSARKSHNS LYKVVRKYRA ILATEVSPLI
EAGIAVETKS EVNMKSLSRI GGIVVDNDLE ICSAVSSWKE RPARLQNLDL VNRNMGVYIN
RITDESLPNL MDYAKELFEE MERLRKETPK ELKEANKKLI TALKTQKKKL LSDTLKELRR
IGLKTALRAD IVKGQGGVNV IVGNSSSFEG TELESSDVYF YRILDILPKL RASISNVAED
VPQVDADKGL AAAENLVNTI VATRDPLRVF SEYIPKLAKV YKSLEIISQS RSENIDLARA
SEYSSIEMNL HGIKYVLHWL PKLIDFALLT AQAVDTFEPG TQSKLFYDIK LELSSLKNEF
DSIDPYVRTS ATSMFIRKFA EEYIGFSKRL NSWKLQYKNA GFIAEVVLSW MNSVNFSGLV
QSSTSLNSLG SLENVEISLR KLSNSIIVAV QRVTECQEGE ITEENDNWLV LTQTRIMKYI
KSLHSKRIHD ELAVCLDTIS SVEQNTETSK VTSALVSFTL PLINHYFRLS SNVFETARDN
YYNTSRATFM MTVALYNLAT KGFCSPEPPT EQKEDNNLHD GTGLGDGEGA TNNSNDVEDD
EDLSEQAQQP NEENNDDDNE EENDDAVDIE GDMAGNLENA SDQEDDENDG DEDDQEDLDE
EIDDLDDLDP NAIDEKMWDE EVKEDKNEKE SDEVPENANK DDDNMEANED EDSEQSKDKE
KKQSEQGEED KESEEKEKEE ENGTDEEGEE NDVGEQEDEV NHQDNEQVED HVPQTEALDL
PDDINLDDDE EADGDDKEEE FDDKMDVDDE EEEEAQGKEE ENQSDAKEDG ETEEGVDEEG
EEDEEGEDHD QENGQENDNN EIGEEPQEEN ENDGEQSDEE TLGNTDEKEE ENEHGNDAMD
QDAEGVDGAN QDIPNEDIDA ESAVKQEAGD KGEGADNQVS EENDNIGSTG NASSDPNQQD
KEEDNSVKDD ARDMAKESLK QLGDTLKEFH RRRQEIKEAA KEEKEKVEEK ANERPDEFQH
VQGENTDFDT QALGAADKDQ VQSLDEDMAI DDDIEEDREE KPEIKQEAEE EVGEEDMMDV
DEDLDEVKNN AEDDNEGKTK GAFIGERDHK NDRDGEISIN NEINIDDDQE SEDEENNLVP
DAIDGEEIPT IEIDEARQLW KHSDLATQEL ASGLCEQLRL ILEPTLATKL RGDFKTGKRL
NMKRIIPYIA SEFRKDKIWL RRTKPSKRQY QIMIAVDDSK SMSESKSTQL AFHSIALVAK
ALTQLESGGL SIVRFGEDVK VVHPFDKPFN GQESGARIFQ WFDFQQTKTD IKQLCNKSLK
IFEDARSSSN SDLWQLQIII SDGVCEDHDT IERLVRRARE EKIMLVFVVI DGINSNESIL
DMSQVQYEAD PKTGAMNLKV VKYLDSFPFE FFVVVRNINE LPEMLALILR QYFTEVASI