Gene PHATRDRAFT_48676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48676 
Symbol 
ID7194909 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp564451 
End bp580072 
Gene Length15622 bp 
Protein Length4825 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183246 
Protein GI219125980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAGAGCAG CACCTCGTTG CATAGTTAGA AATTGAATAT ACAAAAAGGG ATAAAGGAAA 
CAGCACCGAT ACAAAGGTAT CGTTGACGAG GAAGGACTAG TGACAAATTC CAGCCGAGTG
GCCCTTCATC ATAAACACAA TGGGAAGATC GTTCATGATT ATCGTACCAA CATGTGTGCT
GGTAGGCGTC GTTGGCAGCC TCCTGCTACT GTCCGAAACT GAAGCCTTGG CAGGGCCCGC
CTCATTCCGC ATCAACTTGC TGACAACTGC TTCCCCACTG CGCATGGCTC CGGAAAATAG
CCAGGATTTC CCTTCCTCCG TTTCGCTCAA GACACCACGC CCACCGAAGC GTGCACCGTC
GATTGATCGT CTCCGATCGC TGCACGAAGA ATACGAGCCC TTCTTTCTAC GCAATGGTGG
GACTGCGCTT TTGGACGAAG AAACGGACGA GGAGTCTCGT CTTTCACCCC GTCAAGGTTT
CTTACAGTCA CGCTACGACC ATTGGCAAGA GTGCACCGTC AAGGCACTGA AAGCCGAGCT
TTCCGTTCGT AAACTACAGG TATCTGGTAA AAAGGCCGCC CTTGTGGAGC GCCTCGCCTT
AGACGACCTC GACAATACAC CGGATCGTCT AGTCCAAAAG GCCGCCAAAA CGCTGGCTCA
AGAAGTCTTC GTCCATCCCT TCACTACTAC CGAAGTAGAA AAGGGTGTAC CGGCCGACGC
TATCAAAGGC GCCGCGTTAA CGGCCGCAAC ACTCAGCTAT TTGGCCGGAA ACAGCATTGT
TCTTTCCGGG GCGGCGGCTC TCGGCGCCGC CTATCTCGCT ATTTCACCCG GCAGCGCCGG
AGATGCCGTC CGAGCCATTG GTACCTCGGC CTGGTCTTCC ACCGAAGTCT TTGTCGACGT
GGTGAAAAAG ATTGGCCCCG AACACATTGG GGAAACCACC GTGGGATTGC TCCACCGCCT
TTCCGCGGCG GCGCAACAAA CCCAATTTCT TCTGCAACAA AAATCTTACC AGAGCGGAAG
CAGTGCCAGC AACAACGCGA TCGCACGGGA CAAGGCGGAC GCGATCGACG TGACGATTAC
TAACGATACT GGTTCAACCG AGAGTGACGA GGCTTCGCCC ACATTTGCTT TTGCCGAGAA
AGACCAAATT GTTCCGAAAG CGGACACACC GGTGGTTGCT CCCGCAAAGA CAAAGGACGA
CCGCGTCAAT CGCGCGTTGC TATCGTACCG TATCGAGTTG GAACAGACCG CGTCGCAAAA
ACGACTCAAG CAGCGAAAGG AACAAGCTTC CCGGGGATTG CTAGCGGCGC GATTGTCGTT
GGAGACAGCC CTTAAGGAAC GTATTATTGT AGAGCAAGCA CGTCTTGCGG AGGAAGCTCG
GTTGGCGGAA GAAGCTCGGC TCGCAGAAGA AGTCAAACTT GCGGAGGAAG CTCGGCTTGC
GGAAGAAGCA CGCATCGCCG AAGAAACCCG GCGAATGGCA GAAGCCAAAG TTGCGGAAGA
AGCACGTATC GCGGAAGAAG CTCGTCTAGC GGAGGAAGCC GAACAATCCC GACTGGCAGA
AGAAGCACGT GTCGCTAAAA TCAAGGCCAG GGTCGCTGCT CAGGAAGAAG AATCTCGACT
GGCCCGAGAA GCCCAGGTAA CGGCCGAAGC TCAACGGGTC GCGCTCGAAG CACAGCAAGC
CACAGTGCTG GAGATACCTG AGGCCGAAAC ACCGACGGAA GCCGTGTTTG AAACCGAAGC
CTCTTCAGTC TTTGACGAAG TTGGTTTTTC CGAGGAAGAT TGGGCAGCTT CAATTCTTGC
CGCTCAGAAA AGCATCGATG GTACTATTGT TGGCTCGGAC GATGAGGAGC AAGACACTGA
TGAGACTGAA TCGAAAGCGA GTTGGGAGGC CGCCAAGCTA TTGGCTGAAG AACTTAGTCC
AAGTGAACGA GAGGACCTGG GCAAAGCGGC GCGCGAAGCG GTCGAAGCTA TGGAACTAAA
CATGAATGCC AAGATCCAAG AGAAAGCCGT CGAGCGCGAA ACGTGGGCCC AGGAAGTTGT
TGAGGACGAA GGAGCTGAGG ATGACGACGA AAACAACCTG GATATGTTCT TCGACAACGA
GGGTTTCGAT ATGGAAGCGT TGGCTCAGGC AGCACGCCAG GCCGTGGAAC GATACGACGC
TGAATCTACT GGCGAGACGG AAAGAGAATC TTGGTCGGAA TCATCTTTGC GAGACTGGGC
CAGTTACAGA GTTGCCGATC TACGAAACGA ATTGAGCACG CGTGGATTGC CTGCTATTGG
CAAAAAGATG GAACTAGTGG CCGCCTTGGA GGCGGCAGAT TTGGCTTTGT CGAACGGCGA
AATTTCTTCG TCAGCGGGTG TCCAAAGTGA AGGGTCTGGT CCAGTCATGG CAGAAGAGAA
GGAATTGCTC GAGGTGGAAG ACGAGTATGG CATAATGGAG TTCGAAGACG ATACAGCGTT
CACGTTCGAA GAGGATGACG ACGACGACGA AAACTTGGAT GATATCCTGC CTCCCATGGA
GGACTTGGCT GCTTTGGCCG CTGCTGCCCG CGCCGCTGTA CGAGACCAGG AAGATCTGTT
CAACACAGAT GTTATTCATG GGTCTACCAC GGATTGGTCT CAATTTAAAA TGGTCGATTT
GCGTAACGAG CTTACGATGC GTGGTTTGCC CACGGTTGGA AAGAAGACCG ACTTGATTGC
CGCACTCGCC CAGTCTGATC TTGACCAAGA GTCCGCCGCT GTAATGGACG ATGACGAAGA
AGAGGAAGAA TACGGCTCGT TCGAATACGA TGCAACAACG TTGGTTGGAG CGGCGGACGA
CAAGGAAGCA GACTTGGATA GTTTATTTGG TGGCAGAGGG TCGGATCTGG AGACTCTGGC
TGCCACTGCC GAGGCTGTGC TTGAAATGGA AAAGCCAGTC ACGTCTCTAG GTCGGGATTG
GTCTAAATTG ACGGTTGCCC AACTGAGGAC TGAGTTGGAC AAGAGAGGAC TACCAACGGT
TGGAAAGAAG GCGGATCTCG TGGTCGCACT GGAATCAGCG GACCGTGAGC TAGATGGGAA
CGCGGAGGAA GAGAAAGACG TGGACAACTT GTCCAGTAAC GAACACGTGT TTACTGTAAA
CCATGACTAC GACGATGAGC TTTCCGAAGA CGATTTGTTG AATGATTTGC TCCATGCGAA
CGGTGCTGAT AGGGAGGCGT TGGCGGCGGC GGCTCGTGCC TCTGTGGATC GCGAAGGCGT
TCTGCCGGAA CCCAGCACCG ACTGGTCGCG ACTGAGTCCA ACAGAACTGC GCATCGAGTT
GGACAACCGC GGATTACCCA CGGTGGGGAG AAAAGGGGAT CTGGTGGCGT CCTTACAGGC
CTCGGATCGT GACTTGGAAC GGGAGATTGC GCAACTGGAC CGCGAGGACC GCGTGGGTGG
TCTGGGAGAT TTGGACATGG CCGCAGTGGC GCGGGCTGCG CGCGAGGCGG TGAAGCGATT
CGAGTCGGTG GAAGAGCCGA GCGACGAAGA TCTCTTGGAG ATTGAGAAGG AACCGCTGCT
TTCGTCCGCC ACAGACTACG GCAGCCTGAC TCTGGCGGAA CTAAAGGACG AACTCCGACA
ACGAGGGCTA CCGTTGAGTG GAAACAAGGC CGATTTGATT GCCAAGTTGA CGGCTTCCGA
CCAAGTGTAG ACTGCGTGTT CTATCTATAG TATATAGGTT TCTCTGGTCG CGCATAGGGA
AAGTAACGAA GGTGTGTGCG TCACCTCGAC CAAAAAGGTC TTTTGGACTG TGAGAAATCG
GTAGAAAACC TTCGGGGAGT CTCTCCCGGT CTCTGTTTCG GGAACGGATC CATCCGGGAC
TCGCCGACGG GCGCGGACAC ACGCCGACCG GTTTGTCGTC CCGTGCGGCG CACCCTCCTA
GTGAGACGTG ATTCTGGGCG CGAACGTCGC GCCGACGGAT TTGAATGGAT ACTCAATGCA
TCATCCCCGG GGTGTGTGTC CCGACAATTT CGTACCGTAA ATCCTTGGTA AGGAATTGCG
TGCCGACGGC CGCGCCCGTT TGCCTTTCCC GCCACTCCAA AATGTGTATG GTTCGTGTGG
GTGTGTAGGT TTGTCGTTCC CGCTAGGTTT GGTCGCACGG AGAGGGGTGT GTGTGCAGGT
GCATTGGGTG AGAATTCACT CGTCGTCCCG AAAGAGATCT TCCTTTTCCC AATCCAAGCT
CCGTGACGAG TCCGACGTTG TTCGGGTGTA CAGTGAGAGA ATCGTACCTT CGGGATGAAA
CGGTGCGTCG TGACGAATCT GGCACCCTTG TGGTGTTGCT ATTGCTGGTA CTGTTGTTGG
TGTGGCTACT ACTACTACTA CTACAACGGT TGTCGGTCGT ACGAGAATGG TTCCCACACT
GTCTCGTTCG GAACAGTCAC CGCCTTGGTG CCGGAATCCC TGTCGTCGAT ACGATCACGA
TCCCACCGTC CGTCGCGTCG GTACCACGCG ATCCCACCCG ACGATCCCGG ATGGACTCGA
GCACCTCCCG ACGCCAACCC TTGGTGGGAA CGTTCCTCCG TGGCGGAATC TCGGTCGACT
CCCGCCGACC CCCTCACCAC AACGGTCCCG GGTGCGTGGA CAATCGTCGC ACCCCAGACC
TTTGACGATC GCCCGTCCGA TGCGACGACG TGGGTTGCCA CCGACACGGT GCGCCGCACG
GGCGTAACAC CACGGCTCCC CACCACCCCA CTCCCGCCGG CGGCAGTCTC CGCCAATCCT
CCCTTGGACG ACGACGACGT TGCTTCCGTG GACGCGTCCC CGCAAACTCC CGTGCGGGCT
GAAACAGAAC CAAGTCCCGT TGTTCCAAGT CGGCAACGGC CAATCAACAA CAACAACAAC
AACGAGGCGG CACTCGTGCT CAAGCCGGTT TCCATGGAGA ATGCACTGGC ACGTCCCAAG
TCACCGGAAC AAGAACTGAA AGGTAAGAGT ATAGCGCCAG AACAAGTTCA AAAGGTGACG
GAGCGTCTGT CGGAAGAGAT GGAGTCGCAG AGTATTGGTG ATAGTCTCCG CCGAATCCAC
AGTTTGTCGG AAAAAGTTGG TAGTCGTGCC GCGCAAGCCG CCAAGGATCT ACGAGAATCT
CCGCAGCTCC CCGCATTGGC CAGTCGTCTG TCGGACGCAT GGACCAATGT GGCAAAGAGT
AGACAGGAGG TCTGGGACCA AAAGATGACG AACCGGCAGG AAGCTGTCCG TGCGCCAGTA
TCCAGCGAGG ATGGTAGTGT TACAGAGGTA CCCAGCAACA ATGACGACGA GCGTCCATTT
TTCCTGTTAC CCAGTATCGA CGTTTCTTCT CTGTACGCAT CCAAACAACA AAATATGGAC
CATGCTATGC CCGACGGATC CGTGGAGGAG GCTAGTGGTC TAGCCAAGAG CGGTCGTTTT
CCGAAGCAGA GCCACACAGA AGCGGCCCTC GTGCCCAGTC TGGTCTCGGC TGAGAATACT
GAGAATACAC TGACAGTGAG CTTAAAATCC CGGTATTCGG GGGAAGTGAT TGCAGGGTTG
GGTAAAATGC AAGAACAAGT TCAAAAGGTG ACGGAGCGTC TGTCGGAAGA GATGGAGTCG
CAGAGTATTG GTGATAGTCT CCGCCAAATT CACAGCGTGT CGGAAAAAGT TGGTAGTCGT
GCCGCGCAAA TCGCCAAGGA TCTACGAGAG TCCCCGCAGC TCCCCGCATT GGCCAGTCGT
CTGTCGGACG CATGGACCAA TGTGGCAAAG AGTAGACAGG AGGTCTGGGA CCAAAAGATG
ACGAACCGGC AGGAAGCTGT CCGTGCGCCA GTATCCAGCG AGGATGGTAG TGTTACGGAG
GTACCCAGCA ACAATGACGA CGAGCGCCCA TTTTTCCTGG TCCCGAGTAT CGACGTTTCT
TCTCTGTACG CATCCAAACA ACAAGATATG GACCATGCTG TGCCCGACGG ATCCGTGGAG
GAGGCTAGTC TAGCCAAGAG CGATCGCTCT GCGAAGCAGA GCCACACAGA AGCGGCCCTC
GTGCCCAGTC TGGTCTCGGC TGAGAATACT GAGAATACAC TGACAGTGAG CTTAAAATCC
CGGTATTCGG GGGAAGTGAT TGCAGGGTTG GGTAAAATGC AAGAACAAGT TCAAAAGGTG
ACGGAGCGTC TGTCGGAAGA GATGGAGTCG CAGAGTATTG GTGATAGTCT CCGCCAAATT
CACAGCGTGT CGGAAAAAGT TGGTAGTCGT GCCGCGCAAA TCGCCAAGGA TCTACGAGAG
TCCCCGCAGC TCCCCGCATT GGCCAGTCGT CTGTCGGACG CATGGACCAA TGTGGCAAAG
AGTAGACAGG AAGTGTGGGA CCAAAAGATG ACGAACCGGC AGGAAGCTGT CCGTGCGCCA
GTATCCAGCG AGGATGGTAG TGTTACGGAG GTACCCAGCA ACAATGACGA CGAGCGCCCA
TTTTTCCTGG TCCCGAGTAT CGACGTTTCT TCTCTGTATG CATCCAAACA ACAAAATATG
GACCATGCTA TGCCCGACGG ATCCGTGGAG GAGGCTAGTC TAGCCAAGAG CGATCGCTCT
GCGAAGCAGA GCCACACAGA AGCGGCCCTC GTGCCCAGTC TGGTCTCGGC TGAGAATACT
GAGAATACAC TGACAGTGAG CGCAAAATCC CGGTATTCGG GGGAAGTGAT TGCAGGGTTG
GGTAAAATGC AAGAACAAGT TCAAAAGGTG ACGGAGCGTC TGTCGGAAGA GATGGAGTCG
CAGAGTATTG GTGATAGTCT CCGCCGAATC CACAGCGTGT CGGAAAAAGT TGGTAGTCGT
GCCGCGCAAG CCGCCAAGGA TCTACGAGAA TCTCCGCAGC TCCCCGCCTT GGCCAGTCGT
CTGTCGGACG CATGGACCAA TGTGGCAAAG AGTAGACAGG AAGTGTGGGA CCAAAAGATG
ACGAACCGGC AGGAAGCTGT CCGTGCGCCA GTATCCAGCG AGGATGGTAG TGTTACGGAG
GTACCCAGCA ACAATGACGA CGAGCGCCCA TTTTTCCTGG TCCCGAGTAT CGACGTTTCT
TCTCTGTACG CATCCAAACA ACAAGATATG GACCATGCTA TGCCCGACGG ATCCGTGGAG
GAGGCTAGTC TAGCCAAGAG CGATCGCTCT CCGAAACAGA GCCACACAGA AGCGGCCCTC
GTGCCCAGTC TGGTCTCGGC TGAGAATACT GAGAATACAC TGACAGTGAG CTTAAAATCC
CGGTATTCGG GGGAAGTGAT TGCAGGGTTG GGTAAAATGC AAGAACAAGT TCAAAAGGTG
ACGGAGCGTC TGTCGGAAGA GATGGAGTCG CAGAGTATTG GTGATAGTCT CCGCCAAATT
CACAGCGTGT CGGAGGAAGT TGGTAGTCGA GCAGCGCAAG CCGCTAAGGA TCTACGAGAA
TCCCCGCAGC TCCCTGCCTT GGCCAGTCGT CTGTCGGACG CATGGACCAA TGTTGCAAAG
AGTGGGCAGG AATTGTGGGG CCAAAAGATG ACGAATCGGC AGCGAGCTGT CGGTGCTCAA
GCATCGAGCG AGGATAGAAT TATTCCGGGT TTGCTTAACA GCGGCAGCGA CGAGCGCCCT
TTTTTCCTTG TCCCAAGCAT CGATGTTTCC CCCTTGTATG CCCGCCACCA ACGTTCAGTC
CTAGATATAT CCGCCGGAGT AGCGGTTGAG CCCGTTTCCA CTAAGAATGG ATGCCGCGAA
GAGGATGCCA GCAGCGAGGC TGGCCTACAG TTAGAACAGC TTTCCGATGA AATCGCGCAC
CGATCAACTA GTGGCAAGCT GCAGTTTCGG AGCTACCGAC AGCGCGCTTT GAATTTTTCC
AACCGATTTG ACATGCTCTC TGATCAAGTT GGATATCGTG AGGCGCAAGT TTCAAAAAGT
TTGCAACAAT CGCTTTACTC TCAGACGAAG GCCAGCGGTC TCTCTGGGGC CTTTCCTAGC
ACCACGCAAG GAGATGATCG AAGCGTGAGT GAGCCGGAGG AGATGATCAT CTTGGAGTCT
TTGAACGAAA GTCAAGGAGG TGCTGGGTCG CCTGACGACG ATGACGGCGA GGCAATCGCA
GATGACATTC CGGGTATCAA CATCTCGTCC ATATTGGTAT CCGAGCCGAG TGCGATACAA
GTCCAACAAT TGTCTAAGTT TCAGACGAAG TCCAGCGGGA TCTGTGAGGT TCTCACCAAT
GCTACGCAAG GAGGGGATAG AAATGTGACT GATCCGCAGG GGCCGTCATT TTTGGACGAG
GACCGAGGAG GTGCTGGGTC GCCGAACTAT GATGACGACG AACCAAACAT AGATGAGATT
CCGAGTATCA ACGTCTCGTC CATATTCGAA TCTGAGTCGA GTGCGATACG AGAAGTTTTA
CGACAATCGC CTGAAAACCA GCCGGAATGC ATCTCTACCC GTAAGGTCTC CACCAACACT
AGGAAAGATT GGGTACGAAG TGTGACTGAT CCGCAGAGGC CGCTGGTCTC AGTCTCCTGG
AACGAGAACC TAGAAGGTGC TGGGTTACCG GACGACGAAA CAACCATGGA TGATATTCTG
AGTATAAACG TCTCATCTGT ATATGTCTCC GAACCGAATG CAGTCGGAGA CGTGTCCGTC
GAACTGACGG ATGGATTATT GCTGCCCAAG CAGGTCCGAC AGAGGCCGAG CTTAAATGAT
TCCGCACCGC TCTCGTGGCA GGGCACTTCC CGAGACGCAA AGGTTAGCAC AACAACGCTC
TCCAAATGGA AGTGGGGTGC CTTACTGCGG GTCGGTAATT TTCGTTGTCA TCTCGAAGCC
GTCACGGGTC GACTGGCTGA AAGTTACGAG CGCACCCAGA AATTCTTGAT CGATCTCGTT
TCATCTGTGG AATACGACAA GGCCTGGACC CACGCGTTCT TGCAAAAAAG TCAAGGACTT
ACTAGGGAGA ATGTGACACT CATGTCCGAT GATAAATCTG ATGCTGTTGG AGAGTCATTG
GGAAACGAAA AGAGAGCCAG TTCAAATTCG GCTAGCGTGT GGAAGGCCGA AGCTTCGACG
CTGGTTTCCG TGAATAGGTC GATTCTTGAG GACGAAAAAT CCGCAGCCTC AATAAGCCAG
GAGGATAGTG GGGCGCAAGT CGCTTTTCCG GTTCGGCGCA ATACTCAACG CAACCAATAT
TTCGAGGCGC AACCAATTGC TCAGCGGTTG TTCGAACAAA TCCTGCCTTC AGTTGCCGGA
GAGCCGCTAC GGAAGGACGT CCCGGGCTAT ATCATTCAAA GCGCTGTTTT TTCTACATTT
ACCTGGTCTT TCGTCCTTCA ACGTAATGAT CTGTGGACGT CCATGTGGTT GGCTACAGGG
GCATCATACC TTTCTGTTAC CACAGGCTGG CAGGGCGATC TAGTTCGTGG CTGGAGTATT
GCCGTGTACG AGCTCATCGA CTTCGGACGA AGTGATTTAG CCGTTTGGGC ACGGGACTCG
GCTAACGAGT TTGCTGCACT CGCTCCATTC CAGCGCCGCA TCCCAACCCC TCCGAAAAAT
CCTCCACCAC GTGTGTTGCT CGTTTGGGAC CACTCTGAGC TCTTCTTTCT TGTCCCAAAA
TCGGTGAAGG CCGAACGACG CACCCGCTCG TTGATGGAAT ACCGTTTTGA GTTGGAAGCT
GCTGACCGAG AACGCCGGCG CGAACGCGTT GCGGAACGAA ACGCTCGCTG TTTGCTGGCT
GCCCGCCTCG AGTTGCGAGC CCGAAGGCAG ACTATCAAGG CTCCAATCCT ACTGCCCGAG
GCTCAATCAT TCACGACAGT CCCTAAGACA CTACCAGCGA TAGATTTGAT GGAACAGCTT
TTCTTTTTGG TACCCAAATC GGTGAAGGCT GAACAACGCT CCCGCGCACT ATTAGAATTC
CGGATCAGGT TGGAAAGCGA AGAACGAAAA AGGCGGCGCA AGTCCGTTGC GGAACGTAAC
CCCCGTAGCT TGCTTGTTGC CCGTCTCCAG CAGCGAGAGT GGCAGGAGGC ACTGTCTCCA
CTAGACGACT TGCCATCGTT GCCCAATTCC ACAGCCCATA TTGAATCAGA GGTCGGAGAG
ATTGTGGTCG TCCAGAAGAC CCTCGGCTCG GAGCGAAAGA AGCGGGTCAG AGAGGTGAAT
AATACAGCCA AGCTTTTTGA GTACTATCGA GAGCAACTAA AGGCACTTGA AGGCAGGGTG
CTAGCTGGGA TACGTCAACA ATCTATTTGC AAACAAGAAC GGACCCAGCG CGCTCTACTC
GAGAGTCGTC TCAAGTTTCA TGCGACACAA CGGAAAATGG CAGGGTCAGG ACGTTCACAA
CGACGGCTGA TACAGGAACA GCAGGCTCGC GCCGACCAAG CACGGGCGGC CAACATGGCA
TGGTGGGCGG AAGAGAATCG TGCTTGGCAA AATGCCGGAC CAGAATGGCA CGCTCAGTTG
GCCGCCGATG CACGGCAGGA CCGTCGAATA CAGGAGGCAC AGGCTGTCGA TAGAGTAAGG
CTAACGGAGA AAGTAACCAG TCAATTCGAA TCAAATCGAA TCGATCAAGA GAAACGCCTG
GCGCGTGAAG CTTGCAATAC AGACGAATGG ACGGTTACGG ACGAGGCCCG CGTTGCTGTA
GCTGACAGAC CCGCACGCGA AGACCGCGCA GCGAAAAGGA AGAAACAAAG CAAAGCACAG
CTGAACACTA AAATTGCAGC AGAAGCGCAG CGTGTAAAGG AAACGAGATC AGAGCAAAAG
GCGAAGCAAC AAGAGGCAGA GCTGCCACAG AAAATCAGTC GGGAACTTCC ATCCCAGCAA
AATTCGAAGC AGCAGAATGC GCAGAAGGCA GAAGAAGCAA AACGTGCGAG GAAACTTGTA
TCTGAGCAAA AGGGAAAGCA GCAAGAAGCG CAGCTGGCCC ACAAGGCAGA AGAAGCAAAG
CGTGTGCAGG AAATTCTCTC GGGGCAAAAG GCCAAGCAAC ATAAACTCGA GCTGGTCGAG
AAAGCGGTAG AAGCAAAGCA CGTGCACGAT ATTCGGTCGG AGCAACAGGT CAAGCAACGT
AAATTCGAGC TGGCCCAAAA AGCGGCAGAA GAGCAAGACC AGCGAAATCT GGACAAGAGC
GGGATGGAGG ACAAGCGTAG AGAGAAACTT TCATCCGAGC AAAAGGAAAA GCAACGGAAA
GGACAGAAAG CAGAAAAAGC AAAGCTTGTG CAGGATGTTC TATCCGAGCA AGAGGCAAAG
CAAGAAGCGT GGCTGGCCCA GAAAGCAGAC GAAGCAAAGC GTGTGCAGGA AATTTTATCG
GGGCAAAAGG CCAAGCAACG TAAACTCGAG CTGGCCCAAA AAGTGGCAGA AGCACAGCGT
GAGCAAGACA TTAGGTCGGA GCAAAAGGCC AAACAACGCC AGTTGGAGCT CGACAAGAGA
GCGATAGAGG CCAAGCACGT ACGGGATATT ATATCCAATC AAAAGACAAA GCAACGCCAG
CTAGAGCTGG CACAGAAGGA GGCGGATGCA AGACGAGAGC AAGGCATTCG TTCTGAACAA
AAGGCGAAAC AACTCCAGCA GGAATTGGAC CAGAAAGCGG CAGACGCAAA GCGTGTGCAG
GCAGCTCTGT CCGAACAGAA GACGAAGCAA CGGGAAGCGC AGCGGGGAGG CGCTTCTCGC
CAGCGGAAGA AGCAACAAAG AGGTGAAGTA AAGCGAATAG AGGAGAACAG ATCCATTGTT
TGGGCAACAC TATTGGAAAA GGCTCGCGCT GGACAGGAGG GAACTCGTAA AAATGCGGAG
ACGAAGCTGA AGGTTGAAGC TACCAGGGTG CCACCGGAAG AAGTCTTGAC CGATAACCAA
CGCAAGATCC TTAAAGCCAA AAAGGTGGAG GAGGAACGGA TTGTAAAGCA GGCTTTATTG
GCGGAAGAGA AGAGGATTGC CGAAAGGGAG CGCAAAAAGA AGCAAGCGCG ATTGATCGAA
GAGCAACGGA ACGCAGAACT CACCGCGGAA AGGGAGCGCA AAGCGGAGCA AGCACGCCTG
GCAGAGGAGA AGCGCAACGA CAAAGTCGCT GCCGAACAAC AGCTATTGTT GGAACGGGCC
GCCAAAGCTG AAGAGCTGCG CCGGAAAGAA GAACACGAAG CCCAGGCTAC CCAAGCGGCC
GAAAGGCGGA AACGCGAAGA AGTGCGTGTC GATAATGTGA AGGCGCATGT CGCTGCCGGG
CAAGCGAAGG AAGAGTCTCT ACGTGTCGCG AGGAAACAAC GTATTGCAAC GGAACATGCC
GAAGAGCAAA GGGTGCTGAT TGCGGCTCTG GAGGAAGAAA CGCGCTTGCA AGAGGAAGCG
AAAAGGAAGC GCATTGCTGA CGAAGAGGAG AGGAGACAGC TCGTACAAGC ACCCCGCCAG
CAGCGCATGG ACGGGTCGGG TGAGAAACCA CTGGAGGCTG ACCACCCTCG CGTAATATCG
GACACATTCG CGGTGCAATC AGTTCCTAGC AAAGCAGACA AAGATTCTTT GGCCGCAGAA
CTGCCTGCGG ACGAGGTGAT AGATCTGGTG GCCGGTAGTG TGCCAGCGGG AGTCAAGGGC
GTCAATCCGG ATGGTACAGA AGAAGCGTCT CGAGATAATG AGAACGGAAT TGTCCGTGGC
TTTGAGGAAG ACAGTCGCCA AGCGATGCAT CACTTCAAAG ACAACGAGAC TCGCTTGGCC
TTGGAGGAAC CGGAATCTAC CAAAAGCCGT GTCGTGTCGG AAGACGCGTC GCCGGCCGTG
TGGGTTCCGA AAGCGACCCG ATCCCAGGAC GAGACGTTTG TGGACATTTT CACGTACTCC
AAACCGCGCG AGGGAGGGCG AAACTCGGCC ATTCTCGAAA TGAAGGACAC TCGGGCAGCG
CCCCAACGCG CCCGCCAGGC GGCGCAATTG TTCGTGACTC TGGCTCTGTC TCGCCCCGCC
ATTTTCAGGG CGGCCGCCGC CGCGCGTCAA TCGTCGGGTC AAGCCTAGAG AATGCGATGG
ACAACCGTGC GGGGCAAGTA TCTATATGTA AGATATGTGT GTGTATATTA TTCTTAGTAT
TGGCGTTGGT GAATGTATTG GATGCGGGTT ATTTTTCTTT CGTACCTACC GCTGGACCAG
TCTACTTTAG TTCGATGCGT GGTGCCGTGG GTAGGTAAGT AACTTGGGTT CCGGTAGGGT
TTTGATTCGT CGGAGCACGA TCGGGTTGTT TGGACATGTT TGGAAGGCGC GTTCGCGACG
AGTGACACTT TTCCGCGGTT TGAATCGAGA TCTTTCAATA GTGATAATGA CAAAAAGGGA
TACACAACGC CAAAGGAATG TGGGAAAGAC ACACCTTTTC TCTACTTCTG CTCGTTTTCG
AAGACCATCC GGCATGCATG AAACACAGGG GAGGGGGATT GGAGCTTTCG TGTCTCGGTC
TCGTCGGTCC TACTAGCTTC CTGTAACCAC CTACCTATTC GCCCAACGCA TCCTTCGTCG
TCGCCTACAC CAAAATCTTC ATTGAGAGCA TATTCTCATT CTCACTCTTC CCCAGTTTCG
CTAGTCAGCA TGCGTTGGAA AGTACGTGAG GTCCTTCCCA GAGCAGCGGC TCTGCAAGCC
TGCAGTCTGT GTCTATCGAT CCTTGCTGCT CCTGCCGCGT CGTTTTCCGT CGCGACGTCC
CACCGTGCGG TGACGGCGTT GGCCTCCCGT CCTCCGCGCC TTCCGGATCC GCTGCCCTGG
TCCTCGCCTC CCCCGTGGAG TCGCTCACCG GTACACTTGG AGGAACCAGT CGCCACGTGG
GTCGATGCGG TAGCTCGACC CGTCTCAGAA CGTCGTGGTA CGTTTGCTTG TGTTTAGATG
CGTGCGTTCA TTGGTAGAAA GGTCGGTGAG CTCAACTAAC ACTCATTCTT CCGTTTGTTT
TGGTGCTATC GCTGTATCCA AAGAAGTGGA TCTCGACTGG GCAGACGACA CTGATACAAT
CAGCGAGGAC GATAGCGTCA AGACGGACCG CCGTCCTTTG ACCGATCAGG CCACTTGGGT
GGATCCCGAC GCCGGTCTCT GGTTCGTTCG AGACGAACCG AAATCCATCC CCACCTCCAA
AAATCTCCCC CGGTGGGACC CGAGCCTCCC GACGGAAGAA TCCATGGCGG GTGCCTTCCT
AGTGGTGACA TCACTCGGCG CATCAGTGGG TGGGATCGAT CTCCCGACGT CCTGTGTACT
GGGCTGCCTC GCGGCCTACC TGACCACGCG CCCCGGGGCG GCGGGGACGA TGGCTCGACG
CGGTGGCACA CTCTGCTATT ATCTTACGGC CCAAGCCATA CAGACGATTC AGGATCTGCA
CGCCTCAGGG CGCGTCCAAA CGACGATCCG TACATTGGCC GAATCCGTGC GGACGTCGGG
AGAGTCCGTT ACGATGGAAT CCAGAGCCGA TTTCGATACC GACACTGATA CCAACGGTAC
GGTCGCGTCC AATCCAGAAA GCAGCGACGA CACCGTGGTC TCATCATTGC ACCAGGACGA
CGACAAAACG GAATTGGTCT TTCCTGGCAA TGAGAGTAGC TACGGTGACA CAATCTCGTC
GGACCAGGAC ATGAAAGATG AGGACACTGT CTCTTTCCAT GAGGACGTCG ATACCAACGA
CAAACTTGCA TCATCTAACC AAGACAGCGG CAACGTAGCC TTGGTCTCCT CCTCGTACCA
GGACGACGAA AAGACGGAGC TGGTCTTGCC TGATAAGGAG AGTGACTACG GCGACACGAT
CTCGTCGGAC CTGGACATGA AAGATGACGG CACTGTCTTT TCGCATGAGG AGATTGATTC
CAACGACAAA ATTGCATCTT CTTACCAAGA TAGCGGTGAT GTGGCCTTGG TCTCGTCCCC
GCACCAGGAC AACAATAGGA AGGACCACAC GGTCTCGTCC TCGCATTTGA ACGACGAAAG
CACAGAGATG GTTTGGTCCG ACGAGCATAC TAAAGACGAC GACACGGTCT CCTCGTCCCA
GGACAATAAA GGTGACAAAA GGGTCTCCTC GTCACATATG GCGATAGATA CCGACGACAC
AATCGAGTCT TCTTCCCGAG ACAGTGGCAG TATGGCCTCG GTCCCGTCGC ACCAGGAAAA
CGACAACATC AACATGGACG ATACGATTTC GCCGTACTCC AAGCATAATG TCGATGACTC
GGACTCTATT CCTAGCTTCG ACATTTCGTC TTTGCAGCCA ATCCCCAGCA TTGATATTTC
TTCCATGTAC GCGCCTGGGG CAAGCCAAAC CCTGGACCAA CAACAGCTTG GTCTGCCCCG
ACCGTCCAGC CCCGAAGACC TCGATCTGAT CGCGGTGAGC GCACCGACCC GTACCCCGAC
TCCCAGGACC GACAGCGAAG GGACACATCC GGTAAGGGAA GGTCCGGTGT TTGACAACAC
TCCAAACCGG CAATTGCTCT CTGAATACCA GTCGGTGACA GACCAAATCG ATCCTAGTCC
GGTGGATGAC GTACGGAGAG AGAAGATTCC AAATGCTGAA ACAGACGCCT TGCTCTCTGT
TGATCGCGAG GTACCTCAGG CCAAAAGCAA GACCTCGGTT CCTTTAGAGA TGGCGGATGA
AATGATGGCG GAACCCAGCA ACAATATTCC CCGTGAGCCC GTCTGGACAA TCTGGCCCGT
GGACCAACCG TTCCGGTAGC TCGTTCCATT GTAGAATCCC ATAGTGGTTG CTAAGGACAC
AAATATGTAA AACATGTAGG TT
 
Protein sequence
MGRSFMIIVP TCVLVGVVGS LLLLSETEAL AGPASFRINL LTTASPLRMA PENSQDFPSS 
VSLKTPRPPK RAPSIDRLRS LHEEYEPFFL RNGGTALLDE ETDEESRLSP RQGFLQSRYD
HWQECTVKAL KAELSVRKLQ VSGKKAALVE RLALDDLDNT PDRLVQKAAK TLAQEVFVHP
FTTTEVEKGV PADAIKGAAL TAATLSYLAG NSIVLSGAAA LGAAYLAISP GSAGDAVRAI
GTSAWSSTEV FVDVVKKIGP EHIGETTVGL LHRLSAAAQQ TQFLLQQKSY QSGSSASNNA
IARDKADAID VTITNDTGST ESDEASPTFA FAEKDQIVPK ADTPVVAPAK TKDDRVNRAL
LSYRIELEQT ASQKRLKQRK EQASRGLLAA RLSLETALKE RIIVEQARLA EEARLAEEAR
LAEEVKLAEE ARLAEEARIA EETRRMAEAK VAEEARIAEE ARLAEEAEQS RLAEEARVAK
IKARVAAQEE ESRLAREAQV TAEAQRVALE AQQATVLEIP EAETPTEAVF ETEASSVFDE
VGFSEEDWAA SILAAQKSID GTIVGSDDEE QDTDETESKA SWEAAKLLAE ELSPSEREDL
GKAAREAVEA MELNMNAKIQ EKAVERETWA QEVVEDEGAE DDDENNLDMF FDNEGFDMEA
LAQAARQAVE RYDAESTGET ERESWSESSL RDWASYRVAD LRNELSTRGL PAIGKKMELV
AALEAADLAL SNGEISSSAG VQSEGSGPVM AEEKELLEVE DEYGIMEFED DTAFTFEEDD
DDDENLDDIL PPMEDLAALA AAARAAVRDQ EDLFNTDVIH GSTTDWSQFK MVDLRNELTM
RGLPTVGKKT DLIAALAQSD LDQESAAVMD DDEEEEEYGS FEYDATTLVG AADDKEADLD
SLFGGRGSDL ETLAATAEAV LEMEKPVTSL GRDWSKLTVA QLRTELDKRG LPTVGKKADL
VVALESADRE LDGNAEEEKD VDNLSSNEHV FTVNHDYDDE LSEDDLLNDL LHANGADREA
LAAAARASVD REGVLPEPST DWSRLSPTEL RIELDNRGLP TVGRKGDLVA SLQASDRDLE
REIAQLDRED RVGGLGDLDM AAVARAAREA VKRFESVEEP SDEDLLEIEK EPLLSSATDY
GSLTLAELKD ELRQRGLPLS GNKADLIAKL TASDQVSLVA HRESNEENLR GVSPGLCFGN
GSIRDSPTGA DTRRPVCRPV RRTLLVRRDS GRERRADGFE WILNASSPGC VSRQFRTVNP
WFVVPARFGR TERVTALVPE SLSSIRSRSH RPSRRYHAIP PDDPGWTRAP PDANPWWERS
SVAESRSTPA DPLTTTVPGA WTIVAPQTFD DRPSDATTWV ATDTVRRTGV TPRLPTTPLP
PAAVSANPPL DDDDVASVDA SPQTPVRAET EPSPVVPSRQ RPINNNNNNE AALVLKPVSM
ENALARPKSP EQELKGKSIA PEQVQKVTER LSEEMESQSI GDSLRRIHSL SEKVGSRAAQ
AAKDLRESPQ LPALASRLSD AWTNVAKSRQ EVWDQKMTNR QEAVRAPVSS EDGSVTEVPS
NNDDERPFFL LPSIDVSSLY ASKQQNMDHA MPDGSVEEAS GLAKSGRFPK QSHTEAALVP
SLVSAENTEN TLTVSLKSRY SGEVIAGLGK MQEQVQKVTE RLSEEMESQS IGDSLRQIHS
VSEKVGSRAA QIAKDLRESP QLPALASRLS DAWTNVAKSR QEVWDQKMTN RQEAVRAPVS
SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQDMDH AVPDGSVEEA SLAKSDRSAK
QSHTEAALVP SLVSAENTEN TLTVSLKSRY SGEVIAGLGK MQEQVQKVTE RLSEEMESQS
IGDSLRQIHS VSEKVGSRAA QIAKDLRESP QLPALASRLS DAWTNVAKSR QEVWDQKMTN
RQEAVRAPVS SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQNMDH AMPDGSVEEA
SLAKSDRSAK QSHTEAALVP SLVSAENTEN TLTVSAKSRY SGEVIAGLGK MQEQVQKVTE
RLSEEMESQS IGDSLRRIHS VSEKVGSRAA QAAKDLRESP QLPALASRLS DAWTNVAKSR
QEVWDQKMTN RQEAVRAPVS SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQDMDH
AMPDGSVEEA SLAKSDRSPK QSHTEAALVP SLVSAENTEN TLTVSLKSRY SGEVIAGLGK
MQEQVQKVTE RLSEEMESQS IGDSLRQIHS VSEEVGSRAA QAAKDLRESP QLPALASRLS
DAWTNVAKSG QELWGQKMTN RQRAVGAQAS SEDRIIPGLL NSGSDERPFF LVPSIDVSPL
YARHQRSVLD ISAGVAVEPV STKNGCREED ASSEAGLQLE QLSDEIAHRS TSGKLQFRSY
RQRALNFSNR FDMLSDQVGY REAQVSKSLQ QSLYSQTKAS GLSGAFPSTT QGDDRSVSEP
EEMIILESLN ESQGGAGSPD DDDGEAIADD IPGINISSIL VSEPSAIQVQ QLSKFQTKSS
GICEVLTNAT QGGDRNVTDP QGPSFLDEDR GGAGSPNYDD DEPNIDEIPS INVSSIFESE
SSAIREVLRQ SPENQPECIS TRKVSTNTRK DWVRSVTDPQ RPLVSVSWNE NLEGAGLPDD
ETTMDDILSI NVSSVYVSEP NAVGDVSVEL TDGLLLPKQV RQRPSLNDSA PLSWQGTSRD
AKVSTTTLSK WKWGALLRVG NFRCHLEAVT GRLAESYERT QKFLIDLVSS VEYDKAWTHA
FLQKSQGLTR ENVTLMSDDK SDAVGESLGN EKRASSNSAS VWKAEASTLV SVNRSILEDE
KSAASISQED SGAQVAFPVR RNTQRNQYFE AQPIAQRLFE QILPSVAGEP LRKDVPGYII
QSAVFSTFTW SFVLQRNDLW TSMWLATGAS YLSVTTGWQG DLVRGWSIAV YELIDFGRSD
LAVWARDSAN EFAALAPFQR RIPTPPKNPP PRVLLVWDHS ELFFLVPKSV KAERRTRSLM
EYRFELEAAD RERRRERVAE RNARCLLAAR LELRARRQTI KAPILLPEAQ SFTTVPKTLP
AIDLMEQLFF LVPKSVKAEQ RSRALLEFRI RLESEERKRR RKSVAERNPR SLLVARLQQR
EWQEALSPLD DLPSLPNSTA HIESEVGEIV VVQKTLGSER KKRVREVNNT AKLFEYYREQ
LKALEGRVLA GIRQQSICKQ ERTQRALLES RLKFHATQRK MAGSGRSQRR LIQEQQARAD
QARAANMAWW AEENRAWQNA GPEWHAQLAA DARQDRRIQE AQAVDRVRLT EKVTSQFESN
RIDQEKRLAR EACNTDEWTV TDEARVAVAD RPAREDRAAK RKKQSKAQLN TKIAAEAQRV
KETRSEQKAK QQEAELPQKI SRELPSQQNS KQQNAQKAEE AKRARKLVSE QKGKQQEAQL
AHKAEEAKRV QEILSGQKAK QHKLELVEKA VEAKHVHDIR SEQQVKQRKF ELAQKAAEEQ
DQRNLDKSGM EDKRREKLSS EQKEKQRKGQ KAEKAKLVQD VLSEQEAKQE AWLAQKADEA
KRVQEILSGQ KAKQRKLELA QKVAEAQREQ DIRSEQKAKQ RQLELDKRAI EAKHVRDIIS
NQKTKQRQLE LAQKEADARR EQGIRSEQKA KQLQQELDQK AADAKRVQAA LSEQKTKQRE
AQRGGASRQR KKQQRGEVKR IEENRSIVWA TLLEKARAGQ EGTRKNAETK LKVEATRVPP
EEVLTDNQRK ILKAKKVEEE RIVKQALLAE EKRIAERERK KKQARLIEEQ RNAELTAERE
RKAEQARLAE EKRNDKVAAE QQLLLERAAK AEELRRKEEH EAQATQAAER RKREEVRVDN
VKAHVAAGQA KEESLRVARK QRIATEHAEE QRVLIAALEE ETRLQEEAKR KRIADEEERR
QLVQAPRQQR MDGSGEKPLE ADHPRVISDT FAVQSVPSKA DKDSLAAELP ADEVIDLVAG
SVPAGVKGVN PDGTEEASRD NENGIVRGFE EDSRQAMHHF KDNETRLALE EPESTKSRVV
SEDASPAVWV PKATRSQDET FVDIFTYSKP REGGRNSAIL EMKDTRAAPQ RARQAAQLFV
TLALSRPAIF RAAAAARQSS VLALVNVLDA GYFSFVPTAG PVYFSSMRGA VVSLVSMRWK
VREVLPRAAA LQACSLCLSI LAAPAASFSV ATSHRAVTAL ASRPPRLPDP LPWSSPPPWS
RSPVHLEEPV ATWVDAVARP VSERRERSVS STNTHSSVCF GAIAVSKEVD LDWADDTDTI
SEDDSVKTDR RPLTDQATWV DPDAGLWFVR DEPKSIPTSK NLPRWDPSLP TEESMAGAFL
VVTSLGASVG GIDLPTSCVL GCLAAYLTTR PGAAGTMARR GGTLCYYLTA QAIQTIQDLH
ASGRVQTTIR TLAESVRTSG ESVTMESRAD FDTDTDTNGT VASNPESSDD TVVSSLHQDD
DKTELVFPGN ESSYGDTISS DQDMKDEDTV SFHEDVDTND KLASSNQDSG NVALVSSSYQ
DDEKTELVLP DKESDYGDTI SSDLDMKDDG TVFSHEEIDS NDKIASSYQD SGDVALVSSP
HQDNNRKDHT VSSSHLNDES TEMVWSDEHT KDDDTVSSSQ DNKGDKRVSS SHMAIDTDDT
IESSSRDSGS MASVPSHQEN DNINMDDTIS PYSKHNVDDS DSIPSFDISS LQPIPSIDIS
SMYAPGASQT LDQQQLGLPR PSSPEDLDLI AVSAPTRTPT PRTDSEGTHP VREGPVFDNT
PNRQLLSEYQ SVTDQIDPSP VDDVRREKIP NAETDALLSV DREVPQAKSK TSVPLEMADE
MMAEPSNNIP REPVWTIWPV DQPFR