Gene PHATRDRAFT_31156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31156 
SymbolDHC1 
ID7199205 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp217409 
End bp229898 
Gene Length12490 bp 
Protein Length4020 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185293 
Protein GI219130273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACGAGTGGA CCACATCCTT GCAGGACAAG ATTCGTCAAC TCGACGTGGC CTTGTCGCAA 
ACCTCGCGGA GTGCGCGATT GCCCCACGTC GTACTCAACA CCCACGAAGT TTTGGTGCAG
GCCGCCAGCC GCAAAACCGG CGACAAGATG GATCTGGAAG CACTCGGTTT GGCCAACTAT
TTGCAGGATG ACGAGTTTCT CAACAGCTTG CAAGCGACCG TCAACGGATG GATTGTACAA
ATTCGCAAGA TTACGACACT GCCCAAGTCG ACCCCCTTTT TGCCGGACGC AGCTGCCGAA
GAACTCACGT TCTGGACCCA GCTGCAGGCC GAACTCTTGC ATATTCAAAC GGAGCTTTCT
TCCGCCCCGG TAGAACTTAC GGTTGCACTC TTGCGCGAAG CCAAGCGATT CGTGGTGGTA
CTGGCATTGG AGAACAACTC TGGACTGGAA CAGGCTTTAT CCTACACGCA AGATGTGGCC
CACTTTATCA AGCCCTATCC CTTGCCGTCG CTGCAAGCGG CCCGCAGTGT GGAAGGCGTC
GTCGAAGCGG TGCAGAGTCT TTTTGATCAT TACGGCAAGT TGCGCCAGTC GCGTTTCTAC
GGCTTGCCCC GAGCGATTGA ATTACTCCAG GCGACTACGT TAGTCTTACG GGACACGCTA
TTGGCCATTT TGCAGGAGCA GTTTTCGAAT CTGCTCTTTA TGGATTATAA AGAGTATGAA
AGCAAAGTGA GATTTTCGTT CCTCGATGTT TTCGTTCAAT TTGACGATCG TTTCGAGGAA
TGGAAGGACT TTATCCTCGA GCAAGGACGT CGCCGCAAAG TGACGGGGTT GAACAAGTTA
ATTGAAGGTA TGTCAATCCA CCACTTGCAG CTTAAAGACC GGCTGGAGAC CATTCATCAA
TTCCGTTCAT CCCAAGAGCG TTTGCGTGAC GTCGTCCACA AGGTTTTGCG GGAAGAAGAG
CCAGCAGCTA TGCAGCAAGT TGAGTCAGTC CCGCGTCAGA TCTTTGCCTC GCTCAATGTC
TTGGACTTGA CGCCTAGCGG GACCAAGGCA CTGGAAAGTG CTTTAGAAGA ATACGATTTA
CAAATGGATG CCATGGAAGA GCGTCTAGCA AAGTTGTTGC GGGACAAATT GACGGCCTGT
CAAGACGCGG AAGATATGTT TCGGGTCTTT GCTCGCTTTA ATTTGCTCCT GACTCGAAGA
CGTGTACGCG CGTCTGTCAA GGAATTCCAA ATCCAGCTGA TTGCGACCGT TGCCCTGGCT
GTTGAAAAGC TACAGTCCAA ATTTACTCTC AAATACGAGT CATCCGCAGC CGCCCGCATT
TCTCGACTTC GAGGCATTCC ACCCGTGGCT GGTAAAATTC TTTGGGCGAA GCAAATGGAA
CGCCAAGTTC AAACTCTGAT GGAGCGAATG GGTGATGTAT TAGGCCCTGA TTGGGGGCAA
CAACTGGAAG GACGCCAGCT CCGCAAGAGT GGCGATGAGC TCTTGTCCAA ATTGGACGCC
CGTTCCTTTT TTCATGGCTG GGTGACGGAA TGGGAGAAGG GTTTATCATC AGCGGCCGCA
TCCAAGATGA ACTCGTACCC AATTATAGTG GAACCAGAAG GCCGGGACGG TGTTTTGACT
GCTAAAGTCA ATTTTGACGA AAAGAGTGAG CTCTTGTTCA AAGAAATTCG TCATCTGAAA
TGGCTGGGAT TCGGCAAGGA AATTCCGCGG ACTTTGAGTA TGGTCTCGGA CGAGGCCATG
TCTCGATATC CGTACGCAAT AGCGGTAAAA ACCGCTCTAC GTTCGTACAA ATCAGTGCGT
ATCCTGGTCA CTCCGGAGTT GGAACGGTTG GTAATGCCAC AATTGCTCGA AATTCGGGAA
ACTATCTCGG AGGCCTTTGA TGTCAAGTTG TCCAATTCTA AGGTAGCCAA GAAGCGTCGC
GTACGCTGGG AGGGTAAGGA AATATCGGAA TGGGTGGCTC AGTTGACTAC ATCTATCTCC
AAATTTGAAG ATCGTGTAGA GCAATTGCTT CGAGCTTGTG ACAAGGTGGA CATTGCATTA
AATTTACTGG AAGAGGTTGA TTACAACACT GACAAGTTTC GTGCAGTGTT AGCGAGCATA
CAAAAGACGA TAGATGACAT GTCTTTGAGT GGATACAACG ATTTGGATTC TTGGGTTCGG
GTGATTTCGT CCGAGATGGG CAAGGTACTC TCCAAGCGAC TGGAATTTGC CTTGAAGGGA
TGGAACCAGA CGTTGCAGGT GACAAAAGAG AGCAACAATG AAGACCATGT GGAGCTGATC
GAGACAGCTG TTGTCGAAGA TATTCCTTTT CCTGAATCGA TTACCACCAA GGTTTCTCTG
GAGATTGTCC TCCGCAACCA GGAAATCTCT GCTGTGCCAG CGCTACCAAT GGCTCGAGCC
ATGTTCTTGA ACAAGCTACA TGAGTACTTT TCTGTGGTTT GTAGCTTGCC GCGACCGAAA
AGTGGTCGCT ACGAGGTCTT CGACAGCGTC GCTGTCAAGA ATTCTCAAAC GTCCACTGGT
TTGGAGGAGA CTTTCGACAA CTTGGTCTAC CTTCTTCCGC CTAAACTTGT TGCGGATGCT
TACGGAGCAA TCGAGGCTCA CATTAGGCAA GCATCTGCGT TTGTAGAAAG CTGGCTGGCG
TATCAAACTT TGTGGGATAC CCAAGTATCG GATGTGGCAT CCTCTGTGGG GTCGGACATT
GAAAAATGGC AAGCTCTCCT CTTAGAGGCT TCCGAAGCAC GCGCAACCTT GGATACATCG
GCGACAGTTG CAGAATTTGG ACCAATCCTC GTCGATTACG GAAAAGTGCA ATCGCAAATC
AATCTAAAAT ACGATTCTTG GCAAAAAGAG CTCCAGTCAA ACTTCGCATC GATTCTTGCG
CAGCGTATTT CTGAAACTCG CGAAAAGATT GATGGCGCCA AGACCAGATT GGAAGAAACA
ACGCTTACCA ATGCATCCAC TGAAAGCATC GTGCTAGGTG TCACCTTCAT TCAAGAAATT
AATCAGAAGT CTCAGCTTTG GCGGAAAGAT GTGGACATCT TACATTCCTG CGAGCGGCTT
CTCAAGCGTC AACGGTATGC ATTTCACAGT GAGTGGGTCG AGACCACTGT GGTAAAAGGT
CTTTATGATA GTCTCCTTCA AATTCTCGAG CGCCGAACAC GCACAATGGA ACAGCAGGTC
CCGTTGCTTC AAACTCGGGT GTCCGCTGAA GACAAAAAAT CTACGAAGGA GCTTGCCGGA
TTGATTGGGA CCTGGAACCA GGACAAGCCT CTACGTGGTA ATGTGACACC TCCGCAGGCT
ATCGAAGTCC TCTCAAAGTT TGAAATCAAT TTGACCAAGG CCCATGTTCA TCAGGAGAAT
CTTGGACGAG CGAAGGATGC CCTTGGTTTG GAACATACGA CTGAGAGCCA TGAGGTCGTT
GAAAGTCTCA AAGAGTTGAC AGACCTGAAA GAAGTATGGG ATGCAATGAT GGAACCTTAT
CGCTCACTCG AAGAAATTAA GGATACGCTA TGGTCGACAG CTGTCATGCG AAAAATTCGC
CGTGCGCTAG ATGATATACT CGCATCCATG AGATCACTTC CGAATCGAAT ACGTCAGTAC
GACGCCTATA CCCAGCTCCA CGCAGTAGTC AAAGGCTACA TCGAGGGGCA TAGTCTTTTG
TCTGATTTGA AAAGCGAAGC GTTAAAGGAG AGACACTGGA AGACGATTTT TCAACGCCTT
GGTATTCGAG TTCCCTTTAT AGACTTGACT GTCGGTATAC TGTGGGAAAA TGGTATTCTG
ACCCGAAAGA AAGATGTCAG CGAGATCTTG ACCGTGGCAC AAGGTGAAAT GGCACTGGAA
GTCTTCCTTA GCGAAGTGCG CGACCGTTGG CTTAAGCAGG AACTTGAACT TGTCTTGTTT
CAGAACCGTA CTCGATTGAT ACGAGGCTGG GATGACCTTT TCGCAACTTT GGACGATCAC
ATCGGGGGCC TTGCGTTGAT GAAAAGCTCT CCATATTATA GAGCAGTCCG TGAATTTCAG
GAAGAGGGAA AGCTCTGGGA GGATCGGCTG ACTCAGCTGC GCGCTGCCTT CGATGCATGG
GTGGATGTTC AACGTCGATG GGTCTATCTT GAAGGAATTC TCTTTGGAAG CTCTGATATC
AAAGCGCAGC TTCCTGCCGA GTGGTCTCGG TTCAAGAGCG TTGATTCTGA ATTTGTGTCT
CTGATGCGTC GAATCGTAAG CAAACCATTC GCCATGGAAG TTCTTAACAT TGACAATTTA
CAGCGAACTC TTGAAAGGCT TGGCAATCTC ATGACCGTCA TTCAAAAGGC TTTAGGCGAA
TACTTGGCTA AACAGCGTAG CGATTTCAGT CGCTTCTATT TCCTTGGAGA CGACGATTTG
CTGGAGATCA TGGGAAATTC AGGCGAGCCC GGTAAAGTAC TCGCGCATAT TGGCAAAATG
TTTGCTGGTA TAGTCGGTGC CCGTCGCGCT TCTGGTGATG TGCCGGAAAA TGTAAAGACG
CGTTTTGACG CTATGGTCAG CAAAGACGGC GAAATAGTTG AGTTGCACAA ACCAATCGAT
ATCACTGCAG AAACGACGGT GAAGGGTTGG TTGAAGGAAC TTGAAATTAG TATGCAGACA
ACATTGGCTT TGCTACTGCA ACAGGCTGTC GGAGAAGATG TCTACTCGAC AAGTGCTCAG
CTAGACGAAG ATACAGAAGT CAGCTTCGTA GGGTGGTCGA CGAAATTTCC AGCGCAAGTC
ATGATTCTTG CTGCACAAAT AAATTGGAGT ATGGGTGTCG ACAGTGCTCT TGGTACTACC
GAGCCGAATG CTGCTCTAGC TGATGTTCTG CGTGTACTGG AGTGGAAGCT TGAAGTGATG
GCCAATACCG TATTGGAGGA ACTGCCCGCC GATTCTCGTA AGAAGTTTGA GCAACTCATT
ACCGAGCTTG TTCGACAAAG AGATGTCGTT CGTCAGCTTA TGAAGGACAG CGTCTCCGAC
CCTATGGATT TCCGTTGGTT GTATCATCTC CGATACAATT ATGACCCCAA AGCGGAGAAA
GTGACCGAGA AGCTATCTGT CTCGCTCTCG AACGCAAAAT TCTACTATGG GTTTGAGTAT
CTTGGTATCG GAGAGCGACT AGTCCAAACG CCTTTGACAG ACAAGTGCTA CCTTACACTA
ACTCAAGCAC TGCACTTTCG GATGGGTGGT AGCCCGTTCG GACCTGCTGG GACTGGAAAG
ACGGAGAGCG TCAAAGCTTT GGGTGCTGCA CTTGGACGGT TCGTTCTCGT TTTCAATTGC
GATGAGACGT TTGACTTTAG CGCGATGGGT CGTCTCCTTG CGGGGCTTTC TCAGGTCGGG
GCGTGGGGGT GTTTCGACGA GTTCAATCGC CTCGAAGAAA GAATTCTGAG TGCTGTAAGC
CAACAGATTT TGACAATTCA GCGTGGCCTT CTCGAACGGA AAAGTCAGAT CGAGCTACTC
GGTCGACCAG TGAGTCTACA CAATAATGTT GGCATTTTTA TCACAATGAA TCCCGGCTAT
GAAGGACGTA GTAATCTTCC CGATAATTTG AAGAACCTCT TCCGATCGTT TGCTATGGTT
GTTCCGGATC GAAAACTGAT TGCACAGGTC ATGCTCTATA GCCAAGGCAT TGTCACTGCA
GAACATCTCG CCGGAAAGAT AGTCGACCTC TTTATGCTTT GCGATTGCCG TCTGTCAAAA
CAGCGCCACT ACGATTTCGG TCTTCGCGCT TTGAAGACTT TATTAGTAAG CGCTGGAGCA
TTGAAACGAC AGGCCATCGA AGGCAGGAAT CTTCACGGCG AGGACCTAGC ACTTGCGGAG
AAAAATGCCT TAATCGTTGG TGCATGTAAC AATGTTCTCC CTAAGCTTGT TGCTGAAGAC
ATGGTAGTCT TCAAAGAAGT CCTCGAAAGC ACTTTCCCAG GGTCCGATGT TGCCAAAATG
GAAGACACAA AGGCGAGAGA GGAGATTGTC TCCGTTTGCA AAAGAAGCTG TTTTGTACCC
GGTGAAGGCT TTCTACAGAA AATCCTCCAA TTGAAACAGG TCATTGAGAT GCGCCATGGA
GTGATGGTCG TCGGTCCAGT TGGTGTCGGA AAATCCACCG CACTTAAGGT CCTTTTGGAA
GTGTTGGAAA AGCTTGATGG GACGAAAGGA GAAATGAGCA TTATTGATCC GAAAGCCATT
AGCAAAGAGC GTCTTTACGG GTCGCTGGAT GGGACAACGT TAGAGTGGAC AGATGGGGTA
TTTACAAGTT TGTTGAGACG TATCGTCGAC AATCAAAAGG GCGAATCAGA TCGTCGCCAT
TGGATTGTCT TCGACGGTGA CGTTGATCGT AAGTCTCGTT TACCCATCAC ATAAAGAAGG
AAGCATTCGG TCACTGTGCC TCACTTTCTT TGCTTTTCGT CTCAAGCTAA TTGGGTCGAG
AATCTTAATT CAGTTCTCGA CGACAATAAA ATGCTAACTT TACCCTCAGG TGAAAGGCTG
AGTATTCCTG ACAACGTACG CATTATCCTG GAAGTTGACA GTCTAGCGCA TGCCACCCCT
GCGACGGTTT CACGGTGTGG GATGGTCTGG TTCAGTGATG ACAATGTCAC GAACGAAATG
TCGCTTCAGC ATATGCTTCA ACGACTGGCA ACAGAGGATT TGATGGGCGA CAGAGTAGCT
GGACAGCAAG TTCCATCAGC GCAAATTGAG TTCCTAAACG AAATCACGGG CCAAGTTATA
TCCGAGAGAA CTTCGTCACT GGTGATCGAT GCCCTCGAAT TTGCCCTGAA GCAACGCCAC
ATAATGGAAC CCACCAGTGA TCGACTTTTG CATACTTTCC GGGCATTGTT GATTCAGGGG
ATAGTACTCG CAATCGAGTA CGACGAGAAT CATCCTGATT TCCCAATAAC GGGGGAGCAT
ATGGAAAAGT TTGCCAAGCG GTGGTTACTA CACTCACTTA TGTGGTCTTT TTGTGGAAGC
GCCTCTTGGG ACGTGCGAAA GAGTTTTAGC GACATGCTTC TCCGTACTAG CGGTATCATC
ATCCCGTTCG GAACAGACAA CACTCTCTAC GACTATAGGG TCAGAGTTGA CGATGGAGAA
TATGAACTTT GGAGCGATAA TGTGCCTCGG ATGGAGATTG AGAGCCATCG TGCAGCCGCG
AGTGATGTCG TCATTACGAC GACAGACACG GTTCGACATT CTGATGTGCT TGGTGCTTGG
CTAACGCGTC GCATTCCTCT TATTCTGGTG AGTTAATTGC GTGCAAAGCA AAATAATCAA
AGGCCTGTTC TCCTTTGAGT ATGATCTTAC TGACACTTAT CATCAATGTT GAAACGTAGT
GCGGTCCACC TGGCTCGGGT AAGACAATGA CACTGGTAAA CGTCCTGCAA TCAATACAAG
GCGTGATCCT TGCCAACTTG AACTTTTCAT CGAGAACGAC TCCAGAGATC ATTCTAAAGA
CATTCTCACA GTACTGCTCC TATGTTCGTC GAGGAAAGGA CATTTTTCTC GAACCAGGCG
AAAGCTTCGG TGCTACCAGC TGGCTGGTCG TATTTGCAGA CGAAGGTAAG TGTACAGCTA
GGACGTCGTG ATTTTTTAAA AAACGATTCT CATCGACGCA AATATCTTCA GTCAATCTAC
CGGAGGAGGA TAATTACGGG ACGCAACGAG TTATCATGTT CATGCGTCAG CTCGTTGAGC
ATGGAGGATT TTGGAGGAAC GATAATGTTT GGGTAAAAAC TAACAGAATC CAGTTCGTGG
GTGCCTGCAA CCCTCCAACA GATGCTGGTA GAGTTGAAAT GTCGCGTAGA TTCATGAGAC
ACGTTCCATT ACTTCTAGTG GACTTTCCAG CAAAGGACTC GTTAATGCAA ATCTACCGGA
CGTTTAACGG GGGAATGATG AAGCTCTTCC CAAACCTCAA GGGCGAAACA GAGGCAATGA
CCGAGGCAAT GGTGGAGCTG TATACGGAAA ACCAGAGAAA GTTTACTCCG GCTATGCAGC
CTCAATATTT TTATTCCCCT CGGGAACTGA GCAGATGGGT CAGAGCAATC TACGAATCTG
TCGTCAATAG TGACCAGGGT TTAACACGGG AGGAGCTTGT TCGAGTGTGG ACCCATGAAG
GATTGCGACT TTTCGCGGAC CGACTAGTAG ATGTCGAAGA CAAAGAGTGG TGCAGCAGCA
AGCTTGATGA AGTTGCGCGA AAGTGGTTTG CTGGAGTAGA TTTTGAAATT GCGCTTGCCC
GTCCCATGTT TTATACTACT TGGCTCAGTA AGGATACACG CCGCGTTGAA CGCGGCGAAC
TCAAGGACTT TTTGAGTGCT AGACTCCGTG TCTTTTACGA AGAAGAGCTC GATGTGCCTC
TTGTTGTTTT TGATGAGGTG TTAGAGCATG TTCTAAGAAT CGACCGTGTC TTGCGCCAGC
CAATGGGTCA CCTTCTCCTT GTGGGCGACT CAGGTGCAGG AAAGACAGTG CTTTCGAAGT
TTGTTTCTTG GATGAACGGT CTCAGTATAT TCCAGATTAA GGCCCATTCT CGCTATGGGA
TGGAAGATTT CAATGAAGAT TTACGAGGTG TCATGAGACG CGTCGGCGTA GACGGCGAGA
GGGTTTGCTT CATTTTTGAC GAATCGAACG TGCTGTCATC CGGTTTTATA GAGGCGATTA
ATGCCCTCCT GGCTAGCGGT GAAATCCCAG GGCTTTTCGA TGGTGATGAT TACACTGCAC
TCATGAGTGC CGTCCGCGAC ACTGCCGCCC GGGATGGCGT GATTCTAGAT AGCGACGAAG
AATTGTGGCG CCATTTCACA AGCATTGTGC AACGCAACCT ACACGTTGTC TTTACGGTGA
ATCCAAGTGG TGGAGATTGG AAAAACCGCT CGACTACTAG TCCCGCTTTG TTCAATCGTT
GCGTCGTAGA TTGGTTTGGC ACATGGGGCA GCAAAGCGAT GGGCGAAGTC GGCAGAGAAT
TCACTACGCG ACTTGATATG GGCGATTCGG AAACGGAAGG AGGTGCGTGG GGTATTGGCG
AAGGTGAAGA GCTAATGAAG CGTGTCGAAG ATGCTTTCGA CGATAATTCT ACCGGGGGGC
TTCGTCAGGC AGTTGTTGCT GCTCTCGTGA ATATGCATCA GATTGCTAGG GAGATGGCAG
AAGAAATCGC CTCATCGCCC AGCAGCATAA CTCGCACCTT TCTTTCCCCT CGAGACTACC
TCGCCTTGAT TCAGAATTTC GTGTCTTGTT TCAACGAAAG ACGCGAGAAA GTTGAGGACG
AGCAACTGCA CGTGAATGCT GGTCTCAGCA AGTTGAAACA AACGCAAGAG AATGTTGCTG
AGCTAAAACA AGGACTAGGC ACCAAAACCG CAGAGCTTCG AAAGAAAGAG ACTCTGGCCA
ACGAGAAACT GCAGCAGATG GTTGCCGACC AAAATATTGC CGAAAAGCGG AAGGAGGAAG
CTGAAAGAAT GAGTGTCGAA GTCGAAAAGC AGCAGAAGCA AATCAATGAA CGCAAGGATC
GAGCTCAAAA GGATCTTGAT GAAGCTGAGC CTGCTCTACG TAGCGCGCAA GCAAGTGTAC
GTGGCATAAA GAAACGGGAT CTAGATGAGA TCCGAAATTT ATCCCGACCT CCAAATAACG
TCAAGCTTAC TTTGGAATGC GTGGCGATCA TGTTAGGAGA AACCAGCGTT GACTGGACCG
ACGTCCGAAA ACTGCTTGCT AAGGCCGATT TCATTCCGAG TATTCTCAAC TTCGATGTTG
ACAATTTGTC CGCGAAACAA ATCAAGTTGG TTAAGGAAAA ATACCTAGAC GGGAATTCTG
AACTCAACGA GGAAAGTGTG TTGCGAAGTA GCAAGGCGTG CGGACCATTG TACAAATGGG
CGGAGTCGCA AATAAAGTAC AGCACTATTT ATAACAATAT CCAACCTCTA CGAGAAGAGG
TCGCTCAGCT TGAAGAGGAA GCTGATATCA TCAAAACTGA AAAGTACAAG ATCGAAGACG
AGGTCAAGGA ACTAGAAGCT TCGATCGCAA GTTATAAGGC CGACTACGCA AGCCTCATTC
GCGACGTGGA GGCTCTTAAG TCCGAGATGG AAGCCGTCAC AATGAAGATT GATCGAGCGG
AAAGCCTGAT GACAAGCTTG AGTCACGAAA GCGAACGATG GTCAAAGAGC TCGGAAACTT
TTCAAACGAT CATGCGAAGC CTTGTAGGTG ACGGGCTTTT GATGGCAGCT ACACTCACGT
ACCAAGGATT CTTTGACTTT AAAGTCCGCT CCATGATGAT GTCGCGGTGG AAGAAGAGTT
TGGAATGTCT CGACATTGAG TTCCGCGAGG AACTTGGTAT TGTTGAATCT TTGAGCACGG
GAGCCCAGCG TTTGACGTGG CAGGCACAAG GCTTGCCAGG TGACCAGCTT AGTCTTGAAA
ATGGTGTTAT TTTGGATCAC GGAATCCGTT TTCCCTTGGT CATAGATCCC TCGGGGAATG
CAATTGACTT TTTGATGAAC AAACATAAGG ACGAGAAAAT CCAAACAACT AGCTTCTTGG
ACAAATCATT CACGAAAACG TTGGCTGGAG CTGTTCGTTT TGGAACTACG CTTCTAGTCG
AGAACGTAGA AAGAATTGAT CCTATTTTGA ATCCTATTCT CAACAAGGAA ATCCAGCGAA
CCGGTGGACG TACTCTCGTC CGCATTGGCA CGGAAGAGGT GGATTACAGC CCACAGTTCA
AAATTATTTT AAGTACAAAG AATCCGGGTG TGCAGCTAAC GCCTGATCTT TGCTCTCGCG
TGACACTGGT GAACTTTACT GTAACTCCAG ACAGTCTTCA AAGTCAAAGT TTAAGACATC
TTGTGAAATC CCTTAAACCT GAACTGGAAG AGCAGCGAGC GACGTTGCTG AAACTCCAGA
GTGAACAAAA TGTCAAGCTC CGCGAACTCG AGGACCAAAT GCTTGCCAAG ATCAGCGCCT
GTGAAGGCAG TATTTTGGAC GACGACCGTG TGGTCGAGGG TATGGAAATT CTCATGAAAG
AAGGCTCTCA GGTCGAGGAA CAAATTTCTC ACAGTGCCGA AGTCATGAAA CAAGTGCATC
AGGCTGTGGC GCGCTTCGAG CCCTTTGCGG CTGTCTGTCG AAAAATGTTT GTTTTGTTGG
AGGCCTTGCG CGAACTGAGT TTTCTATACG AGTTCCCGGC GAAGGCGTTC ATGACGATTT
TAGAACACAT TCTCGAACAC GAGAGTGCAT CGAGAGAGGC CGACGAAGCT GCTCGGATTG
GAGCTCTAAA GATGGCTCTT TTTCGGGAAA CTGCTGCTCG CATTGGCCGA AGTCTGCAAG
TAGACGACAA ACTTGTTTTT TCCATTTTGT TGGCTCGGTT TTATCAAAAC GATGAAACCA
TGGGCTCCGC TCAGTCTGAA TCGTCCGAAG ACTTGATTGC CGTCATCACT GGTGTTTTCG
GTGAAGACTT TCCTTGGCAA GGTCGCGCAC TGAACGATCT CAACGAGGTG ACGGAATCGG
AGATTAACTC AACGGTACCT CTGTTACTGT GCAGTGCTCC AGGCCACGAC GTGAGTGGCC
GAGTCGAAGC CATGGCCCGT GATTTAGGCA AGGAGCTGTT CAGTGTCGCC ATGGGAAGTA
CGGAGGGCTA CGCAACAGCC GAAAGTATGG TGGCAATGGC GTCGAAACGC GGCACTTGGG
TGATGCTGAA GAACGTACAT CTTTGTATTG AATGGTTGCG TGATTCATTC GTCAAACGAC
TACAGACCCT CGGTTCCCAA ACCCACAAGG ACTTTCGAAT TTTCATTACT AGCGAAATGA
ACGATCGGTT GCCTGCCGCC TTGTTGCAAA TTTCGGATTT GATGGTGGCC GAAGCACCAA
CAGGCGTCAA GGCGTCGCTG ACGAGGTTCT TTTCCAGTCT TTCGAAAGAC CGTCTGGGCT
CGTCGAGCAT GATTCACAAT CGCATGTACT TGTTGTTGGG GTGGACGCAC GCCGTCATCC
AGGAGCGATT GCGGTACGTG CCCAACGGAT GGACGGAAAG GTACGAGTTC ACCGAAGCCG
ATGCGTGGCA TGGTCTGGAC GTGATTGACT CGTTGGTATC CACGTCCGGT TCATTGGACG
AGAGCAACAA CCGCACACCT TCGGATCCCG AACATCTTCC CTGGGACGCG ATTCGTTCCA
CTTTGTGCAA GGGTGTCTTT GGTGGACGCG TAACTTCGGA GGTGGACCAA CGGGTGCTGG
ACGAGTTGGT GCACGCTGTT TTTGTGCCGG CGTCTTTCAA CGTGGACTTT CGCTTGGTCG
AGGGATCGAC GACCAGTCCG ACCCTACCGG ACGGGAGCCG TCGGGAGGAC ATTTTGTCTT
GGATTGCGTC GTTGCCGACC CACACTCCGC CCGCCTGGAT CGGGCTGGAT CGTTCCGCCG
AAGAAGAACG GGAACAGCGT GTGGTCCAGT CGGTCCAAGA CAAGGTGAGC AAGATGCAGG
TCCAGTGCGA GAGTGACAAG GAATAAGAGA ACAAAAAAAA ACGAACAGTA AACGATCCAC
CTACGCCGTA
 
Protein sequence
MDLEALGLAN YLQDDEFLNS LQATVNGWIV QIRKITTLPK STPFLPDAAA EELTFWTQLQ 
AELLHIQTEL SSAPVELTVA LLREAKRFVV VLALENNSGL EQALSYTQDV AHFIKPYPLP
SLQAARSVEG VVEAVQSLFD HYGKLRQSRF YGLPRAIELL QATTLVLRDT LLAILQEQFS
NLLFMDYKEY ESKVRFSFLD VFVQFDDRFE EWKDFILEQG RRRKVTGLNK LIEGMSIHHL
QLKDRLETIH QFRSSQERLR DVVHKVLREE EPAAMQQVES VPRQIFASLN VLDLTPSGTK
ALESALEEYD LQMDAMEERL AKLLRDKLTA CQDAEDMFRV FARFNLLLTR RRVRASVKEF
QIQLIATVAL AVEKLQSKFT LKYESSAAAR ISRLRGIPPV AGKILWAKQM ERQVQTLMER
MGDVLGPDWG QQLEGRQLRK SGDELLSKLD ARSFFHGWVT EWEKGLSSAA ASKMNSYPII
VEPEGRDGVL TAKVNFDEKS ELLFKEIRHL KWLGFGKEIP RTLSMVSDEA MSRYPYAIAV
KTALRSYKSV RILVTPELER LVMPQLLEIR ETISEAFDVK LSNSKVAKKR RVRWEGKEIS
EWVAQLTTSI SKFEDRVEQL LRACDKVDIA LNLLEEVDYN TDKFRAVLAS IQKTIDDMSL
SGYNDLDSWV RVISSEMGKV LSKRLEFALK GWNQTLQVTK ESNNEDHVEL IETAVVEDIP
FPESITTKVS LEIVLRNQEI SAVPALPMAR AMFLNKLHEY FSVVCSLPRP KSGRYEVFDS
VAVKNSQTST GLEETFDNLV YLLPPKLVAD AYGAIEAHIR QASAFVESWL AYQTLWDTQV
SDVASSVGSD IEKWQALLLE ASEARATLDT SATVAEFGPI LVDYGKVQSQ INLKYDSWQK
ELQSNFASIL AQRISETREK IDGAKTRLEE TTLTNASTES IVLGVTFIQE INQKSQLWRK
DVDILHSCER LLKRQRYAFH SEWVETTVVK GLYDSLLQIL ERRTRTMEQQ VPLLQTRVSA
EDKKSTKELA GLIGTWNQDK PLRGNVTPPQ AIEVLSKFEI NLTKAHVHQE NLGRAKDALG
LEHTTESHEV VESLKELTDL KEVWDAMMEP YRSLEEIKDT LWSTAVMRKI RRALDDILAS
MRSLPNRIRQ YDAYTQLHAV VKGYIEGHSL LSDLKSEALK ERHWKTIFQR LGIRVPFIDL
TVGILWENGI LTRKKDVSEI LTVAQGEMAL EVFLSEVRDR WLKQELELVL FQNRTRLIRG
WDDLFATLDD HIGGLALMKS SPYYRAVREF QEEGKLWEDR LTQLRAAFDA WVDVQRRWVY
LEGILFGSSD IKAQLPAEWS RFKSVDSEFV SLMRRIVSKP FAMEVLNIDN LQRTLERLGN
LMTVIQKALG EYLAKQRSDF SRFYFLGDDD LLEIMGNSGE PGKVLAHIGK MFAGIVGARR
ASGDVPENVK TRFDAMVSKD GEIVELHKPI DITAETTVKG WLKELEISMQ TTLALLLQQA
VGEDVYSTSA QLDEDTEVSF VGWSTKFPAQ VMILAAQINW SMGVDSALGT TEPNAALADV
LRVLEWKLEV MANTVLEELP ADSRKKFEQL ITELVRQRDV VRQLMKDSVS DPMDFRWLYH
LRYNYDPKAE KVTEKLSVSL SNAKFYYGFE YLGIGERLVQ TPLTDKCYLT LTQALHFRMG
GSPFGPAGTG KTESVKALGA ALGRFVLVFN CDETFDFSAM GRLLAGLSQV GAWGCFDEFN
RLEERILSAV SQQILTIQRG LLERKSQIEL LGRPVSLHNN VGIFITMNPG YEGRSNLPDN
LKNLFRSFAM VVPDRKLIAQ VMLYSQGIVT AEHLAGKIVD LFMLCDCRLS KQRHYDFGLR
ALKTLLVSAG ALKRQAIEGR NLHGEDLALA EKNALIVGAC NNVLPKLVAE DMVVFKEVLE
STFPGSDVAK MEDTKAREEI VSVCKRSCFV PGEGFLQKIL QLKQVIEMRH GVMVVGPVGV
GKSTALKVLL EVLEKLDGTK GEMSIIDPKA ISKERLYGSL DGTTLEWTDG VFTSLLRRIV
DNQKGESDRR HWIVFDGDVD PNWVENLNSV LDDNKMLTLP SGERLSIPDN VRIILEVDSL
AHATPATVSR CGMVWFSDDN VTNEMSLQHM LQRLATEDLM GDRVAGQQVP SAQIEFLNEI
TGQVISERTS SLVIDALEFA LKQRHIMEPT SDRLLHTFRA LLIQGIVLAI EYDENHPDFP
ITGEHMEKFA KRWLLHSLMW SFCGSASWDV RKSFSDMLLR TSGIIIPFGT DNTLYDYRVR
VDDGEYELWS DNVPRMEIES HRAAASDVVI TTTDTVRHSD VLGAWLTRRI PLILCGPPGS
GKTMTLVNVL QSIQGVILAN LNFSSRTTPE IILKTFSQYC SYVRRGKDIF LEPGESFGAT
SWLVVFADEV NLPEEDNYGT QRVIMFMRQL VEHGGFWRND NVWVKTNRIQ FVGACNPPTD
AGRVEMSRRF MRHVPLLLVD FPAKDSLMQI YRTFNGGMMK LFPNLKGETE AMTEAMVELY
TENQRKFTPA MQPQYFYSPR ELSRWVRAIY ESVVNSDQGL TREELVRVWT HEGLRLFADR
LVDVEDKEWC SSKLDEVARK WFAGVDFEIA LARPMFYTTW LSKDTRRVER GELKDFLSAR
LRVFYEEELD VPLVVFDEVL EHVLRIDRVL RQPMGHLLLV GDSGAGKTVL SKFVSWMNGL
SIFQIKAHSR YGMEDFNEDL RGVMRRVGVD GERVCFIFDE SNVLSSGFIE AINALLASGE
IPGLFDGDDY TALMSAVRDT AARDGVILDS DEELWRHFTS IVQRNLHVVF TVNPSGGDWK
NRSTTSPALF NRCVVDWFGT WGSKAMGEVG REFTTRLDMG DSETEGGAWG IGEGEELMKR
VEDAFDDNST GGLRQAVVAA LVNMHQIARE MAEEIASSPS SITRTFLSPR DYLALIQNFV
SCFNERREKV EDEQLHVNAG LSKLKQTQEN VAELKQGLGT KTAELRKKET LANEKLQQMV
ADQNIAEKRK EEAERMSVEV EKQQKQINER KDRAQKDLDE AEPALRSAQA SVRGIKKRDL
DEIRNLSRPP NNVKLTLECV AIMLGETSVD WTDVRKLLAK ADFIPSILNF DVDNLSAKQI
KLVKEKYLDG NSELNEESVL RSSKACGPLY KWAESQIKYS TIYNNIQPLR EEVAQLEEEA
DIIKTEKYKI EDEVKELEAS IASYKADYAS LIRDVEALKS EMEAVTMKID RAESLMTSLS
HESERWSKSS ETFQTIMRSL VGDGLLMAAT LTYQGFFDFK VRSMMMSRWK KSLECLDIEF
REELGIVESL STGAQRLTWQ AQGLPGDQLS LENGVILDHG IRFPLVIDPS GNAIDFLMNK
HKDEKIQTTS FLDKSFTKTL AGAVRFGTTL LVENVERIDP ILNPILNKEI QRTGGRTLVR
IGTEEVDYSP QFKIILSTKN PGVQLTPDLC SRVTLVNFTV TPDSLQSQSL RHLVKSLKPE
LEEQRATLLK LQSEQNVKLR ELEDQMLAKI SACEGSILDD DRVVEGMEIL MKEGSQVEEQ
ISHSAEVMKQ VHQAVARFEP FAAVCRKMFV LLEALRELSF LYEFPAKAFM TILEHILEHE
SASREADEAA RIGALKMALF RETAARIGRS LQVDDKLVFS ILLARFYQND ETMGSAQSES
SEDLIAVITG VFGEDFPWQG RALNDLNEVT ESEINSTVPL LLCSAPGHDV SGRVEAMARD
LGKELFSVAM GSTEGYATAE SMVAMASKRG TWVMLKNVHL CIEWLRDSFV KRLQTLGSQT
HKDFRIFITS EMNDRLPAAL LQISDLMVAE APTGVKASLT RFFSSLSKDR LGSSSMIHNR
MYLLLGWTHA VIQERLRYVP NGWTERYEFT EADAWHGLDV IDSLVSTSGS LDESNNRTPS
DPEHLPWDAI RSTLCKGVFG GRVTSEVDQR VLDELVHAVF VPASFNVDFR LVEGSTTSPT
LPDGSRREDI LSWIASLPTH TPPAWIGLDR SAEEEREQRV VQSVQDKVSK MQVQCESDKE