Gene PHATRDRAFT_44551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44551 
Symbol 
ID7197797 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp874228 
End bp886117 
Gene Length11890 bp 
Protein Length3905 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178595 
Protein GI219115599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGAAC GTCTAGCCGG GAAGATACTG CAACGGTTTC TTGCCAAGTA CTTCGACGTC 
GAAAATAATG AAACCCTTAC TATGGGAGTC TGGTCTGGTT TGGTCAGTCT TCAGGATCTT
CACGTCAATA TCGACCAGAT AAATCCTTTG CTAGCGCAGA AAGGTATTCC CGTTAGAATA
CAGCAGCTTC ACATTTCACG TCTAGAAATT ACAATTCCAT GGAGTCAGCT CAGTTTTAGT
AGCGGTACGA TGCGAAGCTC GGGCCAACGA AACGAAGCCG CCGAGGTTGT AGTATTAATA
GACGGAGTCC ATTGTTTGGC TAAATCCTTT TTTGATTTTG ACGACATGGC AATCGCTGCA
GACCGAATTG CTCAGCGCCA AAAAATACTC AAGGCGGGCT TACAAAGAGA CACGGACAAG
AGCACCTTTG CCGAAACATT GAAACGGCGC TTGCGAGAGG GGCTTCTTCA ACAGCTCGCG
GAAAGTCTCC AGGTCCATAT TCGAAATGTG CATATTCGCT ACGAAGATTC CAATCATGCA
TTTTCTTGTG GCATGGTTTG CGAAAGTTTG CATTTGCAGC AAGATGTTAC GCCATCAAGC
GAGGAGGCAA TTCGGAAAGT TGTTCAGGTG AATCACGTCG GAATCTACTG GAATCCTGTT
GAAAAGCCTA GGAATGGATT GCCAGTTGAG CAGACGTTTC TAGGCCACCT GCCAACTGAT
CAAATATGCC GAGCTTTGGA TCGCTCCGTT GCCCGACGAA TGCCAACGCT CCAGGCGAGC
AACCCTCTGA ACGCTCCGAA ACACACGTAC CTGATAGTTC CCATTGACGC GTCTGTACAT
TTATCATTTT CCACTGACCC CAGAGATTTT GAAACGCGCC CCGCACTGGA AGTGATCATT
GAAGTTCCGG AACTAACGTT GGGCTTACGC GACTTTCAAA TCTATCAAAT AATCCGTCTT
ATCCACGACC TCAAAGAGCA CAATTACCGG AAGAAGCACT ATCGGCGTTT CCGTCCGTCA
AGCCCAGTCA CAGAAGATCC AAAAGCCTGG TGGCAGTACG CGGTTGCGGT CGTTCGGTAT
GAAATACGAG AAAATAGACT GCGTTGGTCT TGGGGACGTT TCCGTCGAAG GTTTGAACAG
CGGAAGAGAT ACTGCTACCT CTATGAACGG ATGTTGCGAG TCGAAATAGC AATGCCATAC
ACTCCTATAG AGCGTAGCTC ACTTGCTACT GAACTTTCTC TAGCACCTCT CAGTGCCACG
GAGTCTCAGG AACTAGAGGA CCTCGACAAC GGGGTTCTTG GTGATTTGGA TGTGCACGAT
ATTTTGCTTT TCCGTCTGCT TGTCAGCAAG CGGATGGGAT TGAAGAACAG CGGACAAGCT
ACAAATGGAA GAACAAATTG GTTTAGACGG ACAATGTCTA CTTTAGTCAG TGGAGATGAC
GAAGCGGAGG AGGAATACGA ACGATTGCTT TCGTATTGGC AACAAAGGAG CACTTCGGAA
AAAAATTGGT CAGATGCTTC CAGTGGCAAA TTTTGTGTCG CTCTATCTTC TGAGATAAGA
ATTGAAAACG GCCGAACGCT GTTGTATTCA CCAGTGAGTT CAACAATTGA CCTTGAACCA
TTGCGTAGAA TACAGGAATG TTTTCTTGAA ATTTCTTTTA CAGTGTTGGA CGGCCGACTT
TCTCTACTGC AGGACTTCGA GACTATCATA TCTAATATGT CGCTCCTTGA CTTTATCGTC
TCAGAAGTGA GCTCGACGCG GCGCGAAAAT ATTTTGGTAA AACGAATACA CAAACCAGGA
ATTGGCTTGT TTGATTCAAA ACGTCTCGAC GGCTCTGTTG GACCAGTATT TTCCTTGGTA
GTCAGGAAGA ACGCCGAAGA CTCTGATGAA TCTCATATCG AAGTTCAAGC AAGGATAGAG
TGTACCGAGG CTCACTTTTC GCCCGAGGAT ATTTGGTTAC CTTGCTTCAA GCGCCTGATG
CAGCCTGTCC CTCAGCTTCA GCAAATGGCA ACGTTCTGGT CCGAGCTAAG CATGGCTAAC
ATCAATTCTT GGGCCTCAGA TCGGCTAAGC CTCGTGGCAA AAGCACGGAC AGCAGTGAAT
CAGCATCGAA ACATTGATAT AAATCTTGAA ATTGAATGTC CGATCATCCG GGTTTCCGAT
GGAAAAGGCT GTGAACTTGT GTTTGATTTA GGACAAGCAT ACGTAAGAAC TGAGAGGCTC
GCCTCTTTTG GATCCAGTAA CTTGTCCACT TTTGCTGAGA TGGATGACCG AATGCTTTCG
AAGGATCGCC GATTTTCCTA TCAGCAGAGA AATCCGCTCA TTTCACGCGA TGTCGCTTCC
TTCAAAACTC CTCCACGGCT TTCTACAGCA GCCATAGAGC TTCCGACATC GCCAACGAGG
ACAGACTACG TACGTTTAGG ATTCACGGAA AATATGGATC GCGCAGACGA CAATCTGAAC
CTTATCAACC CGACAAGGTT GCCATTTTCA ACGAAAGTCA CTTCAATCGC TAGCGCCGAC
AGCGATCGGT CCTACCGGCA CCAATATCGG GGTCAAATGG AAGCAATATT TTACGATGTT
TTCGAAATGC GTCTCGTCAC TGGGACTGTT GTCTTAGGTG GGACTTCGCG AGATGTGCAC
AGGATATGTC ATCCGGTCGA CATGCGTGCG GCAATCCACA AATCAATCAT TCCCGCCGAC
CATACATTAT GCCGCCTAAA GCTATTCTGC GATATGGAGG ATATTTCTGT AGATCTTTCA
GAGCCGATTG TGTTTGCTCT AGGACGCTTT TTCAATGTCT GGAAATTGGT TTTGTCCACT
AAGGTATCAA AAACTCTATT GCTGACATCG AACTTGGGAG TAGAGTCCAT TCTACCCAAC
CTTCTGTCAG AAGTAGATGT TTCTGATGAT GACTTTGATG AAAATGGTTT CTTCGATGCA
TTGGAGAGCG AGTCATTTAA TTCCCCCGAT GCAGATGCTC TCTTCAAAGA GCATTGGGTG
TCAGAAACTA CGAACTTACT GGATGAAGGT GTCAAGTCTG TTACAAAAAG TGTTCGAAGC
CGACGTCGAC GCCCGAGATC TGTTTCTGAC GTATCTTCAC TATCCGAAAG TTCGTTGGCG
AGGCGTCGCC TCACGAACCA AGGAAACGGG AACATTTACT TAAGTGCTGA GAACCTTGCT
ATGTTAGAAG AGGCTGATGA GTCAGGTGGT CTCGATGAAG AAGAGGATGA TTACTCCTTT
CATTCTGCCT ATTCATTGAG CCACTTGTTG ACACTTTCCG AAAACGTACA AACAGATATA
ATCCAGGCAG TTTCCAATGT CCATCGTTTG AAAGACAAAA TATATGGCAC GGGTGTCTCA
TCTAGCGTTG GCAGTTTTGT GTCAGTTGAT GAAGTGAAAC GGAGCTTGAC GATGCGCAAG
GCGCTCCGTC TCGAACTTGG GAGAGCTACA GCAGAGTTGC AAGCTCTGCG TGATGCACAT
AAAGATCTTT TGTACCAAAT TAAAATCTCA TCCAGCGACG AGAACGAAGG AACGCCACTG
AAGAGCAATC GTTCCGGGCG CGTGTCAAGA TCGATTGAGG TAGCATCAGC CTTACTACAA
ACTCGGAAGC AGCCTACTTT CGCCGATACT GATGCAAACC TTGAACACAC TCTTGTTCGT
GGACTAAATC GTGATCTTCT TCAGATCTGG TTGACGTTGC GTTGTGTTTC AATTCGGTTC
GTTGTTAATT CAGGTACTGA CGGTTCGCTT GAATCGTCCA ATACCTTTGC TGTCAAGATC
GCCAACAGTA ACCTCGTTCG GAGGCAAAGC GCTATTGCTT CGAGGTATTC TTTTAGCCTC
GACAATGTAA GTGCAATTCA TGAAAACGAT GTGGACCCAG CTCTGGACCG GTGCTTCATC
GTTGGCGGAT CCAGCAGTGA GACTTCTTCG AAGCTTCTTC CGTCGCTATT CCCTCAGTTC
ATCTCTTCAC TATCGATGGA AAACAAATTT TTAAGCGGGG CGGTCTGTCT GCAAAGAAAG
AGACAGGCGA TGTCTGGAAA GTCGTCAAAG ATGGACAAGA TTCGGATAGC TTTCGGCGAT
ATTGAGCTGA AAGGTCACAC CAACTCCTTG CGTTTGATCC AGCAACAGAT GTCATCGTTA
TCTTCAGTTT TTCGAGTTAC AAGTACTGAA GTATACCGTA AAATGCCCAA GACTCAAGAA
CGAATGCTTG ATGGATATTT TGATGTAGGC CTTCGGATTT CTTCGCTGAG GCTCGTCTTT
GAGGACGAGG AAGGGCATTT TGCTGCTGCC ACTGCATTGA CCGAAGTGGA TATAAACTTT
AGCGGTGCTA GCTTCGGGCG GCAGTTTCAA GATCGGAGTC AGCTAGACAT ACGCCTTGGC
AACTTTCAGG TTTTAAGCAT AGAGGATCTA GATAGTGGAG TCGGGAATGA ATTGGTTGCG
AAACAAGATC TCTACAATCC TTTACTGAGA TTGCGGCTTC GACAGCAAAT CGTTCCTAGA
GGAGAACAGG GTGGTTGGGT GGTAGGTCAG TCATCCACCA TGGCCCTTCA TTCTAGCCTC
GACGAAATCA ACATTCACCT TGGTTTACGA ATGGATTCCA TGGAGTCCCA TCTGTCCTTC
TCCACCCTTT TTCGACTGTC GAAATCAATC TCCTTCGCGA CAAAGGCGTT TGGAAAGTGT
CCCACCGTGG GACGTCCCTC GCTTCACACA GTGGACTCCG GTGATTCTAG TTTGCTCTCT
TCGGTACCGG TGCGTTGGCG CGTTGATGTC TCGATGAAGA AATTTTTGAT GACTCTCGAA
AGCTCGGGGG CTTTTCTTGA AACTGGCAGA GAAAACCAGA TGGTATTGAT GGCTACAGCG
AAGTTAAACT GTTCGCTTCA ACCTAGTACC ACAAAATGTC GTGAAGGGAA GTTTGGAGTT
GAAGTTAAAT TCACTATCGC AGACTTCTCC ATTTCTGTCG TAACTGAGGA ACTTCCCGTC
CTTGAACCCA TATCGATACA CATAGACCTG GTTGCTTTCC CCAAAGCTAT GCAGTTCGCT
CCCTTCACGC TACCGAAAGA TTCTCCTTGG AGCGACCAGA CTACCCTTAG TACACCCACT
GGGCGAATTA CGTGGAAGGT CGACTACTTT CAAGACAAAC TGCCTATTTC TATCGCAGTC
GCGCTGTCGT CCACCCGGAT AAACATCTCG CCTTCGAGAC TTGGACTCCT ACATTCGGCA
TTGGAAGATT TAGCTAGATG TTCAGCAACT ATTCGACGGT CACAGCAATT GGTGTCTAGG
CCTGGATCGG AAGATCAAAT CGGACGCCTT TACCTTGATC TGACTTGCGA AACTTTGATC
ATATCCTTGT ATCCAGAGAC TCGATTACTT AGCTCTCATG CCGATTCCAG AATTCTCGCT
CTTTCGCTAG ACTACCTTTG TGCTAGTTGG CAGAAGAGAG GCAAGGCCAA ACGCTTTGCG
TGTGGTTTAA AGACACTTTG CATTGACGAT ATGTCCTCTC CACCTGGTAT GAGAGCTCTG
TACGGAAGTC GAGCCGAAAG ACAGCTTAAA TCTCGAATTG AAAGTACGAT TAATCACTTT
CCCAGCTTCT TTATGATGAG AATCGAGAAC TCTGAAGCAG ATGGCAAAAA GTACTTTGAG
ATCGATTTTA GTATTGGAAC AGTTCATTTG CTTTTTCTTC CGAGCCTCGT CCAATCGATC
AAATCATTTG TGATTTCGAT TCCTTCGCTG AGCGAGAGGA AACTACGATC GCCTGAATCT
AGTTCCTCAA GACATGGATT GAGATCGCTT TCAACAGCAG TGGGAATCTA TTCGGTGACG
GTTGTATCGG ATATTTTTGA ATGCTTTCTG TCCTCACAAG ACATACTCAA GGCTATTCGA
GAGACGCCCA GAACAAGCAT CGGCGTTGTC GTGTTGAGGC AGAGGGCTGA CATAGAGCTT
CAACTGGAGA TTGTGTCCTC CGATCATTTG GTTGAAAGGT TTCTCGATGA AATTGAACTG
AAACAGCGTG TTGAAGGTTT CTTGACTGCT GGAAAATCAA GTACGAAGCA AGTTTCGAGC
GTTGTTACCG CAAATTTGGA AGCTGAGGTG CACCAGTTCC AGGTTCTCCG GACGACAATA
AAAAGTATAG ATAAAGGTGA AGAGACTGAC GTTCTCCAGT TCTCTGTTAC ACCGCCTCAA
TGGGGTGAGC AACGCATTTC AAATGAGTTC AGTTTTCGCC TTCTACATCA ATTAACCGGT
GCGTGGTCAC GTGCTGGTCG ATTGTCTGTT CAGACGTTTT CATTTTCGAA CGCATTCGAA
ATGAAATTTG AGTTCGTTGA CATTTTGCTT TACATCGCGC AAAGTTCTGG CGGCATGACT
GATGCATTTC AATTCACCGT TTGGCCCATA ATTGAAGCTT TGAGCAAAAA TTCTGGGCCC
AAAAGGCTCT CTGAGGAAAA TTCAAATGAT TCTGGCTGCA ATGCGATTGT CGATGCGCTA
CTGAAAGGTC TATCGGTTGC TTCGATTCGT GGAGAAGGGA TACAACTCAC GTGCGTCCCA
GGTGGTGCAA CGCGATTGAC AGAGTCCCCG ATCATCAAAC TCGAAGTGAT GCGTGTCGCG
TTTGGTTGCG CTGGGGTGTC GAGTGCAACA GAGCTAGTAA ATCTCCCTGT AAGTGGCACT
TTCCAGGATG TCGAACAGAC AGATAAAATA GGACAGAACT TCGTTTTCGG AGCATGGTCG
ATGTTTGAAG TTTCAGCTCA TTACCATAAC CGCCGCTTGG TAGCTTGGGA ACCATTTATT
GAGCCTTGGA AAGTCAGTAT TTTCGGGGGA ATGGATTTAT GTTGCCCGAC AACTTTATTG
CCTAGTAAGC ACGCTGCTAC AGCATCCGAC TACACAACTG AAGACGGCTT TGTCACTAGC
GTCTATAGTC CTGGTGGAGG GAGACTTCAA GAAATTGGAA GACTCCTGCG GTCACCGTTT
CGTGGGGGGA TCTCAGCTTC TGGACAAGGA GTTACTAATT CGTCTGAGAT GCTACAAAGT
GATATCGACT TTTGTTATCT GCTCTTGCTT CACTCCTCCG AAAGTTCTGT TCAGGGCGCA
TATTTGCCAA CAAACGAGCT CCACTGCACC GATGCGAGAA AATGCTTGCC GTCAGATCAA
CCTCGGAAAT GGGTAAGTCA CTTTGGCTAT CCTCAAGACA AGATATCGCA AGCGAAAAAG
ATAAAGAAGC ACGCGTCTAT AACCTTGCTG GGCTCCGACA TCACTCCGCT GAACATCAAT
ATAACTGGTG CCCTGATTGA AAACGTATGG GACTACATAC GAAAGGACCA AGTAGATGCA
TCCAGGGGAG TGGCACCCCA TTGGGTACGG AATGAGTCAG GCCTGGTACG TAGTTTTTAA
ATTTGAGATT GCCACACTTT CTTTTGCACC TGCAACTGAT GCATTCCACC CTCTTAGACC
ATCCGTTTCC ATGAGGTTCT TGACCCTGAG CGCGTGTTCC GTGGAGAGAA AGCACCAAAA
TGTATTTTAC CTGACGGCTC GGAAGCACCG TTGTCGCTCA AGAGAACTCG GTCACAATCC
TGCGATCCTC ATAGAGCTTT TATTTTTCTT GAGCTTGGAT GCGATGAAGA TGTCTGCGGG
AGAGAACATC AGAACACATC CCATACCAGT GGTGTTTCCA AAACTTCGAG TTTCTATTTC
AAATCTACAA CAAAGATTCC AGTAGACACT GTCGGCCTCA ACAAATACCA TCTCGACCGC
AGAATCGAAT CGCGGGAAGC CGATGGAAGG CTGGATCGGT CGAGGCCTCT TGGATGCGTT
ATTGTGCGCG TTGCTCTCCA GGGTGGAGTG AAAGTAGTGT CTGTTGAATC TCCCCTTGTT
GTAAAAAATT TGTCGATAAC TGATATCATC TGTGAGGCCC GAGATCGTGA CAGTTCGACG
CTGCTGTGGC GGTCTCTAGT GCCAGGCTTA CAAAGCAAAA TATTTGGTTC TGCTGCGGGA
AGGACAGTAC CTGTCCCTGT TGACCTTGTC CCGTATGTTC ATGAGAGCTC CTGCTGTTTC
TCAGTTCTTT CCGTGAGCGG CTGTCTCGAG AGTGACTTAA GCACGGTTCC TAGCAGCCGA
CCATACGAAG GCCTAATGCG ACTGCCAAAG CCGTATACAC GTTCTTCACT TGAGAAGGGG
GTCGTCGACG AGATCCATCT CCGCGTGGCA TCCTTGATGC TTCCAAGCAA AGTGTTCAAT
GATGTCCCTC AGTACTTAAA TGGGTGCTCT CTTAGAATCG GATCGATTTC ACTTTCAGCG
ACGGCGTACC AGCGAAGAAA AACTGTGATG GACGTTCCAG AGCAAAGAAT GGTCTTAATA
AGACCAGAAG TTGTGTTCCG GAACCATCTG CCCGTGCCAA TTTGTGTCCA GGCCCGACCG
AGACCTCAAT CGCTAGAGGA AGGACCGCCA CCATCAAATT TATGGGTAGA CCTTGGTATT
TTGAAATGTG GCGAATTCGC TGGTTGGACT GGCGTTGGTC CTTACGACTT CTTTGATATC
CGGATTCTGA TCATCGAAAA AGACGGAGGA CCGAGCAAGC AGTTTCCGCA ATGGAGCCAT
TCTGTACTTG TTTTTTCAGC CGTGCCAGAA CAGTCCTCGG GGACTAATCG AGTAGGAAAA
CAGATCAAAG CAATGTACAA GCTGAGACTC GAGGACTCAG TGGGAACTCC ACTCACTTTG
TCGGTACATT CAAGCCAAGG CGATAACAAG TTAGTGCCTT TAAACGAATC AAATGTTCGC
CCTTTATCGG AGCGATTGCA GCCAGGTAAT CGAGTCCTGA GTATCTTCGC ACCATATTGG
ATTGTGGACA GTTCTGGTCT CGATCTTGAG TTTAAGCTGA CAAAGCCTAT TGCAGGACAA
ATTGGTCCTT CCGGGTCTCT ATCGTGCATG GAGAACGACT CCTTTTCCAC ACATGGACTT
GGGGAGCTTC TAGATGACAG CGACCTCATG TATCTGCCGT CAAGAGGTTT GTTCGAAATT
TTGATGGCTG GCGAAGAAGA GTCACGCCAG ATGAATGTTC GCCGACGTGC ATCTCGAGCC
ACTGGATTCA ACCCAAACAC AGCTCCATGG TCGGACACAA TTTCCTTGTC CCGAAGATTC
GGCTCGTATC ACGACACATA CGTCCAACCA GCAAGGCGAA GCTCGGTCCA TGATCACTCA
AATACGGATC ACGATTCTTT CGAACCCTTC GCCCTTCGTT CTCGTCTTGT TCGAGCCCCT
GAATCCCTCG GAGGTCTCTT GGGAACAAAG ATTATGCACT TTTTTTGTCG GTACTCAATA
TTGAACGAGC TCGGACGCGA CATCGAGATC AAAACCTGTG GTCTAAGTCG AGGTGCTCCA
TGTGTCGTGA AATCCGACTG TTGGCAAAAG CCGTTCCATA TCGAAGACTC ACGTTTCGTG
TCATTTCGAC CAAAAGAATA TGGGTGGGAA TGGTCAGGAC GTTTTCACGT CACATCCAAA
AGAAGAGTGG AGATGACTTT CCATTTAAGA CATACCATTA GAGACGAGTC TATTGGCGTA
ACAGTTGAGT GTGTCTCCCG CGAAGCATCT GGCACATGTA CAGTCATCTT TCGACCTGCA
CTCCACCTGC CCTGTCGTAT CGAAAACAAG AGCATGTTTC CCATAAAAAT TTTCCAAAAT
CCTTCGATAA TGTGCGTATG TGGCTTTGGC AGAGAAGGTT CCAAAGACAC CGTGATTTTG
CCATTTCATA GCTTAGAGTT TGCTTGGGAC GAGCCTGAGT CACGTCGCAA ATCTGTTCTC
GTGAAGGCTG TCAATTTCAG CGCCAATCGC GGGGACGGCA ATGCAAGAAA CCTGGGAATC
TTTGCTCTTG ATAGCTTAAC ACCGGGCACT GTGCGAAAGC TTGAACATAA CCTATCAGCC
CAAGTTCTTG CCGATGGTCC AACCAAAGTC CTTCGAATCG TCGAAACCTC TGATGACGGG
GGGCCCATAC ATGGAGAGTG TGAAGACGAT CAACTTCCAA GTTATCAAAG GCCCTCTGCC
AGTTTCCCCT ATTCAATTAC CGTGAAGCTG GCACACGGAA TTGGTCTAAG CGTTGTCGAC
TGGTCACCAC AGGAGCTAAT CTATCTCAAA TTGGAAGATA TCCTTTTTGA GCAAGCTCGC
AACAGCTTGG CGGAAAACAC TGACGTGAGT GTTGGAAGTG TAGTTGTAGA CAACTGTTTG
TGGGTATCCC CCTATCCAGT TGCTGTTCGA CTTGGGTCAC GTTCCCGTAA GCGACGGCAT
AGAAGGCATA ACGGGATTGC GATATCTTGG AGCCGTCCCT TGGTTCAGCG TGCGGCGTTT
GGCGATCTGA CCATGATTGA ACGCATTGAA ATATCTACTG AACCGTCAAT AATAAGTGTG
GACGGCAAGT TGGCCGAGTT TGCGATTGCG ATGACAAGAC AAGTTAAGAA AATGGGGTAT
AGTCTAAACG ACCTCGATCG AATCGTTGTA TCGCGTAACA GTGAGCTGCG AAAGCTTCTC
ACGATTGCTC TGCCCGTGGA CGGAAACGCT TCAGCGATAG CCACGGACAA ATCGCACGAA
TCACGATTAT CAGACGACTT GTATGCAGCC GTCGACTGCA TGGCAACACC AGCAATTGCT
TCTAAACTTC GATCACGATT TCGCCCTTTG ACGGCGAAAG GAGTTATTTC GAAGCGCATG
GAATCAGCCT TAAGTCCCGC CCCACCACAG CACAAATACT ATATCGAAAA ATTACGAATA
TCAGCTACGG CTGCGGAAGT GAGCTGGTCA GGTTCATTAC CAGTCGCGTT TTCGTTGCCC
CGCTGGATGC GACCAGCGTT GACCTTTGAA GGCTTTCCAC TTTTTTTACG ACCCTACTCG
GTGTCTCATT CATACGGAAC GGCCGAAGAA CACTTACGGG CCTTGAAATC TCACTACATT
AGTATTTGGA GGGTATTTGA TTTGGTTGTT GGCCTGGCAA AACCGACATT TTTGATACGG
GCGTGGTACT TTACCACTCG GGACATTCTG GCGACAGCAC TAATGAGTCT ATCAAACGGA
GTCTACAATG TTGGTGCAAA GCTGTTCCCG TTGTCGTCTG CAGAGCACAA TGTAACGGAT
GCGACTACAC AAGAAAGTTG GCTTAGGCAT CCTTATTGGG GCTTGCACTC ATGGCGGCAT
CCCATTGTAC AACGAATGTT CTATGTTTGC GCTGGAGGAA TGGCAACATC CGCCGCGTGG
TTGCGATATA ACGCTGCTAG ACATGTAGGT GGCTTGGTCA GGGCACGTAA TCCGCGACTG
TTCGCCTCCA CCGGGGACGG CAACGACTTG TTGGTAGAAT ACGTGGAAAA AGGTGAAAAC
GCCGGCAAGG CCTTACTATC GCGAGTCCGA ATGGGGTCGC ACCTGGGTGA AGGCTACGTC
TATCACGTGG AAGATGCTCA CCGACAGGGA TTGAGCAAAA ACACAGAGAT GGGACACGCA
ACAATGATTT TAATGTTGAC GTTCGATCGC ATCGTACTGT TGAACGGTGA ATTAGACAGC
AATTTTTGTC AAGTGGTTTG GGAAGTGCTC TTTTCCGATT TGGTGCATTT ACAGTTAGTC
ATGGTAGCTA TCCCTGCAAA CGGCAATGAT ATAGATTCTA CGAGCTCAAG ATATCAAGCC
ATTCGTCTTT GGTACCTTGC AAACAAGCCA CGTCAATCAC CAAAGCTCGA CGAACAGCTG
CAAGCTTTGG CTGGCTTGGA TGCAATGGAA TGTCACTTGG TGTTTGTGCC CCGTCGAGAT
GTAGGTATGT TGTTGGGCAA AGTCAGATTG GTTAAGGCAC ACGTATTAGA CTAGAGCGTC
TGCTGGACAC ATGACTATCA TGCTAACTTC ACTATCAAGC ATAGCTAGTA CGCTTGAATG
TCCAAGGTAG
 
Protein sequence
MLERLAGKIL QRFLAKYFDV ENNETLTMGV WSGLVSLQDL HVNIDQINPL LAQKGIPVRI 
QQLHISRLEI TIPWSQLSFS SGTMRSSGQR NEAAEVVVLI DGVHCLAKSF FDFDDMAIAA
DRIAQRQKIL KAGLQRDTDK STFAETLKRR LREGLLQQLA ESLQVHIRNV HIRYEDSNHA
FSCGMVCESL HLQQDVTPSS EEAIRKVVQV NHVGIYWNPV EKPRNGLPVE QTFLGHLPTD
QICRALDRSV ARRMPTLQAS NPLNAPKHTY LIVPIDASVH LSFSTDPRDF ETRPALEVII
EVPELTLGLR DFQIYQIIRL IHDLKEHNYR KKHYRRFRPS SPVTEDPKAW WQYAVAVVRY
EIRENRLRWS WGRFRRRFEQ RKRYCYLYER MLRVEIAMPY TPIERSSLAT ELSLAPLSAT
ESQELEDLDN GVLGDLDVHD ILLFRLLVSK RMGLKNSGQA TNGRTNWFRR TMSTLVSGDD
EAEEEYERLL SYWQQRSTSE KNWSDASSGK FCVALSSEIR IENGRTLLYS PVSSTIDLEP
LRRIQECFLE ISFTVLDGRL SLLQDFETII SNMSLLDFIV SEVSSTRREN ILVKRIHKPG
IGLFDSKRLD GSVGPVFSLV VRKNAEDSDE SHIEVQARIE CTEAHFSPED IWLPCFKRLM
QPVPQLQQMA TFWSELSMAN INSWASDRLS LVAKARTAVN QHRNIDINLE IECPIIRVSD
GKGCELVFDL GQAYVRTERL ASFGSSNLST FAEMDDRMLS KDRRFSYQQR NPLISRDVAS
FKTPPRLSTA AIELPTSPTR TDYVRLGFTE NMDRADDNLN LINPTRLPFS TKVTSIASAD
SDRSYRHQYR GQMEAIFYDV FEMRLVTGTV VLGGTSRDVH RICHPVDMRA AIHKSIIPAD
HTLCRLKLFC DMEDISVDLS EPIVFALGRF FNVWKLVLST KVSKTLLLTS NLGVESILPN
LLSEVDVSDD DFDENGFFDA LESESFNSPD ADALFKEHWV SETTNLLDEG VKSVTKSVRS
RRRRPRSVSD VSSLSESSLA RRRLTNQGNG NIYLSAENLA MLEEADESGG LDEEEDDYSF
HSAYSLSHLL TLSENVQTDI IQAVSNVHRL KDKIYGTGVS SSVGSFVSVD EVKRSLTMRK
ALRLELGRAT AELQALRDAH KDLLYQIKIS SSDENEGTPL KSNRSGRVSR SIEVASALLQ
TRKQPTFADT DANLEHTLVR GLNRDLLQIW LTLRCVSIRF VVNSGTDGSL ESSNTFAVKI
ANSNLVRRQS AIASRYSFSL DNVSAIHEND VDPALDRCFI VGGSSSETSS KLLPSLFPQF
ISSLSMENKF LSGAVCLQRK RQAMSGKSSK MDKIRIAFGD IELKGHTNSL RLIQQQMSSL
SSVFRVTSTE VYRKMPKTQE RMLDGYFDVG LRISSLRLVF EDEEGHFAAA TALTEVDINF
SGASFGRQFQ DRSQLDIRLG NFQVLSIEDL DSGVGNELVA KQDLYNPLLR LRLRQQIVPR
GEQGGWVVGQ SSTMALHSSL DEINIHLGLR MDSMESHLSF STLFRLSKSI SFATKAFGKC
PTVGRPSLHT VDSGDSSLLS SVPVRWRVDV SMKKFLMTLE SSGAFLETGR ENQMVLMATA
KLNCSLQPST TKCREGKFGV EVKFTIADFS ISVVTEELPV LEPISIHIDL VAFPKAMQFA
PFTLPKDSPW SDQTTLSTPT GRITWKVDYF QDKLPISIAV ALSSTRINIS PSRLGLLHSA
LEDLARCSAT IRRSQQLVSR PGSEDQIGRL YLDLTCETLI ISLYPETRLL SSHADSRILA
LSLDYLCASW QKRGKAKRFA CGLKTLCIDD MSSPPGMRAL YGSRAERQLK SRIESTINHF
PSFFMMRIEN SEADGKKYFE IDFSIGTVHL LFLPSLVQSI KSFVISIPSL SERKLRSPES
SSSRHGLRSL STAVGIYSVT VVSDIFECFL SSQDILKAIR ETPRTSIGVV VLRQRADIEL
QLEIVSSDHL VERFLDEIEL KQRVEGFLTA GKSSTKQVSS VVTANLEAEV HQFQVLRTTI
KSIDKGEETD VLQFSVTPPQ WGEQRISNEF SFRLLHQLTG AWSRAGRLSV QTFSFSNAFE
MKFEFVDILL YIAQSSGGMT DAFQFTVWPI IEALSKNSGP KRLSEENSND SGCNAIVDAL
LKGLSVASIR GEGIQLTCVP GGATRLTESP IIKLEVMRVA FGCAGVSSAT ELVNLPVSGT
FQDVEQTDKI GQNFVFGAWS MFEVSAHYHN RRLVAWEPFI EPWKVSIFGG MDLCCPTTLL
PSKHAATASD YTTEDGFVTS VYSPGGGRLQ EIGRLLRSPF RGGISASGQG VTNSSEMLQS
DIDFCYLLLL HSSESSVQGA YLPTNELHCT DARKCLPSDQ PRKWVSHFGY PQDKISQAKK
IKKHASITLL GSDITPLNIN ITGALIENVW DYIRKDQVDA SRGVAPHWVR NESGLTIRFH
EVLDPERVFR GEKAPKCILP DGSEAPLSLK RTRSQSCDPH RAFIFLELGC DEDVCGREHQ
NTSHTSGVSK TSSFYFKSTT KIPVDTVGLN KYHLDRRIES READGRLDRS RPLGCVIVRV
ALQGGVKVVS VESPLVVKNL SITDIICEAR DRDSSTLLWR SLVPGLQSKI FGSAAGRTVP
VPVDLVPYVH ESSCCFSVLS VSGCLESDLS TVPSSRPYEG LMRLPKPYTR SSLEKGVVDE
IHLRVASLML PSKVFNDVPQ YLNGCSLRIG SISLSATAYQ RRKTVMDVPE QRMVLIRPEV
VFRNHLPVPI CVQARPRPQS LEEGPPPSNL WVDLGILKCG EFAGWTGVGP YDFFDIRILI
IEKDGGPSKQ FPQWSHSVLV FSAVPEQSSG TNRVGKQIKA MYKLRLEDSV GTPLTLSVHS
SQGDNKLVPL NESNVRPLSE RLQPGNRVLS IFAPYWIVDS SGLDLEFKLT KPIAGQIGPS
GSLSCMENDS FSTHGLGELL DDSDLMYLPS RGLFEILMAG EEESRQMNVR RRASRATGFN
PNTAPWSDTI SLSRRFGSYH DTYVQPARRS SVHDHSNTDH DSFEPFALRS RLVRAPESLG
GLLGTKIMHF FCRYSILNEL GRDIEIKTCG LSRGAPCVVK SDCWQKPFHI EDSRFVSFRP
KEYGWEWSGR FHVTSKRRVE MTFHLRHTIR DESIGVTVEC VSREASGTCT VIFRPALHLP
CRIENKSMFP IKIFQNPSIM CVCGFGREGS KDTVILPFHS LEFAWDEPES RRKSVLVKAV
NFSANRGDGN ARNLGIFALD SLTPGTVRKL EHNLSAQVLA DGPTKVLRIV ETSDDGGPIH
GECEDDQLPS YQRPSASFPY SITVKLAHGI GLSVVDWSPQ ELIYLKLEDI LFEQARNSLA
ENTDVSVGSV VVDNCLWVSP YPVAVRLGSR SRKRRHRRHN GIAISWSRPL VQRAAFGDLT
MIERIEISTE PSIISVDGKL AEFAIAMTRQ VKKMGYSLND LDRIVVSRNS ELRKLLTIAL
PVDGNASAIA TDKSHESRLS DDLYAAVDCM ATPAIASKLR SRFRPLTAKG VISKRMESAL
SPAPPQHKYY IEKLRISATA AEVSWSGSLP VAFSLPRWMR PALTFEGFPL FLRPYSVSHS
YGTAEEHLRA LKSHYISIWR VFDLVVGLAK PTFLIRAWYF TTRDILATAL MSLSNGVYNV
GAKLFPLSSA EHNVTDATTQ ESWLRHPYWG LHSWRHPIVQ RMFYVCAGGM ATSAAWLRYN
AARHVGGLVR ARNPRLFAST GDGNDLLVEY VEKGENAGKA LLSRVRMGSH LGEGYVYHVE
DAHRQGLSKN TEMGHATMIL MLTFDRIVLL NGELDSNFCQ VVWEVLFSDL VHLQLVMVAI
PANGNDIDST SSRYQAIRLW YLANKPRQSP KLDEQLQALA GLDAMECHLV FVPRRDVAST
LECPR