Gene PHATRDRAFT_42892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42892 
Symbol 
ID7196470 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1432123 
End bp1444698 
Gene Length12576 bp 
Protein Length4067 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177293 
Protein GI219111083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATTGGGTG TAGACCATGG CTTCTGAACC TGATTACTCA GCAAATACCA ACGCGAGCAG 
TAGCAGTTCA GCGCCAGCGG GTAGTATGAA CTGGGAACCT GTCCAGCAAC GCCTGCTGGA
TCCGGATCTG AACGTGGCCT GGCAGGCTGC CAAAGAATTA CGGGACAATG TTGAGGTTGC
TCATTCTACC GAGTACCCTC TACTCCTGTC GGCCCTCTCG CCCGCCTTTT CGTCGATTCT
CACGCAGCGA ACCAAGCCCA ATAGAGATAC GAATAGTGTG GAGCATAAGT TGCGTCACGC
GATATTGGAC GCCGTGAGCA AGTTTCCTTC GAACGAAGTC TTACGCCCTC ACGCTCCCCA
TTTGGTCGCC ATTGCCCTTG AGGTTCTGAA TCGTGACTAC GAGGCCAATG CGCTGCTCAG
TTTACGGATC ATCTTTGACC TGTACAAAAC CTACCGATCG CTACCACAAG ATTATGTTCA
GCCGTTTCTA GATTTTGTGG TTTCAACCTA TAGGGCCTTG CCCCGGGCAG TACAGCATAA
CTTTGCCTGG GAGCATTTGA ATTTCAAGGC GACTGAGACC AGTAAAACAG CGACAACGAC
TATAATTTCC ACGCCGATAA CCTCAACTGC CACGCCGACG ACGACAGCAA CTACCGCCAC
TATTGCAGCT GCCGAACCAG TCGAAGAAAA AGACCAAGAT GGCGACCTTA CTATGCAGGA
CATAGAAGTT CCAGCTACCG GTCCGGAAGT AAAATCTTCA ACACCAGCAT CGTCAGTACC
CTCTGGGTCC GCTTCAGTAG ACTCTTTGGG AAGCGCCATC CCTGGCCATC CGAACAGCCC
GGAAGCACGG CTGCCCGTGC GTTCCAACCT TTCCTTTCGT GTCCTTACCG AATGTCCCCT
GATTGTTATG CTCATGTTAC AATTGTATCC AAAATTTCTC AAAACCAACA TTGCCGCCTT
GATCGTTGTC ATGATGGAAG CCTTGGCCAT ACGTTCTCCG AGCTTGTCCT CCATTACTCC
GCCCGAAATA GCGTCTACGG ACAGCCCAGT CAAACGTTCC TACCACACGC GCGTCCGTGA
ACTTGCCGCT GCACAGGCCA AAACGCTGTC GTTTCTGACC TTTCTACTCC GGTCGTTCCA
AGCCGAACTC AAACCGTATG AAGATCGTCT CGCCTCACAC GTTGTCGCTC TGATCAAATC
ATGCCCCCGC GAATCCACTA GTACGCGCAA AGAACTTTTG GTCGCTACGC GTCATTTGCT
GAGTAGCGAC TTTCGGAAGG GATTTTTTCG ACACGCCGAT GTATTAATGG ACGAACGTCT
TCTGTTGGGT TCACACTACC GATCAGCCGA CCAAGCAAGC TTGCGCCCGT TGGGTTACCA
AACTGTGTCA GAACTTGTAC TGAATGTTCG CAGCTCCTTG ACGATGCTAC AAATGAGCAA
GGTGGTCAGC TTGTTTTCTC GAGTGCTGCA CGATGAAGGA TCGACATGTC CCATGCCCAC
GCAGTACTTA GCAGTGCGTA CTTTGCTGAA TTTGGGAGAT GTTATTTATC ACAACACGGA
TCTAAATCCG CAACTGGGAC GGGACTTGTT GGTGCGGATT TTGAACACTT TGACGGAGAA
GTTGACTGCT TTAAACGAGT ACTATCCAGA AGTGCAAAGG GCAGAACTTA AAAGGGGAGA
AATTGTGTCA CCTACCGTAC AGACCACCTC ATGTCATGAT TCAGTCCGCG ATTTGCAGAG
TATGATTCGG GCAATCATTG TGGGAAACAA AAACATTGTG TTTTTCTTAA GTAATTATCG
AAATCAACGG GACAAAGAAA AAGTGCGAGA AACCTTGGTA CCGCCACCTG GTTCAAATGA
AGAAGTGTCG TCTGCCTATC ACAAGCTGAC CCATACGGAA GTGGCCATTC TTGATCGTTA
CATTATTGCG TCGTTGCCTG CACTCAAACT GCTGAAAATG ACCAGTACGG GGCAGAGCCG
AGTGGGGGGT GAAAAGACCT TGGCAGATCA TCACCGTGAT ACATTAACGT ACTTCGCCGC
AACATTTGCA GCCTTGGATG GTTACAACTT TCGGCGGACG ATTGGCCGCC GTCTCAACCT
TCTAGTGGAC GCTATTGTAG AAGATCCACT CGTGATGATC GTGCCCCGGC ATCTTCTTGC
TGTAAATGCG GGGACGAGTT ACGAATTCTG TAGCATGTTG ACGTGTTACC TTGTTGAACG
ACTTGACGAT CTAGCTTTGC CTCATCGAAA CAATATTGTG TTTCTTAAGC CATCCTGTGG
ACAAGCCGAG AATGGCAAGG ACGTTGTACT GGAGCAGTTG CGCGAGATTT CTCAGAATCC
CCGAGACAGT GAGAAACATC AGCGCCAGCG AAGTTCCACG TACTTACAAC TCTTCGAGCG
TGCTCTGAAG TCCTTGGCTC CATATCCAGA GAACGAATCT ACCATTCGTC GATATTTGCG
TTTTGTCGTC TCGGAATGTT TACGGTCCTC CTTAGAAACC TCGGAACTAT GGCCAGACAA
CTACTGTATT CTGCTCCGAT ACATATTCCG ATCGATTTCA GCGGGCAAGT TTGAAGAGTC
TTACAAGGAG CTGTTGCCTT TAATTCCTAC GGTTCTAAAT GGTTTGTATA GGGTCATATG
CACTGCAGAT GATACCATGC TACGACGCAC CGCCATGGAA CTTTGCCTGA CGATACCTGC
AAGACTATCC TCTCTCCTTC CTCACATGAA TCTACTGGTG AGGGTAATTA TTCCAGCTCT
TGATTCAAAT TCTGGAGATC TGGTAAATTT AGGGTAAGCA AATACCGGAA ATCAAGACGA
TTGCTTGCCT CGCTTATCTC TCACTATCAA TTGGTGTGTA TACTGCTTGC AGACTGCGAA
CACTTGAATT TTGGGTCGAT AATTTGAATC CATTGTTCTT GTACCCCGAG ATGTCAAAAG
ACATTCCTTT GCTCTCTGCT ATTATGCGGT CGTTGTCTCG TCACTTGCGC CCAGCGCCCT
ATCCTTACGG CCTGCTAACC CTGCGATTGT TGGGGAAGCT GGGCGGTAAG AATCGTCATT
TCTTACGTGA ACCGCTTCAC TTGACAAATA CATCCAACTT TAATACAGAA GCCGTAGAGG
TCGACTGTTC GTGGATCGTT GGAGACGAGA ATTCCGTTAA ACCTCTCAAA ACTACAACTA
CAATTGCGCT TCCTCTTGAC CGCTGTATCG AAATGCTTAA GACTATTGCT ACTTCACAAG
AGTTTGTTGA GCTCAAGATA TGCGTAGAAG ATGAGCCTAC AGCTCTCAAA CAAACTGTGA
TTCCATGGGG AGAATATGAA GTATTGTGGT CAACACATTT GGAAAATGTG GATTTCGAAG
CATACTCAAA AGGCGTGAGC AATGAGACAC GACGTTCTCA GGCCCACGCT TGTCTCTCAA
TCATTAAGAC AGCGTTGGGA GTGCTCGAAA CCTCTCGAAA ACAGCATGAG CAAAAAGATG
CTGCAAAAGC TCGAGATCTT CTTTCCAGCT TCGAATCTTA TGACGAAAAT GTAACTCCCG
GTAGACTTGT CGCGCTGGCT CTCATGTTTG CTCGCATGAT TGATAGTACC AAGCAAGAAT
CCCAAAATCT CCTAATTGGC GCTGTTGGAA AGCTGCCTCC TGCCGATTTG AGTGACGCCC
TGGCTGACTT CCTCTCTGAA CCTATTTTGG GGACAAATAC GATTGCGATT GAGATTTCAA
CATATTTTTT GAAAAGCGAG AGGAGTTTGG AAAATTCAGA CGTTGTGACA TTTGTAGAGC
ATCTAGTCAG ACGGTTGTGT GGGACCTGCT GTTCAGCAAC CTGGAGCCGT CAACGCGGTG
CTCAGCTTAT TTTGTTATTT TTGGTAACCG AGCTGGGTCA TGAATGGGCA CTCGAGCACG
AATTAACTCT CATAAATGCA GCAATTATAT CAGTGAAAAG TGTTCCTCGG GAGCTCAGCA
CTGCTTCAAT CAAGGCTGTA GAGTTCTTTG TTCGCATTTG TGAAGGCGTT TACGGCAAGC
TGAACAGAGC ACAAATACTT CAACAAGGGC TGATGTGGGA TATCCTATCT GACGACGATA
ACAGAATTCT GCAATACACG CGTGATAGTG TTGTTACTTT GGGAGATGAT GATGCGATCG
ACCAGATTGT TAAGTCCGAC AACCTTTCAC AAACGACAGA AGCACCGTCA GATTCCACTG
AGATTCCGAG TACCAGAAAA AAGGGCAGCG AGGAAAAAAC GCGATTGACA AGACCATCGA
AGGAGGTTTT CCGAGTCATG ATTCGCGAAT TGATATCAGC TCAGCACGCT GTAAGGTGAG
CGGACTGGAA AACATTCTCA CCATTTGTTG TGTGGATTTT GTCTTGCAAA GGTAGACGTA
AGCTTCGACA AGCCAATGCC TCAATTCTCT GTTAATTTTC ACCTCTTCCA TTCAGATTCG
TGGCACGATT TATAACATCT TTATACATCG TGCCGCACTG GCGTGAAGGA AACGACCAGA
GGGACGAGGC GGAACACCCG ACATTTATTC GACGAGCCGT ACTATCTAAA TCAATGAAGC
TATTGCCCCT TCCTTACCAG GTTGGTGCGA TCGAGGGTTT GGCATCTGTC GTCAAGCTGT
TTCCTGGTGT TCTCCCTCTT GACGACCAGC ATTTCCTAGG GTTTCTGTCG GAGCTCTTGA
AGATGACTTC TGTTCCTGAA GGCGATATGC AAGACCCAAG TTGGGCTGAT TCTGTGGTGG
ATAAAAATGG ATTTGCCGGC GTTTCTAGTG AAAGATTGAA CATTGGCAAT CCGACACATG
CCTCGGCACT ATTCTGTCGT CGCGAATGCA TTCTGAATGT TGACGGCATG ACATTGGTGG
TCCCGGCTGA GCTTCCGATG GGTGTGCAAC TCAGAGTTTC TGCGATAAAA TTATTTCACG
CGGTAATCGT GAGCTACGCA GACGCATTCT TCAATGCGGA TAAGTCAACG CCAATAGGTC
AGTAGTATTG AAGATCTCAC GACCGTAAAA TTTCTTGGAT GTGCAAAAGC AACATTAACT
TATCTTTTTC CTTGCGGTCA ACGTAGGAAA GATACGGTCG CATGCTGTTA GCCTACTCTT
CCGGTCGCTT GTTTCCCAGC CTGATAGGGC TGCGCAAGCC GCCTATGTTG CTCTGCGGGA
TGCGCGTATT CCGAAAACGG CTGAGGTGGA ACAGTCTCGA GGGCACTCAT GCCTATCGAA
AGACCTTATT CAAACATGCA TTCGTCCGGT ACTTCTCAAC CTGCGAGACT TCAAACGTTT
GTCGATTCCC CTTCTTCGCG GTCTTTCTCG ATTATTATCT CTGCTCAGCT CGTGGTTTAA
CAAAACACTT GGTGAGAAGC TGTTAGACCA TCTACAGAAA TGGACTGATC CTGGAAATAT
TATGTCCTCG AATATTTGGA GTGAGGGCCA AGAGCCACAT GTTGCTGCTG CAATAGTTGA
TATCTTTGCA CTTCTACCCC ATGCATCTAA TTTTGTTGAA CCATTGGTCA AAACATGCAT
GAAATTAGAT GCCACACTAC CTGCCTTTAA GGCCAGATAC GTCGAATCAC CGTACCGCGG
ACCCCTTGTA CGTTATCTGA ACAAGCATCC GGGATTTACG GTTAGCTTCT TTTTCCAGCG
CTTGCGAACT CCGATATATA GTGAGCTATT TCATTGGCTC ATCAATGTGG ATGGTAGCAC
TGATCTTCGG AGCTACTTGT CGAATAAGCA ATGCTCTGTA ATGATCCTCA ATGTATGCTT
TGAGACCCCT CTTGCAATTA TGCGATCCGA GAGGACTTCT CCCGCTTCTG GAAGTAGGAT
ATCCCTTGCT TTGCATGGAA TTGGTCAACA CAGTACAGCG TCCCAGAACC CTGGAGGTAC
ATCTGCACGA ATGATGAGTA CGGAGGCACT TGAGCTACAG TTCCAGGGCT TCCGAATCGT
GGAATCACTC ATGGCAAACG ATATTTCATA CGTAAAAGAT CATAATGACA TTGTAAGAGC
CTTTCGATGG CTTTGGCGCA GCAAGGGGCG TGCTTTACGC CTTCAACACG AAGAATCTTT
GTCACCACGA TTTCACGATG AATCCAAGAT GCTTGCTTCG TTCTTGCTCA GCTATGCCAA
ATCATTCCCG AATGAGGATC TGGATATCCT TTTTGAGTTA GTCCGCGTAT ACATTCAACC
TTCAGGAGTT GATCTTACCT TCATAAGTCG ATACCTCGAG GAAATGGTGT CCAACGTCCT
GACAGCCGAA CAGAAGAAAC GTGTACTCGA ACGATTTTTC GACCTTGTTT CGCGTGAAAA
AAACGAAGAG ATTAAGGTAC TCAGCATACA CTTTCTACTG TACCCGATGA TTTTGGCCAC
TCACAAAGAA GAGTCTCGGG ACTCCAGTCT TAGACTAGTC GACTCAACGA TAGTTGAGAG
GTTTACAAAA GAGGTTTTGT TCTCTCAAGG CGCTCCATTC GCTTGCGGCG AAAGACTGAA
GGTTGAGCTT CTTCGTCTTT TGGACTTGAT AGTTGAGCAA ACAAAGTCGG CCATCGAACC
CTTCCGAAAA GATGTCGTTC GATTTTGTTG GGCTCTTATA AAAAATGAAG ATGCTTCCTG
CAAGGGATGG GCGTACGTTT TAATTTGTCG GCTCGTCGAG TCGTTCGAGA CTCCGAAGAA
GATTGTCAAT CAAATCTACT CTGCTCTACT TCGATCACAC CAGCAAGATG GGAAGGATCT
TATCAGAAGG GCAATTGATT TGCTGATCCC AGTATTGCCC AAGCGACTGG ACGAAGATGA
TATGAGACGA GCTATAGACC AGGCTAGTCA CCTGCTGTTT GAAGATAACA GTTCTACTCC
CCAGCTCGCA CATATTTGTA CCATGATCGT CAGAAGTCCG GATGTTTTCA GGCCATTTAG
AATTCGATTT ATTGGCCAGA TGATTAATTG CTTGAGTCGC CTTGGACTAC CTCCCAATTC
TTCAGCTGAG AACAGACCAC TAACCATTGA TATGGTCGAT CTTCTTTGGG AGTGGAATAG
CAATGTCGCC GAGTCCTTGC CACTCATAAG TCACGAACAG ATGGACGTAG TGGGAAACTT
TCTTGTACGC CTGAAAATCG TTTCATCCGA AGAAAAACCA GACGGCAGGA CGGCTAAGTT
TGATGCCGGT TTACCATCCT TTGACGAGCG GCTTTCTTGT CTACTGGTGA AGGTCTTGAA
ACAGAAAGGC GTCGATATAC GGAAGCAACC CTTCGAGAAA GTTATGGAAA AAAATCTTGG
AGACAGCGGA CCTGTGCTAG CGTCCTTGGA TCTATTCGTT TTGATGCTTG AATCCGGTCT
ACATGAATTT TTCACGGCCA ATGAAGTGTT GGTGGAGAAA CTTGTTTTGT CGGCCTTCTC
TCATGCGAAA ACTTACGAGC CTCTTCGAAA GAAGCTGCTT TTCTTCACGC ATGCGTCTTG
TTCTTCTACA AAGCTGACTT CCATCGTGAT ATTGTGTGTT GAGCAGATAA TCATAGAGTC
TACAGATACG CAAAAAAAGC GATCGCCGGG TAAACAGCCA GAGGCGAGCC GCCAGGTGCA
TGGTCGCAGC AGCAAGGACA AAGAGAAAGA TGATGCTACC AAGATTGCTT TGGCCGAGTT
TTATCGCTTC GGACTAGAGC TAGCATCAGA GCTGTGTAGA CAAAGCGATA GCGGTCTAAG
AAGACTCACG AACGTCCTTC TTAACCTCTT GGCAGTTCTG ACAAAAACCC ATGTATTGGA
AGCGGCAGCA AAGCAACGTC AGGGGGGCTC AAGTGGTCCA CCGACCAATA CCCCAGGCGT
CCTGCACCAC ACTCCAACTA AAGGTATTTT GGAGGAGACC TGTGATCAGT TTTACGGAGA
CCAATCGCGT ATTCTGGGAG GAGGAAAAAC AAAGGGAGAA AGGGAAAATT CAGAAAGAGA
TGATGCAGTT CGATCGTTGG TAGTCACTCT AGGAATTTTT GAACGAAGCG ACGTTGCATT
CACTTTTACG CAGAGTCGGA AAAACTTGTT CCAGATTTTG AGTAGTATCC TCGATTCCAG
TGACAACGTA CAACTACTGA TGGTGGCCAC TAGAGTAATA GGAAAGTGGC TCCTCTCAGA
CGACTCTGGA GGGCCTCTGA CTTCGAAAGA GCGGAACAGT TTTCTCTGGA AAATTGCTTC
ATTCGATTCC AACGGTCTGT CAGACGACTT GACCTCCCAA CCACTGGCCG ATCTCGTGGC
TTACTTCGTC CAGAGGGTAT GTTTTGGCGT TGAGGGCAAG TTGAAGGATG GCGAAGGTTT
GGTCATTGGT AGATGTATGG TTGCATGTCT CTTACATGCT CAAGAGGATG TAAGAATCCG
TCTTATCACT TCTTATGTTT GCGGTTTTCC TTCCGAGCTG TCGAGCAGAG GGGCAGCAGC
TATGAATCGA TCCGTTGTCG ACGTCTTTTG GAGGCTCTTA AACAGCGATA TCGAAGGTCT
GGCATCACGC TATTGGACTG TCTTTTACGT CGACTGCTTG ATATCCTGTT TGTCGACGAA
AGACTACCAC TGTCTAAATG GTCTGCGAGT TCTTGCCCAT GGGGATACAA CAACGTGTCA
AAAACTATTT GAGTGGCTTC TCCCTGCTAT TTGGGATATC ATCCCAAGCG ACGCTATACG
GCTTCGTATT TCGACTGGAA TGGAGTTTCT TCTCTCGCGT CCTTTTCACG CCCAGTTTCT
GAAGGCTGCA ACTTTCACCC CTGGCGCCGA ACGTCGCTGT TCCAACGCTG TTCGGTCGTT
TTTGAGTGGC GTGGCGGCTT TGGCTCCGGC TCCCGTGCTA GAACCTTACC TTTTGGTTTC
TCTAGCCGAA AACTACAACT GTTGGTTTGA AAGTATCAGC CTGTTGGAGA AGCACTTCTC
GTTGCTAGGG TCGACTCCTA AAGGTCTACT CAGCTTGGAT GCAATGGGCC ACTGCTACAG
GCGGCTTGGC GAAGACGCGC TGTGCTTGAC ACTTGCCAAA GAAGCGTGCA CCTTACCCGA
AACAGCTATT GCAATAGGCT TAAATGTCTA CGGTATGGTT TCTGAGGCAG CCAAACAGTA
CGAAGGCTTG GTAGACAGTG CTGGAGGGAG GGAGTTTAAG CACGTACCAA CCGATTTTGA
AATGGATCTT TGGGAGGAGC ATTGGATTGG GCTGCAAAGG GAACTGTGCC AGCTTGACAT
CGTTTCAGAG TTTGCCAACT CTGCTGGAAG CCTACGGTTG AGGCTAGAAT GTGCGTGGAA
AGTCCAAGAC TGGAACACAG TTCGTTCACT TTGCACTTCC ACATCGCTTC TTGCAGCTGT
TGAAAGTGGA GACCCTCTTG TCAAAATCAG CGAGACTCTG TTGGCCGTCG CAGATGGAAA
ACTCAGCGAG GTGGAAAATC TACATGCACA AACAGCGCAA CTCTGTTTGC ACAAATGGCA
GCTACTTCCT CTACTAAGTA GTGGGAGCAC TTCACACACG TCCTTGTTGC ACTTTTTTCA
TCGGCTGGTA GAGATTAGAG AATCGGGGCA AATTATGGTT GAGACGAGCA CCCACTCCTC
CGAGAAAACG CTACCAGATC TCAAGAACCT CTTGAATGCC TGGAGACACC GTCTGCCAAA
TGACTTCGAT CCTATTGCAT CGTGGGATGA AGTTTTCTCT TGGAGAGCGC ACATGTTTAG
CGCCATAACA TCCAATTTCC ACTGGAGCGA GCCCAATACA TTGGCGACAC TTCACGATAG
GCCCTGGACC GCTATCCGAA TGGCGAAAAC CGCCCGCAAA CACGGAATGC GAGATGTTTC
TCTACTCACT TTGAATAAAA CTGTCGATGA ACGTGCAATG AATGTGTCCG ATGCATTTTT
AAAGTTGCGG GAGCAGATTC TTGCTTACTA CAACCCAGGA TCGGACGCTG AGAGACACGG
CGGACTGAAC CTTATAAATA CTACGAATCT CTCTTTTTTT GACCAGCCGC AAAAAAGTGA
GATCTTCAGG CTCAAGGCAT CTTTTCTTGC ATCTTTGGGG TTCAGGTCAA AAGCAAATCA
AGCTTTCTGT CACGCACTTC AAATCTCTCC AACTCATGCT CGGGCTTGGG AAAGCTGGGG
TGGCTTGTGT TCAATGCTGG GTGCAGTTGC TGAAAAACAG GTGGATCTAT CAACGTCAAA
GGGTGGGCAC GAAGGCAGCA AAGAAGTGGG TACTGATTCG TCCAAGCGGG TAGCGCAGTA
CCTTGCACAA GCGATGGGTT GCTATCTCGA GAGTATCCAG CTCGACACAA ATGAATGGAC
GCGCATTCAT CTTGCGAAAT GCATTTGGAT GCTCACCAAA GATGGCGGAA CACCAGGTGT
TTTATGCTCG ACGTTTGAGA ATCGAGCAGC TCGGCTGCCC CCGTGGGTTT GGCTGCCGTG
GCTTCCACAG CTTTTCACTT GTCTATATCG GCCAGAGGGC CGTGCAATTA GAACAATTTT
CTCTCGCGTT TTGAAATCGT ATCCTCAAGC TGCATACTAT CCGTTGAGAG CATTCTTCCT
GGAACGACGT GATGTTGAGC GAACAAAAAG TTCGTCATCC ACCGAGCCCG GACAACACAA
CGGTTCTGCT TCCTACTCCG AGGAGATGGT GTCTCAGCTG AGGCGCGCTC ATACCTCTCT
TTGGAGTTCG TTGGAGGCTA TACTGGAGGA ACTCCTGATG AAATTTCGAC CTTCTCACGA
GGAGGAATTT CTTGCAACTA TCGTAGCTTT ACTCGAACGA GCCGAGACGC AGACCGCCAC
AATCGGGACA AAGGACGAAG AAACAGTGAC TGCATCTGTT TGGAAAACAT TGGGAAGGAT
TGCGGTCAAG TACTTTCGTC ACTCGGATTC ATCCTCAGAC CGCAAGGATA CACGATCTAT
AAAAACCGCG GAATTTAAAT CACAGCACAG GAAATCGTTT GAAGATGACT TTCACGTCTC
TTCGAGCGAA ACGGTAGAAA GCGATTCATC ACCACCACCT ATGGAGCTTA TCGAAATCCT
TTTGCTGATA AAGAGCTGGA AGGCAAAGGT GGAAAGTCAC GTTATGTCAA CGCCTCGGTC
TATTCCATTG ATCACGTCCT CGCCATCACT GGCGATGTAT TGTATCGGTG ACGCACCAGA
TCTCTGGCCA GGGTCCTGTG ATTCTCGCTA TACTTCCCAA ATCGCCAATG ATTGTAAAAC
TGGTTCTGCT GCCGATGAAG GAAATCTCCT GACATCAACA ACATCATCCG CTGCAGCTGC
AAAAAAGGCA GCGTTAGCTG CAGCCAAAGC TGTCTCGATT TGTGCAACGA GGGAAGGAGT
AGGCAGTGAC TACGGGGGAG GGTCTGCGCA CATCGAGATC CCGGGACTCT ACATGCCGAA
CACAACTTGT TGGACCGACA CAAGACCAAG CCCTGAGCTA CACCCGAAAC TGGCAAGGTT
CGAGCCCTTC GTCAAAGTTA TTTGTCGCAA CGATCAACTC GTCCGACGAA TTGGAATGAT
AGGCAGCGAC GGTAAAACGT ACCGCTACCT TTTGCAATGT GCGTTGCCGT ACTGGACTCG
GACGGACGAA AGAACAGCGC AGACATATTA CGCACTGGAC AAAGTGCTTC GAAAATCAAT
GCCGAGTGCT CGAGCTCATC TCTCGGCTCA GCCGCATCCC GTCGTTCCCG TCGCGCAAAG
GCTGCGACTC GTACACGAGC CGGAATCGAG ATTTTCACTG GACGAAGTGA TGGGAAAGTC
ATTGCGAGAG AAGATCAGCG GAGAGGCTCC GGCATCTTGG CGGTTCAATG AAGCTCTCAA
GGTCGCACTG TCCAACAAGG ATCTAGCTAC TCAAAAGGAC GAAGAAAGAA GTGCTTTCGA
GCGATCAACG CGGCTCGAAG TTTTTGAATC TTTTTCGAAG GAATGCGACG TTGATTCTCA
AATTCTGTCG AATTACATGT CCCGTAAATT ACAAAGTCCT GAGCCATTCT TCCAGTTTCG
ACGCACTTTT TCCAATCAAT GGGCCACCAA CTGTTTATTA CAGTTTGTCT TCCATATATC
GGAAAGGTCC CCTGGTAAGG TTGTTGGTAT TGAAACTGAT GGTCGTGTTC TTTCTCCAGA
CTTTCGGATT CGATACAATA ACCAAGGGGC CTTGGAAGGA CAACCAGTCC CGTATCGAAT
GACTCCAAAT ATTGAGAACC TCATTGGGTT TCCACTGATG GATGGGCGAT TCATTACTTC
TATGGCGATG ATGGCGGGGG CCATCCGTGA GTACAAGGAA GATATGGATC CAATTTTTCG
TCTTTTGATG AGAGATGACC TTGTTGCTTT CTTCACAAAG TCCATGGCCA AGTCGGACAA
TAAGACACAG GAGATGGAAC GGCAACTGGT CGATCGCGTC TCGCAGAATG TGGCTACCGT
GCAGAGTCGC TTTTCGGAAT GCTCGCCGAA GCTCCCCAAA GACAAAAAAG AGGGTCTAGT
CGATCAGCGC GTGCGTGAGC TGTTGGATGC GGCTCGAAGC AAAGACAATA TCTGTATGAT
GAACGGAACA TTTCAAGGGT GGGTGTGAAA GATAAAAAAC ACCCTCAAAA TTGGTTTTGA
TAAGCTTACA TTAGCTATAG AAGTAAATTT GCATTC
 
Protein sequence
MASEPDYSAN TNASSSSSAP AGSMNWEPVQ QRLLDPDLNV AWQAAKELRD NVEVAHSTEY 
PLLLSALSPA FSSILTQRTK PNRDTNSVEH KLRHAILDAV SKFPSNEVLR PHAPHLVAIA
LEVLNRDYEA NALLSLRIIF DLYKTYRSLP QDYVQPFLDF VVSTYRALPR AVQHNFAWEH
LNFKATETSK TATTTIISTP ITSTATPTTT ATTATIAAAE PVEEKDQDGD LTMQDIEVPA
TGPEVKSSTP ASSVPSGSAS VDSLGSAIPG HPNSPEARLP VRSNLSFRVL TECPLIVMLM
LQLYPKFLKT NIAALIVVMM EALAIRSPSL SSITPPEIAS TDSPVKRSYH TRVRELAAAQ
AKTLSFLTFL LRSFQAELKP YEDRLASHVV ALIKSCPRES TSTRKELLVA TRHLLSSDFR
KGFFRHADVL MDERLLLGSH YRSADQASLR PLGYQTVSEL VLNVRSSLTM LQMSKVVSLF
SRVLHDEGST CPMPTQYLAV RTLLNLGDVI YHNTDLNPQL GRDLLVRILN TLTEKLTALN
EYYPEVQRAE LKRGEIVSPT VQTTSCHDSV RDLQSMIRAI IVGNKNIVFF LSNYRNQRDK
EKVRETLVPP PGSNEEVSSA YHKLTHTEVA ILDRYIIASL PALKLLKMTS TGQSRVGGEK
TLADHHRDTL TYFAATFAAL DGYNFRRTIG RRLNLLVDAI VEDPLVMIVP RHLLAVNAGT
SYEFCSMLTC YLVERLDDLA LPHRNNIVFL KPSCGQAENG KDVVLEQLRE ISQNPRDSEK
HQRQRSSTYL QLFERALKSL APYPENESTI RRYLRFVVSE CLRSSLETSE LWPDNYCILL
RYIFRSISAG KFEESYKELL PLIPTVLNGL YRVICTADDT MLRRTAMELC LTIPARLSSL
LPHMNLLVRV IIPALDSNSG DLVNLGLRTL EFWVDNLNPL FLYPEMSKDI PLLSAIMRSL
SRHLRPAPYP YGLLTLRLLG KLGGKNRHFL REPLHLTNTS NFNTEAVEVD CSWIVGDENS
VKPLKTTTTI ALPLDRCIEM LKTIATSQEF VELKICVEDE PTALKQTVIP WGEYEVLWST
HLENVDFEAY SKGVSNETRR SQAHACLSII KTALGVLETS RKQHEQKDAA KARDLLSSFE
SYDENVTPGR LVALALMFAR MIDSTKQESQ NLLIGAVGKL PPADLSDALA DFLSEPILGT
NTIAIEISTY FLKSERSLEN SDVVTFVEHL VRRLCGTCCS ATWSRQRGAQ LILLFLVTEL
GHEWALEHEL TLINAAIISV KSVPRELSTA SIKAVEFFVR ICEGVYGKLN RAQILQQGLM
WDILSDDDNR ILQYTRDSVV TLGDDDAIDQ IVKSDNLSQT TEAPSDSTEI PSTRKKGSEE
KTRLTRPSKE VFRVMIRELI SAQHAVRFVA RFITSLYIVP HWREGNDQRD EAEHPTFIRR
AVLSKSMKLL PLPYQVGAIE GLASVVKLFP GVLPLDDQHF LGFLSELLKM TSVPEGDMQD
PSWADSVVDK NGFAGVSSER LNIGNPTHAS ALFCRRECIL NVDGMTLVVP AELPMGVQLR
VSAIKLFHAV IVSYADAFFN ADKSTPIGKI RSHAVSLLFR SLVSQPDRAA QAAYVALRDA
RIPKTAEVEQ SRGHSCLSKD LIQTCIRPVL LNLRDFKRLS IPLLRGLSRL LSLLSSWFNK
TLGEKLLDHL QKWTDPGNIM SSNIWSEGQE PHVAAAIVDI FALLPHASNF VEPLVKTCMK
LDATLPAFKA RYVESPYRGP LVRYLNKHPG FTVSFFFQRL RTPIYSELFH WLINVDGSTD
LRSYLSNKQC SVMILNVCFE TPLAIMRSER TSPASGSRIS LALHGIGQHS TASQNPGGTS
ARMMSTEALE LQFQGFRIVE SLMANDISYV KDHNDIVRAF RWLWRSKGRA LRLQHEESLS
PRFHDESKML ASFLLSYAKS FPNEDLDILF ELVRVYIQPS GVDLTFISRY LEEMVSNVLT
AEQKKRVLER FFDLVSREKN EEIKVLSIHF LLYPMILATH KEESRDSSLR LVDSTIVERF
TKEVLFSQGA PFACGERLKV ELLRLLDLIV EQTKSAIEPF RKDVVRFCWA LIKNEDASCK
GWAYVLICRL VESFETPKKI VNQIYSALLR SHQQDGKDLI RRAIDLLIPV LPKRLDEDDM
RRAIDQASHL LFEDNSSTPQ LAHICTMIVR SPDVFRPFRI RFIGQMINCL SRLGLPPNSS
AENRPLTIDM VDLLWEWNSN VAESLPLISH EQMDVVGNFL VRLKIVSSEE KPDGRTAKFD
AGLPSFDERL SCLLVKVLKQ KGVDIRKQPF EKVMEKNLGD SGPVLASLDL FVLMLESGLH
EFFTANEVLV EKLVLSAFSH AKTYEPLRKK LLFFTHASCS STKLTSIVIL CVEQIIIEST
DTQKKRSPGK QPEASRQVHG RSSKDKEKDD ATKIALAEFY RFGLELASEL CRQSDSGLRR
LTNVLLNLLA VLTKTHVLEA AAKQRQGGSS GPPTNTPGVL HHTPTKGILE ETCDQFYGDQ
SRILGGGKTK GERENSERDD AVRSLVVTLG IFERSDVAFT FTQSRKNLFQ ILSSILDSSD
NVQLLMVATR VIGKWLLSDD SGGPLTSKER NSFLWKIASF DSNGLSDDLT SQPLADLVAY
FVQRVCFGVE GKLKDGEGLV IGRCMVACLL HAQEDVRIRL ITSYVCGFPS ELSSRGAAAM
NRSVVDVFWR LLNSDIEGLA SRYWTVFYVD CLISCLSTKD YHCLNGLRVL AHGDTTTCQK
LFEWLLPAIW DIIPSDAIRL RISTGMEFLL SRPFHAQFLK AATFTPGAER RCSNAVRSFL
SGVAALAPAP VLEPYLLVSL AENYNCWFES ISLLEKHFSL LGSTPKGLLS LDAMGHCYRR
LGEDALCLTL AKEACTLPET AIAIGLNVYG MVSEAAKQYE GLVDSAGGRE FKHVPTDFEM
DLWEEHWIGL QRELCQLDIV SEFANSAGSL RLRLECAWKV QDWNTVRSLC TSTSLLAAVE
SGDPLVKISE TLLAVADGKL SEVENLHAQT AQLCLHKWQL LPLLSSGSTS HTSLLHFFHR
LVEIRESGQI MVETSTHSSE KTLPDLKNLL NAWRHRLPND FDPIASWDEV FSWRAHMFSA
ITSNFHWSEP NTLATLHDRP WTAIRMAKTA RKHGMRDVSL LTLNKTVDER AMNVSDAFLK
LREQILAYYN PGSDAERHGG LNLINTTNLS FFDQPQKSEI FRLKASFLAS LGFRSKANQA
FCHALQISPT HARAWESWGG LCSMLGAVAE KQVDLSTSKG GHEGSKEVGT DSSKRVAQYL
AQAMGCYLES IQLDTNEWTR IHLAKCIWML TKDGGTPGVL CSTFENRAAR LPPWVWLPWL
PQLFTCLYRP EGRAIRTIFS RVLKSYPQAA YYPLRAFFLE RRDVERTKSS SSTEPGQHNG
SASYSEEMVS QLRRAHTSLW SSLEAILEEL LMKFRPSHEE EFLATIVALL ERAETQTATI
GTKDEETVTA SVWKTLGRIA VKYFRHSDSS SDRKDTRSIK TAEFKSQHRK SFEDDFHVSS
SETVESDSSP PPMELIEILL LIKSWKAKVE SHVMSTPRSI PLITSSPSLA MYCIGDAPDL
WPGSCDSRYT SQIANDCKTG SAADEGNLLT STTSSAAAAK KAALAAAKAV SICATREGVG
SDYGGGSAHI EIPGLYMPNT TCWTDTRPSP ELHPKLARFE PFVKVICRND QLVRRIGMIG
SDGKTYRYLL QCALPYWTRT DERTAQTYYA LDKVLRKSMP SARAHLSAQP HPVVPVAQRL
RLVHEPESRF SLDEVMGKSL REKISGEAPA SWRFNEALKV ALSNKDLATQ KDEERSAFER
STRLEVFESF SKECDVDSQI LSNYMSRKLQ SPEPFFQFRR TFSNQWATNC LLQFVFHISE
RSPGKVVGIE TDGRVLSPDF RIRYNNQGAL EGQPVPYRMT PNIENLIGFP LMDGRFITSM
AMMAGAIREY KEDMDPIFRL LMRDDLVAFF TKSMAKSDNK TQEMERQLVD RVSQNVATVQ
SRFSECSPKL PKDKKEGLVD QRVRELLDAA RSKDNICMMN GTFQGWV