Gene PHATRDRAFT_36043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36043 
Symbol 
ID7201366 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp412633 
End bp424440 
Gene Length11808 bp 
Protein Length3911 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180633 
Protein GI219119760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCGA CTTTGGCTAC GGCGACGGGC ATTACGCCGC AGGTGTTGGA CGATTTGCGA 
CTGCAGAAGT TGATCGAGAA TATGCATGCA ATTGATTCAG GACAGATAGC CAACAGTCTG
AATTCATTAA GCGATTTGCT TCGCCCCTAT CAGACCTGGG TTTTTGAAGA ACAGGTATTT
ACGAAAAGAA AACGGGGGTT CGAAGACTCA CCGAGGTGCA TCTCTCACAA ATGGCTTTCC
CCGCATAGGT TGACTTGCAC GACTGGATTT ATCCGCTTAA CGTGATCGAC GCCGCCCTGT
GTCATTACAT GAAAACATAT CCGTCGATGC TACTTGTCAA TCCTGTGGAG CGCAGGCGCG
GGCAACCGAA AGCGCTTTCA TCTTCGGCTA GATCTCTCAC CGATGTAGAC TCTGTACCGG
CCTCTTGTTA CCCCCATCTG TACATCCTTC TCCGTTTCCT TGAAGGTTTG TTGCGGAACT
CCACCAGCAA ATCAATTTTT AATTCGGCCG AGGAACTCGT TGATTTGCTG GCTGCGGCGG
ACGATGAAAC CTCCGATTTG GCCTTGACGG CGCTGCTGGC ACTGTCGCTG CCACCCGCCT
TACATAAGCA GCAAGCGCCC GAAGTACAGC AGCACACGAC CACCTTGCAC AACGCAAAGA
TGCCTTCGCA CTCGCGGTTG ACCGCCATAG CCAGAGGCTG GGGAACTCGT GCTATGGGAC
TCGGCTTGTG GAGCGTTATA AAAGCCGATG ACTCAGTCCA TGGTCAGGGT TCTTTACCGG
GAGAAGCAGG CCGACTAAGT GTTTGCTACT ACAGCAGAGA GTCTCAAAAC GAAGTGAATG
AGGAGTCGCA AATGGTTCGT ATATATTTAA ACGAATCGGA TATTATGGTA GGAGGAAGCG
CTGAAATAAG CATGGATAGT ACCAGCGATG CCTTGACTGC GGAAAGTGAT CAAGTTTCCA
AGAAACGACG GATCGGAAAG ATTCCGCGCA AAGAAACCAA ATCTACAGCA GAACTCTACT
ACCTCGTTCT GGAAAGGGCG GGGGGAATGG ACAAGATTCC ATCTGATCGT CTTTTTCCTT
TGTTAGCCGA CATTCGTCTG GCTCGCTCCT TTCATTCCCA GGCCGCTCGA ATCGGCGCCG
TTGAGCGGCG ACTCAGAGCT CTCACGACCA TTTTACACGC GCATCCTTCA CAAGAAGTCA
TGAATGGGTA TTTTCAAGCT CAACCGGAAC TCTGTGTGGA GATTGTTGAT CTACTTCGAC
CGACTGTTAG TTCCGCCAAC ATCGCCGCTG CGTCGGGAAG GGCGACAGAG ACCAACTCTC
CGTTGCAAAG AGATGCTATC ACTTGTCTGT CATCATGTCA GCCTGTACCA TATACTATAC
GAATGCTAGC GGTGGAATCT TTGGCGGCTT TGGTGGCTCG TAGGGACGGA TCGACTGGGG
CGCTTTCCGG AAGTGCTCGA CTTTCGACCG TGCTGGCAGA ACTTGGCATT GGGAAAGGCT
ATTACTTGGG GTTGTTGCCG ACACTTCTTC GATACTCGCT TGCTTCGCTA GACTCTCCAG
GGCATGAATC TATTCTTCCA GAGATTGAAC CAGAAGACTT GGACGATGCT ACGATGCTTG
ATGTAGGTAT GGCTTTTGTG GAAGCGACGA TGGAACCCCT GCCACCAAGA ATTGTGCAAG
TGGAGCGAAC GTTGGAGTTT GTCGACAATC TTTTGACATT GACGTCAACA ATCGTTTCGA
CACCGACCGG TACTGCTGCA CTCACTGACT GCGGCCTAAT CCCGGCTCTC CTGGCGACCG
TGGCTCATGA CTTGGATCGC ACTATACAAC GGCTGCTGCC AGACTCTTCG AATTGCTCCA
CTCAAGAGAT TCTCCGCGTT TACTCGTTGC TTCGATTTGT GCTTGCACAA GCTGTGCAGA
TCTTAGAGGG AGCGATTGTT ACGCACGGAA ATGCGTTGAC AGCTTTCCAC GATCTGCGTG
GTGTCGAAGT TTTGTCAGGT CGGTTGTCGG GAGAAATCGT TACGACCGGG CCGGACATGC
TCTCGAGAGG CCAGCAAAAA AGCGAAAGGG AGGAATTGGA GCATACCCAG ACGTTTGTAT
CCGAGTTTGA TACCAAGATT CGGTCTTCTC AGCGAGGACT GCTTTTTAGT ATCATCACTT
ACTTGACCGT TGTGTTTCAT CAAGAGTCAA CAGCATCGAC TTCTGCCACA AATCCATCTG
GTGGTGCACA AATTCGGCAG GAAGGACTCA CTCGGGCACT GATCCATGTC ATGGAGCATG
TTAGCTCGTA TGGCGGACAC GTGGCATCTC TAACTGCTAC CCTGCTTTCG GACGTCATGA
ATAGTGACCC TCACGTAGTC AGCTACATCC ATGAATCCGG TATTGCGAAA TCGTTCCTCA
GTATGGTAGT CGGGCGAAGG GTGAAAGCGG GAGATAGTAC GGATACTTAC GAGCCAGTTT
TACCTCCTGT CCCTGAATTG GTTATGGCTG TTCCCAATGT CTTGTCAGCG TTGTCGCTGA
CAGAAGACGG GGCCAAAGTC ATTTACGAAA TAAATCCGTT TCCGTCTCTT ATGCGCATAT
TTTACCATCC AAATTATGCG ATGCCGAAAA GCCGATGCCT GTTAAATGAG ATCACTGGAA
TAATCGGAAC TGGACTGGAT GAGCTTATGC GACACGTTGA ACGGTTGAAG CCACTTGTTA
TGGAAGCGGT TGCGAACGCA ATGAAAGACG TTGTTATGTA CGCCGAGGAT CTGAAGAAAC
GAGAGAACCA GTTTTTTTTC TTGTCGACAT CGCCGCTTCC AGACAGGGGA TTAGAAGATG
AACGTTCATG CCTTATGCAG TACGTGCTCA ATTTTGGACA GCTCCTCGAA CAGCTTCTTC
ACAACGATGG GCACTGTGAA CCATTCGTCA ACTCGGGAGG TCTAGTCGCA TTGATGCAAT
TATTTCCTGC TTCTTTGCCT GGAGGATTTC AGTTCCTCAC TCATGCTAGT GGCCTAAGCT
CTCCATCAGT TAGCACCTTA CACCATTCTA CAATCGAAGA CTCTTTGTCT GTTGCGTTCA
AGTGCATTGA ATTTCGATAC GACTCTTTGC AACTTTTTCG TGCTTTAATG GAGGCAATAA
ACGATGCACT CGACAGTGCC GAGCAAAATG AGCGCAACTT ACTATCCCAA GAGGATACTG
TGTTTTGTCT TGATTGTTTT CCGCAAGTTC CCGTATACCA ATTCGATGAC TCTCCTGTCT
CGATCCGTCA AGCTCTTCTC GTTTCAAATT ATTTGATCGA CGTTACTAAT ATCCAATGGC
TCACTAGCCT TCTTGCCACT GCTCTCAAAT CGGCATGCCA CAAAAGCCAG GAATCTGGTA
CAGGCTGGGG CAAACGAGAA AAGGAATGGA AAACAGAAAT TTCTTCCGAG AACTTTACGA
AAATCTTCGG TCGGTTGTCA GATATGCACC AGTCCTGTTT CTATGACGTC TGTCGTGTTA
GGACAGTACC TGGATTCGAA GAAAGGGAAA GGATGCGACA AATCACAAAA TCGCCTCGAT
TACGCTTCCG ACTGCGAATT GTTTGTCCTG AAGGTGCCGT TGTTCGTGAT GGTATCGAGA
TCGATTCCTG TGCTACAGTT GGAAATATGG AAATGGGAGA AGTAGTGGAA TCGTTCGATC
GATGTGTCAA CTCGTCGGGC ATACTTCGTT ATAGAACGAC TAGAGGGTGG GTTTCAGAGA
TGACTCGAGG TCATGGTCGT GAGCCTATTG CTGAGATTGT TGGCATGTGG CTAAGCACAA
ATGAAAACAC CATGGACAAT GAAGCGGGAC CGACAAAGCG ATTGGAAGCA GCCGTTCCCG
ATGTCAGGAC CGTCGCGACA GGCATACTCG CGAGGCTTCA AATCAGCTTT TCGGATCTGT
ACTTAGCTGT TTCGAGGGTG ATCGTCCAGA GTATTCGAAC ACTAACGCCG GGAGCGATTT
CATTTGACAG CGGTACGCAA GGATCTCATG TTTCGACGAC TACGAAGCTA TTGTCGTTAC
ATCTGAAAAG GTCACTTAAT ATTACTCCCG TTCGTGAAGC GGTCCATCAA TTCAGAGTCG
CTGGTGATGG CTACAGGCCT GACCCCTCGC ATGCAGCCGC AGCAATGTAT CTTGGATGTA
TGCTCTCTCA AGCTCAAAGT TGCCTGTTTG ACGAAAAGAG AGAGCGAAGA GCACTTAATT
TTTCTCTTCT CGTCAATCTG CTCCATTCAG ATTCTATAAT GCATGCCACA TCACGCCGGA
TGGCATCTGC GGTTGGATCG GCAGATATGA TGGAAATCGA AAGTTCTTTT GGATTATTGG
AGGCTGCTTC TTTTGTCCTC CGCTTTAGTT TGACGGACTT GGATTCTCGC TCGTTGGGAG
AATCGCTCTG TAATAGAATG AACGAGAGCA GCGCCCCACT TTGCGCTCCG CCACAGAAAG
TATCGCGATG GGTAGCGGCC AGTTTTCCGC CGGCGATCTC GTTGCTTCGG AGGCTCCTCG
TCACTCCGGT ATCGACCTCT CCAGTCGTGT CGGTGCTGAG TCGAATTAAA TTGAAAGACA
TCTCGACTCT TGTTGGATTA GATGATAGTC ACATTGCCTT TTCAAAAAGT GGAAAGTTGG
ACAACTTCTT TCAACCCGAA GATATTTGGC GAGTCTTGCA AACAACAGTC TCCTACGTCG
TCCGTGAGCT CTGGTCTGAT GCGCTGTTTC CCAAGACTCC ACCATATTTA GTGCATCCCG
TAACCAATTT GATCGGAGAA ATACTGGTCG CACTGGAAGA TATTTCGAAG CTTTCGAAGA
ACGTTTCCAG TCATAGTAGT TCAGGAGATT CGCGAAGTGG AGGAACTCGC CTCCGAGACT
GGTTGAGAGA AAGGGCACGC CCGTCTGTAG ATGGGCAAAA CCTTGATGAA GACAGTTTTG
AAGCTAGCGA TGAGGCCATA TTGAGATTGA CAGAAATGGG TTTCACAGGT GAGCACGCGG
TGGATGCACT GGAATCCGTG AGAAGCAATC GTGTGGAGGT CGCAATGGAA TATGCGTTGT
CGCATCCACC TCCGAGTCCA TCAACTGTTG AAAGGCGGCG AGTCGGACGT GAGGAGCGCG
AGAGGCAAAG AGCTGCACTC CAGCAAGCGA CGACTTCCCA GCTTGAAGTT GACCAAAATG
GCTCCATGAA TGCTGAGAAT AGATCTACTA TACCGTCTCA AAACAGTTGC TACCCTGAGG
ATAGCTTGGA AATGGAACAG AATTCTTCTG AAGACGTTGG TACCGCATTT GTCAGCCATG
AATTGAAGAC TTGGAAGAAG TCGATTTTAA ATGTGTGTTG CGATATTCTT ACCAAAATGC
CCATAAGTAA GATACCGAAG CATGGCGAAG GTGATGGCGA TCTTGAAGCT CAGACTGTTG
TGGTAGCATC CTTTCTCCTT GATGCAGCAA ATCGGTTTCC TGAGGACCGT GCTTGTATCG
TATCGCGGAT TCTTGATGAA ATTGTTTGCC GCATTACATG CATCGAAATC GATCTGATCA
AAAAGTACAG TATTGACGAT ACCAACGAAT CCAGTATCGC TGTCCTTTGC CATGCAACAG
TTCTTTTTGC TCGAGCTTTA CCTAGAGCCA GGGTATTGAT CCTCGAAAAG AGTTTAATTG
GGCCTTTGCT CTCTTGCACA CTTGGAGTTG CAGAAGCTAA TGCGGCCAAC AGTAGGAGTG
ATGCTACTTG GCCGATGTGG CTGGCTCCTG CTGTCTTACT TCTTGACATC ATGGCACAGC
CTATTGTTGC TTTTGCTGAC CTTAACTCTT CCGAGCCAAA CGAGCTAGAT CATGGAAATG
AACTCCAACG TGTCCGTCAT GAACACCGGT CTCAGTCCGA GTTCATGTCC ATCGTAGCTA
GAAGAATTTT TTCAGATACT CCGTTTCAAA GCCAATCAAC CGACCCCAAA ACGAGTTCAA
CGGATAAAAG GGGGCAAGAT ATTCCCTCCT TGGTCCCTGC ATATCTTCCT TTGCTACCGA
GCGAGGCCAT AGATGATTGC CTCAATTTGT GCAACTTACT GATTGGATCA CGACGGTGTA
CCGCTTTGGA CCGTGGCGCG CCCCCTGGTG TCACACATGC CTCTCTTTTG TTACTTCTTC
GCATGGTTCG ATCTCCCAAA AATTCTTCAA CGTGTTTGCG TCTGGGCTTT GCGGAATCAG
TTCTCAACCT TCCAAAAGAA TGCAGATTTA CTGGAAGCGC AGGTCTAGTT ACAATTTTGT
TGCGTCGGTT GCTTGAGGAT GAGTTTACAC TACAGGCATC GATGGAAACT GAGATACGGA
GCACAGTGAC TAAATTGCAT TCCATGCATC AAGGCGGTTC TCGGGAGAGT CAAAGAGTCT
CAGCACCACT CTCTTCCTTT ATCGACTCGA TCACACCGCT TCTCTGCCGC GAGCCTATTT
CATTTTTGCG AGCAATGGCA ATTTCTGTGA AGGTCGAAAA CAAATCTTCC GAGGCAGGTG
TTGTTGTGCT TCTTTCTGCT AAAGAGAAAG ATATGTACAA GGCATTATTA AATGACATCG
TGAGGCCTTT GAGTATTGAA ACCGAAGCCG GGGCAACCAG ATTGAGTCAT CCGAGCACTC
GCCCAAGAAA GTCCTCAGGC GGAATCTCGA AGGAAAAGAC TACACACAAA CTAACAAAAC
GAGGAAACGT GTCGAAACAA TCGAGAAGAG GAAAGCATGA GCCTCTTAAA CCGACAATCT
CGTCCGCCAG CTCATCACCA GCAAATCGTA TTACTTTCCT GCTGATCACG GAGGTGATGG
TGTTGACGGA AGGTGGGGCT TTATCTCATG ATCCAATTCG GCAGACTCGG TTTCTCTGGG
CCGGCGACTT GCTTGAAATT TTAGCGGACT TGATTCTGGC AGTCCCCGCA TGTGCTTCTT
CAGTACACAA TTATCGACCG CACAAGGCGA AACACAGAGT TTATAGGAAC GGACTACCGG
GGTCCTTCGC CCATGCCTTA TCGGGTTTGC CTTCACCGGC GAGAACTTTC GTGAGTTTTC
TGCTCCACAA ACTTCTTCCT CAAGACCGAT GGTCAATCAA GAATGATCCA ACCATCTGGG
AGCGTCGACA AGACGGTGAC TATTTTGAAA TGGAATCCAT TGTTGACAAA AAACAAAAGG
CCTATCAGGT AGCGAAGTTA TCGCAGACGT CTGCAAGAGT CCTTGTTGCT CTTGTTGCGC
GCCCTGGAGA AGGAAGGAAA CGCGTTGTTT CTGAATTAGT GTTTGCCTTA AGTGGGGGTC
GGCTCGGGCA TGGAGCCGAG GGTGTGGAAG ATGAACTGAA ATCTGAATGC CCCCAAAAGG
CTCCTATTCC AAAAACTGAG CTCCACGCCC TTACCGTATG GGGCGAACTC TGCCAAGGAC
TAGCTGCGCC CAGAAGTAAT GGAAAAATGG TTGACACTAT AAATTCACTG AACATTGACA
ATGTGAAGCT AATGCTGGAA CATGGTGTAG CACACGGGTT GATGTATTCA ATGCACCGTG
TAAATCTTTC AAGTCCCATG GCTTCGGATG TATGCGCCAG TCTCTTGCTT CCCTTGGAGC
TTTTGACAAG ACCAGGAGTT ACAGATGCTT TGAAGGCAGC GAAAGATAGT GAGGAGCGAC
CCGATGGTGG AAAAGGAGAC GGAGCCGGCA GTTCGTCTTT TGAACGGCAA GGAGACCTTA
TGTCATCAGA TGGGGAAAAT AATGGCGCCT CATTTCCGGT CATCGAGGCT GTTCAAGGAA
ATGCTCGTGA TGACGGAACC GAAGAAGACC CGGATGACAT GGAAATTGAG GAAGGTGATG
TGACGGAGAA TGATGGGTCC AACGGTAGTG AAGAATCCGA TTCCATTTCT TCCGACAGCG
ACGAAAGCGA GGATGATTCA GGTCACGACG ATGAGTTTTC AACAGATATG GACGACGACG
AAGAAGTGAG CAGCGACGAA GTGGAGGAGG AAGGTGAGTG GAACGTCGAC TTCGGAGAGC
CCTATGAAAA CAGTCAACGT GGGGATCAAA ACGATTTTGA TGAGGTCGAA GATGACGGGA
CCGAGCGTGG AGATCCTGGC GGAGACGAAG GGTGGACTCG AGTAGAGTCA TCCGGCTTCG
GAGGCATGGT CCTCGGTGGA CTTGGTCGTC GACTCAACGG AGGACCACAT GCAGTTGATC
CAAACGGACG CGCGAGGGGG TTTATTGATG CTGCAGAAAC AATGATTGGG TCGTTGCTAA
GAACTGGTGA AATAAGTTCG GAAGCGATTG CAGAAATCGA AGGCACTCTC GGTATCAATA
TTACAACAGG TTCCCGACGT GGTCGGACGG CAGCCATGAC TGTTAACGGA ACAGGAATCG
ATGATTCTTT TGCAAACAGA ATATGGGGAA GTGGGAATGT CGGTATTGAA GGCAACTCCA
GGCCTGAGGT ACTCGGGACG CTACCTCACG TGCATCAGCG AAGTCAACCT GATGTTGGAT
ACTCCGCTTT TGGAACTACG CTAAGAATGA CTGAAGGAAG CTCGATGGAG TTTGTTTTCG
GTGGTCCTTC AGTCACGTCA GGAAACCGGA GCTATGATAC TGTCTCTCCT CTGCGCGCAA
ATACAGATGA TCGTTTTCCA GCCATCTCAC AGCTCGATCT GCAACTATTT CCGGGCGGTC
CTTCATCTGC TGCAAGTGCT AGGACGCAGC AAGCTTTACA TCCGCTTCTC TGTGGCATTG
ATCTTCCTCC AATGAATTCC CTTGTGTCAG ATTTGCTTCC CCACGGAATC AGAGCCACAA
GAAGAGGTGA GATGACAACC CGAAGGCCTG GTAATTGGAG CAACTCGTCT TTTGCTCAAG
GAAACTACCT CATGTCCACG TCAAACGGAA ACATTGTCAG ATCGAGTCCC CCCCTGCCCG
GGAACCACAC AGGCCTTTCA CAGACTTCTG GCAATAGTAG CGGCCCCGTC GGTTGGACTG
ACGATGGCTT GCCTTTTGAT GTAACAGTGG AGCAATTCAG CTCAGCATTT CAAGAAGCTA
TTTTATTTTC TTCGACTGCT TCTACCACTT CTCCTGGAGA AGAAGGCAAT CAAGAAGCCT
CTCGGTTGCC CCAACCGCTT ACATCCGAAA TTAGCACTTT GTACCCACAT GTAGAAGGCA
ATCCAAGTTT TTCTCATGTC GAAACAAATG TGGATCGGTC CATCACTGGG ACTCCCGCAG
TAAGCGAAGC TCATATCCAC GAGGAAATTC GGACGAATAG TCCAATTACC AGTATTGACG
GAGACAGGGT TGCATCATCT TTGGCCCGTG GTCTCCGTCT CTCCTCAGGA AGTGGGGGTA
CTGCTACTGC GGAGGGTTCA GTGACTGGCG AAGATCCGCC TTCTGATTCT GAGGGAGACG
CAGGGGAGAA TACGGACGAG AATCTTACTG TTCCAATGAA TCAGAATGGA GCGAGCGCTA
CTGAAAGGAT TTCAAACGAT AACGTACCTC AAGCTCTAGA CCAAATGATG TTGGATGAGC
GGTCAGTAGT TTTGCGTCCT CAAGCATCCA ACCATGATGC ATCGCAAGGT GTAACGCCAA
ATACAAACGA ACTCGGACTT TCGTGCCCAC CGGATGTTGA CATGGAAGTG TTCAATAGCC
TACCCGTGGA TATGCAGCAA GATTGCATAG ATCACTACAA CGGCGCACAG GAGCTGGTCG
CTCAACTAGA CGGTTCCACA TTGGACCCAG AAGTTCTTGC TGCCCTTCCA GAGGACATGA
GAAGGGAAGT TATTGAGCAG GAACGCCTTG CTAGGGGGCA ACAAGATCCG CCTGCCGATC
CTGCCAGGGC TGAGGATATG GATACGGCAA GTTTCCTTGC TTCTGTAGGG CCCGATCTCC
GAGAAGAAAT TTTAGTGTCG GCAGACAACG ACTTTCTTGC TAGCCTCCCG CCAAGTATCG
CGGCGGAGGC CCAAATTCTG AGGGAAAGGG CATCCGCTCA ACATAGAAGA CTGTACGAAA
ATCCGATTGC AGACGTTCCG AACCAGGGCC TTCATCAAGA GGTAGGAGCT GTGCATCACC
GATCTTCCGC AGCGAATACG TCGTCAGTCA GACGCCACCG TGTCGGAAGA TTACGAGTTG
AAGTCGATCG CCAAGCCCCT TTGTATCTCC CTGAGCATCT ACCGTATCCC GTGTCTGCCG
CAGATCTCAC GCTATTTTTC AAGCTGATGT ATTTGCTGTC TCCTATTCGA CCCCAGCGCC
TTCTACAAAG GCTCTTTCAG AATCTTTCGA CAAACCATGA CCTCCGGGTT GTGCTGTCCT
CAATCCTCGT TAATCTCCTA CACGAAGATA GCGTTGGGGC ACAGGCAGTA GCAGAGTCTT
ATATGAAAGA ATACTCATCC GAGGACTGTT GGCGAAAATC GATGGATGTA TTGTTCGATC
AAATAGAGAA CTTTCCTCCG ACTGTTCTTC TTGGAGCCGC CGCTCAAGAT TCTGCAGTAG
ACGCGTTCTC TATGAGCGTC TCTGCTTCGC TTCTGCGACG GAGACAAGGG CTCGGGACCG
CCGCAGCTGT TGCTGCCAAC CTACCACGTG GCTCAGCAGG TTCCCACTCA GCCTCACGCT
TGCCCCCCGT AGTGGCTACA CGACTAATTG AAACCGCTCT TCAATTGTGC AAAAATTCGC
CCCGCTTCAG TTTACATTGC ATAACGGAAC CAATTGCTGA TGTGTCCTTA GGAGGCAGGA
CCACCTGCTT TGAGAAGTTC CTAGATCTTC TGGACAAACC GATGTTCGCA AAGAGCTCGA
CAAACCTAGA CCAGCTTCTA TCACTTCTTG AAGCTGCTAT TTCACCTCTT TCGCATCTGT
CCCGTACTCC CGACGAAGAG ATTGAGATCC CACAGTCAGA CGTTGAAGCA GCGGCATTGA
GTGGAAAAAT TTGGGTCCAA GCACCTGGCA TTGAGATCTC ACCAGAACGG CTGAAGCGTT
TGTGCTCTAT TCTTCGTATG GAGACATTTC GAGACAACGC TTTTTCCAAA GTCAACACAA
TTGTTCGGCG GCTATGCCGA GTTGAATCTA ATAGAGGATT AGTGTTGGCT GAGCTTGCCG
GGGTTGCTCA TTCACTCGCA AGCGATTCAA TTCATGATCT TAGAGCGTTG AGGATCCGTA
TAGATGAAGC TGTGTCTGCC CATGCAAAAC AACAATTCAT CGTCCGCGAA AACCAAACCG
GAAAGGAAAG CCACCGAACC TTGGGAGCAG TCTCGAGTTC GGTGACTTTG TCAACGAGCA
CAAGCGAGCT GAAATTGTTA CGTGTTCTAC AGACGCTACA AGCGTTGTGC GCTGACATTC
CCGACGGATC TTCCAGCAAA AAGGACTCCA GCGTTGTCGT CACTGACGAA TTAGTCCACA
TACTCCGCCA GATAGAATTT GATGATCTCT GGAACGAGCT GAACTTGTGT TTAAAGGTGG
TTCAAGTTTT GGAAGGGGTG AAGGATGCCA ATGGGAAGGA GAACGAAACA GGATTGGAAA
ATACCGATCG CAACGATGAC GAAGATGGTA GTATTGATGA AAATGCAGCG CAGACGAAGA
AGCTTCGGAA TAGCGCTGCT GGTCTCTTGA CACGATTTCT ACCGTCAATT GAAGCATTCT
TTGTAGCAAA TGCAAGTGCA ACTCGAGTTG TGGGCGAAGA TTTTGCAGAT GGAGCTGCGA
TTGCCCTAGA GAATCTTGTT GGTGGCAAGA GGCTGCTTGA TTTTGTTTCA GGCAATAGAG
TTTTGCTGAA TGCTTTAATT CGCAATAACT CTGGGCTGCT TGACAAAGGG CTCCGAGCGC
TCGTTCAAAC ACCAAGATGT AGAGCCTTTC TGGATTTCGA TGTGAAAC
 
Protein sequence
MAPTLATATG ITPQVLDDLR LQKLIENMHA IDSGQIANSL NSLSDLLRPY QTWVFEEQVD 
LHDWIYPLNV IDAALCHYMK TYPSMLLVNP VERRRGQPKA LSSSARSLTD VDSVPASCYP
HLYILLRFLE GLLRNSTSKS IFNSAEELVD LLAAADDETS DLALTALLAL SLPPALHKQQ
APEVQQHTTT LHNAKMPSHS RLTAIARGWG TRAMGLGLWS VIKADDSVHG QGSLPGEAGR
LSVCYYSRES QNEVNEESQM VRIYLNESDI MVGGSAEISM DSTSDALTAE SDQVSKKRRI
GKIPRKETKS TAELYYLVLE RAGGMDKIPS DRLFPLLADI RLARSFHSQA ARIGAVERRL
RALTTILHAH PSQEVMNGYF QAQPELCVEI VDLLRPTVSS ANIAAASGRA TETNSPLQRD
AITCLSSCQP VPYTIRMLAV ESLAALVARR DGSTGALSGS ARLSTVLAEL GIGKGYYLGL
LPTLLRYSLA SLDSPGHESI LPEIEPEDLD DATMLDVGMA FVEATMEPLP PRIVQVERTL
EFVDNLLTLT STIVSTPTGT AALTDCGLIP ALLATVAHDL DRTIQRLLPD SSNCSTQEIL
RVYSLLRFVL AQAVQILEGA IVTHGNALTA FHDLRGVEVL SGRLSGEIVT TGPDMLSRGQ
QKSEREELEH TQTFVSEFDT KIRSSQRGLL FSIITYLTVV FHQESTASTS ATNPSGGAQI
RQEGLTRALI HVMEHVSSYG GHVASLTATL LSDVMNSDPH VVSYIHESGI AKSFLSMVVG
RRVKAGDSTD TYEPVLPPVP ELVMAVPNVL SALSLTEDGA KVIYEINPFP SLMRIFYHPN
YAMPKSRCLL NEITGIIGTG LDELMRHVER LKPLVMEAVA NAMKDVVMYA EDLKKRENQF
FFLSTSPLPD RGLEDERSCL MQYVLNFGQL LEQLLHNDGH CEPFVNSGGL VALMQLFPAS
LPGGFQFLTH ASGLSSPSVS TLHHSTIEDS LSVAFKCIEF RYDSLQLFRA LMEAINDALD
SAEQNERNLL SQEDTVFCLD CFPQVPVYQF DDSPVSIRQA LLVSNYLIDV TNIQWLTSLL
ATALKSACHK SQESGTGWGK REKEWKTEIS SENFTKIFGR LSDMHQSCFY DVCRVRTVPG
FEERERMRQI TKSPRLRFRL RIVCPEGAVV RDGIEIDSCA TVGNMEMGEV VESFDRCVNS
SGILRYRTTR GWVSEMTRGH GREPIAEIVG MWLSTNENTM DNEAGPTKRL EAAVPDVRTV
ATGILARLQI SFSDLYLAVS RVIVQSIRTL TPGAISFDSG TQGSHVSTTT KLLSLHLKRS
LNITPVREAV HQFRVAGDGY RPDPSHAAAA MYLGCMLSQA QSCLFDEKRE RRALNFSLLV
NLLHSDSIMH ATSRRMASAV GSADMMEIES SFGLLEAASF VLRFSLTDLD SRSLGESLCN
RMNESSAPLC APPQKVSRWV AASFPPAISL LRRLLVTPVS TSPVVSVLSR IKLKDISTLV
GLDDSHIAFS KSGKLDNFFQ PEDIWRVLQT TVSYVVRELW SDALFPKTPP YLVHPVTNLI
GEILVALEDI SKLSKNVSSH SSSGDSRSGG TRLRDWLRER ARPSVDGQNL DEDSFEASDE
AILRLTEMGF TGEHAVDALE SVRSNRVEVA MEYALSHPPP SPSTVERRRV GREERERQRA
ALQQATTSQL EVDQNGSMNA ENRSTIPSQN SCYPEDSLEM EQNSSEDVGT AFVSHELKTW
KKSILNVCCD ILTKMPISKI PKHGEGDGDL EAQTVVVASF LLDAANRFPE DRACIVSRIL
DEIVCRITCI EIDLIKKYSI DDTNESSIAV LCHATVLFAR ALPRARVLIL EKSLIGPLLS
CTLGVAEANA ANSRSDATWP MWLAPAVLLL DIMAQPIVAF ADLNSSEPNE LDHGNELQRV
RHEHRSQSEF MSIVARRIFS DTPFQSQSTD PKTSSTDKRG QDIPSLVPAY LPLLPSEAID
DCLNLCNLLI GSRRCTALDR GAPPGVTHAS LLLLLRMVRS PKNSSTCLRL GFAESVLNLP
KECRFTGSAG LVTILLRRLL EDEFTLQASM ETEIRSTVTK LHSMHQGGSR ESQRVSAPLS
SFIDSITPLL CREPISFLRA MAISVKVENK SSEAGVVVLL SAKEKDMYKA LLNDIVRPLS
IETEAGATRL SHPSTRPRKS SGGISKEKTT HKLTKRGNVS KQSRRGKHEP LKPTISSASS
SPANRITFLL ITEVMVLTEG GALSHDPIRQ TRFLWAGDLL EILADLILAV PACASSVHNY
RPHKAKHRVY RNGLPGSFAH ALSGLPSPAR TFVSFLLHKL LPQDRWSIKN DPTIWERRQD
GDYFEMESIV DKKQKAYQVA KLSQTSARVL VALVARPGEG RKRVVSELVF ALSGGRLGHG
AEGVEDELKS ECPQKAPIPK TELHALTVWG ELCQGLAAPR SNGKMVDTIN SLNIDNVKLM
LEHGVAHGLM YSMHRVNLSS PMASDVCASL LLPLELLTRP GVTDALKAAK DSEERPDGGK
GDGAGSSSFE RQGDLMSSDG ENNGASFPVI EAVQGNARDD GTEEDPDDME IEEGDVTEND
GSNGSEESDS ISSDSDESED DSGHDDEFST DMDDDEEVSS DEVEEEGEWN VDFGEPYENS
QRGDQNDFDE VEDDGTERGD PGGDEGWTRV ESSGFGGMVL GGLGRRLNGG PHAVDPNGRA
RGFIDAAETM IGSLLRTGEI SSEAIAEIEG TLGINITTGS RRGRTAAMTV NGTGIDDSFA
NRIWGSGNVG IEGNSRPEVL GTLPHVHQRS QPDVGYSAFG TTLRMTEGSS MEFVFGGPSV
TSGNRSYDTV SPLRANTDDR FPAISQLDLQ LFPGGPSSAA SARTQQALHP LLCGIDLPPM
NSLVSDLLPH GIRATRRGEM TTRRPGNWSN SSFAQGNYLM STSNGNIVRS SPPLPGNHTG
LSQTSGNSSG PVGWTDDGLP FDVTVEQFSS AFQEAILFSS TASTTSPGEE GNQEASRLPQ
PLTSEISTLY PHVEGNPSFS HVETNVDRSI TGTPAVSEAH IHEEIRTNSP ITSIDGDRVA
SSLARGLRLS SGSGGTATAE GSVTGEDPPS DSEGDAGENT DENLTVPMNQ NGASATERIS
NDNVPQALDQ MMLDERSVVL RPQASNHDAS QGVTPNTNEL GLSCPPDVDM EVFNSLPVDM
QQDCIDHYNG AQELVAQLDG STLDPEVLAA LPEDMRREVI EQERLARGQQ DPPADPARAE
DMDTASFLAS VGPDLREEIL VSADNDFLAS LPPSIAAEAQ ILRERASAQH RRLYENPIAD
VPNQGLHQEV GAVHHRSSAA NTSSVRRHRV GRLRVEVDRQ APLYLPEHLP YPVSAADLTL
FFKLMYLLSP IRPQRLLQRL FQNLSTNHDL RVVLSSILVN LLHEDSVGAQ AVAESYMKEY
SSEDCWRKSM DVLFDQIENF PPTVLLGAAA QDSAVDAFSM SVSASLLRRR QGLGTAAAVA
ANLPRGSAGS HSASRLPPVV ATRLIETALQ LCKNSPRFSL HCITEPIADV SLGGRTTCFE
KFLDLLDKPM FAKSSTNLDQ LLSLLEAAIS PLSHLSRTPD EEIEIPQSDV EAAALSGKIW
VQAPGIEISP ERLKRLCSIL RMETFRDNAF SKVNTIVRRL CRVESNRGLV LAELAGVAHS
LASDSIHDLR ALRIRIDEAV SAHAKQQFIV RENQTGKESH RTLGAVSSSV TLSTSTSELK
LLRVLQTLQA LCADIPDGSS SKKDSSVVVT DELVHILRQI EFDDLWNELN LCLKVVQVLE
GVKDANGKEN ETGLENTDRN DDEDGSIDEN AAQTKKLRNS AAGLLTRFLP SIEAFFVANA
SATRVVGEDF ADGAAIALEN LVGGKRLLDF VSGNRVLLNA LIRNNSGLLD KGLRALVQTP
RCRAFLDFDV K