Gene PHATRDRAFT_54960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54960 
Symbol 
ID7195088 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp430804 
End bp441507 
Gene Length10704 bp 
Protein Length269 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183357 
Protein GI219126214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCATATCCA CCATCACATC AGTATAGGCG TCCAACGATA CCTTGTTCCA GCCCACACAT 
ACCTCCCAGC GACCATCACA TCAAGCCACT TCGATGGAAG CAGCTTTGAA TCGCTTGGGG
CGGGCAACGG GAGCTTTGGC TGTTGGAACC TTCACCGTTT CGCAATGTTT GTACACGGTT
GACGGCGGTG AACGCGCCGT CATGTTCGAT ACGCTTCGTG GAGGTATTTT ACCGGATGTC
CGTAAAGAAG GAACACACTT CATCGTACCT ATCATCCAGC GTCCGGTAAT TATGGACATT
CGTACCAAGC CACGAGAGGT GCCGTCCGTG ACTGGTACCA AGGATTTACA GATGGTGAAT
ATCAAGCTTC GCGTATTGTG GCGACCCATA GAAGAGGAAC TTCCCACTCT GTACCGTGAA
CTGGGCACAG ATTTCGACGA GCGCGTTCTT CCTTCTATCG GAAACGAAGT CTTAAAATCT
GTGGTTGCGC AGTACAATGC GGAAGAGCTT CTTTCCAAAC GTGCCGAAGT GTCCGAGCGC
ATCAAGAACG AAATGATGAA ACGTGCCAAA CACTTTCATC TTACGCTGGA CGACGTTTCC
ATTACGCATC TGACATTTGG TCGAGAATTT ATGAAGGCTA TCGAAGCCAA GCAGGTGGCC
AGTCAGGAAG CGGAACGCCA GCAGTGGGTT GTGAAGAAAG CCGAACAGGA ACGGCAGGCG
ATGGTGACTC GAGCGGAGGG TGAAGCCGAG TCGGCCCGTA TCATCACAAA GGCGATGGAG
AAGACGGGAA ACGCGATTAT TGAGGTGCGA CGTATTGATG CAGCCAAGGA GATTGCGGGC
AAGCTGGCGA ACTCTCGCAA CATTGTTTAC TTGCCAAACA CTGGCGGTGG TAGCGGCGGA
GGGGGTAGTA GCTTGTTGCT CGGAATCGAT TCTAAATAAT TGGGGGAGGG CGAGAATCGT
CTCCGTTCAA TAAGCGTCTG AATGCGTCTA CTTTATACCA TTTGTACTGC CTTAAAAAGA
ATCTACGTTC GCGAAAGTAC CGTACCTTGG AGTTATAGTA ATTCTCGAAA GACTTCGTGG
TAGTCTTCGG TATCCTATGT AGATAGTAGA TACCAAATTT GGCTGCAGCT GGAACCAATC
ATTTGCAATC ACGTTGCCAA ACATTGGAAG AAAGTCTCGC CTATCCGCGA ACGGCGGACA
TGCTGCCGAG TCCGGAGAAA TTGGTGAAAC GAGGAAAGGC TTTTGTCGAC GACAGACAGA
CGGATTTGCG TTTCTATTCC GAGGACGCTG GTGGTAGACT TAATCTCAGA AATTCTCTGG
CCGTTCGTTG CAAACGCGAA TCCGACCATG GCCGTCGCAG ATCACTAGCA TTGGGGGGTC
CACTGCCCCT TCCAAGGCCA TCAGGGAGCA TATCGAACCT CCCTGATCTT ATACGATGAA
TTATAGGGTC GAAAGACGGT CGCGTACTGC TCCATCGACG GATGCGAACC TTCGATTGCC
GCGTTGTTCG TTGGTTTGGC AAGAGCCTTC TCGTTGCGAC TTTTTCATCA AGGCTGAAGT
TAGTTTTTCC GCCGGTTTTT CGAACCAAGA CTCTGGCTCA CAATAAAAAA AATCCACACT
TTTGACATTG ATATTTACAA TGGCCAGTCC AACCTCTGTT TATCTCTGCT ATGATGCTCG
CATGGCTCTG CACCGTCCTC CTCCCGGCTT TGCCGACGAC GATGACGCCG TAGTTGTTGC
TCACGGCGAC CCCAGTGTCC CGGAAGCAGC AATTCCTCCC TGCTTCGAGC GAGCCTCGCG
AATCGTCAGT CTCCACGACA AAATGTTATC ACTCGAACGC CGTCTACTCG CTCGTAAAGA
GTATTGTAAC AGCGCCTACG ATCGTTTCCA CCGACAGCGA TTCGTTCCCA TCGCTCCCAC
TCCCTGCCAG CGAGGTACCA TCGAACTCGC ACATTCCTCC GATCACTACG ACAACATGTT
GCGTAGTAGA TTTCTTTCCG ACCAAGGCCT GCGCGACATG TGCGTCGATA ATGATCTCTA
CTTTTGCAAG GATACTTTCG ACGCCGCCTC GCTTTCCGTT GGTGGCGTCG TCAACTGCGT
CGACGCCGTT CTCACCAACG AAGAAACAGG CTTTGGCCCC ACCCGCGCCA TTGCTTTGGT
TCGCCCTCCG GGTCACCACG CTACTTGTGA TGCCGCCATG GGCTTTTGCT ACTTCAACAA
CATTGCCGTT GCCGCGAAAC ATGCCATACA CGAGGGCCGG GCCGAACGCG TATTCGTGCT
AGACTGGGAT ATTCACCACG GCAACGGTAT CCAAGATATC ACGTACGACG ATCCGAACAT
TTTTTACATG TCTATTCATC GTGCGGCGTT TGGGACCAGT GCAGCGTCAC AACGAGACTG
GTTTTATCCG GGAACGGGGC ACCCCGAAGA AGTGGGTTCC GGATTTGCAG CCGGAACCAA
CCTAAATATT GCCTGGGAAA CCCGAGGAAT GGGCAACGTG GAGTACGCTG CGGCCTTTCA
AGAAGCCGTC TTACCGGTCG TTTCGGCTTT CCGACCCGAC CTGATTCTGA TTGCTTGTGG
TCTAGACGCG GCAAGGGGCG ATTTGTTGGG TGACTGTGGG CTTTCGCCCG ATATGTTCTA
CATCATGACA AGGAGTGTCC TGGAAGCTGG CGGAATTCAT ACCCCGTTTG TGGTGGCCTT
GGAAGGTGGG TACGATTTGG AAACCATGTC GACGTGTATG GAAGCGACGG CGTTGGCGCT
CTTGGACGAG CCCTTTACCG AACACCCCTA CGCCGCCACG GCCATGCGTT CGGATTCGGC
GCGACACAAG GAACAATTTG ATTTATTGCG CTATTGGGAC CGAGAGGCCT TTGAAGCGGC
CAAAAAGAAA AAGCGCAAAA CGAAACGAGC TTTGGCGTCG ATTCGAAAGT CCCTGCGAGC
ATTGACCAAT ACCGTCAAGG GACGGTGTCT CTTGGAGGGC CGCCGTTGGA TTACGCCGCA
TCCGGTTCGA AATTTGATCA CGCCGACGAG TAGCGTTGCA CCTATTGTGC CGCTCTGTCA
TCGTCCCGTC CAGCCACGGC AAGACTACGC ATTTTCTACT ATCCTGGAAG GAAGCAGTAT
GGATGACGAC GACGATAGCG AACCAGAATT GTGGTCGGTA TACGGATACT CGTCTGCATG
TCAGCGGCCG CTGTCCAACG CAAGCTACCC TTTCAAGAAA AGGAAGCTTA TCGTTTGAGA
TACAATGCAG TACAAAAATC ACTTAAAGAT TACTTTGCTT TGCAATAGTT CGGCTCGGTG
ATGCACTCCT GGCAAAAAAA CGTGGTGGTA TCCATTGCCA CGTTTGCATC CAAAGCCAGA
ATTGAAGATA GCACGCATGA AACCGGTTTC GACAGACCAA AAATTGAAGC TCGTGTCAGC
GGAGATCGCA AAAGTCAAAC GAACGTATCG CATTAACTTT TGACTGTGAC ATGCGAGTTT
CAATTCACGA CATGTAGTTG GAAGCAGACG TTTCCAGAGG CAAATTGGTA TAAAACCTTT
TTCGTGTTTC TTGGAAATGA GTATTGAAAG AAGGAACCCT GCGAGTATTC TGCTTTACAG
TTATTTAGAC CCACGAAATT GCAATCCGAA TGATGTTGAG GTCGCTTCAA AAATGTCTTT
TCTCCAAAGT GAAAACCTAT AAATGACTCT GAAAATTGGG GTCATACTTT AGGTCACATG
AGAGATCAAT CAAAGGACAT AAACACAGCT TTTCAAATAT AAAAAACATG GTTTCGACAA
GGATTCCGTC AGATCATCCC TGTAAAAACC TGTCTGTCGC GGCGAATTAG ACATATTTCA
AAAACATTTA ATAGGCCTAC TAGAGGGCAC TTGAGTAGCT TCTACTAGAG GTATCAATGC
GGTAATAAAT TATTTCGCAT TGCTGTTTGT CGATAATACA TGATCAGAAA CGACATGCTC
CATTTAGATG GTTTAATGAC CAATGACTGC ATACGAAATT GTGCTCATTT AATAATTCTT
GTTTGAATTC GGTCCTACTG TTCGACACGG AATGTCATAC TCTAAAAATC CATCTGGCCT
TGAAAATAAC CACACGAACG CAAATCGTGC GCATCCGTAA TATTACGAGA GACTCCTCGA
AGAACGATTT TTTCCCGTTT CCCTTTGCAG TAGCTCACTC AAAAGTACAA GGACGATAAC
CATGAAGCCA TCAGAGCCTT CCCAAGTCAC ACCTGATCAA CTGACACTCT TTGACTACGT
CGCTAGCGCT TTGAGCGAAG AAACAGATGA TCGTATTCCA GCTTTGGGTA ATTCCACACG
AGGTTCGAGT GGGCTAGACG TTCCACATCG CAGTGGCGGT CGAGCAAATA CAAACGGTCA
AGCTCAGCGG CGTTTCCTAG CCAAAATTAT CGAGGAGGCC TTATCACTTA CCCGAGGGGA
CGAATTTCGT GGTGCATCAG CTCAAAGCGA CTGCCACCGA CGCCAACAGT GAAGAAAATC
AGATCCGGGA CATCATGACA AAGCGTTCAG ATTCACTACA CGGAAGCAAT ATTTGTCTGA
CCCGTCCACT CAGAGCAAGC GTGGAGAGGT TTTCAGCAAA TTCCTATGAT ACAGTAACAA
AGTAGATGCC AAATCAATCA GTCTCTTAGC TTTTTATGTG TATGTCTTTT AAGGAAATAA
TGTCGCTGTT CCAGACAGAA GCGGAAATGG TCGATTCGAG ATAGTCTCGG TCACTCCTAC
CACTAGCCAT TGCGGAAGCC ACAGTACGTA GGCTCTCCTC TCTGGCATAC AGGTCACTGT
TCGCGACAAA GCGTGGCACG ATTAAGCGTA GACGTAAGGA CCTAACCCCA TCAGCTGAAC
ATTTTGCAAT GCTCTCTGAC ACAGTCTTGG ACGGAAGAAG CTAGCTAGTC CAATGATCAA
TATCGCGGTT ATGATTAAAT ACGAGGCGTT CTAGCCTATC AAGCCATACT AGCTAGCATC
GTTTCGTTCG CAAATGATCA TTTCGGCCAT CGTGAGAAAG GCCTTCGTTC GGTATAGATT
CTCCGTGAAT TCCGTATTGG GTGAGAAATT CTGCGAAGTT CTAGGCACGG TGGATATTTT
CAGCTCCAAA CACGGTTTCG TGAAAGGCCT ATCGATCCAC AAGGCCTTCT TCGTTCGATC
AGGGACTACT GCGACACGAA AAAACGTCAA AGAAGGACAA TGTGGAAAAT TATACTTATT
CACGTTGTCT GTAGTGGCGA TGGGCAGACG TGCATCAGCG TCCATTTTCA TGTATATCCG
ACGAATGACA ATTTTAATAG TCAAAGCAAA CGAATAATGC AACCCAGCAG TATTTTTCGT
CAAGGATTTG CTTCAGATCT TCTGACCCCG CGACGATCAC TTCCCCTGAC TTGACATTTA
TCGATGGTCG CATAAACAGC ATTTATTGCC TCTAAAATTC AGGCATGAGC AAGCCCCTGG
GTATTGCACA AACTGGCGTG GATGAGTCTG TCGTACCAGA AGATCCGAGC GAACTTCTGA
GATTCCGGCT CACTGCGTTT CTCAGGGAGC TTTTCCTGAC CGTGAACTGC ACGATGTGCG
TGCCGATGTA TTCGATGCTA ATTTGGATGT CTATTTATTG CACTCTGATT TACTTCGCGC
CCACCATACC GATTCGTATG GTCGTCTTAT CGTACGCCGT CTATTGCGTG TTCGACGAGA
CTCCAAGACT CGGCGTCCGG TGGCTATCCT ACCAACAGAT CGACTGGCTA CGCCGAAATA
TCTTCTTTCA GTTGGTGGCT CGTTACTTTC CCGTGAGTTT GCACAAAACA CAAGACCTGG
ATTCTGAGCA AGGCCCGTAC ATTTTTCTAT ATCATCCACA CGGAATTATT GGCATGGGCG
TGAACACGGC CATGAACATG AATGGATGCC ATTTCGACAA GGTATTTCCT GGCATTAAAC
GGTGGGCCGT AACCTTAAAT GCTTCGTTCT ATGCTCCTAT TTTTCGAGAA TGGATGATAT
GCTTGGGTAT CATTTCGGCC AACAAGAAGA CGCTAAAGCG AAAACTTTCA CAAAAAGAAT
CGATCGTGTT AGTCCCGGGC GGCGCGGCCG AAGCGCTACA CGCACACCGT AGTAATTTCA
AGCTGCAAAT ATTAAGTCGG AAAGGATTCG TTCGGTTGGC TTTAGAGACT AGAGCAAAAC
CGATCCCCTG CTTGGGATTT GGTGAGAACG AGGCGTTTGA TACACTGTAC GTGGCCGACG
AGGAAAAAGG ATCGTGGCTC TGGCAAGCCC AGTTACAATT GGAAAAGATT TTAAGCTTTT
CGACTCCCTT TATTACGTGG CCGGTTCCGA ACCGACATCC AATTCATGTG GTAATAGGTA
AACCTTTAGT TTTTCCCGAA ATGAAGAAAG GTCTCCGATA CGAAGAGTAC GTCGATCAGT
GTCACGACTC GTACTTGGAA GCGCTTCGCG AACTCTACAG CGACAATAAG GCCAAGTATG
GCTATCAAGA CATGCCACTG CAATTTGTCT AACAATACCT GTAAATGCGG TTACTCGTTG
CGTTAAGTTT GACTAAGGAG TAGGCAAAGG GGTTTGGAAC CCGTGCAATC AGTATCTCAA
TGAATATACG ACTCGTCTTT CTCTATTTCT CTCCAAAGAT CGGCTTTTCG TTGTCCCAAT
TTTTGGGATC GTGAAACCAG CTCATTGTCA TATTGGTGTA TAAAAGTGGT CGCTCCACGT
CGACACTTCT ATTCGCGAGC CCCATGTGCA GGATTCGGCA GTCAAATAGT AAGCAATCCC
CCAAATCCAT TGCCGGTCGA ACCAAGTGTT TCCAGAGCTC CGCATGATTA ATATGTTGGT
CGTTGCAGTA TTTCGCCGTG AAGTCGAGAC GGTGCGATCC GTGTATCAAC GCAGTTTGCC
CTACCTTGTC GTGGACGGCC GATCCCGGTG TGAACACATT AATGTAGTGA GCCGGCAAGC
ACGATTGGTG TTCGAACAAA TGAGGTGTGT CGGAGTGCAA GGCTTGCTCG GCGGCCCCGG
GTAAAGTGAC AATACCCCCG ACAATGCCGA CACGCAAATC TTGGTAGGAA GCGTCCGGAC
CCCGGCCTTG GAAGTTGTAG CGACCAAAAT TGCCGGATGC CAACGATTCG TCTTTTGGAT
TCATGACGCG CCGGACGATT TCTAGAATAT TGGCATTGCC CCGCAAGAAG TGCTGTGGGT
TGGACGAATC AAGTTCCGCG GTAGCGGTTA TGGTCCAAGG CTGATTACCT TTTGTACCGC
GCTTGCGTCG CAAGCTTGGT CCACCACGTA TATCCATGCG TAAGTCTTCC CGCATGGAAA
GTTCTCGATA CGAAGTGGGA TCATTTTTCG ATTGGTGTGG ATGATAGATA TCGACTTGGT
CTCGTTCCAG TAAAATGTTG GCCGCCTCGT GCAAGTCTCC CAGAACTGCA CACCCCCATT
CTCGACACGT TGCACGGTCC AGTAGTTGCG GTAGTAGACA ATAACCGTAC TTTTGTACCG
CAGCGACTGC CTTTTGCACG GTATGAGACG TTAAGGTTTG GGCTGCGCGT TCGGCAGTCG
ACAAGACCAC GGTAGCGTCG GTGACGCGGC GCAGTTCCTG GTTTAGAGAG TGATTGGACG
GTAACCAGGG ACAATTGGAG GTCTGGATGG GGCACGGCCA TGTGTGTATT GCCGCATACG
CCACGCCACT GGCTTTCGAC ACGAACACGT GGATTTGGTC GTTGCGATTC CATTGCAGCG
GGTCGGTGGC ATCGATTCGC TCGCGATCGG CAACGAGACA ACCACCCGGA ACCAATTGTT
GTTGAGCGTC TAGTAGAGAA TTGTTGTCGG CGTTTGTGCT ACTGTTCGCG TTCCCCAACA
GTACAATAGC ATCGAAGTGT TGCGGACCGT CCGCCGTTGT CGCGCCCCCC AACCGGAAGC
CGTGTCGATG CAAAGCAGCA CTCCATACGT TCGCGCGACT CGAAAGCTCG GCTTCGTCTC
CATTCACCGA GTTTCGTACG TCGACCGATC GATCGGATAA GGCTAATTGG GGATTGTCTT
TGAGAAAGAG TTGCGTCAAG TAGACGGCTA CCGCTTCGGC GTCCGTGCGG TCGACGGTAT
TATTGTTGTT GTTGTCGTTG TCGTCATCTG CTTCACTGTC ACTGCCAAAG GCATCCCAGG
GGTCTGGATC GTTCCCATTT GCCATGGAAG GTCACACTAG CGAGTCGTCC TCTCTCGTTG
GTGCGGGTTC GACGCGAGGA AGGGAACGAA CGAACTGTGC ACCGAGCAGG AATTTGGACG
GAATGTTGCA ACTGTGAGGG CACGACTTTG GACCGTGTGT CACTGTCGTG TCCCACTAGC
TCGTTTTATA GTAGGTGATT GTGCTTTTCT GTGTCTTGTT CTTATGCGGT ATACTGGACC
GTCTGTTTGG CGATTGGCAA ATGGTGATTG TGCTTCGTCT GATCCCGACG TCAACGGAAC
CGCGTACTCT ATGAACCATC GAAGGTAACC CTACCGTACC CTACCGTGCC GTCCCGTACC
CGTGTTCCAC ACCTGCCCTC GCATCCTGTT CTGTGTGTTT GTGTGTTCAG GATCCAGGGA
CACACGTCCC TCCACGATCC GCTTTCCACC CGTATGTAAA TGTGTATGTA CCGTATTCGA
AATCCTTCCC CCCATTCCTT TTCTCGCTCG TACCCTCAGC TGTGCCCCCC ACCATGACGA
CCGCGTCGCG ACCGCGGGAT CGTCGTTGGT CGGTTGCGAC CGAGCGGGAG GCGACGCGTC
GGCGGGTCTC GGCGATGCGG GCCGGGGAAA CCGCGTCGCA GCGTGCCGTG CGGCTCGCCC
AGGCGCGGAT CCGGGCGGCG CAACGACGTG CCCGTGAAAC GCCAACGGAG CGGCGGGATC
GGTTGGAGCG GGGGCGTCTC CGGGCGGCAC AGCGGCGGGC GAACGAGTCC GAGAAGGAAC
GCCGTGCGCG GTTGGAAGAA GGACGACTCC GGGCTGCGGA ACGGCGAGCG CGGGAAGAGC
CGCAGACATC CGGATCCGCG ACTACGGCAC GTTCCCAGAA TCCGGCAGTC ACATCGAGGA
CTGGGGCCGG CGGCAACAAA CGTGCTCGTC CGGCTCCGGC GAAGAAACCT AGAAGTCGTA
CCGTTCCCAA GGAGGGCGGA GGGCGTCCCC GGAAGAAGAA AAAGCCAGCG TCGGCAACCA
ACAGTAAACC TACCAAGCAA CCCAGACGCA ACAAGGTGGT GCCAACGAAA GAATCGCGTC
AGTTGGAGGC TCGCATGGAG AAAGCGCGTG CCCGGGCTGC CCAAATTCGT GCCAACGAAT
CGGAATACGA ACGACAGCTT CGACTGGAAA AGTCTCGCGC GAGAACGGCT CGTATCCGGG
CCAACGAAAC GGAAGAGCAA CGTCTCGCAC GGCTCGAACG GTCTCGCATT CGAGCCGCGC
AAATACGTGC CTCGGAGACT AAAGAACAGA GAGAGCTTCG TCTGGAGCAA AACAGGATTC
GAACGGCTAG ATTGCGCGCC AAGAAGGCGC AAGAACAACA GCAACAGCAA ATTTTGTCCG
ACGCTGAGCA TAGTGGTGCA TCTCGCGATG AACAAAGCGT CGGGTCCGAC GATGCGCAAA
GCGGGCTCTC GGACGACGCA CAAAGCATCG TCTCCGAGGA TGTACACAGC GGAGTGTCCG
ACGATTCACG CGGTGCAACC TCAGACGACT AGTGAGATAG GTGACGAGAA ATGCAAACCC
ACAATAGACT TGGTATTGAT AGTAACAGCG GCCCCCGTGG GGAAAAGCTA CCGTAACTCC
GAAAGTTGGA AAGGAATCTT CAAATTAGCG ACACAATTGT CGGCGGTGGC GAAAAATGCT
AACTGTAAAT GGACGAACCC AACGGACGAA GCATGAGTAC GCCAATTCGC ATCCTACGGT
AAAGCTCCGA AAAGCCTGCT TTTCGATTCT GGAAATGACG ACCAAGTTGA TCATTTACCG
AGCAAACGCC GCCGCAGAAG TAATGTGGCA ACGACAAGCG TACCCATCCG AAAAAGTCTA
ACCTAACGCA AATTTTTATT GTATAGTATT GTAAATTCGG CAAAAAGCTC ATAGAGAAGC
GCTAGCAGGT TCGTTTCGAC TCCTCATCCA AATCGTGCGG CACTTTCGCA TCCTTCGTGT
CGCCTACCTT ACCAGAAATG TCATCTTGTA CACCTTCTTC TTCGTCATCC GCATCTTCCG
TTCCCGGAAC AATCGTCCAG CATGGCCCAC ATTTGAGGCC CACTTTATGA TCATCAATGT
TTACGGTGAT TAAGGCAGCA ATAGCACTGC AGGCGGTCCC CGCAAAGAAG CAAGCCGCCA
AGGCTGCCAC CGCATCATTG CCAGAGTAAT CGTTAACTTT TTCACCCACG TACGTATACA
GCAAACCGCT ACCGAGTGTA CCGAAGAGTC TTCCCATTGC GTTATTCATG TAATAAAGGC
CGACCGAGAC GGCAACCTTT TCCTTGGCGG CGTAGTTAAC CACCAAGAAT GAATGAATGG
ACGAGTTTAT CGCAAACACT ACGGCGAACG CAATGATCAC CGTTACGAGA AAGGCCGTCA
TACCCGAAAC GTCGTAAGAA TCAAACATGT CGGACCCTTG CAACACAATG GCTGCGACCA
GCGTTGGTAG ACAATTAAGG CTTCCCCACA AAACTTCGGT CAGCTTGTTG GGCGGTGTTT
GTCGCAGCGG TCCCGTTACA AGCT
 
Protein sequence
MEAALNRLGR ATGALAVGTF TVSQCLYTVD GGERAVMFDT LRGGILPDVR KEGTHFIVPI 
IQRPVIMDIR TKPREVPSVT GTKDLQMVNI KLRVLWRPIE EELPTLYREL GTDFDERVLP
SIGNEVLKSV VAQYNAEELL SKRAEVSERI KNEMMKRAKH FHLTLDDVSI THLTFGREFM
KAIEAKQVAS QEAERQQWVV KKAEQERQAM VTRAEGEAES ARIITKAMEK TGNAIIEVRR
IDAAKEIAGK LANSRNIVYL PNTGGGNEY