Gene PHATRDRAFT_21660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21660 
Symbol 
ID7202587 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp715648 
End bp726235 
Gene Length10588 bp 
Protein Length2400 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181617 
Protein GI219122574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAATC AGCGGGCAGC TGCAGAGTTG GCATCTCTGC TTTGGACGCT TGCGCACGAA 
ATGAGTGTGG AAGATTTTGG TGCCGTTGAA AGCGAAGTTT TTACGCTCGT GTTCGCTCTC
GTGCACGCTC CGGACAAAGA AAGTCGTATG GCCGGTTTGG CCGCCTTGGA TGGATTGTTG
GTGGCACCGA GTGCGGACGA AGAGAAAAAG GCCATCAAGT TCGCTAACGC CTTGAGTACA
GGGCTGCGAG CAGCGAATGG CGACTATGAA TTTTTTTCCG CCGTGTCGAA AGCGCTGGGT
CATATGGCGA TGCGCATTTC GAACGTGGAT TTTGTCGAGG CCGAAGTTAC CCGAGCGCTG
GAATGGTTGC GCACCGGACG TTCAGATAGA AGGTGGGTAA TCAAATTGTA TCAATGTTTT
ACTGCTCTGT CTATGGTCTC ACTCGCAGCC TGCTTCGTCT ACGAAGACTC GCGGCCTCTT
TGTCACTCAA AGAATTTGCA ATCCACGCTC CTACCACCTT TCATTCTAAG ACCAGTCAAT
CGACGCTAGG ACAGGGCGGT TCCAACGAGT TTTTGGATAC CATTTTTCAG TCAATACGTG
ATCCACAACC CATCGTGCGA GTCTGTGCTG CTGACGCTCT TTCACAATGT TTGCGCATTT
TGGTAGATCG TAGACATCTT TCTCTAACTG GACTATTGTG TCAGATTCAC TTCTCTACAA
TGGAAGGTCT TCAAGAAGCG ACCAAGAAGC AATCTTGGCA CGCGGCTTCG GAATCTGAAG
CAGCCAAGCA CGGGTCGCTC TTAGTAGTCG GAACAATGCT GGCGTACACT CGCGAATTCC
TACTGCCTCG TTTCGAAGAA ATATGTCGAG CAGTCTTAGC TTGCTCCAAA AACCCAAAAG
TTTTGATCCG TTTGGAGGTC GTTCGGTTGA TTCCTAAGTT AGCTGCTTGC TGCTCTAGTG
TGTTTGGCCG CCGGTATTTG GAGCAAAGCT TAGTCTTTTT GATTGACAAC GTATCTAGCC
CAGCATCTCT CCGCGACAAC GTCGATATAC GTCCTTCCGT CTATGACGCT ATTGGCGATT
TGATTATGGC AATGAGCGAT GAGAATACTG GAAGGGTCAT TGGAGGAAGT AAATTCCCGA
CCGCCAGATT CGTAAACGAT CCAGAAAATC CGGATATCTG CTTTGTTGAG CTAAGTAAGA
GAGGGTTCAT ATTCGAGAGG CTGGAGGAAA TCTTCGCCGT TGTTCGGAAA GGGCTGCACG
CTCCAACATC TCGAAGCTCT GTCGGTCACA CCTTATGTCC GGCACTGCAT TGCGCCTCAA
GCCTTGTCGA GGCGCTGAAA GACTTGGCTC TGCCTTATTT GGATGGCGTT ATCGATGACA
TGTTCCAATC GGGTTTGAGT ATTGATTTGA TTCAGTGTTT GCACTCTATC GCTCAGAGCA
TTCCGATGTA TAAAGATGAA ATTGAAGATC GCATGTTACA AGAAGTATCT CTGTCTCTGG
CTGGGAATAG AGTAATTTAC AATCCCATTA CGGCTTATAG AGCAGCTGCA CTTCGAGCGC
GAGCTCTTCG AAATGTTCGG AAGCGTTCCA GTGATCATAC TCTTGGATCG AGCTATCCTG
CGGGTGCAGC CTCAACTACT GAGTCTTCGA ATGTGCATAT TAATATGAAC AATGATGCAA
AAACGACCAA ATTGTTGGTC CTGAGTCTTC AGACTTTCGC ATCATTTTCA AACTCGACGG
CCCAAGTGAC TTTGTCTGGG AAGATTATTC CGCTTATGCC TTTCGTTCAA GATGTTACAG
CACGATATCT ATTGCACCCA TCGAACGAGG TACGCCGGGC GGCAGCGCTG GCATGCTGCG
TCCTGCTGAT TCCCCATGGT TCTATCTTTG CCAGTGCTGG GAGCTGCAGT GGTTTAATTA
TAGAGGATGT TCTGGAAGCT CTTCTGAGAG CTGCCGTTTC GGATAGCTCT GCTGTTGTTC
GCCTCTGCGT TGTTCGTGCG TTGGATACGC GATACGACCC TTTTCTGTGT CAGACACACC
ATCTGCAAGA CCTGTTCCTT GTGCTACAAG ATGAAACATT GGCAACAAGA GTTGCTGGTC
TTCAATTACT TGGTCGTCTT GCAAGTCTGA ATCCTGCTCC AATTCTTCCG GTTCTCCGGA
GGTTTTTAAA CGATCTTGTC GTTGAATTGC AGTGTGGAGT CGATACGGGA AGAGGCCGAG
AAGAAGCGAC ACGCTTGCTC GTCGTTTTTC TACGCGTGAA GCCTTTGCAA CGGTTGATTC
ATCCAGTACT TGCTACACTC GTTGGGGCAC TTCCTCTGAC AGGTGCTGCA CCACCGCGAT
TAGCTTCGGC GTCTTTGGAG GCACTCGGTG AGCTTGCACA AGCTACAGGT ACAGCCTTGC
AGCCGTGGGT CAATGATATT ATTCCTCATG TTCTGAATAC AATGAAGGAC CAAAGCAGCG
CCAGTAAGCA GAGGACTAGT CTCAGAACTC TTGGTCAGAT TGCTGGCTCA ACTGGGTACG
TTGTCCGTCT CTATCTTGAC TATCCGAATC TTTTAAGTCA AGCGACAGAT ATCTTGCCGG
CCACAAAGCG AGCTCCATGG ACTCTTCGTC GGGAAGTTAT CCGTACATTG GGGATTATTG
GTGCTTTAGA TCCTGACCGG TATTATTCTG TAGCTTCTAA GGCCCGAAAA GGAGGCGTTG
TCGGTGGTGC ATATTTCGAA GAGTTGGATA CGCGCCATCG ACCAAAGGAG GGATTTGATT
CGGATTCTGG CCGGAACACT TCGAACATAG AAACTACAAG GATTGCCGGT ATTTCTGGTA
CTTCGGACGA CAGTCGGCTA CTCCATGGGC TGCCAAAAGC AGCCTTAGAC TCTGATGAAG
AAGACCAGCC CGCATACTTG TCCATGTACG AGCAATACGC TATGGTTGCA CAACCTGTAT
CCAATCTTTC CCCTGCGAAG CGTATGACAC CATCTGAAGA AGATTTCTAT CCGACAGTCT
CTATACAGGC ACTGATGCGA ATTTTTCGCG ACTCGACGCT TACTGTGCAT CACGGTATGG
TTATTCAGGC AATTATGTTT ATCTTCAAGT CCCTGGGCGT GCGGTGTGTT CCATTCCTGG
GAAAAGTTCT GCCCCACATG ATATTGACAA TACGACACTG CCCCTCGAAT CTGAAAGAGT
CACTTTTCAT TCAACTCTCC AACCTAACCT TGGTTGTAAA GGCTCATCTG CGAATATTTG
TTGACGACAT TTTCGATATT GTAGAACAGT TTTGGGACTC AAGACACCTC TCTATTATTT
TGAAGTTGCT CTCAAATATA GCAATTGGAG TACCGGATGC CTTTAGGCAA TTCGTCCCCC
GCTTCATTCG GCGTCTCCTT ACATCACTTG ATGAGCTACA AGTTGCTGAC TGGTCCACAG
CTAAACAAAG CTTGCTTCCC CAAAATGGTA GGGCCGAGTC GGAAAAGCTC AGTCATATTT
TGAAGAGCAT TTCAAAGCTC AACAGCATGT TAAGGGAATA CTTGCACATT CTGATTCCCG
CTCTCTTGAA GCTTGCTGAC TCACTAGCTT CGTTATCCTT CAACGGTGCG ACGACCACTA
CGATATCTAT TTTGGATGGC TTCTCCGTAC TGAATTGTAG GACACTGTCA GCGCTGATAG
AGAGCCAGGC GCCGGCACCA AATCCAGTCG CACTGGCTCT GTTCACAGGT CTAAGCTGCA
CTCCACCGAT AAACTCCGAA AATGGACTTC CGTCTCGCGT TGTCCAGCCA TTGGTTAGAA
TTTTTCGAGA AACGCCACCG CGAAGTCTTG CTGTAGGCCT ATCGATGGTG GAAACACTCT
GTACTTGTGC AAAAGAAATT GGGGCTTCAA AGTGGCTGCA AATATACGAT TCCGTTGTTA
AGGCTGCCAT CACTTCATGG CACAACACAT TTTCATTGTC CCCCGGAGTC GATGTATCTA
TATCGACTGC TCGCCCTGAT GAGCGCCTTG CCGCTATCTT GGAGCTGTAT TATGAGACTG
TCAATGAGCT GACAGCTCCA CCACTGCTGA TCGATACCTC AACATCTAGT CATGCAATTT
CCAGTCGTAG ATCACACTCA TTGCTGGGAA TTGAAGGAAG GGCAAGCACT AACGATATGG
CTGTGATCGT TGAAGGTTTC GACAGCTCTA TTGAAGTCGA TGAACATCCC GTTGCTCCTG
CTCTTCGCGC ATCTTTCTCG GTCTCAAATA GACAGAAAGT TAATCAAGGG AATCTGCAAC
GAGCCTGGGA TGTATCGCAA CGATCATCTC GCGAGGACTG GGACGAGTGG ATGAGGAGAT
TTGCAATCCA GCTACTACGT GAAGCACCAT CGCCGGCCCT ACGAGCAAGC GCGAATCTTG
CTCATGCATA TCAACCTTTG GCGAGAGAGC TTTTCAGTGC TGCATTTGCT TGCTGCTGGA
AAGAATTGAG TCATCCGTAC CGGACAGACC TACTTAGTGC GCTGGAGACC GCCTTCGTTG
CTGATATTTC CCCGGAAATT CTTTTGGCGC TACTAAACCT GGCTGAGTTT ATGGAACATG
ACCCAAGTGG AGGTCTCCCT ATTGACATTT CAATCTTGGC AGATTTAGCA TTGAAGTGTC
GGGCGTACGC CAAAGCGCTG CACTACAAAG AACGGGAGTA CAGGAACGGA GGATCAGGTT
CCTGTGTGGA AGCGCTAATC AGTATCAATC GGAAGTTGGA TCTTCAAGGT ATGTGTATCT
TTGCTGTGCT GGCAAGGGAG TGGACCCGAA CTTATACATC TTTTATGTAC CGCAGAGGGT
GCGCTAGGGA TTTTGAAGGC TTCCGCGATT GACGATGAGG ATGCATCCAA GCAATCGGTT
GATGAAGCTT TCTCTGCGAG AGTTTCTAGG CATCAATGTC ATGATATGCT ATACAGTGTT
ATCTGCAGTA CAGAAGAACC CGCTCTAGCC AATAGGAGTC GCTTGAACAT ATCAGAAAAG
GAGGGTTGGT GGCTGGCCAA GCTTGGCAAC TGGACAGAAG CGCTGGAGGT ATATCGCGAG
AAGCTGAAGA GCGACCCTCA CGATTTTGAA GCTATTGTAG GATGCATGAG GTGCCTTGAT
GCGAGCGGCG AGTGGCGTAA AGTGCTTGAT CTTGCTGAGC AAAATTGGAC AGCTTTGAGC
CAGCATCGAT GTATCGGTGA CAGCAACTTT CACGGAGACC ATTCGGATCG CAACGAAAGG
CAGTCCGAAT GTGCGCCCAA GCTGCCTGGA GACTAGGTCA ATGGGATGAT CTCGAAAAGT
ACTCATCTCA GCTGACTTGT GGTCAGGGAA ACATGCATGT AGCGTCGTCG CACCTTGTCT
CGGGACTTCG TGACATCTCA ATCAAGAGAG TCGGATTTGA TGGCGCGTTT TACAGTGCGG
TTTTGCATGT TCATCGCCAA GACTGGTCTC ACGCCGCTGA TGCTATTGAT GCTGCTAGGA
AAGCAATGGA CAGTAGATTC ACCGCGTTGC TGGCTGAATC ATATAGTCGG GCATACCCTA
GCATGGTGAC GGCCCAGATG CTATCGGAGA TGGAAGAAAT TATTGAGTAC ATGAAAACTG
AAGAGAGGTC AAGAATCGAA ATTGACCATC ACCCGGCAAA TCGGCAGAGC ATAGAAAGGG
CTCGGGAGAG GCTAATATCT GTTTGGAAGG ATCGACTAGC AGGGTGCCGT ATGGATTCTG
AAGCCCACGC GTCGATTCTT GCGGTCAGAT CACTGGTGAT CGGACCGGAA GATGACGTAG
ATGCCGTGCT AACGCTGAGT AAATTATCGA GGCAGGCCGA ACGCCACAAG TTTGCAGAAC
GCGTTTTACT CGACCCTTTG CATTCGTTGA ATGCACATCT CGATGGTCCG ACATTCGGAA
TTGGTTTATC AGACACATTG GGAATAAGAG TTGATTTTTC CCAGTACAAT GATGCTGCAT
CTCAAACTCT CATCGATCGT GTCGTTTCAG GAAATCTTTC CAATATCCTC CCTACGTATG
GTCTAGCGCA CGAGCAGTGG AGCAGGAGCC TTGTCGATGA AGCAGGGGGA ATCGACCGGT
ATGACATCCG CTTTTTAATT GCCAGAAATT GAATCATATG AAAATTCCTC ACTATCACTT
CTCGTCTTAT TTACATTAGG CTCAAGATTC AGCATACTTT TTATTTTGCT TATGTTAAGC
ATTTGTGGTA TACTGGTGAA AAGCATGAAG CAACCAGGCG TTTAGAGCAT CTGTGTGATG
TTGTAGATAT GGTCTCACAT TGCGAACGGA TTAATGAAAC GTCGCTAAGA GTTGCATGTT
GGCTGGAATA CGGGGAGTGG AAGCTGTCAA CAACAACTTC GCTTGGGTCA TCCATGAGTC
CGCAATTTCA ACTAGATGTG TTGACGTCTT TGAAGCGGGC AACCCAGCCG GACGACTGCG
GGTATAAAGC ATGGCACGGG TGGTCGCTTT TGAACTTTCG CATTGCCTTG CAACTCAACG
ACCGGCATCA TTTATCGTCG CAAGCGGATG CTCAACGCCC AGGCGCAAGC TTCGATAAGA
GTATTCGGAA TCATGTTGTT GCTGCAGTCC GAGGGTTTGT GAACGCAATC AACTTAGGCA
CCATTAAACA GAGCGCCTCG GTGCAACAAG ATTTATTGAA CTTGTTGACT TGCCTCTTCA
AGTTTGGCAG TTTGCAGGAC GTCGCGGTTG TCTTGAATGA ATGCGTGAGC TCCGTCGCCA
TCGAAGCTTG GCTGGGCGTT CTTCCTCAGC TATTGGCTCG AATACATATA AAGGACCCCG
CCATAAGATC CGTCCTGCAC CCCCTGCTTA CACGTCTCGG TGAGAAGCAC CCTCAAGCAT
TGATGTACCA GCTTTCCGTT CTTCTCAAAA GCCCTGTCGT TGAGCGGAGA ACCGCTGCAG
AGAGCTTGAT GAACTCACTG AAGTCACATT CAAGTGATCT AGTGGAAGAG TCTCTGATGG
TTTCATCAGA GCTAATCAGG GTGGCGATCC TTTGGTCTGA AACGTGGCAC GCTGGACTAG
AGAATGCCTC AGCGTTTTTC TACGTTGAAA ATAACATAGC TGCGATGCTG GACCAGCTGC
ACTCTCTACA TGGTGAATTT GAGAAGGAGC CAGAAACGAG TATGGAGAGA GACTTTGCGG
AAGCCCATGG AGGAAACATT CGGCAAGCAT ACGACTGTAT TAAGAAATAC ATACAGCTGA
GCTCCAACGA TGACGAGAAT TTATCGCCAG AGCAGAAAGA CTCTCGCCGT GAAGAAGCCG
AAACTTTTTT ACACAAAGCT TGGGATTCAT ACTACGTTGT CTTTCGACCG ATCAACCAGG
ATCTGAAATC GATGTCGCTT CTTAGGCTAC CAGAGTGTTC TCCGGCGCTC AGTCGAGCCC
GAAATTTGGA ACTTGGAGTA CCAGGATCGT ATCGCGTCGA TGGTTCCTAC GTAAGGATTC
AGAAGTTCGT TCAGCGCGTT AGCATTATCA ATAGCAAGCA GAGGCCCCGT AAAGTGACGT
TGCGAGGGAG CGACGGGAAA CACTACGTAT TCCTCCTCAA AGGTCACGAG GATCTCCGTC
AAGATGAGCG TGTCATGCAG CTGTTTGGTC TCGTGAATGC ACTGTTGGTT CGGGACCCGC
AGACGAAGAA TCAAGATTTG ATGATCAAGC GATACACCAT TTCGCCTTTG TCACACAATT
GCGGCCTCGT CGGCTGGGTG CCTCACTGCG ACACTATGCA TGCGTTGATT CGAGACTATC
GCGAAGCCAA AAAAGTTCCT ATGAACATCG AGAATCGAGA AATGATGAAA ACCGCACCGG
ATTACGACCT TTTGACGGGA ATGCAAAAGG TGGAAGTTTT TACTGATGCG CTACAAAAAA
CACCTGGTAA AGGGGACGAT CTGGCTGAAA TTTTTTGGCT AAAGAGCACA AACAGTGAGG
AATGGCTCGA GCGGCGCACC AAGTATACGC GGAGTTTGGC AGTCATGTCG ATGGTGGGCT
ACATTCTCGG TCTCGGAGAT CGCCACCCTT CCAATCTCAT GATTGATAAG CTTTCCGGTC
GCGTCTTGCA CATTGATTTC GGGGACTGTT TCGAGATTGC CATGGTGCGC GACAAGTACC
CTGAGCGAGT TCCATTCCGC CTCACGCGTA TGCTAGTCAA AGCTATGGAG GTCTCGGGAA
TCGAAGGGAC CTACCGCAGT ACGTGCGAAA GGACAATGAA CCTTCTTCGC TCTAGTCGTG
ATACGTTAGT CGCTATGCTT GAAGCGTTTG TGCACGACCC TCTGATTAGT TGGCGGTTGG
TGAACTTGTC CAAAAGTGCT GAGAATTTAT CGAACGAGAT GGAAAGGGAA GGTCGGAAGG
AGTCGTCGGC ACAACCACCC GTTCAAAGCG AATCCGGAGC CTCGGCGGGG AGGGGTAGCG
CGGGATTGTT GGTCACGGGC CGCCAGCAGT TTCTCCACGA ATCCATTGCC GAAGGTGGTG
ACGAAGAGGG GGATGACGAC GAAAACAATC ACAGTCGTGG TAAAGTGGAA CAAAACCACC
ACAATTTGCT GCACCGATTG CCATCCGTAC CTAGATCCGG CTCGGCCCAG CCCACGTCCC
GGGATAGATT GGCGCGGAGT GCACAGTCGA TGCAAAGACA CGCCGAAATA CAGACAATGG
CGGCCAACAT CTCGACAAAT TCGCGGATTG CCAGTATTAC AGGGGGTGCG TCGGCACGAG
TGGCGACGGA AACTTCCATA GCACGATCCC GCATGGAGCG ATCGCTTATG GGAGTGATGG
GCGGGGAGAA CGGGGTCGTG CATGAGGAAG CGTTGAACGT CAAGGCTCTG CAAGTTATTC
GACGAGTTGA AGACAAACTG TCCGGTACGG ACTTTCCGGA TTGCGAAGGC GAGCCGCTGG
ATGTGTCAGA CCAAGTCCAG CGGCTGATTG TACAAGCGAC GAGTTCCGAA AATTTATGCC
AGCTCTTTAT TGGATGGTGT GCGTTTTGGT AAGATATCTC TATTAGAGCT ACACGTTTAC
GTAGACCAAG TATGTTGCCA AGTGGAGATC AACTCTGGCA GTGCCCGAAT AAAGGCGACA
GCGGAGGGCT TTTAGTTGTG CTTTCTTCGA CCCTACATGC CACCAAAAAA ATCACCGTTC
ATGGCTTTCA TACGCGCCAT GTGGTCTTCC ATGGAGGCTT TTTTCTCAAT CGCCGGTTTT
GCCACGGGCT TGGGGGGCGA TGGCGGGCTG GATACGGCGG TGGGAGGAGC GAATACCGGC
ACGACCGGTT GGGGTTCGAG CTGTTGTTGC CGCATCATGG ACTTGGCGAG TGCGGCATCC
AGCGCGACTT GTTCGGCTTC CTGCGTAGCC GACGCAGCCG CAATCTGCGT TGCTTCGACA
TCTTCAAAAT CGTCGGCGTC GTGTGTTACC GATGCGACGG CCAACTTATT GATCCTAAGT
TTACTGGTCA CGTTGCTGGA CTTCTTTTTG AGTAGATTTT TGGTCGCCGT GCTGGTGGTG
ACACCGTCGC CGGCTGGTTT GCGCAAAACG AGTTTGACGC CGCCGCTACC GGGTGGGGTG
ACGAGTTTTT TGGCACCGGG GAGTTGCGAG GCCAAGACGG CATTGGACTT GACCGGTCCA
GCAGCGGCGG CGCGGGCCGC TTGTAGCTTT TGTCGTGCCA GCGCGTCTCC TTGTTGTTGA
TCGGCGAGTT CGAGATTCGC CAACAGATTA TTCGTGTGGT CATTGTTGTT GCCTTGCTGG
GGGGTCCCTT CCCCACGTTG GGCGGCTTGG GCATCGACTA GTTTGGCGAG TTCGGCGCGG
TAGGACTGTG CGGCTTTGGA CTTGTACTTC TTCTCGCTCT TGCCGTGAAA GTCGGTGAAG
CCGTGTTTAC GGAGATAGGC CTGTGCGGTG CCGTTACCGC CAACCCGCAT GGCGTCCACT
TGTCGTTGAG TCCATTCATC CAAGTCCACC GAACGGACAA AAGTGGTGTG GACTCCCATC
CCGCGGTGTG TGGCGGAACA ATCCAGACAC AGAAAGACAC CGTAGGTTAC CGAAGCCCAC
GTGGGACGAG ATGCGGGACA ATCGAAACAA ACCTGATTAG CTTGAATAGC CTAACGTCGT
AAGTTCATTC ACCAAAACCC ACCACCACCA CCACAGAAGA GAGAGCGATA CACACGTATA
TACGGTGAGA GAGAAAACGT AACGTATGGC ATTGTATCGC AGCGTAGACG TTGACACCGT
GAGCCACGCT TGTCCGGCGT TGGCACTGGC GGCAACTCTA CCGTAACAGA AAGCTGGCTC
CACGTTCGTC TCGTGCCGTA CCAACTCCCA ATGCTTACAG TTAGTTTACT AACCAACTAA
CTTATCTACA CAACACGACG CACGTACCTT TAACTTTTTG AACTGGGCGT TCTTGTCCGC
GGAAGGAATG CAGACTTGTC CTTTCCCCGC CGCTGTAAAG GTTCCATCCT GGCGTAGATT
TCCCGACGTC GAATGCTGGT AATTTCGTGC GGCAGAGGTT GACATGGGAA AGCAAGGCAA
GGTAAGGAAT GGTCTCTGAC GGATGGGACT GTGGAATGAT GGTGGCAAAA GCCAAAACGC
TTCCCGTGAT GTAGCAACGG TGGAGTAG
 
Protein sequence
MANQRAAAEL ASLLWTLAHE MSVEDFGAVE SEVFTLVFAL VHAPDKESRM AGLAALDGLL 
VAPSADEEKK AIKFANALST GLRAANGDYE FFSAVSKALG HMAMRISNVD FVEAEVTRAL
EWLRTGRSDR SLLRLRRLAA SLSLKEFAIH APTTFHSKTS QSTLGQGGSN EFLDTIFQSI
RDPQPIVRVC AADALSQCLR ILVDRRHLSL TGLLCQIHFS TMEGLQEATK KQSWHAASES
EAAKHGSLLV VGTMLAYTRE FLLPRFEEIC RAVLACSKNP KVLIRLEVVR LIPKLAACCS
SVFGRRYLEQ SLVFLIDNVS SPASLRDNVD IRPSVYDAIG DLIMAMSDEN TGRRLEEIFA
VVRKGLHAPT SRSSVGHTLC PALHCASSLV EALKDLALPY LDGVIDDMFQ SGLSIDLIQC
LHSIAQSIPM YKDEIEDRML QEVSLSLAGN RRSSDHTLGS SYPAGAASTT ESSNVHINMN
NDAKTTKLLV LSLQTFASFS NSTAQVTLSG KIIPLMPFVQ DVTARYLLHP SNEVRRAAAL
ACCVLLIPHG SIFASAGSCS GLIIEDVLEA LLRAAVSDSS AVVRLCVVRA LDTRYDPFLC
QTHHLQDLFL VLQDETLATR VAGLQLLGRL ASLNPAPILP VLRRFLNDLV VELQCGVDTG
RGREEATRLL VVFLRVKPLQ RLIHPVLATL VGALPLTGAA PPRLASASLE ALGELAQATG
TALQPWVNDI IPHVLNTMKD QSSASKQRTS LRTLGQIAGS TGYVVRLYLD YPNLLSQATD
ILPATKRAPW TLRREVIRTL GIIGALDPDR YYSVASKARK GGVQYAMVAQ PVSNLSPAKR
MTPSEEDFYP TVSIQALMRI FRDSTLTVHH GMVIQAIMFI FKSLGVRCVP FLGKVLPHMI
LTIRHCPSNL KESLFIQLSN LTLVVKAHLR IFVDDIFDIV EQFWDSRHLS IILKLLSNIA
IGVPDAFRQF VPRFIRRLLT SLDELQVADW STAKQSLLPQ NGRAESEKLS HILKSISKLN
SMLREYLHIL IPALLKLADS LASLSFNGAT TTTISILDGF SVLNCRTLSA LIESQAPAPN
PVALALFTGL SCTPPINSEN GLPSRVVQPL VRIFRETPPR SLAVGLSMVE TLFDEHPVAP
ALRASFSVSN RQKVNQGNLQ RAWDVSQRSS REDWDEWMRR FAIQLLREAP SPALRASANL
AHAYQPLARE LFSAAFACCW KELSHPYRTD LLSALETAFV ADISPEILLA LLNLAEFMEH
DPSGGLPIDI SILADLALKC RAYAKALHYK EREYRNGGSG SCVEALISIN RKLDLQEGAL
GILKASAIDD EDASKQSGWW LAKLGNWTEA LEVYREKLKS DPHDFEAIVG CMRCLDASGE
WRKVLDLAEQ NWTALSQHRC IVRMCAQAAW RLGQWDDLEK YSSQLTCVGF DGAFYSAVLH
VHRQDWSHAA DAIDAARKAM DSRFTALLAE SYSRAYPSMV TAQMLSEMEE IIEYMKTEER
SRIEIDHHPA NRQSIERARE RLISVWKDRL AGCRMDSEAH ASILAVRSLV IGPEDDVDAV
LTLSKLSRQA ERHKFAERVL LDPLHSLKIQ HTFYFAYVKH LWYTGEKHEA TRRLEHLCDV
VDMVSHCERI NETSLRVACW LEYGEWKLST TTSLGSSMSP QFQLDVLTSL KRATQPDDCG
YKAWHGWSLL NFRIALQLND RHHLSSQADA QRPGASFDKS IRNHVVAAVR GFVNAINLGT
IKQSASVQQD LLNLLTCLFK FGSLQDVAVV LNECVSSVAI EAWLGVLPQL LARIHIKDPA
IRSVLHPLLT RLGEKHPQAL MYQLSVLLKS PVVERRTAAE SLMNSLKSHS SDLVEESLMV
SSELIRVAIL WSETWHAGLE NASAFFYVEN NIAAMLDQLH SLHGEFEKEP ETSMERDFAE
AHGGNIRQAY DCIKKYIQLS SNDDENLSPE QKDSRREEAE TFLHKAWDSY YVVFRPINQD
LKSMSLLRLP ECSPALSRAR NLELGVPGSY RVDGSYVRIQ KFVQRVSIIN SKQRPRKVTL
RGSDGKHYVF LLKGHEDLRQ DERVMQLFGL VNALLVRDPQ TKNQDLMIKR YTISPLSHNC
GLVGWVPHCD TMHALIRDYR EAKKVPMNIE NREMMKTAPD YDLLTGMQKV EVFTDALQKT
PGKGDDLAEI FWLKSTNSEE WLERRTKYTR SLAVMSMVGY ILGLGDRHPS NLMIDKLSGR
VLHIDFGDCF EIAMVRDKYP ERVPFRLTRM LVKAMEVSGI EGTYRSTCER TMNLLRSSRD
TLVAMLEAFV HDPLISWRLV NFITGGASAR VATETSIARS RMERSLMGVM GGENGVVHEE
ALNVKALQVI RRVEDKLSGT DFPDCEGEPL DVSDQVQRLI VQATSSENLC QLFIGWCAFW