Gene Phep_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0107 
Symbol 
ID8251192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp115925 
End bp130615 
Gene Length14691 bp 
Protein Length4896 aa 
Translation table11 
GC content48% 
IMG OID644933757 
Productconserved repeat domain protein 
Protein accessionYP_003090395 
Protein GI255530023 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.615678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GTTTACTCAT TTTTTTTATT GCGCAATTTT TTATCTGTAC TACAGGTTTT 
GCACAATTCA CTCTATCAGA AAATTTTAAA GGAAGTACTG CCAATGGTAT TGTACTTGGC
GGCTCTCCTA GCGCATCGCT GACTTCCGGG GGCGTTGATC CTGTAAATCA GGGATACCTC
AGGTTAACTA AGGACGCGCT GGACCAGCGT GGTTACGCCT ATATTGACAA AGCCTTTCCT
TCTACCCTGG GGGTACTTCT TGATTTCGAA TATAAAACCT GGCACTCGGT AACAAATGAA
GGTGCAGATG GAATTGTGGT ATTCCTGTTT GATGGTTCAA TTACCCAATC TAATTTCAGG
CTTGGCGGAA CCGGTGCCGG CCTGGGTTAT GCGCCAAGGG GCGATAAAAA CGAAGCCGGA
CTGGGAGGGG CTTATCTGGG GATCGGGATC GATGAATACG GAAATTTCTC TTCTAATTAT
GGTACAGTAG GTGCGGCTAC AGACCGGGCG GCATCACCCC TAACCCTGGG ACGAAGAATA
AACAGCATCA CACTTCGCGG AAGAGAATCG GATGCTTACC GGCTTCTTGC TACGACTACA
CCAACTGCTT CTAGTCAGAT TCAACACACG CCAACCTCTG GTTCACGTCC TAACGATGCA
ACTTTCTACA GAAGGGTACA GGTAGAGATC AAACCAAACG CAAATGGAAA ATATGACATT
ATTGTCAGGT GGGCAATCAG CCCGGGCGGG GCGCTTACCC AACAGTTTAG TTATTCTCTG
GTAGACCAGT CGGGCGTAAG CTATGCACCC CCTGCAACTT TAAAACTAGG GTTTACTTCT
GCAACGGGTG GGTCGCTCAA CAATCACGAA ATCCGGAACC TAACGGTAAC TACTCCAGGT
AATATCCGGG TAGGAAAAAG GGCAGATAAA GATGTGCTGA GAACGATCCC TGCCGGAACA
AGTGCCAACC AGGTAACTTA TACCGTTGAG GTGGTTAACG ATAACAGCGT TGCACTTAAT
AATATAGAAT TTAAAGACAG GCTGACAGAT GTTAACGGTA CCTTAATTCC TTTAAGTGCT
TTTAATATAG GTACTGTGAG TTTTACCGGA TTTTTGGCCG GCACAACTGT GAATAAGTCG
ACCACTGCCA ACGAAATTGC AGGCAGTCTG AACATGGCTG CTGGGGCTAC AGGAAAAATT
ACCGTAACCG GGACTTTAAA TGCCGTGCCT GCCGGAAACA CACTCAGGAA TACGGCAAGC
ATTAACCCGA CAGACATCAC CGATCTGGAT CCGGACAACA ACATTGCTGA AGTAAATACC
CCGGTACTGG CCGAGGATGT AGACCTTACC ATCGCAAAAA CTGCAGCTGT TCCATGTCTC
TCTACCAGCG GTAATGATTT TAACGTGGTG GTTTCCAATG TTGGTGCAGT AGCTACTACT
GCTATAAATA AGATTGTAGT TACCAAAACC TATCCAACCA GTTATACTTT TACTAACCTG
TCCAACCCAA ACTGGACTTT AGGTACAAGG GAAACTTCGG GCAGTAACTA CATTTATAAA
TATACTTATA CTGGGGGTAC TTTGGCTTCC GGGGCCAGCA CTTCGCCGAT AAGCTACCGG
ATCACGGTAC CCACAGCTGT AAGTACTTAT GCAGACGTTG CACAGGTGAG CTACTTAAAT
TCTTCCAATG CAAACATTGA AGTTACAGCT AACCAGGGTA ACAATTCCAC AAGCAATACC
ATTGCCACAG TTTCTGCCAA ACCTGCAGTA GCTGATAAAG TTACGTATTG CTTAAATGCT
ACAGCAACCG CACTTTCTGC TACCCCTACC AATGGAAATA CATTGTTGTG GTTTCGGACC
AAGGGTGGTG TATCCTCTAT AAATGCACCT GTACCCAGCA CCGCAACGGC GGGAAATACA
AGTTATTTTG TAAAACAAAC CAATGGAAGC TGCGAAAGTG ATTATGCAGA GATTGTAGTT
ACTGTACAGG CTGCGGTTAA TGCCGGCGGC ATTGGCAGCA ACCAGACCAT TTGCAATGGC
GCAACGCCTG CAGCCCTAAC TTCCTCAACC GCAGGAAGCG GTGGAACTGC AGGTTCGGGC
ACTGCTTACA GATGGGAAAG ATCTGTGGAT GGTGGCGCTA CATGGACACC CATAGTTGGG
GCTACGGCCA GCACCTATGC GCCGGCTGCG CTTACTGCCA CCACACAGTT CAGAAGGGTT
TACATTTCAA CGCTGAATGG TGTAGCCTGT GAGTCCGTAA CGGGGCCGGT TACCATTACT
GTTCAGGATA TCACCGGGGC CGGATCAATT GGTACCGACC AAACCATCTG TACCGGAACC
ATTCCTGCAG CTTTTACTTC AGTTACCAAC GGAAGTGGAT CGGCGGGTGC GGCCATCAGC
TACAAATGGG AAAGCTCTAC AAATGGAACG CTGTGGACTG CCATCAGCAA TGCAACATCA
GCTACTTATA CCCCTACTGC AGCCTTAACG GTAACAACCC AGTTCCGGAG AACAGCAATT
TCAACATTGA ACAGTATGGC CTGTTCTTCC GTGCCTTCCA ATGTGGTTAC GGTTACAGTT
AACCAAAACC CTTCGGTAGC AAATGCCGGC GCAGATGCAG AGCAATTTAA TTCGGGGGTA
TTTACCTTGC AGGGAAATGC CCCTGCTGTA GGAACAGGCA TGTGGTCTGT TGTAGCACCC
GGAACGGCTA CCTTTCAGGA TCCTTCCAAC CGCAATACGA CGGTAACCAT AGCAGAAAAT
ACCAGTGCAA TTTTAAGGTG GACCATCAGC AATGGCCCAT GTGCGGTATC GGTTGATGAT
GTAAAGATTA CCTATACCAA AAGGACCGAT TTACAGGTTA CCAAAACCAT TGATAAAACC
ACACCAAAGG TTGGGGATAA CGTAACCTTT ACCATAACAG CTAAAAACAA TGGACCAAGT
AATGCAACAG CGGTTAAAGT TGCAGATGCT TTAAAAGCAG GATATACTTT TATAAGCAGT
TCTTCCGCAA ATTACGCTTC CTCAAATGGT GAGTGGTTGA TAGGTAGCCT GGAAAATGGG
CTTTCCCAAA CCCTCACCAT TATTGCCCGT GTTAATGCCA ATGCCATAGC GGCAGATTAT
GCCAATGTTG CCACCATCTC CGGAGCAGAA ACTGATCCGG ATGGCAGTAA CAATACCAGC
ACTTTAAATA CCATAGTACC TGTTCCTTCG TCAGACATTG AGATCAATAA AACGGCAACA
CCAAAGCCCG CAATTGCAGG ACAGGCACTT ACCTATACCA TTACGCTAAA GAACAACGGG
CCAAGCACGC TCAGGACTTC TGACGTTTTT GCTCTTACAG AAAATCTTCC CGCAGGCTAT
ACTGCCAGTA GCTTTACGGC CTCTGCCGGA ACTTTTAATC CGGCTACCGG AAACTGGAAC
GGTTTAACCC TGGCTACTGG GCAGCAGGCT ACTTTAACCA TTGCAGGCGC AGTAACAGCA
ACGGCCAGCG GAACATTAAG CAATACCGTA TCGGTTGCTG TTCCGGCCAC AATTTCAGAT
CCTGATATCA GCAACAACAG TAAAACCGAC AATACGGTGA TCAGTCGCCT GCTTGATCTG
GGCATTACCA AAACATCCGC ACCAAAACCC GTAATTGCAG GTAGCAATTT AACTTATACC
ATTACCTTAA GCAACAATGG CCCGGCCAGC TTATTGGCTA CAGATGTAGT AAAGTTAAAC
GAAAACTTCC CGGCAGGTTA TACCGCCTCA ACTTTTACTC CATCAACGGG TACCTTTGAT
AAAAATACAG GTAACTGGAC CGGATTAAAC TTAACACAGG GGCAAACGGC AACTTTGACC
ATTGCAGGTA CAGTTGCCGC AAATGCTGTT GGAACGCTAA GTAATACTGT AAGCCTGGTT
GCACCCACAG GTACAACCGA TAACAATACC ACAAACAATT CTGCTACAGA AACAACAGCC
ATTAACCGTT CAGTCGATTT TGAAATTACC AAAACCGCAA CCCCAAAACC TGCGGTAGCC
GGCGAGGCCC TTACCTATAC CGTAACCGTT AGCAATAAAG GCGTCAGTGC AATGAATGCT
GCTGATGTAT TAAAAGTTAC CGATGCTTTG CCAGCTGGTT TTACAGCATC CAGCTATAAT
GCATCAACCG GAACTTACAA TGCGGCCAGC GGAAACTGGA CAGGTTTAAC TTTGGCCAGC
GGGCAAAGCG CCAGCTTCAC CATCACCGGA AGAGTGGCCG CTTCTGCAAC AGGAAATCTG
GTAAATAAGG CCGTACTGAC CGTCCCGGCC GGAATAACTG ATCCGACAGG AGCCAATAAT
GAGGCCACGG ACAACACCGT GATTAGCGTG AAACCTGCGC TGGCAATTAC CAAATCAGGT
GCTTCAGGCT TAACCGCAGG CTCGGCAGTA AATTATACAC TTAAAATTGT AAATACCGGA
AGCAGTGATG CCATCAATGC GGCCATTAAC GACGCAGTAC CTACTTCAGT ACAAAATGTG
ACCTGGACCG CCACTCCAAA TGGGGCTGCT ACAATTGTTT CCGGAGGGTC GGGAACCGGG
AATACGGTAG CCGTTGGCGC AAATATACCG GCAGGAAATG CCAATAATTA CATTAACGTC
AACATCAGCG GAACATTGAG CCCTGCGGCT ACGGGCAGCA TAACCAATAC CGCTTCGGTC
GTACCTGCGG AGCCTCAGGG TTCCGGAAGC AACTCTTCTG TAAATGCCAA TGTAACCAGT
TCATCAGGAA TTGTGATCTC TAAAGCGGGG GCTTCATCGG CATCGGCAGG AGATGCCATT
ACCTATAAAA TTGAAATGGG AAACAACGGG CCGAGTAATG CAACCGCAGT ACAGGTTGCA
GACCTTGTTC CTGCTGCACT CAGCAATGTA AGCTGGACAA GCCAGACGCA GGGAGGGGCC
AGCATAACCA GTGGTGCCAC GGGTTCGGGG AATACCATAA ATTTAATTGC AAATGTTCCT
GCCGGTATAC AGCATAAAGT AATTTTAAAT GTAACAGGCA CCATAAAGCC GGATTATACA
GGCGCCATTA CCAATACCGT TGTGGCCACA CCCCAGGAAA CTGGCAGCAG CCCTGTAAGT
GCACAAGCGG TAACCAGCAT CAACAGTAAG CCTGTATTTA CCATTGTAAA AAGCGGTCCG
GCTACAGTAA TTGCCGGCAA CACCATTGTC TACACCATCA CCGTAAGGAA CACCGGCCCT
TCCAACAGCT TAAATACGGT AATTACCGAT GATATCCCTT CGGGAATTAC CAATGTGACC
TGGACAAGCA GTGTCACTGA CGGTGTTGCC AGCATTACTT CTGGCGCAAG CGGTACGGGC
AATGCCTTAA GCTTAACCGG AAACTTTAAT GCGAACAGTA CGGTTCAGGT GCATGTAACC
GGAAAAGTAA GTTCAGGCCT GCTGGGCAAT GTCCTCAACT CGGCAACCGT AACCCCGGCA
GAGGCCGGTG TTGTGCCGGT AACATCCGGA CAGGTTATTA CCTCAGTTCA AACCAAATCC
GGTTTAACAG TTGCTAAAAA TGGTCCATCT TCAGTAATTT CAGGCACCAA TATCACTTAT
ACCATTGAGG TTGGCAACAC AGGGCCAAGC GATGCCATGA GTGCAAGGGT CACAGATTTA
ATCCCGGCTG AAATTCTGAA CCCTGTATGG AGTACAGCCA CGCAGGGAAC TGCAGCCATC
ATTAGCGGCG GAACAGGAAG TGGAAATGAC CTGCAAACTG TTGTAAATGT TCCGGCAGGG
GCAGGCAATA AGGTCATTGT TACCATTACC GGCAAAGCAA GTGCTTCCTA CAGTGGGGCC
ATAACCAATA CGGCATATGT AACAGCAACC GAACAGAACA GCCCTTCGCC GCAATCTTCA
GTTACTACAA CAATAAACAG GCTTCCATCG GTATCTGTTA CCAAAAATGG CCCTTCATCG
ATCGTTGCGG GTGCCGGCAT TAATTACAGT ATTGATGTAA CGAATACCAG TACAGCAGAT
GCACAAAACC TGGAAATTAA AGACCTGGTC CCATCAGAAA TCAGCGCGGT AAGCTGGACC
GCCGTCACAG CAGGTTCTGC CACGGTAAAC GGAACAGCTT CCGGAACCGG AAACAATATC
GTTCTTACAG GGAATATTCC GGCTGGAGCG GCAAACAAAA TAACGCTTAA CATTAGCGGA
AAGGTTAGCG CCGCTTATAG CGGTACCTTG AGCAACACCG CAACGGCAAC ACCGGCAGAA
ACAGGCACAG CAGTAAAAAC ATCATCGGTT AGCACAGCAG TACAAAAAAT ACCGGTACTG
TCTATTGAAA AGACCGGCCC GGCAAACATC AATGCGGGCC AGGATATCAG CTACACCTTA
AAAATTAAAA ATAACGCTAC TGCAAATGCA GATTTAGCTT TAATTACAGA CAATGTTCCT
GCGGCTATAC AGAATGTAAC CTGGTCTGCA ACAGTTGCCG GTGACGCAAC AATTAGTGGT
ACGGCAAACG GAACCGGAAA TGCGATTAGC CTAACGGGCA ATATTCCAGC AGGAACCGGA
AATGAGATTT TGGTTATCAT TAACGGAAAG GTAAGCCCAT CGGCAAATGC CAGCCTGGTA
AATGCCGCAA CGGTTACACC GGCAGAAACC GGGGCTGCTG CCAAAACATC GAATACCATA
ACTACCCAGG TAAGCAAAAC ACCTTCTGTT AGTTTAATTA AAACCGGGCC TGCCACAGCT
AAAGCCGGAG AAGTAGTTAC TTATATTATA GATGCGGTTA ACACGGGTCC TTCTAATGCA
GACAACCTGG CCATAGCAGA TGTGGTTCCG GCAATTTTAA CCAACGTTAC CTGGACAGCA
GTTGCCAATG GGATCTCAAC TCTGGCAAGC ACTTCAGGTA CCGGAAATCT GGCCTTAACA
GGAAACCTGA ATGTTGGTGC GGCCAATAAG ATCCGGATCA CCATTATCGG AACGGTTCCG
GCAAATCAGG TAAATACAAC CATTACAAAT ACCGCCAGGG CAATTCCGGC AGAAACTGGC
CTGACGGTTA GTTCTAATAC AATCAGCACA GTGATTACTA ACCAGAGCAA TATTTCCATT
GTAAAATCTG CCCCGGTAGC AGTTAACGCA GGCGAACTGG TAAATTATAC CCTACTGGTT
AAAAATGCCG GTCCAAGTAA TGCACTAAAT GCCACTATTG CAGATGCTGT TCCCGCAGCT
ATCCAGAATG TAAGTTGGAC CGCCGTTGCA AGCGGGACCG CAAACCTGAC TTTTGGCGCT
ACAGGGACCG GAAACGCGGT GAATATGAGG GCCAACCTTC CTGCAGGTGA TACCAATACC
GTTCTGGTAC AGATTACCGG TAAACTAAAC CCGGCATTTA GCGGAACGGC GTTAAGCAAT
ACCGCATCGG TTGCGCCTTC AGAAACCGGA AACCCGGTTG TCAATTCCAA TACAACAAAC
ACTTCGGTAA GCAAACAGGC GGATTTAAGG ATTTCGAAAA CCGGGCCCGC AAGCCTTTTT
GCCGGAGAAC AGGTTACTTA TACCATAACC TTAGAAAACC AGGGTCCTGG TGATGTTACA
GGTGCAGCAA TCAGCGACTT GCTTCCTGCG TCCATCATCA ATACCAGCTG GAGCGTAGCT
ACTCAGGGTA CAGCAAGCAC GAATGTTAAT AACGGCACCG GAAACCTGAA TCTTAGCGCG
TCGCTTAAAG CCGGCGGTAC AGATAAAGTT GTGGTTACGT TGCTGGGTAC GGTTGATCCG
GGATACCAGG GTGTAAATGT AACCAATACC GCCACCGCAA CTCCGCCATC CGGGGTTACC
GATCCGACTC CGGCCAGCGC CGTTGTAAGT ACTGCCATCA CACGTAAAGC CAATGTCAGG
ATGGTAAAGT CTGGTCCGGC AAATGCCAAA GCAGGGGAAG AAATTAATTA TACCTTGCGG
ATTACCAACC AGGGGCCAAG TAGTGCCATT GGTACAGCGA TAATTGATAA CCTGCCTGCC
GGAATTGTTG CCGGCTCGGT TACCTGGACT GCCACCGCCA CAACAGGGTC CAGTGTAAGC
TCAGCTTCGG GTACAGGAAA TATAAATTTA ACGGCTGATA TTGCGCCGGT TTCCGGAGTG
ATAGAAGTGA AGATCAAGGC CTTGGTCAGT CCTTCATTAA CAGATGGCAC TTCTATTGCA
AATACCGCCA CCGCCACCGT GGCTGTAGGT ATAACGGATC CGGAACCGGG AAACAATACT
TCAACATTTA ATACAGTTGT AGATAACGAT CCAAACTTTA CCGTTGCAAA ATCCGGACCT
GCAAATGCAA ATGTAGGCGA CCACATTACC TATACCATCC TGGTTAAAAA TACCGGACCT
GGCGATATTA CCGAGGCCTT TATTGTTGAC AATGTACCGG ACGATGTTGA GGTGCTAGAC
TGGACAGCAA CGGCTAATGG CTCGGCTGTA ATTAGCCCGG GTTCGTCATC ATCAGGTACA
ACAAATAATG TTTATACCGT TGCCGACATC CCGACAGGCA ACAACAGTAT ATTAATTACG
ATTAACGGGA TCATCAAACA AAGTGCAGGT TCATCTTTCA CCAACAAGGC CGAAGCCACT
TCTGGCGTTG TTAAAGGAAG TTCGGTATCA ACAAGCGTTA ACCGCTCTAC AGATATTGCA
GTGATCAAGG CCGGCCCTCA AAGTGCTTCG GCGGGAGAGA CAGTTACCTA TACCTTAAAT
GTGTACAATA ATGGTTCAGT AGATGTTGAC AACCTGGTGA TTACGGACCT GGTAAATACA
TTGCTTACCG ATGTAAGCTG GACCACAACT GCAATGGGTT CGGCCAGGGT CAGCTCTGTT
CCTTACGGGG CGGGCAATAC CGTACAGCTA ACCGGAAATA TCGCAGGTGG CCAGGGTAAT
TTTATTACGG TTACCATAAA CGGAAAAATA CCATCTAATG CAGCTTTGGG GCCATTATCC
AATACGGCAA CAGTAACCTT GCCTTCAGGG GTTACCGATT ACAATACAGC AAACAATACC
TCGCAGGTAA GTACAGCGAT CATCAGCACA CCAACATTGG TACTGCAAAA AACGGGACCA
GCTACAGCTG CTGCCGGAAA TCAGATCACG TATAGAATTA AAGTAGAAAA TACCGGTCCA
AGCGATGCTG CGGCGGTAAA TATTGCAGAT GTTTTGCCTG CCGAACTTAG CGGTGTGCAA
TGGCTGGCCT CGGCCGGGGG TACCGCAGCA GGTATTATTG GCACAAGCAG TGGTAATGGC
AATGTAGCTG TAAATGCAAA TATTCCTGCA GGTTCTTATA TAACTGTTGA TGTAACAGGA
ACCATCAGTG CTGATTTCTC CGGTACGATC AAAAATACCG CGTCAGCCAA AATAGGCAGC
AGCCCTGCAG TTTCCTCACC CGAAGTCAGT ACTGTCGTAA GCAAATTAAC CAACTTAACC
ATTGCTAAAT CTACTGCATC TACATTAAGT GCAGGTGAAC CCATTGTTTA TACCATCGAA
CTTGGCAACA ATGGCCCCAG CAATGCTATA GGGGCAGTAT TAACCGATAA CATTCCGGCA
ACAATTCTGG ATCCAACCTG GTCTGCCAGC GTTTCGGGTG GTGCCCTGGT TACCGCAAAT
GGAACAGGAA GCGGAAATAC TTTGTCGCTA ACCGGAAATA TCCCGAAAGG TGGAAAAATC
TATGTCACCG TTAATGGAAC ACTGGCGGCC AATGCCACCG GAAGTGTTTC AAATACGGCA
ACCATTACAC CTTCAGAACC AGGAAATCCT CCTGTGGTTT CCAGTCCGGC AGTTGCGGTG
GTTAAACAAA GCCCCAATCT GCTGTTAACA AAATCTGCCC CAACACTATC GTCGGGTGGC
AGCACCATTA CATATGCCCT TAAATTGGTT AACTCAGGAC CTTCTGATGC CTTGAATACC
ATATTAACGG ATGCTGTTCC GGGCAATGTG GCAAATGTGA GCTGGACAGG AACCGCTGCG
AATGGTGCAA GCCTGTTGTC GGGCAGTATG GGTACCGGAA ACAATGTTTC CTTAACAGCC
AACATTCCTG CAAACGGAAG CGTAGATGTG GTAATCAGCG GTACAATTGA TCCATTGTTC
AACAATACAT TGGTCAACAC GGCAAGTGCA AGCCCGTCGG AACCTGGTAT ACCGGCTGTT
CAAAGCACTG CCTCAACATT GGTTACACCA GCTGTAGATT TTGTGGTTTC GAAAAGCGGG
CCGGCACGTG TATCTGCAGG TGAAACCATA AACTATCAGG TTATAGTTAG GAACAATGGC
CCGAGTACCG CTTTAAATTC GGTTATTGCT GATGTTGTTC CGGCAGAAAT CGGCAATGTA
ACATGGACGG CCACCGCAGA GGGCACAACA GCAATTCTAT CAGCAAAACA AGGAACGGGA
AATAACATTC AGGTGAATGC CAACCTTCCA TCGGCAGCGG CTGACCGTAT TGTTATTGAT
ATCAGCGGAA CGGTTGCCCC ATCATTTAAC GGAACCATCA GCAATACGGC AAGTGTTACA
CCGGCAGAAA CCGGAACAGC AGTAACATCA GGCCCTGTTG CAACTGCAGT AAGCAGGCAA
CCGGTAATCA AGGTTACAAA AGGTGGCCCG GCCATATTGA GATCGGGCAA CAAAATCAGC
TACCTGATCA ATATTGCAAA CGAAGGTACC GGTGATGCAC TTGACCTGGC CATAAATGAT
GTTGCACCAT TGGCACTGGC CAATTTAAGC TGGACAGTGG AAAATATCGG TGCAGCAACA
ACTTCGGCAA CCAGTGGGAC TGGTGACATT AACATTAGGG CCGACCTTCC GGCAGGGGTG
GCAAATGCAG TCAATATTTA TGTAAGCGGA GAAATTCCTG CGGCTTTTGA CGGAACAATC
CTGAATTCAG TTACTGCACA GCCATCGGAA AGCGGGGCTT TGAGCTCAAC ATCTGCGGTA
GCTACAACGG TTTACCGCTC ATCTTTAACC CTGGTTAAAA CGGCTTTAAA TGCGGTTGGC
AAAGCTGGGG ACGTGATTGA TTACGAGCTT AGCATTAGCA ATACCGGTAC TTCTGCTTTA
ACCGGTCTGG TAATTGAAGA CGCAGGTGCT GATGCAGGGA GCATTACACC TGCCAGTATA
GCAACATTGG CCGCTGGCGC CACGGTTAAG GCAACCGCCA GGCATACGCT AAGTCAGGCC
GATGTAGACC TGGGTAAATT CAGCAATAGC GCTTCAGTTT TTGCAAAGGC GCCGGATGCA
TCTGAAGTCA GTGATATTTC CGGAAATACA GCAGGCGATG ATTTGCCTAC AGTGGTACAA
ATTATGCCGG TTCCGGCGGT TAGGCTGGTG AAAAAGGTAA ATGGCGGAGT TCCAAACCAG
GCTGGACAAG CCATAAATTA TGATTTAACA GTTAAGAACA CGGGTAATGT TACGCTAAAC
AATTTGCTGG TAACGGATGC AAATGCAGTA ATTTCTGGCA GTCCGATCAG CCAGCTGGCA
CCGGGTGCAA CCGCTGTTGT TACGGCAAGC CACGTACTTA CACAAGCAGA TGTAGATGCG
GGAACTTATA TCAATACTGC CGAAGTTACT GCAAGTCCGG CAATCGGGGC CAATGTTTCT
GATAAGTCCG GAACAGAGGA AACGAACGAC GAACCCACAG TAAGTACAAT TGTACCAAGC
GGTAGCATAA GCTTAACAAA AGCGGCAAAC AATACCGGGA CCAAGGCGGG CGATGTGGTT
AATTACACCC TGGTTGTAAA AAACACAGGC AACATAACTT TAAGCAATGT TGTGGTAACT
GACGCTGGTG CAGATGCAGG ATCTGTTTTA CCTGCATCGG TAGCCGTTTT GCCCCCCGGC
GCCACGGCCA CTTTAAGCGC CAGGCATACT TTAACACAAA GTGACATAGA TAACGGAAGC
TATAGCAACC AGGCCGCTGT TTCTGCTCAG GATACCAAAG GAACGATTGT TTCAAACCCT
AAATCTGACG ATCCGGCTAC TCCGGCTGCT GATGATGCTA CGGTGGTCAG CATTGTGGAA
AGTGGGGCCA TAACTTTGAC CAAAGTTGCC GGTAACACTG GAAGCAGGGC CGGCGATGTA
ATCAACTACA CCATGGTAGT GAAAAACACA GGCAATGTAA CCCTAAGCAA TATTGCCGTA
AATGATGCGG GCGCAGACGC GGGCTCAATT ACACCTGCAA GCCTGGTAAG TCTGGCGCCA
GGTGTAAGTG CAACTGTAAC CGCAAGGCAT ACCCTGACAC AGGCAGAGGT AGATAACGGA
ACATACAGCA ACCAGGCATC TGTTTCTGGC AATACGATAA AAGGATCACT TGTTTCAAAC
CCTAAATCTG ATGATCCGGC TACTCCGGCA GCGGACGATG CAACTGTAAT TGCTATTGTG
CCAAACGGTG CTATGTATTT GACCAAAATT GCCGACAATA CCGGAAGCAG GGCCGGCGAT
GTGATTAACT ATACGATTGT AGTGAAAAAT ACAGGCAATG TTACCCTAAG CAATATTGCA
GTAAGCGATG CAGGTGCAGA TGCCGGATCA GTAAGCCCGG CAAGCATTGC TGTTCTGTCA
CCGGGGGCAA CCGCCACAGT TAAGGCTAAA CATACCTTAA CCCAGACTGA AGTGGACAAT
GGCAGTTACA GCAACCAGGC ATCGGTTGTG GCCAAAGACC CAGCAAATAC AACGGTCTCA
AATCCGAAAT CTGATGATCC GGCTACTCCG GCGGCCGATG ATGCCACGCT GAGCACGATT
GTGCCAAACG GTGCTATGTA TTTGACCAAA ATTGCCGACA ATACCGGGAG CAAGGCCGGC
GATGTGATTA ACTATACGAT TGTAGTGAAA AATACAGGCA ATGTTACCCT AAGCAATATT
GCAGTAAGCG ATGCAGGTGC AGATGCCGGA TCAGTAAGCC CGGCAAGCAT TGCTGTTCTG
TCACCGGGGG CAACCGCCAC AGTTAAGGCT AAACATACCT TAACCCAAAC TGAAGTGGAC
AATGGCAGTT ACAGCAACCA GGCATCGGTT GTGGCCAAAG ACCCAGCAAA TACAACGGTC
TCAAATCCGA AATCGGATGA TCCGGCTACT CCGGCGGCCG ATGATGCCAC GCTGAGTATG
ATTGTGCCAA ACGGGGCCCT GCACTTAACC AAAGTTGCAG ACAATACCGG AACAAAAGCA
GGAGAGGTGA TCAGTTATAC TATAGTGGTA ACAAATACGG GCAACGTAAC TTTAAACAAT
TTGGTGATAA GAGACCCGGG TGCAAACGCA GGATCTGTTT TACCTGCAGT CATCGCTAAT
CTGGCGCCGG GTGCCAGGGC TACAGTTACA GCCAAACACA CCTTAACCCA GGCCGAGGTA
GATTATGGAA GTTACAGCAA CCAGGCATCG GTTTCGGGCG ATACACCAAA AGGGGCAACA
ATAAGTAATC CAGGATCGGA CGATCCGAAT ACACCTGCAC CGAATGATGC AACGGTGATC
AGCGTTACTT CCAGTCCGGC CATTACCCTG CTGAAAAGCG GTATTGTAAG TTCAGATGCA
AATAGCATAA CCTATAGTTT TACTATCAAA AATACCGGAA ACGTAACTTT ATCGGCCATT
ACCATCAGCG ATCCTAAGAT CGGGTTGACC AGAACAGTTT CAGCGCCACT GGCCCCAGGT
ACATCAATTG TGGAAACTGC GGTTTATACT TTAAGCCAGG CAGATAAAAA TGCCGGAACT
GTGAGCAACA GCGCAACAGT TGTAGCCCGG GCCCCGGGTG GACAAACAGT AAGTGATGTG
TCTGGAACTG CAGAGAACAA CAATACCCCA ACAGTAACGC TTGTACCTCA AAAACCTATA
ATCGCTCTGG TGAAGACAGC CAGCTTTAAC GGCAACAAAA TCACCTACAC TTTCGGCATT
AAAAACCTGG GTAATGTGAC CTTAAATACG ATTGAGCTGA GTGATATCAA ACTGGGTATA
AGCAATAAGG CTATAGTGGT AACAAACGGT TTATTGCCAG ATGCTGCTGT TGTGGTAACA
GAGGTATACA CCCTGACACA GGCCGATAAA GATCTGGGTC AGGTAAGCAA TACTGCAACC
GTTCAGGCCA GAGGCCCATC GGGCGCTGTG GTTCAGGATG TGTCAGGAAC AGCTGAGGGC
AACAATACAC CAACTGTAAT TGCTGTTCCG AAATCACCTG TTGCTGTTGA CGACAAGCTG
GAAACTAAAG CCAACAGTCC TGTTGTGGTA GCTGTTCTGG ACAATGATGA TCCTGGTAAT
TCGGCCTTTG ACCAGCTAAC TGTTGAGATT GTAACACAGC CCCGGCATGG TACAGTAAAA
GTCAATACAG ATGGAACCAT AACCTTTACA CCCAATCCGG GATATACCGG GTCAGATTCC
TTCCAGTATC GGGTAAAAGA TGCTTTTGGA TATTACACCA ATGTGGCTAC GGTAACGGTA
AATTCAAACT TCTTTGAGAT CCGTGTGCCG AACTTATTTA CGCCGAACGG GGATGGAATA
AACGATACTT TCGAAATAAG GGGATTGAAC CAATACCAGG ATAACGAGCT GAACATCTTT
AACAGGTGGG GCAATGAAGT TTTTAAACAA AAAGGTTATG CCAATACCTG GACGGGAGAA
GGCTTAAATG AAGGGACGTA TTATTACATT TTAAGGGTAA AGCGTATTGG AAGTAACCAG
TATGAAGTTT TCAAAGGTTA TGTAACCCTG GTAAGGGCAT TTAAAAAATA G
 
Protein sequence
MNKRLLIFFI AQFFICTTGF AQFTLSENFK GSTANGIVLG GSPSASLTSG GVDPVNQGYL 
RLTKDALDQR GYAYIDKAFP STLGVLLDFE YKTWHSVTNE GADGIVVFLF DGSITQSNFR
LGGTGAGLGY APRGDKNEAG LGGAYLGIGI DEYGNFSSNY GTVGAATDRA ASPLTLGRRI
NSITLRGRES DAYRLLATTT PTASSQIQHT PTSGSRPNDA TFYRRVQVEI KPNANGKYDI
IVRWAISPGG ALTQQFSYSL VDQSGVSYAP PATLKLGFTS ATGGSLNNHE IRNLTVTTPG
NIRVGKRADK DVLRTIPAGT SANQVTYTVE VVNDNSVALN NIEFKDRLTD VNGTLIPLSA
FNIGTVSFTG FLAGTTVNKS TTANEIAGSL NMAAGATGKI TVTGTLNAVP AGNTLRNTAS
INPTDITDLD PDNNIAEVNT PVLAEDVDLT IAKTAAVPCL STSGNDFNVV VSNVGAVATT
AINKIVVTKT YPTSYTFTNL SNPNWTLGTR ETSGSNYIYK YTYTGGTLAS GASTSPISYR
ITVPTAVSTY ADVAQVSYLN SSNANIEVTA NQGNNSTSNT IATVSAKPAV ADKVTYCLNA
TATALSATPT NGNTLLWFRT KGGVSSINAP VPSTATAGNT SYFVKQTNGS CESDYAEIVV
TVQAAVNAGG IGSNQTICNG ATPAALTSST AGSGGTAGSG TAYRWERSVD GGATWTPIVG
ATASTYAPAA LTATTQFRRV YISTLNGVAC ESVTGPVTIT VQDITGAGSI GTDQTICTGT
IPAAFTSVTN GSGSAGAAIS YKWESSTNGT LWTAISNATS ATYTPTAALT VTTQFRRTAI
STLNSMACSS VPSNVVTVTV NQNPSVANAG ADAEQFNSGV FTLQGNAPAV GTGMWSVVAP
GTATFQDPSN RNTTVTIAEN TSAILRWTIS NGPCAVSVDD VKITYTKRTD LQVTKTIDKT
TPKVGDNVTF TITAKNNGPS NATAVKVADA LKAGYTFISS SSANYASSNG EWLIGSLENG
LSQTLTIIAR VNANAIAADY ANVATISGAE TDPDGSNNTS TLNTIVPVPS SDIEINKTAT
PKPAIAGQAL TYTITLKNNG PSTLRTSDVF ALTENLPAGY TASSFTASAG TFNPATGNWN
GLTLATGQQA TLTIAGAVTA TASGTLSNTV SVAVPATISD PDISNNSKTD NTVISRLLDL
GITKTSAPKP VIAGSNLTYT ITLSNNGPAS LLATDVVKLN ENFPAGYTAS TFTPSTGTFD
KNTGNWTGLN LTQGQTATLT IAGTVAANAV GTLSNTVSLV APTGTTDNNT TNNSATETTA
INRSVDFEIT KTATPKPAVA GEALTYTVTV SNKGVSAMNA ADVLKVTDAL PAGFTASSYN
ASTGTYNAAS GNWTGLTLAS GQSASFTITG RVAASATGNL VNKAVLTVPA GITDPTGANN
EATDNTVISV KPALAITKSG ASGLTAGSAV NYTLKIVNTG SSDAINAAIN DAVPTSVQNV
TWTATPNGAA TIVSGGSGTG NTVAVGANIP AGNANNYINV NISGTLSPAA TGSITNTASV
VPAEPQGSGS NSSVNANVTS SSGIVISKAG ASSASAGDAI TYKIEMGNNG PSNATAVQVA
DLVPAALSNV SWTSQTQGGA SITSGATGSG NTINLIANVP AGIQHKVILN VTGTIKPDYT
GAITNTVVAT PQETGSSPVS AQAVTSINSK PVFTIVKSGP ATVIAGNTIV YTITVRNTGP
SNSLNTVITD DIPSGITNVT WTSSVTDGVA SITSGASGTG NALSLTGNFN ANSTVQVHVT
GKVSSGLLGN VLNSATVTPA EAGVVPVTSG QVITSVQTKS GLTVAKNGPS SVISGTNITY
TIEVGNTGPS DAMSARVTDL IPAEILNPVW STATQGTAAI ISGGTGSGND LQTVVNVPAG
AGNKVIVTIT GKASASYSGA ITNTAYVTAT EQNSPSPQSS VTTTINRLPS VSVTKNGPSS
IVAGAGINYS IDVTNTSTAD AQNLEIKDLV PSEISAVSWT AVTAGSATVN GTASGTGNNI
VLTGNIPAGA ANKITLNISG KVSAAYSGTL SNTATATPAE TGTAVKTSSV STAVQKIPVL
SIEKTGPANI NAGQDISYTL KIKNNATANA DLALITDNVP AAIQNVTWSA TVAGDATISG
TANGTGNAIS LTGNIPAGTG NEILVIINGK VSPSANASLV NAATVTPAET GAAAKTSNTI
TTQVSKTPSV SLIKTGPATA KAGEVVTYII DAVNTGPSNA DNLAIADVVP AILTNVTWTA
VANGISTLAS TSGTGNLALT GNLNVGAANK IRITIIGTVP ANQVNTTITN TARAIPAETG
LTVSSNTIST VITNQSNISI VKSAPVAVNA GELVNYTLLV KNAGPSNALN ATIADAVPAA
IQNVSWTAVA SGTANLTFGA TGTGNAVNMR ANLPAGDTNT VLVQITGKLN PAFSGTALSN
TASVAPSETG NPVVNSNTTN TSVSKQADLR ISKTGPASLF AGEQVTYTIT LENQGPGDVT
GAAISDLLPA SIINTSWSVA TQGTASTNVN NGTGNLNLSA SLKAGGTDKV VVTLLGTVDP
GYQGVNVTNT ATATPPSGVT DPTPASAVVS TAITRKANVR MVKSGPANAK AGEEINYTLR
ITNQGPSSAI GTAIIDNLPA GIVAGSVTWT ATATTGSSVS SASGTGNINL TADIAPVSGV
IEVKIKALVS PSLTDGTSIA NTATATVAVG ITDPEPGNNT STFNTVVDND PNFTVAKSGP
ANANVGDHIT YTILVKNTGP GDITEAFIVD NVPDDVEVLD WTATANGSAV ISPGSSSSGT
TNNVYTVADI PTGNNSILIT INGIIKQSAG SSFTNKAEAT SGVVKGSSVS TSVNRSTDIA
VIKAGPQSAS AGETVTYTLN VYNNGSVDVD NLVITDLVNT LLTDVSWTTT AMGSARVSSV
PYGAGNTVQL TGNIAGGQGN FITVTINGKI PSNAALGPLS NTATVTLPSG VTDYNTANNT
SQVSTAIIST PTLVLQKTGP ATAAAGNQIT YRIKVENTGP SDAAAVNIAD VLPAELSGVQ
WLASAGGTAA GIIGTSSGNG NVAVNANIPA GSYITVDVTG TISADFSGTI KNTASAKIGS
SPAVSSPEVS TVVSKLTNLT IAKSTASTLS AGEPIVYTIE LGNNGPSNAI GAVLTDNIPA
TILDPTWSAS VSGGALVTAN GTGSGNTLSL TGNIPKGGKI YVTVNGTLAA NATGSVSNTA
TITPSEPGNP PVVSSPAVAV VKQSPNLLLT KSAPTLSSGG STITYALKLV NSGPSDALNT
ILTDAVPGNV ANVSWTGTAA NGASLLSGSM GTGNNVSLTA NIPANGSVDV VISGTIDPLF
NNTLVNTASA SPSEPGIPAV QSTASTLVTP AVDFVVSKSG PARVSAGETI NYQVIVRNNG
PSTALNSVIA DVVPAEIGNV TWTATAEGTT AILSAKQGTG NNIQVNANLP SAAADRIVID
ISGTVAPSFN GTISNTASVT PAETGTAVTS GPVATAVSRQ PVIKVTKGGP AILRSGNKIS
YLINIANEGT GDALDLAIND VAPLALANLS WTVENIGAAT TSATSGTGDI NIRADLPAGV
ANAVNIYVSG EIPAAFDGTI LNSVTAQPSE SGALSSTSAV ATTVYRSSLT LVKTALNAVG
KAGDVIDYEL SISNTGTSAL TGLVIEDAGA DAGSITPASI ATLAAGATVK ATARHTLSQA
DVDLGKFSNS ASVFAKAPDA SEVSDISGNT AGDDLPTVVQ IMPVPAVRLV KKVNGGVPNQ
AGQAINYDLT VKNTGNVTLN NLLVTDANAV ISGSPISQLA PGATAVVTAS HVLTQADVDA
GTYINTAEVT ASPAIGANVS DKSGTEETND EPTVSTIVPS GSISLTKAAN NTGTKAGDVV
NYTLVVKNTG NITLSNVVVT DAGADAGSVL PASVAVLPPG ATATLSARHT LTQSDIDNGS
YSNQAAVSAQ DTKGTIVSNP KSDDPATPAA DDATVVSIVE SGAITLTKVA GNTGSRAGDV
INYTMVVKNT GNVTLSNIAV NDAGADAGSI TPASLVSLAP GVSATVTARH TLTQAEVDNG
TYSNQASVSG NTIKGSLVSN PKSDDPATPA ADDATVIAIV PNGAMYLTKI ADNTGSRAGD
VINYTIVVKN TGNVTLSNIA VSDAGADAGS VSPASIAVLS PGATATVKAK HTLTQTEVDN
GSYSNQASVV AKDPANTTVS NPKSDDPATP AADDATLSTI VPNGAMYLTK IADNTGSKAG
DVINYTIVVK NTGNVTLSNI AVSDAGADAG SVSPASIAVL SPGATATVKA KHTLTQTEVD
NGSYSNQASV VAKDPANTTV SNPKSDDPAT PAADDATLSM IVPNGALHLT KVADNTGTKA
GEVISYTIVV TNTGNVTLNN LVIRDPGANA GSVLPAVIAN LAPGARATVT AKHTLTQAEV
DYGSYSNQAS VSGDTPKGAT ISNPGSDDPN TPAPNDATVI SVTSSPAITL LKSGIVSSDA
NSITYSFTIK NTGNVTLSAI TISDPKIGLT RTVSAPLAPG TSIVETAVYT LSQADKNAGT
VSNSATVVAR APGGQTVSDV SGTAENNNTP TVTLVPQKPI IALVKTASFN GNKITYTFGI
KNLGNVTLNT IELSDIKLGI SNKAIVVTNG LLPDAAVVVT EVYTLTQADK DLGQVSNTAT
VQARGPSGAV VQDVSGTAEG NNTPTVIAVP KSPVAVDDKL ETKANSPVVV AVLDNDDPGN
SAFDQLTVEI VTQPRHGTVK VNTDGTITFT PNPGYTGSDS FQYRVKDAFG YYTNVATVTV
NSNFFEIRVP NLFTPNGDGI NDTFEIRGLN QYQDNELNIF NRWGNEVFKQ KGYANTWTGE
GLNEGTYYYI LRVKRIGSNQ YEVFKGYVTL VRAFKK