Gene Cpin_5290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5290 
Symbol 
ID8361467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6708988 
End bp6722277 
Gene Length13290 bp 
Protein Length4429 aa 
Translation table11 
GC content47% 
IMG OID644967438 
Producterythronolide synthase, 6-methylsalicylic acid synthase 
Protein accessionYP_003124922 
Protein GI256424269 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTA ATCTTGGTGT AAATAAAGCG GAAATTATCA CAGCTTTACC TGCAACTGTT 
ATTACTGTAA TTGATAAACT GGAAGAACAT ATACCAGACA CAAATGAAGT GGTGAGCAAT
ATGAAGACGG AAAGCGCCTC CTATCTTTCC GTACTGGATA AGCTGCTGAC AGAATTGTTA
TGGTGGCAGC TGAAAAAATC AGATGTACTG AAAGCCAGTG GCCTTATCAG GAGCGAATAC
CACAGACCAC AACAATTGCC CGCTTTTTAT GAACGCTGGT TGGATGAAAC TTTACGCGTA
CTGGTAAGTG CAGGTTATCT GGAAAAGGAG GGTAACAAAT TTACCCCGAC AGGAAAAGGA
GGTGCTTCAA AAGATGCCTG GTCTGCCTGG GAAACGGCAA AGGTGACTGA CTTGTCTGAT
GTGAATGTCA AGGCGCAGGT GGCTTTACTG GATGCTACCA TGAAAGCCTT ACCGGATATT
CTTTCAGGAA AGGTGCCGGC AACCGATGTG CTGTTTCCGG AAGGTTCGAT GAAACTTGTA
GAGGGCATTT ATAAGCACAA TAAGGTAGCG GATTATTTTA ATGAAGTGTT GTCAGAAACC
CTGACTGCTT ATTGTCAGGA GCGAATCAGA CAAGACAAAA ATGCACGTAT TAATATCCTT
GAAATAGGCG CAGGTACAGG TGGTACCAGT GCCATCCTGT TTAAGCAATT GGCGCCGTTC
CGTGAGCATA TTGAAAGTTA TTGTTACACG GATCTCTCCC GCGCATTCCT GATGTATGCG
GAGAATACCT ATGCACCTTC TGTTCCTTAT CTGCAATGCC GCATTTTCGA CGTAGAGAAA
CCGGCAATCC CCCAGGGAAT AGAACAGGGT AAATATGATA TTGTGATTGC GGCAAATGTC
CTGCACGCTA CCCGTAACAT CCGCAGAACA TTGCGTAATG CAAAGGCGGC GATGAAGAAC
AGGGGCTTGC TGATGCTCAA TGAAATGAGT CACAATACCC TGTTCACACA CCTTACTTTC
GGTCTGCTGG AAGGATGGTG GTTGTATGAA GATGCACCGG TACGTATGGC AGGATGCCCC
GGATTGTCTC CGGAGTCCTG GAAAAATGTA TTTACTGCGG AAGGTTATGA CAAGGTATTT
TTCCCTTCAG CTTATGGGAT TGCGCTTGGC CAGCAGATCG TAGTGGCGGA GAGTAATGGT
ATTATCAGAC AGCCAAGACC CTCGTCTGTT ACCACTAACA GTGATTCCTC TTCTCCGGCG
CCCCGTTTGC CGGCAGGTAA AGCCGTTGCG GATACTACCA GCGTCCCTAA TGGAGACGGT
GACGGATTAA AGGGGAAAAC AGTTGCTTAT CTGCAGCAGA TCATCGGAGA AACATTACGT
ATCCCGGCAG GTAATATTAA GCCAACTGTT GCGCTGGAAG ACTATGGCAT AGATTCGATC
ATCGTGGTGC AACTGAATAA GGCATTGGGA GAAGTATTTG CAGATCTCAG TAATACACTC
TTCTTTGAAT ACCAGACGGT CAATGAACTG GCCGCTTATT TTATAAAAGC CTATCCTGAT
CAACTGAAGG CGAAACTACA GCTGGATGCA GAGCAGGATG AAACACCTGT TTTATCTGAA
AAGGTAACAA AGCCGGTGAT CGCTAAAAAA GGCCCTGTTT TTAAGGGAGT AAAGAAAAAA
GAAACGGTAG CAGCGGTTAA AGAAACAACA GACCTTCCTG TAAGAGAGCC CATTGCTATT
ATTGGTATCA GTGGACGTTA TCCTGGAGGA GATAACCTGG CCGCTTTCTG GGAAAGCCTT
AAAAACGGTA AGGACAGCAT CAGCGAAATA CCGGCAGACC GCTGGTCGTT GGATGACTTT
TATGTACCTG ATAAAACGGC TGCACTGGAG AATGGAAAGA GTTATAGTAA GTGGGGCGGA
TTTATCAACG GACATGCCGA TTTTGATCCG CAGTTTTTTA ATATTTCCCC AAGGGAAGCC
GTGAATATTG ATCCACAGGA GCGTTTGTTC TTACAGGCGT GCTGGGAAGT GATGGAAGAT
GCCGGTTATA CCCGGCGTCA GATCGCAGAG CAGTTCGCCC ACCGCGTAGG TGTATTCGCT
GGCATTACCC GTACGGGATT CGATTTGTAC GGACCGGAAT TATGGCGCAA GGGTGCTGCT
TATTTCCCAC GTACGTCTTT TAGTTCGCTG GCCAACAGAA CGTCTTATTT ACTCAATCTG
CGCGGCCCCA GTATGCCGGT GGATACCATG TGTTCTTCCT CACTGACGGC TATACATGAG
GCATGTGAGC ATCTGTATAG GAACGAGTGT GAAATGGCGA TCGCCGGTGG GGTGAACTTA
TACCTGCATC CTTCCAGTTA TGTACTGTTC TGTTCACAGC AAATGCTGGC TGCAGATGGT
AAGTGTAAGA GTTTTGGGGA GGGAGGTGAT GGTTTCGTAC CTGGTGAGGG AGTGGGGGTT
GTACTATTGA AAAAGCTGTC GCTGGCAGAA AGAGACAATG ATCATATCTA TGCTGTTATA
AAGGCTTCGG GTGTAAATCA TGGCGGGAAG ACCAATGGAT ATACCGTGCC TAATCCTGTG
GCACAGGCTG CATTGATCAG TGAAACAATT ACTAAAGCTG GTATTGAGGC AGAAGCAATC
AGTTATATAG AAGCGCATGG CACAGGCACT GAATTGGGAG ACCCGATTGA GATCACCGGT
TTAAGCCAGG GATTTGCTTC AACAGAGAAC CAGTTCTGTT CCATAGGTGC TGTCAAATCC
AATATCGGAC ATTGTGAAGC TGCTGCAGGT ATTGCGGGCA TTACAAAAGT AGTGCTGCAA
ATGAAACATG GATTGATAGC GCCTACATTA CATGCAGAGA CACTTAATCC TAAGATAAGA
TTTGAATCGA CTCCCTTTGT CGTGAAGCGC GAATTGACGG AGTGGAAACG ACCCGTAAGG
GAAAAGCATG GCCTGATGCA GGAAGTGCCG AGAATAGCAG GTATATCTTC TTTTGGCGCT
GGTGGATCTA ATGCACATGT ATTGGTGCAG GAGTATATTT CGCCTGCAAG AGAACAGGTG
GTGTATGACC AGTTGTTAAT CGTATTATCT GCCCGCACAA CAGCACAATT GCAGGAAAAG
GTCCGGGATC TTTTACAATT TATTGATAGT GAGGGCCAGT CCGCAGAACT GGCTGATATT
GCCTATACAC TTCAGACGGG GAGGGAAGAA ATGGAGCAGA GAGCTGCATT TGTCTGTTCG
TCTGCAGATA CACTCAGGCA GACACTCGCT GGTTATCTGC AATCAGATGG ACAAGCTGCC
GGTATTTATC GCGGACAGGT GAAGAAAGGG CAATCATCAA CAGTCACCTT TACCAATGAT
GAAGATTTCA GAGATACGCT GACTACATGG ATCAGCAAAG GTAAACTGGA TAAACTGGCT
GAGCTATGGA CTGGTGGTCT GGAGATTGAC TGGTCTTTGT TGTATGAAGA GGGAATGCCC
CGTAGGATTA GTCTGCCAGT CTATCCATTC GCCCGTGAAC GATATTGGTT CCCTGAGTTG
GATAAGCAAG ATAATGGTCA GATAGCGGAT AAGCCCTCGA TTATAAAATC AGCTGTTAAA
ACAGATACCC ATGAACTGCT GACCTTTGAA GAGCTTTGGG TACCAGCAGG TCTTTCAACT
CAGATACTGC CGACAGCCGG CGTATTAATA TGCTGTGTGG ATAATGCCGC CAGTATAGCA
GCAATATCCA CATCTGTAGT ATCACTATTG CCTGCTGCAA AATTGTTGTT TGTAAGCAAT
GATAGTATAG CGGGGCAGGA GCATGTATAC CAGCTTGGTA AGGATAAAGC AGCTGGTTAT
AAGACCGTAT TCCATAAGAT CAAAGAAACA TACGGTGCTG TAGACGGCAT ATTATATACA
GGCGCTGGTA ATGAGGCTGC CTATGAAGAT ATTCTCTACC TGTTGCAAGG CATACTTTCA
TCGGGATTAA AGACCAGCCG CTTATTACTG GAAGGACGTT ATTCCTCAGC GCTTGAGCAA
TGTTATACAT TTTCCTGGAT AGGTTATGCG CGGTCACTGG GCTTTGTATT GCCGGGTATG
GATATACATG TTGTATTACA TGCTGCAGCG CATAGCCTGA ACATGCAGAC ATTACTGGGA
GAATTATGGT ATAACGGAAG TCATGTAGCA CATTATGAAA ATAATGTCCG TACTATTCCT
CAGCTGGAAG AACGCACGAC AGAAACATCC GCAGTGCCCT TGTTGAAACA AGGTGGGACT
TATATTGTTA CAGGCGGTTT CGGCGGACTG GGATTATTAT TCAGCAGTTA TTTAAGCAAG
ACCTGGAAGG CGAATGTGAT AATGACGGGT CGTCGTCGTT TATCGACGGA AGAAGAGGCG
AAAGTAGCTG CTATACAGGG TAATCAGAAT AAAGTTGTCT ATGTACAGGC AGATGTGAGT
GATGCGGCTG CTATGCAAAA TGTACGGACT ACTGCAAAGC AGATTACGGG TGAGATATCA
GGTATATTAC ATATAGCCGG TGTACAGAGT CATACGACTA TCGGTGACAA ACAGTATGAA
GACTTTAAGA CAGTATTGTC TTCAAAGATC AGGGGAAGCC AGGTATTAGA CGAAGTGTTT
GGCCAGGAAG CTCTTGACTT TGTATGTTAT TTTTCTTCTT CCTCTGCTGT ATTGGGTGAT
TTTGGTTCCT GTGACTATGC CATTGCTAAC CGCTTTCAGA TGAGTTATGG CGCCTTACGT
CGGGCAGCTG GTTTTAATGG TGTTACAACT GTCATCAACT GGCCATTGTG GCGTGAGGGA
GGAATGAGTC TCGGTGAAAG TAGTTCGCTG GATCTATATC TCAGGAGCAG TGGTCAGAGT
TACCTGGAAG CGGCAATGGG ATTATCAACT TTCGAATCAT TGTTGCGTGC AGGTAATGTA
ACGCGTTTGG TGATGTATGG GGATAGATCC CGTATTTACC AGATGCCATT GTTGTCAGGC
GAAACGGAGT TACCTGTATC AACGATCGTA TCGGAGAATG GTGTTGGCCG CAGTGCAGAG
ATGCATGGCT GGTCAGTGGA TGAATGTTTG CTGTGGGACT TACGCCGCCA GGTGGGTGAA
TTACTGCAAC TAGGTATGGA AAAGGTGGCA GCAGATGTAA ACCTGGCTGA TTTTGGTTTT
GACTCGGTGA GTTTAATGCA GTTATCCAAG CGCCTGAGTG CCTACTATGA TATAGAAGTG
ACACCGGCCG TATTTTTTAG TTATGCTACA ATAGAGAAGT TACGGGAATA TTACTTAGGC
GAACACGCAG ACAAGATCAA TGCTTTCTAC AGTGAAGCCC AGACAGTAGT AGTCCCAGGA
AAACGTTCCA TACCAGTACG TCCGGTATCT CATACTGCTG CTACAGCCAG ACAAACGCCT
GTTTCAGGTA ATACTACAGA CGAACCTATA GCGGTAATTG GTATGAGCGG CCGTTTCCCA
CAGGCAGCTA CTATCGCTGA AATGTGGAAA TTGGTGGCAG CAGGAAAAAG TGCCATTGAA
GAGATACCGG CTACAAGATG GGACTGGCGG GAGTATCACG ATGAACAGGT GATGCCTGGT
AAGTCTAATT CCAAGTGGGG TGGTTTTATC CCGGATGTAG ACCAGTTTGA TCCTTTGTTC
TTTGAGATCG CACCGCTGGA AGCTACCTAT ATGGATCCGC GTCAGCGTTT ATTATTACAG
GAAGCCTGGG CTGCATTGGA AGATGCGGGT TATGGGCCGG CGCAGATCAA TAGCAATAAG
ATTGGAATGT TTGTCGGTTC TGAGGATGGT GAATACCAGA TACTGACCGG CGGACAGGGG
AGTATTACTT CCAACCATGC GGCTATCATG TCGGCAAGAC TCTCTTATTT CCTAAATCTC
GATGGACCCA ATATTAATGT CAATACAGCC TGTTCATCAG GGCTGGTCGC TTTACATCTT
GCCTGCCAGA GCTTACGTAG TAATGAATGT GATACGGCTT TAGCGGCAGG GGTGAACCTC
TTGCTGACAC CAATGTCTTA TGTGCAGATG AGTCAGGCTG GAATATTATC GCCTGATGGA
AAGTGTTTCA CTTTTGATAA ACGGGCTAAT GGCATGGTAC CCGGTGAGGC TGTTGCGGTA
GTGGTATTGA AGAAACTCTC TGCTGCCATC GCAGATGGAG ATCCGGTGTA TGCTGTTATC
GATGGCAGCG GTGTAAACTA TGATGGAAAA ACAAACGGTA TTACGGCTCC TAATGGTAAT
TCCCAACGGG CATTACTACA GGATGTATAT AGCCGGTACC ATATCGATCC TGCAGCAATT
GACTATGTCG TGGCGCATGG AACAGGTACT AAACTGGGTG ACCCGATTGA AGTAAACGCC
TTAGCACAGG CATTCAGGGG GTATACGGAT AAACAAGGCT ATTGTGCGAT CAGTTCTGTC
AAGACAAACT TCGGACATAC ATTCGCAGCT TCCGGACTGG TAAGCCTTAT TTCATTAGTG
AAAGCGGTAC AGGAAAAGGT AATCCCGGCA AATCTGCATT TTGAAGAACA GAATGAGTTC
ATTCACTGGA ATGGCAGTCC TTTTTACGTA CCTCGTCAGG CAGCGGCATG GCCGGAAGTT
AATGGGAAAG CAAGAACCGG CGCCGTAAGC TCATTCGGTA TGAGTGGTAC GAATGCGCAT
GTTGTGGTAA GCAGTTATTC AGGGGTGAAT GTTCCTGCAG CTGTTAGTAC AGCTCCGGTA
TTGCTGCTGT TGTCTGCAAA GACAGAAGAA GCATTACTGC GTAAGATGGA AGATATGATC
GCCTATCTGC AATCCGGTAG TGAAAACCTT TCCCAGGTCG CGTATACTTT ACAGGAAGGT
CGTCACCATT TTATCTATCG TGCAGCGATT GTTGTTCAAC ATCAACAGGA TGCGATACAG
ACATGGACCA GGGCTTTGAA CAAGGAGCAA CTACCACAGC TGTTCAGTAA TAAGGTATCC
CGTGATTTTA AAGCACAGGC AGCCTTAATG AGTTATGGTA CTGAGTTGTT GCGTAAATGC
GCCGACAATC AACTGCCGGC ACAACAGTTT CGTGAGTCTT TATATGCGCT CGCAGAACTT
TACTGCCAGG GCTACGATTT CCCTTGGCGT GAATTATACA GCGATGTGCC GCAGCGTTTA
CATCTGCCGG TATATCCATT TGAAAAGGAG CATTACTGGG TAGCAGAAAA ACAGACAGGC
AGTGGCAATA CGGGTGTTTC CCATATCGGC GGCTTGTTGC ATACCAACAC ATCTACGCTG
GATGGTCTTA CGTTTAGCTC CTCCTTTAAA GGAAATGAGT TTTTCCTTGC AGACCATGTC
GTAAAGGGTA AGAAGATCTT GCCAGGTGTG GCGCATATAG AGATGGCATA TGCTGCGCTG
AAGCAGGTGG CTGCGGAATA TGCGGCATCA GGCACCAGTG TGGTATTGAA GAATATTACC
TGGATGCGTC CTGCCATACA GGAAAAGGAA CCATTACAGA TGAAGATATC ACTGACCGCC
AATGAGGACG GACAGATCGA TTATCAGATT CAAAGCATGA CTGCTGATGG CCGTGAAAAG
ATCCTGAACA GTACCGGTAC CGCTTATATT GTACAAGAGA CAGTGCCATC TTCATATGAT
CTTGCTCAGT TGAGAAAAGA ACTGGGTAAT GCCATTGACG GAGAGATGGT ATACAATACA
TTCAAGGGAA TAGGACTTGA CTATGGCGCT TCCTTCCGGG GTATCAATGA ACTGTATACG
GATGGTGTAC AGGTATTGGC TCGCCTGTCA TTACCCGAAA ACCTTTTCTC TGCAGCATAC
AACCTGCATC CGGGAATGAT GGATGCTGCT TTGCAGTCAT TTATAGGGTT TGTATTTGGC
AGCATCGCTG ATCCCAAGGA ACTGGATATC CGGTCATTAA AACCAGCCTT GCCATTCGCG
TTGGATAGCA TCAGTATTTT TGGCGCCTGT CAGCCGGAAA TGTGGGCATT GACCCGTTTT
AGTGAAGGTT ACAGCGCAGC TGGTAGTATT CAGAAACTGG ATATAGATAT CTGTGACAAT
GCGGGAAATG TATTGATTAG TCTCAGCGGT TTTACAACCA GGACATTAAC AGAAGACGCC
GGAACTGATC ATAAGCAGGA TACCGGTACA CCAGTAGGTA AACTGTTGCT CCGACCAGTA
TGGAAGACTA ATCGTCAGCA GCAGGCAACT TTATATCCGG CGCTGGATGC AAAAGTACTT
GTAGTGCATA ATGGCGCTGA GCCGAAGGCG GTACTGGATG ACTATCCGGA TGCCCTTTTC
CTGGCAATCA GCGGTCAGGA GTCAGTGGCT GTTCTGGAAG AAAAATTGGC CGCTATCGGA
AAGATAGATC ATATCATCTG GCAATCTGCT TCATCCACCT CTTACACTGT AGATGCGGAT
GCCCTGATAA GTGCACAGGA AGGTATCGTC TACGCATTAT TCCGGTTTGT GAAGGCGTTA
CTAAAAGCAG GCTATGGCGC AGAAAAACTA GGTTGGACCA TTGTTACCAA ACAGAGTCTC
GCTGTTGCTG ATGATGAAGA GGTAAACGCT GCCCATGCTG CTATACATGG CCTTGCAGGC
GTTATGGCGA AGGAATATCC TAACTGGAAA GTAAGACTGG CAGATATCGA TAAGGACAAT
GAACTACCAC AAAGTCTTTT CCGGATGCCT GCCGATCCGC ATGGAAATGC ACTCGTTTAC
CGTGGCGGAC AGTGGAAGCA ACAGGAGCTG ATTGTTTATG AACAACAACA AAATGAAACG
ACGTCTTACA GACAGGAAGG TGTATATGTA GTGATCGGTG GTGCAGGTGG TATCGGGGAG
GTATGGACCG ATTATATGAT CAGTAAATAC AAGGCGCAGG TAATCTGGAT CGGAAGACGC
GCATATGATC AGACCATAGC AGCGCGTATC AGCAAACTGA GTATAAAGGG ACCTGCTCCT
GTTTATTATG CGGCTGACGC GGCCAGTTAT GCGTCATTAC ATGCTGTATA TGAACAGATC
AAACAGCGTT TTGGAACAAT AAATGGCATT ATCCACTCTG CCATTGTGCT CCAGGATCAG
GGGCTGGGAA ATATGACAGA GGAGAAGTTC CGTGCCGGAC TTTCTGCAAA GCTGGATGTA
AGTGTCCGGC TGGCGCAGGT ATTTGCCGCA GAGCAACTTG ATTTCGTGTT GTTCTTCTCT
TCCATGACCA CTTTTACCAA AGCACCCGGA CAGAGTAATT ATGCGGCCGG CTGTACGTTT
AAAGATGCCT TTGCATTACA GCTTGCCCAG ACCTGGGATT GCCCGGTGAA AGTCATTAAC
TGGGGTTATT GGGGTAGCGT CGGTATCGTA GCTGATGATA CCTATCGAGA ACGTATGGCC
TTGGCGGGAT TCGACTCGAT TGAACCCGTA GATGGAATGG CTGCCCTGGA AACATTACTG
GCGTCGCCAG TTGCCCAACT GGCCTTTATT AAAACCTGTA AGCCTCTACA GATGGAAGGG
ATTAAACTGT CAGAAAGAAT GACGGTATAT CCAGCTTCCA TACAAGTTAA TGCCACACAT
ACACCGAATG ATGTTACAGG TACCTTGAAG GAGATTGTAG CAGCTGCTAC GGGTATTATG
GCTGAAAATA TAGACATGCA GACGACATTT GGTGAGTATT GTGCTGATCC GGAGGTATTT
GCTGCAATAG CGGCCGGTAT CAGCGAACAG CTGGATATTC ATCTCAATGC GGGTCTGTTG
ATGTCTTTTA CGGATCTTGC AGAGTTGTGT AAATATGTAC AATCGCTGAC AGGGGTGAAT
ACGATTGATT ATAGTTCACC GGAAAGACTG GAACAGATCT GGCAAAAAGT GGGCGCGCTG
GCACTGGAAA TGGAACAACT CCTCGCCAGA CTACTATTTG TACATTTAAG AGACCTGGGC
TGCTTTACAA GCCCCGGAAC CTGCACGGTG TTGATGGAAA ATGGCCGGAT CGTCCCTGCT
TATAAAAGAT GGATGGAAGA AAGCCTGGCT GCATTAGTAA GAGCCGGACT TCTTACAACA
CAGACCGGAC AATATACAGT CGCTCAAGGA GCGGCACAAT TACCAGCGGA AGCTGTTCAC
CAGGAGTGGG CGGCGAAACG GTTATTATGG TTAGAGAATC CGAACCTGAA AAACCAGGTA
GTATTGGCTG ACGCTACGAT GGGTGCACTG CAACAGATCA TTACAGGACA AGTTCCGGCA
ACAGCAATCA TGTTCCCTGG TTCATCCATG GAGATGGTAC AGGGTGTATA CAAAGACAAC
CTTGTGGCCG ATTACTTTAA TGAACTGATG GCAGTTGCAC TGGTGAATTA CCTGGAACAA
CGAAAGTCAA CAGATCCGAA TCTCCGTCTC CGTATACTGG AAATTGGTGC AGGTACCGGT
GGAACCAGTG CCATGATCTT CGGCAAACTC AAGCCTTATC AGCAGCACAT CGCTGAATAC
TGCTATACAG ATATTTCCAG GGCATTCCTA TTACATGCAG AAAAGGCGTA TCTGCCAGAT
AATCCTTATA TCAAGACCAG GATATTTGAT GTAGAGAAGC CGATACAGCC ACAGGACATT
GAAGCAGGTG TGTATGATGT GGTGGTCGCT ACGAACGTAT TACACGCTAC TTCCAACATT
AGTAATACCC TGCGTAATGC CAAGGCGGTC TTACAGACAG GAGGCTTGCT CTTGTTAAAT
GAGTTGGCTG CAAGTTCTTT ATTTACACAC CTGAGCTTCG GATTATTAGA CGGCTGGTGG
TTATATGAAG ATCCGGAGGT GAGACTGTCT GGTAGTCCGG TACTGGTAGC AGACAGCTGG
AGAAAACAGT TGGCCTATGA AGGTTTCAAA GCATTGCAGT TCCCTGTAAA GGCAGCACAT
CATCTTGGTC AGCAGATTAT CATTGCTGTA AGCGATGGTA TTGTTCGTCA GCTTGTTAAT
AAACCAGCTG TTGCCGCTCC TGCTGCTGCA CCGGCGAAAC CGCCTGTAAA GAAAGCAACC
CCAGTACAAC AATCTGTAGC CAGCAACAAA CGCGCTGATA AGAAAGCGGG TCTGGAAGAG
AAAGCCCTGG CTTATTTCAA AGACCTGGTA GGTGGTGTGC TGAAGATTCC TGCACACAAA
ATAGATGTCA ATGCTTCGTT CGAATCTTAT GGTATAGATT CCATTCTTGT TGTACAGCTG
AACAATGCAT TAAAAGAAGT ATTCGGAGAA GTATCAAGTA CGCTGTTCTT TGAATACCAG
GATATCCGCT CCCTCAGCGC TTATTTTATA GATACACAGA GAGAAGCGCT CACAAGCTTA
CTGGGAGATG ATATACCGTC TGGTATTCAT AGTCCGCGGG AAGTATCCCC GGTGAATGAT
GCACCAATTG CTATTCCGGT AGCATTTGGG CGGAAAGCAG GTACAACACC TGTGCCATCA
GCCAATGGTT TTACAGAACA GGTAAACACG ACCATGCCCA TTGCTATTAT CGGTATCAGT
GGGAAATATG CACAGGCAGC TTCGCTGGAT GCTTTCTGGG AAAATCTGCA GACAGGTAAG
AATTGTATTA CGGAGATACC GGAAGAGCGC TGGAACTGGC GTACACACTT CGATGAAGAA
AAAGGCAAAT GGGGTACTAC TTATTCCCGT TGGGGTGGTT TTATTCCTGA TATAGATCGT
TTTGATCCTT TGTTCTTTAA TATATCTCCT ATAGAGGCAG AACGTATTGA TCCACAGGAA
AGACAGTTCT TAGAGACGTC CTACAATGCG ATTGCTGATG CAGGGTATAC GCCCGCTAAG
TTGGCGGCTG ATCGGAAAGT GGGCGTGTTT GCCGGTATTA TGAACGGCAA TTACATCACC
GGACCGAGCT ATCATTCTGT AGCTAACCGT GTGTCTTATG TGATGAACTT TCAAGGGCCA
AGTCTGGCGG TAGATACGGC TTGTTCGTCC TCACTGACAG CTATTCATCT GGCGATCGAC
AGTATACGCG GCGGCTCCTG CCATTGCGCG ATAGCAGGTG GTGTAAACCT GATCGTAGAT
CCTGTACACT ATATGCGATT GTCAGCCGCA GGTATGTTGT CGGCTGGCGA TCAGTGTAAG
GCATTTGGTG ATGGGGCCGA TGGCTTTGTA GACGGAGAAG GCGTAGGCGC AGTGGTACTG
AAACCTCTGG CCAGGGCAAT AGCAGACGGA GATCATATCT ATGGTGTGAT ACAGGGCTCC
GCTATTAATG CAGGTGGCCG TACGAATGGC TATACCGTAC CTAATCCTGT AGCGCAGGCA
CAGGTCGTGG CAGATGCACT GGATCGTGCG GGTATTGATG CCCGGACAGT CAGTTATGTA
GAAGCACATG GCACTGGTAC TGTGTTGGGT GATCCTATAG AAGTAAATGG TCTTACACGG
GCATTTGCAG CAACGACGGA TGATAAACAA TTCTGTGCCA TTGGTTCTGT AAAAACAAAC
ATCGGCCATT GTGAAAGTGC TGCAGGTATT GCGAGTCTGA CGAAAGTACT GTTACAGATG
AAACACGGTA AGCTGGCCCC ATCACTCCAT GCGGCTCAGC CTAATCCCAA TATCAATTTT
GCCAATACGC CATTTAAAGT ACAGGCACAG TTGTCAGATT GGTCTTCACC AAGAATAGCA
GGTATTTCTT CTTTCGGCGC TGGTGGCGCT AATGCACACC TGGTTGTTAC TGAGTATATA
CCGGAACCAA TAGTTGAGAC ACAGCAAATG CCTGTGATGA TTGTACTTTC TGCCCGCACA
GCGGAGCGTT TACAGGCACA GGTGAGTCAG CTGCTGGATG CCATATCCGT TGATGGTTTT
AATGAGCCTC TTGCTGCGAT TGCTTACACA TTACAGGTGG GTAGAGAAGC ATTGGAAGAA
AGGCTTGCAT TGGTCGCAGG TTCTGTCGAA GCGTTGAAGC AACAATTACA GGCGTATCTG
CAGGGAAATC ATACGGCGAT CTTCAGAGGG CAGGTAAAGC CTAACAAGGA TATTGTCGCT
GTTTTCTCTT CGGATGAAAC GCTGGCTCAG GTAGCCGATC AGTGGTTGTT ACAACACAAT
GATGCAAAGG TGCTGGAATG GTGGGTAAAA GGATTACAGA TCAACTGGGA GATCCTGTAT
ACAACGGATC GTCCCCGCAG GATATCATTA CCAGGTTACC CATTTGCGGG TGAAAGGTAC
TGGCAGGCTG CGTTGCCGCT TAGTGTTAAA GCCACAGCTC CCCTCCCAGC GAGAGATGAT
AAAGACTATT ACGATTTGCT TGATGCTGTG CTGAGCGATG AGCTGAGTGT ACAACATGCC
ACTAACGAAA TTGTAAAAAT GCTTAACTGA
 
Protein sequence
MNVNLGVNKA EIITALPATV ITVIDKLEEH IPDTNEVVSN MKTESASYLS VLDKLLTELL 
WWQLKKSDVL KASGLIRSEY HRPQQLPAFY ERWLDETLRV LVSAGYLEKE GNKFTPTGKG
GASKDAWSAW ETAKVTDLSD VNVKAQVALL DATMKALPDI LSGKVPATDV LFPEGSMKLV
EGIYKHNKVA DYFNEVLSET LTAYCQERIR QDKNARINIL EIGAGTGGTS AILFKQLAPF
REHIESYCYT DLSRAFLMYA ENTYAPSVPY LQCRIFDVEK PAIPQGIEQG KYDIVIAANV
LHATRNIRRT LRNAKAAMKN RGLLMLNEMS HNTLFTHLTF GLLEGWWLYE DAPVRMAGCP
GLSPESWKNV FTAEGYDKVF FPSAYGIALG QQIVVAESNG IIRQPRPSSV TTNSDSSSPA
PRLPAGKAVA DTTSVPNGDG DGLKGKTVAY LQQIIGETLR IPAGNIKPTV ALEDYGIDSI
IVVQLNKALG EVFADLSNTL FFEYQTVNEL AAYFIKAYPD QLKAKLQLDA EQDETPVLSE
KVTKPVIAKK GPVFKGVKKK ETVAAVKETT DLPVREPIAI IGISGRYPGG DNLAAFWESL
KNGKDSISEI PADRWSLDDF YVPDKTAALE NGKSYSKWGG FINGHADFDP QFFNISPREA
VNIDPQERLF LQACWEVMED AGYTRRQIAE QFAHRVGVFA GITRTGFDLY GPELWRKGAA
YFPRTSFSSL ANRTSYLLNL RGPSMPVDTM CSSSLTAIHE ACEHLYRNEC EMAIAGGVNL
YLHPSSYVLF CSQQMLAADG KCKSFGEGGD GFVPGEGVGV VLLKKLSLAE RDNDHIYAVI
KASGVNHGGK TNGYTVPNPV AQAALISETI TKAGIEAEAI SYIEAHGTGT ELGDPIEITG
LSQGFASTEN QFCSIGAVKS NIGHCEAAAG IAGITKVVLQ MKHGLIAPTL HAETLNPKIR
FESTPFVVKR ELTEWKRPVR EKHGLMQEVP RIAGISSFGA GGSNAHVLVQ EYISPAREQV
VYDQLLIVLS ARTTAQLQEK VRDLLQFIDS EGQSAELADI AYTLQTGREE MEQRAAFVCS
SADTLRQTLA GYLQSDGQAA GIYRGQVKKG QSSTVTFTND EDFRDTLTTW ISKGKLDKLA
ELWTGGLEID WSLLYEEGMP RRISLPVYPF ARERYWFPEL DKQDNGQIAD KPSIIKSAVK
TDTHELLTFE ELWVPAGLST QILPTAGVLI CCVDNAASIA AISTSVVSLL PAAKLLFVSN
DSIAGQEHVY QLGKDKAAGY KTVFHKIKET YGAVDGILYT GAGNEAAYED ILYLLQGILS
SGLKTSRLLL EGRYSSALEQ CYTFSWIGYA RSLGFVLPGM DIHVVLHAAA HSLNMQTLLG
ELWYNGSHVA HYENNVRTIP QLEERTTETS AVPLLKQGGT YIVTGGFGGL GLLFSSYLSK
TWKANVIMTG RRRLSTEEEA KVAAIQGNQN KVVYVQADVS DAAAMQNVRT TAKQITGEIS
GILHIAGVQS HTTIGDKQYE DFKTVLSSKI RGSQVLDEVF GQEALDFVCY FSSSSAVLGD
FGSCDYAIAN RFQMSYGALR RAAGFNGVTT VINWPLWREG GMSLGESSSL DLYLRSSGQS
YLEAAMGLST FESLLRAGNV TRLVMYGDRS RIYQMPLLSG ETELPVSTIV SENGVGRSAE
MHGWSVDECL LWDLRRQVGE LLQLGMEKVA ADVNLADFGF DSVSLMQLSK RLSAYYDIEV
TPAVFFSYAT IEKLREYYLG EHADKINAFY SEAQTVVVPG KRSIPVRPVS HTAATARQTP
VSGNTTDEPI AVIGMSGRFP QAATIAEMWK LVAAGKSAIE EIPATRWDWR EYHDEQVMPG
KSNSKWGGFI PDVDQFDPLF FEIAPLEATY MDPRQRLLLQ EAWAALEDAG YGPAQINSNK
IGMFVGSEDG EYQILTGGQG SITSNHAAIM SARLSYFLNL DGPNINVNTA CSSGLVALHL
ACQSLRSNEC DTALAAGVNL LLTPMSYVQM SQAGILSPDG KCFTFDKRAN GMVPGEAVAV
VVLKKLSAAI ADGDPVYAVI DGSGVNYDGK TNGITAPNGN SQRALLQDVY SRYHIDPAAI
DYVVAHGTGT KLGDPIEVNA LAQAFRGYTD KQGYCAISSV KTNFGHTFAA SGLVSLISLV
KAVQEKVIPA NLHFEEQNEF IHWNGSPFYV PRQAAAWPEV NGKARTGAVS SFGMSGTNAH
VVVSSYSGVN VPAAVSTAPV LLLLSAKTEE ALLRKMEDMI AYLQSGSENL SQVAYTLQEG
RHHFIYRAAI VVQHQQDAIQ TWTRALNKEQ LPQLFSNKVS RDFKAQAALM SYGTELLRKC
ADNQLPAQQF RESLYALAEL YCQGYDFPWR ELYSDVPQRL HLPVYPFEKE HYWVAEKQTG
SGNTGVSHIG GLLHTNTSTL DGLTFSSSFK GNEFFLADHV VKGKKILPGV AHIEMAYAAL
KQVAAEYAAS GTSVVLKNIT WMRPAIQEKE PLQMKISLTA NEDGQIDYQI QSMTADGREK
ILNSTGTAYI VQETVPSSYD LAQLRKELGN AIDGEMVYNT FKGIGLDYGA SFRGINELYT
DGVQVLARLS LPENLFSAAY NLHPGMMDAA LQSFIGFVFG SIADPKELDI RSLKPALPFA
LDSISIFGAC QPEMWALTRF SEGYSAAGSI QKLDIDICDN AGNVLISLSG FTTRTLTEDA
GTDHKQDTGT PVGKLLLRPV WKTNRQQQAT LYPALDAKVL VVHNGAEPKA VLDDYPDALF
LAISGQESVA VLEEKLAAIG KIDHIIWQSA SSTSYTVDAD ALISAQEGIV YALFRFVKAL
LKAGYGAEKL GWTIVTKQSL AVADDEEVNA AHAAIHGLAG VMAKEYPNWK VRLADIDKDN
ELPQSLFRMP ADPHGNALVY RGGQWKQQEL IVYEQQQNET TSYRQEGVYV VIGGAGGIGE
VWTDYMISKY KAQVIWIGRR AYDQTIAARI SKLSIKGPAP VYYAADAASY ASLHAVYEQI
KQRFGTINGI IHSAIVLQDQ GLGNMTEEKF RAGLSAKLDV SVRLAQVFAA EQLDFVLFFS
SMTTFTKAPG QSNYAAGCTF KDAFALQLAQ TWDCPVKVIN WGYWGSVGIV ADDTYRERMA
LAGFDSIEPV DGMAALETLL ASPVAQLAFI KTCKPLQMEG IKLSERMTVY PASIQVNATH
TPNDVTGTLK EIVAAATGIM AENIDMQTTF GEYCADPEVF AAIAAGISEQ LDIHLNAGLL
MSFTDLAELC KYVQSLTGVN TIDYSSPERL EQIWQKVGAL ALEMEQLLAR LLFVHLRDLG
CFTSPGTCTV LMENGRIVPA YKRWMEESLA ALVRAGLLTT QTGQYTVAQG AAQLPAEAVH
QEWAAKRLLW LENPNLKNQV VLADATMGAL QQIITGQVPA TAIMFPGSSM EMVQGVYKDN
LVADYFNELM AVALVNYLEQ RKSTDPNLRL RILEIGAGTG GTSAMIFGKL KPYQQHIAEY
CYTDISRAFL LHAEKAYLPD NPYIKTRIFD VEKPIQPQDI EAGVYDVVVA TNVLHATSNI
SNTLRNAKAV LQTGGLLLLN ELAASSLFTH LSFGLLDGWW LYEDPEVRLS GSPVLVADSW
RKQLAYEGFK ALQFPVKAAH HLGQQIIIAV SDGIVRQLVN KPAVAAPAAA PAKPPVKKAT
PVQQSVASNK RADKKAGLEE KALAYFKDLV GGVLKIPAHK IDVNASFESY GIDSILVVQL
NNALKEVFGE VSSTLFFEYQ DIRSLSAYFI DTQREALTSL LGDDIPSGIH SPREVSPVND
APIAIPVAFG RKAGTTPVPS ANGFTEQVNT TMPIAIIGIS GKYAQAASLD AFWENLQTGK
NCITEIPEER WNWRTHFDEE KGKWGTTYSR WGGFIPDIDR FDPLFFNISP IEAERIDPQE
RQFLETSYNA IADAGYTPAK LAADRKVGVF AGIMNGNYIT GPSYHSVANR VSYVMNFQGP
SLAVDTACSS SLTAIHLAID SIRGGSCHCA IAGGVNLIVD PVHYMRLSAA GMLSAGDQCK
AFGDGADGFV DGEGVGAVVL KPLARAIADG DHIYGVIQGS AINAGGRTNG YTVPNPVAQA
QVVADALDRA GIDARTVSYV EAHGTGTVLG DPIEVNGLTR AFAATTDDKQ FCAIGSVKTN
IGHCESAAGI ASLTKVLLQM KHGKLAPSLH AAQPNPNINF ANTPFKVQAQ LSDWSSPRIA
GISSFGAGGA NAHLVVTEYI PEPIVETQQM PVMIVLSART AERLQAQVSQ LLDAISVDGF
NEPLAAIAYT LQVGREALEE RLALVAGSVE ALKQQLQAYL QGNHTAIFRG QVKPNKDIVA
VFSSDETLAQ VADQWLLQHN DAKVLEWWVK GLQINWEILY TTDRPRRISL PGYPFAGERY
WQAALPLSVK ATAPLPARDD KDYYDLLDAV LSDELSVQHA TNEIVKMLN