Gene Sare_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1246 
Symbol 
ID5704871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1395747 
End bp1411211 
Gene Length15465 bp 
Protein Length5154 aa 
Translation table11 
GC content72% 
IMG OID641270761 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001536142 
Protein GI159036889 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.262553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCACCG ATCTCATCAA GCCGCTGCCC GTCGCACTTC AGGAGAACGC GGCCCGCTTC 
CCCGGCAAGG TGGCCTTCGA GGACGATCGC CGAGCGGTGA CGTATGCCGA TCTCGAGGCA
CGGACGCGCC GGCTCGCCGG GCATCTGAGA GGCCTCGGCG TCAAACGCGG CGACCGGGTA
GCTATCTGGC TACGTCAATC TGTGTCGACC GTGGAAAGCT ACCTGGCGGT CGTGCGCGCG
GGTGGCGTCG GGGTGCCGCT CAACCCCGAC GCCGCGCAGG CCGAGCTGGA GTACCTGCTG
TCCGACAGCG GCGCCACGGC GGTCATCACC GATGCCGTCC AGGCGCAGCG GCTCCGGCCG
ACACCACACC GTGCGCTGGT GGTGACGGGT GACGTCCCGG CGGGAGCGCT GTCGTACGAC
GAACTCGCCG TCAGCGAGCC GGAGCAGCCA GCCGGGGACG ACCTCGGCCT CGACGACGTG
GCCTGGATGT TCTACACCTC GGGGACGACC GGTCGGCCCA AGGGCGTCCT GTCCACGCAG
CGCAACTGCC TCTGGTCCGT CGCCTCGTGC TATGTGCCGA TCCCCGGGCT GACGGACCAG
GATCGGGTGC TCTGGCCGCT CCCGCTGTTC CACAGCCTCT CGCACATCGC CTGCGTGCTG
TCCGTCACCG TGGTCGGCGC GACCGCCCGG ATCATGGACG GTAGTTCCGT ACAAGACGTG
ATGCGGGCCC TGCAGCAGGA GGAGCCGACG TTCCTGGCCG GTGTGCCCAC GACCTACCAG
CAGTTGGTGT CGGCCGCCAG GAGGCACGGC TTCACCGCAC CGAGCCTGCG GATCGGCCTG
GCCGGAGGAG CCGTTCTCGG GGCGGAGCTG CGCCAGGAGT TCGAGGAGAC CTTCGGGGTG
CCGCTGGTCG ACGCCTACGG CAGCACCGAG ACGTGCGGGG CGATCACCAT CAACCCGCCG
GACGGCCCGC GGATCAACGG GTCCTGCGGA TTGCCCGTGC CGGGTGTCGG GGTCCGCATC
GTCGACCCGA CCACCGGGGG AGACCTCCCC GCTGGCGCTG AGGGGGAGGT GTGGGTCAGC
GGGCCGAATG TCATGGTCGG CTATCACAAC AGTCCGGAGG CGACCGCCAA AGCGATGCGA
GACGGCTGGT TCCGCACGGG CGACCTGGCC CGCCGTGACG GCGCCGGCTA CCTCACCATC
AGCGGCCGGA TCAAGGAGCT GGTGATCCGC GGTGGGGAGA ACATCCACCC CGTCGAGGTG
GAAGCGGTGC TGCGGACCGT CCCAGGTGTC GCCGACGTCG CGGTGGCTGG CGTGCCGCAC
GAGACCCTCG GCGAGGTGCC GGTCGCGTAC GTGATCCCCG GCCCGGATGG GTTCGACGTC
GAGTCCCTGG TGACCCGCTG CCGCGAGCAG CTCTCCGCCT ACAAGGTGCC GCACCAGGTC
CACGAGGTCG CGAGCATCCC CCGGACCGCC TCGGGCAAGG TTCAGCGGCG GCTCTTGGTG
GAGCAACCCG GCCGGTTGAC GTACGCGGCA GTCGATCATC ACGACACGTA TCCGAGCGTG
GACGAGCCTG ACGCCGCCAA CACGGCAACG CTGCGCGACC AACTCGCTGA TTTGGACGAG
CGCGCGCAGC TGGCACTGCT GGAGGACCTG GTCCGAACGC AGACCGCGGC GGTGCTGCGC
CAACCGGGGC CCGACGCGGT TCCGGCCGGC CGGGCGTTCC GTGATCAGGG TCTCACCTCG
ATGGCCATCG TCGAACTGCG CAATCGGCTG ATGTCGCGCA CCGGGCTGCG GCTGCCGACC
AGCGTCGTCT TCGACCACCC CACCCCGACG GCGTTGGCCG CCTGTCTCCG GGCCGAGGTG
CTGGGAATCA CCCAGTCCGT CGCCGAGCCG GCCCTGGTGG CCGACCCCAC GGAACCGATC
GCCATCGTGG GGATGGCCTG CCGGCTCCCC GGTGGCGTGG CAGATCCGGA CGATCTGTGG
AACCTGGTGG CCGGCGAAGT CGACGCCGTC TCCGAGTTTC CCGGCGACCG GGGCTGGGAC
CTCGACCGCC TGTTCGATCC CGACCCCGAC CGCGCCGGCA CCTCGTACAC GGGTCAGGGC
GGGTTCCTGC CCGAGGCGGG GCTGTTCGAC GCGGCATTCT TCGGGATCTC GCCGCGTGAG
GCACTGGCCA TGGATCCGCA ACAGCGACTG CTGCTGGAAA CCTCGTGGGA GGCGCTGGAG
CGGGCGGGCA TCGACGCCCT GTCGCTGAAG GGCGCGGACG TCGGCGTGTT CGCCGGTGTT
TTCGGCCACG GCTACGGAGT CGGCACCGAC GCGGCGGAGC TGGAAGGATT CGCGACCACC
GGGTCGGCCG CCAGCGTGGC ATCGGGCCGG ATCTCCTACG TGTTCGGCTT CGAGGGGCCG
GCGGTCACCG TGGACACGGC CTGTTCGTCG TCACTGGTGG CGATGCACCT CGCCGCGCAG
GCGCTGCGGC AGGGCGAATG CTCCATCGCG CTGGCCGGCG GTGCGACCGT GATGGCCACG
CCCAGTACCT TCGTGGAGTT CTCCCGCCAG CGAGCGCTGG CACCCGACGG GCGGTGCAAG
GCGTACGCGG CCGCCGCCGA CGGCACCGGT TGGGCCGAGG GCGCGGGTGT GGTCGTACTG
GAGCGGCTGT CGGTGGCCCG CGAGCGCGGG CACCGAATTC TGGCCGTGCT GCGGGGCAGC
GCGGTGAACC AGGACGGCGC GTCGAACGGT CTGACGGCGC CGAACGGGCT GGCGCAGCAG
CGGGTCGTCC GGCGGGCTCT CGCCGCCGCC GGCCTCTCCC CATCCGAGGT GGACGCCGTC
GAGGGGCACG GTACCGGGAC CACCCTGGGT GATCCGATCG AGGCGCAGGC GTTGCTGGCG
ACCTACGGCA AGGACCGTGC CCCAGAGCGT CCGCTGTGGT TGGGCTCGCT GAAGTCGAAC
ATCGGGCATG CGCAGGCTGC CGCAGGTGTC GCCAGCGTGA TCAAAATGGT GCAGGCGTTG
CGGCACGGGA TCCTGCCGGC GACGCTGCAC GTCGACGCGC CCACTCCCGG CGTGCACTGG
TCGGCGGGTG CCGTGCAGCT GTTGACGGAG TCGCGTGACT GGTCGCGCAA CGGTCGTCCG
CGCCGGGCCG GGGTGTCGTC GTTCGGAATC AGCGGAACGA ATGCCCACGT GATCCTCGAG
GAGGCGCCCG AGGAACCAGC AGCCGACGAG GTGTCCGAGG AACCAGCAGC CGACGACGCG
CCCCGGGTGG GCGTGATGCC GCTGGTCGTG TCGGCGCGGA GTGTGGCGTC GCTGGCCGGC
CAGGCAGGTC GGCTGGCCGC GTTCGTGGAA GGCGTTGGGC CGCCGATGCC CGATCTGGCC
GGCGCGTTGG CGAAGAACCG GGCGGTGTTG GACGAGCGAG CCGTCGTGCT GGCCGGCTCC
GGCGGCGAGG CGGTGACCGG CCTGGCTGCC CTGGCCCGCG GTGAACCCGC GGTCGGTGTC
GTGACCGGCC GCGCCGACGT CGAGGGCAAG GTGGTGTGGG TGTTCCCGGG GCAGGGGACA
CAGTGGGTCG GGATGGGCCG GGAACTGTTG GACTCGTCCC TGGTGTTCGC CGAGCGGATC
GCCGAGTGCG CCGCCGCGTT GGAGCCGTGG GTCGACTGGT CCCTGGTGGA CGTGTTGCGC
GGTGACGCCG ATCCCGGGTT GATGGACCGG GTCGACGTGG TGCAGCCGGC GACGTTCGCG
GTCATGGTGG GTCTCGCGGC GGTGTGGTCG TCGGTGGGGG TGCGGCCGGA CGCGGTCGTG
GGCCACTCGC AGGGTGAGAT CGCGGCCGCC TGCGTGGCAG GTGCGCTGTC CCTCGCGGAT
GCGGCGCGGG TGGTGGCGTT GCGCAGTCAG GCTATCGCCT CGGTGCTGTC AGGCCGTGGT
GGTATGGCTT CCGTCGCGTT GAGCGAGGAA GACGCGGCGG CGCAACTGGC GCCCTGGGCG
GACCGCGTGG AGGTCGCCGC AGTCAACGGG CCCTCCTCGG TGGTGATCGC CGGTGACGGC
CCGCCGTTGG ACGAGGTGCT CGAGTCACTG GCGCACCGGG CTGTCCGGGT GCGGCGGGTG
GCGGTGGATT ACGCCTCGCA TACGCGTCAG GTGGAGGACA TCGAGGGTGT GTTGGCTGAG
TCGTTGGCTG GGATTGGTGC GGTGGCGCCG GTGGTGCCGT TCTTTTCCAC GGTCGGTGGT
GGGTGGGTTC GGGGGGCTGG GGAGGTTGAT GGTGGGTATT GGTATCGCAA TTTGCGTGGT
CAGGTGCGGT TTGGTGCGGC TGTTGCGGAG CTGATTGGTC AGGGGTACGG GGTGTTTGTG
GAGGTGAGTG CGCACCCGGT TTTGGTTCAG TCGATTGTGG ATGTTGTTGA TGGTGTTGAG
GCTGATGTGG TGGTGGGGGG TAGCCTGCGG CGCGACGACG GTGGTGTGCG GCGGCTTGTG
ACGTCGATGG CTGAGCTGTT TGTCCGTGGG GTGCCGGTGG ATTGGAGTGG TCTGCTGCCG
CCGGTCGTCG GCTGGGTGGA TCTGCCGACG TATGCCTTCG ACCACCGGCA CTACTGGCTA
CCGCCGGTGG AATCGGCGAC CGACGCGAGG TCGCTGGGAC AGGTGGCGAC TGACCACCCG
TTGCTGGATG CGGTCGTCGG CTCCCCGCGG TCGACCGGAT TCACGGCTAC CTCGAGATGG
TCGGTGCGGT CGCACCCCTG GCTCGCCGAC CACCTGGTGA CAGACACCGT CCTGGTGCCG
AACGCGGCCC TCGTGGAGGC CGCCATCCGG CTCGGCGATC TGGCCGCCAC CCCCGTCCTC
GAGGACCTGA CCGTCGACGT TCCCGTGCTG CTGCCGGCAC GGGACGGCCG CGACATCCAA
GCCCTCGTCG GTGAGCCCGA CGAGACACGG CGACGGCCCG TGGAGATCTT CTCCCGGGCG
GCGGAAGCCC CGCTGGACGC CGCGTGGACG CGCCACGCAC ACGGCACACT GGCCCCTGCC
GCATCCCGGT CGGCCCGCCC GGAGCCCGGG GCCGCCACCG AGATCGCCGT GGACGGCGCC
GCACTTCGGG ACGCCGACCG GTATGGGATG CACCCCGTCC TGCTCGACGC GGCCGCCCGC
ACCGTCATCC CGGACGGCAC GCTGCCGTCC AGGTGGACCG GGGTGACCCT GTTGGCCTCC
GGCGCCACCG CACTGCGAGT GCGACCGGGC TCGACCTCGC GGTGGGGGAC CGGTCTGGCC
TTGACCGACC CCACCGGACA ACCCGTCATG ACCATCGACG CGGTCCTCGG TACGCCCGCC
TCCCCCGATC AGGCGCAGAT CTCCGGCGGT CCGCCACCCG ACACCCTGTT CCGTGTCGAG
TGGACCGAAC TGCCGCTGCC GGTCGTGGAT CGGGCGGGCG CCGTCGTGCC CGTCGCGACC
GGCGAGGACG TCGCCGCGGC GGCCGGGGCG ACGCCGGACG TCCTGCGGTA CGAAGCCGGC
ACCGGAGACC CCCGCGCGTC GGTCGAGGCC GCCCTGGAGG TTCTCCAGGC ATGGCTTGCC
GAACCGGCCC TGGCGGAGAC CCGGCTAGCC GTCGTCACTG GCGACTGCAC CGAACTCGGC
GCAGCCGCGG TGTGGGGGTT GGTGCGCTCG GCGCAGTCGG AGCACCCGGG TCGGATCGTG
TTGGCCGACC TCGACGACGC GTCCCGGTCC GTGCTGCCTG CGCTGCTGCG CAGCGGCGAG
GCCCAACTGA GGGTGCGCGA CGGCGTCGCG CAGGTGCCCC GCCTGGCCCG CGCCTCGCTT
GACCTTCCGG AGGAGCGACG CCGGCTCGAC CCCGACGGCA CCGTACTGAT CACCGGCGGT
ACGGGAACGC TGGGTGCCAC GACGGCCCGG CATCTGGTCA CCGTGCACGG CATCCGGCAC
CTGGTCCTGC TCAGCCGGCG CGGCTGTGCC CCGGACCTGC AGGCCGAGTT GACCGAGCTG
GGGGCCTCTG TCACCGTGGC CGCCTGCGAC ACCGCGGACC GGGCCCAACT CGAGGCCGTG
CTGGCCGCCG TCGCGGCCGA GCATCCGCTC ACCGCCGTGG TGCACGTGGC CGGTGTCCTC
GATGACGGAG TGCTCACCGA GCTGACCCCG GAACGCGTCG ACACGGTGTT GCGGCCGAAG
CTGGATGCGG CACTGCATCT GCACGAGCTC ACCCGGGATC TGGACCTCGC CGCGTTCGTG
TTGTTCTCCT CCGCGGCCGG TGTGCTGGGC AACCCTGGGC AGGCGAACTA CGCGGCGGCC
AACGCCTGCC TGGACGCGAT CGCGCGTCAG CGTCACCACC TCGGCCTTCC CGCGGTGTCC
CTGGCCTGGG GCTACTGGAC GCCGGTCAGC ACCATGACCG AGCACCTGGG AGCCGCGGAC
CTGAGCCGCA ACAGGCGAAC GGGCATGAGT GGCCTGTCCG CCGCCGAGGG AATGGCCATT
CTGGACGCCG CCCTCGGCGC CGCCGACACG TTGGTCGCGG CGAAGCTGGA CGTTCCCGCC
CTGCGCAGGG CAGCGGCGGG TGGTGATCCG ATCCCGCCGC TGCTGCGTGC TCTGGCGCCA
CCGCCACGAC CGACAGCCAA GGCCACGGCC GGTCCCGTCT CGCTCGCCCA ACGCCTCGCC
GGTGTTGCCG AGGCCGAGGC CACCGAGGTC GTGCTCGACC TGGTGCGCAG GTACGCCGCC
GAGGTGCTCG GGCACGCCGA CGCCGACGCC GTCCACCCCG GCCGGACCTT CAAGGACGCC
GGCTTCGACT CGCTGACGGC GGTGGAACTA CGAAACCGGT TGGCCACCGC GACCGGCCTC
ACCCTGTCCC CGGCGCTGGT CTTCGACTAC CCGAAACCCG CGGCGCTGGC CGAGCACCTG
CACGCCAGGC TGCTCGGTGT GGCGCCGCGC CGTCAGAACG AACCCGGTGC CCCCACCAGG
ACGACCGACG AGCCCATCGC GATCGTCGCG ATGGCCTGCC GTTTCCCGGC GGGTGTGCAC
AGTCCGGAGG ACCTGTGGCG GGTGGTGGTC GACGGGGTCG ACGCGGTCAC CGAGTTCCCC
CGGGACCGTG GTTGGGACAC CGAGGGGCTC TACCACGAAG ACCCCGACCA CCCGGGCACC
ACGTACGTGC GGCACGGCGC CTTCCTCGAC GACGCCGCCG GATTCGACGC CGCCTTCTTC
GGTATCTCGC CGAAGGAGGC GACGGCGATG GATCCGCAGC AGCGGCTGCT GCTGGAGACC
TCGTGGGAGG CGTTCGAACG CGCCGCGATC GACCCGACCA CCCTGGCCGG TCAGGACGTC
GGCGTCTTCG TCGGCGTCAA CAGCCACGAC TACAGCGTGC GCACGCACCA GGCGTCGGAC
CTCGAGGGTT TCCGGCTCAC CGGAAGTTCG GGCAGCGTCG TCTCCGGTCG CGTCGCCTAC
CACTTTGGCT TCGAGGGCCC CGCCATCACG GTCGACACCG CCTGCTCCTC ATCGCTGGTG
GCGCTGCACA TGGCGAGCCA GGCGCTGCGC CGCGGCGAGT GCAGCATGGC GCTGGCCGGG
GGCGTGATGG TGATAGGCGC CCTCGAAACC TTCGTGGAGT TCTCCCGGCA GGGCGGGCTG
GCCCCGGATG GCCGGTGCAA GGCGTTCGCG GACGGCGCGG ACGGCACCGG CTGGTCCGAG
GGCGTGGGAC TGCTGTTGGT GGAGCGGCTG TCCGAGGCCC GCCGCCGAGG CCACCAGGTG
TTGGCGGTGG TGCGCGGATC GGCGGTGAAC CAGGACGGCG CGTCGAACGG TCTCACCGCC
CCGAACGGGC CGTCGCAGCA GCGGGTGATC CGCAAGGCAC TGTGCGACGC CGGCCTGGGC
GCCCCGGACG TGGACGCGGT CGAGGCACAC GGAACGGGAA CCGTACTCGG TGACCCGATC
GAGGCCCAGG CGCTGCTCGC GACCTACGGA CAGGACCGCC CCGCCGGCCG ACCGCTGTGG
CTCGGATCGG TCAAGTCGAA CATCGGGCAC ACCCAGGCAG CAGCGGGCGT CGCGGGCGTC
ACCAAGATGG TGATGGCCAT CCGGCATGGT GTCCTTCCGC GGACCCTGCA CGTCGATCGC
CCGTCCACCA ACGTGGACTG GGCGTCGGGC GAGGTGGAAC TGCTGACCGA GGCACGCGAC
TGGCCGGAGA CCGGCCGGGC CCGCCGGGCT GCGGTGTCGT CCTTCGGCAT CGGCGGCACC
AACGCGCACG TCATCATCGA GGCCGCCCCC GACCAGCCGA TGCCCCCCAA CCCGGAGCCG
GCCCCGGAGC CCGACGGTTT CCCGGCACCG CTCCCGTTGT CGGCGCGGAC GGCTGCCGGG
CTGCGTGGCC AGGCGGGCCG ACTCGCCCGG CACCTCGGCG ACCGGTCCAC GCTGCCCCTC
ACCGACACCG CCTACGCCCT CGCCACCACC CGTGCCCACC TCGAGCACCG GGCGGTCGTA
CTCGCCGCCG ACCGGCGGCA GGCCGAGGCC GACCTCAGCG CCATGGAGCG CGGCGGGACC
GGTCCCACGG TCCTGTCCGG TACCCCGGTC ACCGGCAAGC TGGCCATTCT CTTCACCGGT
CAGGGCAGTC AGTGGCCGGG CATGGGACGC GAGCTCGCCG AAACGTTCCC GGTCTTCCGC
GCCGCGTTCG CCGCCGCCTG CACCGCCGTC GAACAACACC TGGAACCGGT GCGCCCGCTG
CGCGACGTGG TGTTCGCGGC GGACGGTGAA CTGCTCGACC AGACCATGTA CACGCAGGCC
GCCCTGTTCG CCCTGGAGAC CGCGCTGTTC CGGCTCGTCG AGTCCTGGGG AGTGCGACCC
GACCTCCTCG CCGGTCACTC GATCGGGGAG GTCACCGCCG CTCACGTCGC CGGGGTGTTG
GACCTGGGGG AGGCGGCCAA GCTGGTCGCC ACGCGCGGCC GGCTGATGCA GGCCCTGCCC
GCCGGCGGCG CGATGGTCGC CGTTCAGGCG AGCGAGGCCG ACGTCGCGCC CCTGCTCGCC
CGAGCTGGCG GCGTGGTCTG TGTCGCGGCG GTGAACGGCC CCGACTCGGT GGTGCTCTCG
GGCCACGAGG ACGCGGTGCT CGCCCTGGCG GGCGAGCTGG CCGGTCGGGG CTGTAGGACG
CGTCGACTAC CGGTCAGCCA CGCCTTCCAC TCACCGCTGA TGGCACCCAT GCTGGACGAG
TTCCGGGCCG TCGTGGCGGA GCTGTCGTTC CAGCCGGGTA ACCTTCCCAT CGCCTCCACG
CTCACCGGTC TCCTCGCCCG GGACGGGCAG TTCCGCACGC CGGACTACTG GGTCGACCAG
GCACGCAGCC CGGTCCGATT CGGCGACGCC GTCACCGCGT TGCGTGAGCA CGGCGCGACG
ACCTTCCTGG AGCTGGGACC GGGCGGCTCC CTCGCCGCGA TGGCGCTCAG CACGCTCGGC
GCGCCGGAAC AGGCGTGCAT CGCAGCCCTG CGCAAGGACG GCGCGGAGGC CAGCGACCTC
GTGGCTGCGG TCGCGGGGCT GCACGTGCGT GGGGTGGCCG TCGACTGGGC GGAGCTGCTC
GGCCGGCGGT CCGCCGTGGC CGGCACCGAC CTGCCCACCT ACGCCTTCCA GCACCAGCGG
TACTGGGTCG AGACCGAGGA CACCGCGACG GGCGGCGCCG AGGCCCTCGG CGCCACCGAC
GCCCGGCACC CGCTACTCGG CACTGTGATC GAGGTGCCGG ACACCGGCGG CCTCGTGCTG
ACCGGCCGCC TGTCCCCGCT GAGCCCCGGC CTGCTTCCCG GCAGTGGCAA CCGGGTTCCG
GTGGCCGTGC TCCTCGAACT TCTGGTCCGG GCGGGCAGCG AGGTCGGATG CGGCAGCCTC
GACCAGATCG TTGTCGAGGT GCCGCTGGCG GTGTCGGAGC ACGGACACAC CCAGGTGCGG
GTGACCGTGG ATGCTCCCGG CCTGGACGGA CGTCGCGGCG TCACCGTCCA TGGCAGGCTC
GACGGTGGCC AGCGTGGAGC ATGGACCCGC CATGTTCGCG CGGTGCTGGT CCCCGGCATG
CCGCAGCCGA GCTTCGACCT CCGGGACCTG TCGGGGCCCG AGATCACCCT GCCGGACGAG
CTGACCGAGC AGGCTGCCAC GTTCGCCCTC CACCCGCTGC TCCTCGACGC CGCGCTGCGC
GTCCCGGCCG GCGACGACCA CCGGGACGTG TCCGCCTGCG CCAGCATGAC GGTGTACGCG
CAAGGGGCGA CCGCCCTACG CCTGCGGACG ACGCCGACGC GGGACGGCCG GCACCTGCTG
GAGCTGGCGG ACACCGCGGG GGAACCGGTC GCCGCCATCG GCCCGCTGAC CCTCGACGCC
GCCACCGCCG ACCCGGTGGA GCCCGACAGC GAGCCGCTGC CGGTGCCGGT GCCGCTCACC
CGCCGCGTGG CCGCGCGGGG TGAGGCGGCG GGCGGGGCGA TCCTGGGCCG GTTGGCGGGT
CTGGCCGAGG CTGAGCAGCT TCGGGTCGTG ACGGCCGTGG TCAAGGAGAG CATCGCCGCC
GTCCTCGGTC ATCGCGAGTC GGACGCGTTC GCCGAGGGAC AGGCCTTCAA GGATCTTGGA
TTCGATTCGC TGAGCGCGGT CAGACTGCGA AACCGGCTGC GGGACCTCAC CGGTGTGAGC
CTGTCCAGCA CCCTGGTCTT CGACCATCCC ACACCTGCCA TCCTCGCCGC CCATCTGCGC
GACGAACTAC TCGGCGAGCG CCGTGAGCTG CCCGTTGCCA CTGCTCGGGC CGCGCTGTCC
GACGAACCCG TCGCGATCGT CGCGATGAGC ACCCGGCTGC CCGGCGGCGT GGACACACCC
GAGGAGCTGT GGAACCTGGT GCTGGAGCGC CGGGACGCGG TCGCCGGCTT CCCGGTCGAC
CGGGGCTGGG ACCTCGACCG GCTGTACGAT CCTGACCCCG CCAGCCCGGG CACGAGCTAC
ACCCGCCTGG GCGGGTTCCT GTATGACGCC GCGCAGTTCG ACGGCACGCT GTTCGGGATC
TCGCCGCGCG AGGCGCTGGC GATGGACCCA CAGCAGCGGT TGCTGCTGGA GACGTCCTGG
GAGGCGTTGG AACGGGCGGG CATCGACCCA CTGTCGCTCA AGGGCAGTGA CGTCGGCGTC
TTCACCGGCA TCGTCCACCA CGACTATGTG AGCCGGTTGC ACCGGGTACC CGACGATGTC
CAGGGCTACC TCATGACCGG CGCCGCCTCC AGCGTGGCGT CCGGCCGGGT GTCCTACGTG
TTCGGCTTCG AGGGCCCGGC GGTGACGCTG GACACGGCGT GCTCGTCCTC GCTGGTGGCG
ATGCACCTCG CCGCGCAGGC GTTGCGACAG GGCGAGTGTT CGCTCGCGCT GGCGGGCGGC
GCGACCGTGA TGGCCAGTCC GGACGCGTTC CTGGAGTTCT CCCGGCAGCG GGGCCTGTCC
GCCGACGGCC GCTGTAAGGC CTACGCGGAC GGCGCGGACG GTACGGGATG GGCCGAGGGC
GTCGGTGTCG TACTGCTGGA GCGGTTGTCC GTGGCACGGG AGCGTGGGCA CCCGGTCCTG
GCCGTGCTGC GGGGCAGCGC GGTGAACCAG GACGGGGCGT CGAACGGTCT GACGGCCCCG
AACGGTCCGT CGCAGCAGCG GGTGATCCGC AGCGCGTTGG CGGCTGCCGG CCTGTCGCCG
GGTGACGTGG ACGTCGTGGA GGGGCATGGG ACCGGGACGG CGTTGGGCGA CCCGATCGAG
GCACAGGCCC TGCTGGCCAC GTACGGGCAG GGCCGGGATG CCGAGCATCC GCTGTGGTTG
GGGTCGTTGA AGTCGAACAT CGGGCACACC CAGGCAGCCG CGGGGGTGGC TGGTGTGATC
AAGATGGTGC AGGCGATGCA ACACGAGGTG CTGCCGGCGA CGCTGCACGT GGAGACGCCC
ACCACCGAGG TCGACTGGTC GGCTGGGGCG GTCCGGGTGC TGACCGAGCC GCGTGACTGG
CCCCGCGGCG CTCGTGTCCG CCGGGCCGCC GTGTCCTCCT TCGGCGCGAG CGGGACGAAC
GCGCACGTGA TCCTGGAGGA AGCGTCCGTC GAGCGGGCCC GGTCCGCCAC GGCGGAGGGC
CCCGCGGCCG GTGTGGTGCC GCTGGTGGTG TCGGCCGGGA GCACCGGTGC GCTGGCCGGG
CAGGCCAGCC GGCTGGCGTC CTTCATCAAG GGCTCCGTCG AGGTGCCGCT CGCCGCGGTG
GCCGGCGCGC TGGTGTCGGG CCGGGCGATG CTCGGCAAGC GGGCCGTCGT GGTGGCTGGC
TCCGGTGACG AGGCACTGGC CGGCCTGGCA GCGTTGGCGC GCGGCGAGAG CAGTCCCGCC
CTGGTGACCG GCAGTGTGGA CACGCCGGGA AAGGTCGTCT GGGTGTTCCC GGGCCAGGGC
TGGCAGTGGG TCGGCATGGG CCGGGAGTTG TTGGACACGT CGCCGGTCTT CGCCGAGCGG
ATCGCCGAGT GCGCCGCCGC GCTGGAGCCC TGGGTCGACT GGTCCCTGGT GGACGTGCTG
CGCGGTGAGG CGGCCGCCGA GCTGCTGGAG CGGGTCGACG TGCTGCAGCC GGCGAGCTTC
GCCGTGATGG TGGGTCTGGC CGCGGTGTGG TCGTCGGTGG GGGTGCGGCC GGACGCCGTG
GTCGGTCATT CACAGGGTGA GATCGCGGCC GCGTGTGTGG CCGGTGCGCT GTCCCTCGAG
GATGCCGCCC GGGTGGCGGC GCTACGCAGC CAGGCCATCG CCGTGAAGCT GTCCGGCCGC
GGTGGCATGG CGTCGGTCGC CCTGCCCGAG GAGGACCTGG TCGTGCGGTT GACGCCCTGG
GCGGAGCGGG TCGAGGTGGC CGCGGTCAAC AGCCCCTCCT CCGTGGTCAT CTCGGGCGAC
GCCGAGGCGC TGGACGACGC CCTGGAGTTC CTGTCCGGTC AGGGGGTGCG GGTGCGGCGG
GTGGCTGTCG ACTACGCCTC CCACAGCCGC CATGTGGACG ACCTCCGGGA CACCCTCGCC
GACATGCTGG CCGGGCTCGA CGCACAGGCA CCGGCCATTC CGTTCTTCTC GACCGTCACC
GGCGGCTGGG TCCGGGACGG CGGAGTCGTC GACGGTGCCT ACTGGTACCG GAATCTGCGG
GATCGGGTCG GCTTCGGCCC GGCGGTGGCG GACCTGGTCG AGCAGGGCCA CGCCGTGTTC
GTGGAACTCA GCGCGCACCC GGTGCTGGTG CAACCGGTCA ACGAGGTCGC CGACGCGGTG
GTGGTCGGGT CGCTGCGGCG CGACGACGGC GGCCTGCCGC GGCTGCTGGC CTCGATGGCC
GAACTCTTCG TACGCGGGGT GGCGGTGGAC TGGTCCCGGG TGCTCCCGGC CCGGTCCGGC
TGGGTGGGCC TGCCGACCTA CGCCTTCGAC CACCGGCACT ACTGGCTGCA CGAGGCTCAG
GCGGGTACCG ACGCGGCCGA CCCGGCCGAG GGCGCCGACT CCGATTTCTG GGCCGCGGTC
GAGCACGCGA ACCTGGACTC CCTCGCCGAA CTGCTGGAGA TGGCCTCAGC CGACCAGCGC
GGCGCTCTGA ACACCGTCGT GCCGGTACTG GCGGACTGGC GGAAGAAGCG CCGCGAGCTG
TCAACCGCGG AGGGGCTGCG CTACCACGTC AACTGGCAAC CGCTCGACGG CGAAGCCGCC
GGCGTGCCCG GCGGCCGGTG GCTCGTCGTC GTGCCGTCGG GGCACGGTGA GGACACGCTT
CTCGCCGGGC TGGGCGAGCA GGGGCTGGAC CTGATCCGAC TGGAGGTCGG CGAGCGGGAC
CGTGGCCGGG AACGTCTCGC CGAACGACTG AGCACCGTCC TCGCCGACCA CGAGCTGACC
GGGGTGCTGT CCCTGCTCGC CCTCGACCCC CGGGCGCAGG ACCCGACAAC CGTCGCGGCG
CCGACGCTCG CACTCGTGCA GGCGCTGGGC GACAGCGACG TCACCGCACC ACTGTGGTGT
GTGACCAGGG GCGCGGTGAA CATCGGCATC CAGGACGCGG TGACCGCACC GGCTCAGGCC
GCGCTCTGGG GTCTTGGCCG TGCCGTTGCC CTGGAGCGGC TCGATCGCTG GGGTGGTCTG
GTCGATCTGC CCGCGTCGAG CGACGCGCGT ACGGCGCAGG CCCTGCTCGG GGTGCTGAAC
ACCACCGGTG AGGATCAGCT CGCGATCCGG CGGTCGGGCA GCTACGGCAG GCGGCTGGTC
CGTAAGCCGC TGCCGGAGCC CGTGACCGGC GGACGCTGGC AGCCCCGGGG CACGGTCGTG
GTGACCGGCG GAGCCGAGGG GCTCGGCAGG CATGCCGCGG TCTGGCTCGC GCAGGCCGGC
GTGGACCGGC TCGTGGTCAC CACCACCGCC CACGCGCCCG TGGACGGCGT GCCGGAGCTT
CGCGCCGAAC TGGCCGGGCT CGGTCTGCAC ACGGTGGTGG AGTCCTGCGC CGATGCCGAC
CGGGACGCGA TCGCCGCGCT GGTGGCCGCG ACCGTACCGG AACAACCGAT CACGGCCGTG
GTGCACGCCG CCGACGTCAC GCAGACCAGT TCCGTCGACG ACACCGGTGA GCGCGACCTC
GCCGAGGTGT TCGCCACGAA GGTGGACAGC GCGGTGTGGC TCGACAACAT GTTCGAGGAC
ATCCCACTTG ATGCCTTTGT CGCGTTCTCC TCGATCGCCG GTGTCTGGGG TGGTGGCGGC
CAGGGCCCGT CCGGTGCGGC GAACGCGGTG CTCGACGCCC TCATCGAGTG GCGTCGGGCC
AGGGGCCTGC GGGCGACGTC CGTCGGGTGG GGGGCGTTGA ACCAGATCGG CGTGGGCATG
GACGAGGCCG CGCTCGCCCA ACTCCGTCGC CGTGGTGTGC TGCCGATGGA GCCCTCGGTG
GCGATGGCCG CGATGGTCCA GGCCGTCCAG GGCAACGAGA AGTTCGTGGC GGTCGCAGAC
ATGGACTGGG CTTCCTTCAT CCCGGCCTTC ACCTCCGTCC GTCCCAGTCC GCTGTTCGCG
GACCTGCCCG AGGCCCAGGC GGCTCTGGCA GCGTCGCAGC CCGACACCGA GAACAGCGAC
GCCGCCGCGT CCCTCGCGGA GTCCCTGCGC GCAGTGACGG ACAGTGAGCA GAACCGGATT
CTGCTCCGAC TGGTCCGCGG GCACGCCTCG ACGGTCCTCG GCCACGGCGG GGCGGAGGGC
ATCGGCCCAC GACAGCCGTT CCAGGAGGTC GGCTTCGACT CGCTGGCCGC CGTCAATCTC
CGCAACAGTT TGCATACCGC CACCGGGCTG CGCCTTCCCG CGACGTTGAT CTTCGACTAC
CCCACCCCGG AGACGCTGGT CGGCTACCTG CGTGCCGAGC TTCTCCGGGA GCCCGACGAC
GGCCTACCGG GGCGGGAGGA CGACCTGCGG CGGGTCCTCG CGTCCGTGCC GATCGCCCGG
TTCAAGGAGG CGGGCGTGTT GGAAGCCCTG CTGGGTCTGG CCGACGTGGA CGACGACCCG
GCCGTGCCGG ACGAACCAGT GCCCAGCGGG GCGAACGACG TGGAACTGAT CGACGCACTG
GACATCGCCG CCCTCGTCCA GCGAGCGCTG GGCAAGGCGA GCTGA
 
Protein sequence
MRTDLIKPLP VALQENAARF PGKVAFEDDR RAVTYADLEA RTRRLAGHLR GLGVKRGDRV 
AIWLRQSVST VESYLAVVRA GGVGVPLNPD AAQAELEYLL SDSGATAVIT DAVQAQRLRP
TPHRALVVTG DVPAGALSYD ELAVSEPEQP AGDDLGLDDV AWMFYTSGTT GRPKGVLSTQ
RNCLWSVASC YVPIPGLTDQ DRVLWPLPLF HSLSHIACVL SVTVVGATAR IMDGSSVQDV
MRALQQEEPT FLAGVPTTYQ QLVSAARRHG FTAPSLRIGL AGGAVLGAEL RQEFEETFGV
PLVDAYGSTE TCGAITINPP DGPRINGSCG LPVPGVGVRI VDPTTGGDLP AGAEGEVWVS
GPNVMVGYHN SPEATAKAMR DGWFRTGDLA RRDGAGYLTI SGRIKELVIR GGENIHPVEV
EAVLRTVPGV ADVAVAGVPH ETLGEVPVAY VIPGPDGFDV ESLVTRCREQ LSAYKVPHQV
HEVASIPRTA SGKVQRRLLV EQPGRLTYAA VDHHDTYPSV DEPDAANTAT LRDQLADLDE
RAQLALLEDL VRTQTAAVLR QPGPDAVPAG RAFRDQGLTS MAIVELRNRL MSRTGLRLPT
SVVFDHPTPT ALAACLRAEV LGITQSVAEP ALVADPTEPI AIVGMACRLP GGVADPDDLW
NLVAGEVDAV SEFPGDRGWD LDRLFDPDPD RAGTSYTGQG GFLPEAGLFD AAFFGISPRE
ALAMDPQQRL LLETSWEALE RAGIDALSLK GADVGVFAGV FGHGYGVGTD AAELEGFATT
GSAASVASGR ISYVFGFEGP AVTVDTACSS SLVAMHLAAQ ALRQGECSIA LAGGATVMAT
PSTFVEFSRQ RALAPDGRCK AYAAAADGTG WAEGAGVVVL ERLSVARERG HRILAVLRGS
AVNQDGASNG LTAPNGLAQQ RVVRRALAAA GLSPSEVDAV EGHGTGTTLG DPIEAQALLA
TYGKDRAPER PLWLGSLKSN IGHAQAAAGV ASVIKMVQAL RHGILPATLH VDAPTPGVHW
SAGAVQLLTE SRDWSRNGRP RRAGVSSFGI SGTNAHVILE EAPEEPAADE VSEEPAADDA
PRVGVMPLVV SARSVASLAG QAGRLAAFVE GVGPPMPDLA GALAKNRAVL DERAVVLAGS
GGEAVTGLAA LARGEPAVGV VTGRADVEGK VVWVFPGQGT QWVGMGRELL DSSLVFAERI
AECAAALEPW VDWSLVDVLR GDADPGLMDR VDVVQPATFA VMVGLAAVWS SVGVRPDAVV
GHSQGEIAAA CVAGALSLAD AARVVALRSQ AIASVLSGRG GMASVALSEE DAAAQLAPWA
DRVEVAAVNG PSSVVIAGDG PPLDEVLESL AHRAVRVRRV AVDYASHTRQ VEDIEGVLAE
SLAGIGAVAP VVPFFSTVGG GWVRGAGEVD GGYWYRNLRG QVRFGAAVAE LIGQGYGVFV
EVSAHPVLVQ SIVDVVDGVE ADVVVGGSLR RDDGGVRRLV TSMAELFVRG VPVDWSGLLP
PVVGWVDLPT YAFDHRHYWL PPVESATDAR SLGQVATDHP LLDAVVGSPR STGFTATSRW
SVRSHPWLAD HLVTDTVLVP NAALVEAAIR LGDLAATPVL EDLTVDVPVL LPARDGRDIQ
ALVGEPDETR RRPVEIFSRA AEAPLDAAWT RHAHGTLAPA ASRSARPEPG AATEIAVDGA
ALRDADRYGM HPVLLDAAAR TVIPDGTLPS RWTGVTLLAS GATALRVRPG STSRWGTGLA
LTDPTGQPVM TIDAVLGTPA SPDQAQISGG PPPDTLFRVE WTELPLPVVD RAGAVVPVAT
GEDVAAAAGA TPDVLRYEAG TGDPRASVEA ALEVLQAWLA EPALAETRLA VVTGDCTELG
AAAVWGLVRS AQSEHPGRIV LADLDDASRS VLPALLRSGE AQLRVRDGVA QVPRLARASL
DLPEERRRLD PDGTVLITGG TGTLGATTAR HLVTVHGIRH LVLLSRRGCA PDLQAELTEL
GASVTVAACD TADRAQLEAV LAAVAAEHPL TAVVHVAGVL DDGVLTELTP ERVDTVLRPK
LDAALHLHEL TRDLDLAAFV LFSSAAGVLG NPGQANYAAA NACLDAIARQ RHHLGLPAVS
LAWGYWTPVS TMTEHLGAAD LSRNRRTGMS GLSAAEGMAI LDAALGAADT LVAAKLDVPA
LRRAAAGGDP IPPLLRALAP PPRPTAKATA GPVSLAQRLA GVAEAEATEV VLDLVRRYAA
EVLGHADADA VHPGRTFKDA GFDSLTAVEL RNRLATATGL TLSPALVFDY PKPAALAEHL
HARLLGVAPR RQNEPGAPTR TTDEPIAIVA MACRFPAGVH SPEDLWRVVV DGVDAVTEFP
RDRGWDTEGL YHEDPDHPGT TYVRHGAFLD DAAGFDAAFF GISPKEATAM DPQQRLLLET
SWEAFERAAI DPTTLAGQDV GVFVGVNSHD YSVRTHQASD LEGFRLTGSS GSVVSGRVAY
HFGFEGPAIT VDTACSSSLV ALHMASQALR RGECSMALAG GVMVIGALET FVEFSRQGGL
APDGRCKAFA DGADGTGWSE GVGLLLVERL SEARRRGHQV LAVVRGSAVN QDGASNGLTA
PNGPSQQRVI RKALCDAGLG APDVDAVEAH GTGTVLGDPI EAQALLATYG QDRPAGRPLW
LGSVKSNIGH TQAAAGVAGV TKMVMAIRHG VLPRTLHVDR PSTNVDWASG EVELLTEARD
WPETGRARRA AVSSFGIGGT NAHVIIEAAP DQPMPPNPEP APEPDGFPAP LPLSARTAAG
LRGQAGRLAR HLGDRSTLPL TDTAYALATT RAHLEHRAVV LAADRRQAEA DLSAMERGGT
GPTVLSGTPV TGKLAILFTG QGSQWPGMGR ELAETFPVFR AAFAAACTAV EQHLEPVRPL
RDVVFAADGE LLDQTMYTQA ALFALETALF RLVESWGVRP DLLAGHSIGE VTAAHVAGVL
DLGEAAKLVA TRGRLMQALP AGGAMVAVQA SEADVAPLLA RAGGVVCVAA VNGPDSVVLS
GHEDAVLALA GELAGRGCRT RRLPVSHAFH SPLMAPMLDE FRAVVAELSF QPGNLPIAST
LTGLLARDGQ FRTPDYWVDQ ARSPVRFGDA VTALREHGAT TFLELGPGGS LAAMALSTLG
APEQACIAAL RKDGAEASDL VAAVAGLHVR GVAVDWAELL GRRSAVAGTD LPTYAFQHQR
YWVETEDTAT GGAEALGATD ARHPLLGTVI EVPDTGGLVL TGRLSPLSPG LLPGSGNRVP
VAVLLELLVR AGSEVGCGSL DQIVVEVPLA VSEHGHTQVR VTVDAPGLDG RRGVTVHGRL
DGGQRGAWTR HVRAVLVPGM PQPSFDLRDL SGPEITLPDE LTEQAATFAL HPLLLDAALR
VPAGDDHRDV SACASMTVYA QGATALRLRT TPTRDGRHLL ELADTAGEPV AAIGPLTLDA
ATADPVEPDS EPLPVPVPLT RRVAARGEAA GGAILGRLAG LAEAEQLRVV TAVVKESIAA
VLGHRESDAF AEGQAFKDLG FDSLSAVRLR NRLRDLTGVS LSSTLVFDHP TPAILAAHLR
DELLGERREL PVATARAALS DEPVAIVAMS TRLPGGVDTP EELWNLVLER RDAVAGFPVD
RGWDLDRLYD PDPASPGTSY TRLGGFLYDA AQFDGTLFGI SPREALAMDP QQRLLLETSW
EALERAGIDP LSLKGSDVGV FTGIVHHDYV SRLHRVPDDV QGYLMTGAAS SVASGRVSYV
FGFEGPAVTL DTACSSSLVA MHLAAQALRQ GECSLALAGG ATVMASPDAF LEFSRQRGLS
ADGRCKAYAD GADGTGWAEG VGVVLLERLS VARERGHPVL AVLRGSAVNQ DGASNGLTAP
NGPSQQRVIR SALAAAGLSP GDVDVVEGHG TGTALGDPIE AQALLATYGQ GRDAEHPLWL
GSLKSNIGHT QAAAGVAGVI KMVQAMQHEV LPATLHVETP TTEVDWSAGA VRVLTEPRDW
PRGARVRRAA VSSFGASGTN AHVILEEASV ERARSATAEG PAAGVVPLVV SAGSTGALAG
QASRLASFIK GSVEVPLAAV AGALVSGRAM LGKRAVVVAG SGDEALAGLA ALARGESSPA
LVTGSVDTPG KVVWVFPGQG WQWVGMGREL LDTSPVFAER IAECAAALEP WVDWSLVDVL
RGEAAAELLE RVDVLQPASF AVMVGLAAVW SSVGVRPDAV VGHSQGEIAA ACVAGALSLE
DAARVAALRS QAIAVKLSGR GGMASVALPE EDLVVRLTPW AERVEVAAVN SPSSVVISGD
AEALDDALEF LSGQGVRVRR VAVDYASHSR HVDDLRDTLA DMLAGLDAQA PAIPFFSTVT
GGWVRDGGVV DGAYWYRNLR DRVGFGPAVA DLVEQGHAVF VELSAHPVLV QPVNEVADAV
VVGSLRRDDG GLPRLLASMA ELFVRGVAVD WSRVLPARSG WVGLPTYAFD HRHYWLHEAQ
AGTDAADPAE GADSDFWAAV EHANLDSLAE LLEMASADQR GALNTVVPVL ADWRKKRREL
STAEGLRYHV NWQPLDGEAA GVPGGRWLVV VPSGHGEDTL LAGLGEQGLD LIRLEVGERD
RGRERLAERL STVLADHELT GVLSLLALDP RAQDPTTVAA PTLALVQALG DSDVTAPLWC
VTRGAVNIGI QDAVTAPAQA ALWGLGRAVA LERLDRWGGL VDLPASSDAR TAQALLGVLN
TTGEDQLAIR RSGSYGRRLV RKPLPEPVTG GRWQPRGTVV VTGGAEGLGR HAAVWLAQAG
VDRLVVTTTA HAPVDGVPEL RAELAGLGLH TVVESCADAD RDAIAALVAA TVPEQPITAV
VHAADVTQTS SVDDTGERDL AEVFATKVDS AVWLDNMFED IPLDAFVAFS SIAGVWGGGG
QGPSGAANAV LDALIEWRRA RGLRATSVGW GALNQIGVGM DEAALAQLRR RGVLPMEPSV
AMAAMVQAVQ GNEKFVAVAD MDWASFIPAF TSVRPSPLFA DLPEAQAALA ASQPDTENSD
AAASLAESLR AVTDSEQNRI LLRLVRGHAS TVLGHGGAEG IGPRQPFQEV GFDSLAAVNL
RNSLHTATGL RLPATLIFDY PTPETLVGYL RAELLREPDD GLPGREDDLR RVLASVPIAR
FKEAGVLEAL LGLADVDDDP AVPDEPVPSG ANDVELIDAL DIAALVQRAL GKAS