Gene BTH_II1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1666 
Symbol 
ID3845272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1973189 
End bp1989889 
Gene Length16701 bp 
Protein Length5566 aa 
Translation table11 
GC content75% 
IMG OID637838967 
Productpolyketide synthase, putative 
Protein accessionYP_439860 
Protein GI83716786 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACC AAGTCGTACT GACTTCGCAA CATCCGCTGC TCGACGCCCA CCGGATCGAC 
GGCGAACCCT GGCTGCCGGG GCTCGCCTAT CTCGACCTCG TCTATCAGCT CGGCGAGGCG
CGCGGCATCG CGTTCGAGCG TCACGCGCTG CACGACCTCA CGCTGCATGC GCCGCTTGCC
GTCGCCGATG GCGAGGCGGT CGTGCTGACG CTGCGGTGGC GCGACGAACG CGCCGACGGC
TGGCATCTGA CGATCGAAGG CGCGCCGCTC GACGGCGGCG AGGGCGCGGC GCCGCGCCGG
CTCGTCACCG TCCGGCTGCG CGAGGCCGCG CCGGCCGCCT TCGGCGCGAT GCGTGACGGC
GACATGGACG CGCCGGCCGT ACTCGGCGCG ACGCATGACG GCAACGTCGG CACAGCCGGC
GACATCGTTC ACGGCAACGG CGAAGCGACC GTCGTCGATC TCGCGTCGCT GTACGCACGC
TGCCGCTCGC TCGGGCTCGT GCACGGCGCG GCGATGCAGG CGCGGGGCAC CGCGCGGCTG
ACGGCCGAGC AGGTGCTCGT CGACGTGCGG CCGGCGACGG CCTCGCGACA GGCGGAAGGC
GATACGGAGC GCAATACGGA CGGCAATGCG GACGGCGACG CCGGTCCGTA TCTGTTCGAT
CCCGCGTTGA TCGACGGCAG CGGCATCGGC TCGGCTGCGT TCTTCGCGCA TCTGTTCGAC
GACGCCGATG CGCCGCGCCT CTACCTGCCG CTGCATATCG GCGCATTCCG GGCGACGCGG
CCGATTCGCG CGCACGTGCT CGCGCGCGTG CGCCGGGCGT CGGTGCGCCG GCGCCGCGAG
CTCGTTACGG TGACGTTCGA ATTCTTCGAT CCCGCGACCG GCGAGCAGAT CGCCGAGCTG
ACCGACGTGA CCTGCAAGGC GGTGCGCGAC GCGCGCGACG GCGCGCCGGC GCGGGCGCGC
ACGGATGCGG CGTCCGGCAG CGCGTGTTCC GGCGCGTGTT CCTGTGCGGC GCCCGCCGCG
GCGCGAACCG TTGACGGACG AACCGACGGC GCGCGAACGG AACGCGCGCT GCCGGACGAC
GCATCACCGC CGCCCGGCGC GCGGGCCGAG CGATTCGTGC GCGACGCGAT CGCCGCGCGG
CTCGGCGTCG CGCCGGCCGC GATCGATGCG GACGCGGGCT TTTACGATCT GGGCCTCGAC
TCGGGCATGC TGCTCGAACT GGCGGAGACG ATCGGCGCGG CGATCGGCGC GTCGTTGGCG
CCGACGCTGC TGTTCGAGCA CGCGAACGCG CGCGAGCTCG GCGCATGGCT CGACGCCGAG
CACGGCGCCG CGTTCGCGGC GCGCGCGACG CCGCCGCATC GCGAGACCGA AGACGCGGCC
CGGCATGCAG CGCCGCGAGC CGGCGCGGGC GATGCGAACG GTGCGCGCGA CGGTGCGCGC
GCGGGGACCG ACGCGAACGA CGCCGGCAAC GCGAACAGCT CGAACGGCTC GAATCCCGTC
GCCCGCCCGC CTGCGTCCGT GCGCTGCGGC TCGCTCGACA TCGCCGTCGT CGGCATGGCC
GGGCGCTACC CCGGCGCGCG CGATCTGGAC GAATTGTGGG CGAACCTGCG CGACGGGCGC
GACTGCGTGA CCGAGGTGCC CGCCGAGCGC TGGCGCGCCG ACGACGTCGC GCACGTGCGT
TCGCCGTCGG GCAAGCCCGT GTCGCGATGG GGCGGCTTCA TCGACGCGCC CGATTGCTTT
GACGCGCGCT TTTTCCGGAT CACGCCGCGC GAGGCGGAAG TGATGGACCC GCAGGAACGC
GTGTTCCTGG AAACGGCGTG GGCGGCGATC GAGGACGCGG GCCACACGCC CGACACGCTC
GCGCGCCGCG CGCCGGGCGG CGGCGACGCG GGCGCGCCCG TCGGCGTGTT CGTCGGCGTG
ATGCAGAGCG AATACGCGCT GATCCAGCGC GACGCGCTCG CGGGCGCGCC GACGCCGCTC
GCCGTCAACC GCGCGCCCGT CGCGAACCGC GTGTCGTACG TCTGCGATTT CCACGGGCCG
AGCGTCGCGA TCGACACCGC GTGCTCGTCG TCGCTCGTCG CCGTGCACCT CGCGGCGCAG
AGCCTCGCGG CCGGTGAATG CGCGGTTGCG ATCGCGGGCG GCGTGAACCT GTCGCTGCAT
CCGGCGAAGT ATGTGTCGTG CGCGCTGATG GACATGCACG CGTCGGACGG CCGCTGCCGC
AGTTTCGGCG CGGGCGGCGA CGGCTACGTG TCGGCCGAGG GCTGCGGCGC GCTCGTGCTC
AAGCCGCTTT CGGCCGCGCT CGCGGACGGC GACGCGATCC ACGGCGTGAT TCGCGCGAGC
GCGACGAACC ACGTCGGCGC GGTCGCCGGC ATCATGGCGC CGAGCCCCGC CGCGCAGGCC
GCGCTGATCG GCGAGTGCCT GCGTCGCGCG AACGTCGACG CGCGCACGAT CGGCTATGTC
GAGGCGCACG GCACGGGCAC CGCGCTCGGC GATCCGATCG AGATCGACGG GTTGACGCGC
GCGTATCGCG CGCACACGCG CGACACGCAG TATTGCGCGC TCGGCTCGAT CAAATCGGCG
GTGGGGCACG CGGAGGCGGC GGCGGGCGTC GCCGGACTCA CGAAGGTGCT GCTGCAGATG
CGCCATCGGA CGATCGCGCC CTCGCTGCAC GCGCAGCCGC CCAATCCCCA CATCGATCTC
GCGGCGTCGC CGTTCTTCGT GCCGGCGCGG GCGATGCGGT GGGAGCCGCG CGCGCTTGCC
GGCGGCGGCC GCGCGCCGCT GCGCGCGGCG GTGAGCTCGT TCGGCGCGTC GGGCACGAAC
GCGCACGTGA TCGTCGACGC GTACGCGGAC GCGCGCGTCG CGACGCACGA CGACGCGGGG
CCGTCCGTCG TCGTGCTCTC CGCGATGAGC GAGCCGGCGC TGCGGCAGTA TGCGCGCGCG
CTGCGCGACG CGATGCGCGC GGCGCCGGCG GCCGAGCCGC TCGTCGCGCG CATCGCCGAG
CTGCTCGCCG AGCAGCTCGG CGTCACAGCC GACATGATCG ATCCCGACGC GGACTGGCGC
GAGTTCGGCG TCGACGCCGC GCAGCGGCGG CGCCTCGCGC GCGCGCTCGA GGCATGGCTC
GACATCGACG ACGCGCTGCC CGAGCGCGCG GCGCTCGATT CGGTCGCCGG GCTCGCGCGG
CAGTTGCGGC GTGAGCACGC GCGCGCCGTC GATGCGGCGC TCGCCGGCGC GACGGACCTG
GGCGCGGCGC GCAACGACGG CGCGGCCGCG CTGTCGCTGC GCGCGATCGC GCATACGCTG
CAAGTGGGCC GCAAGGCGCT GCCCGAGCGC GCCGCGTTCG TCGCGCGCGA CGTCGCGCAT
CTGGCGCGGC TGCTCGACAC GTTCGCCGCG ACGGGCGACG CGCCGCGACC GGACGGCCAT
CGCGGGCGAG CCCGCGCCGA CGAACAGCCG GCCGAGCCGC TTTCGCCGCA ACACGCGGAC
CCGCACCGGC TCGCCGCCGC GTGGGTCGAG GGCCGGGCGA TCGACTGGGC GGCGGCGTAT
TCGAGCGGCG CGCCGCGCCG CGTGAGCCTG CCGACCTATC CGTTCGAGCG GCGGCGCTAT
TGGGTTCGCG CAGCGGATGC CCAGTCCGCC GTGCGCGCGA CGGCCGACGG CGCGACTGCG
CCGGACGCCG ACGCGAAAGC CGATGTAAAA GCCGATGCAA GGGCTAATGC AAGGGCCGAT
GCGCAAGCCG GCGCGCACGT CGACGCCGGT GCGGCGGCCG CAGCCGCAGC CGCGGTCGCG
GACATCCTGC AATCGGTCAC CGGGCTGGGG CCGGACGAAT GGCGCGACGA TACGACGTTC
GATGCGCTCG GTCTCGATTC GCTGATGATC GGCGAGTTCA CGCGCCGCAT CGAGGCGATG
ACGGGCGAGC GCGACACGAC GCTGCTGTTC CGGTTCCGCG ACTTGACCGC GCTCGCCGCG
CATCTCGCGC ATCGTCATCC GGCTGCGTGG CGGGCGGATG GGGCGCAGCG CGCGTCGTCC
GCCGCATCGA CGCCCGCGCC CGCGCCGGCG CAAGCCGGAT CGCCGCCCGC GGCCCTCGCA
TCGCGCGCCG CGCGGCCGGC GCCGCGCGAG ACGGCCGCCG CCGCGCCGCT CGACATCGCG
ATCATCGGCC TCGCGGGCCG CTATCCGCAT GCGCCGACGC TTGACGCGTT CTGGCGCAAC
CTCGTCGCGG GCCGCGACTG CGTCGACGAG ATCCCGCCGC AGCGCTGGCC GCTCGACGGC
TTCTACGAGG CCGACCCCGC GCGCGCGGCG GCCGAGGGCA AGAGCGCGAG CAAGTGGGGC
GCGTTCCTGT CCGATGTCGA CCAGTTCGAC CCGCTTTTCT TCGGCATCAC GCCGAACGAA
GCGCGGCTCA CCGATCCGCA CGAGCGGCTC TTTCTCGAAA CCGCGTGGGC GTGCGTCGAG
GACGCCGGCT ACACGCGCGC GTCGCTCGCC GCGCTGCGCG ACGGCCCGGG CGTCGGCGTG
TTCGTCGGCG CGAGCTTCAA CCAGTATCAG CTCATCGTGA GCGACGCGGC GCAGCGGCGC
GGCGCGCGGC AGTTCGCGGC GCCGAGCCAG ATCTTCTCGA TCTCGAACCG CGTGTCGTAC
GTGATGAACT TCACGGGGCC GAGCCTGACC GTCGACACCG CGTGCTCGTC GTCGCTCTAT
GCGATCCATC TCGCATGCGA AAGCCTGCGG CGCGGCGAAT CGAGCGTCGC GCTCGCGGGC
GGCGTCAATC TTTCGCTGCA CCCGAGCAAG TACGTGTCGC TGTCGCTCGG GCGCTTTCTC
GCCGCCGACG GCCGGTGCCG CGCCTTCGAC GAAGGCGGCA CGGGCTACGT GCCGGGCGAG
GCGGTGGGCG CGGTGCTGCT CAAGCCGCTC GCCGACGCCG AGCGCGACGG CGATGCGATC
CACGGCGTGA TCCGCGGCAG CGGCGTGAGC CACGGCGGCC GGACCAACGG CTTCGCGGTG
CCGAGCCCCG ACGCGCAGGC GCTCGCGATC CGGCGGGCGG TCGCGCAGGC GGGCGTCGCG
CCGCGCAGCG TCGGCTACGT CGAGGCGCAC GGCACGGGCA CCGCGCTTGG CGATCCCGTC
GAGATCGCGG GGCTGGAGGA CGTGTTCCGC GCGGGCACCG ACGACGTCGG GTTCTGCGCG
ATCGCGTCGG TGAAGACGAT GATCGGCCAC AGCGAGGCGG CGGCCGGCAT CGCGCAGTTG
ACGAAGGTGC TGCTGCAGAT GAAGCACGCG ACGCTCGTGC GCAACCTGTC GCACGGCGGC
GCGCCGAACC CGAACCTCGC GCTCGAACGC TCGCCGTTTC GCGTCGTGCG CGACAACGAG
CCGTGGGCGG CCGGCGACGG CGGCGTGCGC CGCGCGGGCG TGAGCAGCTT CGGCGCGGGC
GGCGCGAACG CGCACCTGAT CGTCGACGCG TACCGCAACG CGCCCGAGCC GCGCGCGGGC
GCGGACAGCG GCACGGACGA CGCGCCCGCG CTGTGCGTGC TGTCGGCGAA GGACGACGCG
CGCCTGCGCG AGATGGCGGC CCGGCTCGCC GCGCATCTGA GCGACGCGCG TCTCGGCGAC
GCGGACCTCG CCGACGTCGC GTTCACGCTT CAAGTCGGGC GCGAGCCGAT GAACCGGCGC
GCGGCGTTCG TCGTCAGCCG CGTGCGCGAC GCGATCGATC TGCTGCACGC GATCGCGCGC
GGAGACGAAG CGTCCGGCGC GCAGGCGTCC GGCGTGCTGC GCGGCGACGT GCGCGCGGCG
GACGCGCAGC ACGCGCCCGC CGTCGACGAG GCGACGCTCG CGCGCTGGCC GGCGGCCGAC
GCGGCGCGCG CGATCGCCTC CGCGTGGGTG CGGGGCGCGG CGATCGACTG GATGCGGCTG
CGCCGCGCGC CGTCGGTGCG ACGCATGTCG CTGCCGACCT ATCCGTTCGC GCGCGAATCG
TACTGGGTGC CCGGCGTCGC GCCGCAGTCG GCGCTTGCGC CGGCCGCCGG CCGCGCGAGC
CGGGCCGTTG ACGCGCACGG CGAGCACGGC GCGCCGCGCG CCGACGCGCC GGTTTGTCCG
CCGGCCGGCC GCGCGCAGGA AACGCCGGGC ACGGGCGTGC TCGTGCTCGC GCCGCAATGG
CGGCCGGCCG AATGCTCGCC GAGCGCGAGC GCCGCGCCCG CGTTCGATGC GCGCGTGCTC
GTGCTCTGCG AGACGCCGGC TTCGTTCGGC GCGGCGCTGG CGCCCGCGCT CGCGGACGGC
GCGCGGATCG TCGCGCTGCG CTCGGACGAG CGCGATCCCG CCGCCCGCTT CGCGTCGCAT
GCGGCGCGCC TCTTCGCGCT CGTCAAGGAC GAACTGCAGG CGCTCGCGAA GCGTCCGCTC
GCGCGCGCGC TGCTGCAGGT CGTCGCGCCC GCGAGCGACG ATGCGTGGGA CTGCGCGGCG
CTCGCCGCGC TGCTGAAGAC GGCGGCGCAA GAGAATCCGC GTCTCGTCGC GCAGTGGATC
GCGTTCGACG CGTGGCCCGA TGCCGGCGGC GCCGCCGCGC GGCTGCGCGA GAACGCATCG
GCGGCGGGCG ACGTCGACAT CCGCTATCTC GACGGCCGCC GCCACACGCT GCGCTGGGCG
CAGACGCGGG CGGCCGACCG CGCGTTCGGC TGGCGCGCGA ACGGCGTGTT CCTGATCACG
GGCGGCGCGG GCGGGCTCGG CGCGCACGTG GCCGCGACGA TCGCGCGCGG CGCGGTGCGC
CCGACGATCG TGATCGCGGG CCGCTCGGCC GAAGGCGCGG CGCAGACCGC GCTGCTCGAC
GCGCTGCGCT CGGCGGGCGC GCGTGCGCAG TACCGGCAGG CCGACGTCGC GTCGCATGCC
GATTGCGACG CGCTGATCGC CGGCTGCGTC GCCGAGTACG GCGCGCTGCA CGGCGTCGTT
CACTGCGCGG GCGTGATCCG CGACGGCTTC ATCGTGCGCA AGCGCGCCGA AGACCTCGCG
GCGGTATGCG CGCCGAAGGT GGCGGGCGTC GTCCATCTGG ACGCGGCGAC GCGGCGGCTG
CCGCTCGATT TCCTGCTGTG CTTCTCGTCG GCGGCCGGCG CGCTCGGCAA CGTCGGCCAA
AGTGACTATG CGATGGCGAA CGCGTTCATG GACGCGTTCG CCGCTCACCG CAACACGCTC
GTCGCGCGCG GACAGCGGCG CGGGCGCACG TGCTCGATCG GCTGGCCGAT CTGGCGCGAC
GGCGGCATGC GCGTCGATGA CGCGACGCTC GCGGCGCTCG AAGCGCGCGC GGGGATGCTG
CCGCTCGCGA TCGACGAAGG AATGCGAACC ATGCAAGACT GCCTTGCGCT CGATGCGCCG
CACGTGATCG TCGCGGCTGG ACACTTGACG ACGTTGCGCC GGCTGCTGCG CGTCGATGCC
GGCGCGGCCG GCGGGCATGC TGCGGATTCG GCCGATGCGG AAGCGGCCGA TGCGGACGCG
CGCGCGGCGA GCCCGGCCGG CGCGAATGCG GCCGTTGCCG ATGCGGCAGG CGGCGATCGG
GCCGGCATGA ACACGGTCGG GCCGGACGCG CCCGGCGCGA CGGCGCTCGC CGAAGGTCAG
GCGGGCGCAC GGGCGGCCGG CGAATCCGCG CCGGGCGACG ACGTTGCCGG GACGCACGCC
GCTTCGGGTG ACGCGTCGCT TGCGGCGGCC GTCGCCGAGC GGCTCAAGCA CGTGCTGAGC
GAGGCGACCG GGATCGCCGT GTCGCGCCTT GACGCCGACG AGCCGTTCGA CGCGTACGGC
GTCGATTCGG TCGTCGTGAT GACCGTCAAC CGCGCGCTCG ACGACGTGTT CGACGCGCTG
TCGCAGACGC TGCTCTACGA ATACGGCACG CTCGGCGCGC TCGCGGGCCA CCTCGCGCAC
GCGCACGAAG CGGCGTGCCG CGACTGGTGC GGCGTCGCCG CGGCCGCGCG GCCGGGCGCG
CGCGCCGCCG ACGTGCGCGT GCGCGATACG CTCGCCGTCG ACATGTCTCG CCCGGACGTG
CGGACCCCGG ACCTCCGGGT ATCCGACGCG CGGGGCTTCG ACGAGCGGGT ATCCGACATG
CGGGACTCCG ACGCACAGGC CTCCAGCGCG CAGGCCTCCG ACATGCTGGC CCGCGACACG
CGCGACATCG ACGCGCAGCG CATCGCCGAC GCCGAACGGC AGCCCGCCGA GGACGGCGAC
GCGACGGCCG TCGGCCGCGC GGGAGCGCGC GCGCCCGGTG CGTCGTCCGA TGCATTCGTC
GGGGCGGCCG GCGCGGCCGC GGCCCCGGCC GGCCGCGACG AGCCGATCGC GATCATCGGC
ATCGCGGGCC GCTACCCGCA GGCGAGCGAT CTCGACCAGT TCTGGCGCAA CCTCGCGGGC
GGCGTCGACA GCGTGACGAC GATCCCGCCG GGCCGCTGGC CGCTCGAAGG CTTCTTCGAG
CCGGACCGGG CGCGCGCGCT CGAAGCGGGC AAGAGCTACG GCAAGTGGGG CGCGTTCCTC
GACGATCACG AGCGCTTCGA TGCGCCGTTC TTTCAGATGT CGCCGCTCGA GGCGATCAAC
CTTGATCCGC AGATCCGGCT GTTCATCGAG ACCTGCTGGG CGGCGCTCGA GGACGCGGGC
TACACGCGCC GCCTGCTCGC GGAGCGCCAC GGCAAGCGCG TCGGCGTGTT CGCGGGCATC
ACGAAGACGG GCTACGCGCT GCACGGCCCG CGCGTGTGGC CGCTCAGGCA GAGCTACAAC
CCGCAGACGT CGTTCAGCGC GCTCGTGAAC CGCGTGTCGT ACGTGCTCGA CCTGCACGGC
CCGAGCGTGC CGATCGACAC GATGTGCTCG TCGTCGCTCA CCGCCGTGCA CGAGGCGTGC
GAGCACATCC GCTCGGGCGC GTGCGCGCTC GCGCTCGCGG GCGGCGTCAA CCTGTACCTG
CATCCGTCGA ATTTTCTCGA GCTGGCGGGC GTGCAGATGC TGTCGGCCGA CGGGCGCTGC
AAGAGCTTCG GGCAGGGCGC GGACGGCTTC GTGCCGGGCG AGGGCGTGGG CGCGGTGCTG
CTCAAGCCGC TGTCGCGCGC GCTTGCCGAC GGCGATCACG TGCACGCCGT CATTCGCGCG
ACGGGCATCA ACCACGGCGG CAAGGTGAAC GGCTTCACGG TGCCGAACCC GAACGCGCAG
CGCGAGCTGA TCCGCGCGAC GCTCGAACGC GCGGGCGTGC GCGCGCGCCG CGTGAGCTAC
GTCGAGGCGC ACGGCACCGG CACCGATCTC GGCGATCCGA TCGAAGTCGC CGCGCTCGCG
CAGGCGTTCG CCGCGAGCGG CGGCGATCGC GACACGGGCT ACTGCGCGCT CGGCTCGGTG
AAGTCGAACA TCGGCCACCT GGAGGCCGCG GCCGGCATCG CCGGATTGAC GAAGGTCGTC
CTGCAGATGA CGCACGGCAT GCGCGCGCCG ACGCTGCACG CGCGGATTCC GAACGAGAAG
ATTCCGTTCG CGCGCACGCC GTTCGTGCTG CAGCGCGAGC GCGACGACTG GCCGCACGCG
CACGGCGACG CCGCGCAGTC GCGGATCGCG ACGGTATCGT CGTTCGGCGC GGGCGGCGCG
AACGCGTTCG CGGTCGTCGA GGAAGCGCCG CGCCGCGAGC GCGACGCGCG GCCCGACCGC
CCCGAGGTGA TCGTGCTGTC GGCGCGCGAC GCCGACCGGC TCGCCGCACG CATCGCGCAG
TTGCGCGGCC GTCTGCGCGA GGGCGTGCCG TACTCGCTCG CCGAGATCGC CTACACGCTG
CAAGCCGGCC GTGAGGCGAT GGCCGAGCGG ATCGCGTTCG TCGCGGACTC GCTCGACGCG
CTCGTCGACA CGCTCGACAC GCTCGACACG CTCGACACGG CGGGCGGCGC GCCGAATCCG
GCGGCCGGCG CGGCGGCGGC GCGCGTGCAT CGCGGCAGCG GCGTGACGCG CAAGACGCCG
CTGTCGTTCC TGGCCGACGA CGATCTCGCG GACACCGTGC GCGCGTGGCT CGCGAAGGGC
AAGCTCGATC TCGTCGCGCA GGCGTGGGCG AGCGGCGTCG ACATCGACTG GACGTTGCTG
CACGGCGATC GCACGCCGCG CCGCGCGAGC CTGCCGGCCT ATCCGTTCGC GGGCGAGCGG
CACGCGTTGC CCGATGCGGC GCAGCCCGTC GCGGCGCACG CGCTGCATCC GCTGCTGCAG
CGGAACACGT CGACGCTCGA GCGGGTGAGC TTCGCGTCGG TGTTCGGCGG CGACGAAGCG
TTCCTGCGCG ATCATCGCGT GTCGGGCCGG CCCGTGCTGC CCGCCGTCGC CTACATCGAA
ATCGCGTATC GCGCGCTGCG CGAGGCGGGC GTGCCGGGCG ACGCGGTGCG CTTGACCGAT
CTCGTGTGGA TGCGGCCCGC AATCGTCGAA GGCGAGCCGC TCCACGTGCG CGTCGAGCTC
GCTTCCGAAG CTTCCGAAGC TTCCGAAGCT TCCGATACGT CCGATACGTC CGATACGTCC
GATGCGGCGG GCGACACGCT CGCGCTGACG ATCGGCGCGG CCGCGCCGTC CGGGCGCGGG
CAACCGGGCT TCGCGACGGT GTTCGCGCGC GCTCGCGCGC GTCGCGTGCG CGTCGCCGCG
CCGGCCGCGC TGCCGCTCGC GCGCTGGCGC GCGCAGTGCG GCGCCGCAAT GACGGCGGAT
GACTGCTATC GCGTGTTCGA ACGCGTCGGC CTGGCGTACG GCCCGGCGTT CCGCACGCTC
GACACCCTCT GGCGCGGCGA CGGCGTCGCC GTCGCGAAGC TGCGCGCCGC GTCGCCGCCG
GCCGCGCCGC ACGCGGCGGC CGACGTCGTC GATTCGCTCG ACTGCGGCCT GCTCGACGGC
GCGCTGCAGG CGGTGCTGGG CCTCGCGCCC GCATCGGACG CCGAGCGGCC GCTGCTGCCG
TTCGCGCTCG ACGCGTGCGA GCTTTACGGC GAGCGGCCCG ACGAGCCGTG GTGCGTCGTG
CGGCGCACCG AAGACGCGCG CGTCGACGTC ACGCTGTGCG GCCCGCAGGG CGAGCCGTGG
ATCGCGCTGC GCGGCCTCGT GCTGCGCCGC TACGAAAGCG CGCCCGCCGC CGAGCGCGGG
CTCGTCGCGC TCGCGCCGCA ATGGCGGCGG GCGGACGCGA CGCCCGTCGC GGCCGCCGCC
GCGCCCGCAT TCGCCGACAC GCTCGTCGTG CTCTGCGGCG CGCAGCGTCT AGACGCCGGC
GGCAGCGCGG ATCGCATCCG CGTCGTGACT GTCGCGAACG ATGCGACGCC GATCGGCGAG
CGCTTCGCAC AGACGCTGCA GGCGCTGATC GCCGAGACGC AGGCGCTGCT GCGCGACCGG
CCGGACGCGC CCGCGCTGTT GCAGGTGGTC GTGCCGGATA CGCCGGCGTT CGCGCCGATG
ATCGGGCTGG CGGGCTTTCT GCGCACGCTG TCGCACGAGG CGCCGCACAT GTGCGGCCAA
TTGCTGCTGG TTCCGGCGCA CGACACGGGC GCCGCGCTGC GCGCGAGGCT GGCCGAATGC
GCGGCGCGCG CGCACGAAAC GGTGGTGCGC TACGGGCAGG GGCCGGCGCG CGACGTGCTG
CAGTGGCGCG AGTGGGCGGA CGCGATGCCG CCGGCCGATG CCGGCCAGAC GGCCGACGCG
CATGCGGCGC CCGTCTGGCG CGACGACGGC GTGTATCTGG TGACGGGCGG CGCGGGCGGG
CTCGCGTGGC TGCTCGCGCA GGACATGGCC CGGCGCGCCC CGGGCGCGCG CATGGTGCTT
GTCGGGCGGC GGCCGCTCGA CGCGCGCGCC GCATCGTCGC TCGATGCGCT GCGCGCGCGC
ATCGAATACG TCGAGGCCGA TTGCGCCGAC GCCGACGCGA TCGCCGCGGC GGTCGCGTCC
GTGCTCGAGC GCCACGGCCG GCTCGACGGC GTGGTGCACG CGGCGGGCGT GCTGCGCGAC
GAATTCATCC GGCGCAAGAC GGCGGACGGC GTCGCGGCGG TGCTGCGCCC GAAGGTCGAC
GGCACGCTCG CGCTCGACCG CGCGACGCGC GACGTTCGCC TCGCGTTCTT CGTGCTGTTC
TCGTCGGCCG CCGCGCAGGC CGGCAATCCG GGGCAGGCCG ACTATGCGGC CGCGAACGGT
TTCATGGACG GCTTCGCGCA GTATCGCCGC GCGCTCGTCG CGCTCGGCGA GCGGCACGGC
GCGACGACGA GCATCGACTG GCCGCTGTGG CGCGACGGCG GGATGCGCAT CGGCGCCGAC
GCGCAGCGCC TGCTCGAAAC GCAGACGGGG ATGTCGGCGA TGCCGGGCGA CGCGGGGCTC
GCGTTCTTCC ATCGCGCGGT GCGCGCGGGC GTGGCGCAGG TGATGCCGCT GTACGGGAAT
CCGGCGCGTC TGCGTGACGC GCTTGCGCCC GCCGCGCGCG ACGACGCGGG TTCGGCCGCC
GATGCGGCCG GCGCGAGCGA CGCGGGCAGC GCGTACGCGG ATGCCGGCGA CGCGCGCGGG
ATGGCCGGCG CCGTGCTGAT CGCGCGTCTT CGCGATGCGC TTGCCGCGGC GCTCGGCGTC
GCGCGCGACG CGCTCGACGC GCACGTGCCC GTCGGCGAAT TCGGCGTCGA CCGGCAGGTG
CTCGACGCGC TTGTCGAGCA GGTCGAGCCC GGCGGCGCGG TGCGTCGGGC GGAACTGCTC
GATACGCGGC TCACGCTCGA CGAGATCGCG TCGCGTCTTG CCGCCGCGAC GCGCGGCGAC
GCGCACGCGG CGCACGCCGC TTCGCCGCGC GGCGAAGCGG GCGAGGCGCA GGAAGCGCCG
CAAGCATACG AAGCGCACGG CGCGCATGAA GTGCCGCCGT TGGCCGAGCC GCGCGCGTCC
GAGCTCGTCG ACGCATCGCT CGCGCTGCTG ACCGAGCGGC TTGCCGACGT GATCAAGCTG
CCCGCGCCGC GCATCGATCC CGATGCCGAG CTCACGACGT ACGGCATCGA TTCGGTCGTC
GTGATGCAGG TGACGACGCA ACTCGAAAAG CAGCTCGGGC CATTGTCGAA GACGCTGTTC
TTCGAATACG GCACGCTGCG CGCGATCGCG GCGCGCGTCG CGCACACGCA TCGCGAGCGG
GTCCGCGCGC TCGTCGAGCG GCGCGCCGCA GGCACGCAGG CGGCATCGGC GGCATCCGTC
GCATCGCACC CGGCTGCGCC GCACGGCGCG AGGCCCGCGC ACGAGCGGCG CGCCGACGCC
GACGCGGCGA GCGTTCGCGG GGCACCGCGC GCCGTTGCCG CGAGGTCGTC GGCGGAGGTC
TCGCGGGCAA GCGCGACCGC GAACACGAAC ACGACCACGA CCACGACCAC GACCGCGACC
GCGACCGCGA CCGCGACCGC GACCGCGACC GCGACCACCA ACACGACTGA GAACGCCGCG
CCGCACGCGT CGCGCGGCGA TCGCGACGCG CGGCCGCCTG CGCAGCAGCC GCCCGGCGCG
GCCGCGTTCG CGGCGGGCGA CATCGCGATC GTCGGCATCG CGGGCCGCTA TCCGCAGGCG
GACGATCTCG CGCAGTTCTG GCGGAACCTC GCGCGCGGCG TCGACAGCGT GACGGAGATT
CCCGCCGACC GCTGGGATTA CCGGCGCTTC TACGATCCGC AGAAGGGCCG GCTCGGCAAG
AGCTACAGCA AGTGGGGCGG CTTCCTGTCC GATGTCGCGC GCTTCGACGC GGCGTTCTTC
AACATCTCCG CGCGCGAGGC GCAGATCATG GACCCGCAGG AGCGGCTGTT CCTCGAATGC
GTGTATCACA CGCTCGAGGA TGCGGGCTAC ACGCGCCGCA ACGTGAGCCG CAGCCGCCGG
GTCGGCGTGT TCGTCGGCGT GATGTACGAG GAGTATCAGC TGTACGGCGT CGAGCGCATG
CTGGAAGGCA CGCCCGTCGC GCTCGCGGGC AATCCCGCCG CGATCGCGAA CCGCGTGTCG
TACTTCTGCG ATTTCCACGG CCCGAGCATG GCGATCGACA CGATGTGCTC GTCGTCGCTG
AGCGCGATCC ATCTCGCGTG CCAGAGCCTC ATGCTCGGCG AATGCGAGGT CGCGGTGGCG
GGCGGCGTCA ACGTGTCGAT TCATCCGAAC AAGTATCAGA TGCTGTCGCA GGGCCGCTTC
GCGTCGAGCA ACGGCCGCTG CGAGAGCTTC GGCGCGGGCG GCGACGGCTA CGTGCCGAGC
GAGGGCGTGG GCGCGGTGCT GCTCAAGCCG CTCGCGCGCG CGATCGCGGA CGGCGACCGG
ATTCACGCGG TCATCAAGGC GACCGCGCTC AACCACGGCG GCAAGACGAA CGGCTACACG
GTGCCGAACC CGAACGCGCA GGCCGACGTG ATCGGCGACG CGCTCGCGCG CGCGGGCATC
GACGCGCGCA GCATCGGCTA TGTCGAGGCG CACGGCACCG GCACGTCGCT CGGCGATCCG
ATCGAGATCG CGGGGCTCGC GCAGGCGTTC GGCCGCCACA CGCCGGACAA GGGCTTCTGC
GCGATCGGCT CGGTGAAGTC GAACATCGGC CACGGCGAGA GCGCGGCCGG CATGGCGGGC
CTCACGAAGA TCGTGCTGCA GATGCGGCAC CGTCGGCTCG TGCCGTCGCT GCATGCCGAC
ACGCCGAATC CGAACATCGA TTTCGCCGAT ACGCCGTTCG TCGTGCAGAC GACGTTGGCG
CCGTGGCGCG GCGCGATCCT GCCCGACGAG GCGGGCCGGC CGGCCGAGCT GCCGCTGCGC
GCCGGCCTGT CGTCGTTCGG CGCGGGCGGC GCGAACGCGC ACGTCGTCGT CGAGGCGTAC
GACGTCGCGG GCGCGGAGCG GGGCGCGCAC GAAGACGGTC CGGCCGTCGT CGTGCTGTCG
GCTCGCACCG ACGAGCGGCT CGCCGCGCAG GCGCGCAACC TGCTCGCGCA TCTGTCGCGC
GAGCCGCATC GCGACGGCGA GCTCGACCGC GACGGCGGTG CGACGCTCGC GTCGATCGCG
TACACGCTGC AGGTCGGCCG CGAGGCGATG CCGGCGCGAC TCGCGGTGAT CGCGGCGTCG
CTCGACGATC TGCGCGAGAA GCTCGCGGCG CTCGTCGACG GCGGTGAAGG GCTCGACGGC
GTGTCGCGCG GCCGCGTCGA TCCGCTCGGC GAGCCGCTCG CGCCCGGCGA GCTCGCGCAA
TGGCGCGCCG GCGCGCGGCT CGCCGAGCTC GCCGCCGCCT GGGTCGCGGG CCGCTTCGAC
GACTGGACGC CGTGTTACGG CGACGTTCGG CCGGCGCGCG TGTCGCTGCC GGGCTACGCG
TTCGCGCCGA CGCGCTACTG GGTGCCCGAC GCGGAGCAAC TGGCGAAGGC GGCCGGTGTG
CACGCGGACG CAACGCAAGC GAAGGGGATC GCGGGCGTGG CGCAGGGTTC GGTGGCGTCG
GGAAGTTCGG GCGCTTCGGC TGAGGCGGCC GGGCCGGCCG AGTCGAACGG GGCGGCTGGG
TCGAACGGGT CGGCCGGGTC GGCCGGGTCG AACGACGCGG GACCGGGCGA TGCGCGCAAG
CGCGTGGCGG GGCCTTCGAT CGACGCGGCG GCCCCGGCGA ACACGGCTCA ACCGGCCGAA
CCGGCCGAAC CGGCCGCGCG CCGTCGCGGC GGCGTGACGT TGTCGCCGCC CGCCGAGGCG
GCCGCATTCG ATCTGTCGGC GCGCGCACCG GCCGCGAAGC CGACGGTGAC GTTCGCGCAG
CCCGCCGGCC GCGCGCACGA AGCCCTGCCG CCGGCATCTC GTCCGGAGGA ATCCGCACCG
CGCGTCGTCG CGACGCAGGC GCGCCGCGCG GATGCGGTTA GCGCCGCGCC CGCCGTCTCG
CCGGAGGCGG TCGTGCCGGC GCTGCGCGCG CTGCTCGCCG ATGCGCTGTA CGTCGATGCG
CAGACGATCG ACGCCAACGC CGAATTCGTG TCGCTCGGCG TGGATTCGAT CATCGGCGTC
GAGTGGATCG ACGCGGTGAA CCGGCGGTTC GGCGTCGCGC TGTCGGCGAC GACGATCTAC
GACCATCCGA CGCTGTCGAG CTTCGCCCGG CATCTGGCGT CGGCGGCGGC ATCGGCGCAG
GCGTCGATGG CCGCGGCGGC GTCGCCGTCC GCGCCTGCGT CCGGCGCGCG GGCGGACACG
GCCGCGCCCG AGGCTGCGGC GGAGGCGTCC GCGCCGTTCG AGCGCGAGCA GGCGGCGCGC
GCGCGCGCCG CCGCGCACGG GGCGGACGAC GCGGGCGATC CGGCGCCCGC CGCGCTCGAA
GCGGCGCTGC GCGAGACGCT CGCCGACGCG CTGTTCGCCG ACGCGCACGA CATCGAGCCC
GACGCGACGT TCCAGGACCT CGGCGTGGAT TCGATCATCG GCGTCGAATG GGTGCAGGCG
ATCAACCGCC GCTACGGCAC GTCGATCCCG GCGCCGCAGA TCTATCAGTA TCCGACGCTG
CGTGCGTTTG GCGGCCTCGT CGCGTCCGAG CTGGCGCTCG CTCGCCGTCG CGCGGGCGGC
GCGCCGCTGC CCGACGCCGA GGCGGCATCA GCGACGACGG CGACGACGGC AAGCGCGTCG
GGCACCGCGC ACGCGCCGAA CGCAGCAGGC GACGCGCATG CGCCGAACGC AGCCGGGGCG
CGCGCGGCGA CTCCCGCGCC CGCGGAAAAC GCGGCGGGCG CGGTACATGC CGCGCGGGCC
GCAAACCCCG CAAACCCCGC CACTCTCGCA AACCCCGCGC CGGGCGGCGG CGTCCCGCTC
GACGACGTGC TCGCGCGCGT GCATCGCGGC GAGCTGAGCG TCGAGGCGGC CGAGGCGCTG
CTGGCGGGCG CGCTCGGTTG A
 
Protein sequence
MEDQVVLTSQ HPLLDAHRID GEPWLPGLAY LDLVYQLGEA RGIAFERHAL HDLTLHAPLA 
VADGEAVVLT LRWRDERADG WHLTIEGAPL DGGEGAAPRR LVTVRLREAA PAAFGAMRDG
DMDAPAVLGA THDGNVGTAG DIVHGNGEAT VVDLASLYAR CRSLGLVHGA AMQARGTARL
TAEQVLVDVR PATASRQAEG DTERNTDGNA DGDAGPYLFD PALIDGSGIG SAAFFAHLFD
DADAPRLYLP LHIGAFRATR PIRAHVLARV RRASVRRRRE LVTVTFEFFD PATGEQIAEL
TDVTCKAVRD ARDGAPARAR TDAASGSACS GACSCAAPAA ARTVDGRTDG ARTERALPDD
ASPPPGARAE RFVRDAIAAR LGVAPAAIDA DAGFYDLGLD SGMLLELAET IGAAIGASLA
PTLLFEHANA RELGAWLDAE HGAAFAARAT PPHRETEDAA RHAAPRAGAG DANGARDGAR
AGTDANDAGN ANSSNGSNPV ARPPASVRCG SLDIAVVGMA GRYPGARDLD ELWANLRDGR
DCVTEVPAER WRADDVAHVR SPSGKPVSRW GGFIDAPDCF DARFFRITPR EAEVMDPQER
VFLETAWAAI EDAGHTPDTL ARRAPGGGDA GAPVGVFVGV MQSEYALIQR DALAGAPTPL
AVNRAPVANR VSYVCDFHGP SVAIDTACSS SLVAVHLAAQ SLAAGECAVA IAGGVNLSLH
PAKYVSCALM DMHASDGRCR SFGAGGDGYV SAEGCGALVL KPLSAALADG DAIHGVIRAS
ATNHVGAVAG IMAPSPAAQA ALIGECLRRA NVDARTIGYV EAHGTGTALG DPIEIDGLTR
AYRAHTRDTQ YCALGSIKSA VGHAEAAAGV AGLTKVLLQM RHRTIAPSLH AQPPNPHIDL
AASPFFVPAR AMRWEPRALA GGGRAPLRAA VSSFGASGTN AHVIVDAYAD ARVATHDDAG
PSVVVLSAMS EPALRQYARA LRDAMRAAPA AEPLVARIAE LLAEQLGVTA DMIDPDADWR
EFGVDAAQRR RLARALEAWL DIDDALPERA ALDSVAGLAR QLRREHARAV DAALAGATDL
GAARNDGAAA LSLRAIAHTL QVGRKALPER AAFVARDVAH LARLLDTFAA TGDAPRPDGH
RGRARADEQP AEPLSPQHAD PHRLAAAWVE GRAIDWAAAY SSGAPRRVSL PTYPFERRRY
WVRAADAQSA VRATADGATA PDADAKADVK ADARANARAD AQAGAHVDAG AAAAAAAAVA
DILQSVTGLG PDEWRDDTTF DALGLDSLMI GEFTRRIEAM TGERDTTLLF RFRDLTALAA
HLAHRHPAAW RADGAQRASS AASTPAPAPA QAGSPPAALA SRAARPAPRE TAAAAPLDIA
IIGLAGRYPH APTLDAFWRN LVAGRDCVDE IPPQRWPLDG FYEADPARAA AEGKSASKWG
AFLSDVDQFD PLFFGITPNE ARLTDPHERL FLETAWACVE DAGYTRASLA ALRDGPGVGV
FVGASFNQYQ LIVSDAAQRR GARQFAAPSQ IFSISNRVSY VMNFTGPSLT VDTACSSSLY
AIHLACESLR RGESSVALAG GVNLSLHPSK YVSLSLGRFL AADGRCRAFD EGGTGYVPGE
AVGAVLLKPL ADAERDGDAI HGVIRGSGVS HGGRTNGFAV PSPDAQALAI RRAVAQAGVA
PRSVGYVEAH GTGTALGDPV EIAGLEDVFR AGTDDVGFCA IASVKTMIGH SEAAAGIAQL
TKVLLQMKHA TLVRNLSHGG APNPNLALER SPFRVVRDNE PWAAGDGGVR RAGVSSFGAG
GANAHLIVDA YRNAPEPRAG ADSGTDDAPA LCVLSAKDDA RLREMAARLA AHLSDARLGD
ADLADVAFTL QVGREPMNRR AAFVVSRVRD AIDLLHAIAR GDEASGAQAS GVLRGDVRAA
DAQHAPAVDE ATLARWPAAD AARAIASAWV RGAAIDWMRL RRAPSVRRMS LPTYPFARES
YWVPGVAPQS ALAPAAGRAS RAVDAHGEHG APRADAPVCP PAGRAQETPG TGVLVLAPQW
RPAECSPSAS AAPAFDARVL VLCETPASFG AALAPALADG ARIVALRSDE RDPAARFASH
AARLFALVKD ELQALAKRPL ARALLQVVAP ASDDAWDCAA LAALLKTAAQ ENPRLVAQWI
AFDAWPDAGG AAARLRENAS AAGDVDIRYL DGRRHTLRWA QTRAADRAFG WRANGVFLIT
GGAGGLGAHV AATIARGAVR PTIVIAGRSA EGAAQTALLD ALRSAGARAQ YRQADVASHA
DCDALIAGCV AEYGALHGVV HCAGVIRDGF IVRKRAEDLA AVCAPKVAGV VHLDAATRRL
PLDFLLCFSS AAGALGNVGQ SDYAMANAFM DAFAAHRNTL VARGQRRGRT CSIGWPIWRD
GGMRVDDATL AALEARAGML PLAIDEGMRT MQDCLALDAP HVIVAAGHLT TLRRLLRVDA
GAAGGHAADS ADAEAADADA RAASPAGANA AVADAAGGDR AGMNTVGPDA PGATALAEGQ
AGARAAGESA PGDDVAGTHA ASGDASLAAA VAERLKHVLS EATGIAVSRL DADEPFDAYG
VDSVVVMTVN RALDDVFDAL SQTLLYEYGT LGALAGHLAH AHEAACRDWC GVAAAARPGA
RAADVRVRDT LAVDMSRPDV RTPDLRVSDA RGFDERVSDM RDSDAQASSA QASDMLARDT
RDIDAQRIAD AERQPAEDGD ATAVGRAGAR APGASSDAFV GAAGAAAAPA GRDEPIAIIG
IAGRYPQASD LDQFWRNLAG GVDSVTTIPP GRWPLEGFFE PDRARALEAG KSYGKWGAFL
DDHERFDAPF FQMSPLEAIN LDPQIRLFIE TCWAALEDAG YTRRLLAERH GKRVGVFAGI
TKTGYALHGP RVWPLRQSYN PQTSFSALVN RVSYVLDLHG PSVPIDTMCS SSLTAVHEAC
EHIRSGACAL ALAGGVNLYL HPSNFLELAG VQMLSADGRC KSFGQGADGF VPGEGVGAVL
LKPLSRALAD GDHVHAVIRA TGINHGGKVN GFTVPNPNAQ RELIRATLER AGVRARRVSY
VEAHGTGTDL GDPIEVAALA QAFAASGGDR DTGYCALGSV KSNIGHLEAA AGIAGLTKVV
LQMTHGMRAP TLHARIPNEK IPFARTPFVL QRERDDWPHA HGDAAQSRIA TVSSFGAGGA
NAFAVVEEAP RRERDARPDR PEVIVLSARD ADRLAARIAQ LRGRLREGVP YSLAEIAYTL
QAGREAMAER IAFVADSLDA LVDTLDTLDT LDTAGGAPNP AAGAAAARVH RGSGVTRKTP
LSFLADDDLA DTVRAWLAKG KLDLVAQAWA SGVDIDWTLL HGDRTPRRAS LPAYPFAGER
HALPDAAQPV AAHALHPLLQ RNTSTLERVS FASVFGGDEA FLRDHRVSGR PVLPAVAYIE
IAYRALREAG VPGDAVRLTD LVWMRPAIVE GEPLHVRVEL ASEASEASEA SDTSDTSDTS
DAAGDTLALT IGAAAPSGRG QPGFATVFAR ARARRVRVAA PAALPLARWR AQCGAAMTAD
DCYRVFERVG LAYGPAFRTL DTLWRGDGVA VAKLRAASPP AAPHAAADVV DSLDCGLLDG
ALQAVLGLAP ASDAERPLLP FALDACELYG ERPDEPWCVV RRTEDARVDV TLCGPQGEPW
IALRGLVLRR YESAPAAERG LVALAPQWRR ADATPVAAAA APAFADTLVV LCGAQRLDAG
GSADRIRVVT VANDATPIGE RFAQTLQALI AETQALLRDR PDAPALLQVV VPDTPAFAPM
IGLAGFLRTL SHEAPHMCGQ LLLVPAHDTG AALRARLAEC AARAHETVVR YGQGPARDVL
QWREWADAMP PADAGQTADA HAAPVWRDDG VYLVTGGAGG LAWLLAQDMA RRAPGARMVL
VGRRPLDARA ASSLDALRAR IEYVEADCAD ADAIAAAVAS VLERHGRLDG VVHAAGVLRD
EFIRRKTADG VAAVLRPKVD GTLALDRATR DVRLAFFVLF SSAAAQAGNP GQADYAAANG
FMDGFAQYRR ALVALGERHG ATTSIDWPLW RDGGMRIGAD AQRLLETQTG MSAMPGDAGL
AFFHRAVRAG VAQVMPLYGN PARLRDALAP AARDDAGSAA DAAGASDAGS AYADAGDARG
MAGAVLIARL RDALAAALGV ARDALDAHVP VGEFGVDRQV LDALVEQVEP GGAVRRAELL
DTRLTLDEIA SRLAAATRGD AHAAHAASPR GEAGEAQEAP QAYEAHGAHE VPPLAEPRAS
ELVDASLALL TERLADVIKL PAPRIDPDAE LTTYGIDSVV VMQVTTQLEK QLGPLSKTLF
FEYGTLRAIA ARVAHTHRER VRALVERRAA GTQAASAASV ASHPAAPHGA RPAHERRADA
DAASVRGAPR AVAARSSAEV SRASATANTN TTTTTTTTAT ATATATATAT ATTNTTENAA
PHASRGDRDA RPPAQQPPGA AAFAAGDIAI VGIAGRYPQA DDLAQFWRNL ARGVDSVTEI
PADRWDYRRF YDPQKGRLGK SYSKWGGFLS DVARFDAAFF NISAREAQIM DPQERLFLEC
VYHTLEDAGY TRRNVSRSRR VGVFVGVMYE EYQLYGVERM LEGTPVALAG NPAAIANRVS
YFCDFHGPSM AIDTMCSSSL SAIHLACQSL MLGECEVAVA GGVNVSIHPN KYQMLSQGRF
ASSNGRCESF GAGGDGYVPS EGVGAVLLKP LARAIADGDR IHAVIKATAL NHGGKTNGYT
VPNPNAQADV IGDALARAGI DARSIGYVEA HGTGTSLGDP IEIAGLAQAF GRHTPDKGFC
AIGSVKSNIG HGESAAGMAG LTKIVLQMRH RRLVPSLHAD TPNPNIDFAD TPFVVQTTLA
PWRGAILPDE AGRPAELPLR AGLSSFGAGG ANAHVVVEAY DVAGAERGAH EDGPAVVVLS
ARTDERLAAQ ARNLLAHLSR EPHRDGELDR DGGATLASIA YTLQVGREAM PARLAVIAAS
LDDLREKLAA LVDGGEGLDG VSRGRVDPLG EPLAPGELAQ WRAGARLAEL AAAWVAGRFD
DWTPCYGDVR PARVSLPGYA FAPTRYWVPD AEQLAKAAGV HADATQAKGI AGVAQGSVAS
GSSGASAEAA GPAESNGAAG SNGSAGSAGS NDAGPGDARK RVAGPSIDAA APANTAQPAE
PAEPAARRRG GVTLSPPAEA AAFDLSARAP AAKPTVTFAQ PAGRAHEALP PASRPEESAP
RVVATQARRA DAVSAAPAVS PEAVVPALRA LLADALYVDA QTIDANAEFV SLGVDSIIGV
EWIDAVNRRF GVALSATTIY DHPTLSSFAR HLASAAASAQ ASMAAAASPS APASGARADT
AAPEAAAEAS APFEREQAAR ARAAAHGADD AGDPAPAALE AALRETLADA LFADAHDIEP
DATFQDLGVD SIIGVEWVQA INRRYGTSIP APQIYQYPTL RAFGGLVASE LALARRRAGG
APLPDAEAAS ATTATTASAS GTAHAPNAAG DAHAPNAAGA RAATPAPAEN AAGAVHAARA
ANPANPATLA NPAPGGGVPL DDVLARVHRG ELSVEAAEAL LAGALG