Gene BURPS1106A_A0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0440 
Symbol 
ID4906294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp428722 
End bp437220 
Gene Length8499 bp 
Protein Length2832 aa 
Translation table11 
GC content74% 
IMG OID640143547 
Productpolyketide synthase 
Protein accessionYP_001074483 
Protein GI126457144 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCAAA TAGTTTCAAC GAGCAACGGA TCTCTCGATA GTCGGGCGCG GATCATCGCG 
TTTCCGTTCG CGGGCGGCAG CGGCGCCAGC TATGCGGACA TCGCCCGCGA ACTCGGCGAC
GCGTTCGCTT TCACGACACT CAGCCTGCCC GGTCGCGGCG CGACCCAGCA CATTCCGTTC
TACGACGACT GGCCGTCTCT CGTCGACGAT CTCGCCGCCG AGGTCGCGCG GCTCGACGAC
GGCACGCCCT TCTTCCTGTT CGGCCACAGC CTCGGCGCGC TGCTCGCCTA CGAAGTCGCG
CGCACGCTCG AGCAGCGCGG CGGCGCGCGG CCGACCGGCG TCTTTCTGTC GGGCCATCCC
GCGCCGTCGC GCGAACGCGC CGCGCCCGCG TGGCGTCGAC AGACGCATGC GCTACCCGAC
GCCGAATTCG TCGAGGCCGT GCGGCGCTGG GGCTTCTTCC CGGAAGGCGC GCTCGACGAC
CCCGACGTCG CCCGCTACGT TCTCCCCCCG CTGCGAGCGG ACCTTCGGCT CGCCGAGACC
TACCGGCACG CACGCGGCGA CGCGCTGCGC GCGCCATGCG CGATCTACGG CGGCGCGGCC
GATCCGAGCA CGAGCGTCGA CGATCTGCGT GCGTGGCGCG CGCACGTTTC GCCGCAACAC
GCGTGCCCGG TCGAATCGTT CGACGGTCAT CACTTCTATT TTCTCGACCC TTCGCCCCGC
GCGGCGCTGT GTGCGAGCCT CGCCGGCCAC ATCGAGCAGC GGCTCGCGCA ACGCCCCGCG
TCGATCGTCG CGGCGGCGCC CGATCCGGGC TCGCGCGTCG CCGCGTGCCT GCGCACGGCC
GGCGATTGGG GCGATGCGCA AGCGGTGCTC GCCGCGATTC GCGAACACGT CGCGCGCACG
CCGGACGCGC TCGCGCTGCA GGACGGCGAG CGCACGTGGA CCTACCGGCA GCTCGCGTCG
CATGCGGCCG AGCTCGCCGA CGCGTTTCGC GCGGCGGGCG TCGGCCGGCA GGATGTGATC
GGCGTGTTCC TGCCGCACGG CGCCGAATAC GTGCTGACGA TCGTCGGCGC GTGGTCGATC
GGCGCGTCGG TGTGCCTGCT CGAGAAAAGC TGGCCCGATT CGCTCGTCGG CGAATTCGTC
GCGAGCTGCC GCGTCGCGCA GCTCGCGACG ACGCCCGCGT TGCTCGCGCG CGCGGCCAAA
CACCTGCCCG ATGCGCGCTG CACGCTCGTC GGCGCGCGGC CGCCGCTCGC GCAGCGCGCA
TGGACGAGCG TCGCGCCGCG CCCCGACGAC ATCGCGTTCG TCTCGTTGAC GAGCGGCTCG
ACGGGCAAGC CGAAGGCCGT CCTCACGACG CACGTCGGCA CGAGCTACTG CTTCCACGCG
CGCGACGCGC TGTATCCGTA CGCCGACGGC GAGCGCGAAG GGCTCAACGT GTTTCTCGCC
TGGGAATGCC TGCGCCCGCT GATGTTCGGC CGCCCGGCCG TCGTGATCGG CGACGACGTG
ATCTTCGATC CGCCCCGGCT CGTCGCGCTG CTGCGGCGCG AGCGGATCAC GCGGCTCGTC
GTCACGCCGT CGCTGCTCGA AAGCGTGCTC GACTTCCCCG GCATCGCCGG GCAACTGCGC
GATGCGCTCG CGCACATGTC CGCATGGTTC CTGATGGGCG AAGTCGTGCC GCAGCGCGTC
GTCGACAAGG CGCGCGCGGC GTTCCCGCCA TCGGTGCGGC TCGTCAACGC GTACAGCACG
TGGGAAAGCC TCGACGTCTG CTACGCGGAT CTGCTGCCGT CGCGCGCGAG CGACGACGGC
AGGCGCGTGC CGATCGGCCG GCCGCTGCCC GGCTGCGCGC TCGCGGTGCT CGACGAAGCG
GGGCGCGCGG TGCCGGCGGG CGAAACAGGC GAGCTGTACA TCGCGTCGCC CGCCCTCGGG
CCCGGCTATC TCGACGACGC GGCGCGCACC GCCGAGAAAT TCCTGCCGTC GGTGGCCGCG
CTCGCCGAGC GCGGCCACGA CGCGCCCGCC TATCGGACGG GCGACCGCGC GCGGCTGCTG
CCCGACGGCC AGATCGCGAT CCTCGGCCGC ATCGACAACA CCGTGAAGAT CCGCGGCTTC
AAGGTGCTGC TGCACGCGAT CGAGAACGTG CTCGATGCGG TCGACGGCGT CAGCAAGACG
CTCGTCGTGC CGATCGACGA TCCGCACACG AAACAGCCGT CGGCGCTCGC CGCCTATGTC
GTCGGGCGCG GCGGCGCGCC GTCGGAGACG ACGCTCGCGC GGCTGCGCCA GCAGGCGCGC
GCGAAGCTGC CCGAATACGC GGTGCCCGCG CACTTCATCG GCCTGGAGGC GTTCCCGCTG
CGCGCGGGCA CCTCGCGCAA GCTCGACAGG CGGGCGCTGC CGCCGCCCGA ACCGCCGGCG
GGCGCCTCGC CCGCCGAGCG CGGCGCCGTG CCGGCCGCCG CGGGCGACGG GCTCGAAGCG
CGGCTCGCGG AGGTCTGGCG CGATGTGCTC GGCGTCGCGT CGGTCGCGCG CGACGACAAC
TTCTTCGAGC TCGGCGGCAA TTCGCTGAGC GCGGCGAAAG CGGTCGGCGC GCTCGGCGAG
CGACTCGGGC TCTCGCTCGC CGTCGTCGAT CTGTATCAGC ACAGCCGGCT GCGGAACCTC
GCCGACTACT GCCGCCGCTC ACGCGAGGAA GACGCCGCGC GCGATGCGCG CGCGGCACGC
ACGGCTCCGG GCGCGGGCGC GCCGAAGGTC GCGATCGTCG GCATGGCGGG ACGCTTCCCG
GGCGCCGATT CGGTCGATGC GTTCTGGCGC AACCTGACGC AAGGCGTCGA CAGCCTGTCG
CGCTTCACGC GCGAGCAACT GCTCGCGAAG GGCGTCGACG CCGCACAGCT CGACCATCCC
GACTGGGTGT CGGCCGCGCA GGTCGTCGGC GACGCGGACA AGTTCGATGC GCTCTTCTTC
GGCATCGGCC GGCGCGAAGC GGTGCTGATG GATCCGCAGC ACCGGCTCTT CATGGAAGTC
GCGTGGAGCG CGCTCGAGCA GGCGGGCTAC GCGCGCGCCG AGAACCGCTA TCGCACGCGC
ACGGGCGTGT TCGCGAGCTG CGGGATCGAC GGCTATCTCG TCCATCACCT GCAAGGCGGC
GGGCTGCTCA CGCCGCTCGA TCCGGGCCGG CTGATGCTCA CCGAGATCGG CAACGAGAAG
GACTACGTCG CGACGCGCGT CGCGTATCAG CTCGATCTCG GCGGACCCGC GGTGTCCGTC
GGCGCCGCAT GCAGCAGCGC GCTCGTCGCG GTCGTGCAGG CGGTGCAGGC GATTCGCGCC
GGCCAGTGCG AGATGGCGAT CGCGGGTGCG TCCGCGCTGT CGTTCCCGAA CTTCGGCTTC
TGCTACGAGG ACGGCCTCGT CGGCAGCGCC GACGGCCACG TGCGTCCGTT CGACGCGCGC
GCGAGCGGCA CGCTGTTCGG CGACGCGGTC GGCGCGGTCG TACTCAAGCG GCTCGACCTC
GCCGAAGCGG ACGGCGATCC GATCCTCGCG GTGATCTCGG GCGTCGGGCT GTCGAACGAC
GGCCGGATGA AGGCGGGCTA CACCGCGCCG AACGCGGACG CGCAGCGGCG CTGCATCGTC
GACGCGCTCG ACATGGCCGA CGTGCGCTCC GAGCAGATCT CGTACGTCGA GTGCCACGCG
ACCGCGACGC TGATCGGCGA TGCGATCGAG CTGAAGGGCC TCTCCGACGC GTTCGCGCAG
ACGCGCGGCG CCGATGCCGA TGCCGCTGCC GTCGCGGGCC GCTGCGCGAT CGGCTGCGTG
AAAGGCAACA TCGGTCACGC GAACTGCGCG GCCGGCATCA CGGGCCTCAT CAAGACGGTG
CTGCAGTTGC GGCATCGGCA ACGCGTGCCG ACCGTGCATT TCGACACGCT CAACCCGAAG
CTCGTGCCGT TCGTCGAGCA CGATGCGTCG CCGTTCGTCG TGCAGCGTCA CGCCGACGAC
TGGACCGTCG CCGATCCGGC GACGCAGTTG CCGCGGCGCG CGGGCGTGTC GAGCTTCGGG
ATCGGCGGCA CGAACGCGCA CGTGATCGTC GAGGAAGCGC CCCCGCCCGC GCGTGCCGCC
GCGCCCGCCG AGAGCGTGCC GCGCGCGCAT CACCTGATGA CCGTGTCCGC GCGCACGCCG
GGCGCGCTCG CGCGCAACCT GCAGGCGCTC GCCGGACGTC TCGCGGCGCT GGACGCGGCC
GATCTCGGCT GCGCCGCCCA CACGCTGCAC GTCGCGCGCG AAGCGCATCC GCTGCGCGTC
GCGCTCGCGG TGCCCGCGCG GCCGGCCGAC GCGGCCGCCG CGCTGCGCGC AAGCGCCGCC
GCGCTCGCCG AAGCGAGCGT CGACATGCCG GACATGTCCG ACGCGCCGGC CGCCCCCGTT
CGCGCCAAGC CCGGCGCGAC CGTCGCGTTC TGCTTCTCCG GCCAGGGCTC GCAACATCCG
GGCATGGCGC GCGAGCTGTA TCGCTCGAAC ACGGAGGCCG GCCGCTTCCG CCATCACTTC
GACGCCGCGT GCGCGGCGCT CGAACGCGCG CTCGGCGCGC CGATCGCCTA TGCGATCCTG
AACGCCGACG ACGAGGCGAT GCGCCGCCCG CTCGTCACGC AATGCGGGCT GTTCGCGCTC
GAGCACGCGC TCGCGGCGGT GCTCGGCGAG TACGGCGTGC GGCCCGTCGC CGTCGCGGGG
CACAGCATCG GCCAGTATGC GGCCGCCGTC GCCGCCGAGG CGCTCACGCT CGAGCAGGCG
GCCGCGCTCG TCGCGGCGCG CGCCGGCGCG ACCGAAGCGT TGAATGCGTT CTCCGCCGGC
GGCGGCGCGC CCGCGCGCGG CGGCATGCTC GCCGTGACGG GCGACGACGC ACGCATCGAG
CAGTGGGCGG CCGGGCGCGC CGACGTGTGG ATCGCGGTGC GCAACGCGCC GCGCACGCTC
GTGCTCGCCG GCGCGCAACC CGCGCTCGCC GACGCCGCGC GCGCGCTCGC CGCGCTCGGT
TGCCGGTGCC GGCCGGTGCC GGTGTCGCAT CCGTTCCATA CGCCGCTGAT GCAACCCGTC
GCCGACGCGA TCATCGCGCA GCGCATCGCG GGCGCCGCGC CGCGCATTCC GATGACCTGC
AACGTGACGG GCGGCTGGCT GGGCGCCGAC GCCGCCTCGC CCGGCTACTG GGCGCGCCAT
CTGCTCGAGC CGGTGCGCTG GTCGGACAAC GTTGCCGCGC TGCTGCGCTG GCAGCCCGAC
GTCGTGCTCG AAATCGGCCC GGGCACCGTG CTGTGCAGCC TGATCGGCAA GCATCTCGCG
GGCGGAGCGA ACGCCGGCGC GCCCGCGCGG CCAAACGGGC ACGCGGGCGC GAACGCGGCG
GCGCTCGCGC CGCGCGTGCT GCCGACGCTG CCCGCCGCGG GCGACGACGC GCACGACGCC
GCTCATTTCT GCAACACGAT CGGCCAATTG TGGTGCGCGG GCGTGCCGAT CGACTGGCGC
GCGTATCACG CGCACGAAGC GGCCGCGCCC GGCCGCGCGC TCGCGCGCGT GCCGCTGCCC
GGCTATGCGT TCGAGCGCGA CAGCTACTGG ACGCGGCCGC GGGCGTCGAT CTATGTCGAC
GCGGCGCAAG CCGATGCCGA ACCCGATGCC GCCCCGGCGG CGGCCGATCA CACGGACGCG
GCCGATGCCG ACAGCGCGCG CGCCGCGCTC GCCGATGCAT CGGACAAAAC GCCCGCCGCG
CGTGCGGCGT CGACGCACAC GTGTTCGGGC GAATGCGCAT GCGAGCGCGC ATGCGCGGCC
GAACCGCGAC CGAGCGGCGG CGTGGAGCGC GTCGCCGCGC CGCCGGCGCC CTCGTCGCGC
GGGCTCGTGC GGCTGAAGCC GCGCCGCCGG CCGCGCATGA AGCTGTACTG CTTCCCGTAT
GCGGGCGGCA GCAGCCGCAG TTTCGACGGC TGGGCGCGCA CGGCGCCCGA CTGGCTCGAC
ATCGTCGCGA TCGAATGGGC CGGGCGCAAC GCGCGCGCCG AGGAATCGCT CGCGCGAGAC
GACGCCGCCG ACCTCGCCGC GCGCGAGGCG ATCGCCGCCG CGATCGTCGC GGACGCGGGC
GAGTTGCCCG TCGCGTTCTG CGGGCTGAGC TACGGCGGCG CGGCCGCCAC CGAACTGCTG
GGCGGGCCGC TGCGCGCATG GGCCGCGAGC GGCCGGGTGA AGGGGCTCGC CATCGTGGGG
CGCGCACCGC TCCTCGAGCA GCCGGCGATC GACGCCCCCG CCGACAGCTT CCTGCTCGTC
CCCGACGCGC TGCGCGACGA TCCGCTGTGG CAGGAGATCT TCCGGCCGGT GCTCGAAGCC
GATCTCGACG CCGATGCGCG CTGCGCGCAC CGCATCGCGC AGCGCTGGCG CGACGGCGGC
CGGCAGCCGT TGCTCACGGT CCCGCTGCAG ATTCACGGCG CGCACGACGA TCCGGCATTC
GACTGGCGGC TCGCCGGCGA CTGGGCGCGC ATCTCGTCCG CGCCGCTCGC GAGCCTGCAC
CGGTATCCGG GCGGCCACGA CTTCATGATC CGATGCGAAG CCGAGATCGT CGCGCGCGTG
GCCGCGTGGC TGCGGCCGCA ACGCGCCGCG CGAACGGCGG CGCCCGCGCC GACGTTCGCG
CTGCATTGGC AGCCGCGCGC GATCGCGCCG ATCGCGCCGA TCACGCCGAA TCCGCTCGCG
GCCGCCGACG GCTCGAGCGC CGGCGCTTCA CTCGCCGATG CTTCGCAGGC CGATACATCG
GGCGCCGCCA TGGGTGCAGC GCCGGGCGCC GCGCCGTTGC CGGAATGCGC GGTCTATGCT
GCGGGCGGCG AAGCCAGCGC GTTCGAACGG CTCGCGCCGC TTCTGCGCGA AAGCGGCGAC
GAGGCGGCGC TGCTCTGCAT CGGCGCGGCC GACGATCCGC TCGGCGCCGC GCAATGCGCG
GGCTTCGTCA AACTATGGCA GGCGCTCGCC GCGCGCGAAT GCGCGGGCAC GCTCACGCTC
GTGCTGCCGG CCGGCGCGGC GAGCGGCCCG CTCGTCGGCG CGGCGCGCGT CGCGAGCGCC
GAGCACGCGG CGCTGCGCAT GCGGCTCGTG CTTGCCGACG ATCATCCGGA TCTCGCGCCG
CGCGCCGCCG ATCCGCGTTG GGCCGCCGGG CTTGCACGCG ACGCCGCGCA CTGCGGCGAC
GAGCCGTGGC TGCTGCGCCG CGCCGGCCGG ATGTTCGCGC CGCGCCTGAT GCCGCACGCG
GCGCCCGCGC TGCCCGAAGG CACGCTCGGC GCGGCGGGCG GCCCGTATCT CGTCACCGGC
GCGGCGGGCG GCGTCGGCCG CGCGCTCGTC GACTGGCTGA TCGACGAGCA GCAGGTGCCG
CCGCAGCGAC TCATCGCGCT GTGCCGCAAC GCCGATCTCG CGCCGCGCGG CGTGCGCGCC
GTCGCCGCCG ATCTCGCCGA TGCGCGCGCG CTCGGCGCCG CGCTCGAATC GATCGGCGAA
GTCGAGGGCA TCTTCCATCT GGCCGGCGTG CTCGATGACG GCGTCATCGC GAATCTCGAC
GAAGCGCGGC TGCGCCGCGT GCTCGCGCCG AAGCAGAGCC TCGCCGCGCT GCTCGCACAC
GCGCCGCGCT GGCGCACGCG CTGGGCGGTG GCGTTCTCGT CGACGAGCGC GCTGCTCGGC
GTGCCCGGCC AGGCGAATTA CGCGGCCGCG AACGCGTGGC TCGATCAGCT CGCGTGCTGG
CCCGCGCGGG CGGGCGAGCC GCCGGTGCTG AGCATCCAGT GGGGCAGTTG GGCCGACGTC
GGCATGAGCG CGCGCAGCGA CCGCGCGCAG CGCCGCGCGA TGCAGGACGG CGAGCGTGCG
CTCGCGCCCG ACGCCGCGTT CGCCGCGCTC GGCGCGCTGC TCGCCGGCAT GCTCGGCGGC
GCGGCGGCGG CACGCCAGTT CGCGGTCTGC GACGTCGACT GGCCGCGCTC ACCGTGGCGC
GACGCGCCGA TCGTCGAAGC CATCGCACGC GGCGCGCCGA ACGCGGGCAG CGAAGCCGAG
GCGGCCAAAG TGGCGAGCGC GACGAACAGG ACGAAGCGAG CGGACGAAGC GAGCGGCGCG
AACGGCTGGA TCGCGGCGAG CTCGACGCCG GTCGCCATGT CGAGCGCCAC GCCGGCCGCT
CATGTTTGCG GCAACGCTTG CGTCGATGCG GCTGCTCCGC GCGCCGGCGA TGAATCGGAC
CGCGCACGGT CGCCCCGCCC GGCAAGCGAG CCCGCGCGCT TTGCCGCGCC AACCGTCGAA
GCCGCCGCGG TCGCGCAAGC GCCGGCCGCG CGGGCGACGC GATCCGCGAG CGCACCCGAC
CCCGTCCGCG CGTTCCTCGA AGACTACGTG AGCCGCTGGG ACGAGCGGCT CGATCTCGCG
ACGCTCGGGC TCGATTCGCT CGATCTCGCG CAGATGCGCA ACGGCTTTTT CAAGAAATTC
GGTGTGCAGA TACCGCTTTC GACGCTCGCA TCGCCGACGC TGAAGATCGG CGAGCTGGCC
CGCCGGATGC GGAACGTCGC CGGCATCGCG GACGAGTAG
 
Protein sequence
MRQIVSTSNG SLDSRARIIA FPFAGGSGAS YADIARELGD AFAFTTLSLP GRGATQHIPF 
YDDWPSLVDD LAAEVARLDD GTPFFLFGHS LGALLAYEVA RTLEQRGGAR PTGVFLSGHP
APSRERAAPA WRRQTHALPD AEFVEAVRRW GFFPEGALDD PDVARYVLPP LRADLRLAET
YRHARGDALR APCAIYGGAA DPSTSVDDLR AWRAHVSPQH ACPVESFDGH HFYFLDPSPR
AALCASLAGH IEQRLAQRPA SIVAAAPDPG SRVAACLRTA GDWGDAQAVL AAIREHVART
PDALALQDGE RTWTYRQLAS HAAELADAFR AAGVGRQDVI GVFLPHGAEY VLTIVGAWSI
GASVCLLEKS WPDSLVGEFV ASCRVAQLAT TPALLARAAK HLPDARCTLV GARPPLAQRA
WTSVAPRPDD IAFVSLTSGS TGKPKAVLTT HVGTSYCFHA RDALYPYADG EREGLNVFLA
WECLRPLMFG RPAVVIGDDV IFDPPRLVAL LRRERITRLV VTPSLLESVL DFPGIAGQLR
DALAHMSAWF LMGEVVPQRV VDKARAAFPP SVRLVNAYST WESLDVCYAD LLPSRASDDG
RRVPIGRPLP GCALAVLDEA GRAVPAGETG ELYIASPALG PGYLDDAART AEKFLPSVAA
LAERGHDAPA YRTGDRARLL PDGQIAILGR IDNTVKIRGF KVLLHAIENV LDAVDGVSKT
LVVPIDDPHT KQPSALAAYV VGRGGAPSET TLARLRQQAR AKLPEYAVPA HFIGLEAFPL
RAGTSRKLDR RALPPPEPPA GASPAERGAV PAAAGDGLEA RLAEVWRDVL GVASVARDDN
FFELGGNSLS AAKAVGALGE RLGLSLAVVD LYQHSRLRNL ADYCRRSREE DAARDARAAR
TAPGAGAPKV AIVGMAGRFP GADSVDAFWR NLTQGVDSLS RFTREQLLAK GVDAAQLDHP
DWVSAAQVVG DADKFDALFF GIGRREAVLM DPQHRLFMEV AWSALEQAGY ARAENRYRTR
TGVFASCGID GYLVHHLQGG GLLTPLDPGR LMLTEIGNEK DYVATRVAYQ LDLGGPAVSV
GAACSSALVA VVQAVQAIRA GQCEMAIAGA SALSFPNFGF CYEDGLVGSA DGHVRPFDAR
ASGTLFGDAV GAVVLKRLDL AEADGDPILA VISGVGLSND GRMKAGYTAP NADAQRRCIV
DALDMADVRS EQISYVECHA TATLIGDAIE LKGLSDAFAQ TRGADADAAA VAGRCAIGCV
KGNIGHANCA AGITGLIKTV LQLRHRQRVP TVHFDTLNPK LVPFVEHDAS PFVVQRHADD
WTVADPATQL PRRAGVSSFG IGGTNAHVIV EEAPPPARAA APAESVPRAH HLMTVSARTP
GALARNLQAL AGRLAALDAA DLGCAAHTLH VAREAHPLRV ALAVPARPAD AAAALRASAA
ALAEASVDMP DMSDAPAAPV RAKPGATVAF CFSGQGSQHP GMARELYRSN TEAGRFRHHF
DAACAALERA LGAPIAYAIL NADDEAMRRP LVTQCGLFAL EHALAAVLGE YGVRPVAVAG
HSIGQYAAAV AAEALTLEQA AALVAARAGA TEALNAFSAG GGAPARGGML AVTGDDARIE
QWAAGRADVW IAVRNAPRTL VLAGAQPALA DAARALAALG CRCRPVPVSH PFHTPLMQPV
ADAIIAQRIA GAAPRIPMTC NVTGGWLGAD AASPGYWARH LLEPVRWSDN VAALLRWQPD
VVLEIGPGTV LCSLIGKHLA GGANAGAPAR PNGHAGANAA ALAPRVLPTL PAAGDDAHDA
AHFCNTIGQL WCAGVPIDWR AYHAHEAAAP GRALARVPLP GYAFERDSYW TRPRASIYVD
AAQADAEPDA APAAADHTDA ADADSARAAL ADASDKTPAA RAASTHTCSG ECACERACAA
EPRPSGGVER VAAPPAPSSR GLVRLKPRRR PRMKLYCFPY AGGSSRSFDG WARTAPDWLD
IVAIEWAGRN ARAEESLARD DAADLAAREA IAAAIVADAG ELPVAFCGLS YGGAAATELL
GGPLRAWAAS GRVKGLAIVG RAPLLEQPAI DAPADSFLLV PDALRDDPLW QEIFRPVLEA
DLDADARCAH RIAQRWRDGG RQPLLTVPLQ IHGAHDDPAF DWRLAGDWAR ISSAPLASLH
RYPGGHDFMI RCEAEIVARV AAWLRPQRAA RTAAPAPTFA LHWQPRAIAP IAPITPNPLA
AADGSSAGAS LADASQADTS GAAMGAAPGA APLPECAVYA AGGEASAFER LAPLLRESGD
EAALLCIGAA DDPLGAAQCA GFVKLWQALA ARECAGTLTL VLPAGAASGP LVGAARVASA
EHAALRMRLV LADDHPDLAP RAADPRWAAG LARDAAHCGD EPWLLRRAGR MFAPRLMPHA
APALPEGTLG AAGGPYLVTG AAGGVGRALV DWLIDEQQVP PQRLIALCRN ADLAPRGVRA
VAADLADARA LGAALESIGE VEGIFHLAGV LDDGVIANLD EARLRRVLAP KQSLAALLAH
APRWRTRWAV AFSSTSALLG VPGQANYAAA NAWLDQLACW PARAGEPPVL SIQWGSWADV
GMSARSDRAQ RRAMQDGERA LAPDAAFAAL GALLAGMLGG AAAARQFAVC DVDWPRSPWR
DAPIVEAIAR GAPNAGSEAE AAKVASATNR TKRADEASGA NGWIAASSTP VAMSSATPAA
HVCGNACVDA AAPRAGDESD RARSPRPASE PARFAAPTVE AAAVAQAPAA RATRSASAPD
PVRAFLEDYV SRWDERLDLA TLGLDSLDLA QMRNGFFKKF GVQIPLSTLA SPTLKIGELA
RRMRNVAGIA DE