Gene Francci3_2931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2931 
Symbol 
ID3903995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3454985 
End bp3462667 
Gene Length7683 bp 
Protein Length2560 aa 
Translation table11 
GC content73% 
IMG OID637880252 
Productbeta-ketoacyl synthase 
Protein accessionYP_482018 
Protein GI86741618 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein
[TIGR02813] polyketide-type polyunsaturated fatty acid synthase PfaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0735457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAGT CCGACCGCCC TGACGACCGA CGACTCGCGA ACAACCCGAT CGCGATCGTC 
GGGCTCGCCG GACTGTTCCC GATGGCGCGC GACGTGCGGG AGTACTGGAG CAACGTCATC
GACGCCGTCG ACTGCGTCAC CGAGGTGCCG TCGACGCACT GGCGCCCGGA GGACTACTAC
GACCCGGATG CGACGGTGCC CGACCGTACC TACGCGCGCC GGGGCGGCTT CCTCCCCGAG
GTCGCGTTCA GCCCGGTGGA GTTCGGCTTT CCGCCCAACC TGCTGGACGT GACCGCGGTC
GTGCAGCTGC TCAGTCTGAT CGTCGCGCGG GATGTGCTCG TCGACGCGGG CGCACCGGGA
TCGGCGTGGT ACGACTCCGA GCGCACCGGG GTCATCCTGG GCATGGCCGG CCCCACCACG
ATGGCCCACC CGCTGGCGGC CCGGCTGGAG ACGCCGGTCC TGCGCGAGGT CGTGCGCGCC
GCGGGACTCG ACGAGGCCGA AACCGCCGCG GTCTCGGAGG CCTTCTCGCT CGCGTTCCCG
CCGTGGGAGG AGAACTCGTT CCCCGGGCTG CTCGGCAACG TCGTCTCCGG CCGGATCACC
AACCGGCTCG ACCTCGGAGG CACGAACTAC ACGGTCGACT CCGCGTGCGC GAGTGGGCTC
ACGGCCGTCC GCGGGGCCAT CGCCGAACTC ATCGACCACC GTGCCGATCT CATGATCACC
GGTGGTTGTG ACACCGAGAA CTCGATCTTC AGTTACATGT GCTTCTCGAA GACGCAGGCG
CTGTCCAAGT CCGACCGCAT CCAGCCGTTC GCCGCCGGCG CCGACGGCAC CCTGGTCGGC
GAGGGCATCG GGATGCTGGC GCTCAAGCGG CTGGCCGACG CCGAGCGCGA CGGGGACCGG
ATCTACGCGG TCATCCGGGG TCTCGGGTCG TCCAGCGACG GTCGGTTCAA GAGCGTCTAC
GCGCCGCGCG CGGAGGGCCA GGTCCGCGCG CTGCGCCGGG CCTACGCGGA CGCCGGGGTG
AGCCCGGCGT CCGTCGAGCT GTTCGAGGCG CACGCCACCG GTACCGCCGT CGGTGACGCC
ACCGAGCTGT CCGCGCTCGG GGCCGTCCTG GCCGAGGCCG GGGCACGCCC GGGCCGGGCG
GCGATCGGCA GTGTGAAGTC GCAGATCGGG CACACCAAGG GCGCTGCCGG CGCGGCCAGC
CTGATCAAGC TGTCGCTGTC GCTGTACAAC CGGGTGCTGC CGCCCACGAT CAACGTCCGC
GAACCGAATC CGGCGCTCGC CCAGGAGGGC TCGCCCTTCT ACCTCAGCAC CCGCACCCGG
CCCTGGGTGC TCGACCCGGA CGTGGGGCGG CGCCGGGCGG CCGCCTCCGC CATGGGGTTC
GGCGGTACCA ACTTCCACGT CGTCCTGGAG GAGACCCCCG ACACGGCACG CCCGGACGCC
GATAAGCGTG CCGCCCGGGC GACGCACCGT GCCGTCCGCG CCCACCTGTG GCATGCCGCG
ACGCCCGCGG AGCTCCTCGA CCTGGTGCGC GGACAGGCCG AACCGAACGG TGACACGGGC
ATCCCGGGCA GCGCCGCGCG GGTCGGGTTC GTCGCGCGCA CCGGCGCCGA GGCGGAGCGC
CTGCGCGCGA TCGCGATCGA GCAGCTCGCG GCGCGAGCCG ACGCCGCCGA ATGGTCGCAT
CCCGCCGGGG TGCACTACCG CCGGCAGGCG GCCGAGCAGC CCCGCGTCGG CGTCGTCTTC
GCCGGCCAGG GCAGCCAGTA CGTCGACATG GGGCTCACCG CCGCCGTCGA GATCCCGCCG
GTGCGGGAGG CCTTCGACGC GGCGAACGCC ACCTTCGCCG GGGCCGAGCT CACGCTCGCC
CGTGCGGTGT TCCCGCCGCC GAGCTTCGAT CCGGGCGACC GTGCCGACCA CGAGATGAAC
CTGACGCGCA CCGAGTACGC GCAGCCCGCG ATCGGTGCGC TCTCCGTCGG CCAGTTCCGG
TTCCTGCGGG AGCTGGGACT GCGCCCGGTG AGCCTGCTCG GCCACAGCTT CGGGGAGCTG
ACCGCCCTGT GGGCCGCCGG TTCCCTCGCC GACGCCGACT TCTTCCGGCT CGCCCGGCGT
CGCGGCCAGG CCATGGCCCC GCCGGACGAA CCCGGCTTCG ACCCCGGCAC CATGGCCGCC
GTCCGCGCCG ACCGCGACGC GGTCGCCGAG CTGCTGACCG CGCACCCGGA CGTGGTCGTG
TGTAACCACA ACGCCCCCGA CCAGGTCGTC GTTGGTGGTC CGACGCCCGC CGTCGAGGAG
TTCGTCACCG CCGCCACCGC GAACGGGCTC GACGCCCGGC GCATCTCCGT GGCCGCGGCG
TTCCACACCT CGCTCGTCGG GCACGCGGTG GACGACTTCG CCGCCGCGGT GGACGTCACC
GAAATCGGAT CGCCGGCCGT GCCGGTGCAC GCCAACACCG CCGGCGCCCG GTACGGCGAC
GACCCCGCCG AGAACCGCCG GGTCCTCACC CGGCAGCTGG GCAACCCCGT CGAGTTCGTC
GCCGGGCTGC AGGCCCTGCA CGCGGACGGC GCGACGGTGG TTGTCGAGAT CGGGCCGAGC
CAGGTGCTGA CCGGTCTGGT CCGGCGCACC CTCGGCGACG ATGTCGTGGC GATCCCCACC
GACGCCGGTC GCGACGGCGA CGGCGCGGTG GCTCTGATGA CCGCGGCCGT GCGCCTCGCG
GTGCTCGGCG TCGAGCTCAG CGGGCTCAAC CGCTATACGG CCCCGCTGCC GCAGCAGGTG
CAGCCGACCG GGATGACGGT TGCGCTGACC GGGGCGGACT ACGTCTCGGA GCCTCGCCGT
CGCGCCTACG CCGACGCGCT GGTCACGCTC GCGGCCGCCG CGGCCGTCCG GCCGCGGGAG
TCGGCCGCCG CGGCGCTCGA AAACGTCCCG ACCGCCACCG AGCGGCGCGT CCCCGAAGCG
GCACCGGCGC CCACCCGGCC AGTACCCACC CAGCCGGCAC CGGCGCCCAC CCGGCCAGTA
CCCACCCAGC CGGCACCGGC GCCCACCCGG GCGGTACCCG TCCCGACGGC CATCGCTCCG
GTAGCGCCTG CTCCGGTAGC GCCTGCTCCG GCGGCAGCCA CCCTGGCGGC CCCGGCCCCG
AGCGCCGGTG AGGTCGCCGG CATCAGCGAG GTCGCCCGCG ACCATCTGGC CCTGCACTCC
CAGTTCCTCG ACACCCAGCT CCGGGTCGCC GAGGGCCTCG TGCAGGTCCT TCGCGCGGCC
CCCGCGGACA ATGGGGCGGG TCAGGAGATC CGGGCCGCTG CCGACGCTGT GACCCGGCAG
AGCCTGGCCG TGGGTGAGGC CCATATCCGG GTCAACGAGA TCCTGGCGTC CCTGACCGAC
CTGGAGTACG GCAACGGCGT CGGCGGCCAC GGCGCCGGCG GGCTCGCGGC CGGCGGGGCG
CCGGGCGGCG CGCTCACCAC CGGTACGGAA CCGACGGACC TGTACGGACG GGGAACGGCG
GGCGATCTGC GTACGCCGGG AATCCCGGCG CCCGGGCTGC CGGCCGTCCC GCTTCCGATC
CCCAGCACCG GGAACAGTAA CGGGCGTCCG GCGGGTGCGA TCGCGGCTGC CCCGCCCGTG
CCTCCGGTGA CCGTCCCATC CGTCCCATCC GTCCCATCCG TGCCTCCGGT GACCGTCCCA
TCCGCGGCCC CGGCGACCGT GGGGTCGCAG CCGGCCACCG CTGGGATTGC CCAGCCGGCG
ACGGCCGAGC CGCCGGCGGG CGTCGACGTG TCGGTTATTG AGTCGGCGGT GGCGGAGGTG
GTGGCGGAGA AGACGGGTTA TCCGGTGGAT GTGCTGGAGC CGTCGATGGC GTTGGAGTCG
GATCTGGGGG TTGATTCGAT TAAGCGGGTG CAGATTCTGG GGTCGTTGCA GGAGCGGTTC
CCGGGTGTTC CTGCGGTGGG TCCGGAGCGG GCGGCGGAGA TGCGGACGCT GGCGGATGTG
GCCTCGTTCC TGCGTGATGG CCTGGGTTCG GTGGTTCAGG GTTCGGCGGG TGCGGGTGCG
GGTGCGGGTG CGGGTGCGGG TGTGGGTGCG GGTGTGGGTG TGGCTGCGGT CGACGTGTCG
GTTATTGAGT CGGCGGTGGC GGAGGTGGTG GCGGAGAAGA CGGGTTATCC GGTGGATGTG
CTGGAGCCGT CGATGGCGTT GGAGTCGGAT CTGGGGGTTG ATTCGATTAA GCGGGTGCAG
ATTCTGGGGT CGTTGCAGGA GCGGTTCCCG GGTGTTCCTG CGGTGGGTCC GGAGCGGGCG
GCGGAGATGC GGACGCTGGC GGATGTGGCC TCGTTCCTGC GTGATGGCCT GGGTTCGGTG
GTTCAGGGTT CGGCGGGTGC GGGTGCGGGT GCGGGTGCGG GTGCGGGTGC GGGTGTGGCT
GTGGGTGCGG GTGTGGCTGC GGTCGACGTG TCGGTTATTG AGTCGGCGGT GGCGGAGGTG
GTGGCGGAGA AGACGGGTTA TCCGGTGGAT GTGCTGGAGC CGTCGATGGC GTTGGAGTCG
GATCTGGGGG TTGATTCGAT TAAGCGGGTG CAGATTCTGG GGTCGTTGCA GGAGCGGTTC
CCGGGTGTTC CTGCGGTGGG TCCGGAGCGG GCGGCGGAGA TGCGGACGCT GGCGGATGTG
GCCTCGTTCC TGCGTGATGG CCTGGGTTCG GTGGTTCAGG GTGCGGGTGC GGGTGTGGCT
GCGGTCGACG TGTCGGTTAT TGAGTCGGCG GTGGCGGAGG TGGTGGCGGA GAAGACGGGT
TATCCGGTGG ATGTGCTGGA GCCGTCGATG GCGTTGGAGT CGGATCTGGG GGTTGATTCG
ATTAAGCGGG TGCAGATTCT GGGGTCGTTG CAGGAGCGGT TCCCGGGTGT TCCTGCGGTG
GGTCCGGAGC GGGCGGCGGA GATGCGGACG CTGGCGGATG TGGCCTCGTT CCTGCGTGAT
GGCCTGGGCC TGACCGCGCC GGCCAGCGCC GCACCCGAGT CGACCCCGCC CACCGCTACT
GAGACGGCGG CGTCGATGCC CACGGCGACT GCATCGGCGC CGCACCCCGT TCCGACCGAG
GCGCCCATCG AGCGGCTGCG CGCGGTGCCG CGCCTTCTGC CGGCCGTAGA CCCGCTCGAC
GATCCTTTCG GCGTCCAGCC CAGCGCGCTC GTGATCGAGG TCGGTGACGC CGGTGGTGTG
GCGCTGGTCC GCCACCTCAG GGAGCGCGGC ATCTCGGTCG AGCGGATCGC GATCGGGTCC
CTTGCGGGTT CCGACACCGC CGGCCTCCTC GGCTGGGATG GCGCCGCGGT CGAGGCGGCG
CTCGCGGCTG TCGCCGGATC GGTCCGCCTG GACCTCTGCC TGCTCGCCCT CGGCGCTCCG
GCATCACCGG GCGCCGGATC CGGGGCTGAG CCGGACCGCG CCGGGGAGGA CGATCGAGCC
GGGATCGGCG CCACCGGGCT GGCCGACTTC GACGTCGCCG CGCGTGGGCT CGCCGACGCC
ATCCTGGCCA CGAAGGGGCT CGTCGTACCA CTGCGGGCGG CGGCCTCGGC GGGCCGGCGC
GCGGCCTTCC TAACGGTGAC GCGGGTCGAC GGGCTGCTCG GCCATCGGGG CGGGTCCTCG
ATAGCCGGGG CGGTCACCGC GGGGATCACC GGGCTGGTGA AGACCGTCGC GCTCGAGGCA
CCTGAGCTGG TCTGCCGGTC GGTCGACGTC CATCCGGCGC TCGACGACGA CACCTTCGCG
GCGGCCGTCC TGGCGGAGGC CGTCGATTCC GCCCGGGACG TCGTCGAGGT CGGTCTGGAT
CCCGAGGGAC GCCGCTGGTC CGTCGAGTAT GTCCAGGGCG GGGTGACCGG GGTGACCGGG
GTGACCGGGG TGGCTGAGGC GGCCGGGGTG GCTGAGGCGG CCGGGGTGAC CGGGACGACC
TCCCCGGCGA CGCTCCGGCC CCCGGTGGGC CCGGATGACG TCCTGGTGGT GACCGGCGGA
GCCCGCGGAG TGACCGCGCG CTGCGTCGAG GAGCTGGCGC GCCGCGCCTC CTGCGAGTTC
GTCCTGCTCG GACGCACGAA TCTGGTTGAG GAGCCGGGCT GGGCGGCCGG AGTCGAGGAT
GACGGCCTCA AGGCGGCGTA TCTCGCCGGG GCCCGTGCCG AGGGACGCAA GGTGCGGCTG
CCCGACGTCG ATGCCGCGGC TGCCGCCGTG ATCGCCGTCC GGCAGGTGCG GGCCACCCTC
GGTGACCTTG CCGCGACCGG CGCGCGGATC CACTACCAGG CCGTGGACGC CTCGGATCCG
GTGGCGACGG CCGCGGCGCT CGCGCCCTGG CGTGACCGGG TCACCGGCGT CGTGCACGGC
GCCGGCGCTC TCGCCGACGC GATGCTCACG GACAAGACCG TCCCGGCCAT CCGGTCGGTG
CTCACCCCGA AGCTCGCCGG CCTGCGCTCC GTGCTCACGG CGCTGGAGGG GGCACCGCTG
CGTCATCTCG TGCTGTTCGG TTCGGTCGCC GGTGTCTTCG GTAACCCGGG GCAGGCCGAC
TACGCCACGG CCAACGAGGC GCTGACCAGG GTCGCGGCCT GGGCCCGGAC CGGGCTCACC
GGCCGGGTGG TCGCCCTGAA CTGGGGCGCC TGGGACGGCG GGATGGTCAC GGCCGACCTG
CGGGAGCTCT TCCGCTCCCG GGGCGTGACG CTGCTGCCGC CGGACGAGGG CGCGCGCCGC
TTCGTCGCGG AACTCGTCGA CGGCCACGGC GCGGATGGCG CGGTCCTCAT CGGTGGATCC
GCTCCGCTCG CGCGGGCCGC GGGTGGCACG GCGCCCGCCG CGTTCGTGGC GCACCGTTCG
ATAGCGGCTA TGGACACCGA GACGGTCATC GACCAGCACC AGATCGGGCC GGCCCCGGTG
CTGCCCGCCA CCGCCGGTCT CGGCTGGTTG GTCAACGGGG TCGAACGAGC CTTGCCCGGG
CGTCAGGTCG TCGAGGTCCG GAACTTCATC GTGCACAAGG GGATCGTGTT CGACGGGGCG
CACCTGCGGG ACTACTGGCT CGACGCGCGG CCCGTCGACG GGACCGGCGG TGGGCCCGGT
GACGGGCGTG ACGGCGGGCC GGTGACGGTG TCCGCGACCG TTCGCGGGGA CAGCGGCGGC
GCGTTGCCGA CCTCGCACTA CGCGGGCACC CTCGTGCTCG CCGAGGTCAC CGAGGTCGGC
AAGGCCGCCG GGGCCGCCCC GACCGCCCCG GGCTGGCCCG CCGATGGTTA CCGGCTCGGC
GAGCAGGGCG AGGACGCGGC CTGGGTCTAC ACCGACGGGC TGCTCATCCA CGGGCCGTTG
ATGCGGGGCA TCCGGCGCCT GCTCGTGCGG GAGCCGGGCC GTCTGGTGGC TGAGTGCCGC
CTGCCGGAAC TGCCCTTCGC CGGCGGCGCG TTCGCGGGCG CGCTGCACCG CCCGGTCCTG
TGCGACATCC TCGCCGCGGC CGGGAGCGTG CTCGCCCGGT GGACCCTGAA CGAGGTGGGC
GCCCTGCCGA TCGAGATCGG GCACGCGGAG TTCTTCGCTC CGCTGCCGGC CAACGAGACG
TTCATCGTGG TCGTCGACGA CGTCCGACCC GGCCCGATGA CGGTCACGGT GACCGTGACC
GCCCACACCC CGGATGGACG GGTGCTGCAG CGCCTGTCCG ACTTCACCTT CGTCGGCACC
CCGGACATGG CGGATCTGAT CCGTCAGGGG GCGCAGCGGT GGCAGTCGGG TGACCGGGCA
TGA
 
Protein sequence
MDESDRPDDR RLANNPIAIV GLAGLFPMAR DVREYWSNVI DAVDCVTEVP STHWRPEDYY 
DPDATVPDRT YARRGGFLPE VAFSPVEFGF PPNLLDVTAV VQLLSLIVAR DVLVDAGAPG
SAWYDSERTG VILGMAGPTT MAHPLAARLE TPVLREVVRA AGLDEAETAA VSEAFSLAFP
PWEENSFPGL LGNVVSGRIT NRLDLGGTNY TVDSACASGL TAVRGAIAEL IDHRADLMIT
GGCDTENSIF SYMCFSKTQA LSKSDRIQPF AAGADGTLVG EGIGMLALKR LADAERDGDR
IYAVIRGLGS SSDGRFKSVY APRAEGQVRA LRRAYADAGV SPASVELFEA HATGTAVGDA
TELSALGAVL AEAGARPGRA AIGSVKSQIG HTKGAAGAAS LIKLSLSLYN RVLPPTINVR
EPNPALAQEG SPFYLSTRTR PWVLDPDVGR RRAAASAMGF GGTNFHVVLE ETPDTARPDA
DKRAARATHR AVRAHLWHAA TPAELLDLVR GQAEPNGDTG IPGSAARVGF VARTGAEAER
LRAIAIEQLA ARADAAEWSH PAGVHYRRQA AEQPRVGVVF AGQGSQYVDM GLTAAVEIPP
VREAFDAANA TFAGAELTLA RAVFPPPSFD PGDRADHEMN LTRTEYAQPA IGALSVGQFR
FLRELGLRPV SLLGHSFGEL TALWAAGSLA DADFFRLARR RGQAMAPPDE PGFDPGTMAA
VRADRDAVAE LLTAHPDVVV CNHNAPDQVV VGGPTPAVEE FVTAATANGL DARRISVAAA
FHTSLVGHAV DDFAAAVDVT EIGSPAVPVH ANTAGARYGD DPAENRRVLT RQLGNPVEFV
AGLQALHADG ATVVVEIGPS QVLTGLVRRT LGDDVVAIPT DAGRDGDGAV ALMTAAVRLA
VLGVELSGLN RYTAPLPQQV QPTGMTVALT GADYVSEPRR RAYADALVTL AAAAAVRPRE
SAAAALENVP TATERRVPEA APAPTRPVPT QPAPAPTRPV PTQPAPAPTR AVPVPTAIAP
VAPAPVAPAP AAATLAAPAP SAGEVAGISE VARDHLALHS QFLDTQLRVA EGLVQVLRAA
PADNGAGQEI RAAADAVTRQ SLAVGEAHIR VNEILASLTD LEYGNGVGGH GAGGLAAGGA
PGGALTTGTE PTDLYGRGTA GDLRTPGIPA PGLPAVPLPI PSTGNSNGRP AGAIAAAPPV
PPVTVPSVPS VPSVPPVTVP SAAPATVGSQ PATAGIAQPA TAEPPAGVDV SVIESAVAEV
VAEKTGYPVD VLEPSMALES DLGVDSIKRV QILGSLQERF PGVPAVGPER AAEMRTLADV
ASFLRDGLGS VVQGSAGAGA GAGAGAGVGA GVGVAAVDVS VIESAVAEVV AEKTGYPVDV
LEPSMALESD LGVDSIKRVQ ILGSLQERFP GVPAVGPERA AEMRTLADVA SFLRDGLGSV
VQGSAGAGAG AGAGAGAGVA VGAGVAAVDV SVIESAVAEV VAEKTGYPVD VLEPSMALES
DLGVDSIKRV QILGSLQERF PGVPAVGPER AAEMRTLADV ASFLRDGLGS VVQGAGAGVA
AVDVSVIESA VAEVVAEKTG YPVDVLEPSM ALESDLGVDS IKRVQILGSL QERFPGVPAV
GPERAAEMRT LADVASFLRD GLGLTAPASA APESTPPTAT ETAASMPTAT ASAPHPVPTE
APIERLRAVP RLLPAVDPLD DPFGVQPSAL VIEVGDAGGV ALVRHLRERG ISVERIAIGS
LAGSDTAGLL GWDGAAVEAA LAAVAGSVRL DLCLLALGAP ASPGAGSGAE PDRAGEDDRA
GIGATGLADF DVAARGLADA ILATKGLVVP LRAAASAGRR AAFLTVTRVD GLLGHRGGSS
IAGAVTAGIT GLVKTVALEA PELVCRSVDV HPALDDDTFA AAVLAEAVDS ARDVVEVGLD
PEGRRWSVEY VQGGVTGVTG VTGVAEAAGV AEAAGVTGTT SPATLRPPVG PDDVLVVTGG
ARGVTARCVE ELARRASCEF VLLGRTNLVE EPGWAAGVED DGLKAAYLAG ARAEGRKVRL
PDVDAAAAAV IAVRQVRATL GDLAATGARI HYQAVDASDP VATAAALAPW RDRVTGVVHG
AGALADAMLT DKTVPAIRSV LTPKLAGLRS VLTALEGAPL RHLVLFGSVA GVFGNPGQAD
YATANEALTR VAAWARTGLT GRVVALNWGA WDGGMVTADL RELFRSRGVT LLPPDEGARR
FVAELVDGHG ADGAVLIGGS APLARAAGGT APAAFVAHRS IAAMDTETVI DQHQIGPAPV
LPATAGLGWL VNGVERALPG RQVVEVRNFI VHKGIVFDGA HLRDYWLDAR PVDGTGGGPG
DGRDGGPVTV SATVRGDSGG ALPTSHYAGT LVLAEVTEVG KAAGAAPTAP GWPADGYRLG
EQGEDAAWVY TDGLLIHGPL MRGIRRLLVR EPGRLVAECR LPELPFAGGA FAGALHRPVL
CDILAAAGSV LARWTLNEVG ALPIEIGHAE FFAPLPANET FIVVVDDVRP GPMTVTVTVT
AHTPDGRVLQ RLSDFTFVGT PDMADLIRQG AQRWQSGDRA