Gene Francci3_0926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0926 
Symbol 
ID3906090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1073531 
End bp1081417 
Gene Length7887 bp 
Protein Length2628 aa 
Translation table11 
GC content75% 
IMG OID637878260 
Productbeta-ketoacyl synthase 
Protein accessionYP_480039 
Protein GI86739639 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR02813] polyketide-type polyunsaturated fatty acid synthase PfaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.526066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTG CACCCAGCGC GTCACCCGTG GCCTCGCCGG CCAATCCCGC CGACCGCGAT 
CCGCTCGACG CCCGCGAGCT GGTCGTGGTG ATCACGCCGT TCGCCGAACC GTCCGCGTCC
CTGGTCGCCG CGGTGCAGCG GGCCGGAGGG CTCGGGGTGC TGGATCTCGG GCTGCTGGCC
GGACCGGCGC GTGATGCGCT GCGGGACGCC GCGCGGTGGG CGCCGGGCCG GTTCGGGGTG
CGGGCTGCCC CCGGCTGCCC CCTCGATCCC GACGAGCTGC CCGCGAGCGT CGGGACCGTC
GTACTCGTCG CCCGATCCGG TCCGGATCCC CGATCCGGCC CCGTCGCGTG GGCGGATGCC
GGGCCGGGCC CGGGGAGCCG CTGGGACATC GGCGCGCTGA CGAGCCGGTG GCGGACGAGC
GGGCCGCGTG GAGAGGCGCG GCGCCGGGTC TGGGCGGAGG TCACCTCGGT TGCGCAGGCG
CGCGACGCGG TGGCGGCGGG GGTCGAGGGA CTGATCGCGC GAGGCAACGA GTCCGGGGGG
CCTGTGGGCG ACCTCACCAC CTTCACCCTC CTGCAACATC TGCTCGCGCA GGCGTTCGTC
ACCACCGAGG GCGCGCCGCT GCCCGTGTTT GCCGCTGGCG GGATCGGTCC GGCCACCGCG
GCGGCCGCGG TGGCCGGCGG CGCGGCCGGT GTGGTCCTGG ACTCCCAGCT CGCCCTGGTG
CGTGAGATGG AGCTGCCGGC ACAGGTCGCG GCGGCGATCC GCGCGATGGA CGGCAGCGAG
ACCACCCTCC TCGCCGGCTA CCGGGTGTAC ACCCGCCCCG ATCTGCCCGT CGCCGCCATG
ACGGGCGCCG CCATGGCCGG CGCTGCCACG GCGATCCCGT TCCGGCTGGG TGCGCGTGAT
CTGCGTAACC AGTTTCTCCC GGTCGGCCAG GACGGAGCGT TCGCCGCCGA TCTCGCCGAC
CGCCACGTGA CCGCGGGCGG TGTCGTACGG GCCGTGCGTC AGGGCATCCG CGAGGGCATC
GCTGCCGCCG CCCGTACCCG TCCCCTCGCC CCCGGTGCGC CCTTCGCCGC TGCCCGCGGC
CTGCGCCACC CGGTCGCGCA GGGACCGATG ACGCGGGTCA GCGACCGGGC GGCGTTCGCT
CATGCCGTCG CCGACGAGGG CGGGCTGCCT TTCCTCGCCC TGGCGCTGAT GAGCGGCGAG
GACAGCCGCG CCCTGCTGGA GGAGACCGCG GACCTGCTCG GGGACGCGCC CTGGGGGGTC
GGGGTCCTCG GGTTCGCCCC GGCGCCGCTG CGGGCCGCCC AACTCGCCGC CGTGCATGAC
GTGCGTCCTG CCTGCGCGAT CGTCGCCGGA GGTCGCCCGG CACAGGCGGC CCCGCTGGAG
GCCGCCGGCA TCGACACGTT CCTGCATGTC CCGTCCCCGG GGCTGCTGAA CCGTTTCCTG
GCCGACGGCG CGCGCAAGTT CGTCTTCGAG GGGCGTGAAT GCGGCGGGCA CGTCGGGCCG
CGGGCGAGCT TCCCGCTGTG GGAGGCGCAG ATCGCCGGCC TGCTGGAGTT CGGCACGACC
GATCAGGCCG GTCCGGACTT CTTCGCGGGC CTGCATCTGC TGTTCGCCGG GGGCGTGCAT
GACGCCCGTT CCGCTGCGAT GGTCGCCGCG GCGGCCGGGC CGCTCGCCGA GCGGGGCGCG
AGCGTCGGCG TGCTGATGGG CACCGCGTAC CTGTTCACCG CCGAGGCCGT CGCGTCCGGG
GCGATCCTGC CGGGTTTCCA GCAGGCCGCT CTCGACTGCG AGCGCACCGT GCTGCTCGAG
ACCTCTCCCG GTCATGCCAC CCGGTGCGTG GAGTCCGCGT ACGTCCACAC CTTTCTGGGC
CGTCGCCGTG AGCTGATCGC CGCCGGCAAC TCCCGCCAGC AGATGTGGGA GGAGCTCGAG
GCGCTCAACC TCGGCCGGCT GCGGGTCGCG AGCAAGGGCC TGCGCCGTGA GGGCAGCGAG
GTCGTCACGG TCGATCCCGC GACCCAGGCC CGGGACGGCA TGTACATGAT CGGACAGGTC
GCCGCCCTGC GTTCGCGGCG GACGAGCATC GCCGAGCTGC ATGACGAGGT CACCGCGCAG
GCCACGGCCG GGCTCGCCGC CCGCTCCGCC GAGCTGCGCT CCGCCGAGCC GCGGCCCGCC
GACAGGAGCG CGGCGCCCGG CCGCCCGGCC GCCCGGCCGC TCGACATCGC GATCGTCGGG
ATGTCCGCGA TCTTCCCGGA CGCCCCGGAC CTGGCCAGCT TCTGGGCGAA CATCGTCGCC
GGCAACGACG CCATCCGCGA GGTCCCGGCC GATCGCTGGG ATGCCGAGGT CTACTACCAC
CCGGACGCGG TGCTCAGGGA CGCTGGCCGC AGGACGCCGT CGAGATGGGG CGGTTTCCTC
CCGCAGGTGC CGTTCGACGC GCTGGCCTAC GGTATCCCGC CGAAGTCGCT GCGCAGTATC
GAGACCAGCC AGCTCCTCGC GCTGGAGGTC GCCGCCCGCG CGCTGCGCGG CGCCGGGTAC
GACGCGGGCC AGAAGGACGT GGGCCAGAAG ACCGCCGGCG AGAGTCGGGC GTTCGACCGC
TCCCGCACGT CGGTGGTCTT CGGCACGGAG GCCGGGACGG ATCTCTCCGG TGCCTACAGC
TTCCGCTCCC TGTGGCCGGC GCTGCTCGGG GACCTGCCGG CCGAGCTGGA GGAGTTCCTG
CCGGAGCTGG ACGAGGACTC GTTCCCGGGC ATGCTCGCGA ACGTCATCGC TGGCCGGGTC
GCCAACCGGC TGGACCTCGG CGGGGTGAAC TTCACCGTGG ACGCGGCCTG CGCGTCGTCG
CTCGCCGCCC TCGACGCCGC CTGCAAGGAG CTCGTCGCGG GCACCTCGGA CATGGTGCTG
TGCGGCGGCG TGGACACCCA CGCGGGCGCC CACGACTTTC TGCTGTTCTC CTCCGTCCAT
GCGCTGTCTC CGGGCGGTCG GTGCCGCAGC TTCGACGCGG GCGCCGACGG CACCAGCCTC
GGTGAGGGGG TGGCCGTCGT CGTGCTCAAG CGGCTCGCGG ACGCCGAACG GGACGGCGAC
CGGATCCACG CCGTCATCCG GGCCGTCGCC GGTTCCTCCG ACGGGCGGGC CCTGGGGCTG
ACCGCGCCGC GCAAGGCCGG CCAGGTGCTC TCCCTGGAAC GGGCCTACGG GCGGGCGGGG
ATCTCGCCCA GCGAGATCGG TCTGCTGGAG GCGCACGCGA CCGGAACGGT CGTGGGGGAC
CGCACCGAGC TCGCCACCCT CACCGAGGTC TTCACCAACC ACGGTGCGAC GCCGGGCCAG
TGCGCGGTGG GTTCGGTGAA GTCGCAGATC GGGCACACGA AGTGTGCGGC GGGCCTGGCC
GGTCTGATCA AAATGGCCAA GGCCGTCGAG ACCGGGGTAC GGCCGCCGAC CCTGCACATC
GACACGCCGA ACGCCTATTG GGACGCCGAG AGCAGCCCGT TCTACTTTGA CGACGTCGCC
AAGCCGTGGG TTGCCCCGGC CGAGCGGCGC CACGCCGGCG TGTCGGCCTT CGGATTCGGC
GGCACCAACT TCCACGTCGT GCTGTCCGCC TATGACGGCG GGCCCGAGCC CGCCCACGGC
CTCGACAGCT GGCCCGCGGA GTTGTTCGTC GTCCGCGGCA CCGACCGGGC GGCGGCCACC
CGGGCGCTGG ACCGGCTGGC CGAGCTGATC GTCGCGAACG ACGCGGCCGG ACGGCCGTTC
CGCCTGCGTG ACCTCGCCTC CACCGTCTGC GCGGAACGCG GCGGCCCGGT CCAGCTCGCG
TTCGTCGCCG ACGACCTCGA CGCGCTGCCC ACCGCCATCG ACCAGGCGCG CACGTTCACG
TCGAACCCTC GGGCGGGTCT GTTCGTCCGG GACCAGGACG CGGATCCGGG TGCCACCGCG
TTCCTCTTCC CCGGTCAGGG CAGTCAGCGG CCCGGCATGC TCGCCGACCT GTTCATCGCC
TTCCCACGGC TGCGGTCCGT GCTGGACCTC GCCCCCCGGT GGGCGGACAC GATGTTCCCG
CCGGCGGCCT TCTCCCGGGA GCAGAAGGCC GCCCAGGCGG CCGCGATCAC CGACACCCGT
GCGGCCCAGC CGACGCTGGG GCTCGCCGGG CTTGCCGTCC ACGACCTGCT GACCAGCCTC
GGGGTGCGCC CGGACCACGT CGCCGGGCAC TCCTACGGCG AACTGGTCGC GCTGTGCGCC
GCCGGGGCGC TGGACCGGCG GGACCTCATC GGCCTGAGCG AGGCCCGTGC CGCGGCGATC
CTGGCAGCCG CTGGCGACGA CCCGGGCACG ATGGCGGCGG TCTCCGCGTC GGTCGAGCAG
GTCCGCGCCG CGCTCGACGG GGCCGTTACG GGTGGGGAGG CCGGACCGGG CCGGGTCGTC
GTGGCCAACC ACAACGCCCC GCGGCAGAGC GTGATCTCCG GGCCGACGGA CGCCGTTGCC
GCCGCCGTGG CGGCTCTCGG CCGGGCCGGC ATCAGCGCCA AGCCGATCCC GGTCGCGTGC
GCCTTCCACA GCCCGGTGGT GACGGAAGCG GCGACGGCGC TGGCCGCCCG GCTCGATGAG
GTCGAGGTCG AGGCGCCGAC CCTGCCGGTC TGGTCCAACA CGACCGCCGG GCCCTACCCC
GGCAGCCCGG ACGAGGTGCG GGCGACGCTC GCCGGTCAGG TGGCGGCGCC GGTCAGGTTC
GTCGAGCAGA TCGAGGCGAT GTACGCCGCG GGGGTGCGGA CCTTCGTCGA GGCCGGGCCC
GGCCGCGTCT TGGCGCAACT CGTCGGCAGA ATTCTGGCCG GTCGACCGCA CCGGGTGGTG
GCCACCGACG TCGCCGGCGA GCCGGGGCTG CGCCGGTTGC TGCTGGCCCT CGCCGAGCTC
GCCGCCGTCG GTGTCGAGCT CGACCCCTCC GCGCTGTTCA CCGGCCGCGA CGCCCGGGTG
GTCAGCGCGA CGGACCTGCC GCGCCGTCCC GGCTGGCTCA TCGACGGGGC CTATGTGCGC
ACGCCCGACG GCCGGTTCCT TCCTGGTGGG CTGCGGCCGC CGACCCGGCT GACGGTGCGG
ACGACGTCAC CCGGGGTCGC CTCCGTGCCC GGGGCCGCAT CCGTGCCCGA GATCGTCGGC
GTGCGCGGGT TCGTCGAGGA CACCGTGGCG CCGGCTCCCC CGTTCGACCG CGCCGGCGTG
ACCGACCACA GCGAGACGAC CGACGACCCT GCGGCGGGGC CCGCCGGACT GGCCGAACTT
GGAGGTGACG GACCGGTGGA CGCCGTGGTG CTGGAGTTTC TGCGAACCAC CCGGGAACTG
GTCGCGGCGC AACGTGACGT CGTACTGGGC TACCTGGGCA TCGACCCGAC CGGCCTGGCC
CGGCCCGGCC AGCTCCCCGA GGGGCGGCCG GCGGACCTTG TGCGGCCGCG GCCGGGGGTC
GTGTCTGACG ACGTGCCGGG CTTCGTGCCG CCGCGGCCGG GGGTCGTGTC TGACGACGTG
CCGGGCTTCG TGCCGCCGCG GCCGGGGGTC GTGTCTGACG ACGTGCCGGA CATGGCTGCC
GCGAGTCGGG TGGTGCTGGA CCGGGCCGCG GTGTCGGCCG CGGTGGTGGG GGTGATCGGG
GAACAGACGG GTTATCCGGT GGAGATGCTG GAGCCCGATC TGGATCTTGA GGCCGACCTG
TCGATCGACT CGATCAAGCG GACGGAGATT CTGGGGGAGC TGGCGCAGCG GCTGGGTCTG
GTGGATGCCG GGTCGGGGGA GCTGGGCGAG GAGGCCGTCG AGGAGCTGGC GGCGATCAAG
ACTGTCCGCG GCATCGTCGA CTGGCTGACG GACCCGCCGG GTGCGGTGTC CGGGGTGCCC
GGTGCGGGGG AGGTGGATCT GGTGGCTGCG GTGGATCGGG TGGCTGCCGC GAGTCGGGTG
GTGCTGGACC GGGCCGCGGT GTCGGCCGCG GTGGTGGGGG TGATCGGGGA ACAGACGGGT
TATCCGGTGG AGATGCTGGA GCCCGATCTG GATCTTGAGG CCGACCTGTC GATCGACTCG
ATCAAGCGGA CGGAGATTCT GGGGGAGCTG GCGCAGCGGC TGGGTCTGGT GGATGCCGGG
TCGGGGGAGC TGGGCGAGGA GGTCGTCGAG GAGCTGGCGG CGATCAAGAC TGTCCGCGGC
ATCGTCGACT GGATCGTCGC GTCGGTGGAT CCGGATGCCG CCGGCGCGAC CGAACCGGCC
GAGCCCACCG GGCCCGGTCG GTCCGCCGGA GTCGCGAAGT CCGGGGGGGC CGGCTCGACA
GCCGCGATCC CGCTACGGCG GTTCGTTGTC GAACCGGAGA TCGTCCCCGT GCCCTCGACC
CCGGAGCCGG TCGGACGTGC CGGGTCGCTG GCCGGGGCCC GCTTCCTTCT CGTCGAGGGT
GGTCTCGGGG TCGGACTCGA GCTCGCCACG TTGCTGGAGC AGGCGGGTGC CGAGGCCCGC
ATCCTCGCTG CCGACGACCG GGGCCTGGTC GGTCAGGTTC GAGCCGGCGC CGGCGCGGAC
GGCCTGGTGT GGATCGCCTC GATCGATCCC GGTGCCGGCG ACCACGCCGT TCTTCCTGCC
GCGTTCGCGG CACTGCGGGC CGCGGCCCTC GGCGGCAGCC GGCGGATGCT CGTCGCGACC
GGGCACGGCG GGCGCTTCGG TCGCCCGTCG CCCCACGGCA CCGGAGCGCT CGCCCCGGGC
GACGCCGGGG ACGAGGGGAT CCAGCTGACC CCGGGGATGG GCCTCGCCGG CCTGGTGCGC
ACGATCGCCT GGGAGGTTCC CGACATCGCG GTCCGCCTCG TCGACGTCGA ACCGAAGGAC
GAACCGCGTC GGATCGCCGA GAGCCTGCTC GTCGAGCTGC TCACCCCCGG CGGGCCGTCC
GTGGTGGGTT ACCGTGACGG GGTCCGCGCG ACCCCGCGGA TCCGGCCGGT GGAGCTGGGC
GTCGCGGGCG ACGTCGGCGA CGTGGGTGCG GCGGGCTATG CGGGCGAACT GCCCGCCGAG
CTCGGTCCCT CCTCGGTGGT GCTGCTCACC GGTGGCGCGC GGGGTATCAC GGCGGCCGTC
GCCGTGGAGC TCGCCCGGCG CAGCGGCTGC GCCATCGAGC TGATCGGGCG TACCCCACCG
CCCACCGGAC CGGAGGATCC GGTGACGGCG GATGCGCTCG ACGCCCCGGC GCTGCGCCGC
GCGCTTGTCG CCACCGGCGT CCGCCGGCCG GCCGAGATCG AGTCCCAGGT GGCCCGGTTG
CTCGCCGAGC GGGAGGTGCG CACCACGCTG GAGGTCCTTG GTCGGCTGGC CTCGTCGGTG
CGTTACCACG CGATGGACGT GCGTGACGGC GCCGCCGTTG CCTCGGTCGT GGCCGACGTC
TACGCCCGCC ACGGTCGCCT CGACGGCGTG GTGCACGGTG CGGGCATCCT GGCGGACCGA
CTCCTGCGGG ACAAGACGCC CGAGTCGTTC GACCAGGTGT ACCGAACGAA GATCGACGGT
GCCCGGGCGC TGCTGGCGGC GCTGCGGGAC GACGTCGGCT TCGTCGCCCT GTTCGGGAGT
GTCGCGGGCG TCTTCGGCAA CCGCGGCCAG GTCGACTACG CCGCCGCGAA CGACGCCCTG
GACACGCTCG CCGGTGCCTG TGCGGGCCGC TTCGCCGGCC GGGTGGTCAG CGTCGACTGG
GGTCCCTGGG GATCCGCGCG GGACTTCGTG GCACGGGCCG GCGGACCGGC CCCCGCGGTG
ACCGGGGGCG CGGTGGCCGG TGCGGCCGGT GTGACCGGCG GTGGGATGGT CTCCCCGGAG
CTTGCCCGGG AGTACGCGCG CCGTGGCATC GGCCTGATCG ATCCCGCCGA CGGGGTGGCC
GCCCTGCTGC GGGAGCTGGC CGCTCCGGCG GGCACGCCCG CGCAGGTCGT CTACATGTGC
GCCTCGGTGG AGTCGTTCGA TGCCTGA
 
Protein sequence
MTTAPSASPV ASPANPADRD PLDARELVVV ITPFAEPSAS LVAAVQRAGG LGVLDLGLLA 
GPARDALRDA ARWAPGRFGV RAAPGCPLDP DELPASVGTV VLVARSGPDP RSGPVAWADA
GPGPGSRWDI GALTSRWRTS GPRGEARRRV WAEVTSVAQA RDAVAAGVEG LIARGNESGG
PVGDLTTFTL LQHLLAQAFV TTEGAPLPVF AAGGIGPATA AAAVAGGAAG VVLDSQLALV
REMELPAQVA AAIRAMDGSE TTLLAGYRVY TRPDLPVAAM TGAAMAGAAT AIPFRLGARD
LRNQFLPVGQ DGAFAADLAD RHVTAGGVVR AVRQGIREGI AAAARTRPLA PGAPFAAARG
LRHPVAQGPM TRVSDRAAFA HAVADEGGLP FLALALMSGE DSRALLEETA DLLGDAPWGV
GVLGFAPAPL RAAQLAAVHD VRPACAIVAG GRPAQAAPLE AAGIDTFLHV PSPGLLNRFL
ADGARKFVFE GRECGGHVGP RASFPLWEAQ IAGLLEFGTT DQAGPDFFAG LHLLFAGGVH
DARSAAMVAA AAGPLAERGA SVGVLMGTAY LFTAEAVASG AILPGFQQAA LDCERTVLLE
TSPGHATRCV ESAYVHTFLG RRRELIAAGN SRQQMWEELE ALNLGRLRVA SKGLRREGSE
VVTVDPATQA RDGMYMIGQV AALRSRRTSI AELHDEVTAQ ATAGLAARSA ELRSAEPRPA
DRSAAPGRPA ARPLDIAIVG MSAIFPDAPD LASFWANIVA GNDAIREVPA DRWDAEVYYH
PDAVLRDAGR RTPSRWGGFL PQVPFDALAY GIPPKSLRSI ETSQLLALEV AARALRGAGY
DAGQKDVGQK TAGESRAFDR SRTSVVFGTE AGTDLSGAYS FRSLWPALLG DLPAELEEFL
PELDEDSFPG MLANVIAGRV ANRLDLGGVN FTVDAACASS LAALDAACKE LVAGTSDMVL
CGGVDTHAGA HDFLLFSSVH ALSPGGRCRS FDAGADGTSL GEGVAVVVLK RLADAERDGD
RIHAVIRAVA GSSDGRALGL TAPRKAGQVL SLERAYGRAG ISPSEIGLLE AHATGTVVGD
RTELATLTEV FTNHGATPGQ CAVGSVKSQI GHTKCAAGLA GLIKMAKAVE TGVRPPTLHI
DTPNAYWDAE SSPFYFDDVA KPWVAPAERR HAGVSAFGFG GTNFHVVLSA YDGGPEPAHG
LDSWPAELFV VRGTDRAAAT RALDRLAELI VANDAAGRPF RLRDLASTVC AERGGPVQLA
FVADDLDALP TAIDQARTFT SNPRAGLFVR DQDADPGATA FLFPGQGSQR PGMLADLFIA
FPRLRSVLDL APRWADTMFP PAAFSREQKA AQAAAITDTR AAQPTLGLAG LAVHDLLTSL
GVRPDHVAGH SYGELVALCA AGALDRRDLI GLSEARAAAI LAAAGDDPGT MAAVSASVEQ
VRAALDGAVT GGEAGPGRVV VANHNAPRQS VISGPTDAVA AAVAALGRAG ISAKPIPVAC
AFHSPVVTEA ATALAARLDE VEVEAPTLPV WSNTTAGPYP GSPDEVRATL AGQVAAPVRF
VEQIEAMYAA GVRTFVEAGP GRVLAQLVGR ILAGRPHRVV ATDVAGEPGL RRLLLALAEL
AAVGVELDPS ALFTGRDARV VSATDLPRRP GWLIDGAYVR TPDGRFLPGG LRPPTRLTVR
TTSPGVASVP GAASVPEIVG VRGFVEDTVA PAPPFDRAGV TDHSETTDDP AAGPAGLAEL
GGDGPVDAVV LEFLRTTREL VAAQRDVVLG YLGIDPTGLA RPGQLPEGRP ADLVRPRPGV
VSDDVPGFVP PRPGVVSDDV PGFVPPRPGV VSDDVPDMAA ASRVVLDRAA VSAAVVGVIG
EQTGYPVEML EPDLDLEADL SIDSIKRTEI LGELAQRLGL VDAGSGELGE EAVEELAAIK
TVRGIVDWLT DPPGAVSGVP GAGEVDLVAA VDRVAAASRV VLDRAAVSAA VVGVIGEQTG
YPVEMLEPDL DLEADLSIDS IKRTEILGEL AQRLGLVDAG SGELGEEVVE ELAAIKTVRG
IVDWIVASVD PDAAGATEPA EPTGPGRSAG VAKSGGAGST AAIPLRRFVV EPEIVPVPST
PEPVGRAGSL AGARFLLVEG GLGVGLELAT LLEQAGAEAR ILAADDRGLV GQVRAGAGAD
GLVWIASIDP GAGDHAVLPA AFAALRAAAL GGSRRMLVAT GHGGRFGRPS PHGTGALAPG
DAGDEGIQLT PGMGLAGLVR TIAWEVPDIA VRLVDVEPKD EPRRIAESLL VELLTPGGPS
VVGYRDGVRA TPRIRPVELG VAGDVGDVGA AGYAGELPAE LGPSSVVLLT GGARGITAAV
AVELARRSGC AIELIGRTPP PTGPEDPVTA DALDAPALRR ALVATGVRRP AEIESQVARL
LAEREVRTTL EVLGRLASSV RYHAMDVRDG AAVASVVADV YARHGRLDGV VHGAGILADR
LLRDKTPESF DQVYRTKIDG ARALLAALRD DVGFVALFGS VAGVFGNRGQ VDYAAANDAL
DTLAGACAGR FAGRVVSVDW GPWGSARDFV ARAGGPAPAV TGGAVAGAAG VTGGGMVSPE
LAREYARRGI GLIDPADGVA ALLRELAAPA GTPAQVVYMC ASVESFDA