Gene Franean1_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3887 
Symbol 
ID5672248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4622848 
End bp4632324 
Gene Length9477 bp 
Protein Length3158 aa 
Translation table11 
GC content76% 
IMG OID641242766 
Producterythronolide synthase 
Protein accessionYP_001508183 
Protein GI158315675 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.108023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGGAC ACGAACACCC CACCTGGCGG GAGGCCCTCA CCGGGCTGCC CGAGGCCGAA 
CAGCACCGGC TTCTGCTCGA CCTGGTACTC CGGCAGGTGC GCCGGGCCGT GGGCGACGGG
CGCGGCCCGT TCGACCCGCG GACGCCGTTC GTCCGGCTGG GGCTCGGCGG AGGAAGCACG
GCCGGTTTCC GGGACGCCCT GGGCGGCGAG CTCGGCCTGG CACTGCCCGC CACGCTTCTG
TTCGACCATC CGTCCCCGGC GCGTCTCGCC GACCATCTCA GGACACGGGT TCTCGACGCA
CCGGACCACC CGATTCCGCC TCCGATCGCG CCGGACCCCC CGGACCTGCC GGACCTGCCG
GCTTCGCGGG CGCCGTCGGC CTCGGCCGCG CCGCCGGACG ACCCGATCGT GATCGTGGCC
ATGGCCTGCC GGTTCCCCGG AGGGGTGCGT AGCCCGGAGG ATCTCTGGCG GCTGGTCGCC
GAGGGCCGCG ACGCCGTCGT CGACATCCCC GCGGACCGCG GCTGGGACCT GGCCACCCAC
TTCGATCCCG ATCCGCGCCG CTCCGGCACC TTCTACACGA CCCAGGGCGG CTTCCTGGAC
GACGTCGCGG GCTTCGACGC GGCCTTCTTC GGGATCTCGC CGCGGGAGGC GCTGGCGATG
GACCCGCAGC AACGGCTGCT GCTCGAGGTC TCCTGGGAGG CGTTCGAGCG GGCCGGCATC
GACCCGCTCT CCCTGCGCGG CAGCCGGACC GGCGTCTACA CCGGCCTCCC CGTCGCGGAC
TACAGCCCGC CCTGGCGGAC GGCCCCGCCC GTTGTCGAGG GCCACCTGAT GAGCGGGACC
CTGCCCAGTG TCGCCTCGGG CCGGGTCGCC TACACCCTGG GGCTCGAGGG CCCCGCGATG
ACGATCGACA CGGCGTGCTC GTCGTCGCTG ACGGCCATGC ACCTGGCGGC GCAGGCCCTG
CGCGCCGGCG AGTGCACCCT GGCGCTGGCC GGCGGGGCCA CGGTGATGGC CACCCCGGAC
GTCCTGGTGG AGTTCAGCCG CCAACAGGGC CTGGCGCCGG ACGGCCGCTG CAAGTCGTTC
GCGGACGCCG CCGACGGCAC CGGCTGGGCC GAGGGCGCGG CCGTGGTGCT GCTGGAACGG
CTCTCCGCCG CGCGCCGCCA CGGGCGCGCG GCGCTGGCGG TGCTGCGCGG CAGCGCGGTC
AACTCCGACG GCGCGTCCAA CGGGCTGACC GCGCCCAACG GGCTGGCGCA GCGGCGGGTC
ATCCGGGCGG CGCTGGCGAG CGCGGGCGTC TCGCCGTCCG ACGTGGATGT CGTGGAGGCG
CACGGCACCG GGACCAGGCT GGGCGACCCG ATCGAGGCCC AGGCGGTCAT CGCCGCCTAC
GGGCAGGACC GGCTGGCCGA CCGGCCGCTG TGGCTGGGCT CGTTGAAGTC GAACGTCGGG
CACACCGTCG GGGCGGCGGG GGTCGGCGGC GTGATCAAGA TGGTCCTGGC GCTGCGGCAC
GGCGTGCTCC CGCGCACCCT GCACGTGGAC GAGCCTTCGG CCCGGGTGGA CTGGTCGGCC
GGGCACGTGC GCCTGCTGAC CGAGCCGGTG GTCTGGCCGC GCGGGAACGG CGTCCGCCGG
GCCGGTGTCT CCGCCTTCGG GGTCAGCGGC ACGAACGCCC ACGTCATCAT CGAGGACGCC
CCCCGCGCCG AGCCGGTTCC ACCCTGGCTC GACCAGGCCG GGCTCGACGA GGCCGGACCG
AATACCGGCG GCCGTGACCA GGCTGCGGGT GACCCGCCCG GGGGCGGGCC CCCGGCGGGC
GAGGAGCTTC CATGGCTGCT GTCGGCCGGG TCGGAGCCGG CGCTGCGCGA GCAGGCCGTC
CGCCTGGCGG CCCGGGTCGA GAGCGAGCCT CGGCCGGGCC TGGCCGATGT CGGCTGGTCG
CTGGCGGCGG GCCGCGCGGC GCTGGCGCAC CGCGCGGCCG TCGTTGCCGG CGACCGGGAG
GGCATGCTGC GCGGCCTCGG CGCGCTGGCG CGGGGTGAAC CGTCGGCGGC ACTGATCCGC
GGCTCCGCAC GCCCGGACCG CCAGACCGAC CGGCTTGCGT TCCTCTTCAG CGGCCAGGGC
TCCCAGCGGC CGGGGATGGG CCGGTCGCTG CACGGCCGGT TCCCGGTGTT CACCGAGGCG
TTCGACGCGG TCACCGCCCG CCTCGACCGG CATCTTCCCC GGCCGCTGCG GGACGTCGTG
TTCGCCGGCC CGCTCCCGGA CGCGGCGCTG GAGCGCACAG AGTTCACCCA GCCGGCGCTG
TTCGCGCTCG AGGTGGCGCT GTTCCGCCTG CTGGAGTCCT GGGGCGTGCG GCCGGACGCG
CTGCTCGGGC ACTCCGTCGG CGAGCTGGCG GCCGCGCATG TGGCCGGCGT GTTCGACCTG
GACGACGCCT GCGCGCTGGT CGCCGCGCGC GGCCGGCTGA TGCAGGCGAT GCCGACCGGC
GGCGCGATGG TCTCGCTGCG GGCGTCGGAG GAGCAGGCCC GTGAGCTGAT CGCCGAGCTG
GGTGCGCGCG CGGCGGCCGC ATCCGGTTCC GCGTCGGGAG CGGTGGACGT CGCGGCGGTC
AACGGACCGG AGTCGGTGGT CGTCTCCGGA GACCTGCGCC CGGTCGAGGA GATCGCGCAG
CGGTGGCGCG CCCGCGGCGG AACGGCGACC CGCCTGCGGG TCTCGCACGC ATTCCACTCC
GCGCACATGG ACGGCATGCT AGAGGAGTTC CGCGGGGTCG CCCGCGGCGT GACCTTCCGG
GCGCCGGCCG TCCCGGTCGT CTCCAACGTC ACCGGCGCGA TCGTCGCTCC CGACGAGCTG
TGCTCGGCAG AGTACTGGGT GCGCCAGGTC CGGGCCGCGG TGCGGTTCCG CGACGGCGTC
GACCGGCTGC ACGAGCGGGG CGTGCGGACC TTCCTGGAGA TCGGCCCGGA CGCGGTGCTC
TGCGGCATGG GGCAGGACTG CCTCCCGGCC GGCTCCGACG CCGTGTTCGT CCCGAGCCTG
CGCCGTGACC AGGCGGAGCA GCCCGCGCTG GCCCGGGCCG TGGGCAGGCT GTGGATGGCT
GGCGCCGATG TCGACTGGGC GGCGTTCTTC GCCGGCGCGC GCCGGGTCGA TCTGCCGACC
TACCCGTTCC AGCACCGCCG CTTCTGGCTT GGCGACCACC TCGCCACTGC GAACGCGCCC
ACCCCGGGCA CCGCGGGCGA GCCTGCCACC GCGGACGGCG CGACGCCGGG CCGGCCGCCC
GCCGCCCGCC GGTACCGGGT CACCTGGACG CCGGTGCCCA CGGCGAAGGC CGCGAAGGCC
GCGGGGCCCG GCCGGTGGCT GGTGGTGATC CCCGCCGATC TGACCGCGGA GGACAGTGTC
GGCGACTGCG TCGAGAGCTG CGTCCGGGGT CTTGAGGGCA GGGACGCCGA GGTCGTCCGG
TTCGAGGTGG ACCCCGCCGC AGCCGACCGG AGCTCGCTGG CCGCCCGTCT CGCCTCGCTC
GACCCGCCGC CGGCCGCCCG GGACGTCTCA GGACGCAACG CTCCGGGGCA TGACGTTCCG
GGGAGTGACG TTCCGGGGTG TTCGGTGGAT GTCCCGCTGG CGGGGGTGCT GTCCCTGCTC
GCGCTGGATG TCCGGGAGCT GCCCGGCCGT CAGCCGGTCA CCCGGGGCCT GGCGGGCACG
CTGGTCCTGG TGCAGGCCCT CGGCGACGCC GGGGTCGAGG CGCCGCTGTG GTGCGTGACG
CAGGGGGCGG TGTCGATCGG CGGAGCCGAC GCGGTCACGA GCCCACCGCA GGCGCAGGTG
TGGGGCGCGG GGCGGGTGGC CGCGCTGGAG CTGCCACGCC GGTGGGGCGG CCTGGTCGAC
CTTCCCCCGG CCGTCGACGG GACGGCGGCC GACCGGCTCG CTGCCGTGCT CGAGGGCGGG
CCGGGCGAGG ACCAGGTGGC GATCCGCCCG TCGGGCGTGT TCGGCCGCCG GCTCGTACCG
GCCCGAACGA CCCGCACCGA CAGCGACAGT GACAGCGCCG GTGGCGGGTG GCGTCCGCGC
GGAACGGTGC TGATCACCGG TGGGACGGGG GCGCTCGGCG CGGCCGTCGC ACGGTGGTGC
GCGGATGCCG GGGCGGCCCG GCTGGTTCTG ACCAGCCGTC GGGGCGCGGC GGCCCCAGGC
GTGCCCGAAC TCGTCGCCGA CCTGGCCCGG CGCGGCGCGG CGACGACCGT GGCCGCGTGC
GACGCGGCGG ACGGCGACGC GCTTGCGGAG GTGCTGGCGG CCATACCGGC GGACTGCCCG
CTGACCGCCG TAGTCCACGC CGCGGGCGTC GCCGGCGGGT TCACACCGCT GCAGGACGTG
GACGTCGCCG AGCTCGCCGA GGTGCTCGCG GGCAAGGCCG CGGGCGCCCG GGTCCTCGAC
GAACTGGTCG GCGACCGGCC GCTGGACGCC TTCGTGCTGT TCTCCTCCAT CGCCGGCACC
TGGGGCAGCG GCGGCCAGTG CGGGTACAGC GCGGGCAACG CCTACCTCGA CGCGATGGCC
GAGTACCGCG GCGCGCGCGG GCTGGCCGGC ACGGCGCTCG CGTGGGGGCC GTGGGCGGAG
GGCGGGATGG CCGTCGATCC CGCGATCGCG GGCCACCTCC GGGACCGCGG GCTGGTTCCC
ATGCCGCCGG ACGAGGCGGT CGCCGTGCTG GCGGATGCAG TGGGTGACGA GCGGTGCCTG
ACCGTCACCG ACGTCGACTG GGCCCGGTTC GCCCCGGCCT TCACCGCCGC CCGCCCCAGT
CCGCTGCTCG GCGACCTGCC GGAGGTCCGC GCCGTCCTGA ACCCTCCCGA CCCGTCCACG
GGACCCGCCG GTGGACCGGT CGCCGGCGGG GAGGCCGGTG ACCGCGGCGG CGACGGGCCG
GCGGCGCTGC TCGGGCGCCG AGCCACGATG CGCACGGCCG AGTGGCGACG GTTCCTGCTC
GACCTGGTGC GGGCCGAGAC CGCCGCCGTC CTCGGGTACG ACGCCGCCGA CCAGATTCCC
GCCGACCGGG CCTTCGTCGA TCTCGGGTCC ACCTCGCTGA CCGCCGTCCA ACTGCGGGCG
CGGCTGGCTG AGCGCACCGG GCTGCGGCTG CCCACCACCG TGGTCTTCGA CCATCCGACG
TCGACCGCCC TCGCCGAGTA CCTGGCCGCC GAGCTCCCCG GCGGCCCCGA CGGCGACCCG
GCACCGGCCG GGGCGGCGGA ACCGGCCTTC GCCGGGAATC ACGTGACGCC GGCACCCGGC
CCGCACGGCG CGGCCGGGGA CGACCCGATC GTGATCGTCG GCATGGGGTG TCGCCTGCCC
GGCGGGGTGG CCTCGCCGGC AGAGCTGTGG CGGCTGCTGG CGGAGGGCAC CGACGCGGTG
TCGGAGTTTC CGGTGGATCG GGGTTGGGAT GTGGCGGGTC TTTATGATCC GGTTCCTGGT
CGGGTGGGTC GGAGTTATGT GCGGTCGGGT GGTTTTCTGG CTGGTGCGGC TGATTTTGAT
GCGGGTTTTT TTGGGATTTC GCCGCGTGAG GCGTTGGCGA TGGATCCGCA GCAGCGGTTG
TTGTTGGAGG TGTCGTGGGA GGCGTTGGAG CGGGCGGGGG TGGATCCGTC GTCGCTGCGT
GGCAGCGACA CCGGCGTCTT CGTCGGCGCC AACATCCACG ACTACGCGAC GCTGCTCGCG
GCAGGCGGGG AGAACACCGA GGGGTATTTC GCGACCGGCA CCGCGAACAG CGTGATGTCC
GGTCGGGTGG CGTATGTGTT GGGGTTGGAG GGGGCGGCGG TGACGGTGGA TACGGCGTGT
TCGTCGTCGT TGGTGGCGTT GCATCTGGCG GTGCGGGCGT TGCGTGCGGG TGAGTGTGGG
ATGGCGTTGG TGGGTGGGGT GACGGTGATG TCGACGCCGG TGGGTTTTGT GGAGTTTTCG
CGGCAGCGGG GGTTGGCTGC TGATGGGCGG GTGAAGGCGT TTGCGGAGGG TGCGGATGGG
ACGGGGTGGG GTGAGGGGGT GGGTGTGTTG GTGGTGGAGC GGTTGTCGGT GGCGCGGGCT
CGGGGGCATG GGGTGTTGGC GGTGGTGGCG GGTTCGGCGG TGAATCAGGA TGGTGCGTCG
AATGGTTTGA CGGCGCCGAG TGGTCGGGCG CAGGAGCGGG TGATCCGGGC CGCGCTCGCG
GACGCGGGTG TGGGCCCGTC CGAGGTCGAC GTGGTGGAGG GGCACGGGAC GGGCACCCGG
CTGGGCGATC CGATCGAGCT CGGCGCGTTG CTGGCGACCT ACGGGCAGGG GCGGGAGCCG
GGCCGCCCGC TGTGGCTGGG GTCGGTGAAG TCGAACATCG GGCACGCGCA GGCGGCCGCC
GGGGTGGCCG GGATCATCAA GATCGTGGAG GCGCTGCGCC ATCAGGCGGT GCCGGCGACC
CTGCACGTGG ACGCGCCCTC GTCGCGGGTC GACTGGTCGA CTGGGTCGGT CGAGGTGGTC
ACCGAGCCGG TCGCCTGGCC GCGGCAGGCC GCCCGGGTCC GCCGCGCGGG GGTCTCCTCG
TTCGGGATGT CGGGCACGAA CGCGCACGTG ATCATTATGG AGGCCCCCGA ACCCGACCTG
GAGGCCACGC CCGGCGCCGG CGCCAGCGCC GTCGCCGTGC CGTGGCTGCT GTCGGCGCGT
TCGCCGGCGG CGCTGCGCGG GCAGGCCCGG CGGCTGCTGG ACCACCTGGC GGGCTCCGCC
GATCGGCTGC CGGTGCCTGA TGTGGGTTGG TCGCTGGTGC GGGGCCGGGC AGCACTGGAA
CACCGGGCGG TGGTGCTGGG CCACGGCGCG GGAACGGGCA GCGACGGGGA CCGACCGGCG
GCGTTGCGTG CGTTGGCGGA GGGGGAGTCT GCCGAGTCTG TGGTGCGGGG GCGGGTTGGC
ACAAGTGGTG ATCGGGTGGT GTGGGTGTTT CCGGGGCAGG GGGCGCAGTG GGTGGGGATG
GGCGCGGAGC TGTTGGACGC CTTGCCGGTG TTTGCGGGTC GGGTCGCGGA GTGCGCGGCG
GCGTTGGCGC CGTTTGTGGA TTGGTCGTTG GTGGATGTCC TGCGTGGTGT GCCGGATGGG
TCGGGGATGT CGGGTGCGGG GGTGGTGTCG CTGGAGCGGG TGGATGTGGT GCAGCCGGTG
TTGTGGGCGG TGGCTTTGGG GTTGGCGGCG GTGTGGGAGT CGTGGGGGGT GCGGCCGGAT
GTGGTGGTGG GGCATTCGCA GGGGGAGGCG GCGGCGGCGT GTGTGGCGGG GCTGCTGTCG
TTGGAGGACG GGGCGCGGGT GGTGGCGTCG CGTAGTCGGG TGATCGCGGG TGGGTTGGCT
GGTGGTGGGG GGATGGTGTC GGTGGGGTTG CCGGTGGTCG AGGTGGAGGT TCGGTTGCGG
GAGTGGGCTG GTCGGGCGTG GGTGGGGGCT GGTCGGGTGG AGGTGGCGGC GGTGAACGGG
CCGTCGGCGA CGGTGGTCGC CGGTGAGCCT GCGGCGTTGG ACGATCTGCT TTCCGTCTGG
GAGGCGGAGG GTGTTCGGGT TCGTCGGTTG CCGGTGGACT ATGCGTCGCA TACGGCGCAG
GTGGACCGGG TCCGCGACCG ACTCACCACG GAACTTGCCG GGATCGTCCC GCGCCCGGGT
GACGTGGCGA TGTGGTCGGC GGTGACGGGA CGTCCGGTGG GGCCGGGGGC GCTGGACGGG
GAGTACTGGT TCCGGAATCT GCGGTCGCGG GTGCGCTTCG ACGAGGCGGT GCGCGGGCTG
CTGGCCGAGG GGCACCGGGT GTTCGTGGAG ATCAGCCCGC ATCCGGTGCT GACTGCGGCG
ATCACCGAGA CGGCCGAGGT GGCCGGCGTC CCGGACGCTG TGGTCGTGGG GTCGCTGCGG
CGCGGCGACG GCGGGCCGGA CCGGCTGGCG GCGGCCGCGG CGCAGCTGTG GGTGCGCGGT
GTTCCGGTGG ACTGGGCCCC GCTGGTGGCC GGGGGCCGTC CGGCCGAGCT GCCGACCTAT
GCCTTCCAGC ACCAGCGCTA CTGGCCTGAC GCGCCCGCGG TGCGTGGCGG CGCGCCCGAT
CCGGGCGTGC GGCCCGCCGA CCATCCGATG CTGGGCGGCG TGGTACCGCT GGTCGACACC
GGTGGGCTGC TCCTCACCGG TCTGCTCTCA CGGGGCTCAC AGCCCTGGCT CGCCGACCAC
GCGGTCGGTG GTGCCGCACT GCTGCCCGGT GCCGTGTTCC TCGAGCTGGC CGTCCAGGCC
GCGGACCAGA CCGGCTGCGA CGAGGTCGCC GAGCTCACCC TTGAGGCGGC CCTGGTCGTT
CCCGAGCACG GCGGTGTCCA CGTGCAGGTC GCGGTGGGTC CGCCCGACGG CGCGGGCCGG
CGCCCGCTGG CCGTCTACGC GCGTCCGGAC GGTCGAAGTC GGCCGGGGGA CCCGGAGGGG
CCGCAGGAGC CGTGGGTCCG GCACGCCCGG GGCGCGCTCG CGGTGCGGGC CCCGGGCGGC
GTCGCCGCGG ACCCGGCGTT CGACCTGGTC GTGTGGCCGC CGAGCGGAGC GCGTCCCGTC
CCGGTCGACG ACCTGTACCC GCGCCTCGCC GCGGCGGGCG TGGACTACGG CCCGGCCTTC
CACGGGGTGC GTGCGGTGTG GCGCCGCGAT GCGGAGCTGT TCGCCGAGGT CGGCCTGCCC
GAACCGCTGC GGGGGACGGC CCGGCGGTTC GCCCTGCATC CGGTGCTTGT CGACGCCGCC
TTCCAGCCGC TCGTGCTCGA TCCGGAGCTG GGGCCGCGGC GGCTGCGCCC GTTCTCCTGG
AGCGCCGTCC GCGTGCTCGC GGACGGCGCG TCGACGCTGC GTGTGCGGCT GTCGCCGGCC
GGCGCCGACG AGGTGGCCGT GACTCTCGCC GACGGCACGG GCCAACCGGT CGCCGAGGGC
ACGCTGGCAC TCCGGCCGGC CTCCCCGGCA CCCGCGGTGG CACCCCCGGG CCCGCCGCCC
GCGGCAGCCG CGCCGGTGCC GGTCATTCCG GCCCGGCGGG CCGCCGCGGA CGCGGACCAG
CCCGATGCCT CCTCGTTCGC CCGGCGGCTG GCCGGCCTCG CCGGGCCCGA CCAGGAGCAG
GCCCTGCTGG AACTGGTCCG CGCTCAGGCG GCCGCGGTGC TCGGCCACGA GGGTGCGACG
GCCGTCGCGG ATGACCGCGC GTTCCGCGAT CTCGGCTTCG ACTCGCTCAG CGGGGTCGAG
CTGCGCGACC GGCTGACCGC CGTCACCGGG CTGCGCCTGC CGGCCGCGCT GGTCTTCAAC
CATCCGACGC CGCGGGCCCT GGCCGGCTAC CTGCGGGCCG GGCTCGAGCC GGGACGGGCG
GGCACCGCCG CCGCCCTGGC GGAGCTGGAC CGCCTCGAAG CCGCGCTCGG CACGGCCGCC
GAGGGCGACC GGTCCGTGCT CACCGAGCGG CTGCAGGCCC TGCTTTGGCG GTGGACCGAC
ACCCCGGCCG CGCCCGCCGG CGGCACGTAC GTCCCCGCCG GGGAGGACGA CGACTTCGAG
TCGATGACCG ACGACGAGAT GCTGGAGGTG ATCGACCGGG AGCTCGGCGC GCTGTGA
 
Protein sequence
MAGHEHPTWR EALTGLPEAE QHRLLLDLVL RQVRRAVGDG RGPFDPRTPF VRLGLGGGST 
AGFRDALGGE LGLALPATLL FDHPSPARLA DHLRTRVLDA PDHPIPPPIA PDPPDLPDLP
ASRAPSASAA PPDDPIVIVA MACRFPGGVR SPEDLWRLVA EGRDAVVDIP ADRGWDLATH
FDPDPRRSGT FYTTQGGFLD DVAGFDAAFF GISPREALAM DPQQRLLLEV SWEAFERAGI
DPLSLRGSRT GVYTGLPVAD YSPPWRTAPP VVEGHLMSGT LPSVASGRVA YTLGLEGPAM
TIDTACSSSL TAMHLAAQAL RAGECTLALA GGATVMATPD VLVEFSRQQG LAPDGRCKSF
ADAADGTGWA EGAAVVLLER LSAARRHGRA ALAVLRGSAV NSDGASNGLT APNGLAQRRV
IRAALASAGV SPSDVDVVEA HGTGTRLGDP IEAQAVIAAY GQDRLADRPL WLGSLKSNVG
HTVGAAGVGG VIKMVLALRH GVLPRTLHVD EPSARVDWSA GHVRLLTEPV VWPRGNGVRR
AGVSAFGVSG TNAHVIIEDA PRAEPVPPWL DQAGLDEAGP NTGGRDQAAG DPPGGGPPAG
EELPWLLSAG SEPALREQAV RLAARVESEP RPGLADVGWS LAAGRAALAH RAAVVAGDRE
GMLRGLGALA RGEPSAALIR GSARPDRQTD RLAFLFSGQG SQRPGMGRSL HGRFPVFTEA
FDAVTARLDR HLPRPLRDVV FAGPLPDAAL ERTEFTQPAL FALEVALFRL LESWGVRPDA
LLGHSVGELA AAHVAGVFDL DDACALVAAR GRLMQAMPTG GAMVSLRASE EQARELIAEL
GARAAAASGS ASGAVDVAAV NGPESVVVSG DLRPVEEIAQ RWRARGGTAT RLRVSHAFHS
AHMDGMLEEF RGVARGVTFR APAVPVVSNV TGAIVAPDEL CSAEYWVRQV RAAVRFRDGV
DRLHERGVRT FLEIGPDAVL CGMGQDCLPA GSDAVFVPSL RRDQAEQPAL ARAVGRLWMA
GADVDWAAFF AGARRVDLPT YPFQHRRFWL GDHLATANAP TPGTAGEPAT ADGATPGRPP
AARRYRVTWT PVPTAKAAKA AGPGRWLVVI PADLTAEDSV GDCVESCVRG LEGRDAEVVR
FEVDPAAADR SSLAARLASL DPPPAARDVS GRNAPGHDVP GSDVPGCSVD VPLAGVLSLL
ALDVRELPGR QPVTRGLAGT LVLVQALGDA GVEAPLWCVT QGAVSIGGAD AVTSPPQAQV
WGAGRVAALE LPRRWGGLVD LPPAVDGTAA DRLAAVLEGG PGEDQVAIRP SGVFGRRLVP
ARTTRTDSDS DSAGGGWRPR GTVLITGGTG ALGAAVARWC ADAGAARLVL TSRRGAAAPG
VPELVADLAR RGAATTVAAC DAADGDALAE VLAAIPADCP LTAVVHAAGV AGGFTPLQDV
DVAELAEVLA GKAAGARVLD ELVGDRPLDA FVLFSSIAGT WGSGGQCGYS AGNAYLDAMA
EYRGARGLAG TALAWGPWAE GGMAVDPAIA GHLRDRGLVP MPPDEAVAVL ADAVGDERCL
TVTDVDWARF APAFTAARPS PLLGDLPEVR AVLNPPDPST GPAGGPVAGG EAGDRGGDGP
AALLGRRATM RTAEWRRFLL DLVRAETAAV LGYDAADQIP ADRAFVDLGS TSLTAVQLRA
RLAERTGLRL PTTVVFDHPT STALAEYLAA ELPGGPDGDP APAGAAEPAF AGNHVTPAPG
PHGAAGDDPI VIVGMGCRLP GGVASPAELW RLLAEGTDAV SEFPVDRGWD VAGLYDPVPG
RVGRSYVRSG GFLAGAADFD AGFFGISPRE ALAMDPQQRL LLEVSWEALE RAGVDPSSLR
GSDTGVFVGA NIHDYATLLA AGGENTEGYF ATGTANSVMS GRVAYVLGLE GAAVTVDTAC
SSSLVALHLA VRALRAGECG MALVGGVTVM STPVGFVEFS RQRGLAADGR VKAFAEGADG
TGWGEGVGVL VVERLSVARA RGHGVLAVVA GSAVNQDGAS NGLTAPSGRA QERVIRAALA
DAGVGPSEVD VVEGHGTGTR LGDPIELGAL LATYGQGREP GRPLWLGSVK SNIGHAQAAA
GVAGIIKIVE ALRHQAVPAT LHVDAPSSRV DWSTGSVEVV TEPVAWPRQA ARVRRAGVSS
FGMSGTNAHV IIMEAPEPDL EATPGAGASA VAVPWLLSAR SPAALRGQAR RLLDHLAGSA
DRLPVPDVGW SLVRGRAALE HRAVVLGHGA GTGSDGDRPA ALRALAEGES AESVVRGRVG
TSGDRVVWVF PGQGAQWVGM GAELLDALPV FAGRVAECAA ALAPFVDWSL VDVLRGVPDG
SGMSGAGVVS LERVDVVQPV LWAVALGLAA VWESWGVRPD VVVGHSQGEA AAACVAGLLS
LEDGARVVAS RSRVIAGGLA GGGGMVSVGL PVVEVEVRLR EWAGRAWVGA GRVEVAAVNG
PSATVVAGEP AALDDLLSVW EAEGVRVRRL PVDYASHTAQ VDRVRDRLTT ELAGIVPRPG
DVAMWSAVTG RPVGPGALDG EYWFRNLRSR VRFDEAVRGL LAEGHRVFVE ISPHPVLTAA
ITETAEVAGV PDAVVVGSLR RGDGGPDRLA AAAAQLWVRG VPVDWAPLVA GGRPAELPTY
AFQHQRYWPD APAVRGGAPD PGVRPADHPM LGGVVPLVDT GGLLLTGLLS RGSQPWLADH
AVGGAALLPG AVFLELAVQA ADQTGCDEVA ELTLEAALVV PEHGGVHVQV AVGPPDGAGR
RPLAVYARPD GRSRPGDPEG PQEPWVRHAR GALAVRAPGG VAADPAFDLV VWPPSGARPV
PVDDLYPRLA AAGVDYGPAF HGVRAVWRRD AELFAEVGLP EPLRGTARRF ALHPVLVDAA
FQPLVLDPEL GPRRLRPFSW SAVRVLADGA STLRVRLSPA GADEVAVTLA DGTGQPVAEG
TLALRPASPA PAVAPPGPPP AAAAPVPVIP ARRAAADADQ PDASSFARRL AGLAGPDQEQ
ALLELVRAQA AAVLGHEGAT AVADDRAFRD LGFDSLSGVE LRDRLTAVTG LRLPAALVFN
HPTPRALAGY LRAGLEPGRA GTAAALAELD RLEAALGTAA EGDRSVLTER LQALLWRWTD
TPAAPAGGTY VPAGEDDDFE SMTDDEMLEV IDRELGAL