Gene Franean1_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4838 
Symbol 
ID5673179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5787437 
End bp5802229 
Gene Length14793 bp 
Protein Length4930 aa 
Translation table11 
GC content75% 
IMG OID641243694 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001509110 
Protein GI158316602 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.15888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATGC TGCGCACCGA ACTGATTGCA CCGCTGCCCA AGCTGCTGCG CGGGAACGCC 
GAGCGTCTCG GTGACAGGAT TGCATTCCGC GATGCCAGCC GTGCCGTCGG CTGGGCGGAG
CTGGAGCGGC GCACCGGCTG GCTGGCGGGC CACCTGGCCG ACCTGCGGCT GCAGCCGGGT
GACCGGGCCG CGATCGTGCT CGGCAACTGC GTCGAGGTCG TCGAGAGCTA TCTCGGCTTC
GCCCGCGCGT CCGTGGTGGG CGTGCCGATC AATCCTCGGG TCACCGAGAC CGAACTGGCC
TACCTCCTCG ACGACTGCGG GGCTCGGCTG GTCGTCACCG ACCCGGCGCG GATCGACATG
GTCGGTCGCG TCCTCCGGGA CCGGCCCGGC CTGCGAGTCG TGGTGACCGG CGGCCATGCC
CCGCCGCCGT CCGCCCCGGC CGGGACGCTG TCGTTTGCGG CGCTGGCAGG CGCCCAGCCC
CGCTCGGCGG CCCGCGACGA TCTGGGACTC GACGACGTCG CCTGGATGCT CTACACGTCG
GGGACGACGG GACGTCCAAA GGGCGTGTTG TCGACCCAGC GAAGCTGCCT GTGGTCGGTC
GCGGCCTGCT ACGCCCCGAT CCCCGGCCTG TCCGAGCAGG ATCGCGTCCT GTGGCCGCTG
CCGCTGTTCC ACAGCCTGTC GCACATCGCC TGCGTGCTCG GGGTGACGGC GGTCGGGGCC
AGCGCCCGGC TGCTGGACGG ATTCGCCGCG TCCGAGGTAC TCGCCGCGAT CCAGGAGGAC
GGCTCGACCT TCCTGGCCGG GGTCCCGACG ATGTACCACT ACCTCGTGCG GGCGGCGCGC
GAGAGCGGCT TCAGCGCGCC GAGCCTGCGG ATGTGCCTCG TCGGGGGCGC GATCACCACC
GCGCGGCTGC GCCGCGACTT CGAGGAGGCG TTCGGCGCCC CGCTGCTGGA CGCCTACGGC
AGCACCGAGA CCTGTGGGTC GATCACCATC AACTGGCCGA CCGGCGCGCG GGTCGAGGGC
TCCTGCGGAC TGCCCGTACC CGGCCTGGGG GTACGCCTTG TCGACCCCGA GACCGGGCTC
GACGTGGGAG CCGGCGCCGA GGGAGAGGTG TGGGTGCGCG GGCCGAACGT GATGGTCGGC
TACCACAACC AGCCCGAGGC CACCGCCGCC GCACTGCGCG ACGGCTGGTA CCGAACCGGT
GACCTGGCCC GCCGGGACGA CGCCGGCTAC TTCACCATCA CGGGCCGGAT CAAGGAGCTC
ATCATCCGGG GTGGCGAGAA CATCCACCCC GGCGAGGTGG AGGAGGTGCT GCGCGGTGTC
CCGGGCGTAG CCGACGTCGC GGTCGTCGCC CGCCCGCACG ACCTGCTCGG TGAGGTGCCG
GTGGCCTTTC TGGTGCCCGG CCCGGAGGGC CTCGACCCCG ACCGCCTCCT GGCCACGTGC
CGGGAGCGGC TGTCGTACTT CAAGGTGCCT GAGGAGCTCT ACGAGATCGA CCGTATTCCG
CGGACTGCCT CGGGGAAGAT CACCCGACAC GTGCTGCTGG AACGTCCGGC GCGGCTGCGT
GCCGCCAGCA GCGGGCACCA CGACACGCTG TTCCGGGTCG ACTGGATCCC GCGCCCGTCG
GTGACGTCCT CCTCGGTCCG CGCCCGCCCG GACCACACGT CGAGCCCCGT TCAGGCCGGT
GGGCCCGGCC GCCAGCCCCT TTCGGCTCCG CCGGTCGGGA CGTCCGTTCC GCCGGTCGGC
GGCGAGCGGA GGTGGGCGAT CATCGGCGCG GACGCGTTCG GATTCGCGCC GGTGCTCACC
GAAGCCGGGA TCCTGGTCTC CCAGTACCCG AACCTGGACG CCGTGCGACG GGCCGCCGCC
GACGGCGACG AGGTTCCCGA CCTCGCAGTG CTGACGTGCG GGTCGGTGCT CGGCAAGGCG
GGCGTTCTCT CCGACGACGC GGCGCGGGCG GTCACCTGGC TGTCCGGCGA GATCGCCGGC
TGGCTCGCCC AGGAGCGCCT GGCCGGCGTG AAACTGGTGA TCGCCACCCG CGGCGCGGTG
GCGGTCGGCC CCGACGACGA CATCGAGGAC CTGATCCGCG CGCCGCTGTG GGGCCTGCTG
CGCACGCTGC AGACCGAACA TCCGGACCGC TTCGTCCTGT TCGACCTCAC GGTCGACGAC
CCGGCGGGGG CGGCGGCCCT GCCGGCCGTC GTCGAGTCGG GCGAGCCGCA GGTCGCGGTG
CGGCAGGGGG TCGTCCTGCT GCCCCGGCTC GCCCGGGTGG CCGCCGTCCC GACCGACCAG
GACTCACCGA CGGGCGGGAA CCTGTTCGCC GACCCGCTGC GGACGGTGGT CGTCACCGGC
GCGGACGGCC CGGTCGGGGC CGCACTGGCC CGCCACCTGG TGGCGAACTA CGGAGTCCGC
AGGCTGCTGC TCGTCAGCCG TCCCGGCGTC GCGGGTGACC TGGGTGACCT GGGTGAACTG
GGCGGCGCGG GTGGCCTGGG CGGCGCGACC GGTGAGCACT CCGCCGAGGC ACTGCGGGCC
GAACTGGCGC ACGCGGGGGC GACAGTCACG CATGTCCCAT GCGATCTTGC CGACCGGGCG
GCGCTCGCGG CGGTGCTGCA CCGGCACGCC CGTTCGGTCT GCGCGGTCTT CCACGCCCAG
AGCGCCGCGG GACCGGTCGG CGGGCTCCGG AACGACGAGC AGCGGCTGGC CGCGACCCTG
GCTGGCGCGC TCAACCTGCA CGACCTGATC GGCGGGCCGG AGACCCGGGC GTTCGTCCTG
TGCTCCTCGG CCGCCGGCCT GCTCGGCGGG GCGGGACTGG CCGAGCCGGC CGCGGTGAGC
GTGTTCTTCG GCGCGCTGGT CCAGCACCGC GCCGCGCGCG GCCTGCCGGC TGCCAGCGTG
AGCTGGGGGC CGTGGGAGGG CAGCGAGCTG CCCGAGCTGC CCGGCTCCCG CGCGCTGCGC
GTCCGGGAAG GCCTGGCCAT GTTCGACGCG GCCCTCGGCG CCGATCAGAG TGTGCTGGCG
GTCCTGCGCC CGGACGCCGC CGTGCTCGCC GACGACGCCC CGCCCGCGCC GCTGCGCGGA
CTGATCGACG TTCCGGTGGC GCACCGGCCC CCGGACGACG CCGTCGCGGT CGAGCTGCGG
ACGAAGCTTG CCGGTCTGCC CGAGGCCGAC GCTCTGCGGC TGTTGACGAC CCTCGTCCGG
ACCGAGGTCG CGCGGGTCGC AGGCGGGTCC GCCGGCGCGG CCGTCGAGGC GGGGACGGCG
TTCCGCGACC TCGGACTGAC CTCGGTGACC ACGGTCGAGC TGCGCAACCG GCTGACCGCG
AGCACCGGCC TGCGGCTTCC CGTGACCGTG GCCTTCGACC ATCCCACCCC GCTCGAGCTG
GCCCGCGAGC TGCGGCGCGG GCTGCTCGGC GAGGCATCCG CCGCCGTGGC GGTCCGGAAC
CGGCGCGCCG TGTCGGACGA GCCGGTGGCG ATCGTGGGTA TGGCGTGTCG GTTGCCGGGT
GGGGTGGTGT CGGCGGAGGG GTTGTGGGAT GTGGTGGCGG GTGGGGTGGA TGCGGTGTCG
GGTTTTCCGT CGGATCGTGG GTGGGATCTC GCGGGTTTGG CGGGGGACGG TGTTGGTGGC
GTGGGTGGCG TGGGTGGCGT GGGTCGGGTG GGTTCGTCGG TGGCGGGTTC GGGTGGTTTT
CTGCGGGATG TGGCGGGGTT TGATGCGGGT TTGTTCGGGG TTTCGCCGCG GGAGGCGTTG
GCGATGGATC CGCAGCAGCG GTTGTTGTTG GAGGTGTCGT GGGAGGTGTT GGAGCGGGCC
GGGATTGATC CTGGTTCGTT GCGGGGGGAG CCGGTGGGTG TCTTCACCGG CCTGATGCAT
CACGACTACG CGCGGGGGAA CACCGCGGCG GCCGAACGCC TCGAGGGTTA TCTCAGTATC
GGAACAGCGG GCAGTGTGGC GTCGGGTCGG GTGGCGTATT CGTTTGGTTT TGAGGGGCCT
GCTCTGACGG TGGACACGGC GTGTTCGTCG TCGTTGGTGG CGTTGCATCT TGCGGTGGGG
TCGTTGCGGT CGGGTGAGTG TTCTCTTGCG TTGGCTGGTG GGGTGGCGGT GATGGCGACG
CCGGAGGTGT TCGTGGATTT CTCGCGGCAG GGTGGTCTTG CCGTGGATGG CCGGTGCAAG
GCGTATGCGG ACGCCGCCGA CGGAACCGGA TGGGCGGAGG GCGCCGGGGT TCTCCTGCTG
GAGCGGTTGT CGGACGCGGA GCGCAATGGT CATCGGGTGT TGGCGGTGGT GCGTGGTTCG
GCGGTGAATC AGGATGGTGC GTCGAATGGT CTGACGGCGC CGAGTGGCCG GTCGCAGGAG
CGGGTGATCC GGGCGGCGTT GGCGGATGCG GGGTTGACGA CGGCTGATGT GGATGTGGTG
GAGGGGCATG GGACGGGGAC GCGTCTCGGC GACCCGATCG AGGTCGGTGC GCTGCTGGCG
ACCTACGGCA CCGGTCGCTC GCCGGACCGT CCGGTGTTGT TGGGGTCGTT GAAGTCGAAC
ATCGGGCACA CGCAGGCCGC CGCGGGGGTG GCCGGTGTCA TCAAGACGGT GCAGGCGCTG
CGGCACGGCG TCGTCCCGAG GACGCTGCAT GTGGACGCGC CCTCGTCGCG GATCGACTGG
TCGGCGGGTG CGGTGTCGCT GGTGACGGAG CCGGTGGTGT GGCCGGAGAC GGGCCGGCCG
CGGCGGGCGG GTGTGTCGTC GTTCGGGGTG AGCGGGACGA ACGCCCATGT GATCATCGAA
CAGGCCCCGG ACGAGTCCCC CGACACCGCC ACGGATGAAG TCGCAGATGC CGCCCCGGAT
GGCGTCGTGG CGGGGGAGGC CGTGCCCTGG CTGCTGTCCG CGGCGTCGCC GGCGGCGCTG
CGCGCCCAGG CCGACGCCCT GGCGGAGTTC GCCGCCGCGC GGCCGGCACC TGCCGCGGCC
GAGATCGGCC GGTCCCTGGC GACGACCCGC GCGCGTCTCG CGCGACGTGC GGTCGTGGTG
GCGGGCTCCC ACGACGAGCG GGTCACCGCG TTGCGCGCGC TGGGCGCCGG CACGCCACAT
CCGGACGTGA TCTTCGAACC GTCCCGGCCG CAGGTCGGCG CCGACCGCGG AAACGTTTTT
GTTTTTCCGG GTCAGGGGGC GCAGTGGGTG GGTATGGGTG CGGGTTTGTT GGGGGGTTCT
TCGCGGTTGT CGGAGGTGTT TCGGGGGGTG GTGGAGGAGG TTTCTGGGGT GTTGGCGGGG
TTGGTGGACT GGTCGTTGGT GGATGTGTTG CGGGGGGTGG GGTCGGATGG GGTGTTGGAG
AGGGTGGAGG TGGTGCAGCC GGCGTCGTTC GCGGTGGGTT TGGGGTTGGT GCGGGTGTGG
GGTGAGTTGG GGGTGGTTCC GGGTGCGGTG CTGGGTCATT CCCAGGGTGA GGTGGTGGCG
GCGTGTGTGG CGGGGGCTTT GGGGGTGGGT GATGCGGTGC GGGTTGTGGT GGGTCGTAGT
CGGGTGGTGG CGGAGAGGTT GTCGGGTCGG GGTGGGATGG TGTCGGTGTT CCTGCCGGTG
GATGAGGTTG TGGGGTTGTT GCCGGTGGGG GTTGAGGTGG CGGCGGTGAA TGGTCCGGGG
GTGACGGTGG TGTCGGGTGA GCGGGCTGGT CTGGTGGAGT TGGTGGGGGT GTTGGAGGGT
CGGGGTGTGC GGGTGCGGTG GGTGGCGGTG GATTATGCGT CGCATTCGTC GCAGGTGGAT
GGGGTGGCGG GGGAGTTGCG GGAGTTGTTG GCGGGGGTGC GGTCGGTGGT GCCTCGGGTT
CCGTTTTTTT CGACGGTGGA GGGTCGGTGG GTGTCGGGGG CGGGGGAGTT GGAGGGGGAT
TACTGGTTTC GGAATCTGCG GTCGAGGGTG GGGTTCGCGG GTGCGGTGGG GGTGTTGGCG
GGGGAGGGGT TCCGGTCGTT TGTGGAGGTG GGGGCGCATC CGGTGTTGGT GGGGGCGGTG
GGTGAGGTGT TGGAGGAGGT GGGGGTGTCG GATGCGGTGG TGGTGGGGTC GTTGCGGCGT
GGTGAGGGGG GTGGTGGGCG TGTTCTGCGG TCGGCGGCGG AGTTGTTCGT GCGTGGGGTT
CGGGTGGACT GGTCCGGGGT GTTCGACGGG CGTGGGGGGG TCGCGGTGGG TGTGGGTGTG
GGGCCTGACC TGCCGACCTA TCCGTTCCAG CACGAGCGCT ACTGGCTGGA CCCGGATCCC
TCCGGCGGGG ACGGCCTCGG CGACGTCTCC ACGGCCGGAC TCGATCCGGT CGACCATCCG
CTTCTCGGCG CCGGTGTCAC CGTCGCAGGA GAGTCCTTCC CGGATACGTC CCCCGACCCG
CTCGCGGCCG GTCTCACGGA AGGCGGCCCC GGCGGCGATG GCCGTCTGCT CTACACAGGC
CGGCTGACGA CGGATCGCCA CCCGTGGCTG GCCGACCACC GTGTTGGCGG TGCCGTGGTG
CTGCCCGGTT CGGCGCTGCT CGACGTCCTG GCGTGGATCG GAGACCGGGT GGGGCGGCCC
GTCCTCGCCG AGCTGACGAT CTCCGCGCCG ATCCTGATCG ACGAGCCCGC TGAGAACACC
GCCGCGAACC CGGCCGCCGG CACGGACATC CAGATCGTCG TCGGTCCGCC GGATGGCGAG
CGGTCGCGCC CCGTCACGGT CCACTCCCGG ACCAGCCCGG ACGAGGCCTG GACCGAGAAC
GCCCGCGGAA TCCTCGCCCA GCCAGCGGGC ACCGCGACCG GCGGCCCGAC CGATCCGACC
GGTCGGCCCG GCTGGGCGGC GTCGTGGCCC CCGGCCGCCG CCAGCCCTGT CGACATCGAC
GAGTGCTACG ACCGGCTTCC TGTGGACTAC GGCCCCGCGT TCCACGCGCT GCGCGCGGCG
TGGGTGGCCG AGGGCACCGT CTACGCCGAG GTCACGCTCC CGGACGCGGC CGTCGCCCCC
GCCGGGGACG TTTCACCCGC GTCCCGGTGG CTGCGCTCGG GCGGCGTGGA ACTCTCCGAC
GGCACCGACG GCACCGAGCC CTCGGACGGA TATGGCCTGC ATCCGGTGCT GTTGGACGCG
GCGCTGCACC CGCTGGGCGT GGCCGGGTTC TTCCCCGACC CGGACCAGCC ACGCCTGGCC
TTCGCCTGGT CGGGGGTGCG GCTGTGGGCG ACCGGGGCCC GCTCCCTGCG GGTGCGGATC
GCGCCTGCCG GGCCCGACAC CATGACGATC AGCGCGGTCG ACGACGAGGG CGCCGCGGTG
ATCGACGTCG AGGCACTGGT GGTCCGCCCC GTCGACCCGG AGCGGCTGAC GGCCGGGCGC
GGTGCGGCGC GGCGGTCGCA CGCGCTGTTC GAGGTGCGAT GGGAACAGAC GGACGCCGCC
GCGGTGCGCG GCACGGAATG GGCCTTCCAC CCCGACTCCG CCGTTCGTGA CTCCGCCGTT
CCCGGCGGTA GCGGCGGAGT CACGGGCACT CCCGTGGCCG AGCGGGCGGC CCGTCCCCCG
TTCGTGGTCT TCGTTCCCGC CAGCGCCACG GGTGTCGACG GTGGTGCCGG TGGGGACGCC
GGCACGGTCC CGGGCGCAGC CGTCGCCGTC CGGGACGTCC CCGGCGCGGT GCGTGCCGCC
GGGCTGCGGA CGCTGCGCGT TCTCCAGGAC TGGCTGGCCG ATCCCGACAG CGCCTCGTCC
CGTCTCGTGG TCGTCACGCG GGACGGCGAC CTGGCCCACG GCTCGGTGCG CGGGCTGGTA
CGCGCCGCGC AGGCGGAACA CCCGGGGCGG TTCGGCCTGC TCAACCTGGA CCGGGCCTAC
CACCTGCCGG AACCCCCCTC GGTCGGCGAG ACTCCCTGGC TGGAGGCGAT CGCGGCGGGT
CTGGGCGCGC CGGCCGACGA GCCGTGGGTC GCCGTCCGGC CGGCGGGGGC AGGCGGGACC
GAGGTGTTGG CCGCGCGGCT GCGGCGCGCC GAGCGGGCCC CGGTGCCGGC GCGGTCGCTT
CTCGAGGGAG GCACCGTCCT CGTCACGGGC GCCACCGGTG GCCTCGGCCG GCTGGCCGCC
GACCATCTCG CCCGGGTCCA CCGGGTGCGG GAGCTGGTTC TCGTCGCCCG GTCCGTGGCG
GGGGAGGAGC AGGTCGCCGA GCTGCGCGCG ACGGGTGTCA CCGTGCGCGC GGTCGCCGCC
GACGTGGCCG ACCGCGCTGC CATGGCCGAG ATCGTCGCGT CCGTCGCCGA CCGCCTCACC
GCGGTGGTGC ACATCGCTGG CATCGTCGAG GACGGCGTCA TCGAGGCCCT CGACGAACCG
CGTTGGCACG CCGTCCTGCG GACCAAGGCG GATGCCGCCT GGCACCTGCA CGAGCTGACC
GCCGGCCTCG ACCTCGCCGC GTTCGTGCTC TACTCCTCGG CCGCCTCAGT CTTCGGTGGC
GCGGGCCAGG GCAACTACGC GGCCGGCAAC GGCTTCCTCG ACGCGCTCGC CCGGCATCGG
CGGGCGGCTG GTCTGCCCGC GGTGTCGCTG GCGTGGGGAC TGTGGCAGGA GGCCGCCGGA
ATGGGCGGCC GGCTGAGCGC GACCGACCGG GCCCGCATGA CCCGCGATGG AACCAGGGCA
CTGACGGCGG CCGACGGCCT CGCCCTGTTC GACTCCGCCC TGGTGGACAG CCGGCCGGAG
CTGGTCCCCG TACTGCTCGA CCTGGGCGCG CTCCGGCGGC GCGACTCGCT GCCACCCCTG
CTGCGCGGAC TCGTGCCGGC GCCGGCGCCG GCACGCCGCC GCGCCGCCGT GTCGGCGGGC
TCCGCGGCGC CTGACCGCTC CACTCTGCGC GACCGGCTCG CCGCTGCGTC GGCCGGTACC
CGGGACGAGA TTCTGCTGGA ACTGGTCCAG GCGACCGCGG CGGTCGTCCT CGGCCACACC
GATCCGGCCG CGGTCGAGCC GGACCGGGCC TTCCGGGACC TCGGCTTCGA CTCGCTTACC
AGCGTGGAGC TGCGCAACGG CCTGATCACC GCGGCCGGGG TGCGGCTGTC GGCGACCGTG
GTGTTCAACC ATCCCACGCC CCGATCGCTC GCCGGCCACC TGGCGGCGCA GCTCGCCCCG
GACCTCCCGG ACCTCCCGGA CGGGCCCTCC GCGTCCGGCG ACCCGCCGCG GGCCGTCACC
GCCGTGGCGG CCGGCACCGT CCGGGGGAGC GGCGGCCGTA CCCGGGAGCA GGACCGAACC
GACGACCCGA TCGTGATCGT CGCCATGGCC TGCCGGTTCC CGGGCGGGAT CGGCTCGCCG
GCCGACCTGT GGCAGGTCGC CAGCGACGGC GTGGACGTCG TCGGCCCGCT GCCGACGGAC
CGCGGCTGGA ACCTCACCGA GCTCTACGAC CCCGACCCGG ACCGCCCGGG ACGGACCTAC
GTGCACGCCG GCGGCTTCCT CACCGACGTG ACGGGCTTCG ACGCCGGCCT GTTCGGGATC
TCGCCGCGGG AGGCGCAGGC GATGGACCCG CAGCAGCGGC TGCTGCTGGA GACGTCCTGG
GAGGTGCTGG AGCGGGCGGG TCTCGACCCG ACGTCGCTCG CGGGCACGCC GACCGGGGTG
TACGTCGGCA CGCACGGCCA GGACTATGCG AGCGAGGTGT CCGGCGAGCG GGCGGACGAG
GGATATCTCG TCATCGGCCG GGCCGCGAGC GTGCTCTCCG GCAGGGTGTC CTACGCGTTC
GGCTTCGAGG GGCCCGCGTT GACCGTGGAC ACGGCGTGCT CGTCCTCGCT CGTCGCCCTG
CATACCGCGG CCGCCGCTCT GCGGGCCGGC GAGATCGGCC TGGCGCTGGT CGCGGGCGTG
TCCATCATGT CGAGCCCGGA GGGCCTGCTC GGTTTCTCCC GGCAGCGTGG CCTCGCCGCC
GACGGCCGGT GCAAGGCCTA CGCGGACGCC GCGGACGGCT TCGGGATGGC CGAGGGCGTG
GGCGTTCTGC TGGTGGAGCG GTTGTCGGAC GCGCGTCGCC ACGGCCGCCG GGTGTTGGCG
GTGGTGCGTG GTTCGGCGGT GAATCAGGAT GGTGCGTCGA ATGGTCTGAC GGCGCCGAGT
GGCCGGTCGC AGGAGCGGGT GATCCGGGCG GCGTTGGCGG ATGCGGGGTT GACGACGGCT
GATGTGGATG TGGTGGAGGG GCACGGCACG GGGACGACGC TGGGCGACCC GATCGAGGCC
CAGGCGCTGC TGGCCACCTA CGGCCGGCGG CCGGCGGACC GGCCGGTGCT GCTCGGGTCG
GTGAAGTCGA ACATCGGGCA CACCCAGGCG GCCGCCGGGG TGGCCGGGAT CATCAAGATG
GTTCAGGCGC TGGACCATGC GGTGGTCCCG AAGACGCTGC ACGTGGATCG GCCGTCGGGG
CATGTGGACT GGTCGGCGGG TGCGGTGTCG CTGGTGACGG AGCCGGTGGC GTGGCCGGAG
ACGGGCCGGC CGCGGCGGGC GGCCGTGTCG TCGTTCGGGG TGAGCGGGAC GAACGCCCAT
GTGATCATCG AACAGGCCCC GGTCGCATCG ACCGAACCGG CTGAAACAGA GGCACCGGCC
GCACCGGGTG AATCAGAGGT ACCCGCCGGA CCGGTCGTTC CGGTCGTGCT GTCGGCGGCG
TCCCGGGAGG CTCTGCGCGG GCAGGCCGGG CGGCTGGCGG AATTCGTCCG GACCAGGACC
GACGTCCCGG TGGCCGCGGT CGCCGCGACG CTGCTGACCC GCGCGCGGCT CGGCCAGCGC
GCGGTCGCGG TGGCCGCGGA GCGCGGCGAG CTCGTCGCCG GCCTGGAGGC GCTGGCCGGT
GATCTTCCTG ACCCGGCCGT GGTCTCCGGG GCGGCGAGCC CCCGGGGCAG GGGCCCGGTC
TTCGTGTTCC CGGGCCAGGG CGCGCAGTGG GTGGGCATGG GCGCCGGCCT GCTGTCGGGT
CCCTCGGTGC CGTCGTCGGC GTTGTCGTCG GTGTTCCGGG AGACGGTGGA CGAGGTCGCC
GGGGCGCTGG CGGGGCTGGT GGACTGGTCG CTCGTGGACG TCCTGCGGGG GGAGGGGCCG
GACGGGGCGC TGGAGAGGGT GGAGGTGGTC CAGCCGGCGT CGTTCGCGGT GGCCCTGGGT
CTGGCGCGGG TGTGGCGGGA ACTGGGGGTC ACCCCGGGCG CGGTGGTGGG CCACTCGCAG
GGAGAAGTGG CGGCGGCGTG CGTGGCGGGA GCGTTGGGCA CGGCCGACGC GGTACGGGTG
GTGGTGGCCC GCAGCCGGGT GGTGGGCGCG CGGCTGGCCG GCCGTGGCGG GATGGTGTCG
GTGTCCCTGC CCGCGGCGGA ACTGGCTGGT CTGCTGCCAC CGGGGGTGGA GGTGGCCGCG
GTCAACGGCC CGGGAACGAC CGTGATCTCG GGCGCCTCCG CGGCGTTGGC CGAGCTGGTG
GCGGCGCTGG AGGTGCGGGG CGTGCGGGCG CGGTCGGTGG CGGTCGACTA CGCGTCGCAC
TCCGCCCAGG TGGACGCCGT GGCCGAGGAG CTGGCGGAGC TGCTGGCGGG GGTGCGGCCG
GAGTCGCCCC GGATCCCGTT CTTCTCCACG GTGGAGGGCC GCTGGATCGC GGGAGCGGAG
CTGACAGGCG ACTACTGGGT CCGCAACCTG CGGCGGACGG TGGGCTTCGC CCAGGCGGTC
GGAGTCCTGG CGGGCGAGGG GTTCCGCTCG TTCGTCGAGG TCGGTGCCCA TCCGGTGCTG
GTGCCCTCGA TCGGTGAGGT GCTGGAGGAA GCCGGGTTCG GTGACACGGC GGTGGTGGGA
TCGTTGCGCC GGGGCGAAGG CGGCCCCGAG CGGCTCCTGC GCTCCGCGGC CGAGCTGTTC
GTGGCCGGCG TCCCCGTCGA CTGGACGAAG GCGTTCCCCG CCGCCGCGCC GCGGGCCGCC
GGCCGGCTCG GCGCGGCGGC GGAGCTGCCG ACGTACGCCT TCCAGCACGA GCGTTACTGG
CTCGCGCCGG GAGCACCCGG TTCCGGTGAC GTGACCGCCG CCGGGCTCGA CGCGACCGGT
CATCCTCTGC TCGGCGCCGC GGTCGACCTC CCCGGCTCGG CGCCCGACGC GGCCGAGGTG
GCGTTCACCG CCCGGCTGTC GGCGCGGACC CATCCCTGGC TGGCCGATCA CGCGGTGCGC
GGCGTGCGGC TGCTGCCGGC CACGGCGTGG ATCGAGATCG GGTTGCACGC GGGAGACCGC
GTCGGCCACC CCGTACTCGA CGAGCTGTTG ATCGAGGCGC CGCTCGCGGT TCCGGCCGAG
GGCTCGGTCA CCCTGCGGGT GATCGTGGAC GGGCCGGACG CCGATGGTCG GCGCCCCGTG
CGGCTGTACG CCCGCCCGGA CGGCGCCGCC GCGCCCGGTG ACGGCGACGG CGACGGGACC
GAAGGCACCG GCACCCGGTG GACCCGCCAC GGCACGGGGC TGCTGTCGGC CGAGGTCACC
GAGCCCGGAC ACGGCTACGA GGCATGGCCC CCGGCCGATG CCCGCCCGGT GGACCTCGCA
GACTTCTACC CGCGGCTGGC CGACCGGGGA TACGACTACG GCCCGGCGTT CACGGGGCTG
CGCGCCGCCT GGACGCGGGG CCGGGAGGTC TTCGCCGAGG TGGAGCTGCC GGCCGCCGAG
GCGTCCGCCG AGCCCCCCGG CGCGTACGGT CTGCACCCCG CGCTGCTGGA CGCCGCCCTG
CAGGCGACGA ACCTCGGCGC GGTGCCCGCC GCCGAGGAGG GGCACGTCCT GCTGCCCTTC
GCCTGGAGCG GCATCCGTCG CTTCTCCTCG GGCGTGACAG CCCTGCGCGT GCACGCCACG
CCGAGTGATC TCGCCGCCGC CCCCGGGTCG CACGGGGTGA GCGTCCGGAT GTCCGACCGC
GGCGGTGCGC CCGTCGCCGA GATCGGCTCC CTGGTGCTGC GCGCGACGCC GCTGGCGCAG
CTCGACCGGC TCGACCGTTC GGCCGGCAGT GGTGGGGCGG GGACCGCCGA GGCGCTGTTC
CGGGTGGAAT GGGTCGGCGT ACCGACCGGC CCGGCCGGTA CGCCGCTCGG GAGACCGGCG
GTGTCCCCGG CGGCGGGCGC GGAGCCGCCG GAGGCGGACG TTCTCGACGT CACCGGCCGC
GCGAGCGTTG ACCCGACCGC CGTCCGCGCG CTGGTCGCCG ACGTCCTGGA GGCCCTCCAG
AAGCGGCTCG CGCCGGCCGG GGGTGCGCCG GCGGTGACCG CCCCCGGCCA CGGCTGGCGG
GTCCCGGACG GGCCCCTGGT GATCCTCACC GACGATCCCG CGGGTGAGCC CGCCTCGGCT
GCGGTGTGGG GCCTGGTCCG CTCGGCGCAG GCCGAGCATC CAGGGCGGTT CGTCCTGCTC
GGCGGTGCGC CGGAGGAGTC CCGGGCCGTG CTGCCCACCG TGCTGGCGAG CGGCGAACCG
CAGGCGGTCG TGCGGGATGG GCAGGTGCTG GTGCCCCGCC TGGCTCGCGC GGCGCGGCCC
TCGGCCGACA CGTCTGCTCC GGACGGTGCC TCGCCGCTCA GCGGCATGTC GCCGGTGGAC
GGCACCACGC TGGTCACGGG TGGCACGGGA ACCCTGGGAG CGATCGTCGC GCGAGAGCTG
GTGCGTACGC ACGGTGTCCG GCACCTGGTG TTGCTCAGCC GCACCGGCGC GGCCACGCCG
GGCGCCCCCG AGCTGGTCGC GGAGCTCCGT CACGCCGGTG CCGCGGTCGA GGTGGTCGCC
GCCGACGCGG CGGACCGCGA GGCGGTGCGG GCCCTGCTGG CGGACATCCC GCCCGCGCAC
CCACTGACCG CGGTCGTCCA TGTGGCCGGG GTGGTCGACG ACGGCCTGGT CACCTCACTG
GACCGGGGAC GGCTCGACTC CGTCTTCCGG CCGAAGGCCG ACGCCGCCTG GAACCTGCAC
GAGCTGACCG CTGAGCTCGG TCTCGCCGCC TTCGTGCTCT TCTCCTCCGC CGCCGGCGCC
TTCGGCGGCG CGGGGCAGGG CAACTACGCC GCGGCGAACG GCTACCTCGA CGCGCTCGCC
GAGTACCGCG CCGGCCTCGG ACTGCCCGCG GTGGCGGTGG CCTGGGGCCT GTGGGAGCGG
GCGAGCAGCC TGACGGCGAG CCTGACACCG GCCGACCGCG ACCGGATGGC GCGGGGCGGT
GTCCGCGGCC TGTCCGACGC CGAGGGCGCC GCGTTCTTCG GCGCGGCGCT GCGGTCGCCG
GACGCGGTGC TGGTCGCCGC CGCGGTCGAC GTGCCGGCGT TGCGGCGGCG CGCGTCGGCC
GGCGGGCTGC CGCCGCTGCT GCGCGGACTG GTTCCCGCGC CGGTACCGGC CGCGGACTCG
GCGGCCCCGG ACAGCCCCGG CGGTATCACC GGTACCGGCA GCGCCGGCGC CGGCTCAGCC
TCGGGTCAGC CGGGGCGGGC GTTGGCCCGG CGACTCGCCG GCCAGGCGGA GCCGGAGCGC
CGGCGCGTCC TGCTCGATGT CGTCCGTGCC CACACCGCGA CGGTCCTCGC ACACCGGTCG
GGGAACGCGG TGGGTGTCGG CCAGACGTTC CAGGAGCTCG GGTTCGACTC GTTGACCGGC
GTCGAACTGC GCAACCGGCT CGCCGCGGCG GTGGGGGTGC GGCTGGCCGC GACGATGGTC
TTCGACCATC CCACCCCCGC CGCGCTCGCC GAGCACCTGC TCCGTCTGCT CGACCTGGAC
GGTCCGGACG ACCCGGACAG CCATCACGGC CTGGACGGCC CCGGTACTCC GGCGGTGCGG
AACTCCTCGG TGCTCGACCA GCTCGCCCGT CTGGAACGGG TCCTGACCGG CGCCACGGAC
GCCCGGATCG CCGTCCCGGA CGAGGTGGCC GCCCGGCTGC GGGCGCTCGC GGCATCGCTG
GGCCCCCGGC GCGACGACGG CGCCGGTCTC GACCTGACCT CCGCGACGGA CGACGAGATG
TTCGAGCTGC TCGACCGCGG CCTCGGTTCC TGA
 
Protein sequence
MRMLRTELIA PLPKLLRGNA ERLGDRIAFR DASRAVGWAE LERRTGWLAG HLADLRLQPG 
DRAAIVLGNC VEVVESYLGF ARASVVGVPI NPRVTETELA YLLDDCGARL VVTDPARIDM
VGRVLRDRPG LRVVVTGGHA PPPSAPAGTL SFAALAGAQP RSAARDDLGL DDVAWMLYTS
GTTGRPKGVL STQRSCLWSV AACYAPIPGL SEQDRVLWPL PLFHSLSHIA CVLGVTAVGA
SARLLDGFAA SEVLAAIQED GSTFLAGVPT MYHYLVRAAR ESGFSAPSLR MCLVGGAITT
ARLRRDFEEA FGAPLLDAYG STETCGSITI NWPTGARVEG SCGLPVPGLG VRLVDPETGL
DVGAGAEGEV WVRGPNVMVG YHNQPEATAA ALRDGWYRTG DLARRDDAGY FTITGRIKEL
IIRGGENIHP GEVEEVLRGV PGVADVAVVA RPHDLLGEVP VAFLVPGPEG LDPDRLLATC
RERLSYFKVP EELYEIDRIP RTASGKITRH VLLERPARLR AASSGHHDTL FRVDWIPRPS
VTSSSVRARP DHTSSPVQAG GPGRQPLSAP PVGTSVPPVG GERRWAIIGA DAFGFAPVLT
EAGILVSQYP NLDAVRRAAA DGDEVPDLAV LTCGSVLGKA GVLSDDAARA VTWLSGEIAG
WLAQERLAGV KLVIATRGAV AVGPDDDIED LIRAPLWGLL RTLQTEHPDR FVLFDLTVDD
PAGAAALPAV VESGEPQVAV RQGVVLLPRL ARVAAVPTDQ DSPTGGNLFA DPLRTVVVTG
ADGPVGAALA RHLVANYGVR RLLLVSRPGV AGDLGDLGEL GGAGGLGGAT GEHSAEALRA
ELAHAGATVT HVPCDLADRA ALAAVLHRHA RSVCAVFHAQ SAAGPVGGLR NDEQRLAATL
AGALNLHDLI GGPETRAFVL CSSAAGLLGG AGLAEPAAVS VFFGALVQHR AARGLPAASV
SWGPWEGSEL PELPGSRALR VREGLAMFDA ALGADQSVLA VLRPDAAVLA DDAPPAPLRG
LIDVPVAHRP PDDAVAVELR TKLAGLPEAD ALRLLTTLVR TEVARVAGGS AGAAVEAGTA
FRDLGLTSVT TVELRNRLTA STGLRLPVTV AFDHPTPLEL ARELRRGLLG EASAAVAVRN
RRAVSDEPVA IVGMACRLPG GVVSAEGLWD VVAGGVDAVS GFPSDRGWDL AGLAGDGVGG
VGGVGGVGRV GSSVAGSGGF LRDVAGFDAG LFGVSPREAL AMDPQQRLLL EVSWEVLERA
GIDPGSLRGE PVGVFTGLMH HDYARGNTAA AERLEGYLSI GTAGSVASGR VAYSFGFEGP
ALTVDTACSS SLVALHLAVG SLRSGECSLA LAGGVAVMAT PEVFVDFSRQ GGLAVDGRCK
AYADAADGTG WAEGAGVLLL ERLSDAERNG HRVLAVVRGS AVNQDGASNG LTAPSGRSQE
RVIRAALADA GLTTADVDVV EGHGTGTRLG DPIEVGALLA TYGTGRSPDR PVLLGSLKSN
IGHTQAAAGV AGVIKTVQAL RHGVVPRTLH VDAPSSRIDW SAGAVSLVTE PVVWPETGRP
RRAGVSSFGV SGTNAHVIIE QAPDESPDTA TDEVADAAPD GVVAGEAVPW LLSAASPAAL
RAQADALAEF AAARPAPAAA EIGRSLATTR ARLARRAVVV AGSHDERVTA LRALGAGTPH
PDVIFEPSRP QVGADRGNVF VFPGQGAQWV GMGAGLLGGS SRLSEVFRGV VEEVSGVLAG
LVDWSLVDVL RGVGSDGVLE RVEVVQPASF AVGLGLVRVW GELGVVPGAV LGHSQGEVVA
ACVAGALGVG DAVRVVVGRS RVVAERLSGR GGMVSVFLPV DEVVGLLPVG VEVAAVNGPG
VTVVSGERAG LVELVGVLEG RGVRVRWVAV DYASHSSQVD GVAGELRELL AGVRSVVPRV
PFFSTVEGRW VSGAGELEGD YWFRNLRSRV GFAGAVGVLA GEGFRSFVEV GAHPVLVGAV
GEVLEEVGVS DAVVVGSLRR GEGGGGRVLR SAAELFVRGV RVDWSGVFDG RGGVAVGVGV
GPDLPTYPFQ HERYWLDPDP SGGDGLGDVS TAGLDPVDHP LLGAGVTVAG ESFPDTSPDP
LAAGLTEGGP GGDGRLLYTG RLTTDRHPWL ADHRVGGAVV LPGSALLDVL AWIGDRVGRP
VLAELTISAP ILIDEPAENT AANPAAGTDI QIVVGPPDGE RSRPVTVHSR TSPDEAWTEN
ARGILAQPAG TATGGPTDPT GRPGWAASWP PAAASPVDID ECYDRLPVDY GPAFHALRAA
WVAEGTVYAE VTLPDAAVAP AGDVSPASRW LRSGGVELSD GTDGTEPSDG YGLHPVLLDA
ALHPLGVAGF FPDPDQPRLA FAWSGVRLWA TGARSLRVRI APAGPDTMTI SAVDDEGAAV
IDVEALVVRP VDPERLTAGR GAARRSHALF EVRWEQTDAA AVRGTEWAFH PDSAVRDSAV
PGGSGGVTGT PVAERAARPP FVVFVPASAT GVDGGAGGDA GTVPGAAVAV RDVPGAVRAA
GLRTLRVLQD WLADPDSASS RLVVVTRDGD LAHGSVRGLV RAAQAEHPGR FGLLNLDRAY
HLPEPPSVGE TPWLEAIAAG LGAPADEPWV AVRPAGAGGT EVLAARLRRA ERAPVPARSL
LEGGTVLVTG ATGGLGRLAA DHLARVHRVR ELVLVARSVA GEEQVAELRA TGVTVRAVAA
DVADRAAMAE IVASVADRLT AVVHIAGIVE DGVIEALDEP RWHAVLRTKA DAAWHLHELT
AGLDLAAFVL YSSAASVFGG AGQGNYAAGN GFLDALARHR RAAGLPAVSL AWGLWQEAAG
MGGRLSATDR ARMTRDGTRA LTAADGLALF DSALVDSRPE LVPVLLDLGA LRRRDSLPPL
LRGLVPAPAP ARRRAAVSAG SAAPDRSTLR DRLAAASAGT RDEILLELVQ ATAAVVLGHT
DPAAVEPDRA FRDLGFDSLT SVELRNGLIT AAGVRLSATV VFNHPTPRSL AGHLAAQLAP
DLPDLPDGPS ASGDPPRAVT AVAAGTVRGS GGRTREQDRT DDPIVIVAMA CRFPGGIGSP
ADLWQVASDG VDVVGPLPTD RGWNLTELYD PDPDRPGRTY VHAGGFLTDV TGFDAGLFGI
SPREAQAMDP QQRLLLETSW EVLERAGLDP TSLAGTPTGV YVGTHGQDYA SEVSGERADE
GYLVIGRAAS VLSGRVSYAF GFEGPALTVD TACSSSLVAL HTAAAALRAG EIGLALVAGV
SIMSSPEGLL GFSRQRGLAA DGRCKAYADA ADGFGMAEGV GVLLVERLSD ARRHGRRVLA
VVRGSAVNQD GASNGLTAPS GRSQERVIRA ALADAGLTTA DVDVVEGHGT GTTLGDPIEA
QALLATYGRR PADRPVLLGS VKSNIGHTQA AAGVAGIIKM VQALDHAVVP KTLHVDRPSG
HVDWSAGAVS LVTEPVAWPE TGRPRRAAVS SFGVSGTNAH VIIEQAPVAS TEPAETEAPA
APGESEVPAG PVVPVVLSAA SREALRGQAG RLAEFVRTRT DVPVAAVAAT LLTRARLGQR
AVAVAAERGE LVAGLEALAG DLPDPAVVSG AASPRGRGPV FVFPGQGAQW VGMGAGLLSG
PSVPSSALSS VFRETVDEVA GALAGLVDWS LVDVLRGEGP DGALERVEVV QPASFAVALG
LARVWRELGV TPGAVVGHSQ GEVAAACVAG ALGTADAVRV VVARSRVVGA RLAGRGGMVS
VSLPAAELAG LLPPGVEVAA VNGPGTTVIS GASAALAELV AALEVRGVRA RSVAVDYASH
SAQVDAVAEE LAELLAGVRP ESPRIPFFST VEGRWIAGAE LTGDYWVRNL RRTVGFAQAV
GVLAGEGFRS FVEVGAHPVL VPSIGEVLEE AGFGDTAVVG SLRRGEGGPE RLLRSAAELF
VAGVPVDWTK AFPAAAPRAA GRLGAAAELP TYAFQHERYW LAPGAPGSGD VTAAGLDATG
HPLLGAAVDL PGSAPDAAEV AFTARLSART HPWLADHAVR GVRLLPATAW IEIGLHAGDR
VGHPVLDELL IEAPLAVPAE GSVTLRVIVD GPDADGRRPV RLYARPDGAA APGDGDGDGT
EGTGTRWTRH GTGLLSAEVT EPGHGYEAWP PADARPVDLA DFYPRLADRG YDYGPAFTGL
RAAWTRGREV FAEVELPAAE ASAEPPGAYG LHPALLDAAL QATNLGAVPA AEEGHVLLPF
AWSGIRRFSS GVTALRVHAT PSDLAAAPGS HGVSVRMSDR GGAPVAEIGS LVLRATPLAQ
LDRLDRSAGS GGAGTAEALF RVEWVGVPTG PAGTPLGRPA VSPAAGAEPP EADVLDVTGR
ASVDPTAVRA LVADVLEALQ KRLAPAGGAP AVTAPGHGWR VPDGPLVILT DDPAGEPASA
AVWGLVRSAQ AEHPGRFVLL GGAPEESRAV LPTVLASGEP QAVVRDGQVL VPRLARAARP
SADTSAPDGA SPLSGMSPVD GTTLVTGGTG TLGAIVAREL VRTHGVRHLV LLSRTGAATP
GAPELVAELR HAGAAVEVVA ADAADREAVR ALLADIPPAH PLTAVVHVAG VVDDGLVTSL
DRGRLDSVFR PKADAAWNLH ELTAELGLAA FVLFSSAAGA FGGAGQGNYA AANGYLDALA
EYRAGLGLPA VAVAWGLWER ASSLTASLTP ADRDRMARGG VRGLSDAEGA AFFGAALRSP
DAVLVAAAVD VPALRRRASA GGLPPLLRGL VPAPVPAADS AAPDSPGGIT GTGSAGAGSA
SGQPGRALAR RLAGQAEPER RRVLLDVVRA HTATVLAHRS GNAVGVGQTF QELGFDSLTG
VELRNRLAAA VGVRLAATMV FDHPTPAALA EHLLRLLDLD GPDDPDSHHG LDGPGTPAVR
NSSVLDQLAR LERVLTGATD ARIAVPDEVA ARLRALAASL GPRRDDGAGL DLTSATDDEM
FELLDRGLGS