Gene Franean1_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3473 
Symbol 
ID5671844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4116715 
End bp4131237 
Gene Length14523 bp 
Protein Length4840 aa 
Translation table11 
GC content76% 
IMG OID641242361 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001507781 
Protein GI158315273 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACG ATCAGGTCCC GCCCGGCAAC GGGAAGCCGT CCACCCCTGG GCAGCCGTCC 
ACCCCTGGAC AGCCGTCCGA CCCTGGGCGG TCGCCCGCCG CGGTCGTCTC CCCCGCGACG
GCCGTGGGAG CCGTGGCGGG CGGGGCGGGC GTGCCGGGCG AACCGGTGGC CGTCATCGGC
CTGGCCGCCC GGCTGCCCGA GGCGCCGGAC GTCGCCGCGT TCTGGCGGCT GCTGGAGCGC
GGCGGGCAGG CGGTCGGCGC CCCGCCGGCG GACCGCTGGT CCGCGGACGC ACCGCCGTTC
GGCGCGTTCC TCGACGAGGT CGACCGCTTC GACGCCGACT TCTTCGGGAT CTCACCCCGG
GAGGCCGCCG CGACCGACCC CCAGCAGCGG CTGCTGCTGG AGCTGGGCTG GGAGGCGATC
GAGGACGCCC GGATCGTGCC GGCCGCCCTC GCCGGCAGCC GCACCGCCGT GTTCACCGCG
GCGATCTGGG ACGACTACGC CGCCCTGCAC CATCAGCGGG GCGGCCCGGC GGCGGGCCGG
CACACCATGA CCGGGCTCGG CCGGGGACTG CTCGCCAACC GGCTGTCCTA CCTGCTCGGG
CTCACCGGGC CCAGCCTCAC CGTGGACGCC GCCCAGTCGT CGTCGCTGGT CGCCGTGCAC
CTGGCCGCCG AGTCGCTGCG CCGCGGCGAG GCGGAGCTGG CGATCGTCGG CGGGATCAAC
CTCATCCTGG CCCCGGCCAG CTCCGCCGCG AGCGCCCGCT TCGGCGCGCT CTCGCCGAAC
GGCCGCTGCT TCACCTTCGA CGCGCGGGCC AACGGCTACG TGCGTGGCGA GGGCGGGGTC
GCCGTCGTCC TCAAGCCGCT CGCCCGCGCC CTCGCGGACG GCGACCCGGT GCGCTGCGTG
CTGCTCGGCG GCGCGGTCAA CAACGACGGC GGCGGCGCCG GCCTGACCGT GCCCGACCAG
CACGCCCAGG AGGCGGTGCT GCGCGCCGCC TACCAGGCCG CCGGTCTCTC GCCGGCCCGG
GCCCGCTACG TCGAGCTGCA CGGCACCGGC ACACCCGTCG GCGACCCGGT GGAGGCGGCG
GCGCTGGGCG CGGTGATCGG CACGGCCCGC GCCGGGGAGA ACGCCGGGCC GCTGCTGGTC
GGCTCGGCCA AGACTAACGT GGGTCACCTC GAGGGGGCCG CCGGCCTGGT CGGCCTGCTC
AAGACCGCCC TGGCCCTCGC CCACCGGCGG CTGCCCGCCA GCCTGAACTT CGAGCGGCAG
AACCCGGCGA TCCCCCTGGA CGAGCTCGGC CTGCGGGTCG TCACGGAGGC CACCGACTGG
CCGGCGGACG ACAGCGCGCC GATCGCCGGG GTCAGCTCCT TCGGGATGGG CGGCACCAAC
TGCCACCTGG TGCTGGCCGC GCCCCCGGTC CCGGCCGACG TGATCCCGGC CGACGTGGTG
CCGGGAACGG CGCCCGGCGG GGATCTCCCG CTGCCCTGGG TGCTCTCCGG CCGCGGCGAG
CCGGCCCTGC GCGGGCAGGC CCGGCGCCTC CTCACACTGT TGGAACAGGA CGTCACCCGG
CTGGAACAGG ACGAGACCCC GCTGGAGCAG GACGACGCCG TCCGCTCGGT GGACGTCGGC
TACTCGCTGG CGGCGACCCG CACCCACTTC GAGGACCGCG CGGTGATCCT GGCCGCCGAC
CGGCCGTCCC GCCTGGCCGC GCTGCGCGCG CTGAGCCAGG GTGACCCGGC CACCGGCCTG
GTCGTGGGGC AGGCGCCGGG CGGGACGGGC GGCTCCTGGG CGTTCCTGTT CACCGGGCAG
GGCAGCCAGC GGCCCGGGAT GGGCCGCGAG CTGTACGACG CGTTCCCCAC GTATGCCGCC
GCGTTCGACG AGCTGTGCGC CGCGTTCGAC CCGCACCTGG ACCGACCGCT GCGCGACGTC
GTGTTCGCCG CCGAGGGCAG CTCCGAGGCG GCCCTGCTCG ACAGCACCCG CTACACCCAG
CCGGCGCTGT TCACCGTGGA GGTCGCGCTC TTCCGGCTGG TGACCGGCTG GGGGCTGCGC
CCGGCCCTGC TCGCCGGCCA CTCCATCGGT GAACTGGCCG CCGCGCACGC CGCCGAAGTG
CTCGATCTCG CCGACGCGGC GGCCCTCGTC GCCGCCCGCG GCCGGCTGAT GGGCGCGCTG
CCGCCCGGCG GCGCGATGGT GGCCGTCGAG GCCGACGAGC CCGAGGTCCT GGAGCTGCTC
GCCGACCGGG ACGGGCGGGT GGCCGTCGCC GCGGTGAACG GGCCGCGGGC GGTCGTCCTC
TCCGGAGACG AGCCGGCCAC GCTCGAGGTG GCCGCGGCGC TGGCGGAACA CGGCCGGCGC
ACCCGCCGGC TGACCGTCAG CCACGCCTTC CACTCCCCGC ACATGGACGG CATGCTCGCC
GACTTCCGGG AAGCCGCCAC CGCGGTGGAA CTACGCGCAC CGCGCATCCC GCTGGTCAGC
AACGTCACCG GGGCCCTCGC CACCACGGAC CAGCTGACCT CCCCGGACTA CTGGGTGCGG
CACGTCCGCG ACACCGTGCG GTTCGGCGCC GGGGTGGCCG CGCTGACCGC CGCCGGGGCC
ACCGGTTTCG TCGAGCTCGG ACCGGACGCG GTGCTCAGCG CCCTCGTCCC CGCGGCGGTA
CCGACCCTGC GCCGCGGCCG GCGGGAGGTC GCCACCCTGC TGGGCGCGCT GGCGGCCGCG
CACGTCCGGG GCGCCGAGGT GGACTGGGCC GGGCTCCTCG CCGAGCACGG CGGCCGCGCG
ACCGACCTGC CCACCTACGC CTTCCAGCGC AGCCGGCACT GGCTCCCGGA CGCACCCGCA
CCCGCACCCA CGAGCACCGT GCCCGCCAGC ACTACTGTTC CCGCCAGCAC GGCGGCCGAC
GGCACCGCGG CGCACGGCGT CACGGTGGAC GGCACCACGG CAGACGACGC CGGGATCGAC
CGGGCCACGG CGGACGACTC CGGGTCGGGC GACCGTCCCG GGGCCGCTCT CGCCGGGCTG
TCCGACGCGG AGCGGGCGCG GCGGCTGCGT GCCCTCGTCC TGGACCGGAC CGCGGCCGTG
CTCGACCACG GCTCGGCGGA CGACATCGAC CACCGCCGCA CCTTCCGCGA CCTCGGCTTC
GACTCCCTGG CGGGCGTCGA GCTGCGGGAC CGCCTCGCCG AGGCGACCGG CCTGACGCTG
CCGGCCGGCC TGGTGTACGA CCAGCCCACC GTCGACGCCC TCGTCACCCA CCTCGACGGC
CTGCTCACCG GCCGCGCGGC CGCCGCCGCG CCCAGGCGGC GGGAGGGCGC GGGTACGGAC
GACCCCATCG TCATCGTGTC GATGGCCTGC CGTCTGCCCG GCGGGGTCGC GTCTCCCGAG
CAGCTGTGGG AGCTGGTCGC CGCCGGCGGG GACGCCGTCG GCCCGTTCCC CACCGACCGC
GGCTGGGACC TCGACGGGCT CTACGACCCG GACCCCGACC AGCCCGGGAC CGTCTACACC
CGGCAGGGCG GGTTCCTGCA CGACGCCGGG GACTTCGACC CGGAGCTGTT CGGGATCTCC
CCCCGTGAGG CGACCGCGAT GGACCCGCAG CAGCGGCTGC TGCTGGAGAC GTCCTGGGAG
GCGTTCGAGC GGGCCGGGAT CGTCCCGGGC TCGCTGCGCG GCTCACGCAC CGGCGTGTTC
ACCGGCGCGA CGTCGATGGA CTACGGCCCG CGGCTGCACG AGCCGGCGGG CGGGGTCGAG
GGCTACCTCC TCACCGGCAC GACGACAAGC ATCGTGTCCG GGCGGGTGTC CTACACGTTC
GGGCTGGAGG GCCCGGCGGT CACCGTCGAC ACCGCCTGCT CCTCCTCCCT GGTCGCCTTG
CACCTCGCCG CCCAGGCACT GCGCAGCGGC GAATGCGACC TCGCCCTCGC CGGCGGCGTC
ACGGTGATGG CGACCCCGGG CATGTTCCTG GAGTTCAGCC GCCAGCGGGG GCTGGCCGCC
GACGGCCGGT GCAAGTCCTT CGCCAACGCC GCCGACGGGA CCGGCTGGTC CGAAGGCGCC
GGCATCCTCC TGCTCGAACG GCTCTCCGAC GCCCGGCGCA ACAACCACCC GGTCCTGGCG
CTGGTCCGCG GCACCGCGGT CAACCAGGAC GGGGCGAGCA ACGGCCTCAC CGCTCCGAAC
GGACCGTCGC AGGAGCGGGT GATCCGCCAG GCTCTCGCCA ACGCCGGGCT CAACCCGGCC
GACATCGATG CCGTCGAGGC GCACGGCACC GGCACCCGCC TCGGCGACCC CATCGAAGCC
CAGGCCCTGC TGGCGACCTA CGGGCAGGAC CGACCTGACG ACCAGCCCCT GTGGCTGGGG
AGCCTGAAGT CCAACATCGG CCACGCCCAG GCCGCTGCCG GGGTCGCCGG CATCGTCAAG
ATCGTCCAGG CCCTCCACAA CAGCGAACTC CCCCGCACCC TGCACGTCGA CGAACCCACC
CCCCACGTCG ACTGGCACAC CGGAAACGTC GCACTACTCA CCGAGAAGCA ACCCTGGGAA
CCAGGCACCC GGCCCCGCCG CGCCGCCGTC TCCTCCTTCG GCATCAGCGG CACCAACGCC
CACGTCATCC TCGAAGAACC CGGAGGACTG TCGGGCGGCA CAAGCCACGC CGGCGGGGAC
GGCGACGACG TCCCGGCGGA GGGGGCGCCG GCGTCAACCG CCGCGGCCCA GGGCGAGCCC
CGGGAGCCGG TCCAGCCACT GCCGTGGTTG TTGTCGGGTC ACAGCGAGCA GGCACTCCAA
GCCCAAGCCC GCGCGCTCCA GGCCTACCTC GCAGACCAGC CGGACACCCA CCCCGCCGGC
ATCGCCGCCA CCCTCACCCA CCACCGCACC CACCACACCC ACCGCGCCGT CATCCTCATC
GACACCAGCG CCAACCCCAG CCGGGGCAGC GCCGGCCCGG TCGACGGTCA CATCGCGGTT
CCAGCCGACG CTCTGGCGGC GTTGGCCGCT CTGGCCACCG GTGGCGACCA CCCCCACCTC
ACCCGCGGAC ACGTCCCCCC CACACCCCGC CTCGGCTACC TCTTCACCGG CCAAGGCAGC
CAACGCCCCG GCATGGGCCG CGAACTCTAC AAAGCATTCC CCGTCTTCAC CGCCGCCTTC
GACGAAACCG CCGCCGCCTT CACCCCCCAC CTCCAACACC CACTCCACGA CATCATCTTC
GCCGCACCCA ACACCCCCCA AGCCCACCTC CTCAACCAAA CCCACTACAC CCAACCCGCC
CTGTTCACAT TGCAAACCGC CCTCTACCAC CTCCTCACCC ACCACGGCCT CACCCCCGAC
TACCTCACCG GCCACTCCAT CGGCGAAATC ACCGCCGCCC ACCTCGCCGG CATCCTCACC
CTCCCCGACG CCGCCACCCT CGTCGCCACC CGCGGCCACC TCATGAACAC CGCCCCACCC
GGCGGCACCA TGATCGCCAT CGACACCACC GAAGAAGACA TCACCCCCCA CCTCACCCCC
ACCGTCTCCA TCGCCGCCAT CAACAGCCCC ACCACCCTCG TCATCGCCGG CGACCCCACC
GACACCCACC GCATCGCCAC CCACTACCAA AACAAAGGCA CCCGCACCCG CAAACTCACC
GTCAGCCACG CCTTCCACTC ACCCCACATG GACCCCATCC TCAACCAGCT CCACCACACC
CTCACCACCC TCACCCTCCA CCCACCCCAA ATCCCCATCA TCAGCACCCT CACCGGCACC
CTCGCCGACC ACACCATCAC CACCCCCGAC TACTGGACCC GCCACACCCG CCACACCGTC
CGCTACCACC AAGCCGTCCA AACCCTCACC ACCCTCGGCA CCACCCACCA CCTCGAACTC
GGACCCCACC CCACCCTCAC CACCCTCACC CCCCACACCA CCCCCACCCT CCGCACCAGC
CACCCCGAAA CACACACCCT CACCACCGCC CTCGCCACCA CCCACACCCA CGGCTTCCCC
ACCCACTGGC ACACCCCCAC CACCACCCCA CCCACCAACC TCCCCACCTA CCCCTTCCAA
CGACGGCACT TCTGGTTCCG GGCGCCGACG GCCTGGGCCC GGGAGGAGTA CCTGCTCGGC
GCCGCGGTGG AGCGCGCCGA CGACGGCGGC CTGCTGTTCA CCGGAACCCT CGACCTCGAC
CGCCACCCCT GGCTCGCCGA CCATGTCATC GGCGGCCGGC CGCTCGTGCC GGGCAGCCTG
TTCGTCGACC TCGCGCTGCG CGCCGGCCTG CGCGCCGACG CTCCCGTGCT CGACGAGCTG
ACCCTGCAGA CCCCGCTGCT CCTGCCGGAG CGCGGCACGC TGAGCGTGCA GGTCTCGGTC
GGCGGCGTGG ACGAGCACGG CCGCCGTGCG CTGACCGTGC ACTCCCGGCC GGGTGAGGAG
GACCCCTGGG CCCTGCACGC CACCGGCGCC CTGCGGCAGG AGGAGATCGC GGGCGCGGAC
AGCCCGGCGC CGGCCGCGGA TCCCTGGCCA CCGGCCGGGG CGGACCCGCT CGACGTGGCG
GACGTGTACC AGCGCCTGGG CGCGCTCGGC TACGACTACG GCGCGGGCCT GCGCAACGTG
ACGGCGGCCT GGCAGGCCGG GGACACCCTG CTCGCCGAGG TCCGCCTCGG CGCCGGCGCG
GACGTCCGCG AGAACACCGG CGAAACCGGC GACCCGGGTG ACTCGGGTGG CCCGGCCAGG
GCCGACGGGA CGGACGGGAC CGACGGGACC GACGGGACCG ACGGGGAGCC CTTCGCGCTG
CATCCGGCGC TACTCGACGC CGCCCTGCAC CTGCTGCCCC TGTACGGCGG GGCGGACGGG
GTGCGGGTGC CGTTCTCCTG GACCGGGGTC CGGCTCGCCG CCGTCGGCGC GACCACGGTG
CGGGTCCGGC TCACGCCGGC CGACGACGGC TCCGTCGCGG TTCTGCTCAC CGATCTCGAC
GGCCTGCCGG TGGCCTCGGC GCGTTCCCTG ACGCTGCGCG CCGTGACCGC GGACGCGTTC
GCTCCGGCGA CCGACGCCCT GTACACGCTG GAGTGGCGCC CGGTCGCGAC GGACACCGCC
ACGATCGACG CCGCCACGGC CGGCACCGCC GGAGGCGTTC CGGCGGAAGG CATCGAGCTA
CGGCGGGTCG CCGGCGGCCG GGGGGCCGCG GTCGTGGCCG GTGAGGTCCT GGCGTTGGTC
CAGGACGCGC TGGCGCGCGA CGCGGGAGCC GACGCGGGCC CGGGTTCCCG GCTGGCCGTC
GTCACCTCCG CCGCGGTCTG GACCGGCCCC GCCGACCTGG ACGTCGACCC GGCCGCGGCC
ACCGCCTGGG GCCTGCTGCG CAGTGCGATG TCGGAGCATC CCGACCGGTT CCTGCTGCTG
GACACCGACG GTGACGCCGC GTCGGAATCC GCCCTGACCG CCGCGTTGGC CACCGCGCTG
CGCACGGGTG AGAGCCAGCT GGCGCTGCGC GCCGGACGGC TGCTCGCACC CCGGCTGCAA
CGGCTGCCGG CAGGCTCGTC GCCCGCGGGC GACGAGCCGG CGGACGAGAC GCGACACGAG
CACGGCACCG CGCTGATCAC CGGGGCCACC GGGGCGCTCG GCGGGTTGAT CGCCGAACGG
CTGGTGACCC GACACGGGGT GCGCCACCTG CTGCTGGTCA GCCGCCGCGG CCCGGACGCA
CCGGGCGCCG ACCGCCTGCT CGCCCGGCTA CGGGAGCTGG GCGCGCACGC CCGCCTGGTC
GCCTGCGACA CGGCCGACCG GGACGCGCTG GCCGCGCTGC TCGCGACCAT CCCGGCCGAG
CAGCCGCTCA CCGCCGTCGT GCACGCCGCC GGGGTGCTCG ACGACGGCAC CATCGAGACC
CTCACCCCCG ACCGCCTCAC CACCACCCTC ACCCCCAAAG CCGACGCCGC CACCCACCTC
CACACCCTCA CCCACCACCA CCCCCTCCAC ACCTTCCTCC TCTTCTCCTC CATCACCGCC
ACCACCGGCA CCGCCGGCCA AGCCAACTAC GCCGCCGCCA ACGCCTACCT CGACGCCCTC
GCCCACCACC GCCACACCCA CCACCTCCCC GCCACCAGCA TCGCCTGGGG CCTCTGGCAC
CCCACCGAAA CCACCACCAC CGACGCCGAC ACCAACACCG ACGGCGGCAT GGCCGCCACC
CTGGCGCAGG CGGACCTGAA CCGGCTCGCC CGGGCCGGCA TCGCGCCGCT GCCCGCCGAG
CAGGCCCTGC GGCTGTTCGA CGAGGTCTTC GCCGGCGGGC ACGCCGACCG TCCCCTGCTG
GTGGCCTCCC GTTTCGACAC CCGCGCGCTC ACCGCGCCGG GAGCCGTCGT GCCGCCCCCG
CTGCGCTCGC TGGTGCGCGG CACCCCCCGC CGCGGTGACG GCCCGCGCGC GGCCAGCGCC
GGCCGGCCGG CGCTGGCGTC CCGGCTGGCC GGGGTGAGCA CCACCGAGGC GGACCGGCTG
CTGCTCGAGC TCGTCCGGGA GACGGTCGCG CTCGTCCTCG GGCACACCGA CACCGCCGCG
GTGCCCGAGG ACCGGGCCTT CACCGACCTC GGTTTCGACT CGTTGGCCGC CGTCGACCTG
CGCAACCGGC TCGGCACCGC CACCGGCCTG CGACTGCCCG CCACCGTCGT CTTCGACCAC
CCAACCCCGA ACGCGCTGGC GGGCCTGCTG CGCGAGGAGC TACTCGGCGC GGCGGCCGAC
GGCGCAACCC CAGCGACGGC CGCCGGCGCG GCCCCGGCGG CGGGCGACGA TCCGATCGTC
ATCGTGTCGA TGGCGTGCCG CCTGCCCGGC GGGGTCCGCT CGCCGGAGGA CCTGTGGCGA
CTGCTCGCCG ACGGCACGGA CGCGGTCTCC GGCTTCCCCA CCGACCGCGG CTGGGACCTC
GACGCGCTCT ACGACCCCGA CCCCGACCAC CTCGGCACCT CCTACGCCCG CGAGGGCGGG
TTCCTGCACG ACGCCGGGGA CTTCGACCCG GAGCTGTTCG GCATCAGCCC GCGGGAGGCG
ATGACGACCG ATCCGCAGCA GCGGCTGCTG CTGGAGACGT CCTGGGAGGC GTTCGAACGG
GCCGGCATCG CGCCCGACTC GCTGCGCGGC TCGCGCACCG GCGTGTTCGC CGGCGTCATG
TACAACGACT ACGGGGCCCG GCTGCACCAG TCCGGCACGC CTGCGGCGGG CTTCGAGGGG
TACCTGGTCA GCGGAAGCGC CGGCAGCGTC GCCTCAGGCC GGGTGTCCTA CACGTTCGGG
CTGGAGGGCC CGGCGGTCAC CGTCGACACC GCCTGCTCCT CCTCCCTCGT CGCCCTGCAC
CTCGCCGCCC AGGCCCTACG CAGCGGCGAA TGCGACCTCG CCCTCGCCGG CGGCGTCACC
GTCATGGCCA GCCCCGCCAC CTTCATCGAG TTCAGCCGCC AGCGCGGCCT CGCCGCCGAC
GGCCGGTGCA AACCCTTCGC CAACGCCGCC GACGGCACCG GCTGGTCCGA AGGCGCCGGC
ATCCTCCTGC TCGAACGGCT CTCCGACGCC CGCCGCAACA ACCACCCCGT CCTCGCCATC
GTCCGCGGCA CCGCCACCAA CCAGGACGGC GCCAGCAACG GCCTCACCGC CCCCAACGGA
CCCTCCCAAC AACGCGTCAT CCGCCAGGCC CTCACCAACG CCGGACTCAA CCCCGCCGAC
ATCGACGCCG TCGAAGCCCA CGGCACCGGC ACCCGCCTCG GCGACCCCAT CGAAGCCCAA
GCCCTCCTCG CCACCTACGG CCAGAACCGA CCCGACAACC AACCCCTCTG GCTCGGCAGC
CTCAAATCCA ACATCGGCCA CACCCAGGCC GCCGCCGGCG CCGCCGGCAT CATCAAAATC
ATCCAGGCCC TCCACCACAA CGAACTCCCC CGCACCCTCC ACGTCGACGA ACCCACCCCC
CACATCGACT GGCACACCGG AAACGTCGCA CTACTCACCG AGAAGCAACC CTGGGAACCA
GGCACCCGGC CCCGCCGCGC CGCCGTCTCC TCCTTCGGCA TCAGCGGCAC CAACGCCCAC
GTCATCCTCG AAGAACCCGG CCAGTCCACC GGGCCCGACG GAACCGGAGC ACCCGACGGG
AGCATCGGCG TCCCGCCGGT GTGGGTGCTG CGCGCGCACA GCGCGGCGGG GCTGCGAGCC
CACGCCGCGA AGCTGGGCGA CGATCTCGAA CACCGGCCGG CAGTGGCGCC GATCGACATC
GCGCGGCAGC TCGCCCGGGT CCAGGCGGGG CTGGAGTGGC GGGCCTCGTT CGTCGCGGCC
GAGCACGCGG ACGCGCTGCG CGCGCTGGAC CTGCTGGCCC GGGACGAGCC CGACCCTGGA
CGGGCGACCG GGCAGGCCCT CGGCACGCTG CGCACCGCGT TCCTGTTCAC CGGGCAGGGC
AGTCAGCGCC CCGGCATGGG CCGCGAACTG TACGAGGCGT TCCCGGCGTT CGCCACCGCG
TTCGACGAGG TGGCCGCGGC CTTCGCGCCG GGCCTAGACC AGCCGCTGCG CGACGTCGTG
TTCGCCGCGC CCGGCAGCCC GGAGGCGGCG CTGCTGGACT CGACCGCCTG GACGCAGCCG
GCGCTGTTCG CGCACGAGGT CGCCCTGTTC CGGCTGTTGG CGGGCTGGGG TGTGCGTCCC
GACGCGGTGC TGGGCCACTC CATCGGCGCG ATCGCCGCGG CGCACGCGGT CGGGGTGCTG
TCGCTGGCCG ACGCCGCGAC GCTGGTCGGC GCCCGCGCCC GGCTGATGGC CACGCTGCCG
GTCGGGGGCA CGATGGCGGC CCTGTCCGTC CCGGCCGACG AGGCGGCGGC CCTGCTCGCC
GACGTCGACG GGGCGGTGTC GCTCGCCGCG GTCAACGGCC CCCGGGCGAC CGTGGTGTCC
GGGGCGGAAC GGGCCGTCGC CGAGGTCGTC CGCCGGGCGG CGGCGGGTGG CGCCCGCACC
CGGCTGCTGT CGGTCAGCCA CGCGTTCCAC TCCCCGCTGA TCGAGCCGAT CCTCGACGAG
TTCCGGGACG TCGTCGCCGG CCTGGCCTTC GCCGCGCCGA CCGTCACGGT CGTCTCCGAC
CTCACCGGGG AGCCGGTCCC GGCCGACGTC CTGGCGACGC CCGAGTACTG GGTGCGCCAC
GCGCGGGAGA CCGTGCTGTT CGCGCCCGCC GTCCAGGCGC TGCGCGAATC GGGCATCCGC
GGGTTCCTGG AGATCGGCCC GGACACCGTC CTGGCCACGA TGGCCGCCGA CACACTCGCC
CAACCACCGG TCACGGCCGA GGTGGTCACG CTGGCCACCC AGCGGCGGGG CCGTCCCGAG
CCGGAGACGC TCGCCGCGGC GCTGGGCGCC CTGGACGCCG CGGGCGGGCG GGTCGACTGG
TCGGCCTACT TCGGCCCAGG GCCCGCGGTG GAGCTGCCCA CCTATCCGTT CCAGCGGTCC
CGCTACTGGC TCGACGTCAC CGCCTCCGGC CCGGGACCGG ACCTGGGTGC GGCCGGGCTG
GATCCGGCCG GGCACCCGTT GCTGGGCGCG GCGCTGGAAC TGGCCGACGG CTCCGGCTCG
GTGCTCACCG GACGGCTCTC GCTGGCCGAC CAGCCGTGGC TGGCCGACCA CCGGGTCGCC
GGGGCGGTGG TGGCGCCGGC GACGGCGCTG CTCGGCCTCG CCCTGCACGC GGCCCGGGTG
GCCGGGGCGG CCGCGCTGGA GGAGCTGACC CTGGCGGCAC CGCTGGTGCT GCCCGAGCAG
GGCAGCCTGG CACTGCAGGT CGTCGTCGGG ACAGCCGACG ACTCCGGCCG GACCACGCTG
CGCGTGCACT CCCGTCCGGA CGGGGCGCGC CAGCCCTGGA CCGTTCACGC CAGCGGGCTG
TTCGGCCCCG ACGCGGCCTC CGCGCCGCCC GCCCGGCCCG AACCGGACAC CTGGCCGCCG
GCGGACGCCC AACCACTCGG CGGCCCCGCC GACAGCGACA GCGACACCGA CACCGGCTAC
AACGGCTACG ACGATCTCGC CCGCCTCGGC TACGACTACG GGCCGATCTT CCAGGGCCTG
CGAGCCGGTT GGCGGCGCGG GAACGAGCTG TTCGCCGAAC TCGTGCCACC GGGCGACGCG
ACGTTCGGCG CGGCCCCGCA TCCCGCGTTG CTCGACGCGG CCATGCACCC GCTCGCCCTC
GCCGGCACCG TCGGCACCGA CCCTGACGGC ACCGGCACCG TCGGCACCGG CACCGTCGGC
ACCGACACCG TCGGCACCGC CGCCGGCCCG GCCGGTGCCG GCGGCATCCG GGTGCCGTTC
TCCTGGCACG GGGTGTGGAC GGCGACGTCC GGCGGCTGGG CCGGCCCGCT GCGGGTGCGC
ATCACCCCGG TCGGGCCGGA CACCGTGCGG CTGCTGCTCA CCGACACCGC GGACACCCCG
CTGCTGGCCG TGGACCGGCT CGTGGTGCGC GCGGTCGACC CGGCCCGGCT GGCCGGGGCA
CGCGGCACCG ACGGCGCGCT GCACCGCCTG GCCTGGGTCC CGCCGGTCGG CCCGGCACCC
GCTCGGGCGG TCGCCGCGGC GACCGCCCGA GAGCTGGCCG GCTGGGCGCA GGCCGAGCGG
CTGCTGGCCG ACGTCGCGGC CGGCGGTGCG GCGCCCGAGG CGGTGCTGCT GGACGTCGCC
GGTCCCACCG CCGACTCCGT GTCGCCGGCG GCGGCCCGGG AGATCCTGCT CGCCGCGCTG
CCGGTGCTGC GCGGCTGGCT CGCCCAGGAA CAACTGGCGG ACAGCCGGCT GGTGCTGCTG
ACCCGCGGTG CCGTCGCCGC CCGGGTCGGG GACCAGGTCG ACGGACTCGC CCAGGCCGCG
TTGTGGGGAC TCGTCCGCAC GGCGCAGACG GAGAACCCGG GCGTGTTCAC CCTGGTCGAC
ACCGACGGCA CCCCGTCCTC CGAACGGGCG CTGGGAGCCG CCGTCGCCAC CGGGGAGCCT
CAGGTCGCGG TGCGCGACGG GCAGCTCCTG CTGCCCCGGC TGGAACCGCT GCCGGCGCCA
CCCGGCGCGC CCGAGGCTCC TGTCACAGCC GACACGCCGC AGGAGCAGGA CGACAGCGCG
ACCGTCCCCG GACCGCTGTT CCCCTCGACC GGCACCGTGC TGGTGACCGG GGCGACCGGT
GCCCTGGGCG GCCTCGTCGC CCGCCGGCTG GTGGCCCGGC ACGGCGTCCG CCGGCTGCTG
CTGACCAGTC GGCGCGGCCC GGCCGCGCCC GGCGCCGACG ACCTGGTGGA GCGGCTGACC
GGACTGGGAG CCGAGGTGCG GCTCGTCGCC GGCGATCTCG CCGATCCCGA CGAACTGCGC
CGGCTGCTCG CGGAGGTTCC CGCCGCGCAT CCACTGTCCG CTGTCGTGCA CGTCGCCGGG
ATCACCGCCG ACGCGACGCT GGGGGCGCTG GACGCCGAAC GCCTCGACGA GGTGCTGCGC
CCCAAGGCGG AGGCGGCCTG GCTGCTGCAC GAGCTGACCC GGGAGGCCGG GCTGGCAGCC
TTCGTGCTGT TCTCCTCGGT CGCGGGCGTC GTCGGCAACG CCGGCCAGGC CAACTACGCC
GCCGCCAACT CCTACCTCGA CGCCCTCGCG GAGCATCGCC AGGCCCTGGG CCTGCCGGCC
GTGTCACTGG CCTGGGGGCT GTGGGAGCAG GCGGAGGGCA TGGGCGGCGC GCTGTCCGTG
GCCGACCGGG CCCGCATCGC ACGCGGCGGC ATCCGGGCGA TCAGCGAGCA GGAGGGGCTG
GATCTGTTCG ACGCCGCCGT GGCCGCCGGA GCACCGGTGG CGGTACCGGC CCGCTTCGAT
CTCGGCGCGC TGCGCGCCGC CGCCCGCGAG CAGCGGCTGG CCCCGCCGCT GTACGGACTG
CTGCCGGCCG GTTTCGCCGC CGCGGCGGCC GGGCCGGTGC CGGGCGGCGC CGCGGGCGGG
ACCGCGGCGT CGAACGGGAC CGCGGGGCCG AACGGGTCGT CCCCGCGGGA GCCGGGCCGG
GACGGGGCCA ACGGCTCCGG CCCGGCAGAC CCGCCGTGGC TGCGCCGGCT CCGCGAGGTG
TCGGCCGGCG AACGGCCCCG GGTGTCGCGG GAACTGGTCC GCGCCACGGT GGCCGAGGTC
CTCGGGCACC CGGCCAGCTA TCCGGTGCCG GTCGACCGGG GCCTGCTCGA CTTCGGTTTC
GACTCGCTCA CCGCGGTGGA GCTGCGCAAC CGGATCGGCG CGGCGACCGG TCTGCGGCTG
CCGCCGACGA TGCTGTTCGA CCATCCGACG GTCGAGACGC TCGCCCGGCA TCTGCTGGCC
GAGCTGGCCC GGGCGCTGCC GGGTGGGACC GCCGACGCGC TGTCCCGCCT TGACGAGCTG
GAAGCGGCCC TCGCCCAGCT GGACCCGCCG GACGGACCGG ACGGGCCAGA CGGGCCAGAC
GGGCTGACAG TTCCCGACGT ACCGGGCCGG TCCGACGGCG CAGGGCAGCC GCCCGCCGCC
GACCCCCTGC GGGACCAGCT CGCGGAACGG CTGTCCGCCC TGCTCGCCCG GATCGCGCCG
GGCCGGGCCG GGCGTCCGCT GCCCCTCGCC TCCGCGGTCG CCCCCGTCTT CGCCCCCGCG
CTCGGCACGG CCTCGGACGA CGAGCTCTTC GAGCTCATCG ACGAGCAACT CGGCACCGAC
TGA
 
Protein sequence
MVNDQVPPGN GKPSTPGQPS TPGQPSDPGR SPAAVVSPAT AVGAVAGGAG VPGEPVAVIG 
LAARLPEAPD VAAFWRLLER GGQAVGAPPA DRWSADAPPF GAFLDEVDRF DADFFGISPR
EAAATDPQQR LLLELGWEAI EDARIVPAAL AGSRTAVFTA AIWDDYAALH HQRGGPAAGR
HTMTGLGRGL LANRLSYLLG LTGPSLTVDA AQSSSLVAVH LAAESLRRGE AELAIVGGIN
LILAPASSAA SARFGALSPN GRCFTFDARA NGYVRGEGGV AVVLKPLARA LADGDPVRCV
LLGGAVNNDG GGAGLTVPDQ HAQEAVLRAA YQAAGLSPAR ARYVELHGTG TPVGDPVEAA
ALGAVIGTAR AGENAGPLLV GSAKTNVGHL EGAAGLVGLL KTALALAHRR LPASLNFERQ
NPAIPLDELG LRVVTEATDW PADDSAPIAG VSSFGMGGTN CHLVLAAPPV PADVIPADVV
PGTAPGGDLP LPWVLSGRGE PALRGQARRL LTLLEQDVTR LEQDETPLEQ DDAVRSVDVG
YSLAATRTHF EDRAVILAAD RPSRLAALRA LSQGDPATGL VVGQAPGGTG GSWAFLFTGQ
GSQRPGMGRE LYDAFPTYAA AFDELCAAFD PHLDRPLRDV VFAAEGSSEA ALLDSTRYTQ
PALFTVEVAL FRLVTGWGLR PALLAGHSIG ELAAAHAAEV LDLADAAALV AARGRLMGAL
PPGGAMVAVE ADEPEVLELL ADRDGRVAVA AVNGPRAVVL SGDEPATLEV AAALAEHGRR
TRRLTVSHAF HSPHMDGMLA DFREAATAVE LRAPRIPLVS NVTGALATTD QLTSPDYWVR
HVRDTVRFGA GVAALTAAGA TGFVELGPDA VLSALVPAAV PTLRRGRREV ATLLGALAAA
HVRGAEVDWA GLLAEHGGRA TDLPTYAFQR SRHWLPDAPA PAPTSTVPAS TTVPASTAAD
GTAAHGVTVD GTTADDAGID RATADDSGSG DRPGAALAGL SDAERARRLR ALVLDRTAAV
LDHGSADDID HRRTFRDLGF DSLAGVELRD RLAEATGLTL PAGLVYDQPT VDALVTHLDG
LLTGRAAAAA PRRREGAGTD DPIVIVSMAC RLPGGVASPE QLWELVAAGG DAVGPFPTDR
GWDLDGLYDP DPDQPGTVYT RQGGFLHDAG DFDPELFGIS PREATAMDPQ QRLLLETSWE
AFERAGIVPG SLRGSRTGVF TGATSMDYGP RLHEPAGGVE GYLLTGTTTS IVSGRVSYTF
GLEGPAVTVD TACSSSLVAL HLAAQALRSG ECDLALAGGV TVMATPGMFL EFSRQRGLAA
DGRCKSFANA ADGTGWSEGA GILLLERLSD ARRNNHPVLA LVRGTAVNQD GASNGLTAPN
GPSQERVIRQ ALANAGLNPA DIDAVEAHGT GTRLGDPIEA QALLATYGQD RPDDQPLWLG
SLKSNIGHAQ AAAGVAGIVK IVQALHNSEL PRTLHVDEPT PHVDWHTGNV ALLTEKQPWE
PGTRPRRAAV SSFGISGTNA HVILEEPGGL SGGTSHAGGD GDDVPAEGAP ASTAAAQGEP
REPVQPLPWL LSGHSEQALQ AQARALQAYL ADQPDTHPAG IAATLTHHRT HHTHRAVILI
DTSANPSRGS AGPVDGHIAV PADALAALAA LATGGDHPHL TRGHVPPTPR LGYLFTGQGS
QRPGMGRELY KAFPVFTAAF DETAAAFTPH LQHPLHDIIF AAPNTPQAHL LNQTHYTQPA
LFTLQTALYH LLTHHGLTPD YLTGHSIGEI TAAHLAGILT LPDAATLVAT RGHLMNTAPP
GGTMIAIDTT EEDITPHLTP TVSIAAINSP TTLVIAGDPT DTHRIATHYQ NKGTRTRKLT
VSHAFHSPHM DPILNQLHHT LTTLTLHPPQ IPIISTLTGT LADHTITTPD YWTRHTRHTV
RYHQAVQTLT TLGTTHHLEL GPHPTLTTLT PHTTPTLRTS HPETHTLTTA LATTHTHGFP
THWHTPTTTP PTNLPTYPFQ RRHFWFRAPT AWAREEYLLG AAVERADDGG LLFTGTLDLD
RHPWLADHVI GGRPLVPGSL FVDLALRAGL RADAPVLDEL TLQTPLLLPE RGTLSVQVSV
GGVDEHGRRA LTVHSRPGEE DPWALHATGA LRQEEIAGAD SPAPAADPWP PAGADPLDVA
DVYQRLGALG YDYGAGLRNV TAAWQAGDTL LAEVRLGAGA DVRENTGETG DPGDSGGPAR
ADGTDGTDGT DGTDGEPFAL HPALLDAALH LLPLYGGADG VRVPFSWTGV RLAAVGATTV
RVRLTPADDG SVAVLLTDLD GLPVASARSL TLRAVTADAF APATDALYTL EWRPVATDTA
TIDAATAGTA GGVPAEGIEL RRVAGGRGAA VVAGEVLALV QDALARDAGA DAGPGSRLAV
VTSAAVWTGP ADLDVDPAAA TAWGLLRSAM SEHPDRFLLL DTDGDAASES ALTAALATAL
RTGESQLALR AGRLLAPRLQ RLPAGSSPAG DEPADETRHE HGTALITGAT GALGGLIAER
LVTRHGVRHL LLVSRRGPDA PGADRLLARL RELGAHARLV ACDTADRDAL AALLATIPAE
QPLTAVVHAA GVLDDGTIET LTPDRLTTTL TPKADAATHL HTLTHHHPLH TFLLFSSITA
TTGTAGQANY AAANAYLDAL AHHRHTHHLP ATSIAWGLWH PTETTTTDAD TNTDGGMAAT
LAQADLNRLA RAGIAPLPAE QALRLFDEVF AGGHADRPLL VASRFDTRAL TAPGAVVPPP
LRSLVRGTPR RGDGPRAASA GRPALASRLA GVSTTEADRL LLELVRETVA LVLGHTDTAA
VPEDRAFTDL GFDSLAAVDL RNRLGTATGL RLPATVVFDH PTPNALAGLL REELLGAAAD
GATPATAAGA APAAGDDPIV IVSMACRLPG GVRSPEDLWR LLADGTDAVS GFPTDRGWDL
DALYDPDPDH LGTSYAREGG FLHDAGDFDP ELFGISPREA MTTDPQQRLL LETSWEAFER
AGIAPDSLRG SRTGVFAGVM YNDYGARLHQ SGTPAAGFEG YLVSGSAGSV ASGRVSYTFG
LEGPAVTVDT ACSSSLVALH LAAQALRSGE CDLALAGGVT VMASPATFIE FSRQRGLAAD
GRCKPFANAA DGTGWSEGAG ILLLERLSDA RRNNHPVLAI VRGTATNQDG ASNGLTAPNG
PSQQRVIRQA LTNAGLNPAD IDAVEAHGTG TRLGDPIEAQ ALLATYGQNR PDNQPLWLGS
LKSNIGHTQA AAGAAGIIKI IQALHHNELP RTLHVDEPTP HIDWHTGNVA LLTEKQPWEP
GTRPRRAAVS SFGISGTNAH VILEEPGQST GPDGTGAPDG SIGVPPVWVL RAHSAAGLRA
HAAKLGDDLE HRPAVAPIDI ARQLARVQAG LEWRASFVAA EHADALRALD LLARDEPDPG
RATGQALGTL RTAFLFTGQG SQRPGMGREL YEAFPAFATA FDEVAAAFAP GLDQPLRDVV
FAAPGSPEAA LLDSTAWTQP ALFAHEVALF RLLAGWGVRP DAVLGHSIGA IAAAHAVGVL
SLADAATLVG ARARLMATLP VGGTMAALSV PADEAAALLA DVDGAVSLAA VNGPRATVVS
GAERAVAEVV RRAAAGGART RLLSVSHAFH SPLIEPILDE FRDVVAGLAF AAPTVTVVSD
LTGEPVPADV LATPEYWVRH ARETVLFAPA VQALRESGIR GFLEIGPDTV LATMAADTLA
QPPVTAEVVT LATQRRGRPE PETLAAALGA LDAAGGRVDW SAYFGPGPAV ELPTYPFQRS
RYWLDVTASG PGPDLGAAGL DPAGHPLLGA ALELADGSGS VLTGRLSLAD QPWLADHRVA
GAVVAPATAL LGLALHAARV AGAAALEELT LAAPLVLPEQ GSLALQVVVG TADDSGRTTL
RVHSRPDGAR QPWTVHASGL FGPDAASAPP ARPEPDTWPP ADAQPLGGPA DSDSDTDTGY
NGYDDLARLG YDYGPIFQGL RAGWRRGNEL FAELVPPGDA TFGAAPHPAL LDAAMHPLAL
AGTVGTDPDG TGTVGTGTVG TDTVGTAAGP AGAGGIRVPF SWHGVWTATS GGWAGPLRVR
ITPVGPDTVR LLLTDTADTP LLAVDRLVVR AVDPARLAGA RGTDGALHRL AWVPPVGPAP
ARAVAAATAR ELAGWAQAER LLADVAAGGA APEAVLLDVA GPTADSVSPA AAREILLAAL
PVLRGWLAQE QLADSRLVLL TRGAVAARVG DQVDGLAQAA LWGLVRTAQT ENPGVFTLVD
TDGTPSSERA LGAAVATGEP QVAVRDGQLL LPRLEPLPAP PGAPEAPVTA DTPQEQDDSA
TVPGPLFPST GTVLVTGATG ALGGLVARRL VARHGVRRLL LTSRRGPAAP GADDLVERLT
GLGAEVRLVA GDLADPDELR RLLAEVPAAH PLSAVVHVAG ITADATLGAL DAERLDEVLR
PKAEAAWLLH ELTREAGLAA FVLFSSVAGV VGNAGQANYA AANSYLDALA EHRQALGLPA
VSLAWGLWEQ AEGMGGALSV ADRARIARGG IRAISEQEGL DLFDAAVAAG APVAVPARFD
LGALRAAARE QRLAPPLYGL LPAGFAAAAA GPVPGGAAGG TAASNGTAGP NGSSPREPGR
DGANGSGPAD PPWLRRLREV SAGERPRVSR ELVRATVAEV LGHPASYPVP VDRGLLDFGF
DSLTAVELRN RIGAATGLRL PPTMLFDHPT VETLARHLLA ELARALPGGT ADALSRLDEL
EAALAQLDPP DGPDGPDGPD GLTVPDVPGR SDGAGQPPAA DPLRDQLAER LSALLARIAP
GRAGRPLPLA SAVAPVFAPA LGTASDDELF ELIDEQLGTD