Gene Franean1_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3472 
Symbol 
ID5671843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4104327 
End bp4116662 
Gene Length12336 bp 
Protein Length4111 aa 
Translation table11 
GC content74% 
IMG OID641242360 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001507780 
Protein GI158315272 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0447934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG AGACGACCGC CGGCACCGAG AGCCGCCTGC GGGACTACCT GAAGCGGGTG 
ACGAGCGATC TGCTCAGCAC CCGGGCGCGG CTGTCCGAGC TGGAGGCCGA GCGCACGGAG
CCGATCGCGA TCGTCGGGAC GGCCTGCCGT TACCCCGGCG GGGTCCGCAC ACCGGAGGAC
CTGTGGCGGG TGGCGCGGTC GGGCACCGAC GCGATCTCGT CCTTCCCGGA GAACCGCGGC
TGGGACGTGC CCGGCCTGTT CGACCCGGAC CCGGAGCGGG TGGGCCACAC CTACGCGCGG
GAGGGCGGGT TCCTGCACGA CGCCGACCTG TTCGACCCGG CGTTCTTCGG GATCTCCCCG
CGGGAGGCGA CCGCGATGGA CCCGCAGCAG CGACTGCTGC TGGAGACGTC CTGGGAGGCG
TTCGAACGGG CCGGCATCGC ACCGGACTCG CTGCGCGGCT CACGCACCGG CGTGTTCGCC
GGGGTGATGT ACAGCGACTA CGGCGGCAGG ATCAGGCAGG CACCGGACGG CCTGGAGGGG
TACATCGGCA TCGGCAGCGC CGGGTCCGTG GCCTCCGGGC GGATCTCCTA CACGTTCGGG
CTGGAGGGCC CGGCGGTCAC GGTCGACACC GCCTGCTCCT CCTCCCTGGT CGCCCTGCAC
CTGGCGGTGC AGTCGCTGCG CCGCGGCGAA TGCGACCTGG CCCTGGCCGG CGGCGCCACC
GTGATCGCGA CCCCCGGGCT GTTCGTCGAG TTCAGCCGGC AGCGCGGACT CTCCCCGGAC
GGGCGGTGCA AGGCGTTCGC CGCCACCGCC GACGGCACCG GCTGGGGTGA GGGGGTGGGG
CTGCTGCTGG TGGAACGGCT CTCCGACGCC CGCCGCAACA ACCACCCGGT GCTCGCGGTC
GTGCGCGGCT CCGCCGTGAA CCAGGACGGC ACCAGCGGCC AGCTCTCGGC ACCGAACGGC
CCGGCCCAGC AGCGGGTGAT CCGGCAGGCC CTCGCGGACG CCGGCCTGAA CACAGCCGAT
GTGGACGCCG TCGAGGCGCA CGGCACCGGC ACCCGCCTCG GCGACCCCAT CGAAGCCCAG
GCCCTCCTCG CCACCTACGG CCAGAACCGA CCCGACGACC AGCCCCTGTG GCTTGGGAGC
CTGAAGTCCA ACATCGGCCA CACCCAGGCC GCCGCCGGCG CCGCCGGCAT CATAAAAATC
ATCCAGGCCC TTCATCACGA CGAGTTGCCG CGCACTCTGC ACGTCGACGA ACCCACCCCG
CACGTCGACT GGGACAGCGG AAACGTCCGA CTGCTCACCG AGAACCGGCC CTGGGAATCC
CGCGACACCC CCCGCCGCGC CGCCGTCTCC TCCTTCGGCA TCAGCGGCAC CAACGCCCAC
GTGATTCTCG AGGAAGCCGC CCGGGAGACC GACGACAACT CCTCGGGTGA GGCCGGTTCA
GCGCCGGCCG AAACGGCTCC GACGGCCGCC GCCCCGGTGG AGCGGGAAAC GCCGGTTCCG
ATTCTTTTGT CGGGGCACAC CGAACAAGCC CTCCACGACC AGGCCACAAA CCTCAACACC
TACCTCACCC ACCACCCGAA CACCACCCCC CACCAGCTCA CCCACACCCT CACCCACGGC
AGAACCCACC ACCAACACCG CGCCGCCATC ATCACCAGCG CCACCAACAC CACCGAACTC
CGCGACACCC TCACCGCCCT CACCCACCAC CAACCACACC CCAACCTCAC CCAAAGCCAC
ACCACCACCC CCGGCAAAAC CGTCTTCGTC TTCCCCGGCC AAGGCAGCCA ATGGCACGGC
ATGGCCCGCC ACCTCTACAC CACCTCACCC ACCTTCGCCC ACCACCTCAC CACCGCCACC
AACGCCCTCA ACCCCCACCT CGACTACGAC CTCCTCCACA CCCTCACCTC ACCCACCCCA
CCACCCAACA CCGTCACCTA CGTCCAACCC GCCCTCTTCG CCGTCATGAC CAGCCTCGCC
CAACTCTGGC AACACCACGG CATCCACCCC GACACCGTCC TCGGCCACAG CCAAGGCGAA
ATCGCCGCCG CCCACATCGC CGGCGCCCTC ACCCTCACCG ACGCCGCCAC CATCGTCGCC
CTGCGCGCCA CCGCCCTCAC CACCCTCACC GGCACCGGCA CCATGGCCTC CATCACCCTG
CCCGCCCACG CCCTCCAACC CCTCCTCGAC ACCCACCCCG ACCTCCACAT CGCCGCCCAC
AACAGCCCCA CCCACACCAT CGTCGCCGGC AACCCCCAAT CCATCACCGA ACTCCTCGAA
CACTGCCAGC AAAACAACAT CCAAGCCCGC CAGATCCCCG TCGACTACGC CTCCCACACC
CCCCACATCG AACCCCTCCA CACCACCCTC ACCACCGCAC TCGCCCACAT CACCCCCACC
CCCGCCACCA TCCCCTTCTA CTCCACCCTC ACCAACACCT ACCTCGACAC CACCCAACTC
ACCCCCGAAT ACTGGTACCA AAACCTCAGA AACCCCGTCC AATTCCACCA AGCCATCACC
ACCCTCCACC ACAACGGCCA CACCACCTAC ATCGAAACCA GCCCCCACCC CGTCCTCACC
ACCACCATCA CCGAAACCCT CGAAAACCCA CCCACCAACA CAGACAAGAA CAGCACCCAA
AACACCAACC AGAACGGCAC CGCGGAACAC CCCACCGAAC GACCCGCCAT CACCGTCACC
GGAACCCTCC GCCGCGACCA CGGCACCCTC CACACCTTCC ACACCGCCCT CGCCCACCTC
CACACCCACG GCCACACCCC CACCTGGCAC ACACCCCCAC CCCCCACACC CCCCACACCC
CTGCCCACCT ACCCCTTCCA ACACCACCGC TACTGGCTGG AGGACGCGAC CCCTCCCGGC
GACGCGGACG GCCTCGGCCT GCAGGCGACC GGGCACCCGG TGCTGCGCGC GGCCACCACA
CTCGGCAACG GCCAGGGCCT GCTGCTCACC GGTCGGGTCT CCGCCCGCAC CCACAGCCTC
CTGGCCGGGC ACACCGTCGC CGGCACGGCT GTTCTGCCGC CGGCCGCGAC CCTGGACCTG
GCCTTCCACG CGGCCAGGCT GCTCGGCGGA CTCGACGTCG AGGAGCTGAC GATCAGCACA
CCCGTCGTGC TGCCGGCCGC GGACGGGGTC GACCTGCAGC TCATCGTCGA GGCCGAGCGG
GACGACTCCC GGCGCCCCTT CACCGTGCGC GCCCGCCCCG ACCAGCCGTC CGCTCGGGCC
GCCGACGGCA CCGCCGCGGC CGGGGAGGAC GACGACCTCG GCGAGCAGCT CAACCGCCCC
TGGACGACAC ACGCCGTCGG CGTGCTGGCA CCGGCGGTGC CGGTGCTGCC CGGGGCCACC
CCGGCGCCGG AGAGCTGGCC GCCCGCGGGT GCCGAGGAGA TCGCGCTGGA CGATCTCGAC
GCCCATCTCA CCACGCTCGG CCTCGAGCCC GGCCCGGCCC TGGACGGCCC GCACCGGCTG
TGGCAACGCG ACGAGGAGCT GTTCGCCGAG GTCGAGCTCG CCGAGGACGG CGCGGGCGAG
CCGGGCCGCT TCGGCCTGCA CCCGGCCGCG CTGACCGCCG CGTTGAGCCC CCTGCAGGCC
GGCCTCGGCC CTGCCGCCGA CCACGCCCGT CAGGCCAGTG GCGACACCGC CGGGAACGGC
TCCGGAGCGC TGCCGCTGAC CTGGCGCGGA GCGACTCTGC ACACCGCGAG CTCCTCCCCG
CTGCGGGTAC GCCTCGCGCC GGGCGCTCCG GACCCGGCCG GGCCGGGCCG CGGCTGGGCG
GTGGATCTGT TCACCCCCTC CGGCACCGCC GTCGCGAGCA TCGCGGACGT ACGGCTGCGT
CCCCTTGATC CCCGGGACCT CGGCGCCCCC ACCGGCGGCC TGGAGGCGGC CCGGGCGGAC
TGGGTACTCA CCCTGGACTG GCCGGTGCTC GAGCCCCCGG TTCGGGAGTC CGCCACAGCG
CAGCGCTGGT CGCCCCATCT TCTCCTCCCC ACCGCCGACG AGGACCGCTC CGCCGTGGCG
GACCTGCTCG CACGGCTCAC CCGGGTCCTG GGCGGCGCGG ACGCCCCGGC CGGCGGTTCG
GGCGACGACC CGACCGGTAT CCCGGCTGAC GACACGGCCC GGGAGCCGCT GGGCCCCGAG
GTGACCCTGC TCCCCGTCAG CAGCACGGGG GCGGCGTTCG ACGCGCTGCG TTCCTGGCTG
ACCGACCCGG TGACCGAGGG CGGCCGCCTG CTCGTGCTGA CCACCCGCGC CGTCGCCACC
GGGCCGGACG ACACACCGGA CCCGCCGGGC CTGGAGGCCG CCGCGGTCTG GGGGCTCGTC
CGCGCCGCCC AGACGGAGCA CCCGGGCCGG ATCACCATCG TCGACCTCGA CGGCCAGGAC
GCCTCGCTCG CCGCGCTGCC CGCCGCGCTC GCCACCGGCG AGCCCCAGCT GGCCCTGCGC
GGCGGGGTAG CCCACCGCCC CCGCCTCACC CGCGTGGCCC CCGCGGACCT CACCATCCCC
GCCGACGGGA CCACCGACGG GACTGTCTCC ACCGAGGGGA CCGTCCCCGC CGGCGACACC
GTGTTCATCG GCGACACCGT GCTCGTGGCC GGCGGCGGCC CGCTGGCGGG GGTGCTCGCG
CGGCGTCTCG CCGCGGAGGA GTACGGGGCT CGCCTGCTGC TGCTCGGCGC CGACGACGGC
CTGGCCGAGG AGCTGGCCGC GCTGGGCGCG GACGTGCTCG TGGCCGAGGG CGACGCGAGC
GACCGGGAGG CGCTCACCGC CGCGCTCGCT CTGGTGAGCG ACGCGAGCCC GGTGCGCGCG
GTGCTGCACG TGGCGGCCGC GCCCCCGGTC GCCGCGCTGG AGACGACGAG GACGGCGACG
TTCACCGCGG CGGTGGACAC GGCGACCGCC GCGGCCCGCA ACCTGCACGA GGCCACCCTG
GGCCTCGACC TGGACTGGCT GGTTCTCGTC GCCCCGGATG TCGCCGGCAC CCTCGGCGGC
GTCGGGACGG CCGCCCGGGC CGCGATCGGG GCCGTCCTCG ACGGGCTCGC GCGGCAGCGA
CGCTCGCTCG GGCTCGTGGG GGTCTCGCTG GCGCTCGGTC CGCGCGCGAC CGGCGACACC
GCCAGCACCA CCAGCACCGG CGTCACCGGT GACGCCGGGG ACGCAGGGGA CGCAGGGGCT
GCCGCGGGTC TGGTGCCGTT CGACTCCGCC CAGGCGGGTG ACCTGTTCAC ACTGGTGCGG
CAGGCCCGGC CGGCCGCGCT GGTCGCCGCG CGGCTGGACC ACGCCGCGCT GCGCCCGGCC
GCCGCGGCCG GTCTGCTGCC CGCAGTGCTG CGTTCGCTGG TGCGCGGTGC GCGCACGCCC
GCGGCCGGCC AGGGGGCGGG CCGGCTGGTC GCCTCGCTGG TCGGCCGGCC CGAGGATGAG
CAGCGGGCGG TGCTGCTCGG GCTGATCCGG GCCACCGCGG CCGCCATCCT CGGCCACACC
GACGGGGCGG GGGTCGACGT CACCCGGGCC TTCAAGGATC TCGGGTTCGA CTCGCTCACC
GCGGTGGAGC TGCGCAACCG GCTGGCCGCG GCGACCGGGC TGAGCCTGCC GAGCACGCTG
CTGTTCGACT ATCCCTCCGG GGAGGTGCTC GCCGACCACC TGCGCTCGCA GCTGCTGGGC
CTGGTCCCCG AGGACGGGCG GTCCGTGGCC CGGGGGCGGG CCGGGGACGA CGGTGAGCCG
ATCGCGATCG TGGCGATGGC CTGCCGTTAC CCCGGTGGGG TACGTACGCC GGAGGACCTG
TGGCGGGTGG TCGAGCAAGA GCTCGACGTC ATCTCGCCGT TCCCGGACAA CCGCGGGTGG
GATCTGGACG TCCTGTTCGA TCCGGACCCC GCGCACCCCG GCACCTCCTA CGCGCGGGAG
GGCGGGTTCC TGCACGACGT CGACCAGTTC GACCCGGCGT TCTTCGGGAT CTCCCCGCGT
GAGGCGACGG CGATGGACCC GCAGCAGCGG CTGCTGCTGG AGACGTCCTG GGAGGCGTTC
GAACGGGCCG GCATCGCCCC CGACTCGCTG CGCGGCTCAC GCACCGGCGT GTTCGCGGGC
GTCGTCTACA CCGACTACGG CTCCCGGGTG CGGCTGCCTG CCGACATGGA GGGCTACCTG
GGCATCGGCA GCGCGGGCAG CATCGCCTCC GGGCGCATCG CCTACACCCT GGGGCTGGAG
GGCCCGGCGG TCACCGTCGA CACCGCCTGC TCCTCCTCCC TGGTCGCCCT GCACCTGGCG
GTGCAGTCGC TGCGCCGTGG CGAGTGCGAC CTGGCCTTGG CCGGCGGCGC GACCACGCTG
GCCAACCCGG ACATCTTCGT CGGTTTCAGC CGGCAGCGCG GGCTGGCTCC GGACTCGCGC
TGCAAGCCGT TCGCCGCGGC CGCCGACGGC ACGGCGTTCG GTGAGGGGGT GGGGCTGCTG
CTCGTGGAGC GGCTCTCCGA CGCCCGCCGC AACAACCACC CGATCCTCGC GCTGGTCCGC
GGCACGGCGA CCAACCAGGA CGGCGCCAGC AACGGCCTCA CCGCACCCAA CGGCCCCTCC
CAGCAGCGGG TGATCCGCCA GGCCCTCACC GACGCCGGCC TCAACCCCGC CGACGTCGAC
GCCGTCGAGG CGCACGGCAC CGGCACCCGC CTCGGCGACC CCATCGAAGC CCAGGCCATC
CTCGCCACCT ACGGGCAGGA CCGGCCCGAC GATCAGCCGC TCTGGCTGGG CAGCCTCAAG
TCCAACATCG GCCACACCCA GGCGGCCGCC GGGGTCGGCA GCATCATCAA GATCATCCAG
GCGTTCCACC ACGGCGAGCT GCCGCGCACC CTGCATGTCG ACGAACCCAC CCCGCACGTC
GACTGGCAGG CCGGAAACGT CGCCCTGCTC ACCGAGAAGC GGCCCTGGGA ACCGGGAGAC
CGGCCCCGCC GCGCCGGTGT CTCCGGATTC GGCATGAGCG GCACCAACGC CCACGTGATT
CTCGAGGAGG CACCGGCCGC GACACCGGCG GAGGAAACGA CGGACGGCCC GGTTCCGATT
CTTTTGTCGG GGCACACCGA ACAAGCCCTC CACGACCAGG CCACAAACCT CAACACCTAC
CTCACCCACC ACCCGAACAC CACCCCCCAC CAGCTCACCC ACACCCTCAC CCACGGCAGA
ACCCACCACC AACACCGCGC CGCCATCATC ACCAGCGCCA CCAACACCAC CGAACTCCGC
GACACCCTCA CCGCCCTCAC CCACCACCAA CCACACCCCA ACCTCACCCA AAGCCACACC
ACCACCCCCG GCAAAACCGT CTTCGTCTTC CCCGGCCAAG GCAGCCAATG GCACGGCATG
GCCCGCCACC TCTACACCAC CTCACCCACC TTCGCCCACC ACCTCACCAC CGCCACCAAC
GCCCTCAACC CCCACCTCGA CTACGACCTC CTCCACACCC TCACCTCACC CACCCCACCA
CCCAACACCG TCACCTACGT CCAACCCGCC CTCTTCGCCG TCATGACCAG CCTCGCCCAA
CTCTGGCAAC ACCACGGCAT CCACCCCGAC ACCGTCCTCG GCCACAGCCA AGGCGAAATC
GCCGCCGCCC ACATCGCCGG CGCCCTCACC CTCACCGACG CCGCCACCAT CGTCGCCCTG
CGCGCCACCG CCCTCACCAC CCTCACCGGC ACCGGCACCA TGGCCTCCAT CACCCTGCCC
GCCCACGCCC TCCAACCCCT CCTCGACACC CACCCCGACC TCCACATCGC CGCCCACAAC
AGCCCCACCC ACACCATCGT CGCCGGCAAC CCCCAATCCA TCACCGAACT CCTCGAACAC
TGCCAGCAAA ACAACATCCA AGCCCGCCAG ATCCCCGTCG ACTACGCCTC CCACACCCCC
CACATCGAAC CCCTCCACAC CACCCTCACC ACCGCACTCG CCCACATCAC CCCCACCCCC
GCCACCATCC CCTTCTACTC CACCCTCACC AACACCTACC TCGACACCAC CCAACTCACC
CCCGAATACT GGTACCAAAA CCTCAGAAAC CCCGTCCAAT TCCACCAAGC CATCACCACC
CTCCACCACA ACGGCCACAC CACCTACATC GAAACCAGCC CCCACCCCGT CCTCACCACC
ACCATCACCG AAACCCTCGA AAACCCACCC ACCAACACAG ACAAGAACAG CACCCAAAAC
ACCAACCAGA ACGGCACCGC GGAACACCCC ACCGAACGAC CCGCCATCAC CGTCACCGGA
ACCCTCCGCC GCGACCACGG CACCCTCCAC ACCTTCCACA CCGCCCTCGC CCACCTCCAC
ACCCACGGCC ACACCCCCAC CTGGCACACA CCCCCACCCC CCACACCCCC CACACCCCTG
CCCACCTACC CCTTCCAACA CCACCGCTAC TGGCTGGAAG GACCCGCGAT CGGGCCGGAC
GGCGACGGGG ACACCGCGGG GCACGGCTTC GTCAGCGCCG TCACCGAGCT CGCAGACGGC
GACGGCCTGC TGCTCACCGG CCGGCTCTCC CTGCGCTCCC ACCCCTGGCT CGCCGACCAC
GCGGTGCGCG GAACGGTGCT GCTGCCGGCC ACCGCGCAGC TCGAACTCGT CTTCCAGGCG
GCCCTGCACA CCGGCGCCGC CGGGATCGAG GAGCTGACGC TGGAGGCGCC GCTGCTCCTG
CCGGAGCGGG GCGGCGTGCG GCTGCAGGCC CGGGTCGAGG CCGCGGACGA CCAGGGCCGG
CGCCGGGTGA CGGTCCACTC CCGCCCGGGC GCCGAGGACC CCTGGACCCG CCATGCCGCG
GGCGTGCTGG CCGCGGCGGC GCCGACAGCC ATCCCGCCGC GGGGCGTCAG CGCCTGGCCG
CCGCCGCAGG CCACCCCGGT ATCGGTCGAG GGCCTCTACG ACCAGCTCGC CGAGCTCGGC
TACGAGTACG GGCCGGCCTT CCAGAATCTG CGCGCGGCCT GGCGCGACGG TGACACCGTG
TACGCGTCGG TCGTCCTCGG GCCCGAGCAC CACGTCGACG CGGCGCGGTT CGCGGTGCAC
CCGGCGCTGT TCGACGCGGC GCTGCACGCG ATGGGCCTGG GTGGCTTCCT GGGCTCCGGG
GTGCTGCTGC CGTTCGCCTG GTCCGGGATC ACGCTGGCGG CGGCCGGCGC CACCGAGCTG
CGGGTCACCG TCGCGCCGGG GCCCGGTGAG CGGGACACCG TGACGGTGAC GCTGGCCGAC
CAGACGGGGG CACCGGTCGC CCGGGTGGAC CAGCTGACCC TGCGGCCGGT GGACCCGGCG
AGCCTGCCGT CGGCGCCCGG CCGGGGCGGC TCCGAGGGGC TGTACGAGCT CGAATGGCAG
GAGCTCCCGC CGCTCGCGGC GGTCGCGCCG CCGCCCTACC GGCTGGCCGC GGCGCCGGGC
GGGCCGTCCG CCGGCGCTCC CGGCGGGGAG CCGCGGTCGG ACCTGAGCTC CCACCTGGAG
CTGGACCGGG AGCTCGACCT TCAGCTGGCG CTCGGCGCCC CCGGCGGCGG CGAGGCGGAG
CTGACCCTGC TGCCCTGGTG GGCCCCGCCC GGCCCGGCGG ACGCCGCCGC GGGCGCAGGC
GTGGACACGG ACGCGGACCG CGATCCCAGC CCCGACCTGC CCGGCACGGA CCTGCCCGGC
GCCGTCGGCG CCGGGGTGCT GGCGCTGCTG GGCCGGGTGC GCGACTGGCT CGGCCAGGAC
GACCGGCCCC AGGCGCGGCT GGTGGTGCTC ACCCGGCGCG CAGTGGCCGC CCGCCCCGGC
GACCCGACAG AGCTCACCGC CGCGCCACTG TGGGGCCTGC TGCGCTCCGC GGCGACGGAG
AACCCGGACC GGATCGTCAT CGCCGACGTC GACGGCACCG CCGCGTCGCT GGCGGCGCTG
CCCGGCGCGC TCGCCACCGG CGAGCCCCAG TTCGCCCTGC GCGACGGGGT GGCGCTGGTG
CCCCGGCTGG TCCGCCGCGC CGCCGGCGGG GCGCTCACAC CGCCGCCGGA CGAGCCGGCC
TGGCGGGTGG AGACACCCGG CGGGTCACCC GACGACCTGT TCCTCGCCCC GTTCCCGGCG
GCCCGCGCTC CGCTGGGGGC CGGGCAGGTC CGGCTGGCGG TGCGCGCCGC CGGGCTGAAC
TTCCGGGACG TGCTCACCGC TCTCGGCATG GTGCCCGTGG GCGCCCCGCT GGGCACCGAG
GCCGCCGGAG TGGTGCTGGA GGTCGGCCCG CAGGCAAGCG GGCTCTCCCC GGGGCTCTCC
GTCGGGGACC GGGTGCTCGG GCTGGTGCCC GGCGCCCTCG GCCCGCTGGC GGTGGCGGAC
GCGCGGCTGC TCGCGCCGAT CCCCGCCGGC TGGTCGTTCG CCGAGGCCGC CGGAGTGCCG
GCGGTGTTCA CCACCGCCTA CTACAGCCTG GTGGAGCTGG CCCGGGTCCG CCCGGGTGAG
CGGGTGCTGG TCCACGCGGC GACCGGCGGG GTGGGCCTGG CGGCCCTGCA GGTGGCCCGC
CACCTGGGCG CGGAGGTCTT CGCGACCGCC AGCCCGGCGA AGTGGGGGGC GCTGCGGGCC
CTGGGCGTGC CGGCGGAGCG GATCGCCTCC TCGCGCACCA CCGACTTCGA GGCGGCGTTC
CTGGACGCCA CCGGCGGCGC GGGGGTCGAC GTCGTGCTCA ACTCGCTCAC CGGCCCGTTC
CTGGACGCCT CGCTGCGGCT GCTGCCCCAG GGCGGGCGTT TCGTGGAGAT GGGCATCGCG
GACCCGCGGG ACCCCGCCGA GGTGGCCGCC ACCCACCCGG GCGTGCGTTA CCAGGCGTTC
GAGCTGCTCG ACATGGATCC CGACCAGGTG GGACGGTCGC TGGCCGGTGC CCTCGACCTG
CTCGCCGGCG GGCCGCTGCG CCCGCTGCCG GTGACCACCT GGGACGTCCG CTCGGCACCG
GCCGCGTTCC GGCACTTCTC GAAGGCCCGG CAGGTCGGCA AGGTCGTGCT CACCATCCCG
GCCGCGGTCG GGGAGGGGGA ACGGCCCGAC CTGGGCGAAC CCACCGGTGA GCGGGCCGCC
GAGCGGGCCG GGACGGTGCT GCTCACCGGT GGCACGGGCA CCCTCGGCGC GGCGCTCGCC
CGGCATCTGG TGACCGCGCG CGGCGTGCGC CACCTGCTGC TGACCAGCCG TCGGGGCGCG
CAGGCCCCCG GCGCGGCGGA GCTGGCCGGC GAGCTCACCG AGCTCGCCGG GCCCGGTGCG
ACCGTGCGGG TCGAGGCGTG CGACGCCGCC GACCGGGACG CGCTGGCCGG CCTGCTGGCC
TCCGTCGACC CGGCGCATCC GCTGACCTCG GTGATCCACG CCGCCGGGCT GCTCGACGAC
GGCGTGCTCA CCTCGCTGAC CCCGGAGAAG GTCGACGCGG TGCTGCGCCC CAAGGTCCGC
GCGGCGTGGA ACCTGCACGA ACTGACCCGC GACGCCGACC TCGCCGAGTT CGTGCTGTTC
TCCTCGGCGT CGGGCCTGCT CGGCGGGGCC GGGCAGGCGA ACTACGCGGC AGCGAACGTC
TTCCTCGACG CGCTCGCCGC GCACCGGCGG GCGGCCGGGC GCCCCGCGGT CTCGCTGGCC
TGGGGCCTGT GGCAGTCGGC CAGCGGGATG ACCGGCCACC TGGACGAGGC GGACCTGGCC
CGTATCGCCC GCGGCGGGCT GGTGCCGATG TCGACGGAGG CGGGCATGGC GCTCTTCGAC
GCGGCGCTGA CGGCCGGCCC GCCGCTGCTG GCGCCGGCCC CGCTGGACCT CGCCGCGCTG
CACCGGCAGG CGCAGACCGG CGGGCTGCCC GCGGTGCTGC GCGCGCTGGT CCGCGGGCCG
GTGGTCCGCC GCGCGGCGGG CGCGACCACG GTGGAGCCGA CCGCGCCGGG TGGCCTGGCC
GAGCGGCTGC GCGGGCTGCG GGGCGTCGAG CGGGAGCGGC ATCTGTTCGC CCTGGTCCGC
GAGCAGATGG CGGCGGTGCT GGGTCACGCG GGGGTCGAGG ACATCGCCCC CGACCAGGCC
CTCAAGGACA TCGGCTTCGA CTCCCTGACG TCCGTCGAGC TACGCAACCG GCTCGGCGCG
GCGACCGGGC TGCGGCTGCC GACGACCCTG GCGTTCGACT TCCCGACCCC GGCGGCGCTC
GCGGCCCGGC TGCTGTCGCT GCTGGCTCCG GACGACACCG CCGCCGAACT CGACCGGCTG
CTCGCCGAGG TGTCGGTGGA CGGCCCCGAG TTCGGCGCGA TCCGCGACCG GCTGCGCGAC
GCCCTGTGGC GATGGGAGGA GGCGGTCGCG GCCGGCTCCC CGGCCGGATC GGCCGTCCCG
GCCCAGCCGT CGTCGGCGGA CGAGCTGGCG GAGCTCGCCG ACGCCACCGA CGAGGAGCTC
TTCCGCGCGC TCGACGAGGA GCTGGACGCG CCGTGA
 
Protein sequence
MTNETTAGTE SRLRDYLKRV TSDLLSTRAR LSELEAERTE PIAIVGTACR YPGGVRTPED 
LWRVARSGTD AISSFPENRG WDVPGLFDPD PERVGHTYAR EGGFLHDADL FDPAFFGISP
REATAMDPQQ RLLLETSWEA FERAGIAPDS LRGSRTGVFA GVMYSDYGGR IRQAPDGLEG
YIGIGSAGSV ASGRISYTFG LEGPAVTVDT ACSSSLVALH LAVQSLRRGE CDLALAGGAT
VIATPGLFVE FSRQRGLSPD GRCKAFAATA DGTGWGEGVG LLLVERLSDA RRNNHPVLAV
VRGSAVNQDG TSGQLSAPNG PAQQRVIRQA LADAGLNTAD VDAVEAHGTG TRLGDPIEAQ
ALLATYGQNR PDDQPLWLGS LKSNIGHTQA AAGAAGIIKI IQALHHDELP RTLHVDEPTP
HVDWDSGNVR LLTENRPWES RDTPRRAAVS SFGISGTNAH VILEEAARET DDNSSGEAGS
APAETAPTAA APVERETPVP ILLSGHTEQA LHDQATNLNT YLTHHPNTTP HQLTHTLTHG
RTHHQHRAAI ITSATNTTEL RDTLTALTHH QPHPNLTQSH TTTPGKTVFV FPGQGSQWHG
MARHLYTTSP TFAHHLTTAT NALNPHLDYD LLHTLTSPTP PPNTVTYVQP ALFAVMTSLA
QLWQHHGIHP DTVLGHSQGE IAAAHIAGAL TLTDAATIVA LRATALTTLT GTGTMASITL
PAHALQPLLD THPDLHIAAH NSPTHTIVAG NPQSITELLE HCQQNNIQAR QIPVDYASHT
PHIEPLHTTL TTALAHITPT PATIPFYSTL TNTYLDTTQL TPEYWYQNLR NPVQFHQAIT
TLHHNGHTTY IETSPHPVLT TTITETLENP PTNTDKNSTQ NTNQNGTAEH PTERPAITVT
GTLRRDHGTL HTFHTALAHL HTHGHTPTWH TPPPPTPPTP LPTYPFQHHR YWLEDATPPG
DADGLGLQAT GHPVLRAATT LGNGQGLLLT GRVSARTHSL LAGHTVAGTA VLPPAATLDL
AFHAARLLGG LDVEELTIST PVVLPAADGV DLQLIVEAER DDSRRPFTVR ARPDQPSARA
ADGTAAAGED DDLGEQLNRP WTTHAVGVLA PAVPVLPGAT PAPESWPPAG AEEIALDDLD
AHLTTLGLEP GPALDGPHRL WQRDEELFAE VELAEDGAGE PGRFGLHPAA LTAALSPLQA
GLGPAADHAR QASGDTAGNG SGALPLTWRG ATLHTASSSP LRVRLAPGAP DPAGPGRGWA
VDLFTPSGTA VASIADVRLR PLDPRDLGAP TGGLEAARAD WVLTLDWPVL EPPVRESATA
QRWSPHLLLP TADEDRSAVA DLLARLTRVL GGADAPAGGS GDDPTGIPAD DTAREPLGPE
VTLLPVSSTG AAFDALRSWL TDPVTEGGRL LVLTTRAVAT GPDDTPDPPG LEAAAVWGLV
RAAQTEHPGR ITIVDLDGQD ASLAALPAAL ATGEPQLALR GGVAHRPRLT RVAPADLTIP
ADGTTDGTVS TEGTVPAGDT VFIGDTVLVA GGGPLAGVLA RRLAAEEYGA RLLLLGADDG
LAEELAALGA DVLVAEGDAS DREALTAALA LVSDASPVRA VLHVAAAPPV AALETTRTAT
FTAAVDTATA AARNLHEATL GLDLDWLVLV APDVAGTLGG VGTAARAAIG AVLDGLARQR
RSLGLVGVSL ALGPRATGDT ASTTSTGVTG DAGDAGDAGA AAGLVPFDSA QAGDLFTLVR
QARPAALVAA RLDHAALRPA AAAGLLPAVL RSLVRGARTP AAGQGAGRLV ASLVGRPEDE
QRAVLLGLIR ATAAAILGHT DGAGVDVTRA FKDLGFDSLT AVELRNRLAA ATGLSLPSTL
LFDYPSGEVL ADHLRSQLLG LVPEDGRSVA RGRAGDDGEP IAIVAMACRY PGGVRTPEDL
WRVVEQELDV ISPFPDNRGW DLDVLFDPDP AHPGTSYARE GGFLHDVDQF DPAFFGISPR
EATAMDPQQR LLLETSWEAF ERAGIAPDSL RGSRTGVFAG VVYTDYGSRV RLPADMEGYL
GIGSAGSIAS GRIAYTLGLE GPAVTVDTAC SSSLVALHLA VQSLRRGECD LALAGGATTL
ANPDIFVGFS RQRGLAPDSR CKPFAAAADG TAFGEGVGLL LVERLSDARR NNHPILALVR
GTATNQDGAS NGLTAPNGPS QQRVIRQALT DAGLNPADVD AVEAHGTGTR LGDPIEAQAI
LATYGQDRPD DQPLWLGSLK SNIGHTQAAA GVGSIIKIIQ AFHHGELPRT LHVDEPTPHV
DWQAGNVALL TEKRPWEPGD RPRRAGVSGF GMSGTNAHVI LEEAPAATPA EETTDGPVPI
LLSGHTEQAL HDQATNLNTY LTHHPNTTPH QLTHTLTHGR THHQHRAAII TSATNTTELR
DTLTALTHHQ PHPNLTQSHT TTPGKTVFVF PGQGSQWHGM ARHLYTTSPT FAHHLTTATN
ALNPHLDYDL LHTLTSPTPP PNTVTYVQPA LFAVMTSLAQ LWQHHGIHPD TVLGHSQGEI
AAAHIAGALT LTDAATIVAL RATALTTLTG TGTMASITLP AHALQPLLDT HPDLHIAAHN
SPTHTIVAGN PQSITELLEH CQQNNIQARQ IPVDYASHTP HIEPLHTTLT TALAHITPTP
ATIPFYSTLT NTYLDTTQLT PEYWYQNLRN PVQFHQAITT LHHNGHTTYI ETSPHPVLTT
TITETLENPP TNTDKNSTQN TNQNGTAEHP TERPAITVTG TLRRDHGTLH TFHTALAHLH
THGHTPTWHT PPPPTPPTPL PTYPFQHHRY WLEGPAIGPD GDGDTAGHGF VSAVTELADG
DGLLLTGRLS LRSHPWLADH AVRGTVLLPA TAQLELVFQA ALHTGAAGIE ELTLEAPLLL
PERGGVRLQA RVEAADDQGR RRVTVHSRPG AEDPWTRHAA GVLAAAAPTA IPPRGVSAWP
PPQATPVSVE GLYDQLAELG YEYGPAFQNL RAAWRDGDTV YASVVLGPEH HVDAARFAVH
PALFDAALHA MGLGGFLGSG VLLPFAWSGI TLAAAGATEL RVTVAPGPGE RDTVTVTLAD
QTGAPVARVD QLTLRPVDPA SLPSAPGRGG SEGLYELEWQ ELPPLAAVAP PPYRLAAAPG
GPSAGAPGGE PRSDLSSHLE LDRELDLQLA LGAPGGGEAE LTLLPWWAPP GPADAAAGAG
VDTDADRDPS PDLPGTDLPG AVGAGVLALL GRVRDWLGQD DRPQARLVVL TRRAVAARPG
DPTELTAAPL WGLLRSAATE NPDRIVIADV DGTAASLAAL PGALATGEPQ FALRDGVALV
PRLVRRAAGG ALTPPPDEPA WRVETPGGSP DDLFLAPFPA ARAPLGAGQV RLAVRAAGLN
FRDVLTALGM VPVGAPLGTE AAGVVLEVGP QASGLSPGLS VGDRVLGLVP GALGPLAVAD
ARLLAPIPAG WSFAEAAGVP AVFTTAYYSL VELARVRPGE RVLVHAATGG VGLAALQVAR
HLGAEVFATA SPAKWGALRA LGVPAERIAS SRTTDFEAAF LDATGGAGVD VVLNSLTGPF
LDASLRLLPQ GGRFVEMGIA DPRDPAEVAA THPGVRYQAF ELLDMDPDQV GRSLAGALDL
LAGGPLRPLP VTTWDVRSAP AAFRHFSKAR QVGKVVLTIP AAVGEGERPD LGEPTGERAA
ERAGTVLLTG GTGTLGAALA RHLVTARGVR HLLLTSRRGA QAPGAAELAG ELTELAGPGA
TVRVEACDAA DRDALAGLLA SVDPAHPLTS VIHAAGLLDD GVLTSLTPEK VDAVLRPKVR
AAWNLHELTR DADLAEFVLF SSASGLLGGA GQANYAAANV FLDALAAHRR AAGRPAVSLA
WGLWQSASGM TGHLDEADLA RIARGGLVPM STEAGMALFD AALTAGPPLL APAPLDLAAL
HRQAQTGGLP AVLRALVRGP VVRRAAGATT VEPTAPGGLA ERLRGLRGVE RERHLFALVR
EQMAAVLGHA GVEDIAPDQA LKDIGFDSLT SVELRNRLGA ATGLRLPTTL AFDFPTPAAL
AARLLSLLAP DDTAAELDRL LAEVSVDGPE FGAIRDRLRD ALWRWEEAVA AGSPAGSAVP
AQPSSADELA ELADATDEEL FRALDEELDA P