Gene Franean1_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3889 
Symbol 
ID5672250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4642187 
End bp4652668 
Gene Length10482 bp 
Protein Length3493 aa 
Translation table11 
GC content77% 
IMG OID641242768 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001508185 
Protein GI158315677 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.49017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG AGGACCGGCT GCGCCATTTT CTGAAGCAGG CGACAGCCGA GCTTCGTCAG 
GCCACCCAGC GGGTGCGCGA GCTGGAGGAG GCCGACCACG AGCCGATCGC GATCGTGGGA
ATGGCCTGCC GTTTCCCGGG CGGGGTGTCG TCCCCGGAGC AGCTGTGGGA TGTCGTCTCC
GGCGGGCACG ACGCGGTGTC GGAGTTTCCG GTGGATCGGG GTTGGGATGT GGCGGGTCTT
TATGATCCGG TTCCTGCTCG GGTGGGTCGG AGTTATGTGC GGTCGGGTGG TTTTCTGGCT
GGTGCGGCTG ATTTTGATGC GGGTTTTTTT GGGATTTCGC CGCGTGAGGC GTTGGCGATG
GATCCGCAGC AGCGGTTGTT GTTGGAGGTG TCGTGGGAGG CGTTGGAGCG GGCGGGGGTG
GATCCGTCGT CGCTGCGTGG CAGCGACACC GGCGTCTTCA CCGGCCTGAT CTACCAGGGA
TACGGCGGCG AGTCGGTGAC GGCGTCCGAG GGTGTGGAGG GCTATCGGAT AAGTGGGACC
GCCTCGAGCG TGGCCTCCGG TCGGGTGGCG TATGTGTTGG GGTTGGAGGG GGCGGCGGTG
ACGGTGGATA CGGCGTGTTC GTCGTCGTTG GTGGCGTTGC ATCTGGCGGT GCGGGCGTTG
CGTGCGGGTG AGTGTGGGAT GGCGTTGGTG GGTGGGGTGA CGGTGATGTC GACGCCGGTG
GGTTTTGTGG AGTTTTCGCG GCAGCGGGGG TTGGCTGCTG ATGGGCGGGT GAAGGCGTTT
GCGGAGGGTG CGGATGGGAC GGGGTGGGGT GAGGGGGTGG GTGTGTTGGT GGTGGAGCGG
TTGTCGGTGG CGCGGGCTCG GGGGCATGGG GTGTTGGCGG TGGTGGCGGG TTCGGCGGTG
AATCAGGATG GTGCGTCGAA TGGTTTGACG GCGCCGAGTG GTCGGGCGCA GGAGCGGGTG
ATCCGGGCCG CGCTCGCGGA CGCGGGAGCG GCGCCGGGCG ACGTGGACGT GCTGGAGGCG
CACGGGACCG GGACGGCGTT GGGCGACCCG ATCGAGGCGG GGGCGCTGTT GGGGGTGTTC
GGCCCGGGCC GCCCGGCCGA CCGGCCGCTG TGGCTGGGGT CGGTGAAGTC GAACATCGGG
CACGCGCAGG CGGCCGCCGG GGTGGCCGGG ATCATCAAGA TCGTGGAGGC GCTGCGCCAT
CAGGCGGTGC CGGCGACCCT GCACGTGGAC GCGCCGACCA GCCGGGTGGA CTGGGCCTCG
GGCGGGGTCC GGGTGGCGAC CGAGCAGGTC GCGTGGCCGC CGGAGCCGGG CCGCCGCCGG
CGCGCGGGGG TCTCCTCGTT CGGGATGTCG GGCACGAACG CGCACATCGT CATCGAGGAG
GCCCCGCCAG CCGCCCCCGA GTCCGAATCG GAGCCCGGGG CTGTGGCCGG GCCCGGGGCC
GCGGCCGGGC CGGTCGGTGC GGTGCCGTGG ATCCTCTCGG CACGGTCGGA GCAGGGCCTG
CTGGCGCAGG CCGCCCGGCT GGCGTCGCGC CTGGACGCCG AGCCGACGCC CGGTTCGCCC
ACGCCACGAC CGTCTTCCCG TGCCGCGGAC CGGCCGTCAG CGCGGGATGT CGGTTGGTCG
TTGGCGCGGG GCCGCGCGGC GTTGGAGCAC CGGGCCGTGG TGCTCGGGGA GCGGGGCGCC
GGCGACGCCC AGGTCGGTGG GGACCGGCTG GCCGGGCTGC GTGCGCTGGC GGCGGACGAG
CCCGTGGAGA CGGTGGTGCG GGGGCGGGTT GGCACAAGTG GTGATCGGGT GGTGTGGGTG
TTTCCGGGGC AGGGGGCGCA GTGGGTGGGG ATGGGCGCGG AGCTGTTGGA CGCCTTGCCG
GTGTTTGCGG GTCGGGTCGC GGAGTGCGCG GCGGCGTTGG CGCCGTTCGT GGACTGGTCG
CTGGTCGACG TCCTGCGGGG GGCCTCGGAC GCCCCGGGTG CCGCCGGGAT CGCGGGTGTC
GAGGGGATCG TGGGTGTCGA GGGCGCGGGT TTGCTGGAGC GGGTTGATGT GCTGCAGCCG
GTGTCGTGGG CGGTGGCGTT GGGGTTGGCG GCGGTGTGGG AGTCGTGGGG GGTGCGCCCG
GATGTGGTGG TGGGGCATTC GCAGGGGGAG ATCGTTGCGG CGTGTGTGGC GGGAGTGCTG
TCCGTCGATG ACGGGGCGCG GGTGGTGGCG GCCCGTAGTC GGGTGATTGC GGGGGAGCTG
GCTGGGTGCG GGGGCATGGC GTCTGTGGCG CTACCGGTGG GTGAGGTGGA GGCGCGGCTG
CGGGTGGGGG CTGGTCGGGT GGAGGTGGCG GCGGTGAACG GGCCGTCGGC GACGGTGGTC
GCCGGTGAGC CTGCGGCGTT GGACGATCTG CTTTCCGTTT GGGAGGCGGA GGGTGTTCGG
GTTCGTCGGT TGCCGGTGGA CTATGCGTCG CATACGGCGC AGGTGGACCG GGTCCGCGAC
CGACTCACCA CGGAGCTCGC GGGCATCGCT CCGCGTGCGG GGACTGCCGA GATGTGGTCC
ACGGTCACCG GCTCCCCGGT CGATCCCGGG CAGCTCGACG GGGAGTACTG GTTCCGGAAC
CTGCGGTCGA CCGTCCGCCT GCACGAGGTC GCCGACGTGC TGGTGGGCGA GGGGCACCGG
GTGTTCGTGG AGATCAGCCC GCATCCGGTG CTGACTGCGG CGATCACCGA GACGGCCGAG
GCGGCCGGCG TCCCGGACGC TGTGGTCGTG GGGTCGCTGC GGCGCGGCGA CGGCGGGCCG
GACCGGCTGG CGGCGGCCGC GGCGCAGCTG TGGGTGCGCG GTGTTCCGGT CGACTGGGCC
CCGCTGGTGG CCGGCGGCCG GCAGGTGGAC CTGCCGACCT ACCCCTTCCA GCACCAGCGG
TACTGGCTGC TGGGCGACGG CCGCCCGGCC GGCGGTGCCG CATCCGGCGG TGGCGCGGCC
GGTGGGACGG ATCTTGACGG GCTGCGGTAC CGCGTCGGTT GGCATCCGCT CGGTCCGGAG
CCGTCGCGGG CCGCCCCGGC CGCCGCTACC GGCCGATGGC TGATGGTCAC GCCGTTCCCG
GCCGCCGATG ACGTACCGCC GCTCGCTGCC GACGCCGTGC GCGCCCTGGC CGAGGACGGC
GCGGAGATCG TCAGCGTCCC GGTCGGCCCC GCCGAGCTCA CCCGGGAGGG CCTCGCCGGG
CGCCTGCGAG CCGTGACCGG CGCCGGGGAG AGCGACCCGC GGGGCGTGCT GTCGCTGCTC
GCGCTGACCG GCACGGCACA GCCCGACGGG CCCCCGCGCG CCGACGGCGC TGACAGCGCT
GACAGCGCCG TGGGCGCTGA CGGTGCCGTG GGCACGGTTG GTGCCACAGC CACCCTGCTA
CTCGTCCAGG CCCTGGGCGA CGCGGGGATC GGCGCGCCGC TGTGGTGCCT CACCCGCGGT
GCCGTGTCCG TCGGCGGCGG TGACCCGGTG ACCGACCCGG CGCAGGCGCA GACCTGGGGG
CTGGGGCGGG TGGCCGCGCT GGAGCACCCG TCCCGCTGGG GCGGCCTCGT CGACCTGCCC
GACACGCTGG ACGGGTCCGC CCGGCGCCGG CTGGCAGCGG TTCTGCGTGG CGCGACCGAT
GAGGACCAGG TCGCGATCCG GGCGGGCACG GTCTTCGCCC GTCGGCTGCT GCGCGCGCCG
CTCGGCGGGA CCGAACCCGC CCGGCGCTGG CGGCCCCGCG GAACCGTGCT GGTCACCAGC
ACCGGACCGC TCGGCCCCCA TCTGGCCGGA TGGCTGGCCG AGCGGGGAGC CGGGCACGTC
GTGCTCGCCG GATCGCACCG CGCGGACGAT CCCGACATCG CGGCGCTGCG TCACGGCGCG
TTCACCGGGG ACGTCCGGCT GACCGTGGCC ACCTGTGACC TCGCGGACCG GGAGGCGTTG
GCCGCGGTCG TCCGCGGGCA GGCCGCCGAC GGCGAGGGGA TCCGCGCCGT CGTGCACACC
GCCGCGCTGC TCGGGCTGCG CCCGCTGGCC GACACCTCGG TCACCGACCT GGCCGGCGCG
ATCGCCGCCA AGGCCACCGC GGCCGACTGG CTCGACGAGC TGTTCCCCAC CGACGACCTC
GACGCGTTCG TGCTCTTCTC GTCGGTCGTC GGCGTGTGGG GCGGCGCAGA CCACGCCGCC
TACGCCGCGG CCAACGCCCA CCTGGACGCC CTGGCCGAGC GGCGGCGTGC CCGCGGCCTG
CCGGCCACCT CGATCGCCTG GGGGCTGTGG CGTCCGCTGG CGGAGAGCGG CAGCGCCGGC
GCCGGCGCCG CGGCAGCCGA GGACCCCGGC GTGCGGCGGC AGCGGGCACG GGGGCTCGGG
TACCTCGAGC CGGACCGCGC CCTCGATGCC CTGCGGCAGG TGCTGGACCA CGACGAGACC
CACATCGTCG TGGCGGAGAT CGACTGGACG CGGTTCATCC CGGTCTTCAC CTCGGCGGGC
CGCCGCCCGC TGTTCGACAC GCTCGCCGAG GACGCCGGGG CTGCTGGTGA GGCCCGGATG
TCCGGCGTCG CGGACGGCGC CGCCTCCCCG GCCGCGTCCG GAACCGGCTC GCCGCTGGCC
CGGAGACTCG CCGCGCTGCC TCCGGCCGAG CGGGACCGGG CCCTGGTCGA GACGGTCCGC
ACCCGGGCAG CGGCCGTGCT GGGGCACTCC GACGCCACCG CCGTGGAGGC GCACCGGGCC
TTCCGCGACC TCGGCTTCGA CTCGCTGACC TCCGTCGACC TGCGTAACGG GCTCGGCGCG
GCGCTCGGCC TGCGGCTGCC GTCCACGCTG GTGTTCGACT ACCCGACCCC GTTGACGCTG
GCGGAGTTCC TGCGGGTGGA GATCACCGGC GGCGACGCGG ACGACGCGCC CGCCGCCGGC
GTCCCGGGCG CTGTGGCGTC CCGCGACCGG AGCGGGCACG GCGGTGAGCA CGACCTGCTG
GACGCCGAGC CGGTGGCGAT CGTGGCCATG GCCTGCCGGC TGCCCGGTGG CGTCGACTCC
CCGGAGGACC TGTGGCGGCT GCTGGCCGAG GGCCGGGACG CGATCGGCGA GTTCCCGGCC
GACCGGGGCT GGGATCTCGA CGCCCTGCAC GATCCCGACC CGGAGCGGTC CGGCACCTCC
TATGTGCGCC ACGGCGGCTT CCTGGCCGGG GTCGCCGACT TCGACGCCGA GTTCTTCGGC
ATCAACCCCC GCGAGGCCCT GGCGATGGAC CCCCAGCAGC GGCTGCTGCT GGAGCTGTCG
TGGGAGGCGG TGGAACGCGC GGGCATCGAC CCGTCGACGC TGCGCGGCAC CCCGGGCGGG
GTGTTCGTCG GCACGAACGT CCAGGACTAC GGCCCGCGCG CGCTGGCCGC CGGGGCGGAG
ACGGAGGGCT ACATCGGCAT CGGCAACGCG GCGAGCGTCA TGTCCGGGCG CATCTCCTAC
ACCCTGGGGT TGCAGGGGCC GGCGGTGACC GTCGACACGG CCTGCTCGTC GTCGCTGGTG
GCCCTGCACC AGGCCGTCCA GGCGCTGCGG CGGGGGGAGT GCTCGCTGGC GCTGGCCGGC
GGCGCGGTCG TGATGTCCTC CCCGCTGATG TACGTCGAGT TCAGCCGTCA GCGCGCGCTG
TCACCGGACG GCCGCTGCCG GGCCTTCGGT GCGGGCGGTG ACGGCACCGG CCTCTCCGAG
GGCGTCGGCC TGCTGGTCCT CGAACGGCTC TCCGACGCCC GCCGGGCGGG CCGTCCGGTG
CTCGCCGTCG TCGAGGGCAC GGCGGTCAAC TCGGACGGCG CGTCGAACGG ACTGAGCGCG
CCGAACGGCC CGGCGCAGCA GCGGGTGATC CGCCAGGCGC TCGCCGTCGC CGGGCTGTCG
CCGTCCGAGG TGGACGCGGT GGAGGCGCAC GGCACCGGGA CGAAGCTCGG CGACCCCATC
GAGGCGCAGG CCCTCATCGC CGCGTACGGC CGCGACCGGC CGGCCGGGCG GCCGCTGTGG
CTGGGGTCGC TGAAGTCCAA CATCGGGCAC ACCCAGGCGG CGGCCGGCGT GGCCGGTGTG
ATCAAGATGG TGCTGGCCAT GCGGTACGGG GTGCTGCCGC GCACCCTGCA CGCCGACGAG
CCCACCCCGC AGGTCGACTG GGACGGGGCC GGCGTCCGGC TGCTGACCGA TGAGGTCGCG
TGGCCGGCCC GCCCGCTCCC GGACGCCGCC CCGGCGGGGC GGGAGCGGGC CGGTGAGCGC
GGACCGCGGC GGGCCGCGGT CTCCGCGTTC GGCATCAGCG GTACGAACGC GCACGTGATC
CTCCGCGAGG CCCCCGCCAC CGAGGAGCCG CGGGTCGGGC CGGCGCCGTC ACCGGCGTCG
TCGCCGCGGC CGGTGCCGCT GCTGGTGTCG GCGCGCACCA CGGAGGGGCT GCGCGCCCAG
GCCCGCCGGC TGGCCGCCCA CCTCGACGAC CGTCCCGGCC CGGCCGCCGC GGATGTCGCG
CTGGCCACCG CGGCGACCCG GGCCGCCTTC GACCACCGCG CGGTCGTGGT CGGCGCCGAC
CACGACGAGC TGCGGGCCGG CCTCGTGGAG CTCGCCGGCG ACCAGGATCC CGCCGACGGG
ACCGCCCCCG TGGGCGTCCG CCAGCCGCGC GGCCTTGTCG TGCGCGGCGT GGCCGTTCCC
GACCCGCGAG TGGTGTTCGT CTTCCCCGGG CAGGGATCAC AGTGGTCCGG CATGGGCCGG
GAGCTGCTGG AGTCGTGCCC GCCGTTCCGC GAGCGGATGA CCGAGTGCGC CGCGGCGTTC
GAGCCCTACC TCGACTGGTC CCTGCTGGAC GTGATCCGCG GGGCCGGTGA CACGCCGCCG
CTGGAGGCCA TCGAGGTGAT CCAGCCCGCG CTGATGTCGA TGATGGTTTC GCTGGCCGCG
GCCTGGCGCG CCTACGGGGT GACGCCGGCC GCGGTGGTCG GCACCAGCCA GGGGGAGGTC
GCCGCCGCGC ACGTCGCCGG TGCTCTCTCC CTGGCCGACG CGGCCCGGAT CATCGCGCTG
CGCAGCAGGC TGCTGGCCAC CCGCCTGCTC GGCCGGGGCG CGCTGGCGTC GGTCGGGCTG
CCGCCCGCGG ACGTCGCCGC CCGGCTCGAC CGCTTCGGCG GCCGGCTCGC CGTCGGCGGG
ATCAACGGGC CGCGGCAGGT CACCGTCGCC GGCGAGACCG GCGCACTGGA CGACCTGGTC
GCCGAGCTCA CCGCCGAGGG CGTCCGCGCC CGGCTGGTCG CGGCGTCGGT CGCGACGCAC
TGCGCGCAGG TGGACGGCAT CCGTCCCGAG CTGATGGAGA TCCTGCGCCC ACTGGCTCCC
ACCCCCGCCC GGATCCCGGT CTACTCGACG GTGACCGGGG GGCCGCTGGA CGCCGCGGCG
CTGGACGCCG AGTACTGGTA CTCCAACACC CGGGAACCGG TCCTGCTCGA CGGCGCCGCG
CGGGCGCTGC TGGCCGCCGG ATTCGACACC TTCGTCGAGG TCAGCCCGCA TCCGGTGGTG
GGATTCGGGC TGGCCGAGAC GGCGCACGAC GCCGGCCGGG ACGCCGTGGT CGTCAGCACG
CTCCAGCGCG GCCGCGGGGG TGCCGACCGC CTGCTGGCCG CGCTGGCCGA GGTGCACGTC
CACGGGGTCG GCGTCGACTG GCGGGAGGCG TTCGCGGGCA CCGGCGCCCA CCCGGCCGAC
CTGCCCACCT ACGCCTTCCA GCGTCGGCGG TTCTGGCCGC GGCCGGCACC GGCCCGCCCG
GCCGACGCCG CCGCCGCGCC GGGCCGCGAC GGCCCGGCTC ATCCGCTGGT CACCGCCGCC
GTCCCGATGG CCGACGGGAG CCTGCTGCTG ACGGCCCGGA TCAGCCCGGC CGCGCAGCCG
TGGCTGGCCG GCCGGACCGT GGCCGGTGTC CCGGTCGTGC CGGACGGGGT GCTGCTGGAA
CTGGCCATCC AAGCGGGCGA CGAGGTCGGC TGCCCGGAGG TGACCGACCT GGCCCTGACC
GGGCCGCTGG TGCTGCCGGA GAGTGGGGCG CTCCGGCTCC AGCTGCACGC GGGAGCACCG
GACGGCACCG GGAACCGTGC GCTGACGGTG CATGCCCGTA CCGACGGGGC CCCGGCGGAC
GCGCCCTGGA CCGCGCACGC CACCGCGACG CTGGCGGGCA CCGCCGACCC GGCCGCGTCC
GTACCCGACC CGCGGGCGTG GCCGCCGCCC GGCGCGGTTC CGGTGGACGT CGACCGGCTG
TACGGCGAGC TGCTCGCCGC CGGTCACGGC CACGGGCCGG ACGGCCCGCG GCTGACCAGG
GCCTGGCAGG CGGGGACGGA CCTGTTCGCC GAGGCCGTCC TGGACGGCGA GCACGAGGTG
GCCGGGTTCG GCCTGCACCC GGTGCTGCTC GAGGTGGCCG GCCACCTGGC CCTGTGGGGA
CGGCTCCCGG CAGCCGGCGC CGAGCCGCCC GCGGAGCCGC CCGCCGTGCT CGGCGTGGAG
CCGCCCGCGG TGTCGGAGCC GGTGGTGGCG GTCGACGGCG GCTACCGGGG CGTGCGGCTG
TATGCCACCG GTGCGCGGGC GGTGCGGGTG CGTGTCGGCC CTGCGGGCGC CGGCGCGGTC
TCGGTGGAGC TGGCCGACCA CACCGGCGCG GCGGTCGCGG CCTACGGCTC GACCGGCCTC
CGGCCGCTGG CCGCCGCCCG GCTGCGTGCG CTGCGCACCT CGGCGGCGGA CTCGATGTTC
GCCCTCGACT GGGTGCCGGT GGCCCCCGCA CGGGCCTCGG GTGCCGGCGC GCCGGGCTGG
GCGGTGCTCG GGCCGGACCC GCTCGGGCTG CGCACGGCCC TCGCGCTGGC CGGCGTCCAC
GCCCCGGCCT GCCTGGACCT GGCGGCGGCC GCGGCCCTGC CGGCCATCCC CGACGTCGTG
CTGGTGACCA GCGACTCGGC CGCGGCCGGC CCGGATCTCG CGGCGGCCGC GCGAACCGCC
CTGCGCTCGG TGCTCGCCCT GGCGCAGGAA TGGCTGCGTG ACGACCGGTT CGCCGCCGCC
CGGCTGGTCG TGGTCACCCG GGGGGCGGTG GCGGTCGGCG CGCACGACGA CGTCACCAGC
CTCGCCGACG CCGCGGTGTG GGGCCTGCTC GGCTCGGCGG CGGCGGAGAA CCCGGACCGG
TTCCTGCTCG TGGACGTCGA CGGGACGCCT GCGTCGGAGC GGGCCCTGCC GGCCGCGGTC
GCCTGCGGCG AGCCGCGGGT CGCCCTGCGC GAGGGCGCCG TCCTGGGCCA ACGCCTGGTC
CGCGCGCCGG TCGCCGGCGC CGGTGTCGAC GCCGGCCGGG CACGGTGGCG GCCGGACGGG
ACGGTGCTGA TCACCGGTGG CACCGGCACG CTGGGCGGGC TGGTCGCCCG GCACCTGGTC
ACCGGCTACG GGATCGGCAA CCTGCTGCTG GTCAGCCGCG GCGGAGCCGG TTCGCCCGGC
GTCGCGGAGC TGACCGACGA TCTGACAGCC CTGGGCGCGC GGGTGACGGT GGCGGCCTGC
GACGTCGCCG ACCGCGCCGC GCTGGCCGGC GTGCTCGCCG CGATCCCCGC GGCGCATCCC
CTGACGGCGG TCGTGCACGC CGCCGGTACC CTCGACGACG GGGTGTTCTC CGCGATGACC
CCGGAGCGCC TCGACACCGT GCTGCGGCCC AAGATGGACG CCGCCCTGCA CCTGCACGAG
CTCACCCGGG GCGACGACCT GGCCGCCTTC GTGCTGTTCT CCTCGTCCGC GGCCGCGTTC
GGCAGCGCGG GGCAGGCGAA CTACGCGGCC GCCAACACCT TCCTCGACGC CCTGGCGCAC
CACCGGCGGG CAGCCGGCCT GCCCGCGCTC GCCCTCGGCT GGGGGTACTG GGCCCCGCGC
AGCGCCCTGG CCGGGCACCT CGACGCCGCC GCGCTCGACC AGCGGATGGC CGCCAACGGG
CTGCGGCCGA TCTCCGCCGC GGCCGGCATG GCGCTGTTCG ACGCCGCGCT CACCGCGGAC
CGCCCCGTGC TGCTGCCCAT GCGGCTCGAC CCGCGGGCAC CGCGGGCGGG GGCCGTCCCG
GCGCTGCTGC GCGGCCTGGT CCGCCACCCG GCCCGGCGGT CGGTCGACGG CGCGGCCGAT
GATCCGGTCG AGACGCTGGC GCGGCGGCTG GCGGGGGCCG GCGGCCCGGA ACGGGACCGG
GCCCTGCTCG ACGTCGTGTG CGCCACCGCC GCCGCGGCGT TGGGGCACAC GTCCGCGGCG
GAGGTGGCAC CGGACCGGCC GTTCCGGGAG CTCGGTTTCG ACTCGCTCGC CTCGGTGCAG
TTCCGCAACC GGCTGACCGC GGCCACCGGG TTGCGCCTGC GGCCGATGCT CGTCTTCGAC
TTCCCGACAC CCGAGGCCAT CGCCGGCCAC CTCCGCGACC AGTTGTTCCC GGAGGTGGCC
CCGGCCGCCG CGGGCCCACC CACCGCCGCG CCCGCGCCCG CGCCCGCGGG CCCGTCCGAC
CCGGAACCGC CGGCCGCGCC CGGCGGCGGC GAGCCGGCCA GCGCGTTCGA CGAGATGGAC
GCCGAGACGC TGGTCCGGCT GGCCCTCGGT GAGGGCACCT GA
 
Protein sequence
MSNEDRLRHF LKQATAELRQ ATQRVRELEE ADHEPIAIVG MACRFPGGVS SPEQLWDVVS 
GGHDAVSEFP VDRGWDVAGL YDPVPARVGR SYVRSGGFLA GAADFDAGFF GISPREALAM
DPQQRLLLEV SWEALERAGV DPSSLRGSDT GVFTGLIYQG YGGESVTASE GVEGYRISGT
ASSVASGRVA YVLGLEGAAV TVDTACSSSL VALHLAVRAL RAGECGMALV GGVTVMSTPV
GFVEFSRQRG LAADGRVKAF AEGADGTGWG EGVGVLVVER LSVARARGHG VLAVVAGSAV
NQDGASNGLT APSGRAQERV IRAALADAGA APGDVDVLEA HGTGTALGDP IEAGALLGVF
GPGRPADRPL WLGSVKSNIG HAQAAAGVAG IIKIVEALRH QAVPATLHVD APTSRVDWAS
GGVRVATEQV AWPPEPGRRR RAGVSSFGMS GTNAHIVIEE APPAAPESES EPGAVAGPGA
AAGPVGAVPW ILSARSEQGL LAQAARLASR LDAEPTPGSP TPRPSSRAAD RPSARDVGWS
LARGRAALEH RAVVLGERGA GDAQVGGDRL AGLRALAADE PVETVVRGRV GTSGDRVVWV
FPGQGAQWVG MGAELLDALP VFAGRVAECA AALAPFVDWS LVDVLRGASD APGAAGIAGV
EGIVGVEGAG LLERVDVLQP VSWAVALGLA AVWESWGVRP DVVVGHSQGE IVAACVAGVL
SVDDGARVVA ARSRVIAGEL AGCGGMASVA LPVGEVEARL RVGAGRVEVA AVNGPSATVV
AGEPAALDDL LSVWEAEGVR VRRLPVDYAS HTAQVDRVRD RLTTELAGIA PRAGTAEMWS
TVTGSPVDPG QLDGEYWFRN LRSTVRLHEV ADVLVGEGHR VFVEISPHPV LTAAITETAE
AAGVPDAVVV GSLRRGDGGP DRLAAAAAQL WVRGVPVDWA PLVAGGRQVD LPTYPFQHQR
YWLLGDGRPA GGAASGGGAA GGTDLDGLRY RVGWHPLGPE PSRAAPAAAT GRWLMVTPFP
AADDVPPLAA DAVRALAEDG AEIVSVPVGP AELTREGLAG RLRAVTGAGE SDPRGVLSLL
ALTGTAQPDG PPRADGADSA DSAVGADGAV GTVGATATLL LVQALGDAGI GAPLWCLTRG
AVSVGGGDPV TDPAQAQTWG LGRVAALEHP SRWGGLVDLP DTLDGSARRR LAAVLRGATD
EDQVAIRAGT VFARRLLRAP LGGTEPARRW RPRGTVLVTS TGPLGPHLAG WLAERGAGHV
VLAGSHRADD PDIAALRHGA FTGDVRLTVA TCDLADREAL AAVVRGQAAD GEGIRAVVHT
AALLGLRPLA DTSVTDLAGA IAAKATAADW LDELFPTDDL DAFVLFSSVV GVWGGADHAA
YAAANAHLDA LAERRRARGL PATSIAWGLW RPLAESGSAG AGAAAAEDPG VRRQRARGLG
YLEPDRALDA LRQVLDHDET HIVVAEIDWT RFIPVFTSAG RRPLFDTLAE DAGAAGEARM
SGVADGAASP AASGTGSPLA RRLAALPPAE RDRALVETVR TRAAAVLGHS DATAVEAHRA
FRDLGFDSLT SVDLRNGLGA ALGLRLPSTL VFDYPTPLTL AEFLRVEITG GDADDAPAAG
VPGAVASRDR SGHGGEHDLL DAEPVAIVAM ACRLPGGVDS PEDLWRLLAE GRDAIGEFPA
DRGWDLDALH DPDPERSGTS YVRHGGFLAG VADFDAEFFG INPREALAMD PQQRLLLELS
WEAVERAGID PSTLRGTPGG VFVGTNVQDY GPRALAAGAE TEGYIGIGNA ASVMSGRISY
TLGLQGPAVT VDTACSSSLV ALHQAVQALR RGECSLALAG GAVVMSSPLM YVEFSRQRAL
SPDGRCRAFG AGGDGTGLSE GVGLLVLERL SDARRAGRPV LAVVEGTAVN SDGASNGLSA
PNGPAQQRVI RQALAVAGLS PSEVDAVEAH GTGTKLGDPI EAQALIAAYG RDRPAGRPLW
LGSLKSNIGH TQAAAGVAGV IKMVLAMRYG VLPRTLHADE PTPQVDWDGA GVRLLTDEVA
WPARPLPDAA PAGRERAGER GPRRAAVSAF GISGTNAHVI LREAPATEEP RVGPAPSPAS
SPRPVPLLVS ARTTEGLRAQ ARRLAAHLDD RPGPAAADVA LATAATRAAF DHRAVVVGAD
HDELRAGLVE LAGDQDPADG TAPVGVRQPR GLVVRGVAVP DPRVVFVFPG QGSQWSGMGR
ELLESCPPFR ERMTECAAAF EPYLDWSLLD VIRGAGDTPP LEAIEVIQPA LMSMMVSLAA
AWRAYGVTPA AVVGTSQGEV AAAHVAGALS LADAARIIAL RSRLLATRLL GRGALASVGL
PPADVAARLD RFGGRLAVGG INGPRQVTVA GETGALDDLV AELTAEGVRA RLVAASVATH
CAQVDGIRPE LMEILRPLAP TPARIPVYST VTGGPLDAAA LDAEYWYSNT REPVLLDGAA
RALLAAGFDT FVEVSPHPVV GFGLAETAHD AGRDAVVVST LQRGRGGADR LLAALAEVHV
HGVGVDWREA FAGTGAHPAD LPTYAFQRRR FWPRPAPARP ADAAAAPGRD GPAHPLVTAA
VPMADGSLLL TARISPAAQP WLAGRTVAGV PVVPDGVLLE LAIQAGDEVG CPEVTDLALT
GPLVLPESGA LRLQLHAGAP DGTGNRALTV HARTDGAPAD APWTAHATAT LAGTADPAAS
VPDPRAWPPP GAVPVDVDRL YGELLAAGHG HGPDGPRLTR AWQAGTDLFA EAVLDGEHEV
AGFGLHPVLL EVAGHLALWG RLPAAGAEPP AEPPAVLGVE PPAVSEPVVA VDGGYRGVRL
YATGARAVRV RVGPAGAGAV SVELADHTGA AVAAYGSTGL RPLAAARLRA LRTSAADSMF
ALDWVPVAPA RASGAGAPGW AVLGPDPLGL RTALALAGVH APACLDLAAA AALPAIPDVV
LVTSDSAAAG PDLAAAARTA LRSVLALAQE WLRDDRFAAA RLVVVTRGAV AVGAHDDVTS
LADAAVWGLL GSAAAENPDR FLLVDVDGTP ASERALPAAV ACGEPRVALR EGAVLGQRLV
RAPVAGAGVD AGRARWRPDG TVLITGGTGT LGGLVARHLV TGYGIGNLLL VSRGGAGSPG
VAELTDDLTA LGARVTVAAC DVADRAALAG VLAAIPAAHP LTAVVHAAGT LDDGVFSAMT
PERLDTVLRP KMDAALHLHE LTRGDDLAAF VLFSSSAAAF GSAGQANYAA ANTFLDALAH
HRRAAGLPAL ALGWGYWAPR SALAGHLDAA ALDQRMAANG LRPISAAAGM ALFDAALTAD
RPVLLPMRLD PRAPRAGAVP ALLRGLVRHP ARRSVDGAAD DPVETLARRL AGAGGPERDR
ALLDVVCATA AAALGHTSAA EVAPDRPFRE LGFDSLASVQ FRNRLTAATG LRLRPMLVFD
FPTPEAIAGH LRDQLFPEVA PAAAGPPTAA PAPAPAGPSD PEPPAAPGGG EPASAFDEMD
AETLVRLALG EGT