Gene Hoch_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2957 
Symbol 
ID8545345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4055634 
End bp4070282 
Gene Length14649 bp 
Protein Length4882 aa 
Translation table11 
GC content67% 
IMG OID646387636 
Productamino acid adenylation domain protein 
Protein accessionYP_003267364 
Protein GI262196155 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.221364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGG ACACCGCATG CTCATCCTCC TTGGTAACAG TTCACTTGGC CTGCGAGAGC 
CTGCGCCGAG GCGAATCGAC GGTCGCGCTG GCGGGCGGTG TGAACCTCAA CCTGATCGCC
GAGAGCGCGC TCGAGATGAG CAAGTTTGGT GGGCTCTCTC CCGATGGGCG CTGCTTCACC
TTCGACGCAC GCGCCAACGG CTACGTACGC GGCGAAGGTG GCGGGGTCGT CGTGCTCAAG
CGTCTCTCCC ACGCGTTGAT GGATGATGAT CCGATCATCT GCATCATCCG GGGCTCCGCC
ACGAACAACG ACGGCGCTAG TAACGGGCTC ACCGCTCCCA ACCCGCGCGC TCAAGAGGCA
CTCCTCCGCC TTGCGTATCG GCAGGCCGGC ACAGACCCGG GTGAGGTCCA GTACGTGGAG
CTACACGGCA CCGGAACGCC GCTGGGCGAT CCGATTGAAG CGGCCGCGCT AGGCAGCGTC
CTCGGTCAGG CACGCTCCTC CCAGAATCCC TTATTGGTCG GCTCGGCCAA GACCAACGTT
GGCCACCTGG AGGCCGCCGC CGGCGTGGCA GGCCTTCTGA AGGTGGCGCT GTGCCTGCAG
AACAAGCAGC TCGCGCCAAG TCTGCACTTC GATACGCCGA ACCCCCACAT TCCACTGGCC
GACCTCAACC TGCGCGTGGT CGATGATCTG ACCACCTGGC CGGCTCCAGG GCGCCCGACG
GTGGCAGGGG TGAGTTCGTT CGGGATGGGC GGGTCGAATT GTCACGTCGT AGTCTCAGAA
CATCTGGAGA GCCAAGCAGA GATAGTGGCT CTCACGGCCC CAAGTGCTGA AGAGTTAGAA
GCCAAGGCCC GAGGGTGGGC CGAGGCGCTT GCCAAGACTT CACCGGCCGA CTCAGCCGCC
ATGCTGTGCG CCCGCGCCTC CGCGGAGCCA GTTGATGGGG ACCACCGGCT TGCGGTCACG
GGACGCTCCG CCGCGCTTTT GAGTCAGCGA CTGCTCGAGT TTGCGACGGG CGACGCGCGG
CTTGGCGTGT CGGTAGGCTG TGTGGTGCCG GGCACCTCGC CGCGGGTCGC GTTCGTTTTC
GGTGGGCAGG GCGCCCAATG GTTCGGGATG GGAACGCAGC TGCTGCGCCG CGAGCCGGTG
TTTCGACGCT CGATCGAGCG GGCCAGCAGC TTGATCCAGC AGCACCTCGG GTGGTCGCTC
CTCGAGGAGC TCACCGCTCC GCGCGAGCGT TCCCGCCTCG ATAGCGTCGC CGTGAGCTTT
CCGGCCATCG TGGCGTTCGA GATCGCGCTC GCATCGTTAT GGCAAAGCTG GGGTATCCGG
CCGGCAGCCG TGCTTGGCCA CAGCATCGGG GAGGTTGCCG CCGCGCACGT TGCGGGCGCT
CTCAGCCTCG AGGACGCGAT GCTGGTGATC TGTGCTTACG CTCGCGGACT CGAGCGCTTC
CGAGGACGCG GGGCCATGGG GCTCGTGGCG CTCTCGTGGC AGGACGTCGC TCAAGCTCTC
GAGCGCCACG AGGGGCGTCT GTTCCGGGCC ATCCAGATGG CCCCGGACTC CACCGTCGTC
GCCGGTGAGC CCGAGTCTCT GCTGGAACTC CTCGACGAGC TGGAAAGCCA GGGGACTTTT
TGTCGCCGAA TCGCCACAGA CGCGGCTCCA CATTGCCCCT TGGTGGACGG GCTGCGGGAT
GAACTCCGCA CAGCGCTTCG CGCCCTCCGC CCGCGTGCGG GCGACATCCC ATGGATCTCT
GAAGTGACGG GCGCGCAGCT CGACGGGGAA AGCCTGGGCA CCAGCCACTG GGTTCGCAAC
CTCTGCGACC CCATTCGGTT CGGCGATGCC CTGGAGCACC TCGTGAATCA GGGGCCGGAC
GTGTTCGTTG AAGTGAGTCC GCACCCCGTC GTGCTACCGG TCATCGAGAC CTACCTACGC
CGCGCTGACA GGACCGGCGA GGCCGTCCCA ACCCTGCGTC GTGATGAAGA CGAAGGCGAA
GCCATGCGGG ACGCCTTGGG GGCGCTGTTC GTGCGGGGCG CATCGAGTCA CTGGGATGCG
GTGCATGGCT GCGCCGCCCC CCACGGCACT AGCTCGGACG GTTTCCTTCC TGCACCGATC
CTTCTCTCCG GAGGGACACC GGCCGCCCTG CGCGCTCAGG CGATGCAGCT GCATACCCTC
CTCACCGCAA AAGAGTCCCT GCGCGTGCAG GACGTCGCCT ATTCGTTGGC CGTGACGCGC
ACGCACTTCG AGAAGCGGGC CTCATGGCTC GCGACCAGCC GCGAGGGCCT GCTAGGCGCT
CTCGATAAGG TCGCCCGGGG AGTACCGGTC GCGTCGGTCA CCGTCGGCGA AGCCAGAGAC
ACAGGCAAAC TCGGGTTCCT CTTCACTGGC CAGGGTAGTC AGCGTCCCGG CATGGGGCGC
TCCCTCTACA GGGCATTTCC CAACTTTCGC AGCGCGCTCG ACGCGGTCTG CGACGAACTG
GATCCCCACC TCCCTCGCCC CCTCCGCGAA CTCCTCTTCT GTACCGACGG CTCCCCCGAG
GCCGCGCTGC TGGGGCAGAC GGGGTTCACG CAGCCCGCGT TGTTCGCTCT AGAGGTCTCT
CTGTTTCGAT TGCTTGAGGC CTGGGGGCTC GTCCCAGACG TCCTGCTGGG GCACTCCGTC
GGCGAGCTCG CGGCTGCTCA TGTGGCAGGG GTGCTCTCCC TTGAGGACGC ATGCACGCTG
GTCGCAACCC GAGCACGGCT CATGCAGGAG TTGCCGGCCG CGGGCGCGAT GGTGGCGCTC
CAAGCTTCCG AGCAGGAGGT CGAAGAGTCG CTGTCCAGCC ACCCCGGCGT GACCATCGCC
GCCGTCAACG GGCCCCGCGC GACGGTGGTC TCGGGCCACG AGGCAGAAGC GCTGGCGGTG
GCCGCGCACT TCGAGGCTCA GGGCCGCAAG TGCAAGCGCC TGGCCACCAG CCACGCCTTC
CATTCGGCAC ACATGGAGCC CATGCTCGAG GCGTTCCGCA GCGTCGCGAG CGGCCTCTCT
TTCCATCCGC CGCGTATCCC CATTGTCTCC AACGTTTCTG GCGCTGTGGC TTCCGCCCTC
GACCTCTGCT CACCAGAGTA TTGGGTCCGG CACGTACGCG CCGCGGTGCG CTTTGCCGAC
GGCGTGCAAA GCGCTGCAAA CCTGGGTGTC ACCTCATTCC TTGAGCTCGG CCCAGATGGT
GTCCTCTGTG CCCTGGGCCG CGACGCGGCT TCGGAGGTCT CTCCCGCCCC CCCGTCATTC
CTCCCGGGGC TGCGTGGGGC GCGACCGGAG CCAGACGCGC TCCTCGGCGC CCTGTCGGCC
CTTCACGCTC GCGGCCACAA CCCCGACTGG GAGGCCGTCT TCGAGCCCTG GGGGGCGCGC
CGCGTGCCGC TTCCTACCTA CCCGTTTCAG CGCGAGCGCT ACTGGATCGA CTCGAAGCGA
GCGCCCTCCG CGTCGAGCCA GGAGCTCGAA GGGGGGGGCC ACCCGCTCCT AGGCGCCTGC
ACGCGCCTTG CGTCCTCCGC GGAGGTTGTC TTCACCGGCC GTCTGTCGTT GGAGGACCAG
CCCTGGTTGG CAGGACATGT AGTCCTCGAC ACGACGTTGC TTCCCGCGAC CGCGTTCCTC
GAGCTCGCCT TCATGGCAGC GGATCGGATC GGGTTGTGCG CGGTGGACGA GTTCACGATC
GAACTCCCCC TGACGCTGCC CCCCGAAGGC GCGCTGCGCT TTCAGTTCAC GATCGGTGCG
GCTGACGAAA CAGGCCGCCG CACGATTTCG CTCTACGCAC GAGATGATCA GGCCGCTGGG
GACGCGCCGT GGACGCGGCA CGCCAGCGGC GCGCTCGTCG CCACGTCGGT GTCGCCGAAG
GCTGCGTTCG CGCTGCGCAC CTGGCCGCCC GCCAACGCAA GGCCGCTAGA CACGTCCGGA
TTCTACGAGC GGCTCGCACA GGCGGGCCTT CACTACGGCA GCGAGTTTCA GAATCTCCGC
GCGGCGTGGA CACTCGACGA AGAGCTGTTT GCGGAAGTCA GTCTGGCGAG CGAGTCCGCG
GTCGATACCG AGGCCTTTGG CCTCTATCCC GCCCTACTGG ATGCAGCTCT GCAGCTCCTG
GTGTTCGCCG GGCTAGAGCG GTCGAGTGAG CTTCTCCTCC CGCTGTCCTG GACGGGAGCT
CACCTGTACG CAACTGGTGC TTCGACCTTG CGCGTACACT TGACCCACCG AAAGGACGGC
GCGTTCGCCG TACGCATCGC CGACGGGGCT GGCGAGCCCG TCGCATTGGT GGAGGCGCTC
CATCTGCGTC CCGCCACGTC CAACTCCATC GAAGCCGGTC GAGCTCGTCC AAGCGAGCTG
CACTATCTGC AGTGGCTCCC GCTCCCCCAC GAGACTGCGG CGGCACTCCC GGAGCAGTCG
ATTGTCACGA TCTCGAGCGT GCATCAACTT CAGGCCGCCT TGGCCTCCGG CGAGCCGCTG
CCAGACGTCG TCGTTTTTAG CCCGGTCAAC GGGGAACGCG GCGAGCTCGC CAAGGCGGCT
AGCAACGCAA CCTGCGCGCT GTTGCAACTT CTGCAGACCT GGGTGCGCGA AGATCGGTTT
GCCGGTCGCA GGCTCGTCAT CTTGACCCGG GGAGCCTTGG CCACACGAGA CGGCGAAGAA
GTGGTCGACT TGGCGCACGC CCCTCTCTGG GGCCTCGCAC GCTCAGCCCA GTCTGAGTTC
CCAGATAGCG GCATCGTCCT GCTTGATCTG GACCGCGACG TCGGCTCCCT GAGCGACGTC
GTGACCGCAG CGTTGGCCAC CGGAGAGAGC CAGCTCGCCC GACGCGGGGA CGCCCTGCTG
GCGCCCTCGC TCACACGGCG GCAAACGCCC GCCCCAGGGC GAGAATCCTT CGCTTTCCCC
GAGTCGGCAA CCGTCCTCAT CACGGGCGGC ACCGGTACGC TGGGGGCCCT GTTCGCGCGG
CACCTCGTAC ACAACCATGG AGTTCGCCAC CTGCTCCTCG TGTCTCGCGC CGGCCGGGCG
GCGAGCGGCG CCGAGGCGCT CGAGGCCGAG CTACGCGCAG AGGGGGCCGA AGTCAGCCTT
GCCGACTGTG ACGTATCAGA CCACGCTGCG CTGCAGACGC TCTTGGCCTC GATACCAGAG
GAGCACCCCC TCGGCGCGGT CATTCACGCT GCGGGAGTCC TGGACGACGG CGTCCTCTCT
GCGCTCACGC CCGAGCGGTT GGCCACGGTT CTCCGCGCCA AGGTCGATGC CTCGCTCCAT
CTGCACGAGT TGACCCAGGC GCATGATCTC GCCGCCTTCG TGTTGTTCTC GTCTGTCGCC
GGTCTGCTGG GGAGCCTCGG CCAGGCGAGC TACGCCGCTG GCAACGCATT CCTCGACGCG
CTTGCGCAAC ACCGAGCAGC CAAGGGCTTG CCCGCCACCT CGTTAGTTTG GGGCCTCTGG
GACGAGCTTG GCACGATGAC CGCCCAGCTG AGCCAGGCCG ACCACAAGCG TATGGCGCGC
CAGGGGATGA CATCGCTCTC CGCCGCGGAG GGCACGGCGC TGTTCGATGC TGCGCTCGCG
CAAGCGTCGG AGCCCGGTCG TCTGCGACAG GCCGCTGTGG TGGCCGCGCG CTTCGATCTC
GTCGTGCTGG GCGCGCAAGA TCCCTCCACT CTCTCGCCTG TTATTCATGG CCTGATGCCC
GCACGGAAGC GCCGGGTGAC GACCGCTGCA GCTCAGGCAG CGGACTCTCT AGCCCAGCGG
CTTGCTGCCC TCTCCGAGGT CGAGCGCGAG CGCATGCTGG TCGACCTGGT GACCACAGAG
GCCGCGACCG TGCTCGGCTT CGGCTCCGGC GACGATATCG ATCCCGCTCG CCCCTTGCAG
GGGCTGGGTG TCGACTCGCT CATGGCGGTC GAGTTGCGCA GTCGCCTCGG CCAAGTGGTC
GGCCTGCGCC TGCCGGTCAC CTTGCTCTTT GACCACCCAA CCCCTGCGTC GATCGCCCAG
CGAATCCAGG CTGAGCTGCT GGGCGATGAG CACGCGGAGC GTAGTCCCGC GACCTCGGTC
GGGTCGGCCC GGGGCGACGA GGAGGATCCC ATCGCCATCG TCGCGATGGC CTGTCGTTAC
CCCGGCGGGG TAGCCACCCC AGAGCAGTTG TGGGAGCTGG TCTGTCAGAA CACCGACGCA
ATCTCTCCGT TCCCCGATCG CCGCGGTTGG CCACTCGACG ACCTGTTCGA CGCTGACCCC
ACCGCCCCGG GCAAGAGTTA CGTGCGCGAG GCAGGCTTTC TCCACGACGC GGATCTATTC
GACCCGACCT TCTTCGGGAT CAGTCCGCGC GAAGCTCTTG CGGTCGATCC TCAGCAACGT
CTGTTGTTGG AGACCGCGTG GGAGACTTTT GAGCGCGCCA GAATCATCCC GGCATCTCTG
CACGGCAGCC GCACCGGGGT CTTCGTCGGC GTCATGTACA ACGACTACAG CGCCCGACGC
ATGATGTCCC CAGACCAGCT AAATGGGCAC GTGTGGTTGG ACAGCGCCGG CAGTGTGGCC
TCGGGGCGCA TCTCCTACAC ATTCGGGCTC GAAGGCCCGA CGCTGACCAT CGATACCGCC
TGCAGCTCGT CGCTGGTCGC ACTGCACCTG GCTAACCAGG CGCTGCGTCA AGGCGAATGC
TCGCTCGCGT TAGCCGGCGG GGTGACCGTG ATGGCCACCC CCACCAACTT TATCGAGTTT
AGCCGCCAGG GTGCGCTTTC CCCTGATGGG CGCTGCCGCG CATTCTCCGC CGATGCGAAC
GGCACGAGTT GGTCAGAAGG TGCTGGCCTT CTCCTGCTTG CGCGCCTGTC CGAGGCCAAG
CGTCGAGGGT ACCCCGTGCT GGCCACCTTG CGAGGCTCGG CCGTCAACCA TGACGGCAGG
AGCCAGGGCC TGAGCGCCCC CAACGGCCCT TCGCAGCAGC GCGCGATCTT GCAGGCGCTC
GACGACGCGC GCCTGACGCC CCGCGACGTC GACGTCGTGG AAGCCCACGG CACGGGTACC
AGCCTCGGTG ATCCGGTCGA GGCGCAGGCC CTGCTCGCCA CGTACGGTCG CGAGCACTCC
GCCGAGGAGC CACTTTGGCT TGGCACGCTC AAGTCGAACC TAGGGCATAC GCAGGCGGCC
GCCGGCGTCG GCGGTGTGAT CAAGATGGTG CAGGCTCTGC AGCACGAGCG CCTACCTGCG
ACGCTGCACG CCGAACGCCC GTCCGACCAT GTCGATTGGT CCTCCGGCTC GGTCCGCCTC
TTGAACGAAT CCAGGCCCTG GACGAAGGGA ATTCGTACAC GTCGCGCTGC GGTCTCTTCG
TTCGGCATCA GCGGGACGAA CGCGCACGTC ATCGTGGAGG AGGCGCCGCC TGCGGGCCAG
CCGGCGAACG AGCGGGGGGT CGCGGGCGAG TCGCCCGTGA CCTACCCGGT GCTGCTCTCC
AGTCGTAGTG ACTCGGGTCT CAGGAGCCAA GCGCAGCGGC TATTGGACTG GGTAACCGAG
CGAGCCGACG TGGAGGTCGT GGACGTTGCG TACTCGTTGG CGACGACTCG CTCCCATTTC
GAGAGCCGCG CGGTGGTGTA TGCGCGCGAC CGGCAAGAGC TGCTCGCCTC GCTGCAGGCG
CTGACGCAAG GGGCGCCGGG GAGCGACGCG AGCAAGACGA GGGTGGGCCG AGACAGGCTG
GCCGTTCTGT TCACCGGCCA GGGCAGCCAG CGAGCGCGGA TGGGCGCCGA GCTGTCGGCA
CTCTATCCCG TGTTCCGGGC CTCGCTCGAA GAAGCGTGCG CTCTGCTCGA TCGCGAGCTG
GGAGTCGAGC CACCGCTGCT CGAGGTCCTG TCGGCCGATG ACGAGTCACC GGCGGGCAAA
CTCCTCGAGC AGACGATGTA CGCTCAGTGC GGCCTGTTTG CCCTGGAGGT CTCGCTGTTT
CGCCTGCTCC AGTCGTGGGG GCTCGAACCT ACCTGGCTGC TGGGCCACTC CATCGGCGAG
CTTGTCGCCG CCCACGTGGC GGACGTGCTC AGCCTCGAAG AGGCCTGTAC GCTGGTCGGC
GCACGGGCGC GCTTGATGCA GGCGCTGCCG GCGACCGGCG CGATGTACAC CGTGCAGGCC
TCTGAGCGCG AGGTGCTGGA GGCCCTTGCC GGCCATGGCG AACGAGCCGC CGTTGCCGCC
AGCAACAGCC CGACCTCCAC CGTCATCAGC GGCGACCTGC AGGTCGTACA GCAGGTCGCC
GCCGCGTTCG AGGCGCGGGA GCGCAAGACC GCCCGGCTGC GCGTCAGTCA TGCATTCCAC
TCCCACCACA TGGACGGGAT GCTGGCGGAG TTCGGACGCG TCGCGGAAGG CCTGCGCTAC
CGGGCGCCGC GCATCGCTAT CGTCTCGACC GCCACGGGAA CGCTGGCGCA ACCAGCGGAG
CTGTGCTCGC CCCAGTATTG GGTGCAGCAG GTCCGGGCCA CGGTTCGGTT CGGGGACGGC
CTTCAGACCC TGCAGCAAGA GGGGGCGCGT ACGTTCCTCG AGCTCGGCCC CCACGGGGTG
CTCACCGCGT TGGGCCAAGA GGCCCTGCCG GATGCCGAGG GCTTGGCCTT CCTCCCCACG
CTGCGCAGGG CTCGTTCGGA GTCGGCCACC ACGGCCGAAG CGCTCGGCGG CCTAGCTCGC
CGCGGCTACT CGCCCGACTG GGAAAGCGTC TTCAAGCCCT ATCGGCCTCA GAAAATCGCC
CTGCCCACCT ACGCATTCCA GCGAGAGCGC TACTGGGTGG ACGCAAGCGC CGCTCGGCCG
GGCGACCTCC GGTCCGCAGG GCTCCGCTCC GCCGAACACC CCCTCCTCGG TGCAGCCGTG
GCGTTCGCGG ACAACGACGG CGTTTTGTTC ACTGCGCGGC TCGCCCTCAC CGACCACCCC
TGGCTTGCCG GCCATCAGCT CTTTGGCACC GTCATCTTTC CCGGCTCGGG CTACGCCGAG
CTCGCCCTGG TCGCCGCGCG GCACGTCGGC CTCGACCGTG TCGAAGAGCT GACCCTGGAA
GCACCGCTCG CCCTCCCCCA GGTCGGGGCC ATCACGCTGC AAGTCTCCGT CGGCGCCGCA
GAGTCCGATG GACGACGCCC GCTCGGGTTC TACGCCCGCT CCGAGGCCAC CCCGGACGCT
GCCTGGACCC GCCACGCCAG CGGCTGGCTC GCTCCCGTGG CGAGCGACTT CCCCTTCGAG
TTCCGAAGCT GGCCACCGAC GGGTGCCAGC GCCGTTTCGC TCGACGGCTT CTACCCAGAC
CTCGCCGACC TCGGCTTTCA CTACGGCCCC GAGTTCCGCG GCCTCCAGGC CGTCTATCGG
CGCGGAGATG AACTCTTTGC CGAGGTCGCC CTGCCCGAGC CCATCGCGCA TACCGCCTCT
CAGTTTGCCA TTCACCCCGC GCTCCTTGAC GCCGCGCTGC ACCTGGTCAA TCTGATTCCG
AGCTTCAAAG GGGTGGGACT GAGCCTGCCC TTCTCGTGGA GCGGCGTCTC GCTTCGTTCC
GGCAGCCAGC GCTTGCGAGT ACGCGTGACA AGCCGCGGCG ACTCTGCGAT CGGACTCCAG
ATTGCGGACG ACCGCGGAGA GCCGCTAGCC GTCGTCGACA CCCTATCCCT GCGCGCGACC
TCCGCGGTAC AGCTGCGCAG CCTGTTGGGC TCCCAGCACG ATGGCTTGCT TCGCCTCGAA
TGGACCACTC CCGAAGGTGG CGCCTCCGCG CGGCTGCCAA GCCACGGAGC GCTGCTCGGT
GACGACTCTC TCGGGCTGGC CTCTCTGATC GCGGAGAGCG GCCTCCAGCT CACGCACTAT
CTGACACTCG AGGCTCTTCA AGAGGCCCTG GCGCAGGGCC TGCCCGTGCC AGACGTGGTG
CTCGTCCCAT CGCTGTCCCC TGAACTCTCT GGCCTTGACC CGATCTCCAC ATCCCATTCG
GCGGCCGTCG AGGCCCTCTC GTTTTTGCAG ACGTGGTTGG CCGACGAGCG ACTGGGCGCT
GTCCCTCTCG CTCTGGTCAC GCGGAACGCG GTCGCCGTCC GTCCTGACGA AGCCGTGCAG
GACCTGGGTC ACGCCGCGCT GTGGGGTCTT GTACGCTCCG CGCAGAGTGA AAACCCCAAC
CACCCCATCC TCCTGCTCGA TCTGGACGGT CAGGACGCCT CGTCGCGCGC ACTGCGGGGA
GTTCTCGGAA CCCGCGAACG GCAACTCGCC CTGCGCAGCG GCCGGCCGCT CGCGCCCAGG
CTGGTGCGCG TGGCGCCCGC GCGAGAGGCA CATCCGTTTC CCGACGAGCA CGGCACGGTC
CTCATCACCG GCGGAACAGG CGCGCTCGGC GGTCTGTTGG CGCGGCACCT GGTGCGCACT
CACGGGGTGA AGCATCTCGT CCTAGCCTCT CGACGCGGCC CAGAGGCCGA GGGTGCCTCG
GCCCTCAAGC GCGACCTCGA GGGCGAGGGC GCCAGCGTCC GGATGGTGGC TTGCGACATT
TCGGTCCGCC AGGCGCTCTC TGCGTTGCTC GACTCGATTC CTGCCGAGCA GCCCCTCACG
GCGATTGTGC ATGCTGCGGG GGTCCTCGAC GACGGAGTAC TCGGTACCCT TACGAACGCT
CGGCTTCAGA GCGTCCTTCG CGCCAAGCTG GACTCGGCCT GGCATCTCCA CGCGCTCACG
GAGCACCTCG AGCTATCCGC GTTTGTGCTG TTTTCCTCGC TCGCAGGGGT GATGGGCACG
CCGGGTCAGG CCAACTACGC CGCCGCCAAC GCCTTCCTCG ACGCGCTGGC TCACCAACGC
AGAGCCCGCG GTCTCGTCGC TCTATCGCTC GACTGGGGCT ACTGGTCCGA GCACAGCGCC
ATGACCGCGC ACCTCGGCGA GGCGGATCTA CAGCGCATGG CCAGCAGCGG CATCCTCTCC
CTCTCCGCGA GCGAAGGCTT GGCGTTGTTC GACGCTGCCC TCGGCCTATC ACACGCGGCA
GTCGTCGCCG CTCGCTTCGA CACCCGCGCG CTTCGGACTC TCGGCAAGGC GCTCCCCGCG
CTGTTCCGCC GACTGGTGCC AGAAGCTCCA ACCTCTCGGG GCGAGGCCTC CGACGCTGAG
GCTTCGCAGC AACGTCAGCT CCTGGCGAAA TCCCCGGAGG AGCGCGAGCA AGCTGTGTTG
GATCTCGTGC GGGGTGAGAT CGCTCGGATC CTACACTACT CGGCCGCGAC CACGCTCGAT
ACCGAGCGCA CGCTGCAAGA GCTCGGCCTC GATTCGCTGA TGGCTGTCGA GCTGCGCAAC
GCGCTTGGAA GCCGGCTCGG AATGGCCCTG CCCGCGACGC TGGCTCTCGA CCACCCTTCG
CAGCAAGCAC TCACCAGCTA CTTCGTTCAG CGCCTCTCGC AGCTGACCGC GACCGATCAG
GAGCGCGATT CGGCCTGGGA TCTCATCGCT GGCGCCACTG AATCCCCTCG CTCGACACGG
ACCGACGAGG TTCGGCTCGC TCCGCTATCT CACGGGCAAG AGCGCCTTTG GTTCCTAGAT
CGCTTGGCGC CCGACAGCCG CCAATACAAC GAGCTGCTCG CGCTAGAGCT TCACGCGGAG
CTCGATCTCG ACCTGCTGCG GCGATGCCTT GCCGTCCTGG TCGCCCGCCA TGAGTCGCTC
CGCACCACCT TGCCCGAGAT TGTGCGCACG CCGGGCGGCG CCGAGGTACC CAGCGCTCTG
ATCGCTCCGG GCGCTCCCAT TTCGCTCGAG GTGGTGGATC TGCAGGAGCA GGGCGTCGAG
TCGGACTTCA CCCGTCTGGT CGCGCAGTTC CGCAACCGCC CATTCCATCT CCACAAGGGC
CCGCTGTGGC GCTCCCTCGT GGTGACGCAC GCGGAGCGCC GGCACAGCCT TCTGTTCGCC
AAGCACCACA TCATCACGGA TGCCAGCTCC CTGGGCATTT TCGGTGAAGA GCTATCGCGC
CTGTACCGCA GTGGTGGAGA CCCGCGCGTG CTGCCGGAGA GACGCTTCGG TTACTCGGAC
TTCGTTCGCT ACGTGCGTGC GCGCGCAGCC GACCCAAGCC ACCAAGCACG CCTCGCGTGG
TGGCGTGACC GGCTCGCCAA CTTGCCACGC CTGGAGCTGC CCTACCGCGC TCAGACAGGT
ACCACCGCGC CCAGCCATCA GGGCGACGCT GTCCCTATCG AGCTATCCCT CGTACAGAGC
CGGGCCGCCC ACGACCTCGC GCGCCGTAAG GGCGTGACTC TATTTGCCGT GCTGTCGGCT
GCCTGGGCCT GCGTTCTCCA GCGCTACTCG GGCCAGTCGG ACTTCGCGAT CGGCACGGTA
GTCGCGAACC GTGGACGCGC TGAATTCGAC GGCGTTCTCG GCTTCTTCGT CAACACAGTC
GTCCTGCGCT GCGACCTATC CTCGAACCCG AGCTTCTCCG AGCTGGTCCA GCGCATGTCG
GACACCACTC GGCAGGCGCT GCAGTACCAA GACGTAGATT TCGGACAGGT CGTCCACGGT
TATCAGGGGG AACCCAGCCA GGGCCTGAAC CCGATCGTCC AGACCACGTT GAACCTGTAC
CCGGCTTTCT ACTCCGCGCA GACCCCCGAC GCGTCGGGAA TCATCTGGGA CGAGCAGACA
TCCTTTCCCA TCCCCGCGGC GAAGTTCGAT CTGGCGCTCG AGTTCATCGA TCGGGAGGAG
GGTCTGAAGG GCAAGCTCGA GTACGCCACG GATCTGTTCG AACGGGCCAC GGTCGAACGC
ATGGTGGGCC ATCTGAAGGC CCTGCTCGAG GCGGCGCTGG CAAATCCCGA CGCGGCGAGC
GGGAGCCTCG AGATGCTCAC GGCCAGCGAA CGCCGTCGGA TTTTGGTGGA GTGGAACGCC
ACCGCGCGAG ACTATCCCGA GGACACCTGC GTCCATGAGC TGTTCGCGCA GCGGGCGGCC
GAGACCCCAG ACGCTGTAGC CCTGACGTTC GGGGACCGGG CGCTCAGCTA TGCAGAGTTG
GAGGCGCGGG CGAACCAGCT AGCGCGGCAC ATTCGCGCGC GCGCGCTGCG TCTGGGCGTC
CGCGTCGGCC CTTCCGTGCT CATCGGTCTA TGCCTGGAGC GCTCCTTGGA GATGGTCATC
ACCGTGCTGG CCATCGCGAA AACCGGCGCG GCCTACATGC CGCTCGACCC GGCCTACCCC
TCCGAGCGCC TCGCCTTCAT TCTGGACGAC TCGCAGACCG CGTTGATCGT GACCCAGAAG
GCGTTCGCGG GTCGCCTGAG CGACCACGCC GAGCGCTTGC TTCTGCTCGA TGCGCAGCCG
GAGGAACTCG ACCGCTACGA GCGCACCCCG CTCGAATCCA CGGTGTCCCC GTCAGACCTC
GCATACGTCC TCTACACGTC GGGCTCTACG GGAAAGCCCA AGGGGGTCGA GGTCAATCAC
CGCGGCCTGA CCAACGTGAT CTGGGACTGC GCACGCGAGC TGAAGGTCGG CGCCGAGGAC
ACCCTGGCGG CTGTCATATC CATCGCATTC GACATGTCGG AGCTAGAGTT CTGGATGCCG
CTCACGCATG GGGCCACCTG TCGCGTGCTC CCGCAGGAGG CGTTGGTAGA TGGCTATCGA
CTCAAGGAGG AGATCGAGGG CGCGACGATT GTCCAAGCCA CGCCGGCTAC CTGGCACGTG
CTGCTGGAGG CTGGCTGGCA GGGAGCCCCG GGCCTGCGTG CGATGGCGGG CGGCGAGGCG
CTCTCGCTGC AACTGGCGAC CCGACTCGCC GAGCGCACGC TGTGCGTCTG GAACGGGTAC
GGGCCCACCG AAGCCACCAT CTACGCGAGC CTATGGGCCG TCGACCCGGG GCGCGGTAGC
GTGAGCCTCG GGCGTCCAGT CGCCAACACC CGAATCTATG TACTCGACGA ACACCACAAT
CCGGTGCCGG TCGGTGTTCC CGGTGAGCTC TACATCGCCG GCGTGGGCCT CTCGCGAGGC
TACCGCGGAC GCGCCGATCT AACCGCCGAA CGCTTCGTGC CAGATCCATT CGCTTCGGAT
CCCCAAGAAC GCATGTACCG GACCGGCGAT CGCGTGCGCT GGTGTGAAGA CGGCACGCTC
GACTACCTCG ACCGCCTCGA CCATCAGGTC AAGATCCGTG GCGTCCGCGT AGAGCTTGGG
GAAATCGAGC ACGCACTCAT GGAGCATCCT GCCGTCATAC GTGCCGTTGT CGTGATCACC
AAGAAGGGTC TCGACGCTCG CCTGGCCGCT TACTACGTAG CCGTCGATGG ATTCGAACTC
AGCTCCGAGC AGCTGCGTCG CCACCTGAGG GCGTCGCTGC CGGAGGCCAT GATCCCGGGC
AGCATCGTCT GTTTGCCAGA GCTGCCGATC AACCCCAACG GGAAGGTGGA CCGACGACTC
CTGTCCAACC GGGTGAACCC GCAGGTCGAG GAAGTCGTCG CGCCCCGCGC GCCACAGAGC
GACCTCGAAC AGGCGATCGC CCAGGTCTGG CGTGAGGTCC TCGCGTGCGA CTCAATCGAT
GTAGGGCGGA CGTTCTACGA GCAGGGGGGC AGCTCGATTC AGCTGGTTCA CGTGCAGCGC
CAGCTTCGCG ACAGTCTGCA GGTCGAACTC AGTGTCGCGG AGCTGTTTGC GTATCCAAGC
GTCGAAGCGC TCGCCGACTA CCTGCGTCTG CGCAGAGCGG AGCGCGCGCG ACCCACGCAA
GCGGATCCGG TCGTCGACCA GCCCGACAGC GAAGCGCGCG ACCTCGATGC ACTGGCGACC
GCGGACCTCT ATCGGAACGT CAGAGCACGT TTGCAGGCCG CGCTTTCCGA CTACGACCGG
GAAGGCTGA
 
Protein sequence
MAVDTACSSS LVTVHLACES LRRGESTVAL AGGVNLNLIA ESALEMSKFG GLSPDGRCFT 
FDARANGYVR GEGGGVVVLK RLSHALMDDD PIICIIRGSA TNNDGASNGL TAPNPRAQEA
LLRLAYRQAG TDPGEVQYVE LHGTGTPLGD PIEAAALGSV LGQARSSQNP LLVGSAKTNV
GHLEAAAGVA GLLKVALCLQ NKQLAPSLHF DTPNPHIPLA DLNLRVVDDL TTWPAPGRPT
VAGVSSFGMG GSNCHVVVSE HLESQAEIVA LTAPSAEELE AKARGWAEAL AKTSPADSAA
MLCARASAEP VDGDHRLAVT GRSAALLSQR LLEFATGDAR LGVSVGCVVP GTSPRVAFVF
GGQGAQWFGM GTQLLRREPV FRRSIERASS LIQQHLGWSL LEELTAPRER SRLDSVAVSF
PAIVAFEIAL ASLWQSWGIR PAAVLGHSIG EVAAAHVAGA LSLEDAMLVI CAYARGLERF
RGRGAMGLVA LSWQDVAQAL ERHEGRLFRA IQMAPDSTVV AGEPESLLEL LDELESQGTF
CRRIATDAAP HCPLVDGLRD ELRTALRALR PRAGDIPWIS EVTGAQLDGE SLGTSHWVRN
LCDPIRFGDA LEHLVNQGPD VFVEVSPHPV VLPVIETYLR RADRTGEAVP TLRRDEDEGE
AMRDALGALF VRGASSHWDA VHGCAAPHGT SSDGFLPAPI LLSGGTPAAL RAQAMQLHTL
LTAKESLRVQ DVAYSLAVTR THFEKRASWL ATSREGLLGA LDKVARGVPV ASVTVGEARD
TGKLGFLFTG QGSQRPGMGR SLYRAFPNFR SALDAVCDEL DPHLPRPLRE LLFCTDGSPE
AALLGQTGFT QPALFALEVS LFRLLEAWGL VPDVLLGHSV GELAAAHVAG VLSLEDACTL
VATRARLMQE LPAAGAMVAL QASEQEVEES LSSHPGVTIA AVNGPRATVV SGHEAEALAV
AAHFEAQGRK CKRLATSHAF HSAHMEPMLE AFRSVASGLS FHPPRIPIVS NVSGAVASAL
DLCSPEYWVR HVRAAVRFAD GVQSAANLGV TSFLELGPDG VLCALGRDAA SEVSPAPPSF
LPGLRGARPE PDALLGALSA LHARGHNPDW EAVFEPWGAR RVPLPTYPFQ RERYWIDSKR
APSASSQELE GGGHPLLGAC TRLASSAEVV FTGRLSLEDQ PWLAGHVVLD TTLLPATAFL
ELAFMAADRI GLCAVDEFTI ELPLTLPPEG ALRFQFTIGA ADETGRRTIS LYARDDQAAG
DAPWTRHASG ALVATSVSPK AAFALRTWPP ANARPLDTSG FYERLAQAGL HYGSEFQNLR
AAWTLDEELF AEVSLASESA VDTEAFGLYP ALLDAALQLL VFAGLERSSE LLLPLSWTGA
HLYATGASTL RVHLTHRKDG AFAVRIADGA GEPVALVEAL HLRPATSNSI EAGRARPSEL
HYLQWLPLPH ETAAALPEQS IVTISSVHQL QAALASGEPL PDVVVFSPVN GERGELAKAA
SNATCALLQL LQTWVREDRF AGRRLVILTR GALATRDGEE VVDLAHAPLW GLARSAQSEF
PDSGIVLLDL DRDVGSLSDV VTAALATGES QLARRGDALL APSLTRRQTP APGRESFAFP
ESATVLITGG TGTLGALFAR HLVHNHGVRH LLLVSRAGRA ASGAEALEAE LRAEGAEVSL
ADCDVSDHAA LQTLLASIPE EHPLGAVIHA AGVLDDGVLS ALTPERLATV LRAKVDASLH
LHELTQAHDL AAFVLFSSVA GLLGSLGQAS YAAGNAFLDA LAQHRAAKGL PATSLVWGLW
DELGTMTAQL SQADHKRMAR QGMTSLSAAE GTALFDAALA QASEPGRLRQ AAVVAARFDL
VVLGAQDPST LSPVIHGLMP ARKRRVTTAA AQAADSLAQR LAALSEVERE RMLVDLVTTE
AATVLGFGSG DDIDPARPLQ GLGVDSLMAV ELRSRLGQVV GLRLPVTLLF DHPTPASIAQ
RIQAELLGDE HAERSPATSV GSARGDEEDP IAIVAMACRY PGGVATPEQL WELVCQNTDA
ISPFPDRRGW PLDDLFDADP TAPGKSYVRE AGFLHDADLF DPTFFGISPR EALAVDPQQR
LLLETAWETF ERARIIPASL HGSRTGVFVG VMYNDYSARR MMSPDQLNGH VWLDSAGSVA
SGRISYTFGL EGPTLTIDTA CSSSLVALHL ANQALRQGEC SLALAGGVTV MATPTNFIEF
SRQGALSPDG RCRAFSADAN GTSWSEGAGL LLLARLSEAK RRGYPVLATL RGSAVNHDGR
SQGLSAPNGP SQQRAILQAL DDARLTPRDV DVVEAHGTGT SLGDPVEAQA LLATYGREHS
AEEPLWLGTL KSNLGHTQAA AGVGGVIKMV QALQHERLPA TLHAERPSDH VDWSSGSVRL
LNESRPWTKG IRTRRAAVSS FGISGTNAHV IVEEAPPAGQ PANERGVAGE SPVTYPVLLS
SRSDSGLRSQ AQRLLDWVTE RADVEVVDVA YSLATTRSHF ESRAVVYARD RQELLASLQA
LTQGAPGSDA SKTRVGRDRL AVLFTGQGSQ RARMGAELSA LYPVFRASLE EACALLDREL
GVEPPLLEVL SADDESPAGK LLEQTMYAQC GLFALEVSLF RLLQSWGLEP TWLLGHSIGE
LVAAHVADVL SLEEACTLVG ARARLMQALP ATGAMYTVQA SEREVLEALA GHGERAAVAA
SNSPTSTVIS GDLQVVQQVA AAFEARERKT ARLRVSHAFH SHHMDGMLAE FGRVAEGLRY
RAPRIAIVST ATGTLAQPAE LCSPQYWVQQ VRATVRFGDG LQTLQQEGAR TFLELGPHGV
LTALGQEALP DAEGLAFLPT LRRARSESAT TAEALGGLAR RGYSPDWESV FKPYRPQKIA
LPTYAFQRER YWVDASAARP GDLRSAGLRS AEHPLLGAAV AFADNDGVLF TARLALTDHP
WLAGHQLFGT VIFPGSGYAE LALVAARHVG LDRVEELTLE APLALPQVGA ITLQVSVGAA
ESDGRRPLGF YARSEATPDA AWTRHASGWL APVASDFPFE FRSWPPTGAS AVSLDGFYPD
LADLGFHYGP EFRGLQAVYR RGDELFAEVA LPEPIAHTAS QFAIHPALLD AALHLVNLIP
SFKGVGLSLP FSWSGVSLRS GSQRLRVRVT SRGDSAIGLQ IADDRGEPLA VVDTLSLRAT
SAVQLRSLLG SQHDGLLRLE WTTPEGGASA RLPSHGALLG DDSLGLASLI AESGLQLTHY
LTLEALQEAL AQGLPVPDVV LVPSLSPELS GLDPISTSHS AAVEALSFLQ TWLADERLGA
VPLALVTRNA VAVRPDEAVQ DLGHAALWGL VRSAQSENPN HPILLLDLDG QDASSRALRG
VLGTRERQLA LRSGRPLAPR LVRVAPAREA HPFPDEHGTV LITGGTGALG GLLARHLVRT
HGVKHLVLAS RRGPEAEGAS ALKRDLEGEG ASVRMVACDI SVRQALSALL DSIPAEQPLT
AIVHAAGVLD DGVLGTLTNA RLQSVLRAKL DSAWHLHALT EHLELSAFVL FSSLAGVMGT
PGQANYAAAN AFLDALAHQR RARGLVALSL DWGYWSEHSA MTAHLGEADL QRMASSGILS
LSASEGLALF DAALGLSHAA VVAARFDTRA LRTLGKALPA LFRRLVPEAP TSRGEASDAE
ASQQRQLLAK SPEEREQAVL DLVRGEIARI LHYSAATTLD TERTLQELGL DSLMAVELRN
ALGSRLGMAL PATLALDHPS QQALTSYFVQ RLSQLTATDQ ERDSAWDLIA GATESPRSTR
TDEVRLAPLS HGQERLWFLD RLAPDSRQYN ELLALELHAE LDLDLLRRCL AVLVARHESL
RTTLPEIVRT PGGAEVPSAL IAPGAPISLE VVDLQEQGVE SDFTRLVAQF RNRPFHLHKG
PLWRSLVVTH AERRHSLLFA KHHIITDASS LGIFGEELSR LYRSGGDPRV LPERRFGYSD
FVRYVRARAA DPSHQARLAW WRDRLANLPR LELPYRAQTG TTAPSHQGDA VPIELSLVQS
RAAHDLARRK GVTLFAVLSA AWACVLQRYS GQSDFAIGTV VANRGRAEFD GVLGFFVNTV
VLRCDLSSNP SFSELVQRMS DTTRQALQYQ DVDFGQVVHG YQGEPSQGLN PIVQTTLNLY
PAFYSAQTPD ASGIIWDEQT SFPIPAAKFD LALEFIDREE GLKGKLEYAT DLFERATVER
MVGHLKALLE AALANPDAAS GSLEMLTASE RRRILVEWNA TARDYPEDTC VHELFAQRAA
ETPDAVALTF GDRALSYAEL EARANQLARH IRARALRLGV RVGPSVLIGL CLERSLEMVI
TVLAIAKTGA AYMPLDPAYP SERLAFILDD SQTALIVTQK AFAGRLSDHA ERLLLLDAQP
EELDRYERTP LESTVSPSDL AYVLYTSGST GKPKGVEVNH RGLTNVIWDC ARELKVGAED
TLAAVISIAF DMSELEFWMP LTHGATCRVL PQEALVDGYR LKEEIEGATI VQATPATWHV
LLEAGWQGAP GLRAMAGGEA LSLQLATRLA ERTLCVWNGY GPTEATIYAS LWAVDPGRGS
VSLGRPVANT RIYVLDEHHN PVPVGVPGEL YIAGVGLSRG YRGRADLTAE RFVPDPFASD
PQERMYRTGD RVRWCEDGTL DYLDRLDHQV KIRGVRVELG EIEHALMEHP AVIRAVVVIT
KKGLDARLAA YYVAVDGFEL SSEQLRRHLR ASLPEAMIPG SIVCLPELPI NPNGKVDRRL
LSNRVNPQVE EVVAPRAPQS DLEQAIAQVW REVLACDSID VGRTFYEQGG SSIQLVHVQR
QLRDSLQVEL SVAELFAYPS VEALADYLRL RRAERARPTQ ADPVVDQPDS EARDLDALAT
ADLYRNVRAR LQAALSDYDR EG