Gene Hoch_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0798 
Symbol 
ID8543180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1031549 
End bp1044100 
Gene Length12552 bp 
Protein Length4183 aa 
Translation table11 
GC content73% 
IMG OID646385572 
Productamino acid adenylation domain protein 
Protein accessionYP_003265307 
Protein GI262194098 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCACG CTTCTCCTCC CCGAGATGTC GTGCTCGAGA CGAGTGCGGC CCAACGACGC 
ATCTGGACTC TTGAGCAACG TTCGTCCGGC GGCGGCTACA ATGTCTGCTT TGCCATTTCC
CTGCCGGTGG GGACCTCGGA GCAGGCGGTG CGCGACGCAC TTCGCGCGCT CATGGCGCGC
CACCGTTCGC TGCGCTCGAC GGTTCGTCTC GTCGCAGGAA CCCTGCGTAT GGTTCCCGAA
GAGCCGGAGC CATCCCTCGA TGTCGTGGAT CTGCCCGAGC AGGCGCTCGC CGAGCAGTAC
ACGCGGATGG CCACACGACG GCTCGACCTG GCGGTGGATC GGCCGCTGCA GTCCGTGCTG
ATTCGCCATC CGGACGGTCT GCGTCTGGTG GTTCTCGACC ACCACGTACG CATCGACGGC
ACCACCTTGC CCATCCTCTG CCGCGAGCTG TGCCTGCTGA TCCGCGGCGA CGAACTGCCG
CCGCTGGATT TGGATTATCA GGACTACGTG GAGGCAGAGG CGCAGTGGCT GGCGAGCCCC
GAGGCACAGG CGAGTCGCGA TTGGTGGCGC GCGCACCTCG AGGGCCTGCC GGCGTCGGCG
TACCCGGGCG ACCGTCCGCG TCGCGGTGCC CCATCCGGAC GCGGCGCGCT GGTGCGCGTC
GACCTACCGT CCGAAGAGAG TCGGTCCCTG GTGAGGCTCG CGCGGGAGCG CGGCTGCACT
CCGTTTCGGG CTCTGGTCGC ATGTACGCAG GCGCTGCTTC TGCGCAGCAC CGGGGTGGAG
GAGTTGCCGG TCGGCATCAC GACCCATGGC CGCTGGGACA AGCGGTGGCG ACCGCTCGCG
GGCATGTTTA CCAATCCGCT GCTCCTGCGT CTGCCGGTCT CCGCCGACGA CTCCCTGGCG
TCGCTGCTCG GGCGCTGTCA GGGCGCGATG GACGAGGCCT TGGCCCACGC CCGGTTGCCG
TACGACGAGG TCGCCCGCGT CGCACGAGAT GCCTGGGGGA TTTCGGAGCC GTTCGCGGTG
ATGCTCGGCA GTCAGGTGGC ACCGCACGCC ATGGACCTGC CCGAGGGCTT CGCGGTCGAA
CTGCACGCGC TCGATACCGG CGCTCCGGGC AGCAGTCGTA TGGACCTCAA GCTCAGCTCC
GCGCCCCTCG AGGACGGCAG CCTGCGGCTC GAGTGGGAGC TCGACACCGA TATCCTGCGC
GAGCAGACGG TGCACAGCCT GGGCGAGAAT CTGGTGCGCC TGGTGCGCGC CGGCCTCGCG
ACGCCCGAGC GCCCTATCGG CTCACTCGAC CTGCTGTCCG CAAACGAGCG GCGCTCCGTT
CGCGCGCTGG CGACGAATGA CGACACGCCG AGACCGCTCG GCGCTGTACT CGACCAGGTG
TCCGCGTGGC CGGCGGAGTC GCTCGCCATC AGCGCGGGTG AGGAGCGGAT GAGCTACGGG
GAACTGGTGG CCCACGCGCG TCGGGTGGGC GGCGCGCTCG CGGCTCGCGG CGTGGGTCCG
GGCGAGGTGG TCGGGCTGCT CCTGCATCGC CGGCCCGCGG CCATCGCGAC CATGCTCGGC
GTGATGGCTG CGGGGGCGGC GTGGCTGCCG ATCGAGCCCG ATCTGCCGGC GCAGCGCATC
GCCCAAATGA CCGCGGAGGC CGACGCTCGC TTCGTCATCG CGGACGAGGA TTTGCGCCAT
CTGCTGCCCG ACGGGGTGGA GGCTCTCGCG CCCTCGCTCG CGGCCGAGCC GCTGAGCGAG
TGTCGCGGAG CGCCGAGCGA TCCCGCGTAC TTGCTGTACA CCTCGGGCTC GACCGGGCGT
CCGAAAGGCG TGCTCGTGCC GCGCTCGGCG CTGCGTCATC TGCTCGCGTA CGCGGGCGAG
CTGTTCGGGA TGAAGCCCGG CGTCACCGTC GCCGCGCTGG CGACGTGGAG CTTTGACATC
GCGCTGGCCG AGCTGCTGCT GCCGCTGGTC CACGGCGCCA GCGTGCGCCT GTTGGATCGC
TCCCTTGCCC TCGATCCGCC GGCGCTCGGC GCGGCCCTCC AGAGGGTCGA TGTCGCCCAG
GCGACGCCCA CGACCTGGGC CCTCCTGGTG CGCCGGGGCT GGCGTCCGGA GGCGCCGCTC
ACGCTCAGCA GCACGGGCGA GGCGCTGCCG CCGGATCTGG CGCGCGCGCT GTGTCGCGAC
GGCGTGCGGC TGCTGAACCT GTACGGCCCA ACCGAGACCA CGGTGTGGGC GAGCGGCAGC
GTGGTGGATC CGGAGCGCCT CGACATCGGT CGTCCGGTGC CGGGTCTGCG CTGCCTTGTG
CTCGACCGCG AGGGCCAGGT GGTTCCGCCT GGCGTGATGG GCGAACTGCA CATCGGCGGC
CCCTCGCTCG CGCTGGGCTA CCTCAAGCGG CCCGAGCTCA CCGCGGAGCG GTTCATCGCC
GATCCGGAGA CGCCCTCCGA GCGCCTGTAT CGCAGCGGCG ACCTCGCGCG CATGAGCGCG
GATGGCCGCA TCGAGTGCCT GGGCCGCATC GACGAGCAGC TCAAGATCCG CGGCCACCGC
ATCGAGCCCG GCGAGATCGA GGCCGCCCTG CGCGAGCACC CGGCGGTGAG CGAGGCCGCG
GTGGCGCCGC TGCGCGACCC CGAAGGCGGC GATCGCCTGG TGGCCGTCTA CGTGTGCCGC
GGCGCGGACC CGGGGGAGCG CGCGCTCCGC GATAGTCTGG CCGCGCGGCT TCCCAAGTGG
ATGGTGCCGG CTCGGATGAG CGCGGTAGCT CTGCTGCCGC GGACCTCGAG CGGCAAGGTC
GATCGCAACG CCATCGTGTC GCTGTTCGCC CAGGTGAGCG TGCAGCGCGC GGCGGATGCG
GGTCTGGTGT CGCGCATCGC CGATACCTTT GCCCGGGTGC TCGGCGTGGC TTCGGTCGAG
AGCGAGCGCA GCTTCTTCGA GCAGGGCGGA ACCTCCATGC AGCTCGTCGT CGCGCGCGAG
CGATTGACCG AGATGGGGCT TGAGGTCACG GTCGCCGACC TCTTCGATCA CCCGAGCCCC
GAGCGCCTCG CCGCCTACCT CGGCGGCTCG CGCGCCGCGA TCCGCGAGGT CCGCGAGCAG
ATCGAGCCGG TCGCGATCGT CGGTCTGGCC TGCCGCTTCC CGGGCGGGGT CACGGACTCC
GCCAGCTTCC TGTCGCTGCT CGACCAGGGC CGCGACGCGA TCACCGAGAT CCCTCTGTCC
CGCTGGGATG CGGACGCGCT GTACGACCCC GAGCCGGGGC GCCCGGGCCG GTTGCCCACG
CGCTGGGGCG GCTTCCTCGA GGATGTCGAG TACTTCGATC CCGGCGCGTT TGGACTGAGC
CCGCGCGAGG CCCGCGCGAT GGATCCGCAG CACCGCTTGC TGTTGGAGCT GGGCCAGGAG
GCGGTGCTGG CCGCCGGCTA TCAACCGGCT GAGTTCGCCG GTCGCGAGGT CGGCGTGTAC
GTGGGTCTGT GCGGCACGGA CTACCAGGGG CGCGCGGTGC AGCGGCCCAC GCTCGACGCC
ATCGACCCGC ACGCGGCCAC CGGCAGCGCC CACAGCGTCG CCGCCGGCCG GGTGGCCCAC
ATCTTCGATC TCCGCGGCCC TGCGGTGGTC GTCGATACGG CCTGTTCCTC GTCGCTCGTG
GCGGTGCATC TCGCGGTCTC GGCGCTCCGT GCCGGCGAGT GCGAAGCCGC GCTGGTCGGC
GGCGCCAACG TGGTGCTGTC GCCGCGCTGG GGCGCCGGGT TTGCGAGCCT CGGCTTTCTG
TCGCCCTCCG GCCGCTGCTC GGCCTTCGGC GCCGAGGCCG ACGGCTACGT GCGCAGCGAG
GGCGCCGGCA TGCTGCTGCT CAAGCCGCTG TCCGCGGCGC TCGCCGACGG CGACACGGTC
CACGCGGTGA TTCGCGGCAC GGCCATCAAC CACGACGGCC GCGCGGCCAG CCTCACCGCG
CCCAGCGGCC TCGCGCAGCA ATCGGTCATC CGCAGTTGCC TTGAGCGCGC CGGCCTCGAG
CCCGGCGATA TCGACGTGGT CGAAGCCCAC GGCACGGGCA CCGAGCTCGG CGATCCGATC
GAGGTCCAGG CGCTGGCCAC TGCGCTCGGC GAGGGCCGGG AGCATCCGCT GCTCGTCGGT
TCGGTGAAGT CCAACCTGGG TCACTGCGAG GGCGCCGCGG GTGTCGCCGG CATGATCAAG
GCGGTGCTGG CCGTGCGCGA GGGGCGCGTG TTCCGGACGC TGCACGTGGA CTCGCTCAAC
CCGCACGTGC CGTGGCAGCA GATGCCGCTC GAGGTCGCCG GGGAGGCTCG GGCCTGGCCA
GAGACTGGCC GCCCGCGCCG CGCCGGCGTC AGCGCCTTCG GCTTCTCGGG GACCAACGCC
CACGTCATTC TCGAGCAGGC GCCCAACCAG GACGCGAGCG CCCTGCGCGC CGCTCTCCCC
GCCACTGCCT ATGCCCGGCG ACGCCTGTGG CTCGACGAGC CCATCGACGG CCACGGGGTG
ATCGAGCACC CGATGCTGCG CTCGCGCCAG ACGCTGGCTG ACGGCCGCGA GATCTTCGAG
GGACGGGTGT CGCTGGCCCT GTTCCCCGAG CTCGGCGACC ACCGCGTGCG TCAGGACGCG
GTGCTGCCGG CCACCTGGCT CATGGAGTTG GGACGCGCCG CCGGCGCGGA GGTGCTCGGC
TCGGCGACCA TCACGCTGTC GGAGATCGGC CTGTTCGCGC CGGTCGTGAT CCCCGAGCGC
GGCACCCTGC GCCTGCAGGT GCTGCTCGAG CCGCTCGCCG ATGCGCAGCT CGGTTGGATC
CTGGCGTCGC GTCGCGAGGT CGGCGAAGAC GGCTGGACAC GGCACGCGGC CGGCCGGCTC
GAGCGCGGGC TGGTGGAGAT CGCCGAGGGC CCGCAGGTAC CGGCGGACGC GTCGGATATG
GACGCGGTGT ACGCGCGGCT CGGTGAGTTC GGCGTCGACT ACGGTCCTGC GTTCCGCGCC
CTGCTCGGCC TGGCGTTGCA CGAGCAGGGC CTGGTGGCGG CGCTCGGCGC GAGCGAGCCG
CGTCCGGGCA CGAGCGATCC GGTGCGGATG GACGCGGCGT TGCAGGCACT CGCCTGGCAC
CGCATCTCGG GACCCGACGC GCGCCTGGTG CTGCCCTTCG GGATCGAGCG GGTCACGCTC
GGCCCCGGTG CGCCCACGCT GGCGACGGTG CAGACCGAGG GTGAGCGCGC GGACATCGAC
GCGTTCGCCG ACGATGGCCG TCTGGTGGCC CGGATTCGCG GCCTGACCCT GCGCGCGCTC
GAGGGCACCA GCCCGCAGGG GCTGAGCGGC CTGATCTGGC GCACGAGCGA GCGGACGCTC
GCCAGCCGTC CGCTGAGCGG GCGCTGGTGG GTGGTCGGTG GTCCCGCCGC GCTCGCCGAC
GCGCCCGGGG TCGAATGGGT CCCGGCAGCG GCGCCGTTCG CCGATCTGCC CGCGCCGGAC
GGCGTGATCC GGTTTCAAGA TCCGGCGCAG CCGAGCGGCG AGGCGCTCAC GGAATCGCTG
GCGCTGGTCC GGGCGCTGCT TGAGCTGGCC AGGCCGCCGC GCACGCTGTG GCTCACGCGC
GCCGGCGCGG GGGCGGAGCC GACGAACGAG ACCAGCGCCG CGCTGTGGGG GCTGGTGCGG
GTGCTGCGGC AGGAGCATCC CGAGCTCGAG CCCGAGCTGA TCGACGGTCT CGAACGCCCG
CAACTCGAGG CGCTCGAGAC GCTCGACGAG CTGGCGCTGT CGACTCCGGA GCGCGTGATG
CAGGACGGAC AGCGGCTGCT GCCGGCGCTG ACCCGGGTGT CGTCCGAGCC GGCCGCCATC
TCCGGTCGCT GGCTGATCAC GGGCGCCTCG GGCGGGCTCG GCCAGGCGCT GGCGCAGCAC
CTGGTCGCGC GCGGTGCCGA GGAGCTGGTG CTGGTGTCGC GCACCGCCCC GCCGGCGGAG
CTGCTCGATA CCTTGCCCGC AACCTTTATC GTCGCCGACG TGTCCGAACC CGGAGCCCTC
GAGGCGGTGC TCCTGCGGGC CGGAGCGCTC GATGGCGTGG TCCACGCTGC CGGGCAGCTC
TCGGACGGCG TACTCCTGCA GCTCGATGCG GCCGCGTTTG CGACGGTGTC GGGGCCGAAG
CTCGCCGCCG CGCGCCAGCT CGCCGAGGCG CTACCGCTGC CGACCCGGCT GGTGCTGTTC
TCGTCGGTGT CGGCGTGGCT GGGCGCGGCC GGGCAGGCTG CCTATGCGGC GGCCAACGCC
GGACTGGAGG CCATCGCGGG CGCCCGGCGG GCGCGCGGGG GCGAGGCCAT CGCCATCGCC
TGGGGACCCT GGGCCGAGGT CGGTATGGCG GCCCGCGCCT CGGGTCGCGA TCGCGCCCGG
AGCGCGCGGA TGGGCCTCGA GCCGCTAGCC ACGGGCCGGG CCCTGGCGCT GTTCGATCGG
GCGCTGGGCG TGGACGCGGC GAGTGTGGGC GCGTTCTCAC TCAACGAGGA GACCTTGCGC
CGGGCCTTGG GCGCGGAGCG CGTGCCCGCC TGGCTCACGC AGCGGGCCGA GGACAGCGCG
ACCCTGGTAC TGCCCGAGAC CGGCCGGGCC CACGCCATCC GCGAGGCGCT CAAAGAGCAG
ATCGCCCAGG TGCTGGCGCT GTCCGCCGAT GACGAGCTCG ACTGGAGCCG ACCCTTCCAG
GATCTCGGCC TCGACTCGCT CATGGCCGTG GAGCTTCGCG ATCGCCTGGG TCGCTGGTGT
GGTCAACGCC TGCCGGCGAC GCTGCTCTTC GATCGCCCGC AGCTCGAGGA GCTGGTCGCC
TGGCTCGACG AGACGCTGCC CGGCGGCGAG GCCGCCGCGG TCGAAGTCAA CATCGCGCCC
ATCGTGCACG ACGAACCCAT CGCCATCATC GGCATGGGCT GCCGCTATCC GGGCGGTGTG
GTCGATCCCG AGAGCTTCTG GGAGCTGCTC GAGGGCCAGG TGGACGCGGT GACCCCGGTG
CCGGCCGATC GCTGGGACCG GGACGCCTGG CACGATCCGG ATCCCGCCTC GGTCGGTCAC
ACGATCACGC GCGAGGGCGG GTTTGTCGAC GGGGTGTTCG ATTTCGACCC GGCGTTCTTT
GGCATCAGCC CGCGCGAGGC GCGGCAGATG GACCCGCAGC AGCGCGTGGT GATGGAGGTG
AGCTGGAACG CGCTGGTCGA CGCCGGCCTG CGCCCCGAGG AGATGCGGGG GAGCAACACG
GGCGTCTACC TGGGCTACAT GAACCACGAC TATTTTCTGC TCCACGGCAC CCACACCGAC
GAGATGGACG GGCACTTTCT CATCGGCAAC AGCGGCGCCG TGGTGTCCGG GCGCGTGGCC
TACCACTTCG GCTTCCACGG CCCCGCGCTC ACGCTCGACA CCGCGTGCTC CTCGGCGCTC
GTGGCCGCCC ACCTGGGCGC CAAGGCCCTC CGCGGCGGCG AGTGCGATGT CGCCCTGGTC
GGCGGCGTGG CGCTGGTGCT GCAGCCCAAT GTCGCGGTCG AGTTCAGTCG GCTACGGGCG
ATGTCGCCCG AGAACCGCTG CCGCAGCTTC GCCGCGAGCG CCAACGGCGT CGGCTGGAGC
GAGGGCTGCG GCATGTTGGT GCTCAAACGC CTGTCCGACG CCGAGCGCGA CGGCGACCGC
GTGCTGGGTG TGTTGCGCGG CTCCGCGGTC AACCAGGACG GGCGCAGCAA CGGACTCACC
GCGCCCAACG GCCCGGCGCA GGAGGACGTG TTGCGCCGCG CCCTGCGCGA CGGCGGGCTG
GCCTCGCACG AGGTCGACTA CGTCGAAGCG CACGGCACGG GGACGGCGCT CGGCGACCCG
ATCGAGGCCA ACGCGCTCGG TCGGGTGATG GGCCAGGGGC GCTCCGATGG CGAGGTCCTG
TACATCGGCT CGGTGAAGAG CAACCTCGCG CATACCCAGG CCGCGGCCGG CGCGGCGGGC
GTGATGAAGG TGCTGCTGGG CATGGAGCGC GACACGCTGC CGGCGCAGAT CCATTTCGAC
GCCCCCAGCC CGCACATCCC GTGGGACGCG CTGCCGCTCA AGGTGGTCGA TGAGCCGCTG
CCGTGGTCGC GCGGTGAGCG GGCGCGCGTC GCCGGCGTCA GCAGCTTCGG CGTCAGCGGA
ACCAACGCGC ACGTGCTGAT CGAGGAGCCG CCGGAGCGCG TGCCGACTGA CCATGCGGCG
CTCGAGCCGC CGCTGCTGCT GCCGCTGTCG GCCCACAGCG CCGCGGCGCT CGCACGCATG
GCGGCCGACA TCGCGGCGCT GATCCGGGCC GGGCGCGTGC CGCTGCGCGA CATCCTGTTC
ACGACCTGCC GCCGGCGCTT CCGTCTCAAC GAGCGGCTGG TCGCGCTCGG CGGCGATGCC
GAGGCGCTGG CCGAGGCGCT CGAGGCCTTT GCCAAGGGGC AGAATCATCC CGGTATCGTG
CGCGGCGCGG TCACCCACGA GCGTCCGCGT CCGGTGTTCG TGTTTCCCGG ACAGGGCGCG
CAGTGGGCGG GTATGGCGCG CGAGCTGTAC GCCCGCGAGC CGGCGTTCCG GGACGCGCTC
AAGGCCTGCG ATCGCGCCAT CCGCGACGAG GCCGAGTGGT CGCTCATCGC CTGGCTGCAC
GGCGAGGGCG AGGCCGAGCG CATCGACCGG ATCCAGCCGG CGCTGTTCGC CGTGATGGTG
TCGCTGGCCG GCCTGTGGCG CGACTGGGGG TACGAGCCCG CGGAGGTCGT CGGGCACAGC
CAGGGCGAGG TCGCTGCGGC CTATGTCGCC GGCGCGCTGT CGCTCGAGGA CGCGGTCGCG
ATCATCGTGC GACGCAGCGC CATGCTGCGG ACGCTGTCCG GTCGCGGCGC CATGATGGTG
GTCGAGCTGA CGGCAGACAA GGCGGCCGAG CGCATCGAGA GCGTTCGCGA TCGCGTCGCG
GTCGCGGTCG TGAACGGGCC GCGCTCGGTC GTGCTGTCCG GGGACGTCGA GGCCCTCGAG
ACGCTCGGCG CGGAGCTCGA GGCCGAGGGC GTCTACCAGC GCTTCGTGAA GGTGGATGTG
GCCTCGCACA GCCCGCAGAT GGACCCGATC CGCGCGAAGT TGCTGGGCGC GCTGTCCGAA
ATCGCGCCGC AGCGGGGCAC CACGCCGATC CGCTCGACCG TGAGCACCCG GACCATCTCC
GGCGAGGAGA TGGACGCCGA CTATTGGTGG AGCAATCTGC GCCGCCCGGT GCGCTTTGGC
GCCGTGGTCG AGGCCATGGC GCAGGAGCGG GACATCCTGT TCCTCGAGAT CAGCGCGCAT
CCCCTGCTGC GTCCGGCGGT GGAGGAGCAG GCGCCGGGTC GCGCGGTGTC CAGCCTGCGG
CGCGAGCAGC CCGAGCGCGA GACCCTGCTG CGCGCTGTTG CCGAGATGCA GGTCCGCGGC
CTGGAGCCCG ACTGGGGCAA GCTGTCGCCC GCGGGTGAGC TCGCGCGGTT GCCGGTGTAT
CCCTGGCAGC GCGAGACGCT GCGGGTGGAC TGGGGCAGCA TGGACACGCC GGGCTCGATG
CAGCGCGCCG GGACCGAGAC CGGCCATCCT TTCCTGGGCC AGTCCTTCGA GCCGGCGACG
GGCACGGGGT GGCGCTACTG GACCAGTCAC CTGAGCACCC GCAGCCACCC CTGGCTGGCG
GATCACAAGG TCGGTGACGC CGTGCTCCTG CCCGGCGCGG CCTACGTGGA TATCGCTCTG
GCCCTGGGCG AGGCGCCGAT GTCGCTGCGC TCGCTGCGGT TCGAAGAGGC CATGGTGCTC
GACGAGGCCG GGCGCACGGT CCAGGTGGCG CTCGACCGCG ACAATCGCTT CACGATCAGC
TCGCGCTCCG CCGAGGGCAC GTGGGTGCAC CACGCCCGCG GTCAGCTCGC GGACGCGCCG
GGCGGCGCCA GCGAGCCCGG GTTTGACGAG TCCGCGGTGG CGCCCGCCGA GGTCGCGGCT
CTGTACCAGC GACTCGCCGC GGTGGGATTG CACTACGGCC CGGTGTTCCA GGGCATCGCA
GGGCTGCTGA CCGGCGACGG TGTGGCGGTC TCCGAGCTCG AGCTGCCGCA GCGCGTGCGC
GGCCGGCGCG GCTGGCAGAT CCATCCCGCG CTGCTCGATG CGGCGCTGCA GACGCTGGCG
GCGGCGGTGC CCGCCGAGAA CCAGCCCAAG GGCCCGGTCG TGCCCATCGG GGTGGCATCG
GTGACGCTGC ACCGGCCGGT GCCCGAGCGC GTGCGCGTGT ATGCTGATTG GCGCCTGGGC
GAGCAGCCGG GCACCAGCGC GGGCGATCTC TGGCTGGTCG ATCTCGACGG CCAGCCGGTC
GCCGAGCTGC GCGGGCTCGA GGCCAAGGAG ATCGGCGTGG GCGGCGACAG CGATCGGACG
CTCGAGGCCG ACTCCTGGCT GGCGAACGTC TGGGTCCCGG CCGCGGAGCC CGACGCGCCG
GCACACGCCA GCCGCTGGCT GCTTGTGGGC GACGGCGACG GCCTGCGCGA CCGGCTCGCC
GAGTCCCTGC GCAGCCACGG CCATACGCTC GTCGCCGAGG GCGAGCCGCA CGATGAGATC
GTGTGGCTGG AGGCCCTCGA CGCTGACAGA GATGGGCCGG AGGCGGCTCG GGTGGCCTGC
GTGACCCTGA TGGAGCGAGT GCAGAGGCTG GTGCAGGAGA GCGAGCACAC GCCGCGGCTG
TGGCTGGTCA CCCGCGGCGC CTGCGAGGCC CAAGGCTGCC TCGTCGATCC CGCTCACGCC
GCGGTTTTCG GGTTCGCGCG CTCGCTGGCG GCGGAACACG CCGAGCTGCG TCCGGGCCGC
ATCGATCTCG ATCCCGGGTT GGCGCCGGTG GCGCAGGCGG CGGAACTCAC TCGCGCGCTG
CTGGCTGGCG ATGACGAGGA CGAGACCGCT CTGCGCGGCG GGGCCCGTTT CGTCGGGCGT
CTGCAGCGGC GGCTGCCCGG CCAGGCGCTG GGACGCCGCG AGCGGGTCGC CGGGGCTTTC
GAGATCCAGG ACGGACAGGC CCGCGCGCTG GAGCTCGGCG CGATTCCGGC GGGGGCGCTG
CGCGTGCGCT TCGATGCGCT CGGCGTCGAC GGCGGCGCTG GCGCGATCAT CGGCCTCGGC
GATGGCGTCG AGGGGTTCGA GCTGGAGCAG CGGGTCATGC TCAGCGAGCC CGGGCTGGGT
CAGGTCTCGC ACTGGACCGG ACCGGCGTCC GCGGTCGCCG CGCTGCCGTC GGGCTGGTCG
GCGACCCGCG CCGTGTGCTG GGGGCTGCCG CTGATGGCGT CGGCGCGCCA GCTCGCGCGC
TCGGGGGACG ACGAGCCCGA TCGCTGGCTG CCGCTCACCC TCGCCGCGCT GCCTCGCGAT
CTGCCCGAGC CCGAGCTGCC CGTGCTCGGC CAGCAGCCCA GCGCGTCAGC GTGGCGCATC
GACGGCGCCT CGCTGACCCA GCGGGTGCCC GCCCACAGCG GGTTTTCGGA CAACGCGACC
TGGTTGATCA GCGGCGGCCT CGGCGGCCTC GGTCTGTCGC TCGCCGAGGG CCTGGTGTCC
TGGGGCGTGC GGGCCGTGGC CTTGCTGGGC CGCAGCGGCG TGCGAACCCA GCACCAGCGC
CAGGCCATCG AGCGCATGCG CGCGGCCGGT GCCCGGGTGC GCGTCCTCGA AGCCGACGTC
TCGCAGCAGG CGTCGCTCGA GGCCGCGCTG GCGACGCTGC ACGACCTGCC TCCGCTGCGC
GGGGTGGTGC ACGTGGCCGG CGTGATCGAC GATGGCATGA TCGCGGGCCA GAGCGCCGAG
CGGCTCGCGG CCGTGTTCGC GCCGAAGGTC TCGGGCGCCT GGAATCTCCA CCGGGCGACC
GCCGCGTGCG AACTCGACCA CTTCGTCATC TACAGCTCGG GCGCATCGCT GCTCGGCTCC
CCGGGACAGT CGAGCTACGC GGCCGCCAAC GGCTTCGTCG ACGGTCTGGC CTGGGCGCGG
CGCAGCGCAG GACTGCCGGC GCTGTCGATC AACTGGGGTG CCTTCTCCGA CGTCGGTCTG
GCGGCCGCCG AGGCGCATCG CGGCGAGCGT CTGGCGTCGA AGGGGCTGCG CAACCTCACC
CCCGAGGAGG GGCTGCGTCT GGTGCGGAGC CTGCTGGCGA CCGATGTCGT CCAGGTGGGC
GCGCTGCCCT TCGACATCGC CCGCTGGCTC GAGTCGCTGC CGCAACTGGC GTCTTCGTCG
CGCTTCGCGG AGCTGGTGTC GAGCGAGGAT GTGGACGCGG GGCCAGCGGT CGAGGACCTC
GCCGCGCTGC TGGCAAGCTC GGTGTCCGCA CGGCGCGTCG CGCTGTGTGA GGACTACGTT
CGCCGCCAGC TCGCCCAGGT CATCGGCATC GCCGAAGATC AGCTCCCGCT GCGCCGCCCG
CTGACCGAGA TGGGACTGGA TTCGCTCACC GGTCTCGAGC TCCGCAACCG CCTCGAGGCC
GGGGTGGGCA AGAAGCTGTC GGCCACGCTG GCGTGGTCCT ACCCGACCAT CGAAGCGCTC
GCCGGGCATC TGCTCGAGCA GCTCGCCAGC CAACCCGAGG CCGCTGCGCA ACCCGCGCCC
GAGCCCGAGC CCGAGCAGGC GACGGCCGCG CCGGACCTCG ACGCCCTCGA ATCCGAACTC
AAGGACCTCG ACGAGTCCGA TCTCGCCGCG CTGCTCGACG ACGAACTCGA CGACCTGGAA
GGACGTATCT GA
 
Protein sequence
MPHASPPRDV VLETSAAQRR IWTLEQRSSG GGYNVCFAIS LPVGTSEQAV RDALRALMAR 
HRSLRSTVRL VAGTLRMVPE EPEPSLDVVD LPEQALAEQY TRMATRRLDL AVDRPLQSVL
IRHPDGLRLV VLDHHVRIDG TTLPILCREL CLLIRGDELP PLDLDYQDYV EAEAQWLASP
EAQASRDWWR AHLEGLPASA YPGDRPRRGA PSGRGALVRV DLPSEESRSL VRLARERGCT
PFRALVACTQ ALLLRSTGVE ELPVGITTHG RWDKRWRPLA GMFTNPLLLR LPVSADDSLA
SLLGRCQGAM DEALAHARLP YDEVARVARD AWGISEPFAV MLGSQVAPHA MDLPEGFAVE
LHALDTGAPG SSRMDLKLSS APLEDGSLRL EWELDTDILR EQTVHSLGEN LVRLVRAGLA
TPERPIGSLD LLSANERRSV RALATNDDTP RPLGAVLDQV SAWPAESLAI SAGEERMSYG
ELVAHARRVG GALAARGVGP GEVVGLLLHR RPAAIATMLG VMAAGAAWLP IEPDLPAQRI
AQMTAEADAR FVIADEDLRH LLPDGVEALA PSLAAEPLSE CRGAPSDPAY LLYTSGSTGR
PKGVLVPRSA LRHLLAYAGE LFGMKPGVTV AALATWSFDI ALAELLLPLV HGASVRLLDR
SLALDPPALG AALQRVDVAQ ATPTTWALLV RRGWRPEAPL TLSSTGEALP PDLARALCRD
GVRLLNLYGP TETTVWASGS VVDPERLDIG RPVPGLRCLV LDREGQVVPP GVMGELHIGG
PSLALGYLKR PELTAERFIA DPETPSERLY RSGDLARMSA DGRIECLGRI DEQLKIRGHR
IEPGEIEAAL REHPAVSEAA VAPLRDPEGG DRLVAVYVCR GADPGERALR DSLAARLPKW
MVPARMSAVA LLPRTSSGKV DRNAIVSLFA QVSVQRAADA GLVSRIADTF ARVLGVASVE
SERSFFEQGG TSMQLVVARE RLTEMGLEVT VADLFDHPSP ERLAAYLGGS RAAIREVREQ
IEPVAIVGLA CRFPGGVTDS ASFLSLLDQG RDAITEIPLS RWDADALYDP EPGRPGRLPT
RWGGFLEDVE YFDPGAFGLS PREARAMDPQ HRLLLELGQE AVLAAGYQPA EFAGREVGVY
VGLCGTDYQG RAVQRPTLDA IDPHAATGSA HSVAAGRVAH IFDLRGPAVV VDTACSSSLV
AVHLAVSALR AGECEAALVG GANVVLSPRW GAGFASLGFL SPSGRCSAFG AEADGYVRSE
GAGMLLLKPL SAALADGDTV HAVIRGTAIN HDGRAASLTA PSGLAQQSVI RSCLERAGLE
PGDIDVVEAH GTGTELGDPI EVQALATALG EGREHPLLVG SVKSNLGHCE GAAGVAGMIK
AVLAVREGRV FRTLHVDSLN PHVPWQQMPL EVAGEARAWP ETGRPRRAGV SAFGFSGTNA
HVILEQAPNQ DASALRAALP ATAYARRRLW LDEPIDGHGV IEHPMLRSRQ TLADGREIFE
GRVSLALFPE LGDHRVRQDA VLPATWLMEL GRAAGAEVLG SATITLSEIG LFAPVVIPER
GTLRLQVLLE PLADAQLGWI LASRREVGED GWTRHAAGRL ERGLVEIAEG PQVPADASDM
DAVYARLGEF GVDYGPAFRA LLGLALHEQG LVAALGASEP RPGTSDPVRM DAALQALAWH
RISGPDARLV LPFGIERVTL GPGAPTLATV QTEGERADID AFADDGRLVA RIRGLTLRAL
EGTSPQGLSG LIWRTSERTL ASRPLSGRWW VVGGPAALAD APGVEWVPAA APFADLPAPD
GVIRFQDPAQ PSGEALTESL ALVRALLELA RPPRTLWLTR AGAGAEPTNE TSAALWGLVR
VLRQEHPELE PELIDGLERP QLEALETLDE LALSTPERVM QDGQRLLPAL TRVSSEPAAI
SGRWLITGAS GGLGQALAQH LVARGAEELV LVSRTAPPAE LLDTLPATFI VADVSEPGAL
EAVLLRAGAL DGVVHAAGQL SDGVLLQLDA AAFATVSGPK LAAARQLAEA LPLPTRLVLF
SSVSAWLGAA GQAAYAAANA GLEAIAGARR ARGGEAIAIA WGPWAEVGMA ARASGRDRAR
SARMGLEPLA TGRALALFDR ALGVDAASVG AFSLNEETLR RALGAERVPA WLTQRAEDSA
TLVLPETGRA HAIREALKEQ IAQVLALSAD DELDWSRPFQ DLGLDSLMAV ELRDRLGRWC
GQRLPATLLF DRPQLEELVA WLDETLPGGE AAAVEVNIAP IVHDEPIAII GMGCRYPGGV
VDPESFWELL EGQVDAVTPV PADRWDRDAW HDPDPASVGH TITREGGFVD GVFDFDPAFF
GISPREARQM DPQQRVVMEV SWNALVDAGL RPEEMRGSNT GVYLGYMNHD YFLLHGTHTD
EMDGHFLIGN SGAVVSGRVA YHFGFHGPAL TLDTACSSAL VAAHLGAKAL RGGECDVALV
GGVALVLQPN VAVEFSRLRA MSPENRCRSF AASANGVGWS EGCGMLVLKR LSDAERDGDR
VLGVLRGSAV NQDGRSNGLT APNGPAQEDV LRRALRDGGL ASHEVDYVEA HGTGTALGDP
IEANALGRVM GQGRSDGEVL YIGSVKSNLA HTQAAAGAAG VMKVLLGMER DTLPAQIHFD
APSPHIPWDA LPLKVVDEPL PWSRGERARV AGVSSFGVSG TNAHVLIEEP PERVPTDHAA
LEPPLLLPLS AHSAAALARM AADIAALIRA GRVPLRDILF TTCRRRFRLN ERLVALGGDA
EALAEALEAF AKGQNHPGIV RGAVTHERPR PVFVFPGQGA QWAGMARELY AREPAFRDAL
KACDRAIRDE AEWSLIAWLH GEGEAERIDR IQPALFAVMV SLAGLWRDWG YEPAEVVGHS
QGEVAAAYVA GALSLEDAVA IIVRRSAMLR TLSGRGAMMV VELTADKAAE RIESVRDRVA
VAVVNGPRSV VLSGDVEALE TLGAELEAEG VYQRFVKVDV ASHSPQMDPI RAKLLGALSE
IAPQRGTTPI RSTVSTRTIS GEEMDADYWW SNLRRPVRFG AVVEAMAQER DILFLEISAH
PLLRPAVEEQ APGRAVSSLR REQPERETLL RAVAEMQVRG LEPDWGKLSP AGELARLPVY
PWQRETLRVD WGSMDTPGSM QRAGTETGHP FLGQSFEPAT GTGWRYWTSH LSTRSHPWLA
DHKVGDAVLL PGAAYVDIAL ALGEAPMSLR SLRFEEAMVL DEAGRTVQVA LDRDNRFTIS
SRSAEGTWVH HARGQLADAP GGASEPGFDE SAVAPAEVAA LYQRLAAVGL HYGPVFQGIA
GLLTGDGVAV SELELPQRVR GRRGWQIHPA LLDAALQTLA AAVPAENQPK GPVVPIGVAS
VTLHRPVPER VRVYADWRLG EQPGTSAGDL WLVDLDGQPV AELRGLEAKE IGVGGDSDRT
LEADSWLANV WVPAAEPDAP AHASRWLLVG DGDGLRDRLA ESLRSHGHTL VAEGEPHDEI
VWLEALDADR DGPEAARVAC VTLMERVQRL VQESEHTPRL WLVTRGACEA QGCLVDPAHA
AVFGFARSLA AEHAELRPGR IDLDPGLAPV AQAAELTRAL LAGDDEDETA LRGGARFVGR
LQRRLPGQAL GRRERVAGAF EIQDGQARAL ELGAIPAGAL RVRFDALGVD GGAGAIIGLG
DGVEGFELEQ RVMLSEPGLG QVSHWTGPAS AVAALPSGWS ATRAVCWGLP LMASARQLAR
SGDDEPDRWL PLTLAALPRD LPEPELPVLG QQPSASAWRI DGASLTQRVP AHSGFSDNAT
WLISGGLGGL GLSLAEGLVS WGVRAVALLG RSGVRTQHQR QAIERMRAAG ARVRVLEADV
SQQASLEAAL ATLHDLPPLR GVVHVAGVID DGMIAGQSAE RLAAVFAPKV SGAWNLHRAT
AACELDHFVI YSSGASLLGS PGQSSYAAAN GFVDGLAWAR RSAGLPALSI NWGAFSDVGL
AAAEAHRGER LASKGLRNLT PEEGLRLVRS LLATDVVQVG ALPFDIARWL ESLPQLASSS
RFAELVSSED VDAGPAVEDL AALLASSVSA RRVALCEDYV RRQLAQVIGI AEDQLPLRRP
LTEMGLDSLT GLELRNRLEA GVGKKLSATL AWSYPTIEAL AGHLLEQLAS QPEAAAQPAP
EPEPEQATAA PDLDALESEL KDLDESDLAA LLDDELDDLE GRI