Gene Hoch_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2971 
Symbol 
ID8545359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4102462 
End bp4112736 
Gene Length10275 bp 
Protein Length3424 aa 
Translation table11 
GC content79% 
IMG OID646387648 
Product3-oxoacyl-(acyl-carrier-protein) reductase., 6- deoxyerythronolide-B synthase 
Protein accessionYP_003267376 
Protein GI262196167 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000594073 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTGGCGC TCAAGCGCGC CAGGGAGCAG CTCGAGGCCA ACGAGCGCCA GCGCAGCGAG 
CCCATCGCCG TGGTCGGCCT GGCGTGTCGC TTCCCGGGCG GCGCCGACGA CCCGGAGCGC
TTCTGGCGCC TGCTCGAGCG CGGCGTCGAC GCGGTGGGCG AGATCCCGGC CGAGCGCGTG
CCCGCGCACC CGGCGTTCCC GCAGACGGTG TCCTCGGCCG CCCTGCTCGA CGATATCGAG
CACTTCGACG CCGAGTTCTT CGGCATCTCG GCGCGCGAGG CCGAGCGCCT CGACCCGCAG
CAGCGGCTGC TGCTCGAGGT CGCGTGGGAG GCGCTGGAGA CCGCCGGCCA CGCGCCCAGC
GGCCTGGTCG GCAGCCGCAC CGGCGTGTAC GTGGGCATCG GCTCGCCCGA CTACCTGCGC
CGGCTGCTGC GCGAGAGCCC GGGCGCGGTC GACGGCCAAG CGTTCACCGG CAGCCTGGCC
AGCGTCGCCG CCGGACGACT GTCCTATATC CTCGGCCTGC AGGGCCCGTG CATGGCGCTC
GACACCGCGT GCTCGTCGTC GCTGGTAGCC GTGCACCAGG CGTGCGCCAG CCTGCGCAGC
CGCGAGAGCG ACCTGGCGCT GGCCGGCGGC GTCAACCTCA TCCTCTCGCC CGAGAGCGGC
TACGTGCTCG GCCAGCTCAA GGCGCTGTCG CCCGATGGCC GCTGCAAGAG CTTCGACGCC
CGCGCCAACG GCTACGTGCG CGGCGAGGGC TGCGGCGTGG TGGTGCTCAA GCGGCTCTCG
GACGCGCAGC GCGACGGCGA CCACGTCTGG GCGCTGGTGC GCGGCTCGGC GGTCAATCAG
GACGGCCGCT CGGCCGGGCT CACCGCGCCC AACATGCTGG CGCAGCAGGC GGTGCTGCGG
CAGGCGCTCG ACAGCGCCCG GGTCGAGCCC GCCGCCATCG GCTACGTCGA GACCCACGGC
ACCGGCACGC CGCTCGGCGA TCCCATCGAG ATCGAGGCGC TCACCGCGGT GCTCGGGCAG
CCGCGCGCCG ACGGCTCGCG CTGCGTGCTC GGCGCGGTCA AGACCAACAT CGGCCACCTC
GAGGCCGCCG CCGGCATGGC TGGTCTGATC AAGGCGATCC TGACCTTTCG CCACCAGGCG
ATCCCGCGCA ACCTGCACTT CCAGACGCTC AACCCGCGCA TCTCGCTGAG CGGCACGCCC
TTCGTCATGG CCGACGGCGA GCAGCCCTGG CGCGCGGGCG AGCGGCCGCG GCTGGCCGAG
GTGAGCTCGT TCGGCATCAG CGGCACCAAC GCGGCCGTGA TCCTCGAGGA GCCGCCGACC
CAACCGGCCC AGGCGCCGGC CCTGACCGAT CAGACGCCGG CCCCGGCGCG CGCGCATCAG
GTGGTGACCC TGTCGGCGCG CTCGCCGCAG GCGCTGAGCG GCGCGGTGGC CCGGCTGGCC
GCGCATCTGC GCAGCGAGCC GGCGCCAGCG CTGGCCGACC TGGCCTTCAC CACCCGCGCC
GGGCGCGCGC ACTTCGCGTA CCGGGCCGGC TTCGCGGCCA GCTCCATCGC CGCGCTGCGC
GAGGCCCTGG ACGCCTGGCT GACCGCTGAC GCGGCGGGGG CTGCCGACGC CGCCGCGGGG
GCGCCGGACG TCGCCGGCCG GCCGCGGATC GCGTTCCTGT TCACCGGCCA GGGCTCGCAG
CGCGCGGGCA TGGGCCGGGC GCTGTACGAC GCCGAGCCGG TGTTCCGCGA GACCCTCGAT
CGCTGCGCCG CGCTGCTCGA CGGCGCCCTG CCGCGGCCGC TGCTCGAGGT CATGTGGCAG
GACGACAGCG CCGAGCTCGA CCAGACCCTG TACAGCCAGC CGGCGCTGTT CGCGCTCGAG
GTCGCGCTGA CCGCGCAGTG GCGGGCCTGG GGCGTGGCGC CCGATGTGGT GCTCGGCCAC
AGCCTGGGCG AGTACGCGGC CGCCGCCGCC GCCGGCGTGA TGTCGCTCGA GGACGCCTGC
GCCCTGGTCG CCGCGCGCGC GCGCGCCTGC CACGGCCTGG GCCCGGGCGG CGCCATGGCC
GCGGTCGAGG GCGAGCCGGC CGTGGTCGAG GAGGTCGTGG CCGCGATCGC GGGCGACCTG
GTCATCGCGG CGCACAACGC GCCCCGGAAC CTCACCGTGT CGGGGCCGGC GGACGCGGTG
CAGCGGGCCG GCGAGGAGCT GCGCGCGCGC GGCGCCAAGG TCAAGCCGAT CAACGCCTCG
TGCGCGTTCC ACTCGCCGCT GATGGCGCCG ATGGTGGCGG CCTTTGCCGA GCGCGCGGCG
CAGGTCGACT ACCGCCCGCC GCGGGTCGCC TGGGTCACCA ACGTCACCGG CGAGCGGGCG
GGCGAGGGCT TCGACGCCGC CGACTACTGG CGCGAGCAGC TCTGCGCGCC GGTGCGCTTC
GTCGAGTGCG TGCGCCAGGC GCGGGCGCTG GGCTGCACGG TGTGGGTCGA GATCGGGCCC
GGGCCGGTGC TGCTCGGCGC GGTCTCGCGC ACGCTCGACG AGCCGCCGAG CGCGGCGCCC
TCGCTGCACC GCGAGCGCGA CGACGGCGCG GCGATGGCGC GCGCGCTGGC GACCCTGTAC
ACGCGCGGGG TGGCGCTCGA CTGGGGCGCC TACGACGCGG GCGCGGCCCA CCGCCGCGTG
CCGCTGCCGA CTTACGCCTT CGCGCGCGAG CGCTTCTGGC TGTCGCCGAG CGCGGCGCCC
GCGGCTGCGT CCGAGCCCGC GTTGCCGCGC TCGGCGGGCG CCGGTCATCC GCTGCTGGGC
ACGCGCGCGG CGCTGCCCGG CCGCGAGGTC CACTATCTCA ACCGGCTCAC GGCCGCGCAC
CCGCCCGAGC TGGGCGAGCA CCGCGTCTAC GGCCAGGTGG TGGCGCCGGC CGCGCTGTAT
CTGGCCGCGT TCCTGGCCGC GCTGGACGAG CGCGGCGGCG GCGCGCTGGT CGACGTCGCG
TTCACGCGCG CGCTGACCAT CCCCGAGGGC GGCGACGCGG GCGCCGAGGG CGCCTCGGTG
TGCGTGGCGC TGACGCCGGA CGAGCCGGGC GCGCGGCTGT CGTTCTTCGC CGCGGCCGAG
CGCGGGAGTC AGCCGGCGGG CCAGGGCGAG GGCGGGGAGG CGTGGACCCT GCACGCGCGG
GCCGCGCTGG CCCCGGCGGG CGACGCGGGC ACAGACGCGG CGCTGGCGTC GGCGGATCTG
GCGGCGCTGC GCGCGCGCTG TCCGCGGGCC GCGGCGCCGG CCGAGCTGCT GGCCGCGGTG
TGGACCGAGG GCGCCGCCGC GCGCACCTCG GCCATCCACC TGGGCCCGCG CTTTCGCCGC
ATGGCGGCGC TGTGGCTGGG CGACGCCGAG GCGCTGACGC GGCTGGCGCT CGACCCCGCT
GCGCTCGACG CGGCGGCCGC GGACGAGGCG GCGGCCGGCG CGGTGCCGCT CACGGTCTTC
GATAGCTGCT TTCAGCTCCT GGGGCTGGCG GCGGCGGCCA GCGAGGCCGA GCAGGCGTGG
ATGCCGATCG CCATCGAGCG GGTCGAGCTG CACGCCGGCG CGGCGCTCGC GGCCGAATTG
TGGGCCCACG CCGCGGCCGC CGTGGTCGGC GAGGGCGCGC TCATCCGCGG CGACGTGCGC
CTGCTGACGG CCGCGGGCGA GCCCGTGGCG ACGCTCTCGG GCGTGAGCCT CAAGCGGGTG
TCGCAGGCCG CGCTGCTGCA GCCGCCGGCG TGGCGGCGGT GGCTCTACGA GCTGGACTGG
CGCCGCTGCG AGCCGCCGGC GGAGGCGGCC GCGGCGGCGT TTGCGGCCGC TGAGCTGGCC
GTGATCCTCG ACCGCGGCGG CCGCGGCGCG CGCTTCGCCG ACGCGCTCGA GCGCCGCGGC
GCCCGCGTGC TGCGCTGCGC CGCGCTCGAG GAGCTTACGC GCGAGCCGCG GCCGGCGCTC
GCCCACATCG TCTCGTTCGC CGCGCTCGAC GCCGCCAAGA CCGACGACGC CGCCGACGCC
GACGACGCGG CGCCGAGCGC CCACGCGCTG GCGTTGGTGC AGACCGTGAG CCGGCTGCGC
GGGGAGCGCG CGCCGCGGAT CTGGTGGGTG ACGGCCGGGG CCCAGGTGGT CGCCGGCGCG
CCCGCGGCGC CGTCGCCGGC GCAGGCCGCG CTGTGGGGCC TGGCGCGCTG CGCGGCGCTG
GAGCTGCCCG AGATGTGGGG TGGGCTGGTC GATCTGGCGC CCGAGGCGGC CGACGCCGAG
GGCGACGCGC TGCTCGCGGC GCTGGCCGGC GAGCGCGATC AGTGCGCGCT GCGCGGCGGC
GCGCTGTACG GCGCGCGGCT CGAGCGGCGC AGCGCCGAGC CCGCGAGCGC GGCGCCGCCG
CAGCCGCTGC GGGCCGACGC GACCTACCTG GTGAGCGGCG GCCTGGGCGG GGTCGGCTGG
GAGCTTCTCG AGGCCTGGGC CGCCCGCGGC GCGCGCCACC TGGTGGCCAT CGGCCGCTCG
GCGCCGTCCG CCGCGCAGCG GCAGCGCCTG GCCGAGCTGG CGCGCGCGGG CGTCGAGGTC
CGGGTGCTGG CCGCCGACGT CCGCGACCGC GACGCCCTGG CCGCCGCCTG GGCCGCGCTG
GCGCCCGCGC TGCCGCCGAT CGCGGGCGTG GTGCACGCGG CCGGCGTGCT CGACGACGCG
GCCCTGCTGC GCCACCAGCC GGACCAGCTC CGCGCGCTGC TGGCGCCCAA GCTGACGGGC
GCGCGCAACC TGCTCGCGAT CGCGACGAGC GCCGCGCCGG GGGAGGGCGC GGCCGGCGCG
CCGGCCGGGG CCGGGGGGCT CGACTTCTTC GCGCTGCTGT CGTCGCTGGC CGGGGTGCTC
GGCTCGCCCG GTCAGGCCGG CTACGCGGCC GCCAACGCGG CCCTGGACGC GCTCGCCGCC
GCCTGGCGCA GCCGCGGCGT GCCCGCGCTG AGCGTGGCCT TTGGACCCTG GCAGGTCGGC
TTCGCGGCCG CTCACCAGCA GGCGCTCGCC GCCCGCGGGG TCGCCGCGAT GCCGGCCAAC
CTGGCGGTCG ACGCGCTCGC CGCGTGCATC GCCCGGGGCC CGGGCGACGC CGTGGTCACG
GCCGCCGACC TGGGCGCGGT GGCGGCCCAC ATGCCGGAGC GCTCGCGCGG CGTGCTGGCG
GCCTTCGCCG CGGCCCCGGA CGCGGCGGCC GGCGCTCCTG CGGGCGGGCG CACGGCGGCC
CTGGCCGAGG CGCTGCGCAA GGCGCCGCCG CGGCAGCGCG CGCGCGTGCT GCTCGAGGGC
TTGGCCGCCG CGGTGCGCGG GGTCCTGGGC CGGGCCGCGC ACGTGCGCAT CGAGCCCACC
CAGAGCCTGT TCGAGCTGGG CCTCGACTCG CTGAGCGCGG TCGAGTTCCG CACCTCGCTG
CAGCGCGCGC TGGGCCGCTC GCTGCCGGCC TCGCTGGCGT TCGAGCACCC CACGCTCGAG
GGCCTGAGCG AGGCGCTGCT GGCGATGCTG GCCGACGAGC TGGCCGCCGA CGAGTCCGAC
GCCGAGACTG ACGCGCCCGC CGCCGCCGCC GCGCCGCCGC CGCGCGCGCG CGCCGAGGAC
CAGCCCGAGG ACCAGCCCGT GGACCAGGCC CTGAGCGCGC TGGCCGCGGG CTCGCTGGAT
CTCTCCGCGC TGCCGCCGGC GACCCTGGCC GCGCTGGCCG CGCGGGCGCG CTCGCAGCGT
CCGGAGCTGG CCGTGCTCGG CGCCGAGCCC ATCGCCATCG TCGGCATGGG CTGCCGCTTC
CCGGGCGGCG CCGACTCGCC CGAGGCGCTG TGGGCGCTCT TGCGCGACGG CGTCGACGCC
ATCACCGAGA TCCCGGTCGA GCGCTGGGAC CCCGACACCT GGTACGACCC CGACCCCGAG
GCCGCGGACA AGACCACGAT CCGTCACGGC GGCTTCCTCG GCGACGTCGC CGGCTTCGAC
GCCGGCTTCT TCCGCATCTC GCCGCGCGAG GCCCGGTGCA TGGACCCGCA GCACCGGCTG
CTGCTCGAGG TCGCGTGGCA GGCGCTCGAG GACGCCGGCC AGGACCCGGA GCGGCTGCGC
GGCACCAGCG CCGGGGTGTT CATCGGCTTC ATGAACAACG ACTACGCCAG CATCGCCGAG
CTGGCCGAGC TCGAGGGCCA CCTGGCCACC GGCAACGGCA TCAGCAACGC GGTCGGCCGG
CTGTCGTTCG TGCTCGGCGT CCACGGCCCC TCGGTGGCCC TCGATACCGC CTGCTCGTCG
TCGCTGGTGG CCATCCACTC GGCGCTCGAG AGCCTGCGCA AGCGCGAGTG CGACCTGGCC
CTGGCCGGCG GCGTGAGCCT GATCCTGTCG CCCGGCCTCA CGATCCTCAT GTCCAAGCTG
GGCGGGCTGG CTGCGGACGG CCACTGCAAG ACCTTCGACG CCGCCGCCGA CGGCTACGTG
CGCTCGGAGG GCTGCGGCGT GCTGGTGCTC AAGCGACTGG TCGACGCGCA GCGCGATCGC
GACCGCGTGG TCGCCGTGAT CCGCGGCTCG GCGACCAACC ACGGCGGCCG CAGCGGCGGC
TTCAGCCAGC CCAACGCGCG CGCCCAGCAG GCGCTGATGC GCGAGGCCCT GGCGCGCAGC
CAGACCCTGC CGCAGCAGGT GAGCTACCTC GAGGCCCACG GCACCGGCAC GCCGCTGGGC
GACCCGATCG AGTTCCGCGC GGCCGCGGCC GCCTACGGGC CCGGGCGCGG CGGCGATCGC
CCGCTGCACA TCGGGTCGAT CAAGACCAAC GTCGGCCACA CCGAGGCCGC CGCCGGGGTC
GCCGGGGTGA TGAAGGTCGC GCTGTCGCTG CGCGCGCGCC AGATCCCGCC GCACCTCAAC
CTGCGCACGC TCAGCCCCGA GATCGATCTC GACGCGGTGC CGGCGCGCAT CCCGATCGCG
CTGACGCCCT GGGAGCCGAT CGCCGGGCGG CGCATCGCGG GCGTGAGCAG CTTCGGCATG
AGCGGCATCA ACGCCCACGT GATCCTCGAG GAGGCGCCGC CGCCCGCCGC TGCCGGCGCG
AGCGGCGCGG CCGAGCCGGC GCCGGCGCGG GCCGAGCTGG TGGCGCTGTC GGCGGCCTCC
GAGGCGGCGC TGATGGCCCT GGTGCGCGCG TACCAGGCTC ACCTCGGCGA GCCAGGCGAG
GGCGCGCCCG CGCGGCCCGC CGCGCCGACG CTGCTCGACA TCGCCGGCAC CGCCGGGGCC
CGGCGCGCCC ACCACGGCCA TCGGCTGGGC CTGGTCGCCC GCGATCTCGC CGAGCTGCGC
ACCGGGCTCG CGGCCGCGCT CGCCGGCCGC GGCGGCGGCG TGGTCCGCGG CGCCAGCGAG
GCCGGCAAGC GGCCGCGGGT GGTGTTCGTG TGTCCCGGCC AGGGCTCGCA GTGGCTGGGC
ATGGGCCGCG CGCTGTACGC CGACGAGCCC GCGTTCCGCG CGGCCATCGA TCGCTGCGAG
CGGGCCATCG GCGAGTACGT CTCGTGGAGC CTGGTCGAGC GCCTGCACGC GGCCGAGGCG
CCGGCCGGCA TCGACGTCAT CCAGCCGATG CTGTTCGCGA TCTCGGTGGC CCTGGGGGCG
CAGTGGCGGG CCTGGGGCGT GGAGCCCGAC GCCATCGTCG GCCACAGCAT GGGCGAGGTC
GCGGCCGCGC ACCTGGCCGG GGCCCTGAGC CTCGACGACG CCGCGCGCGT CATCTGCCGG
CGCAGTCGGC TGATGCTCGA GGTCGCCGGG CGCGGCGCCA TGGCCGTGGT CGAGCTGGGC
GCCGAGGCGC TGGCCCCGCG GCTGGCGCCC TACGGTGAGG CGCTGTGCGT GGCGGCCGCC
AACAGCACGC GCTCGTCGGT GGTCACCGGC GAGGCCGACG CGGTCGAGCG GCTGCTGGCC
GAGCTCGAGC AGGCGCGCGT GTTCGCCCGG CGCATCAAGG TCGACGTGGC CTCGCACGGG
CCCCAGGTGG ACCCGCTGGC CCAGCCCCTG CTCGCGCAGC TCGAGGGCCT CCGCCCGGGG
CTCGCCGAGG TGCCGCTGTG GTCGGCCGTG AGCGCGTCGC CGGCCGAGGG CCCCGAGCTT
TCGCCCGACT ACTGGATGCG CAACCTGCGC CAGCCGGTGC GCTTCGCCGA GACCATCACG
GCCCTGCTCG CGAGCGAGCA CGCGCTGTTC GTCGAGCTGA GCGCGCACCC GATCCTGGCC
CCCGCGATCG AGCACACGGC CGAGGACCGC GGCGCCGCAG CCTGGGTGGT GGGCAGCCTC
GAGCGCGACA AGCCGGAGCG GGCGTCGATG CTGAGCGCGC TGGCCGCGCT GTACGCGCGC
GGCTACCCGC TCGACTTCCG TCGCCTCTAC CCCGAGAGCG CGGCGGTGTC GCTGCCGACC
TATCCGTTCC AGCGCGAGCG CTACTGGGTC GAGCCGGTCA AGACCACGCC GCTGCTGCTG
CGCCACCGCG CCGATCTGCG CGGCGCGGCG CTGGCGCCGG CGGCGAGCGC GGCCGGGGCC
GGGGCCGGGG ACGAGGCCAC GCGCGGGCTG AGCTACGAGC TGTCGTGGGA GCCGGTCGCC
GACCCGGGCG CGGGCGAGCC CGGCCGCCGC TACGCCCTCG TCGGCGACCG CGGCGGGGTC
GCCGAGCGGC TCGCGGCCGC GCTGCGCAGC GCCGGCCACG CGTGCGCGGT GGTGGCCGCC
GAGGCCCCGG ACGAGGCCGC GCTGGCGGCG CTGGCCGACG GCCGCCCGCT GGCCGGCTGC
GTGTACCTCG CCGGCCTCGA CGGCGCGGCC GCGAGCGACG CCGAGGCGCT GCCGCTGGCG
GCCGCGCACG CCGCCGGTGT GGCCCGGGCG CTGGCGGTGC TGGGGCGCTT CGGCGCGCCG
CCGCTGTGGC TGGTGACCCG GGGCGGCCAG AGCGTGGCCG GCGAGGCGCC GAGCCTGTCG
CAGGCGGCGC TGTGGGGTCT GGGCCGCACC CTGGCCGCGG AGCAGCCGGC GCTGCGCAGC
GCGCTGATCG ACCTCGCGCC GGCCGCCGCC GAGCCGGGCG GCGCGGGCGC CCAGGCCGCC
GCAGGAGACG AGATCGCGGC CCTGACGCGC TGGCTGAGCG CGCCGCACGG CGAGACCCAG
CTCGCGCTGC GCGACGGGCG GACGCTGGCC GCGCGCCTGG GCGTGGCCCC CGAAGCGGCG
GCGGCGGCCG CGGTGACGAT CCGCGCGGAC CGGACCTACT GGATCGCGGG CGGGCTCGCC
GGCCTGGGCG GCGCCGTGGC CGAGGAGCTG GTGCGGCGCG GGGCCCGGCA CGTGTTGGTC
ACCGGCGCGC GCGAGGCCGC GGCCGACGAG GCCTGCCGGC AGCGGCTGGC GGCGGCCGGG
GCGAGCGTTA GCTACGCGGC GGCCGCGCAC GCCGAGCCCG AGGCCCTCGC GGCCGCGCTC
GAGGCCCTGC CGGCGCCGCT GGCGGCCGTG ATCCACGCGG TGCCGCCGGC CGCCCCGGCG
CTGCCGCTGG CGGCCACCGA GGACGCGTTC GCCGGGGCCG TGCGCGCGCT GGCCGACGAC
GCCTGGCGCC TGCACCACGC CGCGGGCGAG CGGGAGCTCG ACTTCGCGCT GTTCTTCGGC
CCGGCGGCGT CGCTGCTCGG CGGCGTCGGC CAGGCCGCGG CCGCGGTCGC CGCGGAGGTG
GCGGCGGCGC TGGCCGCGCG CTGGCGGCGG CGCTCGCGGC CGGCGCGGGC GCTGCTGCTG
GCGAGCTGGC CGGGCGCCGA GGACCCCGCG CTCGCCGGCC TGGGCATCGC GCCGGTGGCG
GTCGAGACCG CGGTCGCGGC GGCGCTGGCG GCGGCGGTCG CGGGCGGGGC GCCGGCGCGC
ATGATCGCGG CCATCGACTG GCCGCTGTTC CGCAGCGCCT ACGCCCAGCG CGCCGACGAG
CGCTTCCTCG GGCAGCTCGG CCGGGCCGAG GACGCCGCCG CCGAGGCCGC GCCGCTGCGT
GAGCGGCTGC GCGAGCTGGC GCCAGAGCGA GCGCGCGCGC TGCTCGCCGA GCTGGTCGCC
GACGAGACCC GCGAGCAGCT CGGCCTGGCG CCCGAGCAGC CACTGCCCAG CCGCACGCCG
TTCACCGAGC TGGGCATGGA CTCGGTGATG TCGCTCAAGC TCACCACGCG TCTGGGCCGC
GCGCTCGGGG TGCGCCTGGC CACGGTGGTG ATCTTCGAGT ACCCGACGGT GGCGGCGCTC
ACCGAGCACC TCGCCAGCGA GGTCCTGGAG CTGCCGACGA GCGCCCCGGC CGACGCCGCG
CCCGGCTACG CGCCCGACCC CGCGCTCGCC GGCGCCGACA CCGACACCGA CACCGACACC
GACACCGACG ACGACGACGA CGACCTGTCC GAAGACGAGC TCGCCGCGTT GCTGACGCGC
AAGCTCGAGA GCTGA
 
Protein sequence
MVALKRAREQ LEANERQRSE PIAVVGLACR FPGGADDPER FWRLLERGVD AVGEIPAERV 
PAHPAFPQTV SSAALLDDIE HFDAEFFGIS AREAERLDPQ QRLLLEVAWE ALETAGHAPS
GLVGSRTGVY VGIGSPDYLR RLLRESPGAV DGQAFTGSLA SVAAGRLSYI LGLQGPCMAL
DTACSSSLVA VHQACASLRS RESDLALAGG VNLILSPESG YVLGQLKALS PDGRCKSFDA
RANGYVRGEG CGVVVLKRLS DAQRDGDHVW ALVRGSAVNQ DGRSAGLTAP NMLAQQAVLR
QALDSARVEP AAIGYVETHG TGTPLGDPIE IEALTAVLGQ PRADGSRCVL GAVKTNIGHL
EAAAGMAGLI KAILTFRHQA IPRNLHFQTL NPRISLSGTP FVMADGEQPW RAGERPRLAE
VSSFGISGTN AAVILEEPPT QPAQAPALTD QTPAPARAHQ VVTLSARSPQ ALSGAVARLA
AHLRSEPAPA LADLAFTTRA GRAHFAYRAG FAASSIAALR EALDAWLTAD AAGAADAAAG
APDVAGRPRI AFLFTGQGSQ RAGMGRALYD AEPVFRETLD RCAALLDGAL PRPLLEVMWQ
DDSAELDQTL YSQPALFALE VALTAQWRAW GVAPDVVLGH SLGEYAAAAA AGVMSLEDAC
ALVAARARAC HGLGPGGAMA AVEGEPAVVE EVVAAIAGDL VIAAHNAPRN LTVSGPADAV
QRAGEELRAR GAKVKPINAS CAFHSPLMAP MVAAFAERAA QVDYRPPRVA WVTNVTGERA
GEGFDAADYW REQLCAPVRF VECVRQARAL GCTVWVEIGP GPVLLGAVSR TLDEPPSAAP
SLHRERDDGA AMARALATLY TRGVALDWGA YDAGAAHRRV PLPTYAFARE RFWLSPSAAP
AAASEPALPR SAGAGHPLLG TRAALPGREV HYLNRLTAAH PPELGEHRVY GQVVAPAALY
LAAFLAALDE RGGGALVDVA FTRALTIPEG GDAGAEGASV CVALTPDEPG ARLSFFAAAE
RGSQPAGQGE GGEAWTLHAR AALAPAGDAG TDAALASADL AALRARCPRA AAPAELLAAV
WTEGAAARTS AIHLGPRFRR MAALWLGDAE ALTRLALDPA ALDAAAADEA AAGAVPLTVF
DSCFQLLGLA AAASEAEQAW MPIAIERVEL HAGAALAAEL WAHAAAAVVG EGALIRGDVR
LLTAAGEPVA TLSGVSLKRV SQAALLQPPA WRRWLYELDW RRCEPPAEAA AAAFAAAELA
VILDRGGRGA RFADALERRG ARVLRCAALE ELTREPRPAL AHIVSFAALD AAKTDDAADA
DDAAPSAHAL ALVQTVSRLR GERAPRIWWV TAGAQVVAGA PAAPSPAQAA LWGLARCAAL
ELPEMWGGLV DLAPEAADAE GDALLAALAG ERDQCALRGG ALYGARLERR SAEPASAAPP
QPLRADATYL VSGGLGGVGW ELLEAWAARG ARHLVAIGRS APSAAQRQRL AELARAGVEV
RVLAADVRDR DALAAAWAAL APALPPIAGV VHAAGVLDDA ALLRHQPDQL RALLAPKLTG
ARNLLAIATS AAPGEGAAGA PAGAGGLDFF ALLSSLAGVL GSPGQAGYAA ANAALDALAA
AWRSRGVPAL SVAFGPWQVG FAAAHQQALA ARGVAAMPAN LAVDALAACI ARGPGDAVVT
AADLGAVAAH MPERSRGVLA AFAAAPDAAA GAPAGGRTAA LAEALRKAPP RQRARVLLEG
LAAAVRGVLG RAAHVRIEPT QSLFELGLDS LSAVEFRTSL QRALGRSLPA SLAFEHPTLE
GLSEALLAML ADELAADESD AETDAPAAAA APPPRARAED QPEDQPVDQA LSALAAGSLD
LSALPPATLA ALAARARSQR PELAVLGAEP IAIVGMGCRF PGGADSPEAL WALLRDGVDA
ITEIPVERWD PDTWYDPDPE AADKTTIRHG GFLGDVAGFD AGFFRISPRE ARCMDPQHRL
LLEVAWQALE DAGQDPERLR GTSAGVFIGF MNNDYASIAE LAELEGHLAT GNGISNAVGR
LSFVLGVHGP SVALDTACSS SLVAIHSALE SLRKRECDLA LAGGVSLILS PGLTILMSKL
GGLAADGHCK TFDAAADGYV RSEGCGVLVL KRLVDAQRDR DRVVAVIRGS ATNHGGRSGG
FSQPNARAQQ ALMREALARS QTLPQQVSYL EAHGTGTPLG DPIEFRAAAA AYGPGRGGDR
PLHIGSIKTN VGHTEAAAGV AGVMKVALSL RARQIPPHLN LRTLSPEIDL DAVPARIPIA
LTPWEPIAGR RIAGVSSFGM SGINAHVILE EAPPPAAAGA SGAAEPAPAR AELVALSAAS
EAALMALVRA YQAHLGEPGE GAPARPAAPT LLDIAGTAGA RRAHHGHRLG LVARDLAELR
TGLAAALAGR GGGVVRGASE AGKRPRVVFV CPGQGSQWLG MGRALYADEP AFRAAIDRCE
RAIGEYVSWS LVERLHAAEA PAGIDVIQPM LFAISVALGA QWRAWGVEPD AIVGHSMGEV
AAAHLAGALS LDDAARVICR RSRLMLEVAG RGAMAVVELG AEALAPRLAP YGEALCVAAA
NSTRSSVVTG EADAVERLLA ELEQARVFAR RIKVDVASHG PQVDPLAQPL LAQLEGLRPG
LAEVPLWSAV SASPAEGPEL SPDYWMRNLR QPVRFAETIT ALLASEHALF VELSAHPILA
PAIEHTAEDR GAAAWVVGSL ERDKPERASM LSALAALYAR GYPLDFRRLY PESAAVSLPT
YPFQRERYWV EPVKTTPLLL RHRADLRGAA LAPAASAAGA GAGDEATRGL SYELSWEPVA
DPGAGEPGRR YALVGDRGGV AERLAAALRS AGHACAVVAA EAPDEAALAA LADGRPLAGC
VYLAGLDGAA ASDAEALPLA AAHAAGVARA LAVLGRFGAP PLWLVTRGGQ SVAGEAPSLS
QAALWGLGRT LAAEQPALRS ALIDLAPAAA EPGGAGAQAA AGDEIAALTR WLSAPHGETQ
LALRDGRTLA ARLGVAPEAA AAAAVTIRAD RTYWIAGGLA GLGGAVAEEL VRRGARHVLV
TGAREAAADE ACRQRLAAAG ASVSYAAAAH AEPEALAAAL EALPAPLAAV IHAVPPAAPA
LPLAATEDAF AGAVRALADD AWRLHHAAGE RELDFALFFG PAASLLGGVG QAAAAVAAEV
AAALAARWRR RSRPARALLL ASWPGAEDPA LAGLGIAPVA VETAVAAALA AAVAGGAPAR
MIAAIDWPLF RSAYAQRADE RFLGQLGRAE DAAAEAAPLR ERLRELAPER ARALLAELVA
DETREQLGLA PEQPLPSRTP FTELGMDSVM SLKLTTRLGR ALGVRLATVV IFEYPTVAAL
TEHLASEVLE LPTSAPADAA PGYAPDPALA GADTDTDTDT DTDDDDDDLS EDELAALLTR
KLES