Gene Hoch_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3816 
Symbol 
ID8546209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5244748 
End bp5256558 
Gene Length11811 bp 
Protein Length3936 aa 
Translation table11 
GC content72% 
IMG OID646388486 
ProductTPR repeat-containing protein 
Protein accessionYP_003268209 
Protein GI262197000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGT TCCAACCGTA TCTTCGCATC CTCGATACCG ATCCCACCGA CACCCACGCG 
CTGCAGGCGC TGGAGGCGGC GATCGGCGAA AACCACCAGG GCACCGCCTC GGCCGACATC
CTGAGCGCCC TGGCCGAGAG CCGCTCGGTG CTGCGCGAGC GCGGCGCGGT CGACACCGCC
CTGCGGCTGC TCGAGCTGGC GCTGCGCGTG GTCGGTGACG ACGCCGCTCA CGCCGGCCAG
CGCGCCGACC TGCTGCTCGA GCGCGGCCGC ATCTACGTCG AGGATCTGCT CGACGACGCC
GCGGCCGAAG AGAGCTTCCG CCAGGTGCTG GCGCTGCGCG AGGGCGACGA GAGCGCCAGC
GAGTCGCTGG CCCAGCTCGA GATGGAGCGC GAGAACTGGG CCAAGTTCGT CGAGAAGAAC
CTGTCCGAGG CCGAGTCGGC GACCGATCGC AAGCTGGTCA CGCACATGTA CCTCTCGGCG
GCCGAGTTCT ACGGCCGCTA TCAGCCCGGG GCCGAGGAGT TCGAGCGCTA CCTGCGCAAG
GCGCTCGAGG TCGACCCCGA GAACCACGGC GCCGCGCGTC TGCTGGTGCG CGCGCTGCGC
ACCGAGGAGC GCTGGGAGGA CCTCTTGGCC TTCCTCGACG AGCGCGCCGA GCAGGTCGGC
AAGGACGAGC GCATCCAGGC GCTGCTCGAC CTCGGCGAGA TCGCCAACGA GAAGCTCGGC
CACACCGAGG TGGCCGAGAA CAACATGAAG AAGGTCCTGG CCATGGATCC GGTGCATCCG
CGCGCGCTGC GGTTGCTCTC CGATCTCTAC GAGCGGCTCG AGGACTGGTC GGCCCTGGTC
ATGCTGTACA CCGGCGCGCT CAAGTCGCGC CGCGGCCGCG GCAGCGACTT CGAGATGGGC
ATGCTGCTGC AAGTGGCCAT GCTGCACTGG CGCCGGCTCG ACAACCTCGA CGCCGCCGAG
GAGTATTTCC GCCAGATCCT CAAGGCCCAG CCCGCGCACC CGGCGGCGCT GTCGTTCTAC
CGCGCCTACT ACGCCGCGCG CGGCGAGGGC GCCAAGCTGT TGCAGATCCT GCGGCAGGCG
CAGAAGAGCC TGCCGCGCTC GAGCGGCGAG GGCGGCCCCG AGGCCGAGCG CTGGCGCGAG
CTGGCGGTCG AGGTCGCGGA GCTGGCCGAG AACGACCTCG GCAACCCCGA AAAAGCCATC
GACGCCTGGA AGTCGATCCT GCGCGCCGAC GCCGACGCCG CCGACGCGCG CGTCGCGCTG
CTGCGCCTGT ATCGCAAGAC CGAGAAGTGG AACGCGCTGC TCGACCTGAT GAAGGACGAG
ATCGAGCGCC TGCCCGAGGA CGACCTCGCC GGGCGCGTGA GCCGATTGAT GGAGGTCGTC
GAGATCTACC GCGATCGCCT CAAGCTCGAC GTCATGGTGA TCAACACCTA CAAGGCGATC
CTCGACCTCG ACCCCGGCAA CCTGCGCGCG GTCGACGACC TGGCCTCCAA ATACACCGAG
CTGGGCCGCT GGAAAGACCT CATCTCGGTG CTCTCGCGCA AGTCCGAGAT GCCCGAGGTG
CCGCGCGCCC AGCGGGCCGA GATCCTGCGC GAGATGGCGC GGCTGTGGAG CGAGCGCTTC
GGCAACTTCG CCCAGGCGAT CAAGCCGCTG GAGGCGCTCG TCGACCTCAT GCCCGACGAC
GCCGCGGCCT TTGCCCAGCT CAAGGACATC TACTCGCGGC GCCGGCAGTG GCGCCAGCTC
ATCGATCTGC TCGGCCGCGA GGCCCTGGGA CGGCCGCCCG AGTCGCGCCG CGAGGTGCTC
ACCGAGATGG CGCGCCTGGC CGCCGAGCGC GTGGGCGACA GCAAGCTCTC GGTCGAGCTG
TGGAATCGGG TGCTCGAGCT GCCCGCGCCC CCGGCCCCGG CCCCGGCCGA CAGCGAGGGC
GAAGGCGAAG GCGAAGACGA AGACGGACAG GCGGACGAAA ACGCCGCGGC CGAAGGCGCG
CAGGCGGGCG AGGGCGAGGC CCTGGCCGCG CTGGCGCACC TCTACGAGCG CGAGAAGCGC
TACCCGGCGC TGGCCGAGGT CTACCATCGC CAGTGGCGCC TGGCCGAGGA CGAGACCGAG
GCCGTGGGTC TGCTCGAGAA GCTCGGCAGC CTGCTCGCCG ACCGCATGGA CGCGCCCGCG
CTGGCGGCCC AGACCTTCCG CCAGCTCCTC ACCCTGCGTC CGCGCCACAA CCGCGCGCTG
CGCACGCTGC GCGAGATCTA CGCCACCGAG GGCAACTACG CCGAGCTCGA GTCGCTGTAC
GCCGAGCTCG GCCAGTGGAC CGAGCTGGTC GACGCCTTCC ACACCATCGC CGATCGCATC
AGCGACGAGG GCGACAAGCT GTATCTGTTC GAGCGCAGCG CGGCCATCGC GGCCGAGCAC
TTCGACAAGC CCGATCGCGT GGCCCGCGCC TACGAGCGCG TGCTGGCGGT GGCGCCCGAG
CATCTCGAGG CCGCGCGCGC GCTGGTGCCC ATCTACGAGG CCACCGAGAA GTGGGCGCGG
CTGCTCGGCA CCTACGAGGT GCTCCTCGGA CACGCGGCCG ACGACGACGA GCGGCTCGAT
CTCGAGCTCA AGATCCGCGA CCTGTGCGAG GCCCGCCTGG GCTCCAAGGC GCTGGCCTTC
CAGTGGGCCG CGCGCGCCTA TCGCCTGGCG CCCGGCCGCG AAGACCTGCG CGCCGACCTC
ATCCGCCTGG GCGCCGACGC CGACCAGTGG AACGAGGTCG CCGAGATCCT CGACGCCCGC
GCCCGCGACC GCGCCACCGG CACCGACGAG AAGCTGGCGC TGTTGCGCGA GCTTGGCCGC
ATGGCGGCCG TGCGCCTGCA CGAGCCCGAG CGCGCGCGCG CCTACCAGCG CGAGGTGCTG
GTGCTCTCGC CCGACGATCC CGAGGCCATG GACGCGCTCG AGGAGCTGGC CACCCAGCTC
TCGGACTGGC CCGATCTGCT CACCGTGCAG CGTCGCCGCG TGGCCCTGGC CGACGCCGAC
GACGCCAAGA TCGCGCTGCT GTTCAAGATC GCGTTCGTCG AAGAGGAGCG CCTCGAGGAC
CTCGACGCCG CGGTGGCGAC CTACGAGTCG CTGCTGGCCC TGGTCCCCGA GTCGCAGCGC
GCCATGCGCG CGCTGGCTCG CCTGCAGGAG GAGCGCGGCG ACTGGGAGGG CCTGGTCGAC
GCCCTCGGCC GCGAGCTCTC GCACAGCGAC GACACCGACA CCAAGGTGGC GCTGCTCATG
CGCATCGGCG GCCTCTACGA GCGCAATCTC GAGCGCCCGT CCGAGGCCCT CGGCGCCTAC
TGCGAGGCGC TGTCGGTAGC CCCCGGGCGC GGCCAGATCC ACAGCGCGCT CGAGCGCTTC
CTGAGCCCGC CCAGCGCGAC GGCGACGGCC GCAGCCGCCG AGGAGGCGAG CGAGGCCGCG
GCGGCGTCCG CGACCACGGA CGAGGAAACG TCGCTGAGCG GTGATGTAGG CGAATCGGAT
TCGCCTGATG ACGGCTCTGG CGAATCGGAT GCGGCCGATG CGTCGGCGAG CGAAGGCGCC
GCCGAGACCG AAGGGGCCGG CGAGACCGCG GGCGATGAGA CCTCGGGCGA CGAAGCCGCT
GCCGACGCAG ACGCAGAAAC CGGCGACGAA AGCGAAGCCG ACGACGAGGT CACCACCGAG
CGCTCGCCGG TCGCGGCCGC GCCGGCGAGG CCGCGGGTGT CCGAGGCCGA GCAGTATCGC
GTGGCCGAGC TGCTGCTGCC GGTGTATGAG CAGTCCGGCG ACGTTGTGCG CATCGCGCGC
GCCGTCGAGG TCTTGCGCGC GGGCACGCCC GATGAGGACG AGGCGCTGGC CTACGATCGC
CGTCTGCGCG ACCTCTACGG CGATCGCCGC GGCGACGAGA CCCGGGCCTA CGAGGCCGCG
CTGCGGGTGC TGCGCCGCGC GCCCGAGGAC GGCGCTAACC GCGCCGCGCT GCTCACGCAT
TCGGCCGTGC TCGGCCGCGA CGAGGAGCTG GCGCGGCACT TCGAGAACGT GCTCGCGGGC
GCCGAGGGCT ACGAGCTGCC CTTCGATCCC AAGGCCCAGC GCGGCCTGGC CGTCGAGCTG
GCCGTGCTCT ACGAGCAGCG CCTCGAGGAC AACGAGCGCG CCGAGCGCGC CTGGCAGACC
GTGCTCGAGC TGGCCGCCGA TCAGGGCGTG GTCGAGAGCC GCGCCTACGA CGCGCTCGAC
CGCATCTACC GCACCGCCGG CCGCTGGCGC GACCTGCGCG ATCTGCTGCT GCGGCGCGAG
GAGGTCACCC TCGACGACGC CGCGCGCAAG GACATCGTGC TGGCCATCTG CGAGCTCGAG
GAGGGCGTGC TCGAAGACCG CGACGGCGCC ATCGCCGCGT ATCTGCGCGT GCTCGACATC
GACCCCTCGC TCGACCGCGC CTACCGCTCG CTCGAGCGCC TGCTGGCGCG CGCCGAGCGC
CACTTCGAGC TCGAGGAGCT GCTGGCGCGC GAGGGCGAGT ACGCGGGCGA CGCCACCCGC
GTCGAGCTGC TCTTCCGCCG CGCCGAGCTG CGCGCCCAGC ACTTGGGCGA CACCCTGGGC
GCGGTGCCGC TGCTCGAGGA GGTGGTGCTG CATCGCCCCG GCCACCGCGC CGCGCGCACG
CTGCTCGAGT CGCTGCTGCC CGAGCCCGAG CTGCGCCTGT CGGTGGCGCG GATTCTCGAA
CCGCTGTACG AACAAGATCG CCGCTGGCTG GAGCTGTGCC GCGTGCTGCG CGCGCAGCGT
GAGTTCTCGG CCACGCCCTA CGAGGCCGCC GAGCTGCTGG CCCGGGTCGC CGCCGTCGAA
GAGGACGAGA TGGGCCGGCC GACCGAGGCC TTCAACACCT GGGTCGAGAT CCTGGCGCTG
GCGCCCGGCG ACCAGGGCGC GCGCGCGTCG CTGCGCCGCC TGGGCACGGC CCAGAACCGC
TGGGCCGACG TCGCCGCGGC CTACGAGATG GCGCTCGAGA AGGCCGATCT CACCGATGTG
GCGCTGGCCA TCGAGCTGCT CACCGAGCTG GCCGAGATCT ACGACCAGCG CTTCACAGAG
CGCGACCGCG CCATCTCGGC GTATCGCCGC CTGCTCGATC TCGACCTCGG CAGCCCGGAC
ACCGCGGCCA TGGCCGCGCG CGCGCTCGAC CGCCTGTACA CGGGCGAGTC GCGCTGGGAC
GATCTGGTCG AGATCCTGCG CCGCCAGGCC GACTGGGCCG ACGACATCGA GGAGCGCAAG
CGCATCCTGG CGCGCGTCGC CCGGGTGTCC GAAGAAGCCC AGGAGGACGT CGAGGCCGCG
ATCCTCACCT GGCGCGAGGT GCTCAGCGAG GACCCCGAGG ACGGCGACGC CCTGGACGCG
CTCGAGCGGC TCTACCAGCA GGAGAGCCGG GCGATGGAGC TCATCGAGAT CCTGCGCCGC
CGGGTCGAGC TGGCCGAGGA CGCGGGCGAG CGCAAGGCGC ACCTGTGGCG CATCGCCGTG
CTCTTCGAGC ACGCCATCGA GGACCGCATC GAGGCCATCA CCGCGCACCT CGAGGTGCTC
GACCACGTGC CCGAGGACCC CGAGACGCTC ATGGAGCTGG GGCGCCTGTA CCGGGCCGAG
GAGCGCTACG CCGACCTGCT CGACGTGCTC GAGCGCCGGC TGGCGCAGAG CGAGGAGCCG
GGCGAGCGCA TCGCGCTCAC CTGCGAGGCG GCCGAGCTGC TGGCCACGCA GCTCGGACGC
GAGGCCGAGG CGCTCGAGCG CTACGCGCGC GTCCTCGAGG ACGACCCCGA TCACGGCGAG
GCGCTGGCTG CGGTCGAGAA GCTCACGGCC GGCCCCGAGC TGCTGATGCG CGGCGCCGAG
ATCCTGCGCC CGATCTACGA GCGCGCCGGC GCCCGCGACG ACGTCGCCGG CCCCGAGCAG
GCGCAGGCCC CGGACGGCGC CCACAGCAAG CTGGCGGCGC TGCTTCTGCG CGTGGTCGAG
GCCACCCTCG ATCCCCGCGA GCAGCTGCGC GCGCTGCGCG AGGTGGCGCG CATCCGCGAG
CAGCGTCTGG GCGACCCGAG CGGCGCCTTC GAGGTCGCCG TGCGCGCGCT GCGCGTGGGC
GTGGCCGAGC CCGAGATGCC CGAGCTGCTC GACGAGGTCG AGCGCCTGGG CTCGGAACTC
GAACGACCGT CCGATCTCAT CGAGATCTAC CAGGAGATCG CGCCCGACGT GCTCGACGGC
GAGCAGGCGC GGCGGCTGCA TCTCGACATC GCCGACCTGG CCCGCGCGGT GCACGAGGAT
CCCTCGCTGG CCCGCTCCTA CTACCAGCGC GTGCTCGACG ATCAGCCCGA GGACTCGCGC
GCCATGGTGG CGCTCGAGAG CATCTATCGC GAGACCGACG AGCACGAGGC GCTCTACGAC
ATCCTGGTGC GCAAGGCCGA CATGCTGGCC GACGATCTCG ACGCCCGCAG CGCGGCCCTG
GCCGAGGCCG CCAGGCTGTG CGCCGAGGCC CTGGGCCGGC CCGAGGACGC GATCCTGGCC
TGGGAGCAGG TGCTCGAGCT CACGCCCGAC AGCCGCGAGG CCACGGTCGC GCTCGAGCGT
CTGTACGAGG CCGCCGAGCG CCACCACGAC CTGGTCGACC TGCTCGAGCG CCGCCTCGGC
TTCGCGTTCA CGGTCGAGGA GGCGGTGGCG CTGCGCTACC GGCTCGGCGA CCTGTGCGAG
CACAAGCTCT ACGACCCCGA CACCGCGCTC GAGAACTACA GCGCCGCGCT CGGCGGCGAC
CCGGGCCACG TGCGCGCCAC CGAGGCGCTC GAGCGCTTCC TCGACGATCC CGGGCTGCGC
GCGCGTGCGG CCGAAGTCCT CGAGCCGATT TATGTGGCGC AACAGGATTG GGTCAAGCTG
GTGCGCATCT ATGAGATCAA GCTCGATTCG GCCGAGGACG CCGACGAGCG GCTGGCGCTC
ACGCGCTACA TCGCGCGCCT GCACGAGGAG CAGCTCGAGG ACCTCGAGGG CGCCATGCAC
TGGTGCGGGC GGGTGTTCCG CGAGATCCCG AGCGACGTCG ACATCCGCGA GCAGCTCGCG
CGCCTGGCCT CGATCCTCGA CCGCTGGGAG GAGCTGTCGC ACATTTTCCA GGCGTATCTC
GACGACGAGC CCGGCGAGCC GCCCGAGCTG GCCTCGGTGG CGCGGGCCCT GGGCGACATC
TACGAGCGCC GCCTGGACGA GATCGAGCGC GCCCAGGCCG CCTATCGCCG CGTGCTGCAG
GTGCGCCCCG ACGATCTCGA CACCTTCGAG CGGCTGCGCG ACATGCTCAC GCGCGCCGAG
CGCTGGTACG CGCTCATCGA GGTCTACGAC GAGGCCATCG CCCGGGCGCC GATCGACGAG
AGCGGCGACC GCCGGCGCAT CGAGCTGTTT TTGCGCATGG CCTGGGTCTA CGAGGAGCAC
CTGCACGACG CCGAGCAGGC CATCAACTCC TACCGCTCGG TGCGCGACAT CGACCCGGGC
CAGCCCACCG CGCTGGCGGA GATCGACCGC CTGTACCAGG CCGAGGCGAT GTGGTTCGAG
CTGGCCGAGC TGCTCGCTCA GCGCGTGGCC ATGGCCGAGG CCGAGGACGA TGTGCACGCG
GCCGTGGACC TGCGCATCCG GCTGGCCGAG GTGCTCGGCC GGCGCCTCGA GGACGTGGCC
AGCGCCATCG ACCAGTACGA GCAGGTGCTG CGCGCGTCCG AGGGCTGGGA GCGGGCGCTG
CCGCCGCTCG AGCGGTTGGT GCTCCACGAG GACTACCGCG CGCGCATCGC CGAGCTGCTC
GAGCCCGTGT ACCGGGCCAA CGACTGGTGG AAGAAGCTGG TCGTCATCCT CGACACCCAG
GTCGGCTACG TCGACGACCC CGACCGGCGC GTGGCCATGC TGCGCGAGAT CGCGCACATC
CACGAGACCC GCGGCGGCGA CGAGCGATTG GCGCTCGAGG CGCTGTCGCG GGCCTGGCGC
GAGAATGTGC GCGACAGCGA CGCGCTGGCC GAGCTGACCG CGCTCGCGGC CAAACTCGGC
GCCTGGGAGA CCCTGGCCGA GACCCTGGCG GCCGGCGTCG CCGAGGAGTT CGACCCCGAT
CTGCTGGCGC TGGTGTGGTC GCGCATCGCC GAGATCCACG AGGAGCGGCG CGGCGAGCCC
GCGCGCGCGA TCGAGGCCTG GCGCAAGGTG CTCGAGGTGC GCGACGACGA CGACGCCGCG
CTCAGCGCGC TCGACCGGCT GCTGCTGATG GAGGCGCGCT ACGAGGAGCT CGTCCGCGTG
GTCGAAAAAC GCGCCAACCT GGCCCAGGAT GAGGGCACGC GGCGGGTGTT CTTGCACCGC
ATCGCGGCGC TCTACGAAGA GGATCTCGAG CAGCGCGGCG AGGCCATCGC GGCCTACAAG
AACGTGCTCA CCGAGGCGCC CGGCGATCCG GTCGCGCTCG ACGCCCTCGA GCGCCTGTAC
CGCGAGGAGA GCGACTGGCA CGAGCTGGTC GCGGTGCTGC AGCAGAAGAT CGAGCAGGCT
CAGGAGCGCA CGCAGCGCCG CGAGCTGCGC CTGGCGGCCG CCGACGTCTA CGAGCGCCAG
CTCGAGGACA TCTACGAGGC CATGGCCATG CTGCGCGCCA TCCTCGACCC CGAGGACGGC
GACCCCGAGG ACGGCGAGGC GCTGGCGCGG CTCGATGTCC TGTACCAGAG CGAGTCGGCC
TGGCCCGATC TCATCGACAT CCTCGACCGC CGGGCGGCGC TCGAGAGCGA GCCCATCAAG
CGCGCCGAGC TGGCCTTCCG CGGCGCGCAG GTGGCCGAGA CCGAGCTGCT CGAGCGCGAC
GACGCTATCG AGCGCTACGC GGCGCTGCTC GCCTACGCGC CCGGCCACGG CGGCACGCGC
GCGGCGCTCG ACGCGCTGGC GCAGCGCGAG GAGACCGCCG AGCGCGCCTC GGCGGTTCTC
GAACGGCTGT ACGAAGACGA GCAGAACTAC GACGCGCTGG CCGCGCTCTA CGAGCAGCGG
CTGTCGATGC CGACGCCCAA TCCCGAGCGC CGCTTCGAGC TCTACCGCAT GCTGGCCCAG
GTCTGCGAGG AGCGCCTCGG CGACCTCGAC CGGGCCTTTG AGGTCTGGGC GCTGGCGCTG
TCCGAGTACT CCTCCAGCGA GGAGGTGCAG GACCATCTCG AGCGCCTGGC GGCCTCGCGC
GGCGCCTGGG AAGACCTGGT GGCGCTGCTC GAGCAGCGCC TGGCCGAGCT GCTCGACGCC
GAGCTCGAGT ACGCCTACGC GCTCAAGCTC GCGAGCCTGT ACGAGGACGC CCTGGGCGAC
CTCGAGGGCG CGGCCGAAAA GTACCGGCGG GCGCTCGATG TGGCCGCGGA CGAGCGCGAG
CCGCTGGCCG CGCTGGACCG CATCTACGGG CGCTCCGAGC GCTACGAGGA GCTGGCCGAG
GTGCTCGCGC GCGAGGCCGA GGCGACCCTC GACGAGGGCG AGCAGTGCCA GTTTTTGTTC
CGCCTGGGCG ACCTGCGCGA GGTGCGCCTG CGCGACCTGC CGGGCGCGGT CAACGCCTAC
CGCGACGTAC TCGAGCGCAT CCCCCAGCAC TCGGCCGCGC GCGGCGCGCT CGAGCGGCTG
CTGCACAGCG CCGAGAGCGT GCGCGCCGAC ATCATCCGCA TCCTCGAGCC GCTGTACGAG
CAGGAGGGCG ACTTCGCGCG CCTGGCCGAC CTGCTGGCGG CCAAGCTCGG CACCACCGGC
GTCCACTTCG AGCGCGCCCA GATCTACAGC CGCATCGCGG AGCTGGCCGA AAATCAGCTC
GGCGACCCGG TGCGCGCGCT CGACGCCGCC GGCGGCTGGC TGGCCGAGGA CCCGCAGTCG
CAGCAGGCGC TGGCCGAGCT GGCGCGGCTG GCCGAGGCCG TCGATCGCTT CGGCGAGATG
GCGGCGCGGC TCTCGGGCAT CGTCGAGTCG GCCGACGATC CCGACATCCA GCGCGCGCTG
CTGTTCCAGA TGGGCACCAT CGAGCTCGAG CGGCTGCGCG ACGACGCCGC GGCCGAGGCC
TCGTTCAAGC GCTGCCTCGA GATCTCGCCC GAGTTCACCG AGGCGCTGGA CGCGCTGCAG
CGCATCTACC GCGAGCGCGG CGGTGAGGGC GACCGCGCCC GCCTCGCCGA CGTTCTCGGA
CGAATGGCCG AGATCACCTA CGAGCCCGAG AACAAGCGCC GCTACCTGGT CGAGGTGGCC
GAGCTGCGCG GCGAGCTGGG TGAGCTCGAC GCCGCGGTCG AGGCCTGGCG CGAGGTGCTC
GCGCTCGACG AGGGCGACCG CGACGCGCTG GCCCGCCTGG CCATCATCCA CGAGCAGCGC
GGCGACTGGT ACGCGCTCAT CGATATTCTC GGACAGTCGG CCCGATACGC GGCCAACAGC
GACGAGGAGC GCCGCTTCCG CAGCCGCATC GCCCAGCTCC AGAGCGACAC CCTCGAGGAT
CTCGACGCCG CGGTCGAGGC CTGGCAGTCG GTGCTCGACG TCGCCCCCGA CGCCGAGGAC
GCGCTCACCG CGCTCGAGGG CATCCACACC CGGCGCGAGG ACTGGGGCGC GGTGCAGGAC
ACCATGGCGC GCCGGCTCGA CCTGCTCGAC GCGCCCGCGG ACCGGGTCGC GGTGCTGCAC
CGCCTGGCCG ATCTGGCCGC CGACAAGCGC GACGCCAATG AGGACGCGAT CGTCTACCTG
TTCCAGGCGC TCGACCTCGA CGACACCCAC CTGCCGACCT ACGAGAAGCT CGACGAGCTG
CTCGGAAAGG CCGAGCGCTG GCACGATCTG GTCGATCTGC TGGAGCGCGC GGCCGGCGTG
TACGCGCGCC TGGCCGGGAT GGGCGCGGCC GGACAGCCGC AGCGCAAGGA GATCGATTGT
CTGGCGCGCG CGGCCGACAT CTGGGAGGGG CCGCTGGCCA ATCCCGACGC CGCGGCCGAG
ATCCTCGAGA AGATCCTGGC CCGCGAGCCG GCCTACGTGC CCGCGCTCAC GCGGCTGTCG
AAGATCTACG AGAGCGCCGG CGACTGGGAC CGCTGCGCCG AGGTGCTCGA CCGCGCGCTG
GCGCTCGGGC CCACGGGGCG CGACGCCGCC GAGCTGTACT TCCGCATGGG CGAGGTGGCG
CGCGAGCAGA GCGGCGACGC CGCCGCGGCG ATGTCGCGCT GGCAGCAGGC GCTGGCCTCG
GACCCGAGCT ATCTGCCGGC CATCGCGAGC ATCGAGGCGG CCGCGCGCGA GGCCGAGGAT
TGGCCCGTGG TCGCCGATAT GCTCACGCGC CGTCACAACC AGGTGCAGAA GCCGGCCGAG
CAGCGCGAGC TGGCGCTGGC CCTGGTCGAT ATTCTGCGCA AGAAGCTCGG CCAGCGGGCG
CAGGCGATTC CGCTGCTCGA GGGTCTGGTC AGCGAGGGCG AGGACGACCC CGAGGTGCTG
CGGCCGCTGG CCGATCTCTA CTGCGCCGCG CAGCAGCACG ATCGCGCGGT GCCGATCTAC
GAGCGCCTGG CGGACGCGGC CAAGAAGGCG CGTCAGCTCC GCGATGTCGC CGTGTACCGG
CAGCGGCTGG CCAGCATCCT CGAGGCCCGC GGGCAGATGG ACGAGGCCCT GGCCGCCTAC
GAGGAAGCCT TCCGCGTCAA CCCCACCGAC ATCGCCACCA TGGCGGGGCT GGGCCGCATC
TACCTGGCGC GCGAGGCCTG GGAGAAGGCG CGCCGCGTGT ACCGCTCCAT GGTGCTGCAG
AACCTCGACG AGGACGCCGG CATCAGCAAG GCGCAGGTGT ACGGCAACCT CGGGCGCATC
CACGTGGCCC TGGGCGAGCC GCGCAAGGCC AAGGGCATGT TCCAGCGCGG TCTCGAGCTG
GAGCCGCAGA ATCCCGAGCT GCTGCAGGGG CTCGAGTCGC TCTCGGAATA G
 
Protein sequence
MAEFQPYLRI LDTDPTDTHA LQALEAAIGE NHQGTASADI LSALAESRSV LRERGAVDTA 
LRLLELALRV VGDDAAHAGQ RADLLLERGR IYVEDLLDDA AAEESFRQVL ALREGDESAS
ESLAQLEMER ENWAKFVEKN LSEAESATDR KLVTHMYLSA AEFYGRYQPG AEEFERYLRK
ALEVDPENHG AARLLVRALR TEERWEDLLA FLDERAEQVG KDERIQALLD LGEIANEKLG
HTEVAENNMK KVLAMDPVHP RALRLLSDLY ERLEDWSALV MLYTGALKSR RGRGSDFEMG
MLLQVAMLHW RRLDNLDAAE EYFRQILKAQ PAHPAALSFY RAYYAARGEG AKLLQILRQA
QKSLPRSSGE GGPEAERWRE LAVEVAELAE NDLGNPEKAI DAWKSILRAD ADAADARVAL
LRLYRKTEKW NALLDLMKDE IERLPEDDLA GRVSRLMEVV EIYRDRLKLD VMVINTYKAI
LDLDPGNLRA VDDLASKYTE LGRWKDLISV LSRKSEMPEV PRAQRAEILR EMARLWSERF
GNFAQAIKPL EALVDLMPDD AAAFAQLKDI YSRRRQWRQL IDLLGREALG RPPESRREVL
TEMARLAAER VGDSKLSVEL WNRVLELPAP PAPAPADSEG EGEGEDEDGQ ADENAAAEGA
QAGEGEALAA LAHLYEREKR YPALAEVYHR QWRLAEDETE AVGLLEKLGS LLADRMDAPA
LAAQTFRQLL TLRPRHNRAL RTLREIYATE GNYAELESLY AELGQWTELV DAFHTIADRI
SDEGDKLYLF ERSAAIAAEH FDKPDRVARA YERVLAVAPE HLEAARALVP IYEATEKWAR
LLGTYEVLLG HAADDDERLD LELKIRDLCE ARLGSKALAF QWAARAYRLA PGREDLRADL
IRLGADADQW NEVAEILDAR ARDRATGTDE KLALLRELGR MAAVRLHEPE RARAYQREVL
VLSPDDPEAM DALEELATQL SDWPDLLTVQ RRRVALADAD DAKIALLFKI AFVEEERLED
LDAAVATYES LLALVPESQR AMRALARLQE ERGDWEGLVD ALGRELSHSD DTDTKVALLM
RIGGLYERNL ERPSEALGAY CEALSVAPGR GQIHSALERF LSPPSATATA AAAEEASEAA
AASATTDEET SLSGDVGESD SPDDGSGESD AADASASEGA AETEGAGETA GDETSGDEAA
ADADAETGDE SEADDEVTTE RSPVAAAPAR PRVSEAEQYR VAELLLPVYE QSGDVVRIAR
AVEVLRAGTP DEDEALAYDR RLRDLYGDRR GDETRAYEAA LRVLRRAPED GANRAALLTH
SAVLGRDEEL ARHFENVLAG AEGYELPFDP KAQRGLAVEL AVLYEQRLED NERAERAWQT
VLELAADQGV VESRAYDALD RIYRTAGRWR DLRDLLLRRE EVTLDDAARK DIVLAICELE
EGVLEDRDGA IAAYLRVLDI DPSLDRAYRS LERLLARAER HFELEELLAR EGEYAGDATR
VELLFRRAEL RAQHLGDTLG AVPLLEEVVL HRPGHRAART LLESLLPEPE LRLSVARILE
PLYEQDRRWL ELCRVLRAQR EFSATPYEAA ELLARVAAVE EDEMGRPTEA FNTWVEILAL
APGDQGARAS LRRLGTAQNR WADVAAAYEM ALEKADLTDV ALAIELLTEL AEIYDQRFTE
RDRAISAYRR LLDLDLGSPD TAAMAARALD RLYTGESRWD DLVEILRRQA DWADDIEERK
RILARVARVS EEAQEDVEAA ILTWREVLSE DPEDGDALDA LERLYQQESR AMELIEILRR
RVELAEDAGE RKAHLWRIAV LFEHAIEDRI EAITAHLEVL DHVPEDPETL MELGRLYRAE
ERYADLLDVL ERRLAQSEEP GERIALTCEA AELLATQLGR EAEALERYAR VLEDDPDHGE
ALAAVEKLTA GPELLMRGAE ILRPIYERAG ARDDVAGPEQ AQAPDGAHSK LAALLLRVVE
ATLDPREQLR ALREVARIRE QRLGDPSGAF EVAVRALRVG VAEPEMPELL DEVERLGSEL
ERPSDLIEIY QEIAPDVLDG EQARRLHLDI ADLARAVHED PSLARSYYQR VLDDQPEDSR
AMVALESIYR ETDEHEALYD ILVRKADMLA DDLDARSAAL AEAARLCAEA LGRPEDAILA
WEQVLELTPD SREATVALER LYEAAERHHD LVDLLERRLG FAFTVEEAVA LRYRLGDLCE
HKLYDPDTAL ENYSAALGGD PGHVRATEAL ERFLDDPGLR ARAAEVLEPI YVAQQDWVKL
VRIYEIKLDS AEDADERLAL TRYIARLHEE QLEDLEGAMH WCGRVFREIP SDVDIREQLA
RLASILDRWE ELSHIFQAYL DDEPGEPPEL ASVARALGDI YERRLDEIER AQAAYRRVLQ
VRPDDLDTFE RLRDMLTRAE RWYALIEVYD EAIARAPIDE SGDRRRIELF LRMAWVYEEH
LHDAEQAINS YRSVRDIDPG QPTALAEIDR LYQAEAMWFE LAELLAQRVA MAEAEDDVHA
AVDLRIRLAE VLGRRLEDVA SAIDQYEQVL RASEGWERAL PPLERLVLHE DYRARIAELL
EPVYRANDWW KKLVVILDTQ VGYVDDPDRR VAMLREIAHI HETRGGDERL ALEALSRAWR
ENVRDSDALA ELTALAAKLG AWETLAETLA AGVAEEFDPD LLALVWSRIA EIHEERRGEP
ARAIEAWRKV LEVRDDDDAA LSALDRLLLM EARYEELVRV VEKRANLAQD EGTRRVFLHR
IAALYEEDLE QRGEAIAAYK NVLTEAPGDP VALDALERLY REESDWHELV AVLQQKIEQA
QERTQRRELR LAAADVYERQ LEDIYEAMAM LRAILDPEDG DPEDGEALAR LDVLYQSESA
WPDLIDILDR RAALESEPIK RAELAFRGAQ VAETELLERD DAIERYAALL AYAPGHGGTR
AALDALAQRE ETAERASAVL ERLYEDEQNY DALAALYEQR LSMPTPNPER RFELYRMLAQ
VCEERLGDLD RAFEVWALAL SEYSSSEEVQ DHLERLAASR GAWEDLVALL EQRLAELLDA
ELEYAYALKL ASLYEDALGD LEGAAEKYRR ALDVAADERE PLAALDRIYG RSERYEELAE
VLAREAEATL DEGEQCQFLF RLGDLREVRL RDLPGAVNAY RDVLERIPQH SAARGALERL
LHSAESVRAD IIRILEPLYE QEGDFARLAD LLAAKLGTTG VHFERAQIYS RIAELAENQL
GDPVRALDAA GGWLAEDPQS QQALAELARL AEAVDRFGEM AARLSGIVES ADDPDIQRAL
LFQMGTIELE RLRDDAAAEA SFKRCLEISP EFTEALDALQ RIYRERGGEG DRARLADVLG
RMAEITYEPE NKRRYLVEVA ELRGELGELD AAVEAWREVL ALDEGDRDAL ARLAIIHEQR
GDWYALIDIL GQSARYAANS DEERRFRSRI AQLQSDTLED LDAAVEAWQS VLDVAPDAED
ALTALEGIHT RREDWGAVQD TMARRLDLLD APADRVAVLH RLADLAADKR DANEDAIVYL
FQALDLDDTH LPTYEKLDEL LGKAERWHDL VDLLERAAGV YARLAGMGAA GQPQRKEIDC
LARAADIWEG PLANPDAAAE ILEKILAREP AYVPALTRLS KIYESAGDWD RCAEVLDRAL
ALGPTGRDAA ELYFRMGEVA REQSGDAAAA MSRWQQALAS DPSYLPAIAS IEAAAREAED
WPVVADMLTR RHNQVQKPAE QRELALALVD ILRKKLGQRA QAIPLLEGLV SEGEDDPEVL
RPLADLYCAA QQHDRAVPIY ERLADAAKKA RQLRDVAVYR QRLASILEAR GQMDEALAAY
EEAFRVNPTD IATMAGLGRI YLAREAWEKA RRVYRSMVLQ NLDEDAGISK AQVYGNLGRI
HVALGEPRKA KGMFQRGLEL EPQNPELLQG LESLSE