Gene Hoch_2956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2956 
Symbol 
ID8545344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4043794 
End bp4055586 
Gene Length11793 bp 
Protein Length3930 aa 
Translation table11 
GC content68% 
IMG OID646387635 
ProductKR domain protein 
Protein accessionYP_003267363 
Protein GI262196154 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0245476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG ACCGAGACGA ACTTCTTGCC ATCGAGAAGA TGGCGCAGCG CCTTGAGCGG 
CTCGAGTCGG AGAAATGCGA GCCGATCGCG GTGGTAGGCG CGGCCTGCCG CGTGCCGGGT
TGCGATAATC CTGAGCAGCT CTGGGCTCTG CTCATTCAAG GCCGAGACGC TGTCACGAGC
CTCGCCATGT CCGACGGAAC CTCGGTGCCG GCCGGTGTCG TCCGCGACCT GGACATGTTT
GACGCCGAGT TCTTCCGGAT GGCTCCGCGC GAAGTGCTGA GCATGGATGC ACGCCAACGA
CTCGTGCTCG AGGTCAGTTG GGAGGCGCTC GAGCGAGCCC GCACGCCGGC CGCCAAACTC
GAAGGCACGA GTGTCGGCGT GTACGTAGGC GTATCTGGGC TCTCGCAACT CGAGGGCAAA
GCGGAAGCTC ATGACATGAC GGGCAGCCTG GCAGCGGTGG TCGCTGGGCG CGTAAGCCAT
TTCCTCGGAC TGCGTGGGCC GTGCCTGTCG GTGGATACGG CCTGTTCCTC CTCGCTGGTG
GCTGTGCATC TCGCCTGCAA AGCACTGCGC CAGCGCGACT GCTCGGTGGC GCTCGCGGGA
GGCGTCGGCC TACATACCCG TGATCCGCGC GAGCTCGAAT CGTGGCTTGC GCTCGGCAAC
CTGGGGAAGA GCGGGCGCTG TCGGCCCTTC GACGCCGCCG CAGACGGCAT CGTGGGTGCG
GACGGGTGCG GCATGCTGGT GCTAAAACGA CTCAGCGACG CGCGCAAGGA CGGCGACCCC
TTGCTGGCCG TGTTGCGTGG CAGCGCCGTG AACCACGACG GCCGCGCGCA GGGATTGACT
GTCCCGAGCG GCTTGGCCCA AGCAGCGCTG CTCCGCCGAG CGCTCTCCGA CGCACGAGTA
ACGGCCCAAG CCGTATCCTA CGTCGAGTGC CACGGCACAG GCACCTCTCT TGGCGACCCT
ATCGAAGCCC AGGCTCTCGC CGAAGTGTTC GGCGAGAGTG AGCGCGAACA TCCCCTTCTG
ATCGGCTCGC TGAAGAGTAA TATTGGACAT ACCGACGCTG CTGCGGGGGT ACTGGGCCTG
TTGAAACTCG TGCTCGCGCT TCAGCACGGC CGCATTCCCC AAAGCCTGCA TTTCGATACG
CCCAACCCCC TCATCCCCTG GGACAGCCTC CCGTTGCGCG TCGCAGGCGA AGCAACCGCT
TGGCCGAGCA ACGGCAAGCG GCGACTAGGT GGCGTCTCTT CGTTCGGAAT GAGCGGGACG
AACGCGCACG TCATCGTGGA GGAGGCGCCG CCTGCGGGCC AGCCGGCGAA CGAGCGGGGG
GTCGCGGGCG AGTCGCCCGT GACCTACCCG GTGCTGCTCT CCAGTCGTAG TGACTCGGGT
CTCAGGAGCC AAGCGCAGCG GCTATTGGAC TGGGTAACCG AGCGAGCCGA CGTGGAGGTC
GTGGACGTTG CGTACTCGTT GGCGACGACT CGCTCCCATT TCGAGAGCCG CGCGGTGGTG
TATGCGCGCG ACCGGCAAGA GCTGCTCGCC TCGCTGCAGG CGCTGACGCA AGGGGCGCCG
GGGAGCGACG CGAGCAAGAC GAGGGTGGGC CGAGACAGGC TGGCCGTTCT GTTCACCGGC
CAGGGCAGCC AGCGAGCGCG GATGGGCGCC GAGCTGTCGG CACTCTATCC CGTGTTCCGG
GCCTCGCTCG AAGAAGCGTG CGCTCTGCTC GATCGCGAGC TGGGAGTCGA GCCACCGCTG
CTCGAGGTCC TGTCGGCCGA TGACGAGTCA CCGGCGGGCA AACTCCTCGA GCAGACGATG
TACGCTCAGT GCGGCCTGTT TGCCCTGGAG GTCTCGCTGT TTCGCCTGCT CCAGTCGTGG
GGGCTCGAAC CTACCTGGCT GCTGGGCCAC TCCATCGGCG AGCTTGTCGC CGCCCACGTG
GCGGACGTGC TCAGCCTCGA AGAGGCCTGT ACGCTGGTCG GCGCACGGGC GCGCTTGATG
CAGGCGCTGC CGGCGACCGG CGCGATGTAC ACCGTGCAGG CCTCTGAGCG CGAGGTGCTG
GAGGCCCTTG CCGGCCATGG CGAACGAGCC GCCGTTGCCG CCAGCAACAG CCCGACCTCC
ACCGTCATCA GCGGCGACCT GCAGGTCGTA CAGCAGGTCG CCGCCGCGTT CGAGGCGCGG
GAGCGCAAGA CCGCCCGGCT GCGCGTCAGT CATGCATTCC ACTCCCACCA CATGGACGGG
ATGCTGGCGG AGTTCGGACG CGTCGCGGAA GGCCTGCGCT ACCGGGCGCC GCGCATCGCT
ATCGTCTCGA CCGCCACGGG AACGCTGGCG CAACCAGCGG AGCTGTGCTC GCCCCAGTAT
TGGGTGCAGC AGGTCCGGGC CACGGTTCGG TTCGGGGACG GCCTTCAGAC CCTGCAGCAA
GAGGGGGCGC GTACGTTCCT CGAGCTCGGC CCCCACGGGG TGCTCACCGC GTTGGGCCAA
GAGGCCCTGC CGGATGCCGA GGGCTTGGCC TTCCTCCCCA CGCTGCGCAG GGCTCGTTCG
GAGTCGGCCA CCACGGCCGA AGCGCTCGGC GGCCTAGCTC GCCGCGGCTA CTCGCCCGAC
TGGGAAAGCG TCTTCAAGCC CTATCGGCCT CAGAAAATCG CCCTGCCCAC CTACGCATTC
CAGCGAGAGC GCTACTGGGT GGACGCAAGC GCCGCTCGGC CGGGCGACCT CCGGTCCGCA
GGGCTCCGCT CCGCCGAACA CCCCCTCCTC GGTGCAGCCG TGGCGTTCGC GGACAACGAC
GGCGTTTTGT TCACTGCGCG GCTCGCCCTC ACCGACCACC CCTGGCTTGC CGGCCATCAG
CTCTTTGGCA CCGTCATCTT TCCCGGCTCG GGCTACGCCG AGCTCGCCCT GGTCGCCGCG
CGGCACGTCG GCCTCGACCG TGTCGAAGAG CTGACCCTGG AAGCACCGCT CGCCCTCCCC
CAGGTCGGGG CCATCACGCT GCAAGTCTCC GTCGGCGCCG CAGAGTCCGA TGGACGACGC
CCGCTCGGGT TCTACGCCCG CTCCGAGGCC ACCCCGGACG CTGCCTGGAC CCGCCACGCC
AGCGGCTGGC TCGCTCCCGT GGCGAGCGAC TTCCCCTTCG AGTTCCGAAG CTGGCCACCG
ACGGGTGCCA GCGCCGTTTC GCTCGACGGC TTCTACCCAG ACCTCGCCGA CCTCGGCTTT
CACTACGGCC CCGAGTTCCG CGGCCTCCAG GCCGTCTATC GGCGCGGAGA TGAACTCTTT
GCCGAGGTCG CCCTGCCCGA GCCCATCGCG CATACCGCCT CTCAGTTTGC CATTCACCCC
GCGCTCCTTG ACGCCGCGCT GCAGACGCAG ATCGTCGACC ACAATCAACG TTCGTCTGAG
GTCACTGTGC CCTTTGCGTG GAGTGAGTTC TCGCTGCACG CGGTGGGGGC TACTCGGCTG
CGGGTTCGTT TCCACCGTCT CGACGAAGGC GTTGCCGAGC TGGATGTCGC CGACAGCCAA
GGTGAGCCCG TAGCGCGAAT GCGCGGATTG CGCTCGCGTC CCGCCGCAGC GAGTCAGATG
CGGCGTACGC CCTCCACGAG TGAGCACCTG TTGCGTGTGG TCTGGACAGA AGCCCAGGCA
TGGAAAGAGT CAACGCCGCG AGCTGCCGAG GGTGCCGTGC TGGGCTCCTG GACGCCTGCG
AGCGGGTGGT CTGCGGGACC CGAGCCCGCA CGCTATGACG ACCTCGCCTC GTTGGCCGCG
GCGCTCCGTC AGGGCGCCGC CGCCCCCGCC TGGGTCCTGG TGCCCTGCAT CTCGCGCGAC
ACCCCTGTGG ACCGCACAAG CGAGACCGAG CGCGTGACCG CCGAGGTGCT CGCGCGGCTC
CAGGAATGGC TCGCGGACGA TCGCCTCTCC GCGTTGCGCC TCGTCTTCCT CACGCACGGC
GCGGTCGCCA CGGGACCCGA CCAGGACGTG TCGGACCTAG TCCACGCGCC GCTTTGGGGC
CTCGTGCGCT CTGCCCAGTC CGAACACCCT GACGCCGCAC TGGTCCTGCT CGATAGCGAC
GAGAGCGAGG CGTCGCGAGC CGTTCTCCAG GCAGCGCTCG CCGGCTGCGA AGAACGAGAC
GAGGGCTTTC TCCCCGACCG TCAGCTCGCG CTCCGAGCGG GACGCCTGCT CGCGCCTCGC
CTTGTGCGCG GACGAGAGGC CGATACCCTG GCCCCTCCGC AGGTCGCGAC CTGGCGCCTC
GACATTCCCC AGAAGGGCAC GCTCGAACGT CTGGCGCTGG TTCCGAATCC CGAGGCCAAC
GCCCCGCTCT CGCCGGGACA GGTCCGCATT GCCGTTCGTG CCGCTGGCCT CAATTTTCGC
GATGTTCTCG ACGCGCTCGG CATGTACGCC GGCGAGCCCG GCCCGCTCGG AAGCGAATGC
GCCGGGGTCG TTCTAGAGGC CGCCGACGAC GTCTCGGACG TTGCCCCTGG CGATCGCGTG
ATGGGCCTCA TGCGCGCGCC TTTCAGCCCC ACGGCCATCG CGAACCACCG CAAGATCGTG
CGGATGCCGG CCGGCTGGTC CTTCCAACAA GGCGGCGGCG CGCCTCTGGT CTATCTTACC
GCCTACTACG GCCTCGTCCG AGTGGCGAAG CTCCAGCCTG GCGAGCGGGT CCTGATTCAC
GCGGCAACTG GCGGCGTGGG AATGGCTGGC GTCCAGATCG CTCGCGCCTT AGGCGCCGAC
GTGTTTGCCA CCGCAAGCCC CGCGAAGTGG GACGTGCTGC GCCGGATGGG CCTCGACGCA
GACCACATCG CGTCTTCGCG GACACTCGGG TTCGAGCGGG AGTTCCTCGC CAAGACGGGC
GGTCAGGGCG TCGACGTCGT GCTCAATTCC CTAGCGCACG AGTTCGTGGA CGCGTCCCTT
ACGCTGCTGT CGCGCGGGGG CCGCTTCGTC GAGATGGGCA AGACCGATCT TCGGGATTCC
AGCGCCGTCG CCGCAGATCA TCCGACAGTC CGCTACCAGG CGTTCGACCT CGTAGACCTC
GCACCGGCTC AGCTCGGAGA GCTGCTCTCC GAGCTGAGCC GGTTGTTCGA GCGTGGGGAA
CTGCGCCCCG ACGTGATCCA TGGCTGGGAC GTTCGGCACG CCCCGCGGGC GTTTCGTGCA
TTGGCGCAGG GTCACCACGT GGGTAAGTCC GTCTTCACCT TCGCGCGTCC GATTGACCCA
GCGGGCACCG TCCTGATCAC CGGTGGCACC GGCACCTTGG GCGGTCTTCT AGCGCGACAT
CTGGTCCAGC ACTACGGCGT TCGCCATCTC ATGCTCATCT CACGCCGCGG CCCGGGCACG
CCCGGGGTGG AAGCGCTCGC GGAAGAGCTC GAGTCGGCCG GGGCCCACGT TCAGCTGTTA
GCCTGCGATG TCACGGATCG CGCCGCGTTG CAGCACGCGC TGGAGGTCGT CCCCAAAGAC
CACCCCCTCA CAGCTGTCAT CCACGCCGCC GGCGTCATCG AGGACTCCGT ACTCAGCACG
CTTACGCCCG CGCGCCTCAG CTCCGTCATG CGCGCAAAGG TCGCCGGGGC GCTGCATCTG
CACGAACTGA CCGAGTCCGT CGATCTGTCG ACGTTCATCC TCTTTTCGTC GTTGTCCGGC
GTGATCGGGA GCCCGGGTCA GGCCAACTAC GCGGCCGCCA ACACCTTCCT CGATGCGCTC
GCCCAGCACC GCCGGGCCGG CGGTCTGCCC GCGCTGGCCC TCGCGTGGGG GGCATGGGAC
GAGACGAGCG AGCTTACGGC TCACCTTACT GAGGCGGATC GGCAACGCCT CAGGCAGATG
GGATTCCACC CGCTCCCCGC TACGACGAAC CTCGCGCTCT TCGACACCGC GCTGGCCCAG
CCCGACGCCG CGGTGGTGGC GGCGCGCTTC GACAGAGGCT CGCTCGATAA ACACCGCAAC
CAGACCCTGC CAGCCCTGTT CCGCGACCTA GCGAAACAGC CGCTCGACCG CCCACGCGCG
AATAATCTCG CCGCCCATAT GTCGCTCGGC GAGCGCCTGC GGCCCCTATC TCCCGGAGAG
CGTCAGCGCA TCCTGCTCGA ACTGGTTCAG ACCGAGGTTG CGAGCGTGCT CATGCTCTCC
TCGCACAACT TCGATTCACA GCAGCCGTTG CAGAAGCAAG GACTCGACTC GCTGATGGCG
GTCGAGATCC GCAACCGGCT CGGCGCGGCC ACCGGTCTGC GCCTGCCGGC AACGCTGCTC
TTCGATTATC CAACGCCCGC GGCGCTGGCG AGCTTTTTTG CCAAGCAGCT GAATGAGGCC
GACGAAGAGA GTAGCGCCGC ACCTGTTACG GCTGGGCCGA GGGAGGCGGA AGAGAGCGAG
ATCGCCATCG TGGGGATGGG CTGCCGCTTT CCCGGGGGCA TAAACTCGCC TCAGGGTCTT
TGGCAGCTGC TCGAGCAGGG CCAAGGCGTA ATCGGGGACT TTCCGTCAGA ACGCGGTTGG
CAGGTCTCAG AGCTCTACGA CCCAGACCCA AACGCCCCAG GCAGGAGCTA TACCCGTCGC
GGGAGCTTCC TCTACGACGC CGATCGCTTC GATCCCGCGT TTTTCGGGAT CAGTCCGCGC
GAGGCTCTGA CGGTCGATCC CCAGCAGCGC TTGTTGCTGG AGACATCGTG GGAAGCACTG
GAGCAAGCCG GAATCGATCC GACAACCCTG CAAGGAGCCA ACGCAGGAGT TTTCGCCGGC
GTCATTTACA ACGACTACGG CACGCGGCTG TGGTCGCTGC ACCGCGACAA GGGCTTCCTG
CTGCACGCGC CCGATGACCT CGAGGGCTAC ATGGGCGTGG GGAGCTCACC CAGCGTGGCC
TCGGGTCGCA TCGCCTATAC CTTCGGCCTC CAGGGCCCGG CCCTGACCGT GGACACGGCG
TGCAGCTCAT CGCTCGTGGC GATCCATTTG GCCTGCCAGG CTCTGCGTCA GGGCGAGTGC
ACGCTCGCGC TCGCTGGCGG CGTGACCGTC ATGGCCACGC CAGGGGTCTT CATGTCATTC
AGTCGCCAGC GGGCGCTATC CCCGGACGGC CGCTGCAAGG CGTTCTCGGC GGACGCCGAC
GGCACGGGCT GGGGCGAAGG CGCGGGCATG CTGCTGCTGG AGCGCCTGTC GGACGCCAAG
CGCAACGGCC ATCCGATTTT GGCGGTCCTG CGCGGCTCCG CGGTGAACCA GGATGGCAAG
AGCCAAGGCC TCACGGCTCC GAACGGCCCC GCTCAGCAGC GGGTGATCCT GCAGGCGCTC
GACAACGCCC GGCTCACGCC CAATCAGGTG GACGCCGTCG AAGCCCACGG CACGGGCACC
AAGCTGGGCG ATCCCATCGA AGCGCAGGCC CTGCTCGCCA CCTACGGCCA GGCGCACACG
CCAGAGGCCC CCGTGTGGCT GGGAAGTCTG AAATCGAACC TGGGCCACAC CCAGGCCGCG
GCCGGGGTCG CTGGCGTCAT GAAGATGGTC TTGGCGCTGC AGCACCAGAT GCTGCCGGCG
ACGCTGCACG CCCAGACGCC CTCGCCACAT ATCGACTGGT CGTCCGGCAC GCTCCAGCTG
GTGCAGAGCG CACGCCCCTG GCAAACGAAC GGCCAGCCGC GGCGAGCGGG CGTCTCCTCC
TTTGGAGTCT CTGGAACCAA CGCTCACCTG ATCTTGGAAG AAGCGCCGCT GGAGCAGGCG
GCGACCGAGC AACGAGCAGC TGCGACTGCG CCCGTGGCGG CCTTGCCGTT TTTGCTCTCA
GGCAAAACCG AAGAAGCGCT GAAGGCCCAA GCCCAGCGCT TGCACCAGCA CCTACAACGC
CACGAGGACG CCGCGCTGGT GGACGTGTCC TATTCGCTCG CCACTACCCG CGCGCACTTC
GAGCAGCGAG CCGCCCTCGT CGCCTCCACG CGGGAAGAGC TCCTCGCCGC TCTCGCCGCC
CTCGCGAACG GAGAGAGCGC TCCGTCGCTG GTGGTCGCTC CGCGAAGCGC CGACGGCAAG
GTCGTGTTCG TGTTCCCCGG CCAAGGGTCG CAGTGGCAGG GCATGGGGCG AGCGCTCTTG
CGGAGTTCGG ACGCGTTCCG AGCCGAGGTC GAGGCCTGTG AGGCCGCCTT CGCTCCGTAC
ATCGACGGCT CGCTGCGCGA AGCGCTCGAG GGCGGCAGCT CAGACCGGGT CGACGTGCTC
CAGCCCGTGC TCTTCACGAT GATGGTCTCG CTCGCTGCCC ATTGGCGCTC GCTCGGCGTG
GTGGCAGCCG CCGTTGTCGG TCACAGCCAG GGCGAGGTGG CCGCCGCCTA CGTGGCGGGG
GCGCTCTCCC TCGACGACGC GGCCCAGATC GTGGCCCTCA GAAGCCGCGC CTTGCGGCGG
GTCGCGGGGC GCGGCGCAAT GGCCGCGGTG GAACTTGGAG CCGAGCAGCT CGCCACCTAC
CTCGCGCCGT TTGAAGAACA GCTCGCCATC GGCGCCGTGA ACAGCCCTCG CGCGAGCCTC
GTCGCCGGGC AGCCGGCGGC GCTCGACGCG CTGCTCGAGA AACTCGCCGA GGACGGCGTC
TACACCCAGA AGGCCCGCGG AAACCACGCC TCCCACTGCC GCCTGGTCGA GCCGCTCGCG
CAAGAGCTGA CCGACGCGCT CCAGGGCATC CGCCCCAGCA CCTGCGCCAT CCCCCTGTAT
TCGACCGTCA CCGGGACCAG GCTCGAAGGA CACGAACTCG ACGCCGACTA CTGGTACCAA
AATCTGCGCG CCCCGGTGCT CTTTCAATCC GCTACCGAGC GGCTGCTCGC CGACGGTCAC
GACCTCTTCG TCGAGCTGAG CCCTCACCCC GTCCTCAGCC TCCCGCTCTA CGAGACCTTC
GACGCACGTG AGCACTCCGC CCAGGTCGTC ACCTCCCTGC GTCGCGGCGA CGGTACGCAC
GCTCGCATGC TCCTGAGCTT GGGCGAGCTG CACAACCGCG GTCACAAGCT CGACTGGCAC
GCCTTCTTCG CCCCCTGGCA CCCCCGCACC GTCCCCCTGC CTACCTACGC GTTTCAGTAC
GAGCGCTTCT GGCTCGAAGC GCCTGCCCCC TCAGACGCCG ACCTCACCTC CGCTGGCCTG
TCTGCGGCCG AGCATCCCTT GCTCGGCGCC TCTCTCTCGT TGGCTGAGTC GAACACCGAT
ATCTTCACTG CGCGATTGGC GGTGGCTCAG CACGCCTGGC TCACCGGCCA CAAGGTCTTC
GATACCGTGA TCTTACCGGG AACCGGGTTT GTCGAGCTCG CGCTTGCGGC TGCGCAACGC
GTTGGGCTTG CCCGCATCGA AGAGCTCATC TTGGAGGCGC CGCTGGTGCT GCCCGAGCAC
GAGGCGGTGC TGGTGCAGCT CTCCGTCGGG GCGCCTGATG TCAGCGGCCA TCGACCTCTG
CAGATCTACG CACGACCCGA ACAAGCCGCC CACGACGCCT GGACAAAGCA CGCCAGCGGG
CTTCTCGCTG CCGCCGAACG ACACCCCGAC GACCTCGACT TCCACCTTCA CCGCTGGCCT
CCCCAAGGAG CCACTCCAGT GGCACTTGAA GGCCTCTACG CACAGCTGCA CGACGCCGGA
CTCGACTACG GGCCCGACTT CCAGGGGCTC CGCGCGGTTT ACAAACGCGG CGATGACCTG
TTCGCCGAGG TAGAGCTCCC CGAGAGCCTC GCCAAAGACG CCGCTCGCTT CGTCCTCCAC
CCCGCGCTGC TCGACGCCGC GCTGCACACG CTCGCGTTGG ACTCGATCCA GAGCGCGGCG
GACGTGGCGC TCCCCTTCTC GTGGAGCGGC TTGTCGCCGC TGCGCGCGCA GGGAACGGGT
GCGCTTCGCG TACGTTTCAA GCGCGCTCAG GACGCTAGCG ACGTCTTCCT TCAGGTCGCC
GATGCTACCG GCGAACCCCT CCTTCAGGTT CAAGCGCTCA CGGCACGCCC GGTCGCCGCA
GAACAGGTGC GCGCCGCCGC CGCAAGTCAC GGCGACCTCT TCAAGCTTCA TTGGACGCCA
CTCTCGCGCT CCCAGTCGCA CGGTCAGAAC GACACGCGCG GCTGGGTGAT GCTGAGCACC
GATCTTGACC TCGCCTCCTC CCTGGGTCTC TCTTGCCACG CAAGCCTCGC ATCACTGTGC
GAAGCTCAGA GCACCGACGC CCCCTGGCCA GCGTGCGTGG TCGTTCCGTT CCTGAGACGC
AGCGACGCGA AGGTTGTAGG TTCGGCCGTC CATGAGGCAA GCCGCCGCGC GCTGACCCTG
CTGCAGGAGT GGCTCGCCGA GGAGCGGCTG CACCAAACAC GACTCGTTTT CCTGACGTAT
GGAGCCGTGG CGGCTCACGC AGAAGAGGAT GTACCCGACC TGATTCACGC ACCTCTGTGG
GGGCTCGTAC GCGCAGCGCA GGCCGAGCAC CCTCACCTCC CGATCTTCTT GGTCGATAGC
GACGATACGG AGGCCTCGCG ACGCACGCTT ATGAACGCGC TGGATGAGCT TGAGGAGGGC
CGCGAGTTCA TCTTGCGAAC CGGCCAGGCC CTTATCCCCA GCCTCGCCCG GGCGGCGCGC
AGCACCGATT CCGAGATGGG TGGGCTGGCG ACAGAGGGTA CCGTGCTTAT TACAGGCGGA
ACCGGCACCC TGGGCAGCCT ATTGGCAAGG CACTTGGTGC AGCACCACGG TGTCAAACAC
CTCGTCCTCG TATCGCGCCA GGGAGCCACG GCCGAGGGAG CCAACTCCCT CGCGCGCGAG
CTTCTAAACG CCGGCGCAGC GGTGACGCTA GCAGTTTGCG ATGTCACAGA TAGAGCGGCG
CTCGCGGAGC TTCTCGCGTC GATTCCCGCC GCTCATCCGC TCGCTGCCGT AGTCCACACC
GCAGGACTGC TCGAGGATGC CCTGATCGAC TCCCTCACCT CGGAGCAACT CCACCGCGTC
ATGCGCGCGA AGGTCGAGGC CGCGATACAC CTGCACGAGC TTACGAGTGC GTCCAAGCTG
TCCGCCTTCA TATTATTCTC CTCCTTTGCG GGTGTGCTTG GCAGTGCCGG TCAGGCCAAC
TATGCCGCCG CCAACACCTT CCTCGACGGG CTTGCCCAGC ACCGCCGGGC ACGGGGCCTG
CCGGCTCTGG CGATCGACTG GGGCTACTGG GCTGACCGAA GCGCGATCAG CGAACACCTC
AGGGACAGCG ACCTGCAACG TTTCGCTCGC CACGGTTTGC GCGCGCTCTC CGCCGAGACT
GGGCTCGCTC TGTTCGATGC GGCGCTGAGG CGCCCCGAAC CAGTGCTCGT GGCAACCGAA
CTCGATACAA CGCTCCCGTC GCGCCAAGCC GATGCCTTAC CGCTCCTGTT GCGAAGGCTC
ATACAACCCA AAGCCCTGCC GCGGACCACG GCCGACACGT ATTCTTTTGC GCGGGACCTC
GCCGAGCTCG CACCAGAGGA GCGCGAACGA GCCCTGGTGG AGTTGGTGCG TCGAGAGGCC
GCCAGCGTGC TCGGCATATC ATCTCCAACT ACGGTCGATC CGAGGCGTCC GCTGCAAGAG
CTGGGGCTCG ACTCGTTGAT GGCCCTTGAG ATCCGCAACC GCCTCATGCA GGCGACCAAG
CTCCGCCTCC AGGCGACCGT CCTTTTTGAT CACCCGACTC CCGCGGCTCT GGCGCAGCTG
ATCGGCGGGC AGATGTTCGC GGCGGAGGCA GGTGACGAAC GTGTGCTCGC ACAGCTCGAC
CAGTTGGAGG CCTTGGTATC AGAGCTGTCT CCCGGCGAGT TGGGGCGCTC ACAGCTGATC
TCGCGTCTGA AGTCCCTTTC ATCGCGGCTG AGCGTCGCCG CCTCGGCCGC CGACGCCAGC
GCGGCGCCGA GCCTTGAGGC CGCCACCGAT GATGAGCTGT TCGAGTTCTA CCAGCAAGTG
TCAGTCACCC GGAGCCTGAG CGATGACACC TGA
 
Protein sequence
MKLDRDELLA IEKMAQRLER LESEKCEPIA VVGAACRVPG CDNPEQLWAL LIQGRDAVTS 
LAMSDGTSVP AGVVRDLDMF DAEFFRMAPR EVLSMDARQR LVLEVSWEAL ERARTPAAKL
EGTSVGVYVG VSGLSQLEGK AEAHDMTGSL AAVVAGRVSH FLGLRGPCLS VDTACSSSLV
AVHLACKALR QRDCSVALAG GVGLHTRDPR ELESWLALGN LGKSGRCRPF DAAADGIVGA
DGCGMLVLKR LSDARKDGDP LLAVLRGSAV NHDGRAQGLT VPSGLAQAAL LRRALSDARV
TAQAVSYVEC HGTGTSLGDP IEAQALAEVF GESEREHPLL IGSLKSNIGH TDAAAGVLGL
LKLVLALQHG RIPQSLHFDT PNPLIPWDSL PLRVAGEATA WPSNGKRRLG GVSSFGMSGT
NAHVIVEEAP PAGQPANERG VAGESPVTYP VLLSSRSDSG LRSQAQRLLD WVTERADVEV
VDVAYSLATT RSHFESRAVV YARDRQELLA SLQALTQGAP GSDASKTRVG RDRLAVLFTG
QGSQRARMGA ELSALYPVFR ASLEEACALL DRELGVEPPL LEVLSADDES PAGKLLEQTM
YAQCGLFALE VSLFRLLQSW GLEPTWLLGH SIGELVAAHV ADVLSLEEAC TLVGARARLM
QALPATGAMY TVQASEREVL EALAGHGERA AVAASNSPTS TVISGDLQVV QQVAAAFEAR
ERKTARLRVS HAFHSHHMDG MLAEFGRVAE GLRYRAPRIA IVSTATGTLA QPAELCSPQY
WVQQVRATVR FGDGLQTLQQ EGARTFLELG PHGVLTALGQ EALPDAEGLA FLPTLRRARS
ESATTAEALG GLARRGYSPD WESVFKPYRP QKIALPTYAF QRERYWVDAS AARPGDLRSA
GLRSAEHPLL GAAVAFADND GVLFTARLAL TDHPWLAGHQ LFGTVIFPGS GYAELALVAA
RHVGLDRVEE LTLEAPLALP QVGAITLQVS VGAAESDGRR PLGFYARSEA TPDAAWTRHA
SGWLAPVASD FPFEFRSWPP TGASAVSLDG FYPDLADLGF HYGPEFRGLQ AVYRRGDELF
AEVALPEPIA HTASQFAIHP ALLDAALQTQ IVDHNQRSSE VTVPFAWSEF SLHAVGATRL
RVRFHRLDEG VAELDVADSQ GEPVARMRGL RSRPAAASQM RRTPSTSEHL LRVVWTEAQA
WKESTPRAAE GAVLGSWTPA SGWSAGPEPA RYDDLASLAA ALRQGAAAPA WVLVPCISRD
TPVDRTSETE RVTAEVLARL QEWLADDRLS ALRLVFLTHG AVATGPDQDV SDLVHAPLWG
LVRSAQSEHP DAALVLLDSD ESEASRAVLQ AALAGCEERD EGFLPDRQLA LRAGRLLAPR
LVRGREADTL APPQVATWRL DIPQKGTLER LALVPNPEAN APLSPGQVRI AVRAAGLNFR
DVLDALGMYA GEPGPLGSEC AGVVLEAADD VSDVAPGDRV MGLMRAPFSP TAIANHRKIV
RMPAGWSFQQ GGGAPLVYLT AYYGLVRVAK LQPGERVLIH AATGGVGMAG VQIARALGAD
VFATASPAKW DVLRRMGLDA DHIASSRTLG FEREFLAKTG GQGVDVVLNS LAHEFVDASL
TLLSRGGRFV EMGKTDLRDS SAVAADHPTV RYQAFDLVDL APAQLGELLS ELSRLFERGE
LRPDVIHGWD VRHAPRAFRA LAQGHHVGKS VFTFARPIDP AGTVLITGGT GTLGGLLARH
LVQHYGVRHL MLISRRGPGT PGVEALAEEL ESAGAHVQLL ACDVTDRAAL QHALEVVPKD
HPLTAVIHAA GVIEDSVLST LTPARLSSVM RAKVAGALHL HELTESVDLS TFILFSSLSG
VIGSPGQANY AAANTFLDAL AQHRRAGGLP ALALAWGAWD ETSELTAHLT EADRQRLRQM
GFHPLPATTN LALFDTALAQ PDAAVVAARF DRGSLDKHRN QTLPALFRDL AKQPLDRPRA
NNLAAHMSLG ERLRPLSPGE RQRILLELVQ TEVASVLMLS SHNFDSQQPL QKQGLDSLMA
VEIRNRLGAA TGLRLPATLL FDYPTPAALA SFFAKQLNEA DEESSAAPVT AGPREAEESE
IAIVGMGCRF PGGINSPQGL WQLLEQGQGV IGDFPSERGW QVSELYDPDP NAPGRSYTRR
GSFLYDADRF DPAFFGISPR EALTVDPQQR LLLETSWEAL EQAGIDPTTL QGANAGVFAG
VIYNDYGTRL WSLHRDKGFL LHAPDDLEGY MGVGSSPSVA SGRIAYTFGL QGPALTVDTA
CSSSLVAIHL ACQALRQGEC TLALAGGVTV MATPGVFMSF SRQRALSPDG RCKAFSADAD
GTGWGEGAGM LLLERLSDAK RNGHPILAVL RGSAVNQDGK SQGLTAPNGP AQQRVILQAL
DNARLTPNQV DAVEAHGTGT KLGDPIEAQA LLATYGQAHT PEAPVWLGSL KSNLGHTQAA
AGVAGVMKMV LALQHQMLPA TLHAQTPSPH IDWSSGTLQL VQSARPWQTN GQPRRAGVSS
FGVSGTNAHL ILEEAPLEQA ATEQRAAATA PVAALPFLLS GKTEEALKAQ AQRLHQHLQR
HEDAALVDVS YSLATTRAHF EQRAALVAST REELLAALAA LANGESAPSL VVAPRSADGK
VVFVFPGQGS QWQGMGRALL RSSDAFRAEV EACEAAFAPY IDGSLREALE GGSSDRVDVL
QPVLFTMMVS LAAHWRSLGV VAAAVVGHSQ GEVAAAYVAG ALSLDDAAQI VALRSRALRR
VAGRGAMAAV ELGAEQLATY LAPFEEQLAI GAVNSPRASL VAGQPAALDA LLEKLAEDGV
YTQKARGNHA SHCRLVEPLA QELTDALQGI RPSTCAIPLY STVTGTRLEG HELDADYWYQ
NLRAPVLFQS ATERLLADGH DLFVELSPHP VLSLPLYETF DAREHSAQVV TSLRRGDGTH
ARMLLSLGEL HNRGHKLDWH AFFAPWHPRT VPLPTYAFQY ERFWLEAPAP SDADLTSAGL
SAAEHPLLGA SLSLAESNTD IFTARLAVAQ HAWLTGHKVF DTVILPGTGF VELALAAAQR
VGLARIEELI LEAPLVLPEH EAVLVQLSVG APDVSGHRPL QIYARPEQAA HDAWTKHASG
LLAAAERHPD DLDFHLHRWP PQGATPVALE GLYAQLHDAG LDYGPDFQGL RAVYKRGDDL
FAEVELPESL AKDAARFVLH PALLDAALHT LALDSIQSAA DVALPFSWSG LSPLRAQGTG
ALRVRFKRAQ DASDVFLQVA DATGEPLLQV QALTARPVAA EQVRAAAASH GDLFKLHWTP
LSRSQSHGQN DTRGWVMLST DLDLASSLGL SCHASLASLC EAQSTDAPWP ACVVVPFLRR
SDAKVVGSAV HEASRRALTL LQEWLAEERL HQTRLVFLTY GAVAAHAEED VPDLIHAPLW
GLVRAAQAEH PHLPIFLVDS DDTEASRRTL MNALDELEEG REFILRTGQA LIPSLARAAR
STDSEMGGLA TEGTVLITGG TGTLGSLLAR HLVQHHGVKH LVLVSRQGAT AEGANSLARE
LLNAGAAVTL AVCDVTDRAA LAELLASIPA AHPLAAVVHT AGLLEDALID SLTSEQLHRV
MRAKVEAAIH LHELTSASKL SAFILFSSFA GVLGSAGQAN YAAANTFLDG LAQHRRARGL
PALAIDWGYW ADRSAISEHL RDSDLQRFAR HGLRALSAET GLALFDAALR RPEPVLVATE
LDTTLPSRQA DALPLLLRRL IQPKALPRTT ADTYSFARDL AELAPEERER ALVELVRREA
ASVLGISSPT TVDPRRPLQE LGLDSLMALE IRNRLMQATK LRLQATVLFD HPTPAALAQL
IGGQMFAAEA GDERVLAQLD QLEALVSELS PGELGRSQLI SRLKSLSSRL SVAASAADAS
AAPSLEAATD DELFEFYQQV SVTRSLSDDT