Gene Haur_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3965 
Symbol 
ID5735826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5013803 
End bp5027743 
Gene Length13941 bp 
Protein Length4646 aa 
Translation table11 
GC content66% 
IMG OID641281115 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546725 
Protein GI159900478 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATA GAGCAGCATA TGATGCCGCA TACGATACAG CGGTAGCGAT CGTGGGTATG 
TCCGGCCGCT TTCCAGGCGC GTCGACCGTC GATGCGTTCT GGCAGAATCT GACTGCCGGC
GAGCGCTCGA TTCGCACCCT GGGCGACGCG GAATTGCTGG CGGCGGGCGT CGATCCCGAA
CTGCTTCGTG ACCCGCAGTA TGTCAAGGCC GGCGCATTTG TCGATGATAT CGAGCTGTTC
GATGCGGCGT TCTTCGGCTA CACGCCGCGC GAAGCTGAGG TGATGGACCC GCAGCACCGC
CTGTTCCTGG AGTGCGCCTG GCAGGCCTTG GAGCAGGCCG GTTACGACCC CGATGGCTTT
CGCGGCTCGA TCGGCGTCTT TGCTGGCTCA GCCACGTCGT CGTATCGCGT TCATAATATT
CACACCAACC CTGAAATCGC CGAATCGGTG GGCGGCTTGC AACTGGCCGT CGGCAACGAC
AGCGACTCGC TGGCGTCGAC CGTGTCGTAC AAGCTGAACC TGCGCGGGCC GAGCGTGGCG
GTGCAGACGT TCTGCTCGAC CTCGCTGGTC GCAGTCCACA TGGCTTGCCA GAGCCTGCTC
ACCTATGAGT GTAACATCGC CCTGGCCGGC GGCGCCGCGA TTACGGTGCC GCAGGGTGTG
GGATATCTGT ACCAGGAAGG CGGTATCCTA TCGCCCGATG GACACTGCCG CACCTTCGAT
GCCAAGGCCC AAGGCAGCGT GATGGGCAGC GGCGTCGGCG TGGTAACCCT CAAGCGCTTC
GAGGATGCGC TCAACGATGG CGACACGATC TACGCGGTTA TCCGTGGCTC GACCGTCAAC
AACGACGGCA TCCGCAAGGT CGGCTACACC GCCCCCGGCT TGAATGGCCA GTCCGCAGTG
ATTACCATCG CTCAGAACCG GGCCGAGGTT GATCCCGACA CGGTCAGCTA TATTGAGGCC
CACGGCACGG CCACGCCGCT CGGCGACTCG ATCGAACTAG CCGCTTTGAT CAAGGCCTTC
GAGCGCGGCA CCGAGCGCAA ACAATTCTGC GCGCTGGGGT CGGTCAAACC CAACATCGGC
CACCTCGACC GGGCGTCGGG GGTAACGGGC CTGATCAAAA CGACCATGGC CCTGCACCAC
CGCCAGCTGC CGCCCAACCT TGATTTCGAG ACGCCTAGCC CGGATATTGA TCTGGCCAAC
AGCCCGTTCT ACGTCAACAC GCAACTGCGT GACTGGCCAG CAGATGGCGC TGCGCCACGC
CGGGCCGGCG TCAACTCGTT TGGCCTAGGC GGCACCAACG TCCACGTCGT GCTGGAGGAA
GCGCCTGCAC CGGCGCCCGT GGCTCCGGCT CGCCCGGCCC AGCTGCTGGT ACTGTCGGCC
AAGACCGCGA CCGCCCTGGA GGCCATGACC GACAACTTGG CTGCCTATCT GGCCGGCGCC
CCCGCCGATT TGGCCGACGT GGCCTTTACC CTCCAGGCAG GACGGACAAG GTTTAACCAT
CGGCGCGCTT TTGTGTGCGA AAGCGCGGCC GATGCGGCGC AGGTGCTGCA AACCCGCGAC
CTGCGGCGGA TCACGACGGT TGAGCAGAGC GGGCGCAACC GGCCGGTGGC GTTTGTGTTC
CCCGGCGTGG GTGACCATTA CGCCGGCATG GCCAAGACGT TGTATGCGAC CGAGGCGGTC
TTCCGCGAGG CGGTTGATCA GTGTGCCGAA TTGCTGGCCC CGCGCCTTGG CCAGGATCTG
CGCGCCGCCC TGTATCCCGC CGATCAGCCA GCCGCAGCCG CGGCCCACAC GCTGTTTGCG
GCTACTGCGG CGAGCAGTCG TGTGGCGGGA GCGCTGCACC AGACGGCGCT GGCCCAGCCG
GCGGTGTTTG TGGTGGAGTA TGCGTTAGTC CAACTGCTGG CGAGCTGGGG TATCCGGCCG
CAAGCGCTGC TCGGCTATAG TCTGGGCGAA TACGTCGCGG CGACGGTCGC CGGGGTGTTG
AGCTTGGAGG ATGCCTTGAC CCTGGTCGCC AAACGCGCCC AATGGATCCA GGCCCAGCCG
CATGGCGCGA TGCTGGCCGT CTCGCTGGGC GTCGAGGCCA TCCAGCCCTA CCTGAATACC
GAGGTGGCGC TGGCGGTGGT CAACAGCCCG ATGACCTGTG TCCTAGCTGG CCCGCACGCC
GCCCTAGAGT TAGTCAAAAT CCATTTGGAA GAAGATGAGG TGGCGAGCCG CTGGCTGGAG
ACGAGCCACG CGTTCCACTC GCCGATGCTG GCGCCGGTGG CGGCCGAACT GACCGCGCTG
GTACGCACGC TGCGGCTCCA GACGCCCAAA ATTCCCTATA TCTCCAATGT GACTGGCACG
TGGATCACCG ATGCCGAGGC GACCGACCCG GGCTACTGGG CGCGGCATAT GGTCGAAACG
GTGCAGTTTG CCGATGGCGT CGGTACGCTG CTGGCCGATG CTCAGCTGAC GCTGCTCGAA
GTGGGACCGG GCCAAGCGCT GGGGTCGTTT ATCCGCCAGC ATCCGGCCTG CGGGCGTGAC
CGGTTTGGCC AGATTGTCGC GACCCTGCCG GGTGCAGCCG AGGCCACTAA TGATCTGGTG
GCGCTGCTCA ACGGGCTGGG GCGGTTGTGG TTGGCCGGTG TCCCCATCGA TTGGGCAGGC
CTGCACGGAA AGCGTCCGCG CCGCCGTGTC CCGCTGCCGA CATATCCCTT TGAGCGCAAG
CGCTTCTGGA TCGACGCCGC GATCATGCCG GCGGCGCTGG CGGCCGGCGA ACGGTTGCGG
GGTCGGCACG CGGATGTGAC CGACTGGTTC TATCGGCCCG ACTGGGCGCC AGCTGCGCTC
GGCGTCCCGG CCGCTCCCGG TCACTGGCTG ATCCTGCCCG ATGCCCACGG ACTGGGCAAG
GCGGTGGCCT CGTCTCTTCG CGCGGCTGGC CACACCGTCA CCCTGGCCAC CGCTGGCCCG
GCCGATACCA CACTGATGGT GCCGCAAATT CCAATTGATC CGACCGATGC TGCGGCTTAT
GAGCTCCTGC TCGACACGCT GCGCGCCGAC GGCGGCCTAC CCAGCCACAT CCTGTGGCTG
GGCGGCCTGA CGCCGCTCGA CTCGGCCCTG ACCGGCCCGG CCCGCTTTCA GGCGGCCCAG
GCGACCGGCT ACTACGACCT GCTCCAGCTG GCGCAGGCGC TGGTGACACG GGTGATTGAT
GAAGCGGTGC AGCTCCTGGT AGTGACCGCC GGGATGCAGG CAGTCGGTGC GAATGCGATC
CCCGTGGCCG AACACGCCAC GCTGCTGGGG CTGGCGACGG TGATCGGCCA GGAAAACGTG
ACCATTCGCG TGCGCAGCGT CGACCTTGCG ATGGCCGATG ACGCCGCAGC CGAGCTGCTG
GCGGCCGAGT GCTTGGCGAG CGGCGATGCG TTGCGCGTCG CCTACCGCGA TGGTCAACGT
CTGGAAGAAC GCTACCAGCC GATTCGCCTA GAGACGCCGG GATCGTCGGT GCTGCAATCG
GGTGGGGTGT ACGTGATCAC AGGTGGGCTG GGCGGGGTCG GGCTGGTGCT GGCGGAGCAT
CTGGCGCGGA CGGCGCAGGC GAAACTGGTG CTGGTTGGCC GGCAGGGCTT GCCGGAGCGG
GCGGCCTGGG ACGCGTGGCT GCGCGAACAT GGCGCGGACG ACGCCACGAG CCAGCGTATC
CAGCGGGTGC GAATGATCGA AGCCGCTGGT GGTACGGTCG AGGTTATGGC CGCCGATGTG
GCTGACCCTG AACAACTGCG CGGGGTGTTT GCTGAGACTG AGGCGCGATT TGGCACGCTC
TACGGGGTGC TGCACGCCGC TGGCATATCC GACTCCCAGG CGTATCTGCC ACTGGAGACG
ATTGGGCCGA AGGAGTGCGA GTGGCACTTT CAGCCCAAGG CCTACGGCCT GTACGCGCTA
GAAGCGGCAC TGGATGACCG GTCGCTGGAC TTCTGCGTGG TCTTTTCGTC GGTGTCCTCG
GTGCTGGGCG GGCTGGGCTT CGCGGGCTAC GCGGCGGCGA ACAGCTTTAT GAATGCGTTC
ACCCAGCGCC ATAACCGCAC GCACGCGGTA CCTTGGGTCA GTGTCAACTG GGACACCTGG
CACCTGCGCG CCGGCCAGCA CGATGTGATC GGCGCGACCG TCGCTCAATA CGAGATGAGT
CCAGCCGAGG GCGCCGACGC CTTTGAGCGA GCGGCCGCGA CGCGCAACGA GCCGGTGATC
ATTAACTCAA CCGGCGACCT GGATGCCCGC ATTCGCCAGT GGGTCCGCCT GGAATCGGTG
CGCGAACAGC CGGAACGCGA ACGCGAGGCG GCTGGCTCGA CCCAGCAGGC TGTGTCAGTC
TCTGTCCCAT TGCGATCAAC CAGCGAGTAT GAGCAGCGGA TTACCGCAGT CTGGCAGCAC
GTCTTGGGCA TAGAGACGAT TGGCATCCAT GACAACTTCT TCGACCTGGG CGGCAACTCG
CTGATCGCAC TCCAGCTGAT CGCCCGGCTC AAGAAGGAGT TCAAGACCCA GGTGCCGGCG
GTGGCGATTT TTGAAGCGCC CACTATTAGC GCGCTAGTCC AGTACATGTT GCCGGATGCG
CCCGTCGTGG CGCCCGCCGA TGCGCTGTTG GTCGAGCGAC GCCAGCGCGT GCGCCAGACC
GCCGAGCAGG ATGGCATCGC GATCATCGGC ATGGTTGGAC GCTTCCCCGG CGCCTCCACT
GTCGATGCCC TTTGGCAGAA CGTGGCCGAC GGAGTTGAAG CGTTCACCCG TTTCACCGAT
GAGGAACTGC GTGCGGCAGG CGTGCCGGCC GACTTGATCA ACGATACCAA CTATGTGAAG
GTCCGCCCGG TGTTGCATAA CGACATCAGT CTGTTCGATG CGGCGTTTTT CGGCTACACG
CCGCGCGAGG CGGAGTTTCT CGACCCACAG CAGCGCCTGT TTCAGGAATG CGCCTGGGAG
GCCCTAGAAC AGGCTGGCTA CGACACCCAG CGCTATCCCG GTTTGGTCGG CGTCTTCGGC
GGCACCAATG TGAACGCCTA CCTTTATCGC TTGGTGGAGG ATCCGGAACT TCGCGATCTG
ATGAGCGAGT CCATCACGCT GCAAAACGAC AAGGATGCGC TGGCGACCTA TGTATCCTAC
AAGCTCAACC TGCGCGGGCC AAGCTTCAGC ATCCAGACCT ATTGCTCGAC CTCGCTGGTC
GCTACCCACC TGGCCTGCCG CAGCCTGCGT GCTGGCGACT GCGACATCGC GTTGGCCGGC
GGTGTGTCTA TCCGCGTCCC AGTCAACACC GGCTATCTGT TCCAGGAAGG CGACCAGGGC
GCTCCTGACG GCCGTTGCCG CACCTTCGAC GCGCTTGCCG AAGGGACGAA TTTCGGCGAC
GGGGTGGCGA TCGTGGTGCT GAAGCGGCTG GCGGATGCGC TGGCCGACGG CGACACTATC
CACGCGGTGA TCCGTGGGTC GGCGATCAAC AACGACGGCG GCCTCAAGGT CGGCTACACG
GCACCCAGCG TGGTTGGGCA GGCGGCGGTG GTGCAGGCTG CCCTTGCCGA CGCCAACCTG
GCCGCCGATG CCATCTCGTA TGTCGAGGCC CACGGCACTG CCACCAAGCT CGGTGACCCG
ATCGAAGTGG CGGCATTGAC CAAGGCCTAC CGCACGATGA CTGATAAAGT TGGCTTCTGC
GCGATCAGTT CGGTCAAACC GAACATCGGC CACCTCGACC GCGCTTCGGG GGCGACCGGC
TTGATCAAGA CCGTCATGGC GCTCAAGCAC AACGTAATTC CGCCGACCTT GCACTTCCAG
GCGCCCAACC CCGAGATCGA CTTCGCCAGC AGCCCGTTCT TTGTGCCGAC CGCGCTCACG
CCGTGGACGC GCAATGGCAC GCCGCGCCGG GCCGGGGTCA ACTCACTGGG TGTGGGTGGA
ACCAATGCGC ACGTCATCGT GGAGGAAGCA CCGCAGGTCG GGCCAAGTGG CCCCGGTCGG
GCGGCCGAAC TGCTGGTGCT GTCGGCCAAA ACGGCGACCG CGCTGGAGGC AGCGACCACG
AATCTGGCGG CTCATCTGGA GGAGCAGCCG ATGGTGAATC TGGCTGATGT GGCCCACACG
CTCCAGGTTG GGCGGCGGGT GTTTGAACAT CGCCGGGTTG TGGTCGCCCG CAACGTGGCG
GACGCGGTGG GCCTGCTGCG GAGCGGCGAT GCGCGGCGGG TGCTGACGCT GGCACAGAAG
CCGACCAGTC GGGGTGTGGC CTTTGTGTTC CCGGGCGTGG GCGACCACTA CGTCGGGATG
GCGGAGGGAT TATACGCGAC CGAGGGAGTA TTCCGCGCGA CGGTTGACCG CTGCTGTGCG
CTGCTGACGC CACTGCTCGG ATCGCCCATT CGGAAGGAAA TCTACCCCGA TGGCGGTGCG
CCGGTCTCGG CGGGCATCGA CCTGCGTGCT ATGCTGCGCG AGAACGCGAC GCCGAGGTCG
GCGGGGCGCT TGCACCAGAC GGCGTGGGCG CAACCGGCGG TGTTCGTGGT GGAGTATGCG
TTGGTGCAGC TGCTGGCGAG TTGGGGCATC CGGCCGCAGG CGTTGCTCGG CTACAGTGTG
GGTGAGTACG TGGCGGCGGC GGTCGCTGGG GTGTTGAGCC TGGAGGATGC CTTGACCGTG
GTCGCCAAGC GTGCCCAGTG GATTCAGGCC CAGCCGGCCG GGTCGATGCT GGCCGTCTCG
CTCGGCGCTG ACGCGATCGG TGCGTATGTG GGCGGCGCGG TGGCGCTGGC GGTGGTCAAC
AGCCCGATGA CCTGCGTCTT GGCTGGTCCG CAGGCGGCGT TGGAGGCGGT GAAAACCCGC
TTGGACGGTG ATGAGGTGGC CAGCCGCTGG CTGGAGACGA GCCACGCCTT CCACTCGCCG
ATGTTGGCGC CGGTGCAGGC CGAGCTGACC GCACTGGCTG GTACGCTGCG GCTCCAGGCA
CCGCGCATCC CGTATATCTC CAACGTGACC GGCACGTGGA TCACCGATGC GGAAGCGACC
GACCCGGGCT ACTGGGCACG GCATATGGTT GAGACGGTGC AGTTTGCGGA CGGCGTTGGC
ACGCTGCTGG CCGATGCCCA GCTCGTGGTG CTGGAAGTGG GGCCAGGGCA GGCGCTGGGG
TCGTTTATCC GGCAGCACCC GGCCTGCGGG CGCGACCGGT TCGGCCAGAT CGTGGCCACG
GTGCGTGGGA TGACGGACAC GAGCGATGAC CTGGAGGTGC TATTGAGCGC GCTGGGGCGG
CTGTGGCTGC ACGATGTGGT GGTCGATTGG GCCAGCTTCC GTGGCAGCGA AGTCCGCCAG
CGTATCCCGC TGCCGACCTA CCCCTTCGAG CGCCAGCGCT TCTGGGTCGA ACCGCGGTCG
TATGTTCGCA CGCCGGTTCA AGAAGCCGCC GTTATCGGCC GCAAACCGAA TATCGCCGAC
TGGTTCTACA CGCCGGTCTG GGAAGCCCAA CCGCTGCCGG CCAAGGGCAG CGCCCAGCCG
GCCGGGCCGT ATCTCGTTTT TGTTGATGAG CAGGGCTTTG GGGCGCAGGT CGTCGGGCGA
CTTGAAAGCA ATGGTGTGAC AGTGATCAAG GTTCGCCAGG GCGGTGCGTT CGCACAGCTT
GACGCCACGA GTTTCGCTGT CCGGCCTGAT ATCCGCGATG ATTACGCCGT GCTTTTCAGT
GCGCTGCAAA CCAACCGTCT GCTTCCGCAG GCCATCGTGC ATCTGTGGAA TGTGGCTTCG
AATGCGCGGG TCAGCGCGGA CGAGGCCGGC TTCGGCGCTT GCCAGGTCTA CGGCTTCTAC
AGCCTTCTGC ACCTGGCCCA GGCGGTCGGC GGTATTGATC TGGAATCGAC GCTGCCGATC
ACTGTGTTTT CCAACAGCAC CCAGCCCGTG ACCGGAAACG AGCGGCTGTA TGCCGAACAA
TCGCCCAGCG CCGTGACCTG CCGCGTGATC GGTCAGGAAA ATCCGGCGGT GTTCTGCCGC
AACGTTGATA TCTGCGTTCC AGAGCAGGCT GGGGCCGAAG CGGACCAGCT GGCCATGCTG
CTTGAGCAGG AACTGCTGGT TCCATCGCAA GATATCGCGG TGGCCTACCG CGAGGGACGG
CGGTTTGTGC AGCACTACCG GGCCGATCGG CTCGAGCCGA TCTCCCGCGC TGTCCCTGCG
CCGTTGCGCA TGGGCGGCGT GTATCTGATC ACCGGCGGCC TCGGCGGCAT CGGCACGGCG
ATCGCCGGCT ACCTAGCCGA AAAAGCTCAG GCCAAGCTGG TGCTGCTGGG GCGCACGCCA
CTGCCGCCGC GCGAGGAATG GGATGGGCTG GCGGTCGCCC GCGGCCCCGA AGATGGCCTT
GTTCAGAAGA TCGCGAAGAT CCGGGCGATG GAGGCTCATG GCGCCGAGGT GCTCACCCTC
AGCGCCGATG TCGGAGACCC GGCCCAACTT AGCGCCGCGT TGGCAGAGAT CCGGCGGCGC
TTCGGCGCCC TCCACGGCGT CGTCCACGGC GCCGGCCACC TTGATCAGTC CGGGTTCCAG
TTGATCCAGG ATGTCGGCCA CGAGCCGTGC GAGGCCCACT TCAAGCCGAA AGTCTATGCG
CTGTATCATC TGGAAGCCAT GTTGCGCGAC CAAGAGCTGG ATTTCTGCCT CCTGCTGTCG
TCGGTTTCGT CGGTGCTGGG CGGCCTGGGC TATGTCGGCT ACACGGCGGC CAACTACTTC
ATGGATATCT TCACCCACCG CCTGCGCCAA TCGCCGTCCA ACCGCTGGAT TAGCGTCAAC
TGGGACACGT GGCACCTGAA GGCCGGCCAG CATGATGGGG CGACCGTCGC CCAGTACGAG
CTGTTCCCGT TCGAGGGCGT CGACGCGTTC GGGCGCATCC TGGAACGCGC GCCGGTGCAG
ATTATCAACT CGACCGGCGA CCTCGACACG CGCATCAAGC AGTGGGTGCT GCTCGAGTCT
ATCCGCAGCA ATGCCGCGTC CAGCAGCCCG GCCGCCAGCC ACGAGCGGCC GGCGATCGAT
ACCCAGTATG TTCCGGTCAA CAGCGAGTAC GAGCGGCGGA TCGCGGCGGT GTGGCAGCAG
GTGCTGGGCA TCGGCCAGAT CGGCATCGAC GACAACTTCT TCGACCTGGG CGGCAACTCG
CTGACGGCGT TGCAGCTGAT CTCGCGGCTC AAGAAGGAGT TCAAAACCCA GATTTCGGCA
GTGGCGATCT TCGAGGCACC GACGATCCGG GCGATGGCGC AGTATCTTAT GCCCGATGCA
CCGCCTGCGG TAGATCTGGC CGAAACTCTG CTGGTGCAGC GGCGGCAACG GGTGCGCCAG
ACTGCCGAGC AGGATGGCAT CGCGATCATC GGCATGGCTG GGCGCTTTCC TGGCGCCTCG
AATGTCGATG AGTTCTGGGA CAATCTTGCC AACGGCGTCG AGGCCTTCAC CGCCTTCACT
GATGCGGAGT TGCTGGCGGC CGGCGTGCTG TACGAGCAAG TCCACGACCT GAACTACGTC
AAACGAAGGC CGATCTTGAA AGAGGATGTC ACCCTCTTCG ACGCGGCGTT TTTCGGCTAC
ACGCCGCGCG AGGCGGAGTT TCTCGACCCG CAGCAGCGCT TGTTCCACGA GTGCGCCTGG
GAGGCCCTGG AGCAGGCCGG CTACGACACC CAGCGCTATC CCGGTTTGGT CGGCATCTAT
GGCGGCGCGA ACCTCAACAC CTACCTGATG CAACTGGCGT TTGATCCGGA TGTCGCCAGA
AACTTCACCG ACTCGGTGTT TCTCGAAAAT GATAAAGACG CGTTGACAAC CAACGTGTCG
TACAAGCTGA ACCTGCGCGG GCCAAGCTTC GCGGTGCAGA CCTATTGCTC GACCTCGCTG
GTCGCCACCC ACCTGGCCTG CCGCAGCCTG CGCGCCGGCG ACTGCGATAT CGCGTTGGCC
GGTGGCGTGT CTATCCGCGT CCCAGTCAAC ACCGGCCATC TGTTCCAGGA AGGCGACCAG
GTGTCGCCCG ATGGAAGCTG CCGGACGTTT GATGCCCAGG CAGCCGGGAC CACATGGGCC
GATGGGGTGG CGGTGCTTGT GCTGAAGCGG CTGGCGGATG CGCTGGCCGA CGGCGACACC
ATCCACGCGG TCATCCGTGG GTCGGCGATC AACAACGACG GGGGGCTGAA GGTCGGCTAT
ACGGCACCCA GCGTGGTTGG GCAGGCGGCG GTGGTGCAGG CGGCGCTGGC TGATGCCAAT
CTGGCCGCCG ATGCCATCTC GTATGTCGAG GCCCACGGCA CCGCCACCAA GCTCGGCGAC
CCGATCGAGG TTGCCTCATT GACCAAGGCC TACCGCACGA TGACTGATAA AGTTGGCTTC
TGCGCGATCA GTTCGGTCAA ACCGAACGTC GGCCACCTCG ACCGGGCGGC CGGCGCGACT
GGCTTGATCA AAACGGTTAT GGCGCTGAAG CACAACGTGA TTCCGCCGAC CTTGCACTTC
CAGACGCCCA ATCCCGAGAT CGACTTCGCG AGCAGCCCGT TCTTTGTGCC GACCGCGCTC
ACGCCGTGGA CGCGCAATGG CACACCGCGT CGGGCCGGGG TCAACTCACT GGGTGTGGGT
GGGACCAATG CCCACGTCAT CGTCGAGGAA GCGCCGCAGG TCGGGCCGAG CGGCCCCGGT
CGGGCGGTCG AATTGCTGGT GCTGTCGGCA CGCACGCCCA GCGCCTTGGA GACGATGACG
GTGAATCTGA CCGCCTATCT GGAGGGGCAG CCAACGGTGA ATCTGGCCGA TGTGGCCCAC
ACACTCCAGG TTGGGCGGCG GGTGTTTGAA CATCGCCGGG TCGTGGTCGC CCGCGATGCG
ACGAGCGCTG CGGCGCTGTT GCGGAGCGGC GATGCGCGGC GGGTGCTGAC GCTGGCACAA
AAGCCGACCA GTCGCGGCGT GGCCTTTGTG TTCCCGGGTG TGGGCGACCA TTACGTCGGG
ATGGCGGAGG GGTTGTACGC GACCGAGGGA GTATTCCGCG CGACGGTTGA CCGCTGCTGC
GCGCTGCTGA CACCGCTGCT CGGATCGCCG ATCCGAAAGG AAATCTACCC GGATGGCGGC
GCGCCGGTCT CGGCGGGCAT CGACCTGCGT GCTATGCTGC GCGAGGACGC GACGCCGAGG
TCGGCAGGGC GCTTGCACCA GACGGCGTGG GCGCAACCGG CGGTGTTTGT GGTGGAGTAT
GCGTTGGTGC AGCTGCTGGC GAGCTGGGGC ATCCGGCCGC AGGCGTTGCT CGGCTACAGC
GTGGGTGAGT ACGTGGCGGC GACGGTCGCT GGGGTGTTGA GCCTGGAGGA TGCCTTGACC
CTAGTCGCCA AGCGTGCCCA GTGGATTCAG GCCCAGCCGG CCGGGTCGAT GCTGGCGGTG
AGCTTGAGTG CCGAGGCGAT CGGTGCGTAT GTGGGCGGTG CGGTGGCGCT GGCGGTGGTC
AATAGCCCGA TGACCTGTGT CCTGGCCGGT CCGCAGGCGG CGTTGGAGGC AGTGAAAACC
CGCTTGGACG GTGATGAGGT GGCCAGCCGC TGGCTGGAAA CGAGCCACGC CTTCCACTCG
CCGATGTTGG CGCCGGTGCA GGCCGAGCTG ACCGCACTGG CTGGTACGCT GCGGCTCCAG
GCACCGCGCA TCCCGTATAT CTCCAACGTG ACCGGTACGT GGATCACCGA TGCGGAAGCG
ACCGACCCGG GCTACTGGGC GCGGCATATG GTCGAGACGG TGCAGTTTGC GGACGGCGTT
GGCACGCTGC TGGCCGATGC CCAGCTCGTG GTGCTGGAAG TGGGGCCGGG GCAGGCGCTG
GGGTCGTTTA TCCGGCAGCA CCCGGCCTGC GGACGCGACC GGTTCGGCCA GATCGTGGCC
ACGGTGCGTG GGATGACGGA CACGAGCGAT GACCTGGAGG TGTTGTTGAG CGCGCTGGGG
CGGCTGTGGC TACACGATGT GGTGGTCGAT TGGGCCGGCT TCCGTGGCAG CGAAGTCCGC
CAGCGTATCC CGCTGCCGAC CTACCCCTTC GAGCGCCAGC GCTTCTGGAT CGAGCCGAAC
CTCAATAGTC GCCTTGCGGC GGCCCACCGG CCGATCCGCC GCCCGGATAT CGGCGATTGG
CTGGCTGCGC CATCGTGGAA ACGCAGCATT CAGGCCGGCA GCGCCGGAGT TGCGGCGCGT
TTGGCCGAGC CGCACTGCTG GCTGATGCTG GCGGATGGCG AGGGGCTGGC CGCCGAGTTG
ACTGCCTGGC TGGAGCAGCG CGGCCAGACC GTGATCACGG TCATGCCCGG CGCGAGCTTC
GCGGCACTCG GTCAGGCGCG CTATATGCTC CGCCCGACCA GCCGCGAGGA TTTCACGGCC
TTGTTGCAAA CGTTGGAGCG CCAAGGCCAC GCCCCCAGCC GCGTCGTCCA TTGCTGGCTG
TTCGGTGCCC AGGAGGATGC CGATTCGCTC AGCGATGCTA CTCTGACCGC AACACTAGAT
GTCGGCTTCT ACAGCCTGCT GGCACTGGCC CAAGCGCTGG GCGACCAGGG TGTCGAGTGG
TGCGAGATCA ACGCCGTCAC TGCGGCCATG CAGGAGGTCA CCGGACAGGA GAATCTTCAG
GTAGCGGCAT CAACGGTGAT CGGGCCATGC AAGATTATTC CGCAGGAATA TCCCAACTTA
ACGGCGCGCT CGATCGATAT CCTGTTGCCA GCTGGGGCTG CCGAACGCAC GGCGCTGGTC
GCACAGCTGG GCACTGAACT GGCCACTCCG CCGACCGGTG ACCAGGTCGC CTTCCGCGGC
GCCCATCGCT GGGTCCAGGT TTTCGAACCA ATTACCGTGC CAGCAGCGCC CGCGTCGCAT
CCGCGCTTGC GGACGGGCGG GGTGTACCTG CTGACCGGCG GCCTGGGCGG GATCGCCCTC
GGCTTGGCCC GCGACCTGGC GGCGACGCTT CAGGCGAAAC TGGTGCTGGT CAACCGCTCC
GGCCTGCCCG ACCGCGCCAC CTGGCCGGCG TTGCTCGAAC GCGACGGTGC CGAGCAGGGC
GTGGGGCGGC GCATCCAGCA GGTGCTGGAC TTGGAGGTGC TGGGCGCGGA GGTGCTGGTC
ATCCAGGCCG ACGTCACCGA CACGGTGGCG ATGGCGCGGG CGGTGGCTGA GGCCCAGGCA
CGCTTCGGGA CGATCCACGG TGTGCTCCAC ACGGCCGGCG TGCCCGGCGT GGGCTTGATG
CAGCTTAAGG ATGCCGCGAC GGCAGCGGCT GAGCTGGCGC CCAAAGTCCA AGGCACGCTG
GCGCTGACCA GCGCGTTAGC TGGGATTCCA CTCGATTTCT TGGTGTTGTT CTCGTCGGTG
ACGTCGGCGA CGGGCGGCGG GCCGGGCCAA GTGGCCTACT GTGCCGCCAA CGCCTTCCTC
GACGCCTACG CCCGCAAGCA CGCCACCGAC CACGGTCAGA CCGTCGCGGT GAGCTGGGGC
GAGTGGCTGT GGGATGCCTG GTCTGAGGGC TTGCAGGGCT TCGCGCCCGA GGTCCAGGCG
CGCTTCCGCG CCTACCGCAC GACCTTCGGG ATCACCTTTG ACGAAGGCGC CGACATTCTA
CGCCGCATCC TGGCCGAGCC GCTGGCCCAC GTGTTTGTGA CCAGCGAGGA TCTGCTGCCG
ATGGCCGAGC GGAGTCGGCG CGAATCGGCG GCTCGCGGCC TCGAAGAATT ACAGCGGCAG
CAGGAAGCGC GGCCGACCTA TCCGCGCCCC GAGGTCGGCA CCTCGTTTGT CGAGCCGCAA
AGCGTGATTG AACAGCAGAT CGCCGGTATC TGGAGCACGG TGTTAGGCAT CGCACCGATC
GGCCTCCACG ACAATTTCTT CGACTTGGGC GGCAACTCGC TACTGGGTCT GGATCTCTTT
TCTCGGATCC GCAAGGCACT GAAGGTCGAC AAGCTGCCGG CCTACGTACT CTACGAGGCT
CCGACCGTCG CCACCCAGGC CGCCTATCTC ACACCGGCGC CTGAGGCAGC CCTGGTTACG
GAAGCGGTGC CGGATCTTGA TGTCAAAATC CGGCAAAAGG TTAACCGGTT TAAACAGCAA
TCGTCGCTGG AGGATGCATG A
 
Protein sequence
MTDRAAYDAA YDTAVAIVGM SGRFPGASTV DAFWQNLTAG ERSIRTLGDA ELLAAGVDPE 
LLRDPQYVKA GAFVDDIELF DAAFFGYTPR EAEVMDPQHR LFLECAWQAL EQAGYDPDGF
RGSIGVFAGS ATSSYRVHNI HTNPEIAESV GGLQLAVGND SDSLASTVSY KLNLRGPSVA
VQTFCSTSLV AVHMACQSLL TYECNIALAG GAAITVPQGV GYLYQEGGIL SPDGHCRTFD
AKAQGSVMGS GVGVVTLKRF EDALNDGDTI YAVIRGSTVN NDGIRKVGYT APGLNGQSAV
ITIAQNRAEV DPDTVSYIEA HGTATPLGDS IELAALIKAF ERGTERKQFC ALGSVKPNIG
HLDRASGVTG LIKTTMALHH RQLPPNLDFE TPSPDIDLAN SPFYVNTQLR DWPADGAAPR
RAGVNSFGLG GTNVHVVLEE APAPAPVAPA RPAQLLVLSA KTATALEAMT DNLAAYLAGA
PADLADVAFT LQAGRTRFNH RRAFVCESAA DAAQVLQTRD LRRITTVEQS GRNRPVAFVF
PGVGDHYAGM AKTLYATEAV FREAVDQCAE LLAPRLGQDL RAALYPADQP AAAAAHTLFA
ATAASSRVAG ALHQTALAQP AVFVVEYALV QLLASWGIRP QALLGYSLGE YVAATVAGVL
SLEDALTLVA KRAQWIQAQP HGAMLAVSLG VEAIQPYLNT EVALAVVNSP MTCVLAGPHA
ALELVKIHLE EDEVASRWLE TSHAFHSPML APVAAELTAL VRTLRLQTPK IPYISNVTGT
WITDAEATDP GYWARHMVET VQFADGVGTL LADAQLTLLE VGPGQALGSF IRQHPACGRD
RFGQIVATLP GAAEATNDLV ALLNGLGRLW LAGVPIDWAG LHGKRPRRRV PLPTYPFERK
RFWIDAAIMP AALAAGERLR GRHADVTDWF YRPDWAPAAL GVPAAPGHWL ILPDAHGLGK
AVASSLRAAG HTVTLATAGP ADTTLMVPQI PIDPTDAAAY ELLLDTLRAD GGLPSHILWL
GGLTPLDSAL TGPARFQAAQ ATGYYDLLQL AQALVTRVID EAVQLLVVTA GMQAVGANAI
PVAEHATLLG LATVIGQENV TIRVRSVDLA MADDAAAELL AAECLASGDA LRVAYRDGQR
LEERYQPIRL ETPGSSVLQS GGVYVITGGL GGVGLVLAEH LARTAQAKLV LVGRQGLPER
AAWDAWLREH GADDATSQRI QRVRMIEAAG GTVEVMAADV ADPEQLRGVF AETEARFGTL
YGVLHAAGIS DSQAYLPLET IGPKECEWHF QPKAYGLYAL EAALDDRSLD FCVVFSSVSS
VLGGLGFAGY AAANSFMNAF TQRHNRTHAV PWVSVNWDTW HLRAGQHDVI GATVAQYEMS
PAEGADAFER AAATRNEPVI INSTGDLDAR IRQWVRLESV REQPEREREA AGSTQQAVSV
SVPLRSTSEY EQRITAVWQH VLGIETIGIH DNFFDLGGNS LIALQLIARL KKEFKTQVPA
VAIFEAPTIS ALVQYMLPDA PVVAPADALL VERRQRVRQT AEQDGIAIIG MVGRFPGAST
VDALWQNVAD GVEAFTRFTD EELRAAGVPA DLINDTNYVK VRPVLHNDIS LFDAAFFGYT
PREAEFLDPQ QRLFQECAWE ALEQAGYDTQ RYPGLVGVFG GTNVNAYLYR LVEDPELRDL
MSESITLQND KDALATYVSY KLNLRGPSFS IQTYCSTSLV ATHLACRSLR AGDCDIALAG
GVSIRVPVNT GYLFQEGDQG APDGRCRTFD ALAEGTNFGD GVAIVVLKRL ADALADGDTI
HAVIRGSAIN NDGGLKVGYT APSVVGQAAV VQAALADANL AADAISYVEA HGTATKLGDP
IEVAALTKAY RTMTDKVGFC AISSVKPNIG HLDRASGATG LIKTVMALKH NVIPPTLHFQ
APNPEIDFAS SPFFVPTALT PWTRNGTPRR AGVNSLGVGG TNAHVIVEEA PQVGPSGPGR
AAELLVLSAK TATALEAATT NLAAHLEEQP MVNLADVAHT LQVGRRVFEH RRVVVARNVA
DAVGLLRSGD ARRVLTLAQK PTSRGVAFVF PGVGDHYVGM AEGLYATEGV FRATVDRCCA
LLTPLLGSPI RKEIYPDGGA PVSAGIDLRA MLRENATPRS AGRLHQTAWA QPAVFVVEYA
LVQLLASWGI RPQALLGYSV GEYVAAAVAG VLSLEDALTV VAKRAQWIQA QPAGSMLAVS
LGADAIGAYV GGAVALAVVN SPMTCVLAGP QAALEAVKTR LDGDEVASRW LETSHAFHSP
MLAPVQAELT ALAGTLRLQA PRIPYISNVT GTWITDAEAT DPGYWARHMV ETVQFADGVG
TLLADAQLVV LEVGPGQALG SFIRQHPACG RDRFGQIVAT VRGMTDTSDD LEVLLSALGR
LWLHDVVVDW ASFRGSEVRQ RIPLPTYPFE RQRFWVEPRS YVRTPVQEAA VIGRKPNIAD
WFYTPVWEAQ PLPAKGSAQP AGPYLVFVDE QGFGAQVVGR LESNGVTVIK VRQGGAFAQL
DATSFAVRPD IRDDYAVLFS ALQTNRLLPQ AIVHLWNVAS NARVSADEAG FGACQVYGFY
SLLHLAQAVG GIDLESTLPI TVFSNSTQPV TGNERLYAEQ SPSAVTCRVI GQENPAVFCR
NVDICVPEQA GAEADQLAML LEQELLVPSQ DIAVAYREGR RFVQHYRADR LEPISRAVPA
PLRMGGVYLI TGGLGGIGTA IAGYLAEKAQ AKLVLLGRTP LPPREEWDGL AVARGPEDGL
VQKIAKIRAM EAHGAEVLTL SADVGDPAQL SAALAEIRRR FGALHGVVHG AGHLDQSGFQ
LIQDVGHEPC EAHFKPKVYA LYHLEAMLRD QELDFCLLLS SVSSVLGGLG YVGYTAANYF
MDIFTHRLRQ SPSNRWISVN WDTWHLKAGQ HDGATVAQYE LFPFEGVDAF GRILERAPVQ
IINSTGDLDT RIKQWVLLES IRSNAASSSP AASHERPAID TQYVPVNSEY ERRIAAVWQQ
VLGIGQIGID DNFFDLGGNS LTALQLISRL KKEFKTQISA VAIFEAPTIR AMAQYLMPDA
PPAVDLAETL LVQRRQRVRQ TAEQDGIAII GMAGRFPGAS NVDEFWDNLA NGVEAFTAFT
DAELLAAGVL YEQVHDLNYV KRRPILKEDV TLFDAAFFGY TPREAEFLDP QQRLFHECAW
EALEQAGYDT QRYPGLVGIY GGANLNTYLM QLAFDPDVAR NFTDSVFLEN DKDALTTNVS
YKLNLRGPSF AVQTYCSTSL VATHLACRSL RAGDCDIALA GGVSIRVPVN TGHLFQEGDQ
VSPDGSCRTF DAQAAGTTWA DGVAVLVLKR LADALADGDT IHAVIRGSAI NNDGGLKVGY
TAPSVVGQAA VVQAALADAN LAADAISYVE AHGTATKLGD PIEVASLTKA YRTMTDKVGF
CAISSVKPNV GHLDRAAGAT GLIKTVMALK HNVIPPTLHF QTPNPEIDFA SSPFFVPTAL
TPWTRNGTPR RAGVNSLGVG GTNAHVIVEE APQVGPSGPG RAVELLVLSA RTPSALETMT
VNLTAYLEGQ PTVNLADVAH TLQVGRRVFE HRRVVVARDA TSAAALLRSG DARRVLTLAQ
KPTSRGVAFV FPGVGDHYVG MAEGLYATEG VFRATVDRCC ALLTPLLGSP IRKEIYPDGG
APVSAGIDLR AMLREDATPR SAGRLHQTAW AQPAVFVVEY ALVQLLASWG IRPQALLGYS
VGEYVAATVA GVLSLEDALT LVAKRAQWIQ AQPAGSMLAV SLSAEAIGAY VGGAVALAVV
NSPMTCVLAG PQAALEAVKT RLDGDEVASR WLETSHAFHS PMLAPVQAEL TALAGTLRLQ
APRIPYISNV TGTWITDAEA TDPGYWARHM VETVQFADGV GTLLADAQLV VLEVGPGQAL
GSFIRQHPAC GRDRFGQIVA TVRGMTDTSD DLEVLLSALG RLWLHDVVVD WAGFRGSEVR
QRIPLPTYPF ERQRFWIEPN LNSRLAAAHR PIRRPDIGDW LAAPSWKRSI QAGSAGVAAR
LAEPHCWLML ADGEGLAAEL TAWLEQRGQT VITVMPGASF AALGQARYML RPTSREDFTA
LLQTLERQGH APSRVVHCWL FGAQEDADSL SDATLTATLD VGFYSLLALA QALGDQGVEW
CEINAVTAAM QEVTGQENLQ VAASTVIGPC KIIPQEYPNL TARSIDILLP AGAAERTALV
AQLGTELATP PTGDQVAFRG AHRWVQVFEP ITVPAAPASH PRLRTGGVYL LTGGLGGIAL
GLARDLAATL QAKLVLVNRS GLPDRATWPA LLERDGAEQG VGRRIQQVLD LEVLGAEVLV
IQADVTDTVA MARAVAEAQA RFGTIHGVLH TAGVPGVGLM QLKDAATAAA ELAPKVQGTL
ALTSALAGIP LDFLVLFSSV TSATGGGPGQ VAYCAANAFL DAYARKHATD HGQTVAVSWG
EWLWDAWSEG LQGFAPEVQA RFRAYRTTFG ITFDEGADIL RRILAEPLAH VFVTSEDLLP
MAERSRRESA ARGLEELQRQ QEARPTYPRP EVGTSFVEPQ SVIEQQIAGI WSTVLGIAPI
GLHDNFFDLG GNSLLGLDLF SRIRKALKVD KLPAYVLYEA PTVATQAAYL TPAPEAALVT
EAVPDLDVKI RQKVNRFKQQ SSLEDA