Gene BTH_II2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2093 
Symbol 
ID3845771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2558633 
End bp2570779 
Gene Length12147 bp 
Protein Length4048 aa 
Translation table11 
GC content72% 
IMG OID637839394 
Productpolyketide synthase, putative 
Protein accessionYP_440281 
Protein GI83717907 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAT CCCCTCCTTC CCGCGAACTC GCCACGGCCG TCGAGGCGGC CGTCCTGTCG 
CTTGCCGGCG ACGTCGCCGG CTGCACGTTC GACGCGTCGG CCGCGGAGCG CCCGCTGCAC
GCGCTCGGCT TCGATTCGGT TCAGTACGTC GAATTGTCCG GATGCCTGAA CGAATACTAC
GGGCTCGATC TCGCGCCGAC GCTGTTCTTC GACGTGCATG CGCCGCGCCG AATCGCCGCG
CACCTGCTCG CGCGGCATCC GCTGGAGGTC GCGCGCAAGC ATGGCGTCGC GTCCGGCGAT
GAATCGGATG CGCTCGCGCG CGCCGGCGCG GCTTCGGACG GCGGTCGCGA GCGCACCGGC
GCGCGCGACG AAACCGCCGC CCACGAACGC GAATCGGCCG GCGACATCGC GATCGTCGGC
ATGGCCGGCA TCTTCCCGCA ATCGGCCGAT CTCGACGCAT TCTGGCGGCA TCTCGCCGCG
GGCGACGATC TGATCGCCGA AGCGCCGGCC TCGCGCTGGG ACTGGCGCGC GGCCGACGGC
GAGTCCGCCT CGCGCTGGGG CGGCTTCATT CCGCGAATCG AGTATTTCGA CGCCGCGTTC
TTCGGCATCT CGCCGCGCGA GGCCGAGCAG ATGGACCCGC AGCAGCGCCT GCTGATGCAG
ACCGCGTGGG CCGCGCTCGA AGACGCGGCG GTCCGCCCGT CCGACCTGAT GGGCAGCGAC
ACTGCGGTGT TCGTCGGCGT CAGCACGTCC GACTATCTCG ACCTGCTGCC GGGCGCGGAC
GGCCATCTCG CGGTCGGCAA CGCGCACGCG ATGCTGCCGA ACCGGCTGTC GCACCTGCTC
GGCGCGCACG GGCCGAGCGA GGCCGTCGAT ACCGCGTGCT CGAGTTCGCT CGTCGCGCTG
CACCGCGCGG TGTGCGCGCT GCGGCGCGGC GAAAGCGGCG TCGCGATCGT CGGCGGCGTC
AACGTGATGC TGACGACGCG GCTGCACCGC GCGCTCGCCG CGGCCGGCAT GCTGAGCCCC
GACGGGCGTT GCAAGACGTT CGACGCGGCG GCGAACGGCT ACGTGCGCGG CGAGGGCATC
GCGGCGCTCG TGCTGATGCC GCTCGAGCGC GCGCGCGCGG GCGGGCATCC GGTGCACGCG
GTGATCAAGG GCAGCGCGGT CAATCACGGC GGCCGCGCGG CGTTCCTGAC CGCGCCGGAC
ATCAATGCGC AAGCCGCGCT GATCGAGGCC GCGTATCGCG ACGCGGGCGT CGACCCCGCC
ACCGTTTCGT ACATCGAAGC GCACGGCACC GGCACGTCGC TGGGCGATCC GATCGAAGTG
CAGGCGCTGC GCCAGGGCTT CGACGCATGC GCGCGCGCGC GCGGGCATGC CGATGCGCCC
GCGCCGGCGC GCTGCGGCCT CGGCTCGGTC AAGACCAATA TCGGACATCT CGAAGCGGCG
GCGGGCCTCG CGGGCGTCGT CAAGGTCGTG CTCGCGATGA ACCGGCGAAT GCTGCCGCCG
AGCCTGCATT GCCGCGAGCT GAATCCGTAT CTGAAGCTCG ACGGCAGCCG CTACCACGTC
GTCACGGAAC CTGTGCCGTG GCCGGGCGAT GCAACGCCGA CGCCGCTTCG CGCGGGCGTC
AGCTCGTTCG GCTTCGGCGG ATCGAACGCG CATGTCGTGC TGCAGTCCGC GGACGCTCGG
CCGAATGATC GGCCGAGCGC GCCCCGGCCG CCGATCGCGC ACGAGCAGGC CGAAGCCGGC
GCGGCCGATG CCGCCGGCCC GCTCGCGTGG TTCATCCCGT TGTCGGCGCG CACCGACACC
GCGTTGCGGG CGCGCGCCGC GCAGCTCGCG TGCTGGCTCG ACGCCGAGCG GGCCGACGAC
GCGTGGCTGC CCGCGCTCGC CAAAACGCTG TCGATCGGCC GCGAACCGAT GGCGTGCCGC
TTCGGCGTGA CCTGCGCGTC GCTCGACTCG CTGCGCGCGC AACTCGCGGT CGCGCTGAAC
GGCCCCGCCG CATCGCTCGC GCGCGACGAC GCTCGCCTGC AGCCGCATGC GAACGCGCAC
GCCGCGTGGC TCGCGGGCGG GGCCGATCCG CTGCCCGTCG CCTGGGACGA AGCGACGCCG
CGCCTGCGAT TGCCCGTCTA TCCGTTCGAA GGCGAGCGGC ACTGGCCAAC CGATGCAGTG
CCGCCGGCGC GCTTCACGCT CGCGCCCGAA GGCGACGGCG CGTACCGGAT GCACGTCGCG
CCCGACGCGC CGCTCGTCGC CGACCATCGG CTCGGCGGCG AGCCCGTGCT CGCGGCCGCC
GCGCAGATCG TGATCGCGTG GCGCGCGTTC GAAGCGGACG CGAACGCAGC GGACTCGAGC
CGCGCAAGCG AGGCCGGTGA GCCAAGTCAG CCGGATGAAC GGAATGAACC GAACGGTTCG
AGCCGCGCGA TCGGTTCGAA GGGGTCGAAT CCTGCCGGCG CTGCGATCGA TTCGACCGAC
GCCGGCGGCT CGCGCGTTTC ATCTAATACC GCCGATGCGA ACGCCGCGAC GCAAATCACG
CTGCGCGACA TCGAATGGCT TGCGCCGATC GCGATCGGCG CGCCGGCCGA TCTGCACGTC
ACGCTCGCGC GTGAAGAACG CAGCAACGCC AACGAAGACC GCCGCGGCAA CGCGCATCGC
CGCGAAATCG GCAACGCCGC GCGATTCGCG ATCGCCGTCG CCCCGGCCAT CGATGCGCCG
CTCGGCCGCG GATACGCGGC GCGAATCGCG CGCGCGCCGA CGAACGCGCC CGCGCTCGAC
GTCGATGCGA TCCGCGCGCG CTGCACGCAG CCGATCGCGG CCGATGCGTG CTATGACGCG
TTCGCCGCGA TCGGCATCGG CTACGGCCCG ACCTTTCGCC CGCTGCGCGC GATCGCGGTA
GGCCGCGACG AAGCGTTCGC CGAATTCGAT CCATCGGCGC TCGCGCGCAC GACGGGCGAC
GCACGCATCG TCGCGCTGCT CGACGGCGCA TTCCAGGCGA TCGCGGGCCT TCAGCTCGCG
GACGCAGGGC GTCTGGAAGG CGGCCTGCTG CCCGCGTCGC TCGCCCGCAT CGAATTCACG
GGACCGCTCG CGGACAGCAC CCATGCATGG ATTCGCGAGG CGCCGGGCGA GACCGGCCGC
CGCACGTTCG ACATCGATCT CGTGACCGCG CGCGGCGTGC CGTGCGCGTC GCTGCGCGGC
CTCGCGCTCG CGTCCGGACG CGGCGGCGCG TCGCGCGAAG CGCCGCGCGT CGCGACGCCG
GGCGACCATC TGCTCGCGCC GCAATGGCTG CCGTGCACGG CGAACGCACC GAGCGCGGCA
ACGCCGCCGC AGCGCGCCGG CGCGCCGGCC ATACTGGGCG GCACGCCGGC CCAGCGCGCC
GCGCTCGCGG CGACGCTCGC GACGCCGCCG CGCCTGATCG ACGACATCGC TGAACTCAAT
GCGCATGTCG ACCACCTGGT GTGGCTGCCG CCCGCGCCGG CGCATGCGCA TGCGCCGCTC
GCGCGCTGCG CCGGCCTCGA CGGCTTCCGT CTTGTCAAGC GGCTGCTCGC GCTCGGCGCG
GGCGAACGCG CCTTCGAGCT GACGGTGCTG ACCGTCCGCT CGTGGACGAT GCCGGGCGAC
GCGCCCGCCT TCCCCGCGCA TGCGGATCTC GCGGGGCTGT GCGGCTCCCT CGCGAACGAA
TACCCGCACT GGCGCGTGCG ACTCATCGAT CTGTCCGACG CCGATGCGCT GCCCGCCGAC
TGGCGCACGC AAGACACCGA AGGCGGCCAT CCGCTGCTGC ACCGGCACGG CCAATGGTTC
GCGCGCCGGC TCGTGCCGCT CGCCGCGTTG CCGTCGCCCG CGACGCCGCC GTACCGGCCG
GGCGGCGTCT ATGTCGCGAT CGGCGGCGCG GGCGGTCTCG GCCGCGTGTG GACCGGGCAC
GCGATCCGCG CATGCGGCGC GCAAGTCGTG TGGATCGGAC GCCGCCCGCT CGACGCGCAG
ATCGATGCGC ACTGCGACGC GCTGGCCGAG TTCGGCCCGC GCCCGTCGTA CCTGAGCGCC
GACGCGAGCG ACGTCGACAG CCTGCGCGAC GCGCGCGACG CGGTGCTCGC GCGTTTCGGC
CGGATCGACG GCGTCGTGCA CACGGCGATC GTGCTGCAGG ACGGCGGCCT CGCGCAGCTC
GACGAAGCGC AGTTCAGCGC GGCGCTGAAC GCGCAGGTCG CGACGACCGC GAACCTCGCG
CGCGTGTTCG GCGGCGATTC TCTCGACTTC ATCCTGTTCT TCTCGTCACT GCAAAGCGCG
TTCGTCGCGG CGGGCCAGAG CAATTACGCG GCGGGCTGCA CGTTCCGCGA CGCGTTCGCC
GACTGGTTGC GCACGCAGCT CCGATGCGTG GTCAAGGTCG TGAGCTGGGG CTACTGGGGG
CAAACGGGGG TCGTCGCATC CGAGCCGTAT CGCAAGCGGA TGGCCGCGCT CGGCATCGGG
TCGATCGAGC CGGCGGCGGC GATGGCTGTC GTCGATGCGC TGCTCGCCGC GCGCGTCGAT
CAGGTCGGCT ATCTGAAGAC GACCGCGCGC GCCGCGGTGC CCACGCTCGC GCCCGCGCTC
GCCGCGCGCA TCGCGCCGCA TACGAGCGCG CTCGCCGGCA AGCCGCCGCC GCGCATCGAC
GAAACGGGCG CGAGCGCGGC ATGGAACGAC GCGCTCGCGG CGCTCGATCG CGCGATCGCA
CGCCGGCTGT TCGCGGAGCT CGGCGCGCTG CGCGCATTCG GCGAACGCGA CGTAGCGGAC
GACGGCGCGC TCGGCAGCGT CGCGACCGGA AGCGATGCGC GCGGCAAGCG CTCGTCGGGC
GAGCGCACGT TCGAGCCCGC ATCGTTCGAC ATCGACGCCG CGCTGCGCTC GGGCCGCATC
GCCCCCGCGT ATCGGCGCTG GCTCGCGCAT GCGCTGGCGC TGATCGCGCA ACACGGCCAC
CTCGACTGGG ACGGCCGCGC GGGCCGCCTC GCCGAAGCGC CGCCGCCGCT GGACGCGGCG
CGCGCCGAAT GGGCGCATGC GCGCGCGCAG CTCGATCGAA CGGCGCTGCT CGACGCGCAC
CTCGCGCTCG CCGACGCGAC GCTCGACGCG CTGCCCGCGA TCCTGCAAGG CAGCGTGCCC
GCGACGTCGA TCCTGTTCCC GGACGGCGAC CTGAGCCGCG TCGAAGCCGT CTACCGGCGC
AACGAGCAGG CGGACCGCTG CAACCGCGCG CTCGCCGATG CGGTGCTGCA TCTCGTCGGC
GGCGCGTCGT CCGCGCAACC GGCGGCGCTC GCGGAAATCG GCGCGGGCAC GGGCGGCACG
ACGGTGCCGC TCCTCGCGGC GCTCGACGCG AGCGGCGCGC GGCTTGCCCA TTACGACTTC
ACCGACATCT CGAAGGCGTT CCTGCTGAAC GCCGAGCAAA CGTTCGGCCG CGGCCGCGAC
ACACTGCGCT ACCGGCTGTT CGACGTCGAG CGGCCGGTTG CCGGGCAGGC GCTCGATGCC
GGCGGCTACG ACATCGTGAT CGCGACGAAC GTGCTGCACG CGACGCAGGA CATCAGCGTG
ACGCTGCGCA ACGCGAAGGC GCTGCTGAAG ACGGGCGGCC ATCTGATCGT CAACGAGCTG
CTCGGCACGC ACGGCTTCGC GCACGCGACG TTCGGCCTGC TGCCCGGTTG GTGGCGGCAT
CGCGACAGCG CGCGCCGGCT GCCCGGCAGC CCGCTGCTGT CGCGCGACGG CTGGATGCGC
GCGCTGCGCG AAGCCGGCTT CGCGGTGCCC GACGGCGATT CGGCCGGCGC GGCGGCGGCC
GCGGGTCAGG GCGTGATCGT CGCGGTCAGC GACGGCGTGA TCGTTCAGCC GGCGATCGCC
GATGCCGGCC ACGCGGCGCA CGCGAACGCC GATGCGCAGG CAAGCGCCGC CCGGCCGGCC
TCGTTCGCCG CATCGGCCGC ACCCGCGCGC GCCGCTTCGT CGATCGCCGC CGCATCATCC
GGCGCCGAGC TGCGCGAGCG CTGCGTGCAA TGGCTCGCGC AGCTCGTCGC GCGGACGCTG
AAGATGCCCG CCGGCAGGCT CGCGCCCGAT CAACCGCTCG GCAGCTACGG CGTCGATTCG
ATTCTCGTGA TCGGCCTCAC GAAGACGCTG CGCGAAACGT TCGGCGTCGC GCTGTCGAAC
GCGACGCTGT TCGAGCACGC GACGCTGAGC GCGCTCGCCG ATTTCTTCGT CGCCGAGCAT
CGCGCCGCGT GCGAGCGCGT GCTCGGCGGC GACGCGGTCG CCGCCTCCGC CGCCTCCTCC
GCGTCGGCCG CATCTGCATC GGCGATCCCG AATCAGGCTG CCTCCAACCC GCTCACATCG
CACGCGCCGA TGCCGATGGC GCTGGCGACG CGAGCGACGC CTCCGGCATC ACCCGCATCG
CCCGCAACCG CAACCGCCGC CGACACCGCC ATCGCCGTCA TCGGCATGTC CGGCCGCTAC
GCGCAGGCGG ACAACCTGCG CGAGTTCTGG GCGAATCTCC GCGCGGGCCG CCACTGCATC
ACCGAAGTGC CCGCCGAGCG ATGGGACTGG CGCACGCACT TCGATGCGGA AAAAGGCGCG
CCGGGCCGAA CGTACAGCCG CTGGGGCGGC TTCCTGAAGC AGATCGACCG CTTCGACGCC
GCGTTCTTCC GGATCGCGCC GAGCGACGCG GAGCACATCG ATCCGCAAGG CCGCCTGTTC
CTCGAGGAAG CGTGGTCCGC GATCGAAGAC GCCGGCTACA CGCCGGCGAC GCTCAGCGCG
AACCGCCAGG TCGGCGTGTT CGTCGGCGTG ATGAACGGCG ACTACCCGAC GGGCGCGCAG
TTCTGGAGCA TCGCGAACCG CGTGTCGCAC GCGCTCGATC TGCACGGGCC GAGCCTCGCC
GTCGACACCG CGTGCTCGTC GTCGCTGACC GCGATCCATC TCGCGCTCGA CAGCCTGCGC
AGCGGCACCT GCGACTGCGC GCTCGCGGGC GGCGTCAATC TCATTCAGAG TCCGAAGCAT
CTGGTCGGGC TGTCGTCGCT GACGATGCTC TCGGCGGGCG ACGCGTGCCG CGCGTTCGGC
GCGGGTGCGG ACGGCTTCGT CGACGGCGAG GGCGTCGGCG TGCTCGTGCT CAAGCCGCTG
TCGCGCGCGC TCGCCGACGG CGACGCGATC CACGGCATCA TCCGCGGCAG CATGATCAAC
GCGGGCGGCA AGACGCACGG CCTCACGGTG CCGAACCCGC GCGCGCAGCA GGCGGTCGTC
GCCGCGGCGC TCGCGCGAAG CGGCGTGCCC GCGCGCGCGG TCGGCTACGT CGAGGCGCAC
GGCACCGGCA CCGCGCTCGG CGATCCGATC GAGCTCACGG GCCTCACGCG CGCGTTCGCC
GAAGCAACCG GCGATCGCGG CTTCTGCGCG CTCGGCTCGG TCAAGTCGAA CATCGGCCAT
TGCGAGAGCG CGGCGGGCGT CGCGGGCGTA ACGAAGGTGC TGCTGCAGAT GAAGCATCGC
GAGCTCGTGC CGACGCTGCA TGCGGACGAA CCGAATCCCG ACCTCGATTT CGCGTGCTCG
CCGTTCGTGC TGCAACGCGC GCTCGCGCCG TGGCCGAAAC CGGATCTCGA CGGATGGCCG
CGCATCGCGG GCGTGTCGTC GTTCGGCGCG GGCGGCGCGA ACGCGCATGT CGTGCTCGAA
GAGTTCGTCG ACACGCGCGT CGCCGCCCCC GACGATCGCG CCGGCCCCGC GATCGTCGTG
CTGTCCGCCG CGACCGACGA CGCACTGCGC CGCCGCGCGC GGCAATTGCA CGCCGCGCTC
GCCGACGGCG AAATCGACGA CGAACGCCTG CACGATCTCG CGTACACGCT GCAGATCGGC
CGCGATGCGA TGGCTTCGCG CTTCGGCTGC GTCGTGGGGA CCGTCGCCGA ACTGCAAGCG
GCGCTCGCCG CGTTCGTCGA AGGCGACGCA TCGCGCGGCT GGCACGCGCA CCGGCTCGCC
GCCGATCGCC ACGGCCTCGC CGAGCTCGAC GCCGATCCCG AGCTGCGCGC GTCGCTCGTC
GAGCAATGCA TCGCGGCCGG CAAGCTCGAC AGGCTCGCGG CGCTCTGGTG CCAGGGGCTC
GGCGTCGATT GGCCCGCGCT GCATCGCGGC CACGCGCGCC GGCGCGTGCA TCTGCCGACC
TATCCGTTCG ACGGCCCGCG CCATTGGCTG CGCGACGACG CGACGCCCGC CACCGAGCCC
GCGCGCGCGC CGGCCGATAT CGCGGACAGC CACGCCGCGC CGCCGATGCG CGGCGCAAGC
GCCGGCGCGC CGAACGTATC GACGCCCGAC GTCGCGGCGC TCGTTCGCCG AACGGTCGCG
CAGGTGCTCG GCTATCCGGA TGTCGACATG AACGAATCGT TCCTGTCGCT CGGCGGCGAT
TCGATCCGCG CGGCGCGCGC GCATCGGATG CTGCAACGGT CGCTCGACGT GAAGATTCCG
CTCAGCCTGA TGCTGGAGGC GAAGACGCTC GCCGAATGCG CGCGCGCGAT CGATGCGCTG
CCGCCGGCGG AACCGCCGAG CGCGGCCGGC ACGCCGGCGG CGGGCGCGCC CCCCGCCGAG
CCGCGAGCGC CCCGCGCATC GGCCTTCGCG CCGCGCGACG CCCGCCCGCG CGTGCACACG
CTGTCGTCGA ACCAGCGGCA GTTCTTTTTC CTCGACCGCC TGAATCCCGC GAATCCGGCG
TTCAACCTGC CGGGCGCGCT GCGCGTGCGC GGCGAATGGC ATGCCGATGC GCTCGCGGCC
GCGTATCAGG CGCTCGTCGA TACGCACGAC GTGCTGCGCA CCCGCTTCGT CGTACGCGGC
GGCGAACCGT GCGCGGAAGT CGCGCCGCGC CGCGCGGCCG CGATCCGCCA TCACGATCTG
TCGGCGCTGC TGCCGAAGCA CCAGGCCGCG CGCATCGCCG AATGCCTGAC CGGATCGAGC
CGCGAAGGCT TCGCGCTGGA ACAGGGCGAG CCGAGCCGGC TGACCGTGCT CGAACTGCGC
GACGACGACC ACGTGATTCT GCTGAACCTG CATCACATCG TCGGCGACGC CGTGTCCGTC
GTCGTGCTGC TCGACGTGCT CGCGCGCGCC GCGCTGACAG GCCGCGCGAG CGCGCCGAAC
CGTGCGCAGC CGCAATACGC GCAATGGGCG GCGGCAGAGC GCGATGCATT GCCGGCGACG
GTCGAGCGCG AACTGCCGTA CTGGCTCGAG CGCCTGCGCG ACGTGCCCCC GCCGCTGCCG
CTGCCGTGCG ACCGCGCGCG GCCGCCCGTG CCGAGCTATC GCGGGCGCAG CGTGCCGCTC
GCGTTCCCGT CCGCGCTCAC CGCGCAGCTC GACGCGTACT GCAAGGCGCA CGGGCTGTCG
CGCTTCGTCG TGATGCTCGC CGCGTTCAAG GTCGCGCTGC GCGTGCTGTC GGGCCGCGAC
GACATTGTCG TCGGTAGCCC GTACGCGAAC CGCGCCGACG ACGACACGGC CGACATGATC
GGCAGCCTCG CGTACGCGCT CGTGCTGCGC ACGCGGCTTG GCGAAGCCGA GACCTTCGCC
GACGCGGCCA CGCTCGTGAG GCGCACCGTG CATGGCGCGT TCGATCATCT CGGCGTGCCG
TATCCCCGGC TCGTCGAGGC GCTGAATCCC GCACGCCACG GCGGCGCGAA TCCGCTCTAC
CAGATCATGT TCAACGTGAT CCCGATGCCG GCGCTGCCCG ACGGCGTCGA GCCCGTCGAA
GTCGATTCCG GCTGGCTCGA TTACGATCTG TTCGTGCGGC TGCGCGCCTC GGGCAGCGCG
ATCGAGGGCG TGCTGCAATT CAGCGCCGAC CTCTTCGATC GCTCGACGGC CGAAGCGATC
GCCACGTACT ACGTCGAGCT GCTGCACACG CTGCTCGCGC ATCCGTCGCT GCCGCTCGCG
GGCCTCGCGC CGCCGCCCGA GCTCGCGCTC GAACGGACGA TCGCCGACGC AATGCCGCCG
CTGCGCATCG AAATCGCATC GACGTTCACC GATCGCCCGC TCGCCGGCAC GCTGCGCTAC
TGGGGCGTCG CGACCGGTCA GCCGATCGAG CCCAATTTCG CGCCATACGG GCAGCTGTTC
CAGACGCTTT ACGATCCGTC GACGCCATTC CATGCGAATC GGCACGGCAC GAACGTCGTG
CTCGTCAGGC CGCGCGACTG GCTGCGCTTC GGCCAAGCCG ATGCGAACGC CGAGACCACC
GACACCGCGG CAGACGCGGC GGCCGCGCAA ATCGCGCTTC ACGCCGAAGA ACTCGCCGAC
GCGCTCGCCG GCGCCGCACC GTCGCTCGCC GTGCCGGTAC TCGTGCTGGT GCTGCCGGAT
GACGCGTCGT CGCTGGCGGC GCACGGCGAA CACGGCGGCG AACGGGATGG CGAACCCGCG
ATCGACTCGT CGCTCGCCCC GTACCGCACG CTCGCCGCCG CGCTCGCGGA TCTGCCGTCG
GTGACGGTCG CGCACTGGCG CGACGTCGCC GCGATCTATC CGGTCGCCGA CGTGTTCGAT
CCGCACGCGG ATGCGGCGGG CCACGTGCCG TTCACGAGCG AGTACTACGC GGCGCTCGCG
AGCTACATCG CGCGCACCGC GTTCCAGCAC GCGTCGGTGC CGCTAGACGA CGCATGGAAC
CGGCTCGCCG CGCAGATCCG GGACGACGCC GAGCACCTGC TCGCCGCACC CGCCGACGGC
GCGCGCGCGC GCCGCGCGCC ATACGCGCCG CCGGCCAACG AAGCGCAGGC GACGCTCGCG
CCGATCTTCG CGGCCGCGCT GAAGCTCGCC GATCTCGGCA TCGACGATAA CTTCTTCGAC
TGCGGCGGCC ACTCGATTCT CGCGATCGGC GTCGTCTATC AGATCAACGA AGCATTCGGC
ACGTCGCTTT CGGTCGCGGA CATCTTCATG GCGCCGACCG TGCGCCGGCT CGCCGAGCGG
ATGCGCGACG CGCCGGACGG CCCCGAGTAC GTCGACCTCG CGAGCGCGGC CGTGCTGCCC
GACGATATCG CGCCGCTGCC CGGGCCCGTC GCCGGCACGC CGCGCGCGCT GCTGCTGACG
GGCGCGACGG GCTTCGTCGG CCGCCACCTG CTGCGCGAGC TGATCGACCG CACCGATGCG
ACGATCCACT GCCTCGTGCG CGCGCCGGAC GCCGCGCAGG GCCTCGCGCG GATTCGCGCG
ACGCTCGAGC GCTGGTCGCT GTGGCGCGAC GGCGACGACG CGCGCGTGAT CGCGGTGCCG
GGCGATCTCG GCCGCCCGCG GATCGGCCTC TCCGAGCCCG ATCGCGCGCG GCTCGTCGCC
GAGGTCGACG CGATCTATCA CAACGGCACC AGCATGAACC ATCTCGAATC GTTCGAGATG
GCGCGCGCCG CGAACGTCGG CGGCGTGATC GAGCTGCTGC GGATCGCAAC CGAAGGCCGG
CCGAAGACAT TCAACTACGT GTCGACGCTC GCGGTGTTCA GCATGCGCGA ACGCACAGGC
ACGCACGTAT TCGACGAGTC CGCGCCGATC GACGATGAAC GGCACCCGTC CGACCAGGGC
TACACGACGA GCAAATGGGC CGGCGAACAG TTGACGCATC TGGCCGCCGC GCGCGGCGTG
CCGTGCAACG TGTTCCGTCT CGGCCTCGTG ACGGGCGACG TGCGGCACGG CCGTTACGAC
GAACTTCAGG CGTACTACCG GCTGCTGAAG AGCTGCATCC TGATGGGCGC GGCATTCGAC
GATTTTCGCT ACGACCTCGT GATCACGCCC GTCGACTACG TCGCGCGCGC GCTCGCGCAT
CTCGGCGCGA AGCATCCGCA AGGCGGCCGC GTGTTTCATC TGTCGACGAT GCAGGTCACG
CCGATGCGCA CGGTGTTCGA GATGATGAAC GCGCATCTGC CCACGCCGAT GCGCATGCTC
ACTCACCGCG CATGGATCGA CGAGCTGCGC GTGCGCTACC GGCGCGGCGA CGTGCAATCG
ATCGTGCCCG TCGTGCAATG GATGATGAAC ATGAGCGACG CGGAGCTCGT GAAGCTCGCG
CGCGAACGCG AGGAAACGAC GTTCATCTAC GACTGCACGG CGACGCACCG CGAGCTCGAG
GAAGCCGGCA TCGTCGTGCC CGTGTTCGAC GACGCGCTGC TGCAGCGCTA TTTGCGCGGC
ATGTTCGACG AAGACGCGGA CCTGCGCGCG CTCGCCGCCC AGCCGGACGG CGGCGAGCGC
GCTTCTCCCC TTCACTCCCA CATGTGA
 
Protein sequence
MTASPPSREL ATAVEAAVLS LAGDVAGCTF DASAAERPLH ALGFDSVQYV ELSGCLNEYY 
GLDLAPTLFF DVHAPRRIAA HLLARHPLEV ARKHGVASGD ESDALARAGA ASDGGRERTG
ARDETAAHER ESAGDIAIVG MAGIFPQSAD LDAFWRHLAA GDDLIAEAPA SRWDWRAADG
ESASRWGGFI PRIEYFDAAF FGISPREAEQ MDPQQRLLMQ TAWAALEDAA VRPSDLMGSD
TAVFVGVSTS DYLDLLPGAD GHLAVGNAHA MLPNRLSHLL GAHGPSEAVD TACSSSLVAL
HRAVCALRRG ESGVAIVGGV NVMLTTRLHR ALAAAGMLSP DGRCKTFDAA ANGYVRGEGI
AALVLMPLER ARAGGHPVHA VIKGSAVNHG GRAAFLTAPD INAQAALIEA AYRDAGVDPA
TVSYIEAHGT GTSLGDPIEV QALRQGFDAC ARARGHADAP APARCGLGSV KTNIGHLEAA
AGLAGVVKVV LAMNRRMLPP SLHCRELNPY LKLDGSRYHV VTEPVPWPGD ATPTPLRAGV
SSFGFGGSNA HVVLQSADAR PNDRPSAPRP PIAHEQAEAG AADAAGPLAW FIPLSARTDT
ALRARAAQLA CWLDAERADD AWLPALAKTL SIGREPMACR FGVTCASLDS LRAQLAVALN
GPAASLARDD ARLQPHANAH AAWLAGGADP LPVAWDEATP RLRLPVYPFE GERHWPTDAV
PPARFTLAPE GDGAYRMHVA PDAPLVADHR LGGEPVLAAA AQIVIAWRAF EADANAADSS
RASEAGEPSQ PDERNEPNGS SRAIGSKGSN PAGAAIDSTD AGGSRVSSNT ADANAATQIT
LRDIEWLAPI AIGAPADLHV TLAREERSNA NEDRRGNAHR REIGNAARFA IAVAPAIDAP
LGRGYAARIA RAPTNAPALD VDAIRARCTQ PIAADACYDA FAAIGIGYGP TFRPLRAIAV
GRDEAFAEFD PSALARTTGD ARIVALLDGA FQAIAGLQLA DAGRLEGGLL PASLARIEFT
GPLADSTHAW IREAPGETGR RTFDIDLVTA RGVPCASLRG LALASGRGGA SREAPRVATP
GDHLLAPQWL PCTANAPSAA TPPQRAGAPA ILGGTPAQRA ALAATLATPP RLIDDIAELN
AHVDHLVWLP PAPAHAHAPL ARCAGLDGFR LVKRLLALGA GERAFELTVL TVRSWTMPGD
APAFPAHADL AGLCGSLANE YPHWRVRLID LSDADALPAD WRTQDTEGGH PLLHRHGQWF
ARRLVPLAAL PSPATPPYRP GGVYVAIGGA GGLGRVWTGH AIRACGAQVV WIGRRPLDAQ
IDAHCDALAE FGPRPSYLSA DASDVDSLRD ARDAVLARFG RIDGVVHTAI VLQDGGLAQL
DEAQFSAALN AQVATTANLA RVFGGDSLDF ILFFSSLQSA FVAAGQSNYA AGCTFRDAFA
DWLRTQLRCV VKVVSWGYWG QTGVVASEPY RKRMAALGIG SIEPAAAMAV VDALLAARVD
QVGYLKTTAR AAVPTLAPAL AARIAPHTSA LAGKPPPRID ETGASAAWND ALAALDRAIA
RRLFAELGAL RAFGERDVAD DGALGSVATG SDARGKRSSG ERTFEPASFD IDAALRSGRI
APAYRRWLAH ALALIAQHGH LDWDGRAGRL AEAPPPLDAA RAEWAHARAQ LDRTALLDAH
LALADATLDA LPAILQGSVP ATSILFPDGD LSRVEAVYRR NEQADRCNRA LADAVLHLVG
GASSAQPAAL AEIGAGTGGT TVPLLAALDA SGARLAHYDF TDISKAFLLN AEQTFGRGRD
TLRYRLFDVE RPVAGQALDA GGYDIVIATN VLHATQDISV TLRNAKALLK TGGHLIVNEL
LGTHGFAHAT FGLLPGWWRH RDSARRLPGS PLLSRDGWMR ALREAGFAVP DGDSAGAAAA
AGQGVIVAVS DGVIVQPAIA DAGHAAHANA DAQASAARPA SFAASAAPAR AASSIAAASS
GAELRERCVQ WLAQLVARTL KMPAGRLAPD QPLGSYGVDS ILVIGLTKTL RETFGVALSN
ATLFEHATLS ALADFFVAEH RAACERVLGG DAVAASAASS ASAASASAIP NQAASNPLTS
HAPMPMALAT RATPPASPAS PATATAADTA IAVIGMSGRY AQADNLREFW ANLRAGRHCI
TEVPAERWDW RTHFDAEKGA PGRTYSRWGG FLKQIDRFDA AFFRIAPSDA EHIDPQGRLF
LEEAWSAIED AGYTPATLSA NRQVGVFVGV MNGDYPTGAQ FWSIANRVSH ALDLHGPSLA
VDTACSSSLT AIHLALDSLR SGTCDCALAG GVNLIQSPKH LVGLSSLTML SAGDACRAFG
AGADGFVDGE GVGVLVLKPL SRALADGDAI HGIIRGSMIN AGGKTHGLTV PNPRAQQAVV
AAALARSGVP ARAVGYVEAH GTGTALGDPI ELTGLTRAFA EATGDRGFCA LGSVKSNIGH
CESAAGVAGV TKVLLQMKHR ELVPTLHADE PNPDLDFACS PFVLQRALAP WPKPDLDGWP
RIAGVSSFGA GGANAHVVLE EFVDTRVAAP DDRAGPAIVV LSAATDDALR RRARQLHAAL
ADGEIDDERL HDLAYTLQIG RDAMASRFGC VVGTVAELQA ALAAFVEGDA SRGWHAHRLA
ADRHGLAELD ADPELRASLV EQCIAAGKLD RLAALWCQGL GVDWPALHRG HARRRVHLPT
YPFDGPRHWL RDDATPATEP ARAPADIADS HAAPPMRGAS AGAPNVSTPD VAALVRRTVA
QVLGYPDVDM NESFLSLGGD SIRAARAHRM LQRSLDVKIP LSLMLEAKTL AECARAIDAL
PPAEPPSAAG TPAAGAPPAE PRAPRASAFA PRDARPRVHT LSSNQRQFFF LDRLNPANPA
FNLPGALRVR GEWHADALAA AYQALVDTHD VLRTRFVVRG GEPCAEVAPR RAAAIRHHDL
SALLPKHQAA RIAECLTGSS REGFALEQGE PSRLTVLELR DDDHVILLNL HHIVGDAVSV
VVLLDVLARA ALTGRASAPN RAQPQYAQWA AAERDALPAT VERELPYWLE RLRDVPPPLP
LPCDRARPPV PSYRGRSVPL AFPSALTAQL DAYCKAHGLS RFVVMLAAFK VALRVLSGRD
DIVVGSPYAN RADDDTADMI GSLAYALVLR TRLGEAETFA DAATLVRRTV HGAFDHLGVP
YPRLVEALNP ARHGGANPLY QIMFNVIPMP ALPDGVEPVE VDSGWLDYDL FVRLRASGSA
IEGVLQFSAD LFDRSTAEAI ATYYVELLHT LLAHPSLPLA GLAPPPELAL ERTIADAMPP
LRIEIASTFT DRPLAGTLRY WGVATGQPIE PNFAPYGQLF QTLYDPSTPF HANRHGTNVV
LVRPRDWLRF GQADANAETT DTAADAAAAQ IALHAEELAD ALAGAAPSLA VPVLVLVLPD
DASSLAAHGE HGGERDGEPA IDSSLAPYRT LAAALADLPS VTVAHWRDVA AIYPVADVFD
PHADAAGHVP FTSEYYAALA SYIARTAFQH ASVPLDDAWN RLAAQIRDDA EHLLAAPADG
ARARRAPYAP PANEAQATLA PIFAAALKLA DLGIDDNFFD CGGHSILAIG VVYQINEAFG
TSLSVADIFM APTVRRLAER MRDAPDGPEY VDLASAAVLP DDIAPLPGPV AGTPRALLLT
GATGFVGRHL LRELIDRTDA TIHCLVRAPD AAQGLARIRA TLERWSLWRD GDDARVIAVP
GDLGRPRIGL SEPDRARLVA EVDAIYHNGT SMNHLESFEM ARAANVGGVI ELLRIATEGR
PKTFNYVSTL AVFSMRERTG THVFDESAPI DDERHPSDQG YTTSKWAGEQ LTHLAAARGV
PCNVFRLGLV TGDVRHGRYD ELQAYYRLLK SCILMGAAFD DFRYDLVITP VDYVARALAH
LGAKHPQGGR VFHLSTMQVT PMRTVFEMMN AHLPTPMRML THRAWIDELR VRYRRGDVQS
IVPVVQWMMN MSDAELVKLA REREETTFIY DCTATHRELE EAGIVVPVFD DALLQRYLRG
MFDEDADLRA LAAQPDGGER ASPLHSHM