Gene BURPS1106A_A1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1709 
Symbol 
ID4904477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1675155 
End bp1687952 
Gene Length12798 bp 
Protein Length4265 aa 
Translation table11 
GC content72% 
IMG OID640144815 
Productnon-ribosomal peptide synthase 
Protein accessionYP_001075743 
Protein GI126456950 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATC GTTTGCCCCA CGTCGAGCGC GGCGACGCCG ACGAACCGTC CATCGAAACC 
GCCGCCGGCG AGCAGCCGCC GCTCGCGCTG CCGGCGAGCG TCGCGCAACG CCGGCTCTGG
TTCGTCGAGA ACGGCGACGT GCGCGCGTCG ACTTACAACG TGCCGGCCGC CTTCACCTTG
ACGGGGCCGC TCGACGACGC CGTGCTCGAG CGGGCGTTGG CGTTCATGCA GCAACGGCAT
CCCGCGCTGC GCTCGCGGTT TCGCACGCGC GACGGCGAAT TGCGCATCGA GCTCGCGCCG
CAGCCGGCGC CGTTGGCGCG GCAGGACCTC GGCGCGCTCG ATGCCGACGT GCGCGCGCGG
ACGGCCGAGC GGCTCTGCGC GAACCACGCG AACCGCCGGT TCGATCTCGA GCGGGACGCG
CCGATCCGCT GCTTGCTGCT CAGGCTCGGC GAAAACGAGC ATGTGCTGGC GGTGAACGTG
CATCACATCG TGTTCGACGA CTGGTCGATC CGGATCTTCT TCCGGGAGCT CGGCGCCGTC
TATGGCGCGC TGCTCGCGGG CGCGACGCCC GATCTGCCGG CGCTCGACTA CGCGGCGGCC
GTCGCGGCGA GCGTGCCCGC GGCCGCCCGG CACGCCGCCG CGCGTCAATA CTGGGCGCGC
GCGATGTCGG GCGCGCCGAC GCTTCACAAG CTGCCCACCG ACCGGCCGCG TCCGGCCGAG
CCGCGCATGC GCGGCGCGGT GCACAAGCAC GTGTTCGCGC GCCGGCATGC CGAGGGCATC
CGCGCGCTTT GCCGCCGTGC CGGCGTGACG CCCTACATGC TGGGCGTCGC CGCGTTCGCC
GCGCTGCTGC ATCGCTATTC GGGCGAGGAC GAGATCGTGA TCGGCAGCCC GTTCGCGAAC
CGCGTGACGC AGGCGCAGCA GAGCCTGATC GGCTTCTTCA TCAACCTGAT CCCGTTGCGC
GTGCGTTTCG ATGCCGGCGT CAATTTTCTC GACCTGCTCG CGCAGGTGCG CGAGACGTCG
TTCGATGCAT TCGAGCACGC GGTGCTGCCG TTCGACCAGA TCGTCGATGC GATCCGTCCG
CCGCGTTCGT CGAGCCACGC GCCGGTGTTC CAGATCATGT TCGACTACCT GAAGAGCGGC
GGCATGCTCG AGCTCGACGG CGTCGGCGTG ACCGGCTCGC TCGTCCACAC GGGCACGGCG
AAGTACGACC TGACGGTCTC GATGGAGGAA GGCCCCGACG AGCTGGCGGC GATCGTCGAA
TACGACACGG ATCTGTTCGA CGCGGGCACG ATCGCCCGCC TGGGCGGGCA TTTCGAGCGA
TTGCTCGAGA ACGTGCTGGC TTCGCCCGCC GCGCCGATCG CCGAGGGCTC GCTGCTGCCC
GCCGACGAGT TGCGGCAGGT GCGCCGCTTC ACGCGGCCCG ACGAGCCGTA CGCGCACATT
CCGTTCTCGC CGATGCCGCA GCGGATTCGC GAAGCGGCGC GGCGCGCGCC GCACGCGGTG
GCGATCGTGC ACGGCGACGC GCGGATGACG TACGAGACGC TCGATCGCCG CTCGGATGCG
CTCGCGCGCG CGTTGCGGGC GCGCGGCGTC GGCAGGGGGA GCCGGGTGGC GTCGCTGCAA
TCGTATTCGG AGAAGATCGT CGTCGCCTAC CTGGGTATCC TGAAGGCGGG CGCCGCGTAT
TTGCCGCTCG ATCCGGCGGA CCCGAGACGG CTGGAAAAAA TCGAGGACGC CGCGCCCGCG
ATGATCGTGA CGGCGCGGCG CGATCTCGAG GACGTGCCGC AGGCGTTGCG CGCCCGCACG
CTGACGATCG ACGACCCGAT CGAATGCGGG AAAGCGCCCG ACGCGGTGAA CGACGTCGCG
ACCGACGTCG TGACCGACGC AACGGCGCGC GACGCCGAAC TCGACTTCGC GACGCTTGCC
GAAGCCGATC CGGCCTATGT GATCTATACG TCCGGATCGA CCGGCAAGCC GAAAGGCGTC
GAGGTCTCGC ACGGCAGCCT CAACGTGTCG TATCACGGCT GGCATCGCGC CTATCGGTTC
GGCAAGCCCG GCCATCCGGT CACGCTGCAG CTCGCCGGCA TGACGTTCGA TCTGGGCATC
GGCGACGTGA GCCGCACGCT CGCATGCGGC GGCACGCTCG TCATGCCGCC GCGCGACGGG
CTGCTCGACG CCGGCCGGCT GCACGCGCTG ATGTGCGCCG AACGCGTGTC GTTCGGCGAT
TTCCCGCCGG TGATCCTGCG CGAGCTGATT CGCCATTGCA ACGAGACGGG CGACCGGCTC
GACATGCTCG ACACGCTCGT GTGCGGCGCC GACGTATGGT TCGGGCACGA GCTGCACGCG
GCGCGCGCGC TGTGCGGGCC GCACGCGCGC GTGCTCGGTT CGTACGGCGT GACCGAGGCG
GCGATCGACA GTTCGTACTT CGATCCCGAT CTGCACGCGC TCGCGCCCGA TAGCGTCGTG
CCGCTCGGGC GGCCGTTGCC GAGCTGCGAA TTGCTGATCG TCGATCCGCT GCTGCAGATG
ACGCCGATCG GCGTGCCGGG CGAATTGCTC GTCGCGGGCC CTGCGGTGGC GACGCGCTAC
CTGAACAACG ACGCGCTGAC CGCGCAAAAA TTCCTCCGCG GCCGCGTGGA CGAGCACGGG
CGCGTGATCG CAGGCGACGG CCAAACGCGC TTTTATCGCA CCGGCGATAT CTGCCGGTTC
CTCGAAGACG GCACGATCGA TTTCCTGGGG CGCCGCGACA ACCAGATCAA GATCCGCGGC
TTTCGCGTCG AGCTGGGCGA GGTCGAGGGC GTGCTCGCCG CGCATCCGGA CGTGCGCCAA
TGCGCGGTCG TCGTGCGCGA CGAGGCGTCG GGCGACCCGT CGCTCGCGGC GTTCGTCGTG
AGCGACGCGC CGATCGCGGC GCTGCGCGGC TATCTGCGCG GGCGTCTGCC CGCGTACATG
CTGCCCGCCG CGATCGAACG GCTGGGCGAC ATGCCGCTCA CGGCGAGCGG CAAGATCGAC
CGGAACCGGC TCAAGGCCTG GCCGCTGAGC GCGCCGGACG TCCCGCCGCC CGACGCGGCG
ACCGACGTCG AGCGCCGCCT GCTCGCGCTG TGGGAAAACT TGCTGTCGGC GCGCGTGCCG
AGCGTGCACG AGAACTTCTT CCAGTGCGGC GGACATTCGC TCGCGGCCGC GCGCCTCGCG
TCGAGCATCA GTCAGGCGTT CGACATCTCG ATCGGCGTGT CGAGCGTCTT CAATCATCCG
AGCGTCGCCG AGCAGGCGCG GCTCGTCGAA GCGCTCGCGT CCGCGCGCGC GCCGCGCGAC
GTCCGGCAGA CGGCGCACGC GGACGCCGCG GAGCCGGCGG GCGACGACGG GCTGCTGTCC
TATTCGCAGC AGAGCCTGTG GCTGACGGCG AAGCGCACGC CGGACGATTT CAGCTACAAC
ATCCCGGTGA CGTGGCGCCT CGACGGCCCG CTCGACGCGC ACGCGCTGGA GCGGGCGATC
AACGATGTGG TCGCGCGCCA CGATGCGTTG CGCACGGTGT TTTCGTCGGA CGTGCGCACG
GTCGTCGGCC CGCATCGCGA ATCGTCGCAG GAGCCGACGC AGCGGGTGCT CGATACGCTG
ACGATCGCGT TGCGGCGGGT GGCCGTCGCA CCGGACGACG CGGCGAGCCT GCCCGCGCGG
CTGCGCGAGG CGCACAGCCG CGCGTTCGAT CTGAACGCGG GGCCGCTGCT GCGCGCGGTG
CTGTTCGAGA TCGCGCCGAC GCATCACGTG CTCGACGTGA CGATCCACCA TATCGTGATC
GACGGGCCGT CGTTCGGCCT GTTCTGGCGC GATCTGCAGA CCGCGTACCG TGCTCGCGTC
GCGGGCGAGG CGCCCGGCTG GCAGCGCCCG GCGCGGCGCC ATGCGGATTT CGTGAGCCGG
CAGCGGCAGG CGCTGCGCGG CGAAGCGGCC GCGCGGCAGC TCGCATATTG GCGCGAGCAA
CTGCGCGGCT TGCCCGCGGC GCTGCCGCTG CCCGACGCGG TGCTCGCGGC GCACGCGCCC
GGCAGCGCCC GATCGCTGAC GTTCGAGATG CCCGATGACG TCGCCGCGGG CCTCGCCGCG
CTCGCGCGGC GCACGAACGG CTCGCCGTTC ATCGTCTACC TCGCGCTTTT CGCCGCCGCG
CTGCGGCAGC AGACCGGCGA AGCGGACTTC GCGATCGGCA CGCCGCTCTC GCTGCGCCCG
CACGAAGGGT TCGCCGACGT GCTCGGCTTC TTCGCGAACA CGATGCCGCT GCGCATGCGC
CTGCACGGGC TCGACACCTT CGAGCGCGTG CTGCGGTACG TGCGCGAACA GTGCCTCGCG
CTCTACGAGA ACGGCGACGT GCCGTTCGAG TATCTCGTCC AGGCGCTGAA GCCCGCGCGG
GCCGCGCGGC GCAATCCGGT GTTCCAGACG ATCTTCTCGT GCGAATTCGA CGACGAGCGG
CTGCAGTTGA CGGGCGTCGA CGCGCATGCG CTCGCGCTCG ACGCGTACAC GGCGAAGCTC
GATCTCGAAA TGGCGATCAA CGTGAGCGGC GGCCGCGTGG TGTGCCATCT GATGTCGCGC
CCCGGATCGT TCGATGCCGA TGCGTTGAGC TCGATCCGGC ATCACTTCCT GCGGACGGCG
TGCAGCGCGA CGCGCGCGCA CGCGGAGCGC GAGGCGGCAG AGGCGCGCGC GACGCACGAG
GTGCATGAGG TGCATGAGGC GCACGAGGCG CATGAGGTGC ATGAGGCGCA CGAGGCGCAC
GAGGTGCACG AGGTGCACGA GGTGCATGAG GCGCATGAGG CGCATGAGGT GCACGAGGTG
CATGAGGTGC ATGAGGTGCA CGAGGTGCAT GAGATGCATG AGATGCATGA CGTGCATGAG
GCGCGCGCGC AACTCGCCGA TGACGCGCCG GGCGCGCGCC GGCCGTCGGC GGACGACCTG
TTCGGCCTCT TCGCGCGCAG CGCGGCGCGC CATGCGCAGC GCGTCGCGCT CGACAGCCCG
ATGCTGCGCG CGAGCTACGC ACAACTGGCC GAGCGCGTGT CGGCCGCCGC GCGCGCGCTC
GCGGCGCACG GCGTGCGGCG CGGCGATCGG GTCGGGATCT TCGTCGGCCA CCATCCGCAC
AACGTGACGG CGATGCTCGC GATCGCGCGC GTCGGCGCCG CGTTCGTGCC GATGGACCCC
GAGCACAAGC CGCAGTGGAA CCGGCATATC GTGGACGACG CGGCGCTGAC GGCGCTCGTC
GGCGGCGCGT GGACCGCGGA TGCGGCGCGC GGCTTCGGGC TGCCCGTCGT CGATCTCGAC
GCGCCGCCGC CGCCCGCATC GGAGCTCGCC GACGCGCCCG CGGCGGGCGG CGCGCACCCG
GACGATTGCG CGTACGTGAT CTACACGTCG GGCTCGACGG GCCGGCCGAA GGGCGTCGCC
GTCTCGCACG CGAGCGTGTG CCACAACGTG CGCGCGATGG CCGAGATCAT GCGCATCGGG
CCGCAGTCGA GAATGGCGCA GTACGTGTCG CCGGTGTTCG ACGTCGTGCT GGGCGAGATC
TTTCCGGCGC TCGCCGCAGG CGCGGCGATC GTGTTCGCCG AGCGCCGCCG GCCGCTGCCG
GGCCAGGCGC TGGTGGATTG GCTCGACGCG CAGCGCGTGT CGCATGTGTG GATCGTGCCG
TCCGCGCTGG CGATGGTGCC CGAGGCCGCG CTGCCGGCGC TCGAGGTGCT GATCGTGGCC
GGCGAAGCCT GCCCGCGCGA GCTCGCGCAG CGATGGGCGG CCGGGCGCCG GCTGCTGAAC
GGCTACGGGC CCACGGAGGC CGCGATCGTC GTGTCGCTGA CCGATTACCA CGCGCAGCGC
GAGCGCCTGA TCCTGAGGCC GATGGGCGGC GCGCGGCTGC ACGTGCTCGA CGAAGCGCTG
CGCCCGGCGC CCGCCGGCGC GGCGGGCGAG CTGTTCATCG GCGGCGCGTG CGTCGCGCAG
GGCTACCTCG GGCAGCCGGC GCGCACCGCG CAGGCGTTCG TCGCCGATCC GTTCGACGCC
GAGCCGGGCG CGCGCATGTA TCGCACGGGC GACGTGGTGC GCCGGCTCGA CGACGGCGCG
ATCCAGTTCA TCGGCCGCGT CGATCGCCAG GTGAAGATCC GGGGCTTTCG CATCGAGCTC
GACGCGGTGC GCGCCGCGCT GATGGAAGTG CCCGGCGTGC AGGCGGCGGA GGCGCTCGCG
CAGCCGGACG CGAGCGGGCA GCCGCTGCTC GTCGGGTATG TCGTCGCGCG CCGCGCGAAG
GCCGAGCTGC TCGACGCGCT GCGCGGCAAG GTGCCGGACG CGATGGTGCC CTCGACGCTC
GTGTTCCTCG ATGCGCTGCC GACCGGCAGC ACGGGCAAGA CGGATCTGAA GGCGCTGAAG
GCGCTGAAGA CGGGCGACGC GGCGCGCCCC GCCGCGGCGG CGGCCGACAT GCCGCGTGCC
GCGTCGCAGG GGCGCACGCT GCATCGCGTG CGCGAGATCT GGCGCACGCT GCTCGAACGC
GACGACATCG GCGACGACGA AAACTTCTTC GACGCGGGCG GGCATTCGCT GCGCGCGGTC
GCGCTGCACC AGCGCATCAC CGAAGCGTTC GGCGATGTGA TCGCGCTCAC CGATCTCTTC
GAGCATCCGA CGATCGGCGC GCTCGCCGCG CATCTCGACG CGTTCGCGCC GCGCGACGGC
GAAGCGGCCG ACGACGCCGC CGGCGCTGCC GCGCGCGCGC CCGCCGACGG TGTGCTCGAC
ACCGACGCGA TCGCCGTGAT CGGCCTCGCC GGCCGCTTTC CCGACGCGCC GGACCTCGAC
CGTTTCTGGG AGCGGCTGCT CGCCGGCTAC GAGGCGGGCC GCACGCTGAG CGACGCGGAA
CTCGACGCGC ACGGCGTGCC GGCCGAGCTG TATCGCAATC CGCATTTCGT CCGCCGCTTC
AAGGAGCTCG AAGGCAAGGC GGAGTTCGAC GCCGGCTTTT TCGGCTATTC GCCCCGCGAG
GCGCAGGTGA TGGACCCGCA GCAGCGGATC TTCCTCGAGC TCGCGTGGCA GGCGCTCGAG
CAAGCCGGCT ATGGCGATCG CGGCCGCGTG CGCTCGGTCG GCGTGTTCGC GAGCGCCGCG
TTCAACTATT ACCTCGTGCA GAACGTGATG CCGAATGCCG AGCGGCTGCG GCTCGAGCCG
GGCCAGTGGC TGATCGGCAA CGACAAGGAT TTCATCGCGA CCCGCACCGC GTACAAGCTC
AATCTGCTCG GGCCGGCGCT GAGCGTCGGC ACCGCGTGCT CGTCGTCGCT GATGGCGGTC
CATCTGGCTT GCGCGAGCTT GCGCAACGGC GAGGCGCAGA TGGCGCTCGC CGGCGCGGTC
GCGCTCGATC CCGATCAGGT CGGCTATCTG TACGCCGAAG GCGGGATCAT GTCGCCGGAC
GGGCGCTGCC GGCCGTTCGA CGCCGCCGCG GCCGGCACCG CAGGCGGCAG CGGCGGCGGC
GTGGTGCTGC TCAAGCGGCT CGACGCGGCG CTGCGCGACG GCGACACCGT GTACGCGGTG
ATCAAGGGCT CGGCGGCGAA CAACGACGGC GCGGACAAGG TGAGCTACAC CGCGCCGAGC
GTCGCCGGCC AGACGGCCGT GATCCGCGAC GCGCTGCGCG CGGCGCGCGT GTCGGCCGAC
AGCATCGGCT ACGTCGAAGC GCACGGCACG GGCACGCCGC TCGGCGATCC GATCGAGGTG
CGCGCGCTCG CGCAGGCCTT CGCCGAGGCG GCCGCGCCGG GCGCGCTGGC GAACGGGCGG
TGCGGGATCG GCTCGATCAA GGGCAACATC GGCCATCTCG ACGCGGCGGC CGGCATCGCG
GGGTTCATCA AGGCGGTTCT CGCGCTGCAT CGCGAAGCGA TTCCGCCGAG CATCAACTGC
GAGACGCCTA ACGCGCGAAT CGGCTTCGAC AAGACGCCGT TCAGCGTCGT GCGCGAAGCC
CGCGCGTGGC CGAGAACGGC GACGCCGCGC CGCGCGGGCG TCAGCTCGTT CGGCGTCGGC
GGCACGAACG TCCACGTGGT GCTGGAGGAG GCGCCGCGCG TGCGCGCGGG CGAATCGGCC
GAGCCTTCGC GCTGGCAGTT GCTGCCGGTT TCGGCGCGTT CGCCGAGTGC GCTGCGCGAG
CAATGGCGGC AACTGCGCGA CGCGCTCGCG CACGCGCGGC CGCGCGTGCA GGACGTCGCC
CATACGCTGC AGGTCGGGCG CACCGCGTTC GAGCATCGCG GCTTCGCCGT CGTCGACGCG
GCCGCCGACG CGCCCGCCCA GCTCGACGCC GCCGGCTCGC CGCCGGCGTT CGAGCGTCGC
GCTGCGCCGC CGGTGGTGTT CATGTTTCCC GGCCAAGGCA GCCAGTATCC CGGCATGGGC
GCGGCGCTGT ATCGGAGCGG CGGCGTATTC CAGGCCGAAG TCGATCGCTG CGCGCAGTTG
CTTCGCGCGC ATCTGGACCG GGACGTCCGC TCGCTGATGT TCGACGCGGG TGCGTCGCTG
CTGAGGGAAA CGCGCTATAC GCAGCCGGCG CTGTTTGCGA TCGAATACGC GCTCGCGCGG
CAATGGCTCG CATGGGGCGT CACGCCGCAC GCGATGATCG GCCATAGCGT CGGCGAGCTC
GTCGCGGCCG CCGTCGGCGA GACGCTCGCG CTGCCCGACG CGCTGGCGCT CGTCGTCGCG
CGGGCCGACG CGATGCAGCG CCAGCCGCCG GGCGCGATGC TCGCGGTGCT GGCCGATGCG
CGCGAGCTCG CGGCGCTCAA CGGCCCGGGC TGCGAGATCG CGGCGATCAA CGGCCCCGAG
CAATACGTGC TCGCCGGCGA CGCCGCCCGG ATCGCCGCGC TCGAAGACGC GTGCATCGCG
GCCGGCGTCG CGTGCCAGCG CCTCGCGACC TCGCATGCGT TCCATTCGTC GGCGATGGAC
GGCGCCGCGC GCGAGATCGA TCGCGCGAGC GAGCGCATCG TGCGCCGGGC CGCGCGCATT
CCGTTGATCT CGAATCGCAG CGGGCGCTGG CTGAACGAGC AAGACCTGCG GGACGCCGGT
TACTGGAGCG AGCACGTGCG CCAGCCCGTG CAATTTCACG CGGGGGTGCG CACGCTGCTC
GATGCGCTCG ACGCGCCGAT ATTCGTCGAA GTCGGGCCGG GGCGCGCGCT GGGCAATCTG
ATCGGCGGCT GGGCGGGGCT CGGGCCGCAG CGGATCGTCG CGACGCTGCC GCACGCGCGC
GAGCGCAGGG ACGACATGGC GGCGGCGTTG CGAGGCGTCG GCACGCTGTG GGCGCAGGGC
GTCGACGTGA ACTGGGCCGG CCTGCATGCG CCGGGCGCGG CGCGGCGCAT CGCGCTGCCG
ACCTATCCGT TCGAGCGTAC GCGCCACTGG ATCGAGCGGC CGGCGGGCGC GCGCGCGGCG
CCGGCGCGCG AAGCCGACGG CGTGCCGATG CGGCGCGGCG AGGACGCGGC CGACGGCAGC
CTCACCGTGT CGTTTGCGCT GCACGAGCGG CTCTGGTTTC TCGATGAACA CCGGATCTTC
GACGGCGCGC CGGTGCTGCC CGGCACCGCC TGCATCGAAC TCGTGCGGCG CGCGTATTCG
CTCGTGCGCC CGGGCGCGGC GGTGACGATG CGCGACGTCT ATTTTCCGAC GGCGCTGATC
CTGTCGACGG ACGAATCGCG CAACGTGCGC GTCGTGTTCC GGCCGCCGGA ACGCGTGTCG
GACGGCGCGG CCGCCGGGGG CGACCTCGCG TTCGTGCTCG AATCGAACGA CGCGAACGGG
CCCGCCGGAT GGACGCCGCA CGCGAGCGGG CGCATCGGGG ACGATCCGCC GGCGTGCGCC
GCGCCCGCTT CGCTCGCCGC GCTCGACGCT CCGGCCGCGC TGCGCGAGCA ATGGGGGCTC
ACGGCGCTGG ACGACGTCGC CGCGCTGTCC GCGCAGGCGT TCGCCGATTA CGGGGCGCGC
TGGCGCGGCG TCGACGCGCT CTGGCTCGGC GAACGGGCCG GGCTCGCGCG GCTCAGGCTG
CCGGCCGCCG GCAGCGGCGA CTTGCCCGAT TTCGCGCTGC ATCCGGCCAT GCTCGACGTC
GCCACGGCGT TCCTGCCCGC CTGCCTGCGC CCGCGCGACG CGTCGGTGCC GTTCCGCTAC
GAATCGATCC GCATGCACCG GCCGCTGCGC GCCGATTGCT ACAGCTTCGC CGTCGAGACC
GCGCCGAACG TGTACGACGT CACGCTGTTC GCATGGGACG AGGCCGCGCG GCGCGCCGAC
GTGCTCGTCG CGATCGGCGG CTTCGCGCGC CGCGAGCCGG CGCACCGGGC GCGCGACGTC
GCGCAGTGGT GCCGCACGGT GAGCTGGCGC GACGCGCCCG CGGCGCGCGC GTTGCCGCCC
GAGCGCTGGC TCGTGTTCGG CGACGAATGG TTCGCGCTCG CGCCCGCCGG CAGCGTGCTC
GTGCGCGAGG ACGACGCGTT TCGCGCGCAC GGCGACAACG GCTATGGCGT GCGCCCGGGC
GAGAAGGCCG ATTGCGACCG GCTGATCGCG CGGCTCGCCG AGCAGGGCGG CGTGCCGGCG
CACGTCGTCT ACGGCTGGGC GCAGACGGAC GTCGATCGCG CGTTCGCCGG GCTCGCCGCG
CTGCTGCAGG CGCTGGGCGC GCATCCCGCC GATTTTCGCG TGTCGCTCGT GACGAAGGGC
GCGCGCTCGG CGCGCACGCT GGACGCATGC GCGGCCGCCG CGCCGGCGGG CTTGCTCAAG
GCGGTGCGCT GGGAATATCC GCGCATCGTG TGCCGCCACA TCGATATCGA CGATGCGAGC
GACGCGACGA TCGACGCGTT GCGCGCGGAG CTGTCGTCCG AGCCCGCCAC GCCGCCCGGC
GCGCCGCCCG AGCTGCCGAG CAGCATCGCG CTCGCCGGCG CGCGCCGCGA GGCGCCCGGC
TTCGCGGCGC TGCCGGCCGT CGCGCGCGAC GACGTCCTGC GCGACGGCGG TGCGTATCTG
ATCACGGGCG GCGCGAGCGG CATCGGCCTC GAGCTGGCGG CGCACATCGC GTCGCGGCGG
CGGGACGTGA GGCTGGCGTT GCTGAGCCGC TCGCCGCATG ACGAAAACGC CGCGCGGTTC
GCCGCGCTGG ACGAGGCCGC CGCGAGCGTG CTGCGGTTGA CGGCCGACGT CGCGCACGCC
GCGCAGCTCG CCGACGCGCT GCGCACGGTG CGCGCGCGCT TCGGGCGCAT CGACGGCGTC
ATCCACGCGG CCGGCGTCGA GGCGAGCGGC CTGCTCGAAA CCGGCACGCC CGACGCATGG
CGGCGCGTGA TGGCGGCGAA GGTTCACGGC GCGCGACACC TGTTCGACCA ACTGGCCGGC
GATCCGCCCG ACTTCATCGT GCTCTGCTCG TCGCTCGCCG CGGTCGTCGG CGGCCTCGGG
CAGGCCGACT ACGCGGCGGC GAACGGTTAC ATGGACGCGC TCGCGCAGCA CTGGCGCCAA
CGCGGCGTCG CGGCCATCGC GATCGATTGG GATACGTGGT CGGACACGGG CATGGCGTTC
GACCACGCGG CGCGCACGCG CCGCTCGAAT GACCGCCCGG GCGCGCTGCC CGGCCTCGCG
AACCGCGAAG GGCGGGCGCT CTTCGATCTC GCGCTCGCGC ACGACGCGCC GCGCATCGTT
ATCAGCAAGC GGGGCTTCGA ACAGGACCGG CGCGACGCGC CCACGCGCGC GCGGCGCGCG
GCCGCCCCGG GCGACGCGCA GGCGGCGCTC GTCGCGCTCT GGCAGGAACT GCTGGGCGTC
GAGCAGGTGG GCGTCGACGA CGACTTCTTC GATCTGGGCG GCCATTCGCT GCTCGCGACG
CAGTTGATTT CCCGCGTGCG CGATCAGTAC GCGCGCAGCC CGACGCTCGG CGAATTCCTG
GAGGAGCCGA CGATCGCGCG GCTACTGCGC GCGATCGACC ATACGGGCGG CGACACGGGC
GGCGACATGA GCGGCGACAC CGCCGGCGAC GCGCCCGACG TCGACGAGAC GCTGCGCTAT
TGCGTGGTGC CGATGGTGAA GGCCGGCAGC GGCGCGCCGT TCTTCTGCAT TCCCGGCATG
GGCGGCAACA TCACGCAGTT GCTGCCGCTC GCGAACGCGC TCGGCGCGGA CCGGCCGGTG
ATCGGCCTGC AATACCTCGG CCTCGACGGC AAGCATGCGC CGCATGCGTC GGTCGAGGCA
ATCGCGGCGC ACTACGTGCG CTGCATCCGC AGCGTGCAGC CCGCGGGGCC GTACTTCCTC
GGCGGGCATT CGCTCGGCGG CAAGATCGCC TATGAAGTCG CACGCCGGCT GCATGCGCAG
GGCGACGCGA TCGGCCTCGT CGCGATGTTC GATTCCGCCG CGCCGCCGTA TTCGTTCGTC
GCGCATCAGG ACGATTTCGC GATCGCCAGC ATGATTCTCG GCGTCTTCGC GTACTACGCG
GGCAAGATGG AGATGATGGC CGGCATCGAC GACGCGCGCT TGCGCGACGC GCCGCGCGAG
CGGTTGCTCG CGTTCATGGG CGAGCGGCTC GCGCAGTTCG GCGTGATTCA GTCGCAAAGC
GATACGAGCG CGATTCGCGG GCTCTTCAAC GTGTATCGCG CGGCGGCCGA TTTTTCCGCG
CGCTATGCGC CGCCGCACGA GCACCTGCCG CTGCCGATAC TGCTCGTCAA GGCGACCGAG
CCGATGCCCG ACGGCATCAA GCTGCCCGAA ATCCGCGAGA CGCCGGCGTG GGGCTGGGAA
AACTTCACGC GGCTGCCGGT GCGCACGTGC GAAGTCGCGG GCAACCATTA CAGTTGCCTG
ATGGACGGCT ACGTCGAGCG CATCGCCGAT GCGCTGCGCG ATGCGCTGGC GTCGGCGCGG
CAAGCGATCG AGGCGTGA
 
Protein sequence
MNHRLPHVER GDADEPSIET AAGEQPPLAL PASVAQRRLW FVENGDVRAS TYNVPAAFTL 
TGPLDDAVLE RALAFMQQRH PALRSRFRTR DGELRIELAP QPAPLARQDL GALDADVRAR
TAERLCANHA NRRFDLERDA PIRCLLLRLG ENEHVLAVNV HHIVFDDWSI RIFFRELGAV
YGALLAGATP DLPALDYAAA VAASVPAAAR HAAARQYWAR AMSGAPTLHK LPTDRPRPAE
PRMRGAVHKH VFARRHAEGI RALCRRAGVT PYMLGVAAFA ALLHRYSGED EIVIGSPFAN
RVTQAQQSLI GFFINLIPLR VRFDAGVNFL DLLAQVRETS FDAFEHAVLP FDQIVDAIRP
PRSSSHAPVF QIMFDYLKSG GMLELDGVGV TGSLVHTGTA KYDLTVSMEE GPDELAAIVE
YDTDLFDAGT IARLGGHFER LLENVLASPA APIAEGSLLP ADELRQVRRF TRPDEPYAHI
PFSPMPQRIR EAARRAPHAV AIVHGDARMT YETLDRRSDA LARALRARGV GRGSRVASLQ
SYSEKIVVAY LGILKAGAAY LPLDPADPRR LEKIEDAAPA MIVTARRDLE DVPQALRART
LTIDDPIECG KAPDAVNDVA TDVVTDATAR DAELDFATLA EADPAYVIYT SGSTGKPKGV
EVSHGSLNVS YHGWHRAYRF GKPGHPVTLQ LAGMTFDLGI GDVSRTLACG GTLVMPPRDG
LLDAGRLHAL MCAERVSFGD FPPVILRELI RHCNETGDRL DMLDTLVCGA DVWFGHELHA
ARALCGPHAR VLGSYGVTEA AIDSSYFDPD LHALAPDSVV PLGRPLPSCE LLIVDPLLQM
TPIGVPGELL VAGPAVATRY LNNDALTAQK FLRGRVDEHG RVIAGDGQTR FYRTGDICRF
LEDGTIDFLG RRDNQIKIRG FRVELGEVEG VLAAHPDVRQ CAVVVRDEAS GDPSLAAFVV
SDAPIAALRG YLRGRLPAYM LPAAIERLGD MPLTASGKID RNRLKAWPLS APDVPPPDAA
TDVERRLLAL WENLLSARVP SVHENFFQCG GHSLAAARLA SSISQAFDIS IGVSSVFNHP
SVAEQARLVE ALASARAPRD VRQTAHADAA EPAGDDGLLS YSQQSLWLTA KRTPDDFSYN
IPVTWRLDGP LDAHALERAI NDVVARHDAL RTVFSSDVRT VVGPHRESSQ EPTQRVLDTL
TIALRRVAVA PDDAASLPAR LREAHSRAFD LNAGPLLRAV LFEIAPTHHV LDVTIHHIVI
DGPSFGLFWR DLQTAYRARV AGEAPGWQRP ARRHADFVSR QRQALRGEAA ARQLAYWREQ
LRGLPAALPL PDAVLAAHAP GSARSLTFEM PDDVAAGLAA LARRTNGSPF IVYLALFAAA
LRQQTGEADF AIGTPLSLRP HEGFADVLGF FANTMPLRMR LHGLDTFERV LRYVREQCLA
LYENGDVPFE YLVQALKPAR AARRNPVFQT IFSCEFDDER LQLTGVDAHA LALDAYTAKL
DLEMAINVSG GRVVCHLMSR PGSFDADALS SIRHHFLRTA CSATRAHAER EAAEARATHE
VHEVHEAHEA HEVHEAHEAH EVHEVHEVHE AHEAHEVHEV HEVHEVHEVH EMHEMHDVHE
ARAQLADDAP GARRPSADDL FGLFARSAAR HAQRVALDSP MLRASYAQLA ERVSAAARAL
AAHGVRRGDR VGIFVGHHPH NVTAMLAIAR VGAAFVPMDP EHKPQWNRHI VDDAALTALV
GGAWTADAAR GFGLPVVDLD APPPPASELA DAPAAGGAHP DDCAYVIYTS GSTGRPKGVA
VSHASVCHNV RAMAEIMRIG PQSRMAQYVS PVFDVVLGEI FPALAAGAAI VFAERRRPLP
GQALVDWLDA QRVSHVWIVP SALAMVPEAA LPALEVLIVA GEACPRELAQ RWAAGRRLLN
GYGPTEAAIV VSLTDYHAQR ERLILRPMGG ARLHVLDEAL RPAPAGAAGE LFIGGACVAQ
GYLGQPARTA QAFVADPFDA EPGARMYRTG DVVRRLDDGA IQFIGRVDRQ VKIRGFRIEL
DAVRAALMEV PGVQAAEALA QPDASGQPLL VGYVVARRAK AELLDALRGK VPDAMVPSTL
VFLDALPTGS TGKTDLKALK ALKTGDAARP AAAAADMPRA ASQGRTLHRV REIWRTLLER
DDIGDDENFF DAGGHSLRAV ALHQRITEAF GDVIALTDLF EHPTIGALAA HLDAFAPRDG
EAADDAAGAA ARAPADGVLD TDAIAVIGLA GRFPDAPDLD RFWERLLAGY EAGRTLSDAE
LDAHGVPAEL YRNPHFVRRF KELEGKAEFD AGFFGYSPRE AQVMDPQQRI FLELAWQALE
QAGYGDRGRV RSVGVFASAA FNYYLVQNVM PNAERLRLEP GQWLIGNDKD FIATRTAYKL
NLLGPALSVG TACSSSLMAV HLACASLRNG EAQMALAGAV ALDPDQVGYL YAEGGIMSPD
GRCRPFDAAA AGTAGGSGGG VVLLKRLDAA LRDGDTVYAV IKGSAANNDG ADKVSYTAPS
VAGQTAVIRD ALRAARVSAD SIGYVEAHGT GTPLGDPIEV RALAQAFAEA AAPGALANGR
CGIGSIKGNI GHLDAAAGIA GFIKAVLALH REAIPPSINC ETPNARIGFD KTPFSVVREA
RAWPRTATPR RAGVSSFGVG GTNVHVVLEE APRVRAGESA EPSRWQLLPV SARSPSALRE
QWRQLRDALA HARPRVQDVA HTLQVGRTAF EHRGFAVVDA AADAPAQLDA AGSPPAFERR
AAPPVVFMFP GQGSQYPGMG AALYRSGGVF QAEVDRCAQL LRAHLDRDVR SLMFDAGASL
LRETRYTQPA LFAIEYALAR QWLAWGVTPH AMIGHSVGEL VAAAVGETLA LPDALALVVA
RADAMQRQPP GAMLAVLADA RELAALNGPG CEIAAINGPE QYVLAGDAAR IAALEDACIA
AGVACQRLAT SHAFHSSAMD GAAREIDRAS ERIVRRAARI PLISNRSGRW LNEQDLRDAG
YWSEHVRQPV QFHAGVRTLL DALDAPIFVE VGPGRALGNL IGGWAGLGPQ RIVATLPHAR
ERRDDMAAAL RGVGTLWAQG VDVNWAGLHA PGAARRIALP TYPFERTRHW IERPAGARAA
PAREADGVPM RRGEDAADGS LTVSFALHER LWFLDEHRIF DGAPVLPGTA CIELVRRAYS
LVRPGAAVTM RDVYFPTALI LSTDESRNVR VVFRPPERVS DGAAAGGDLA FVLESNDANG
PAGWTPHASG RIGDDPPACA APASLAALDA PAALREQWGL TALDDVAALS AQAFADYGAR
WRGVDALWLG ERAGLARLRL PAAGSGDLPD FALHPAMLDV ATAFLPACLR PRDASVPFRY
ESIRMHRPLR ADCYSFAVET APNVYDVTLF AWDEAARRAD VLVAIGGFAR REPAHRARDV
AQWCRTVSWR DAPAARALPP ERWLVFGDEW FALAPAGSVL VREDDAFRAH GDNGYGVRPG
EKADCDRLIA RLAEQGGVPA HVVYGWAQTD VDRAFAGLAA LLQALGAHPA DFRVSLVTKG
ARSARTLDAC AAAAPAGLLK AVRWEYPRIV CRHIDIDDAS DATIDALRAE LSSEPATPPG
APPELPSSIA LAGARREAPG FAALPAVARD DVLRDGGAYL ITGGASGIGL ELAAHIASRR
RDVRLALLSR SPHDENAARF AALDEAAASV LRLTADVAHA AQLADALRTV RARFGRIDGV
IHAAGVEASG LLETGTPDAW RRVMAAKVHG ARHLFDQLAG DPPDFIVLCS SLAAVVGGLG
QADYAAANGY MDALAQHWRQ RGVAAIAIDW DTWSDTGMAF DHAARTRRSN DRPGALPGLA
NREGRALFDL ALAHDAPRIV ISKRGFEQDR RDAPTRARRA AAPGDAQAAL VALWQELLGV
EQVGVDDDFF DLGGHSLLAT QLISRVRDQY ARSPTLGEFL EEPTIARLLR AIDHTGGDTG
GDMSGDTAGD APDVDETLRY CVVPMVKAGS GAPFFCIPGM GGNITQLLPL ANALGADRPV
IGLQYLGLDG KHAPHASVEA IAAHYVRCIR SVQPAGPYFL GGHSLGGKIA YEVARRLHAQ
GDAIGLVAMF DSAAPPYSFV AHQDDFAIAS MILGVFAYYA GKMEMMAGID DARLRDAPRE
RLLAFMGERL AQFGVIQSQS DTSAIRGLFN VYRAAADFSA RYAPPHEHLP LPILLVKATE
PMPDGIKLPE IRETPAWGWE NFTRLPVRTC EVAGNHYSCL MDGYVERIAD ALRDALASAR
QAIEA