Gene BURPS668_A1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1795 
Symbol 
ID4886086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1741812 
End bp1754609 
Gene Length12798 bp 
Protein Length4265 aa 
Translation table11 
GC content72% 
IMG OID640131733 
Productsyringomycin synthetase 
Protein accessionYP_001062790 
Protein GI126443126 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATC GTTTGCCCAA CGTCGGGCGC GGCGACGCCG ACGAACCGTC CATCGAAACC 
GCCGCCGGCG AGCCGCCGCC GCTCGCGCTG CCGGCGAGCG TCGCGCAACG CCGGCTCTGG
TTCGTCGAGA ACGGCGACGT GCGCGCGTCG ACTTACAACG TGCCGGCCGC CTTCACCTTG
ACGGGGCCGC TCGACGACGC CGTGCTCGAG CGGGCGTTGG CGTTCATGCA GCAACGGCAT
CCCGCGCTGC GCTCGCGGTT TCGCACGCGC GACGGCGAAT TGCGCATCGA GCTCGCGCCG
CAGCCGGCGC CGTTGGCGCG GCAGGACCTC GGCGCGCTCG ATGCCGACGT GCGCGCGCGG
ACGGCCGAGC GGCTCTGCGC GAACCACGCG AACCGCCGGT TCGATCTCGA GCGGGACGCG
CCGATCCGCT GCTTGCTGCT CAGGCTCGGC GAAAACGAGC ATGTGCTGGC GGTGAACGTG
CATCACATCG TGTTCGACGA CTGGTCGATC CGGATCTTCT TCCGGGAGCT CGGCGCCGTC
TATGGCGCGC TGCTCGCGGG CGCGACGCCC GATCTGCCGG CGCTCGACTA CGCGGCGGCC
GTCGCGGCGA GCGTGCCCGC GGCCGCCCGG CACGCCGCCG CGCGTCAATA CTGGGCGCGC
GCGATGTCGG GCGCGCCGAC GCTTCACAAG CTGCCCACCG ACCGGCCGCG TCCGGCCGCG
CCGCGCATGC GCGGCGCGGT GCACAAACAC GTGTTCGCGC GCCGGCATGC CGAGGGCATC
CGCGCGCTTT GCCGCCGTGC CGGCGTGACG CCCTACATGC TGGGCGTCGC CGCGTTCGCC
GCGCTGCTGC ATCGCTATTC GGGCGAGGAC GAGATCGTGA TCGGCAGCCC GTTCGCGAAC
CGCGTGACGC AGGCGCAGCA GAGCCTGATC GGCTTCTTCA TCAACCTGAT CCCGTTGCGC
GTGCGTTTCG ATGCCGGCGT CAATTTTCTC GACCTGCTCG CGCAGGTGCG CGAGACGTCG
TTCGATGCAT TCGAGCACGC GGTGCTGCCG TTCGACCAGA TCGTCGATGC GATCCGTCCG
CCGCGTTCGT CGAGCCACGC GCCGGTGTTC CAGATCATGT TCGACTACCT GAAGAGCGGC
GGCATGCTCG AGCTCGACGG CGTCGGCGTG ACCGGCTCGC TCGTCCACAC GGGCACGGCG
AAGTACGACC TGACGGTCTC GATGGAGGAA GGCCCCGACG AGCTGGCGGC GATCGTCGAA
TACGACACGG ATCTGTTCGA CGCGGGCACG ATTGCCCGCC TGGGCGGGCA TTTCGAGCGA
TTGCTCGAGA ACGTGCTGGC TTCGCCCGCC GCGCCGATCG CCGAGGGCTC GCTGCTGCCC
GCCGACGAGC TGCGGCAGGT GCGCCGCTTC ACGCGGCCCG ACGAGCCGTA CGCGCACATT
CCGTTCTCGC CGATGCCGCA GCGGATTCGC GAAGCGGCGC GGCGCGCGCC GCACGCGGTG
GCGATCGTGC ACGGCGACGC GCGGATGACG TACGAGACGC TCGATCGCCG CTCGGATGCG
CTCGCGCGCG CGTTGCGGGC GCGCGGCGTC GGCAGGGGGA GCCGGGTGGC GTCGCTGCAA
TCGTATTCGG AGAAGATCGT CGTCGCCTAC CTGGGTATCC TGAAGGCGGG CGCCGCGTAT
TTGCCGCTCG ATCCGGCGGA CCCGAGACGG CTGGAAAAAA TCGAGGACGC CGCGCCCGCG
ATGATCGTGA CGGCGCGGCG CGATCTCGAG GACGTGCCGC AGGCGTTGCG CGCCCGCACG
CTGACGATCG ACGACCCGAT CGAATGCGGG AAAGCGCCCG ACGCGGTGAA CGACGCGGTG
AACGACGTCG CGACCGACGT CGTGACCGAC GCAACGGCGC GCGACGCCGA ACTCGACTTC
GCGACGCTTG CCGAAGCCGA TCCGGCCTAT GTGATCTATA CGTCCGGATC GACCGGCAAG
CCGAAAGGCG TCGAGGTCTC GCACGGCAGC CTCAACGTGT CGTATCACGG CTGGCATCGC
GCCTATCGGT TCGGCAAGCC CGGCCATCCG GTCACGCTGC AGCTCGCCGG CATGACGTTC
GATCTGGGCA TCGGCGACGT GAGCCGCACG CTCGCATGCG GCGGCACGCT CGTCATGCCG
CCGCGCGACG GGCTGCTCGA CGCCGGCCGG CTGCACGCGC TGATGCGCGC CGAACGTGTG
TCGTTCGGCG ATTTCCCGCC GGTGATCCTG CGCGAGCTGA TTCGCCATTG CAACGAGACG
GGCGAGCGGC TCGACATGCT CGACACGCTC GTGTGCGGCG CCGACGTATG GTTCGGGCAC
GAGCTGCACG CGGCGCGCGC GCTGTGCGGG CCGCGCGCGC GCGTGCTCGG TTCGTACGGC
GTGACCGAGG CGGCGATCGA CAGTTCGTAC TTCGATCCCG ATCTGCACGC GCTCGCGCCC
GATAGCGTCG TGCCGCTCGG GCGGCCGTTG CCGAGCTGCG AATTGCTGAT CGTCGATCCG
CTGCTGCAGA TGACGCCGAT CGGCGTGCCG GGCGAATTGC TCGTCGCGGG CCCCGCGGTG
GCGACGCGCT ACCTGAACAA CGACGCGCTG ACCGCGCAAA AATTCCTCCG CGGCCGGGTG
GACGAGCACG GGCGCGTGAT CGCAGGCGAC GGCCAAACGC GCTTTTATCG CACCGGCGAT
ATCTGCCGGT TCCTCGAAGA CGGCACGATC GATTTCCTGG GGCGCCGCGA CAACCAGATC
AAGATCCGCG GCTTTCGCGT CGAGCTGGGC GAGGTCGAGG GCGTGCTCGC CGCGCATCCG
GACGTGCGCC AATGCGCGGT CGTCGTGCGC GACGAGGCGT CGGGCGACCC GTCGCTCGCG
GCGTTCGTCG TGAGCGACGC GCCGATCGCG GCGCTGCGCG GCTATCTGCG CGGGCGTCTG
CCCGCGTACA TGCTGCCCGC CGCGATCGAA CGGCTGGGCG ACATGCCGCT CACGGCGAGC
GGCAAGATCG ACCGGAACCG GCTCAAGGCC TGGCCGCTGA GCGCGCCGGA CGTCCCGCCG
CCCGACGCGG CGACCGACGT CGAGCGCCGC CTGCTCGCGC TGTGGGAAAC CTTGCTGTCG
GCGCGCGTGC CGAGCGTGCA CGAGAACTTC TTCCAGTGCG GCGGACATTC GCTCGCGGCC
GCGCGCCTCG CGGCGAGCAT CAGTCAGGCG TTCGACGTCT CGATCGGCGT GTCGAGCGTC
TTCAATCATC CGAGCGTCGC CGAGCAGGCG CGGCTCGTCG AAGCGCTCGC GTCCGCGCGC
GCGCCGCGCG ACGTCCGGCA GACGGCGCAC GCGGACGCCG CGGAGCCGGC GGGCGACGAC
GGGCTGCTGT CCTATTCGCA GCAGAGCCTG TGGCTGACGG CGAAGCGCAC GCCGGACGAT
TTCAGCTACA ACATCCCGGT GACGTGGCGC CTCGACGGCC CGCTCGACGC GCACGCGCTG
GAGCGGGCGA TCAACGATGT GGTCGCGCGC CACGATGCGT TGCGCACGGT GTTTTCGTCG
GACGTGCGCA CGGTCGTCGG CCCGCATCGC GAATCGTCGC AGGAGCCGAC GCAGCGGGTG
CTCGATACGC TGACGATCGC GTTGCGGCGG GTGGCCGTCG CACCGGACGA CGCGGCGAGC
CTGCCCGCGC GGCTGCGCGA GGCGCACAGC CGCGCGTTCG ATCTGAACGC GGGGCCGCTG
CTGCGCGCGG TGCTGTTCGA GATCGCGCCG ACGCATCACG TGCTCGACGT GACGATCCAC
CATATCGTGA TCGATGGGCC GTCGTTCGGC CTGTTCTGGC GCGATCTGCA GACCGCGTAC
CGTGCTCGCG TCGCGGGCGA GGCGCCCGGC TGGCAGCGCC CGGCGCGGCG CCATGCGGAT
TTCGTGAGCC GGCAGCGGCA GGCGCTGCGC GGCGAAGCGG CCGCGCGGCA GCTCGCATAT
TGGCGCGAGC AACTGAGTGG CTTGCCCGCG GCGCTGCCGC TACCCGACGC GGTGCTCGCG
GCGCACGCGC CCGGCAGCGC CCGATCGCTG ACGTTCGAGA TGCCCGATGA CGTCGCCGCG
GGCCTTGCCG CGCTCGCGCG GCGCACGAAC GGCTCGCCGT TCATCGTCTA CCTCGCGCTT
TTCGCCGCCG CGCTGCGGCA GCAGACCGGC GAAGCGGACT TCGCGATCGG CACGCCGCTC
TCGCTGCGCC CGCACGAAGG GTTCGCCGAC GTGCTCGGCT TCTTCGCGAA CACGATGCCG
CTGCGCATGC GCCTGCACGG GCTCGACACC TTCGAGCGCG TGCTGCGGTA CGTGCGCGAA
CAGTGCCTCG CGCTCTACGA GAACGGCGAC GTGCCGTTCG AGTATCTCGT CCAGGCGCTG
AAGCCCGCGC GGGCCGCGCG GCGCAATCCG GTGTTCCAGA CGATCTTCTC GTGCGAATTC
GACGACGAGC GGCTGCAGTT GACGGGCGTC GACGCGCATG CGCTCGCGCT CGACGCGTAC
ACGGCGAAGC TCGATCTCGA AATGGCGATC AACGTGAGCG ACGGCCGCGT GGTGTGCCAT
CTGATGTCGC GCCCCGGATC GTTCGATGCC GATGCGTTGA GCTCGATCCG GCATCACTTC
CTGCGGACGG CGTGCAGCGC GACGCGCGCG CACGCGGAGC GCGAGGCGGC AGAGGCGCGC
GCGACGCACG AGGCGCACGA GGCGCACGAG GTGCACGAGG CGCACGAGGC GCATGAGGTG
CACGAGGCGC ACGAGGCGCA TGAGGTGCAC GAGGCGCACG AGGCGCATGA GATGCATGAG
ATGCATGAGA TGCATGAGAT GCATGACGTG CATGACGTGC ATGAGGCGCG CGCGCAACTC
GCCGATGACG CGCCGGGCGC GCGCCGGCCA TCGGCGGACG ACCTGTTCGG CCTCTTCGCG
CGCAGCGCGG CGCGCCATGC GCAGCGCGTC GCGCTCGACA GCCCGATGCT GCGCGCGAGC
TACGCGCAAC TGGCCGAGCG CGTGTCGGCC GCCGCGCGCG CGCTCGCGGC GCACGGCGTG
CGTCGCGGCG ATCGGGTCGG GATCTTCGTC GGCCACCATC CGCACAACGT GACGGCGATG
CTCGCGATCG CGCGCGTCGG CGCCGCGTTC GTGCCGATGG ACCCCGAGCA CAAGCCGCAG
TGGAACCGGC ATATCGTGGA CGACGCGGCG CTGACGGCGC TCGTCGGCGG CGCGTGGACC
GCGGATGCGG CGCGCGGCTT CGGGCTGCCC GTCGTCGATC TCGACGCGCC GCCGCCGCCC
GCATCGGAGC TCGCCGACGC GCCCGCGGCG GGCGGCGCGC ACCCGGACGA TTGCGCGTAC
GTGATCTACA CGTCGGGCTC GACGGGCCGG CCGAAGGGCG TCGCCGTCTC GCACGCGAGC
GTGTGCCACA ACGTGCGCGC GATGGCCGAG ATCATGCGCA TCGGGCCGCA GTCGAGAATG
GCGCAGTACG TGTCGCCGGT GTTCGACGTC GTGCTGGGCG AGATCTTTCC GGCGCTCGCC
GCAGGCGCGG CGATCGTGTT CGCCGAGCGC CGCCGGCCGC TGCCGGGCCA GGCGCTGGTG
GATTGGCTCG ACGCGCAACG CGTGTCGCAT GTGTGGATCG TGCCGTCCGC GCTGGCGATG
GTGCCCGAGG CCGCGCTGCC GGCGCTCGAG GTGCTGATCG TGGCCGGCGA AGCCTGCCCG
CGCGAGCTCG CGCAGCGATG GGCGGCCGGA CGCCGGCTGC TGAACGGCTA CGGGCCCACG
GAGGCCGCGA TCGTCGTGTC GCTGACCGAT TACCACGCGC AGCGCGAGCG CCTGATCCTG
AGGCCGATGG GCGGCGCGCG GCTGCACGTG CTCGACGAAG CGCTGCGCCC GGCGCCCGCC
GGCGCGGCGG GCGAGCTGTT CATCGGCGGC GCGTGCGTCG CGCAGGGCTA CCTCGGGCAG
CCGGCGCGCA CCGCGCAGGC GTTCGTCGCC GATCCGTTCG ACGCCGAGCC GGGCGCGCGC
ATGTATCGCA CGGGCGACGT GGTGCGCCGG CTCGACGACG GCGCGATCCA GTTCATCGGC
CGCGTCGATC GCCAGGTGAA GATCCGGGGC TTTCGCATCG AGCTCGACGC GGTGCGCGCC
GCGCTGATGG AAGTGCCCGG CGTGCAGGCG GCGGAGGCGC TCGCGCAGCC GGACGCGAGC
GGGCAGCCGC TGCTCGTCGG GTATGTCGTC GCGCGCCGCG CGAAGGCCGA GCTGCTCGAC
GCGCTGCGCG GCAAGGTGCC GGACGCGATG GTGCCCTCGA CGCTCGTGTT CCTCGATGCG
CTGCCGACCG GCAGCACGGG CAAGACGGAT CTGAAGGCGC TGAAGGCGCT GAAGACGGGC
GACGCGGCGC GCCCCGCCGC GGCGGCGGCC GACATGCCGC GTGCCGCGTC GCAGGGGCGC
ACGCTGCATC GCGTGCGCGA GATCTGGCGC ACGCTGCTCG AACGCGACGA CATCGGCGAC
GACGAAAACT TCTTCGACGC GGGCGGGCAT TCGCTGCGCG CGGTCGCGCT GCACCAGCGC
ATCACCGAAG CGTTCGGCGA TGTGATCGCG CTCACCGATC TCTTCGAGCA TCCGACGATC
GGCGCGCTCG CCGCGCATCT CGACGCGTTC GCGCCGCGCG ACGGCGAAGC GGCCGACGAC
GCCGCCGGCG CTGCCGCGCG CGCGCCCGCC GACGGTGTGC TCGACACCGA CGCGATCGCC
GTGATCGGCC TCGCCGGCCG CTTTCCCGAC GCGCCGGACC TCGACCGTTT CTGGGAGCGG
CTGCTCGCCG GCTACGAGGC GGGCCGCACG CTGAGCGACG CGGAACTCGA CGCGCACGGC
GTGCCGGCCG AGCTGTATCG CAATCCGCAT TTCGTCCGCC GCTTCAAGGA GCTCGAAGGC
AAGGCGGAGT TCGACGCCGG CTTTTTCGGC TATTCGCCCC GCGAGGCGCA GGTGATGGAC
CCGCAGCAGC GGATCTTCCT CGAGCTCGCG TGGCAGGCGC TCGAGCAAGC CGGCTATGGC
GATCGCGGCC GCGTGCGCTC GGTCGGCGTG TTCGCGAGCG CCGCGTTCAA CTATTACCTC
GTGCAGAACG TGATGCCGAA TGCCGAGCGG CTGCGGCTCG AGCCGGGCCA GTGGCTGATC
GGCAACGACA AGGATTTCAT CGCGACCCGC ACCGCGTACA AGCTCAATCT GCTCGGGCCG
GCGCTGAGCG TCGGCACCGC GTGCTCGTCG TCGCTGATGG CGGTCCATCT GGCTTGCGCG
AGCTTGCGCA ACGGCGAGGC GCAGATGGCG CTCGCCGGCG CGGTCGCGCT CGATCCCGAT
CAGGTCGGCT ATCTGTACGC CGAAGGCGGG ATCATGTCGC CGGACGGGCG CTGCCGGCCG
TTCGACGCCG CCGCGGCCGG CACCGCGGGC GGCAGCGGCG GCGGCGTGGT GCTGCTCAAG
CGGCTCGACG CGGCGCTGCG CGACGGCGAC ACCGTGTACG CGGTGATCAA GGGCTCGGCG
GCGAACAACG ACGGCGCGGA CAAGGTGAGC TACACCGCGC CGAGCGTCGC CGGCCAGACG
GCCGTGATCC GCGACGCGCT GCGCGCGGCG CGCGTGTCGG CCGACAGCAT CGGCTACGTC
GAAGCGCACG GCACGGGCAC GCCGCTCGGC GATCCGATCG AGGTGCGCGC GCTCGCGCAG
GCCTTCGCCG AGGCGGCCGC GCCGGGCGCG CTGGCGAACG GGCGGTGCGG GATCGGCTCG
ATCAAGGGCA ACATCGGCCA TCTCGACGCG GCGGCCGGCA TCGCGGGGTT CATCAAGGCG
GTTCTCGCGC TGCATCGCGA AGCGATTCCG CCGAGCATCA ACTGCGAGAC GCCTAACGCG
CGAATCGGCT TCGACAAGAC GCCGTTCAGC GTCGTGCGCG AAGCCCGCGC GTGGCCGAGA
ACGGCGACGC CGCGCCGCGC GGGCGTCAGC TCGTTCGGCG TCGGCGGCAC GAACGTCCAC
GTGGTGCTGG AGGAGGCGCC GCGCGTGCGC GCGGGCGAAT CGGCCGAGCC TTCGCGCTGG
CAGTTGCTGC CGGTTTCGGC GCGTTCGCCG AGCGCGCTGC GCGAGCAATG GCGGCAACTG
CGCGACGCGC TCGCGCACGC GCGGCCGCGC GTGCAGGACG TCGCCCATAC GCTGCAGGTC
GGGCGCACCG CGTTCGAGCA TCGCGGCTTC GCCGTCGTCG ACGCGGCCGC CGACGCGCCC
GCCCAGCTCG ACGCCGCCGG CTCGCCGCCG GCGTTCGAGC GTCGCGCTGC GCCGCCGGTG
GTGTTCATGT TTCCCGGCCA AGGCAGCCAG TATCCCGGCA TGGGCGCGGC GCTGTATCGA
AGCGGCGGCG TATTCCAGGC CGAAGTCGAT CGCTGCGCGC AGTTGCTTCG CGCGCATCTG
GACCGGGACG TCCGCTCGCT GATGTTCGAC GCGGATGCGT CGCTGCTGAG GGAAACGCGC
TATACGCAGC CGGCGCTGTT TGCGATCGAA TACGCGCTCG CGCGGCAATG GCTCGCATGG
GGCGTCACGC CGCACGCGAT GATCGGCCAT AGCGTCGGCG AGCTCGTCGC GGCTGCCGTC
GGCGAGACGC TCGCGCTGCC CGACGCGCTG GCGCTCGTCG TCGCGCGGGC CGACGCGATG
CAGCGCCAGC CGCCGGGCGC GATGCTCGCG GTGCTGGCCG ATGCGCGCGA GCTCGCGGCG
CTCAACGGCC CGGGCTGCGA GATCGCGGCG ATCAACGGCC CCGAGCAATA CGTGCTCGCC
GGCGACGCCG CCCGGATCGC CGCGCTCGAA GACGCGTGCA TCGCGGCCGG CGTCGCGTGC
CAGCGCCTCG CGACCTCGCA TGCGTTCCAT TCGTCGGCGA TGGACGGCGC CGCGCGCGAG
ATCGATCGCG CGAGCGAGCG CATCGTGCGC CGGGCCGCGC GCATTCCGTT GATCTCGAAT
CGCAGCGGGC GCTGGCTGAA CGAGCAAGAC CTGCGGGACG CCGGTTACTG GAGCGAGCAC
GTGCGCCAGC CCGTGCAATT TCACGCGGGG GTGCGCACGC TGCTCGATGC GCTCGACGCG
CCGATATTCG TCGAAGTCGG GCCGGGGCGC GCGCTGGGCA ATCTGATCGG CGGCTGGGCG
GGGCTCGGGC CGCAGCGGAT CGTCGCGACG CTGCCGCACG CGCGCGAGCG CAGGGACGAC
ATGGCGGCGG CGTTGCGAGG CGTCGGCACG CTGTGGGCGC AGGGCGTCGA CGTGAACTGG
GCCGGCCTGC ATGCGCCGGG CGCGGCGCGG CGCATCGCGC TGCCGACCTA TCCGTTCGAG
CGTACGCGCC ACTGGATCGA GCGGCCGGCG GGCGCGCGCG CGGCGCCGGC GCGCGAAGCC
GACGGCGTGC CGATGCGGCG CGGCGAGGAC GCGGCCGACG GCAGCCTCAC CGTGTCGTTT
GCGCTGCACG AGCGGCTCTG GTTTCTCGAT GAACACCGGA TCTTCGACGG CGCGCCGGTG
CTGCCCGGCA CCGCCTGCAT CGAACTCGTG CGGCGCGCGT ATTCGCTCGT GCGCCCGGGC
GCGGCGGTGA CGATGCGCGA CGTCTATTTT CCGACGGCGC TGATCCTGTC GACGGACGAA
TCGCGCAACG TGCGCGTCGT GTTCCGGCCG CCGGAACGGG TGTCGGACGG CGCGGCCGCC
GGGGGCGACC TCGCGTTCGT GCTCGAATCG AACGACGCGA ACGGGCCCGC CGGATGGACG
CCGCACGCGA GCGGGCGCAT CGGGGACGAT CCGCCGGCGT GCGCCGCGCC CGCTTCGCTC
GCCGCGCTCG CCGCGCTCGA CGCTCCGGCC GCGCTGCGCG AGCAATGGGG GCTCACGGCG
CTGGACGACG TCGCCGCGCT GTCCGCGCAG GCGTTCGCCG ATTACGGGGC GCGCTGGCGC
GGCGTCGACG CGCTCTGGCT CGGCGAACGG GCCGGGCTCG CGCGGCTCAG GCTGCCGGCC
GCCGGCAGCG GCGACTTGCC CGATTTCGCG CTGCATCCGG CCATGCTCGA CGTCGCCACG
GCGTTCCTGC CCGCCTGCCT GCGCCCGCGC GACGCGTCGG TGCCGTTCCG CTACGAATCG
ATCCGCATGC ACCGGCCGCT GCGCGCCGAT TGCTACAGCT TCGCCGTCGA GACCGCGCCG
AACGTGTACG ACGTCACGCT GTTCGCATGG GACGAGGCCG CGCGGCGCGC CGACGTGCTC
GTCGCGATCG GCGGCTTCGC GCGCCGCGAG CCGGCGCACC GGGCGCGCGA CGTCGCGCAG
TGGTGCCGCA CGGTGAGCTG GCGCGACGCG CCCGCGGCGC GCGCGTTGCC GCCCGAGCGC
TGGCTCGTGT TCGGCGACGA ATGGTTCGCG CTCGCGCCCG CCGGCAGCGT GCTCGTGCGC
GAGGACGACG CGTTTCGCGC GCACGGCGAC AACGGCTATG GCGTGCGCCC GGGCGAGAAG
GCCGATTGCG ACCGGCTGAT CGCGCGGCTC GCCGAGCAGG GCGGCGTGCC GGCGCACGTC
GTCTACGGCT GGGCGCAGAC GGACGTCGAT CGCGCGTTCG CCGGGCTCGC CGCGCTGCTG
CAGGCGCTGG GCGCGCATCC CGCCGATTTT CGCGTGTCGC TCGTGACGAA GGGCGCGCGC
TCGGCGCGCA CGCTGGACGC ATGCGCGGCC GCCGCGCCGG CGGGCTTGCT CAAGGCGGTG
CGCTGGGAAT ATCCGCGCAT CGTGTGCCGC CACATCGATA TCGACGATGC AAGCGACGCG
ACGATCGACG CGTTGCGCGC GGAGCTGTCG TCCGAGCCCG CCACGCCGCC CGGCGCGCCG
CCCGGGCTGC CGAGCAGCAT CGCGCTCGCC GGCGCGCGCC GCGAGGCGCC CGGCTTCGCG
GCGCTGCCGG CCGTCGCGCG CGACGACGTC CTGCGCGACG GCGGCGCGTA TCTGATCACG
GGCGGCGCGA GCGGCATCGG CCTCGAGCTG GCGGCGCACA TCGCGTCGCG GCGGCGGGAC
GTGAGGCTGG CGTTGCTGAG CCGCTCGCCG CATGACGAAA ACGCCGCGCG GTTCGCCGCG
CTGGACGAGG CCGCCGCGAG CGTGCTGCGG TTGACGGCCG ACGTCGCGCA CGCCGCGCAG
CTCGCCGACG CGCTGCGCAC GGTGCGCGCG CGCTTCGGGC GCATCGACGG CGTCATCCAC
GCGGCCGGCG TCGAGGCGAG CGGCCTGCTC GAAACCGGCA CGCCCGACGC ATGGCGGCGC
GTGATGGCGG CGAAGGTTCA CGGCGCGCGA CACCTGTTCG ACCAACTGGC CGGCGATCCG
CCCGATTTCA TCGTGCTCTG CTCGTCGCTC GCCGCGGTCG TCGGCGGCCT CGGGCAGGCC
GACTACGCGG CGGCGAACGG TTACATGGAC GCGCTCGCGC AGCACTGGCG CCAACGCGGC
GTCGCGGCCA TCGCGATCGA TTGGGATACG TGGTCGGACA CGGGCATGGC GTTCGACCAC
GCGGCGCGCA CGCGCCGCTC GAATGACCGC CCGGGCGCGC TGCCCGGCCT CGCGAACCGC
GAAGGGCGGG CGCTCTTCGA TCTCGCGCTC GCGCTCGCGC ACGACGCGCC GCGCATCGTT
ATCAGTAAGC GGGGCTTCGA ACAGGACCGG CGCGACGCGC CCACGCGCGC GCGGCGCGCG
GCCGCCCCGG GCGACGCGCA GGCGGCGCTC GTCGCGCTGT GGCAGGAACT GCTGGGCGTC
GAGCAGGTGG GCGTCGACGA CGACTTCTTC GATCTGGGCG GCCATTCGCT GCTCGCGACG
CAGTTGATTT CCCGCGTGCG CGATCAGTAC GCGCGCAGCC CGACGCTCGG CGAATTCCTG
GAGGAGCCGA CGATCGCGCG GCTACTGCGC GCGATCGACC ACACGGGCGG CGACACGGCC
GGCGACATGA GCGGCGACAC CGCCGGCGAC GCGCCCGACG TCGACGAGAC GCTGCGCTAT
TGCGTGGTGC CGATGGTGAA GGCCGGCAGC GGCGCGCCGT TCTTCTGCAT TCCCGGCATG
GGCGGCAACA TCACGCAGTT GCTGCCGCTC GCGAACGCGC TCGGCGCGGA CCGGCCGGTG
ATCGGCCTGC AATACCTCGG CCTCGACGGC AAGCACGCGC CGCATGCGTC GGTCGAGGCA
ATCGCGGCGC ACTACGTGCG CTGCATCCGC AGCGTGCAGC CCGCGGGGCC GTACTTCCTC
GGCGGGCATT CGCTCGGCGG CAAGATCGCC TATGAAGTCG CACGCCGGCT GCATGCGCAG
GGCGACGCGA TCGGCCTCGT CGCGATGTTC GATTCCGCCG CGCCGCCGTA TTCGTTCGTC
GCGCATCAGG ACGATTTCGC GATCGCCAGC ATGATTCTCG GCGTCTTCGC GTACTACGCG
GGCAAGATGG AGATGATGGC CGGCATCGAC GACGCGCGCT TGCGCGACGC GCCGCGCGAG
CGGTTGCTCG CGTTCATGGG CGAGCGGCTC GCGCAGTTCG GCGTGATTCA GTCGCAAAGC
GATACGAGCG CGATTCGCGG GCTCTTCAAC GTGTATCGCG CGGCGGCCGA TTTTTCCGCG
CGCTATGCGC CGCCGCACGA GCACCTGCCG CTGCCGATAC TGCTCGTCAA GGCGACCGAG
CCGATGCCCG ACGGCATCAA GCTGCCCGAA ATCCGCGAGA CGCCGGCGTG GGGCTGGGAA
AACTTCACGC GGCTGCCGGT GCGCACGTGC GAAGTCGCGG GCAACCATTA CAGTTGCCTG
ATGGACGGCT ACGTCGAGCG CATCGCCGAT GCGCTGCGCG ATGCGCTGGC GTCGGCGCGG
CAAGCGATCG AGGCGTGA
 
Protein sequence
MNHRLPNVGR GDADEPSIET AAGEPPPLAL PASVAQRRLW FVENGDVRAS TYNVPAAFTL 
TGPLDDAVLE RALAFMQQRH PALRSRFRTR DGELRIELAP QPAPLARQDL GALDADVRAR
TAERLCANHA NRRFDLERDA PIRCLLLRLG ENEHVLAVNV HHIVFDDWSI RIFFRELGAV
YGALLAGATP DLPALDYAAA VAASVPAAAR HAAARQYWAR AMSGAPTLHK LPTDRPRPAA
PRMRGAVHKH VFARRHAEGI RALCRRAGVT PYMLGVAAFA ALLHRYSGED EIVIGSPFAN
RVTQAQQSLI GFFINLIPLR VRFDAGVNFL DLLAQVRETS FDAFEHAVLP FDQIVDAIRP
PRSSSHAPVF QIMFDYLKSG GMLELDGVGV TGSLVHTGTA KYDLTVSMEE GPDELAAIVE
YDTDLFDAGT IARLGGHFER LLENVLASPA APIAEGSLLP ADELRQVRRF TRPDEPYAHI
PFSPMPQRIR EAARRAPHAV AIVHGDARMT YETLDRRSDA LARALRARGV GRGSRVASLQ
SYSEKIVVAY LGILKAGAAY LPLDPADPRR LEKIEDAAPA MIVTARRDLE DVPQALRART
LTIDDPIECG KAPDAVNDAV NDVATDVVTD ATARDAELDF ATLAEADPAY VIYTSGSTGK
PKGVEVSHGS LNVSYHGWHR AYRFGKPGHP VTLQLAGMTF DLGIGDVSRT LACGGTLVMP
PRDGLLDAGR LHALMRAERV SFGDFPPVIL RELIRHCNET GERLDMLDTL VCGADVWFGH
ELHAARALCG PRARVLGSYG VTEAAIDSSY FDPDLHALAP DSVVPLGRPL PSCELLIVDP
LLQMTPIGVP GELLVAGPAV ATRYLNNDAL TAQKFLRGRV DEHGRVIAGD GQTRFYRTGD
ICRFLEDGTI DFLGRRDNQI KIRGFRVELG EVEGVLAAHP DVRQCAVVVR DEASGDPSLA
AFVVSDAPIA ALRGYLRGRL PAYMLPAAIE RLGDMPLTAS GKIDRNRLKA WPLSAPDVPP
PDAATDVERR LLALWETLLS ARVPSVHENF FQCGGHSLAA ARLAASISQA FDVSIGVSSV
FNHPSVAEQA RLVEALASAR APRDVRQTAH ADAAEPAGDD GLLSYSQQSL WLTAKRTPDD
FSYNIPVTWR LDGPLDAHAL ERAINDVVAR HDALRTVFSS DVRTVVGPHR ESSQEPTQRV
LDTLTIALRR VAVAPDDAAS LPARLREAHS RAFDLNAGPL LRAVLFEIAP THHVLDVTIH
HIVIDGPSFG LFWRDLQTAY RARVAGEAPG WQRPARRHAD FVSRQRQALR GEAAARQLAY
WREQLSGLPA ALPLPDAVLA AHAPGSARSL TFEMPDDVAA GLAALARRTN GSPFIVYLAL
FAAALRQQTG EADFAIGTPL SLRPHEGFAD VLGFFANTMP LRMRLHGLDT FERVLRYVRE
QCLALYENGD VPFEYLVQAL KPARAARRNP VFQTIFSCEF DDERLQLTGV DAHALALDAY
TAKLDLEMAI NVSDGRVVCH LMSRPGSFDA DALSSIRHHF LRTACSATRA HAEREAAEAR
ATHEAHEAHE VHEAHEAHEV HEAHEAHEVH EAHEAHEMHE MHEMHEMHDV HDVHEARAQL
ADDAPGARRP SADDLFGLFA RSAARHAQRV ALDSPMLRAS YAQLAERVSA AARALAAHGV
RRGDRVGIFV GHHPHNVTAM LAIARVGAAF VPMDPEHKPQ WNRHIVDDAA LTALVGGAWT
ADAARGFGLP VVDLDAPPPP ASELADAPAA GGAHPDDCAY VIYTSGSTGR PKGVAVSHAS
VCHNVRAMAE IMRIGPQSRM AQYVSPVFDV VLGEIFPALA AGAAIVFAER RRPLPGQALV
DWLDAQRVSH VWIVPSALAM VPEAALPALE VLIVAGEACP RELAQRWAAG RRLLNGYGPT
EAAIVVSLTD YHAQRERLIL RPMGGARLHV LDEALRPAPA GAAGELFIGG ACVAQGYLGQ
PARTAQAFVA DPFDAEPGAR MYRTGDVVRR LDDGAIQFIG RVDRQVKIRG FRIELDAVRA
ALMEVPGVQA AEALAQPDAS GQPLLVGYVV ARRAKAELLD ALRGKVPDAM VPSTLVFLDA
LPTGSTGKTD LKALKALKTG DAARPAAAAA DMPRAASQGR TLHRVREIWR TLLERDDIGD
DENFFDAGGH SLRAVALHQR ITEAFGDVIA LTDLFEHPTI GALAAHLDAF APRDGEAADD
AAGAAARAPA DGVLDTDAIA VIGLAGRFPD APDLDRFWER LLAGYEAGRT LSDAELDAHG
VPAELYRNPH FVRRFKELEG KAEFDAGFFG YSPREAQVMD PQQRIFLELA WQALEQAGYG
DRGRVRSVGV FASAAFNYYL VQNVMPNAER LRLEPGQWLI GNDKDFIATR TAYKLNLLGP
ALSVGTACSS SLMAVHLACA SLRNGEAQMA LAGAVALDPD QVGYLYAEGG IMSPDGRCRP
FDAAAAGTAG GSGGGVVLLK RLDAALRDGD TVYAVIKGSA ANNDGADKVS YTAPSVAGQT
AVIRDALRAA RVSADSIGYV EAHGTGTPLG DPIEVRALAQ AFAEAAAPGA LANGRCGIGS
IKGNIGHLDA AAGIAGFIKA VLALHREAIP PSINCETPNA RIGFDKTPFS VVREARAWPR
TATPRRAGVS SFGVGGTNVH VVLEEAPRVR AGESAEPSRW QLLPVSARSP SALREQWRQL
RDALAHARPR VQDVAHTLQV GRTAFEHRGF AVVDAAADAP AQLDAAGSPP AFERRAAPPV
VFMFPGQGSQ YPGMGAALYR SGGVFQAEVD RCAQLLRAHL DRDVRSLMFD ADASLLRETR
YTQPALFAIE YALARQWLAW GVTPHAMIGH SVGELVAAAV GETLALPDAL ALVVARADAM
QRQPPGAMLA VLADARELAA LNGPGCEIAA INGPEQYVLA GDAARIAALE DACIAAGVAC
QRLATSHAFH SSAMDGAARE IDRASERIVR RAARIPLISN RSGRWLNEQD LRDAGYWSEH
VRQPVQFHAG VRTLLDALDA PIFVEVGPGR ALGNLIGGWA GLGPQRIVAT LPHARERRDD
MAAALRGVGT LWAQGVDVNW AGLHAPGAAR RIALPTYPFE RTRHWIERPA GARAAPAREA
DGVPMRRGED AADGSLTVSF ALHERLWFLD EHRIFDGAPV LPGTACIELV RRAYSLVRPG
AAVTMRDVYF PTALILSTDE SRNVRVVFRP PERVSDGAAA GGDLAFVLES NDANGPAGWT
PHASGRIGDD PPACAAPASL AALAALDAPA ALREQWGLTA LDDVAALSAQ AFADYGARWR
GVDALWLGER AGLARLRLPA AGSGDLPDFA LHPAMLDVAT AFLPACLRPR DASVPFRYES
IRMHRPLRAD CYSFAVETAP NVYDVTLFAW DEAARRADVL VAIGGFARRE PAHRARDVAQ
WCRTVSWRDA PAARALPPER WLVFGDEWFA LAPAGSVLVR EDDAFRAHGD NGYGVRPGEK
ADCDRLIARL AEQGGVPAHV VYGWAQTDVD RAFAGLAALL QALGAHPADF RVSLVTKGAR
SARTLDACAA AAPAGLLKAV RWEYPRIVCR HIDIDDASDA TIDALRAELS SEPATPPGAP
PGLPSSIALA GARREAPGFA ALPAVARDDV LRDGGAYLIT GGASGIGLEL AAHIASRRRD
VRLALLSRSP HDENAARFAA LDEAAASVLR LTADVAHAAQ LADALRTVRA RFGRIDGVIH
AAGVEASGLL ETGTPDAWRR VMAAKVHGAR HLFDQLAGDP PDFIVLCSSL AAVVGGLGQA
DYAAANGYMD ALAQHWRQRG VAAIAIDWDT WSDTGMAFDH AARTRRSNDR PGALPGLANR
EGRALFDLAL ALAHDAPRIV ISKRGFEQDR RDAPTRARRA AAPGDAQAAL VALWQELLGV
EQVGVDDDFF DLGGHSLLAT QLISRVRDQY ARSPTLGEFL EEPTIARLLR AIDHTGGDTA
GDMSGDTAGD APDVDETLRY CVVPMVKAGS GAPFFCIPGM GGNITQLLPL ANALGADRPV
IGLQYLGLDG KHAPHASVEA IAAHYVRCIR SVQPAGPYFL GGHSLGGKIA YEVARRLHAQ
GDAIGLVAMF DSAAPPYSFV AHQDDFAIAS MILGVFAYYA GKMEMMAGID DARLRDAPRE
RLLAFMGERL AQFGVIQSQS DTSAIRGLFN VYRAAADFSA RYAPPHEHLP LPILLVKATE
PMPDGIKLPE IRETPAWGWE NFTRLPVRTC EVAGNHYSCL MDGYVERIAD ALRDALASAR
QAIEA