Gene BURPS668_A1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1480 
Symbol 
ID4887512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1406780 
End bp1421851 
Gene Length15072 bp 
Protein Length5023 aa 
Translation table11 
GC content74% 
IMG OID640131419 
Productputative polyketide synthase PksJ 
Protein accessionYP_001062476 
Protein GI126443134 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA TCATCAGTGA TGTGATTCGT GCCTACCGCG AAGGGAAGCT GACGACCGCC 
GATCTGGCGA GCGAGCTTCG CCGGGGCGCG GGCGACGGCG CGGGTTTGCC GCTTTCGGAA
GGGCAGCGCG GGATCTGGGC GCTGCATGCG CTGCACGAGG ATCGCGGCGC CTATAACGTG
CCGCTTTGCT TCGCCGTGCG CGATCTGCGG GCGGACGCGT TTCGCCGCGC GCTGCGCTTC
GTCGCGCGGC AGTATCCGTC GCTGTGCGCG GCGATCCGCG TGATCGACGG CGAGCCGAGG
CGGGTGCAGC CGGCCGGCGC GACGCTCGAG CCGATCGAGG CGACGCTCGC CGACACGCTC
GGCCCCGACG CCGACGAGGC CGCGATCCTC GCGTGGCTGC GCGAGCAGGC GAGGCAGCCG
TTCTCGCTCG AGGACGGCCC GCTTTGCCGC GTGCATCTGC TCGATCTCGC CGGCTGGCGC
GTGCGCGACG CGGCGGCCGC GAGCCGGTTC GGCGCGGCGC ACACGATCGT GTCGCTGCAC
GTGCACCACC TGGTGCTCGA CGGGCAGTCG CTGCTGCTGC TGATCGGCAC GCTGCTCGAC
GCATACCGCG CGCTCGTCGA CGGCGTCGAG CCCGCGCCGC GCGCGCCGGC CGCCACGCAC
GACGATTTCG TCGCCGAGGA GCGCGCGCTT CTCGACAGCG ACGAGGGCGC GCGCCGCATC
GCGTACTGGC GGCGGCAGCT CGACGCGCTG CCGCCCGCGC TCGAGCTGCC CGCGTCGGCG
CCGGCCGCGG CCGAGCGCGC GGCGGGCGAC GCATGGCATG CGGTGCCGCT CGACGCGGCG
AGGTCGGCGC GCGTCGCGGC GTTCGTCCAG TCGAACCATC TCGGCGCCGC CGCGTTCTTC
CTCGGCATGT TCAAGCTGCT GCTGCATCGC TATACCGGCG AGCCCGACAT CGTCGTCGGC
ATGCCGGCCG ACGCGCGGCC GTCGCAGCGT TACCGCGACG CGCTCGGCTT CTTCGTCAAC
ATGCTGCCGC TGCGCACGCG CTTGGCCGGC GAGACCGCCG TCGTGGCGAT GCTCGAGCGC
GTGCAGCGCG AGCTCGTCGA CGCCATGGCG ATGCAGTATC CGTTCGGCGC GCTGGTTCGC
GAGCTCGGCC TGCAGGGCGC GGAGGACGGC GCGCCGATGT ACCGGATCGC GTTCATGTAT
CAGGATTTTC TCGCGCGCCT GCGCTTCGGC GACGACGTCG AACCGATCGG CGAGATTCGT
CAGGCGGGCG AGTATGAGCT CGTGCTCGAG GTGATCGAAG GCGCGGCGCC CGGCGGCCCC
GCGAGTTTCG CGCTGAACTG GAAGTACGAC GGCGCGCGGT ATCGCGCGGC CGCCGTGGAG
GCGATGGCGC GCCACTATCT GACGCTGCTC GACGGCGTGC TCGCGGCGCC CGCCGCGCGG
GTGGCCGATT GCCCGATGCT GCCCGCGGCC GAGCGCGAAC GGCTGCTCGC GCTTGGCCGC
GGCCCGCGCG CCGACCATGC GCGCGAGCGG CGCGTGCACG ACCTGATCGA TGCGCGCGCG
CAGCAGGCCC CGCACGCGAT CGCGGTGTCC TGCGGCGGCC GCTCGCTCGA CTATGCGCGA
CTGAAGGCCG ACAGCGACGC GCTCGCGCAG CGCCTGCGCG CGTGCGGCAT CGGCGCGGGC
GAGTTCGTCG CGGTGCGGCT CGACCGCTCG ACGGCGCTCG TCGTCGGCCT GCTCGCGGTG
CTGAAGGCGG GCGCGGCATA CGTGCCGCTC GATCCCGACT ATCCGGACGA CTGGGCGGCG
CAAATGCTCG GCGATTGCCG CCCGGCCGCG ATCCTGACCC GCGCCGCGCT CGCGGCGGGC
GCGCACGCGC TCGCGCGGCG CGTCGCGGCC GACGGCCCGC CCGCCGTCAT CGCGCTCGAC
GACGCCGCCG ACGCCGACAC CCACGCCGCC GACGGCGCAC GCGCGGCCGC GATCGCCGCC
GCGCGGCAGG CCGCGGCGAG CCGCGCGCAC GCCGCGCGGG CGGCCGATCT CGCCTACGTG
ATCTACACGT CGGGCAGCAC GGGCGCGCCG AAGGGCGTGA TGGTCACGCA TCGCGCGCTG
ACCAACTTCC TCGCGTCGAT GGCGCGCCGC CCGGGCCTGC ACGCGCGCGA CACGCTGCTC
GCCGTCACCA CGTACTGTTT CGATATCGCG GCCCTCGAGC TGTTCCTGCC GCTCGTGCAG
GGCGCGCACT GCGTGATCTG CGACAGCGCG TCGGCGCGCG ACGGCGGCCG GCTGCGCGAG
CTGATCGACG CGGCGCGCCC GACGGTGATG CAGGCGACGC CGTCCACGTG GGAGATGCTG
CTGCATGCCG GCTGGCGCAA CGCGCGGCGC ATGCGCGTGC TGTGCGGCGG CGACACGCTG
CCGGACGCCG TCAAGGCCCG GCTGCTCGAG GACGGCGGCG AAGTCTGGAA CCTGTACGGC
CCGACGGAGA CGACGATCTG GTCGATGGTC GCGCCGGTGA CGGCGGAACG GCCGACCTCG
ATCGGCGCGC CGATCGACAA CACGCGAATC CGGATCGTCG ATGCGTACGG CAATCCGGTG
CCGGTCGGCG TGCCGGGCGA GCTGTGCATC GCGGGCGACG GGCTCGCCGC GGGCTACCTG
AACCGGCCCG ACGAAACGGC GGCGCGGTTC GTCGACGCGC TGCCCGACGT GGACGGCCAG
GCGCGCGAGC GCCATTACCG CACCGGCGAC CTCGCGCGCT GGCGCGAGGA CGGCGAAGTC
GAGCATCTCG GGCGCATGGA TTTCCAGGTG AAGATTCGCG GCCATCGCGT CGAAGTGCAC
GACATCGAGC GGCATCTCGC GCGGCATCCG GCGATCCGGG CGGCCGCGGT GGTCGCGCGG
CGGCACGCGG GCGGCGATCA GCTCGTCGCC TACTACGTGC GCGGCGACGC CGCCGGGCAC
GGCGGCGCGG ACGACGCGCC GGCGCTGGCC GCCGAGCTGC GCGGCCATCT GGCCGGCGCG
CTGCCGGACT ACATGATTCC CGCGCTGTTC CTGCCGATCG ACGCGCTGCC GATGACGCAC
AACGGCAAGC TGAACCGCAA GGCGCTCGCG AGCCGCGGCA TCCGGCTGCG CGTCGCGTCG
TCGGGCGAGC GCCGCGCCGC GCCGCCGCGC GCGCCGGCCG CCGCCGATAT CGAGGCCCGC
CTGCTCGCGA TCTGCCGCGA GGTGCTGAAG ATCGACGACA TCGATCGCGC GGACGGCTTT
TTCGAGGTGG GCGGCAATTC GCTGTCGGTG GCGCTGATCG CCTCGCGCGT CGGCGCCGAG
TTCGGCCTCG CGCGGCTCGG CGCCGGCGCG TTCTTCCGCT ATCCGACGGT CGCCGCGCTG
GCCGCCCATC TGGGCGCGCG GCTGCGCGGC GACGCGGGCG CGGCCGAGGG CGCGGACGGC
GCGGACGCCG GCCCGGCCGG CGCCGACGCG CGCGCATCCC GCCCCGCGCA GCCGCGGGCG
GCCGGGCCCG CGGCGCGACT GCCCGCGGCG CTCGACGACG CGATCGCGAT CATCGGCATC
TCGTGCCAGT TTCCCGGCGC GCAAGACCAT CGCGCGTTCT GGCGCAATCT GCGCGACGGG
AAATCGGGCG CGCGGTTCTA TTCGGAAGAC GAACTGCGCG CGGCCGGCGT GCCGGACACG
CTGATCCGCG ACCGGCACTA CGTGCCGATG CAGCAGACGA TCGAAGGCAA GGACCTGTTC
GACCGGCACT TCTTCCGGCT GACGACGAAG GATGCGCAAC TGATGGACCC GCAATTCCGT
CTGTTGCTGC AGCACGCGTG GAAGGCGATC GAGGACGCCG GCTGCACGCG CGAGCGGATC
GCCGACGCCG GCGTATACAT GTCGGCGTCG AACAGCTACT ACCAGGCGAT GCTGCGCGCG
GCCGGCACGA TCGACGCGTC CGACGAGTAT CAGGCGTGGC TGCTCGCGCA GGGCGGCACG
ATTCCGACGC GCATCTCGTA CGAGCTCGGC CTGACGGGGC CCAGCCTCTT CATCCATTCG
AACTGCTCGT CCGGGCTCGT GTCGCTTTCC GTCGCGGCGA AGTCGCTGCT GCAGCGGGAA
AGCCGCTGCG CGCTCGTCGG CGCGGCGACG GTGCTGCCGG ATGCGGACAT CGGCTACGTG
TACCAGCCGG GGCTCAACCT GTCGAGCGAC GGCCGCTGCC GGACCTTCGA CGAAAACGCC
GACGGGCTCA CCTCCGGCGA AGGCGTCGCC GTGCTGCTCG TCAAGCGCGC GCGCGACGCG
ATCGACGACG GCGACCCGAT CTACGCGCTG CTGCGCGGCA TCGCCGTGAA CAACGACGGC
GCGGACAAGG TCGGCTTCTA CGCGCCGAGC GTCGGCGGCC AGGCCGACGT GATCCGCAAG
GTGCTCGATG CGACCGGCAT CCATCCCGAG ACGATCGGCT ACGTCGAGGC GCACGGCACC
GGCACGAAGC TCGGCGATCC GGTGGAGGTG GCGGCGCTCA CCGACGCGTA TCGCCGCCAC
ACCGCGCGCA CCGGATTCTG CGCGATCGGC TCGGTGAAGC CGAACATCGG CCACCTGGAT
ACCGTCGCCG GGCTGTCGGG GTGCATCAAG GTCGCGCTGA GCCTGCGGCA CGGCGAGATC
GCGCCGTCGA TCAACTACGA GAAGCCGAAC CGCGAGATCG ATTTCGCGCA CTCGCCGTTC
TACGTCGTCG ACCGATTGAC GCGCTGGCCC GCGCGCGAGC CGGGGGCGCC GCGCCGCGCG
GCGCTCAGCT CGTTCGGCAT CGGCGGCACC AACGCGCATT TGATCCTCGA GGCGTTCGAG
CGCGACGAGC CGCCCGCCGG GATGCGCGCG CCGGCCGCCC GCGCGGCGCG CGTGATCGCG
CTGTCGGCGC GCACCGAAGA GCGCGTGCGC GCGCAGGCGA GCCAGTTGCT CGCGTTCCTC
GAGCAGGAAG CCGGCGCGCT GCCGGACTTC GACGGTTTCG CGTTCACGCT GCAGGTGGGC
CGCGAGGCGA TGCGCGAGCG CGTCGCGTTC GTCGCCGACG GCTACGACGC GCTCGCCGCC
GCGCTCGCGC GTTTCCTGCG CGGCGAGCCG GACGCGGCCG CGTGTTTCAC CGGCGCGCGC
GGGGGCGATT CGACGCTCGC GGCGCTGCTC GACGATACCG GCGATACCGG CGATACGGCC
GCGCACGGGT TGATCGCCGC GTGGTGCGAG CAAGGCAAGA TCGCGAAGAT CGCGGCGCTC
TGGGCGCACG GTGTGAACGT CGATTGGCGC CGGCTTTACG GCGCGCGCGC GCCGGTGCGT
GTGAGTCTGC CCACCTATCC GTTCGCGCCG GAGCGTTGCG AGGGCGTCGC GCGCCGCCGC
GCCGCCGCGC CGGCGCCGCG CCGCGCGGGC GTCGAGACGG CCGCGGCGCG GCTGCATCCG
CTCGTTCACG ACGACCGCTC GGACGGGGCG CGCCGGCGGT TCGCCGCGAC GTATTCCGGC
GAGGAATTCT TCCTGGCCGA TCATCTGATC CGCGGCAAGC GGATCCTGCC CGGCGTCGCG
TATCTCGAGA TGGCGCGCAT GGCGGCCGTC CGGGCGCACG GCGACGGCGC GCTGAGCCTG
CACGACGTGG TCTGGATGAC GCCCATCGTC GTCGACGGGC CGTGCGAGGT CGAGCTGAGC
CTGGAGGCCG CCGAGCGTGC CGAGGCCGAG GGGGCCGCCG AGGCGGCCGC CGGCGTGCGC
ACGATGCGGT TCAACGTGAC CTCGGGCGGC GGCGCCGGCG CGCGCCGCAC GAACAGCCAG
GGGACGATTC GCCTCGCGCC CGGCGCGGCC GCGCCCGCCG CCGCGCGCGT CGATGTCGCG
GCGCTCCTCG CGCGCTGCAC GCGCGAGATC GGGGCGCAGC GGTTCTACAC GTTCCTCGAC
AGCGGCGGCG GCCATTACGG GCCGACGTTC CGGAGCGTCG CGGCGCTGCA TCAGGGCGAG
CGCGAGGTGC TCGCGCGGCT CGCGCTGCCG GAGTCCGTCG CGCACGCCGA TGCGTTCGTG
CTGCATCCGA GCATGATGGA CGCCGCGTTC CAGATCGCCG ACAGCCTGAT CCTGCAACCG
CGCGCGAACG GCGGCTGTCT GCCGTTCTTC GTGAAGGAGC TCGTCGTGCG ACGCCGGCCG
GGCCGCGACG CGTGGGTCCA CGTTCGCCTC GCCGGCGGCG ATGCGACGCT TGCCCGCTAC
GACATCGATC TGATCGACCC CGACGGCACC GTCTGCGTGT CGATGCGCGA ATTCAGCGCG
CGCGCGGAGA CGGCGGGCGG CAGCGGCCGG CCGAACACGT ACCGCGCCGC CGAATGGCGC
GCCGCGGAGC GCGACGGCGA GCGCGACGGG AACGAGCTGA ACGAGGGGAA CGAGCGGCGT
CGCGCCGCGC CGCGGGTGGC GGTGCTCGAC GCATCGCCGC GTCTTGCGCA CGCGCTGCGC
GGCATCGGCG TCGACGCGCT CTGGCTGCCG GCCGACGCGG CGCACGCGGC GCGCGGGCCG
GCGCTGCGGG ATCTCGACGC GGCGCTGCAC GCCGGCGCGG CGCGCGATCT GCTCGTGCTC
GCCGACGAAC GGCGCGAGCT CGACGACGAC GCGTTGCGCG CGTGGCTCGA CGGCGCGCCG
CACGCCGGGG GCGCGCGGCG GGCGCTCGTG TCGATCGCGG GGCTGGCCGA CGCCGACGCG
CGCGCCGTGG CGGACATCGT CGAGCGCGAG CGGCATGGCC GCGCCGCCGA CGTCCGCTAC
GACGCCGGCG GCGCGCGCAG CGTGCGCGGC TTCGCCGACG CGGCCGTCGC GCGCTGGCTG
CTCGACACGG ACGCGCTGCG CTCGGGCGGC GTGTACTGGA TCGCCGGCGC GAACGGCCCG
CTCGGCGCGA GCCTTGCCTG CCACCTCGCG ACCGTGGAGC GCGCGACCGT GGTGCTGACC
GACGCGCACG CGATCGATGC GGCCCGGCTC GCCTGCCTCG ACGGGTATCG CGCCGGCGGC
GCGCGCCTCG AGTTCATCGA AGGCGACGCC GCGCGAGACG GCGCGGCGCT CGCGCAGCGG
ATCCGCGCGC GTCACGGGCG CATCGACGGC GTGCTGCACT GTGCGCAACA CGCGTCGGCG
CCGACGCTCG CGGCGCTGGC CGCGCTCGAC CGCGCGACGC GCGCCGACGC GCTCGATTGC
TTCGTCGCCT GCGAGGCGCG GGATGCCGAT CCGGATCACG ATCCGGCGGC CGCGCTCGTG
GCGCGATTCG TCGAGCGGCG CCACGCGCGC GTGCAAGCGG GACTCGGCGG GGGGCGGACG
GTGGCGATCG CGGCGCACGC GGCGCTGCCG TGGCCGGACG ACGCGCCGCT GCTGCGCGCG
GGCGGTATCG CGAGCCAGCC CGCGCTGGCG ATCGTGCAGG CGCTGCATCA TGCGTTGCGC
TCGGACGAGG CGATGCTCGC CGTCGGCTGG GGGGCGTCGG CCGGCGGCGT CGATGCGGAC
GCATCGAACG CATCGAACGC ATCGAACGCA TCGAACGCAT CGAACGCATC AAACGCGTCG
AACGCGTCGA ACACATCGAA CACATCGAAC GCGCCGAGCG TCGCCGCGGA CCTTGCCGCG
CCCGCCGAGC CGAACGCGCG GATTCCCGCG CGCGCGCGGG CCACGCCGGA CACGATCGCG
GCATGCCTGA AGGCCGTGAT CGCCGACGTG ATCCGGGCGG ACGTCGACGA AATCGACGCG
CGCCAGCACT TCGGCGAATA CGGGCTCGAC TCGCTGTCGC TGACGTCGGT CAGCAACCGG
CTCAATGACG CATACCGCCT CGATGCTTCG CCGGCGGGCG CGCTGAATCC GACGCTGTTC
TTCGAATACC CGAGCGTCGA GCGGATGGCG GCTTATCTCG CCGAGCATCA CGCCGCGCGC
TTCGCCGACG CGTCGGCCGC ACCCGGCGCC GACGGGGCGG CCGAGTGCGC GCCGCGGCCC
GAGGCCGCGC TCAACGCCGA GGTCGAACCT GGGAACGGGG CCGCGCCCGC GCCCGAGCCC
GAGGTCGGGT TCAGGGCCGG GTTCGAGCCG GTCGCGCCGC CGATGCCGCC CATCGAGCCC
GCCGCATCGA CGCCGCCGGA CCAACCGGTG CCGCAACCCG GCGGTGCATG GCACGCCGGG
CGCGGCGCGC GCCCGGCGGC CGACGACGAC GTCGCGATCA TCGGCATCAG CGGCCGCTTT
CCCGGCGCGC GCGACGTGGC CGAATTCGGC CGCAATCTGT TCGACGGCCG GGACTGCATC
GGCGAGATTC CCGCGGACCG CTGGGACTGG CGCGCGTACC TCGGCGATCC GCAGCACGAG
GCCGGCAAGA CGAACAGCAA GTGGGGCGGC TTCATCGACG GCATCGCGGA ATTCGATCCG
CTGTTCTTCA GCCTCTCGCC GAAGGAGGCC TATCTGCTCG ATCCCGCGCA CCGGCTGCTG
CTGATGCACG CGTGGTGGGC GATCGAGGAC GCCGGCTACA ACCCCGCCGC GCTCGCCGGC
AGCCGGACCG CGCTGTTCGC GGGCATCGCG CAGAGCGGCT ACGCGGATTT GCGCAGGCAA
GCCGGCGAGG GGATCGAGGG CAACTCGTTC CTCGGGGTCG TGCCGTCGAT CGCGCTGAAC
CGGATCAGCC ACCTGCTCGA TCTGCACGGC CCGAGCGAGC CGGTCGAGAC GGCCTGTTCG
TCGTCGCTCG TCGCGATGCA CCGCGCGCTC GTCAGCCTGC GCTGCGGCGA CGCCGACATG
GCGCTCGTCG GCGGCGTGCA GACGATCCTG TCGCCGCACG CGCATATCGG GTTCGGCAAG
GCGGGCATGC TCGCGACCGA CGGCCGCTGC AAGGCGTTCT CGAGCCGCGC CGACGGCTTC
GTGCGCGGCG AGGGCATCGG CATGCTGTTC CTGAAGCGGC TCGGCGACGC GCGGCGCGAC
GGCGACGCGA TCTACGGCGT GATCCGCGGC AGCGCGGTCA ATCACGACGG CCGGTCGAGC
TCGCTCACCG CGCCGAACCC GGCCGCGCAG CGCGACGTGA TCGTGCAGGC GCACATGCGA
GCCGGCGTCG ACCCGCGCAG CATCGGTTAC ATCGAGGCGC ACGGCACCGG CACGAAGCTC
GGCGATCCGA TCGAGATCAA CGCGCTCACG CAGGCGCTCG ACACGCTGCT GCGCGCGCAG
CGCGAGGAAG GCGCCGCCTA CGTTCCCGGC GCGTGCGCGA TCGGCTCCGT GAAGAGCAAC
ATCGGCCATC TGGAGCTGGC CGCCGGCGTG TCCGGCGTGA TCAAGGTGCT GCTGCAGATG
GCGAACGGGC GGCTCGCGAA GAGCCTGCAT TGCGACGAGC TCAATCCGTA CATCACGCTC
GACGGCGGGC CGTTGCGCGT CGTCGGCGCG AACGCCGCGT GGCCGCGTCC CGTCGATCGC
GACGGCCGCG AGCAGCCGCG CCGCGCGGGC GTGAGCTCGT TCGGCATCGG CGGCGTGAAC
GCGCACGTCG TGCTCGAGGA GTATCCCGAG GCCGACGCGC GCGCGCGCGA CGACGGGCAG
CCGGCCGCCG TGCTGCTGTC CGCGCGGGAT TCGCAGCGGC TCGCCGATTA CGCGAGCGCA
TTGCTCGCGT TCGTGCGCGA GCGGCGCGAG GCGGCCGCGC ATGCGCCGCC GCCGCGGCTG
TCGGATCTCG CCTATACGCT GCAGGTGGGC CGCGAGGCGA TGCGCGAGCG TGTCGGCTTC
GTCGTCACGT CGCTCGCGCA ACTCGAGGCG CGGCTCGCCG CGTTCGTCGC GGGCGAGCCG
GCGGGCGACG GCGTCTACCG CGGCAGCGTC CGCCCGGCGC GCGGTGAACG CGCGGCCGAC
GCGGACGGCC TCGACAGGCT CGTCGACATC TGGCTCGCGA GCCGCAAGCA TGAGGCGCTG
CTCGGTGCGT GGGTGAAGGG CGCGGCGATC GACTGGGCGA GACTTCACGC GGGCGGCGCG
GCGCGCCGCG TCCATCTGCC CGGCTATCCG TTCGCGCGCG AGCGCTACTG GATCGCCGAG
CCCGCGCCGG CGACGGGCGC GCCCGGCGAG CCCGCGCCGC CGCGCATGCC GACGCAGCCG
CACGGGCCGA CGTCCGACGG CCGCGCCGAA TCGCGCCATC CGTTGCGGCG CGACGCCGCC
GACGGCCGGT TCCTGCTCGA TCTCGACGGC GACGAGGCCT TTCTCGCCGA CCATCGGGTG
GACGGACGCC GCGTGCTGCC GGGCGTCGCG CACCTGGAGA TCGCGTACGA GGCCGCGCGG
CGCACGTTCG GCCCGGCCGA TGCGATCCGG ATCCGGAACC TCGGCTGGAT CAGGCCGATC
GTCGCCGACG GCGCGCTGCG CATCGGCGTC GAACTGAGCA CGTCCGGCGC CGCCGAAGGC
GCGTTTCGCC TCTACACGAC GGATCCGCAA CATGGGCGGC TCACGCACAG CGAAGGCGCG
ATCGGCCGCG CCGACGTCGC GCAGTCGGCG CGCGCGCTCG ATCTCGGCGC GCTGCGCGAC
GCGTTCGCGA CGGCCGAGCG CGTCGATCCG GCCGTCTGGT ACGACGGCTT CTCGCGGGCC
GGCATCGATT ACGGCCCGAG CCACCGCTGC CTCGAAACAT GCGCCGTCGG CCCGGCCGGC
GTGCTCGCGC GGGTGCGCCT GCCGGCCGCC GAGGCGCGCG CGGCGCGGCC GTTCACCTTG
CATCCGGGCC TGATGGACGC GGTGCTGCAG GCGGCGATCG GCCTGCGCAA GCGCGCGGGC
GGCGCGCCGC GCGGCACGCC GTATCTGCCG TTCGCGCTCG ACACGGTCGA GATTCTCGGC
GGCTGCGGCG AGGCGGCGTG GGCATGGCTG CGCCCGTCGC CGCGCGACGC GGCCGACGCT
TCGGCGTCGC GCGGCGACGC GGGCAAGCCG GCCGCCGAGC GCATCGATAT CGATGTGTGC
GACGACGCGG GCCGGATCAG CGTGACGCTT CGCGGGCTCA CGTCGCGCCC GCTCGCGCGC
CGGACGGCGC CGGCTCCCGA GGCCGGGAAC CCGGCCGGTG AAGTCGGCGA GGTGGCCGAC
GCCACCGATG CCGACGCCGC TGAAGTCCGC GAAATCTCCG ACGTCTCCAA CGTCTCCAAC
GTCTCCAACG TCTCCAACGT CTCCGACGTC TCCGACGTCG CGCCGCTCGC CGACGGCGAC
GTCGGCCTGC TCGCGCGAAC CGCGGTGTGG AGCGCGCTGA CGCCGGCGCA GTGGCTCGCG
GATCCGGCGT CGCGCCCGCG CGCCGGCGCG CGCGTGTTCG TGCTCGGCGG CACCGCCGCG
CAGCGGCGCG AGATCGCGCG GATTCATCCC GGCTGCGAAC CGCTTGAGGC GAATGCGGCC
GACGACGGCG GCGACGGCGC GGACCAACAG GCGCACGTCG ACGCGCTGCG GCGGCGGCTC
GCCGAGGGCG CGCCGATCGA CCAGCTCGTC TGGATCGCGC CGCCGGAGCC GGCCGCCGAC
GCGCGCGCCG GGCTGCGCGG CGACGCGATC GTCGCCGCGC AGGAGCACGG GGTGCTGCAA
CTGTTCCGGA TCGTCAAGCT GCTGCTCGCG GCGGGCTACG GCGGCAAGCC GCTCGACTGG
ACGATCGTCA CGCGCGAAAC GCACGCGACG AGCGGCATCG ACGAGCCGTC GCCGACGCAC
GCGGGCGTGC ATGGGTTCGT CGGCTCGATG GCGAAGGAGT ACCGGAACTG GCGTGTCCGC
CTGCTCGACA TGCCCGCGCG CGAGGCGTGG CCGATCGACG CGATGTTCTC GACGCGCTTC
GATCCGCGCG GCGATGCGCT CGCCTATCGG CGCGGCCGCT GGCTCGCCCG CGAGCTGGCC
GCGATCGACG CGTTGCCCGA CGGCGGCTGC CATGTGAAGG CGGGCGGCGT CTACGTGGTG
ATCGGCGGCG CGGGCGGGAT CGGCGAAGTC TGGAGCCGCT GGATGATGGA GCGCTATCAG
GCGCGGATCG TCTGGATCGG GCGCCGCGAC GAGGACGAGC AAATCCGCCG CAAGCGCGAG
CGGCTCGCGC GCTACGGCAC GCCGCCCGTC TACCTGCGCG CGGACGCGAG CGAGCGCGCG
TCGCTCGCGG CGGCGCGCGA GCGGATCGCC GCGCTGCGCT GGGACGGCCG CGCGCTGCCG
ACGAGCGGCG TCGTGCATTC CGCGATCGTG CTGGCGGACG CGAGCCTCGC GACGATGGAC
GAGGCGCGCT TTCTGGCCGC GTGGCGATCG AAGGCGGATG TCAGCGTGCG CGTCGCCGAG
GTCTTCGGCG GCGATCCGCT CGATTTCATG CTGTTCTTCT CGTCGATCAC GTCGTTCGGC
AAGACGGCCG GACAGGCGAA CTACGCGGCG GGTTGCGCGT TCAAGGACGC GTTCGCCGCG
CATCTCGGCC GCACGCTGCC GTATCCCGTC AAGGTGATGA ACTGGGGCTA CTGGGGCAGC
GTCGGCGTGG TCAGCGACGA AACCTATCGC CGGCGCATGG CGAGCGCGGG CTTCGGCTCG
ATCGAGCCCG ACGAGGGCAT GTCGGCGCTG GAGCGGCTGC TCGCCAGCCG CGTCGGCCAG
ATCGCGGTGC TCAAGACGCT GCGGCCGAAC CTCGTCGGCG ACTCGCGCGC GGACCGGATC
CGGCATTACC CCGGCCGCGA CTGGCCGGAC GCGGCGCCCG CGCCGGCGAC GGCCGCGCTG
CAGGCGGCGC TCGCGGCGCG CGCCGGGCGC TGGCACGCGC AGGCGTCGGC GCTCGCGCTC
GGCAATCCCG AGCTGGAGAC GCTGATCGCG CGCGGCCTGC TCGCGGGCGT CCTTCCGTAT
CTCGACGCGC CGGGCTCGGT CGACGCGCGC CATGCGCGGT GGTTCGACGA AAGCCGGGCG
ATGCTGCACG GGTTCGGCTA TCTCGCGCGC GACGGCGCGG GCGACGCGCC TTCCTGGTCG
CTCACCGACG CCGGCCGCGC GGCGGCGCCG CACGTCTGGC AAGACTGGGA GCGGCACGCG
CTCGCGTGGC ACGACGACGA GCGGCGCGTG CCGATGCGGC TCGCGCACGT CTGCCTGCGC
GCGCTGCCCG AGCTTCTCGG CGGCAAGCGG CGCGCGACCG ACGTGATGTT CCCGGGCTCC
AGCATGGCGC TCGTCGAGGG GCTGTACAAG AGCAATCGCA AGGCCGATCT GTTCAACGAC
GTCGTGCACG ACGCGGTGCT GTCGTATGCG CGCGCGCTCG GGCGCGCGCT CGACATCGTC
GAGGTGGGCG CGGGCACGGG CGGAACGACG GACGGCCTGC TGCGCAAGCT CGTCGAGCAA
GGGATCGCGG TGCGCGAATA CCGGTATACG GATCTGTCGC ACGCTTTTCT GCTGCATGCG
CGCGAGCATT ACGCGCCGCG CGCGCCGTTC CTGACGACCG GGATCTTCGA CGTCGACAAG
CCGATCGCCG CGCAGCGCGT GCCGGGCGGC CGCTATGACG TCGCGGTCGC GACCAACGTG
CTGCACGCGA CGCGCGACGT CCGGCGCGCG CTGCGCAACG TGAAGGCGAC GCTGCGCGCG
GGCGGCCTGC TGATCCTGAA CGAGCTGAGC GTCAAGTCGC TGTTCAGCCA TGTGACGTTC
GGGCTGCTGG ACGGCTGGTG GATGTACGAG GACGCCGATT TGCGGATACC CGGCTCGCCC
GGCCTCGATT CGTCGACGTG GCGGCGCGTG CTGGCGGAAG AGGGCTTCGA GTATGTGTTC
TTCCCCGCGC AAGGGCTGCA TGCACACGGC CAGCAAGTCA TCGTCGCGCA GAGCGACGGC
GTGGTCCGGC AGCCGCGCGC GGCCGCCGCG CCGGGGGCCG GCGCGGCCGC GTCGCCTTCG
GGCGGCACGC AAGCGGCGGT GCCGGCGCGC CGGGCGGCCG CGGCATCCGG CGCGCCGCGC
GTGGAGGCGA TTCCGCCGGC GGCCGTTGCG CCCGCGGCCT TCGATGCCGC CACCGCGGCT
CCTCCCGGCA CCGCTGCCGC TGCCGCGACG GCGGTGCCGG CGGACGGCCG ATCCGCGCTC
GCCCACGCAA GTTCGCCGGC CGCCTCGCCG CCGCAGCCGG GCGACGCGCC CGCGCCCGAA
CGGATGCATG CGTATCTGCG CGACAAGCTC TCGCAAGTGC TGAAGCTGCC GCCGGAGCGC
ATCGAGACGG ACGCATCGTT CGCGAGCTAC GGCGTCGATT CGATCATGGC GATGGCGTTG
ATCGCGGCGC TCGAAAAGGA GCTGGGCAGT CTGCCGAAAA CGCTGTTCTT CGAGCACGAA
ACGATCGAGG AATTGGGCGC GTATCTGCTG GAGCGTTGCG AGCCGATGCC TTCGGGCGTG
GAGCCGGCGA CGGTGGGGGC GGACGATCGC GCCGCGTATT CCGGCGCGAG GCCGCACGCC
TGGCCCGCGT CGCCCACGGA GCCTGACGAG CCCACCGAGC CCACCGCATC GCCCGTCTCA
TCCGCCCCGC CGGCCGCCTC GCCGCCGCAG CCGGGCGACG CGCACGCGCC CGAACGAATG
CATGCGTATT TGCGCGACAA GCTCTCGCAA GTGCTGAAGC TGCCGCCGGA GCGCATCGAG
ACGGACGCAT CGTTCGCGAG CTACGGCGTC GATTCGATCA TGGCGATGGC GTTGATCACG
GCGCTCGAAA AGGAACTGGG CAGCCTGCCG AAGACGCTGT TCTTCGAGCA CGAAACGATC
GAGGAACTGG GCGCGTATCT GCTGGAGCGT TGCGAGCCGA TGCCTTCGGG CGTGGAGCCG
GCGACGGTGG GGGCGGACGA TCGCGCCGCG TATTCCGGCG CGAGGCCGCA CGCCTGGCCC
GCGTCGCCCA CGGAGCCCGA CGAGCCCACC GAGCCCACCG AGCCCACCGC ATCGCCCGTC
TCATCCGCCT CGCCGGCCGC CTCGCCGCCG CAGCCGGGCG ACGCGCCCGC GCCCGAACGG
ATGCATGCGT ATCTGCGCGA CAAGCTCTCG CAAGTGCTGA AGCTGCCGCC GGAGCGCATC
GAGACGGACG CATCGTTCGC GAGCTACGGC GTCGATTCGA TCATGGCGAT GGCGTTGATC
ACGGCGCTCG AAAAGGAGCT GGGCAGCTTG CCGAAGACGC TGTTCTTCGA GCACGAAACG
ATCGGGGAAC TGGGTGAGTA CCTGCTGGAG CGGCAAGGAC AAGAGAGGGC GTGCCATGCA
AGCAACGTTT AA
 
Protein sequence
MNAIISDVIR AYREGKLTTA DLASELRRGA GDGAGLPLSE GQRGIWALHA LHEDRGAYNV 
PLCFAVRDLR ADAFRRALRF VARQYPSLCA AIRVIDGEPR RVQPAGATLE PIEATLADTL
GPDADEAAIL AWLREQARQP FSLEDGPLCR VHLLDLAGWR VRDAAAASRF GAAHTIVSLH
VHHLVLDGQS LLLLIGTLLD AYRALVDGVE PAPRAPAATH DDFVAEERAL LDSDEGARRI
AYWRRQLDAL PPALELPASA PAAAERAAGD AWHAVPLDAA RSARVAAFVQ SNHLGAAAFF
LGMFKLLLHR YTGEPDIVVG MPADARPSQR YRDALGFFVN MLPLRTRLAG ETAVVAMLER
VQRELVDAMA MQYPFGALVR ELGLQGAEDG APMYRIAFMY QDFLARLRFG DDVEPIGEIR
QAGEYELVLE VIEGAAPGGP ASFALNWKYD GARYRAAAVE AMARHYLTLL DGVLAAPAAR
VADCPMLPAA ERERLLALGR GPRADHARER RVHDLIDARA QQAPHAIAVS CGGRSLDYAR
LKADSDALAQ RLRACGIGAG EFVAVRLDRS TALVVGLLAV LKAGAAYVPL DPDYPDDWAA
QMLGDCRPAA ILTRAALAAG AHALARRVAA DGPPAVIALD DAADADTHAA DGARAAAIAA
ARQAAASRAH AARAADLAYV IYTSGSTGAP KGVMVTHRAL TNFLASMARR PGLHARDTLL
AVTTYCFDIA ALELFLPLVQ GAHCVICDSA SARDGGRLRE LIDAARPTVM QATPSTWEML
LHAGWRNARR MRVLCGGDTL PDAVKARLLE DGGEVWNLYG PTETTIWSMV APVTAERPTS
IGAPIDNTRI RIVDAYGNPV PVGVPGELCI AGDGLAAGYL NRPDETAARF VDALPDVDGQ
ARERHYRTGD LARWREDGEV EHLGRMDFQV KIRGHRVEVH DIERHLARHP AIRAAAVVAR
RHAGGDQLVA YYVRGDAAGH GGADDAPALA AELRGHLAGA LPDYMIPALF LPIDALPMTH
NGKLNRKALA SRGIRLRVAS SGERRAAPPR APAAADIEAR LLAICREVLK IDDIDRADGF
FEVGGNSLSV ALIASRVGAE FGLARLGAGA FFRYPTVAAL AAHLGARLRG DAGAAEGADG
ADAGPAGADA RASRPAQPRA AGPAARLPAA LDDAIAIIGI SCQFPGAQDH RAFWRNLRDG
KSGARFYSED ELRAAGVPDT LIRDRHYVPM QQTIEGKDLF DRHFFRLTTK DAQLMDPQFR
LLLQHAWKAI EDAGCTRERI ADAGVYMSAS NSYYQAMLRA AGTIDASDEY QAWLLAQGGT
IPTRISYELG LTGPSLFIHS NCSSGLVSLS VAAKSLLQRE SRCALVGAAT VLPDADIGYV
YQPGLNLSSD GRCRTFDENA DGLTSGEGVA VLLVKRARDA IDDGDPIYAL LRGIAVNNDG
ADKVGFYAPS VGGQADVIRK VLDATGIHPE TIGYVEAHGT GTKLGDPVEV AALTDAYRRH
TARTGFCAIG SVKPNIGHLD TVAGLSGCIK VALSLRHGEI APSINYEKPN REIDFAHSPF
YVVDRLTRWP AREPGAPRRA ALSSFGIGGT NAHLILEAFE RDEPPAGMRA PAARAARVIA
LSARTEERVR AQASQLLAFL EQEAGALPDF DGFAFTLQVG REAMRERVAF VADGYDALAA
ALARFLRGEP DAAACFTGAR GGDSTLAALL DDTGDTGDTA AHGLIAAWCE QGKIAKIAAL
WAHGVNVDWR RLYGARAPVR VSLPTYPFAP ERCEGVARRR AAAPAPRRAG VETAAARLHP
LVHDDRSDGA RRRFAATYSG EEFFLADHLI RGKRILPGVA YLEMARMAAV RAHGDGALSL
HDVVWMTPIV VDGPCEVELS LEAAERAEAE GAAEAAAGVR TMRFNVTSGG GAGARRTNSQ
GTIRLAPGAA APAAARVDVA ALLARCTREI GAQRFYTFLD SGGGHYGPTF RSVAALHQGE
REVLARLALP ESVAHADAFV LHPSMMDAAF QIADSLILQP RANGGCLPFF VKELVVRRRP
GRDAWVHVRL AGGDATLARY DIDLIDPDGT VCVSMREFSA RAETAGGSGR PNTYRAAEWR
AAERDGERDG NELNEGNERR RAAPRVAVLD ASPRLAHALR GIGVDALWLP ADAAHAARGP
ALRDLDAALH AGAARDLLVL ADERRELDDD ALRAWLDGAP HAGGARRALV SIAGLADADA
RAVADIVERE RHGRAADVRY DAGGARSVRG FADAAVARWL LDTDALRSGG VYWIAGANGP
LGASLACHLA TVERATVVLT DAHAIDAARL ACLDGYRAGG ARLEFIEGDA ARDGAALAQR
IRARHGRIDG VLHCAQHASA PTLAALAALD RATRADALDC FVACEARDAD PDHDPAAALV
ARFVERRHAR VQAGLGGGRT VAIAAHAALP WPDDAPLLRA GGIASQPALA IVQALHHALR
SDEAMLAVGW GASAGGVDAD ASNASNASNA SNASNASNAS NASNTSNTSN APSVAADLAA
PAEPNARIPA RARATPDTIA ACLKAVIADV IRADVDEIDA RQHFGEYGLD SLSLTSVSNR
LNDAYRLDAS PAGALNPTLF FEYPSVERMA AYLAEHHAAR FADASAAPGA DGAAECAPRP
EAALNAEVEP GNGAAPAPEP EVGFRAGFEP VAPPMPPIEP AASTPPDQPV PQPGGAWHAG
RGARPAADDD VAIIGISGRF PGARDVAEFG RNLFDGRDCI GEIPADRWDW RAYLGDPQHE
AGKTNSKWGG FIDGIAEFDP LFFSLSPKEA YLLDPAHRLL LMHAWWAIED AGYNPAALAG
SRTALFAGIA QSGYADLRRQ AGEGIEGNSF LGVVPSIALN RISHLLDLHG PSEPVETACS
SSLVAMHRAL VSLRCGDADM ALVGGVQTIL SPHAHIGFGK AGMLATDGRC KAFSSRADGF
VRGEGIGMLF LKRLGDARRD GDAIYGVIRG SAVNHDGRSS SLTAPNPAAQ RDVIVQAHMR
AGVDPRSIGY IEAHGTGTKL GDPIEINALT QALDTLLRAQ REEGAAYVPG ACAIGSVKSN
IGHLELAAGV SGVIKVLLQM ANGRLAKSLH CDELNPYITL DGGPLRVVGA NAAWPRPVDR
DGREQPRRAG VSSFGIGGVN AHVVLEEYPE ADARARDDGQ PAAVLLSARD SQRLADYASA
LLAFVRERRE AAAHAPPPRL SDLAYTLQVG REAMRERVGF VVTSLAQLEA RLAAFVAGEP
AGDGVYRGSV RPARGERAAD ADGLDRLVDI WLASRKHEAL LGAWVKGAAI DWARLHAGGA
ARRVHLPGYP FARERYWIAE PAPATGAPGE PAPPRMPTQP HGPTSDGRAE SRHPLRRDAA
DGRFLLDLDG DEAFLADHRV DGRRVLPGVA HLEIAYEAAR RTFGPADAIR IRNLGWIRPI
VADGALRIGV ELSTSGAAEG AFRLYTTDPQ HGRLTHSEGA IGRADVAQSA RALDLGALRD
AFATAERVDP AVWYDGFSRA GIDYGPSHRC LETCAVGPAG VLARVRLPAA EARAARPFTL
HPGLMDAVLQ AAIGLRKRAG GAPRGTPYLP FALDTVEILG GCGEAAWAWL RPSPRDAADA
SASRGDAGKP AAERIDIDVC DDAGRISVTL RGLTSRPLAR RTAPAPEAGN PAGEVGEVAD
ATDADAAEVR EISDVSNVSN VSNVSNVSDV SDVAPLADGD VGLLARTAVW SALTPAQWLA
DPASRPRAGA RVFVLGGTAA QRREIARIHP GCEPLEANAA DDGGDGADQQ AHVDALRRRL
AEGAPIDQLV WIAPPEPAAD ARAGLRGDAI VAAQEHGVLQ LFRIVKLLLA AGYGGKPLDW
TIVTRETHAT SGIDEPSPTH AGVHGFVGSM AKEYRNWRVR LLDMPAREAW PIDAMFSTRF
DPRGDALAYR RGRWLARELA AIDALPDGGC HVKAGGVYVV IGGAGGIGEV WSRWMMERYQ
ARIVWIGRRD EDEQIRRKRE RLARYGTPPV YLRADASERA SLAAARERIA ALRWDGRALP
TSGVVHSAIV LADASLATMD EARFLAAWRS KADVSVRVAE VFGGDPLDFM LFFSSITSFG
KTAGQANYAA GCAFKDAFAA HLGRTLPYPV KVMNWGYWGS VGVVSDETYR RRMASAGFGS
IEPDEGMSAL ERLLASRVGQ IAVLKTLRPN LVGDSRADRI RHYPGRDWPD AAPAPATAAL
QAALAARAGR WHAQASALAL GNPELETLIA RGLLAGVLPY LDAPGSVDAR HARWFDESRA
MLHGFGYLAR DGAGDAPSWS LTDAGRAAAP HVWQDWERHA LAWHDDERRV PMRLAHVCLR
ALPELLGGKR RATDVMFPGS SMALVEGLYK SNRKADLFND VVHDAVLSYA RALGRALDIV
EVGAGTGGTT DGLLRKLVEQ GIAVREYRYT DLSHAFLLHA REHYAPRAPF LTTGIFDVDK
PIAAQRVPGG RYDVAVATNV LHATRDVRRA LRNVKATLRA GGLLILNELS VKSLFSHVTF
GLLDGWWMYE DADLRIPGSP GLDSSTWRRV LAEEGFEYVF FPAQGLHAHG QQVIVAQSDG
VVRQPRAAAA PGAGAAASPS GGTQAAVPAR RAAAASGAPR VEAIPPAAVA PAAFDAATAA
PPGTAAAAAT AVPADGRSAL AHASSPAASP PQPGDAPAPE RMHAYLRDKL SQVLKLPPER
IETDASFASY GVDSIMAMAL IAALEKELGS LPKTLFFEHE TIEELGAYLL ERCEPMPSGV
EPATVGADDR AAYSGARPHA WPASPTEPDE PTEPTASPVS SAPPAASPPQ PGDAHAPERM
HAYLRDKLSQ VLKLPPERIE TDASFASYGV DSIMAMALIT ALEKELGSLP KTLFFEHETI
EELGAYLLER CEPMPSGVEP ATVGADDRAA YSGARPHAWP ASPTEPDEPT EPTEPTASPV
SSASPAASPP QPGDAPAPER MHAYLRDKLS QVLKLPPERI ETDASFASYG VDSIMAMALI
TALEKELGSL PKTLFFEHET IGELGEYLLE RQGQERACHA SNV