Gene BURPS1106A_A1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1395 
Symbol 
ID4904274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1342713 
End bp1357778 
Gene Length15066 bp 
Protein Length5021 aa 
Translation table11 
GC content74% 
IMG OID640144501 
Productputative polyketide synthase PksJ 
Protein accessionYP_001075429 
Protein GI126456322 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA TCATCAGTGA TGTGATTCGT GCCTACCGCG AAGGGAAGTT GACGACCGCC 
GATCTGGCGA GCGAGCTTCG CCGGGGCGCG GGCGACGGCG CGGGTTTGCC GCTTTCGGAG
GGGCAGCGCG GGATCTGGGC GCTGCATGCG CTGCACGAGG ATCGCGGCGC CTATAACGTG
CCGCTTTGCT TCGCCGTGCG CGATCTGCGG GCGGACGCGT TTCGCCGCGC GCTGCGCTTC
GTCGCGCGGC AGTATCCGTC GCTGTGCGCG GCGATCCGCG TGATCGACGG CGAGCCGAGG
CGGGTGCAGC CGGCCGGCGC GACGCTCGAG CCCATCGAGG CGACGCTCGC CGACACGCTC
GGCCCCGACG CCGACGAGGC CGCGATCCTC GCGTGGCTGC GCGAGCAGGC GAGGCAGCCG
TTCTCGCTCG AGGACGGCCC GCTTTGCCGC GTGCATCTGC TCGATCTCGC CGGCTGGCGC
GCGCGCGACG CGGCGGCCGC GAGCCGGTTC GGCGCGGCGC ACACGATCGT GTCGCTGCAC
GTGCACCACC TGGTGCTCGA CGGGCAGTCG CTGCTGCTGC TGATCGGCAC GCTGCTCGAC
GCATACCGCG CGCTCGTCGA CGGCGTCGAG CCCGCGCCGC GCGCGCCGGC CGCCACGCAC
GACGATTTCG TCGCCGAGGA GCGCGCGCTT CTCGACAGCG ACGAGGGCGC GCGCCGCATC
GCGTACTGGC GGCGGCAGCT CGACGCGCTG CCGCCCGCGC TCGAGCTGCC CGCGTCGGCG
CCGGCCGCGG CCGAGCGCGC GGCGGGCGAC GCGTGGCATG CGGTGCCGCT CGACGCGGCG
AGGTCGGCGC GCGTCGCGGC GTTCGTCCAG TCGAACCATC TCGGCGCCGC CGCGTTCTTC
CTCGGCATGT TCAAGCTGCT GCTGCATCGC TATACCGGCG AGCCCGACAT CGTCGTCGGC
ATGCCGGCCG ACGCGCGGCC GTCGCAGCGT TACCGCGACG CGCTCGGCTT CTTCGTCAAC
ATGCTGCCGC TGCGCACGCG CTTGGCCGGC GAGACCGCCG TCGTGGCGAT GCTCGAGCGC
GTGCAGCGCG AGCTCGTCGA CGCCATGGCG ATGCAGTATC CGTTCGGCGC GCTGGTTCGC
GAGCTCGGCC TGCAGGGCGC GGAGGACGGC GCGCCGATGT ACCGGATCGC GTTCATGTAT
CAGGATTTTC TCGCGCGCCT GCGCTTCAGC GACGACGTCG AACCGATCGG CGAGATTCGT
CAGGCGGGCG AGTATGAGCT CGTGCTCGAG GTGATCGAAG GAGCGGCGCC CGGCGGCCCC
GCGCGTTTCG CGCTGAACTG GAAGTACGAC GGCGCGCGGT ATCGCGCGGC CGCCGTGGAG
GCGATGGCGC GCCACTATCT GACGCTGCTC GACGGCGTGC TCGCGGCGCC CGCCGCGCGG
GTGGCCGATT GCCCGATGCT GCCCGCGGCC GAGCGCGAAC GGCTGCTCGC GCTTGGCCGC
GGCCCGCGCG CCGACCATGC GCGCGAGCGG CGCGTGCACG ACCTGATCGA TGCGCGCGCG
CAGCAGGCCC CGCACGCGAT CGCGGTGTCC TGCGGCGGCC GCTCGCTCGA CTATGCGCGA
CTGAAGGCCG ACAGCGACGC GCTCGCGCAG CGCCTGCGCG CGTGCGGCAT CGGCGCGGGC
GACTTCGTCG CGGTGCGGCT CGACCGCTCG ACGGCGCTCG TCGTCGGCCT GCTCGCGGTG
CTGAAGGCGG GCGCGGCATA CGTGCCGCTC GATCCCGACT ATCCGGACGA CTGGGCGGCG
CAAATGCTCG GCGATTGCCG CCCGGCCGCG ATCCTGACCC GCGCCGCGCT CGCGGCGGGC
GCGCACGCGC TCGCGCGGCG CGTCGCGGCC GACGGCCCGC CCGCCGTCAT CGCGCTCGAC
GACGCCGCCG ACGCCGACAC CCACGCCGCC GACGGCGCAC GCGCGGCCGC GATCGCCGCC
GCGCGGCAGG CCGCGGCGAG CCGCGCGCAC GCCGCGCGGG CGGCCGATCT CGCCTACGTG
ATCTACACGT CGGGCAGCAC GGGCGCGCCG AAGGGCGTGA TGGTCACGCA TCGCGCGCTG
ACCAACTTCC TCGCGTCGAT GGCGCGCCGC CCGGGCCTGC ACGCGCGCGA CACGCTGCTC
GCCGTCACCA CGTACTGTTT CGATATCGCG GCCCTCGAGC TGTTCCTGCC GCTCGTGCAG
GGCGCGCACT GCGTGATCTG CGACAGCGCG TCGGCGCGCG ACGGCGGCCG GCTGCGCGAG
CTGATCGACG CGGCGCGCCC GACGGTGATG CAGGCGACGC CGTCCACGTG GGAGATGCTG
CTGCATGCCG GCTGGCGCAA CGCGCGGCGC ATGCGCGTGC TGTGCGGCGG CGACACGCTG
CCGGACGCCG TCAAGGCCCG GCTGCTCGAG GACGGCGGCG AAGTCTGGAA CCTGTACGGC
CCGACGGAGA CGACGATCTG GTCGATGGTC GCGCCGGTGA CGGCGGAACG GCCGACCTCG
ATCGGCGCGC CGATCGACAA CACGCGAATC CGGATCGTCG ATGCGTACGG CAATCCGGTG
CCGATCGGCG TGCCGGGCGA GCTGTGCATC GCGGGCGACG GGCTCGCCGC GGGCTACCTG
AACCGGCCCG ACGAAACGGC GGCGCGGTTC GTCGATGCGC TGCCCGACGT GGACGGCCAG
GCGCGCGAGC GCCATTACCG CACCGGCGAC CTCGCGCGCT GGCGCGAGGA CGGCGAAGTC
GAGCATCTCG GGCGCATGGA TTTCCAGGTG AAGATTCGCG GCCATCGCGT CGAAGTGCAC
GACATCGAGC GGCATCTCGC GCGGCATCCG GCGATCCGGG CGGCCGCGGT GGTCGCGCGG
CGGCACGCGG GCGGCGATCA GCTCGTCGCC TACTACGTGC GCGGCGACGC CGCCGGGCAC
GGCGGCGCGG ACGACGCGCC GGCGCTGGCC GCCGAGCTGC GCGGCCATCT GGCCGGCGCG
CTGCCGGACT ACATGATTCC CGCGCTGTTC CTGCCGATCG ACGCGCTGCC GATGACGCAC
AACGGCAAGC TGAACCGCAA GGCGCTCGCG AGCCGCGGCA TCCGGCTGCG CGTCGCGTCG
TCGGGCGAGC GCCGCGCCGC GCCGCCGCGC GCGCCGGCCG CCGCCGATAT CGAGGCCCGC
CTGCTCGCGA TCTGCCGCGA GGTGCTGAAG ATCGACGACA TCGATCGCGC GGACGGCTTT
TTCGAGGTGG GCGGCAATTC GCTGTCGGTG GCGCTGATCG CCTCGCGCGT CGGCGCCGAG
TTCGGCCTCG CGCGGCTCGG CGCCGGCGCG TTCTTCCGCT ATCCGACGGT CGCCGCGCTG
GCCGCCCATC TGGGCGCGCG GCTGCGCGGC GACGCGGGCG CGGCCGAGGG CGCGGACGGC
GCGGACGCCG GCCCGGCCGG CGCCGACGCG CGCGCATCCC GCCCCGCGCA GCCGCGGGCG
GCCGGGCCCG CGGCGCGACT GCCCGCGGCG CTCGACGACG CGATCGCGAT CATCGGCATC
TCGTGCCAGT TTCCCGGCGC GCAAGACCAT CGCGCGTTCT GGCGCAATCT GCGCGACGGG
AAATCGGGCG CGCGGTTCTA TTCGGAAGAC GAACTGCGCG CGGCCGGCGT GCCGGACACG
CTGATCCGCG ACCGGCACTA CGTGCCGATG CAGCAGACGA TCGAAGGCAA GGACCTGTTC
GACCGGCACT TCTTCCGGCT GACGACGAAG GATGCGCAAC TGATGGACCC GCAATTCCGT
CTGTTGCTGC AGCACGCGTG GAAGGCGATC GAGGACGCCG GCTGCACGCG CGAGCGGATC
GCCGACGCCG GCGTATACAT GTCGGCGTCG AACAGCTACT ACCAGGCGAT GCTGCGCGCG
GCCGGCACGA TCGACGCGTC CGACGAGTAT CAGGCGTGGC TGCTCGCGCA GGGCGGCACG
ATTCCGACGC GCATCTCGTA CGAACTCGGC CTGACGGGGC CCAGCCTCTT CATCCATTCG
AACTGCTCGT CCGGGCTCGT GTCGCTTTCC GTCGCGGCGA AGTCGCTGCT GCAGCGGGAA
AGCCGCTGCG CGCTCGTCGG CGCGGCGACG GTGCTGCCGG ATGCGGACAT CGGCTACGTG
TACCAGCCGG GGCTCAACCT GTCGAGCGAC GGCCGCTGCC GGACCTTCGA CGAAAACGCC
GACGGGCTCA CCTCCGGCGA AGGCGTCGCC GTGCTGCTCG TCAAGCGCGC GCGCGACGCG
ATCGACGACG GCGACCCGAT CTACGCGCTG CTGCGCGGCA TCGCCGTGAA CAACGACGGC
GCGGACAAGG TCGGCTTCTA CGCGCCGAGC GTCGGCGGCC AGGCCGACGT GATCCGCAAG
GTGCTCGATG CGACCGGCAT CCATCCCGAG ACGATCGGCT ACGTCGAGGC GCACGGCACC
GGCACGAAGC TCGGCGATCC GGTGGAGGTG GCGGCGCTCA CCGACGCGTA TCGCCGCCAT
ACCGCGCGCA CCGGATTCTG CGCGATCGGC TCGGTGAAGC CGAACATCGG CCACCTGGAT
ACCGTCGCCG GGCTGTCGGG GTGCATCAAG GTCGCGCTGA GCCTGCGGCA CGGCGAGATC
GCGCCGTCGA TCAACTACGA GAAGCCGAAC CGCGAGATCG ATTTCGCGCA CTCGCCGTTC
TACGTCGTCG ACCGATTGAC GCGCTGGCCC GCGCGCGAGC CGGGGGCGCC GCGCCGCGCG
GCGCTCAGCT CGTTCGGCAT CGGCGGCACC AACGCGCATT TGATCCTCGA GGCGTTCGAG
CGCGACGAGC CGCCCGCCGG AATGCGCGCG CCGGCCGCCC GCGCGGCGCG CGTGATCGCG
CTGTCGGCGC GCACCGAAGA GCGCGTGCGC GCGCAGGCGA GCCAGTTGCT CGCGTTCCTC
GAGCAGGAAG CCGGCGCGCT GCCGGACTTC GACGGTTTCG CGTTCACGCT GCAGGTGGGC
CGCGAGGCGA TGCGCGAGCG CGTCGCGTTC GTCGCCGACG GCTACGACGC GCTCGCCGCC
GCGCTCGCGC GTTTCCTGCG CGGCGAGCCG GACGCGGCCG CGTGTTTCAC CGGCGCGCGC
GGGGGCGATT CGACGCTCGC GGCGCTGCTC GACGATACCG GCGATACCGG CGATACGGCC
GCGCACGGGT TGATCGCCGC GTGGTGCGAG CAAGGCAAGG TCGCGAAGAT CGCGGCGCTC
TGGGCGCACG GTGTGAACGT CGATTGGCGC CGGCTTTACG GCGCGCGCGC GCCGGTGCGT
GTGAGTCTGC CCACCTATCC GTTCGCGCCG GAGCGTTGCG AGGGCGTCGC GCGCCGCCGC
GCCGCCGCGC CGGCGCCGCG CCGCGCGGGC GTCGAGACGG CCGCGGCGCG GCTGCATCCG
CTCGTTCACG ACGACCGCTC GGACGGGGCG CGCCGGCGGT TCGCCGCGAC GTATTCCGGC
GAGGAATTCT TCCTGGCCGA TCATCTGATC CGCGGCAAGC GGATCCTGCC CGGCGTCGCG
TATCTCGAGA TGGCGCGCAT GGCGGCCGTC CGGGCGCACG GCGACGGCGC GCTGAGCCTG
CACGACGTGG TCTGGATGAC GCCCATCGTC GTCGACGGGC CGTGCGAGGT CGAGCTGAGC
CTGGAGGCCG CCGAGCGTGC CGAGGCCGAG GGGGCCGCCG AGGCGGCCGC CGGCGTGCGC
ACGATGCGGT TCAACGTGAC CTCGGGCGGC GGCGCCGGCG CGCGCCGCAC GAACAGCCAG
GGGACGATTC GCCTCGCGCC CGGCGCGGCC GCGCCCGCCG CCGCGCGCGT CGATGTCGCG
GCGCTCCTCG CGCGCTGCAC GCGCGAGATC GGGGCGCAGC GGTTCTACAC GTTCCTCGAC
AGCGGCGGCG GCCATTACGG GCCGACGTTC CGGAGCGTCG CGGCGCTGCA TCAGGGCGAG
CGCGAGGTGC TCGCGCGGCT CGCGCTGCCG GAGTCCGTCG CGCACGCCGA TGCGTTCGTG
CTGCATCCGA GCATGATGGA CGCCGCGTTC CAGATCGCCG ACAGCCTGAT CCTGCAACCG
CGCGCGAACG GCGGCTGTCT GCCGTTCTTC GTGAAGGAGC TCGTCGTGCG ACGCCGGCCG
GGCCGCGACG CGTGGGTCCA CGTTCGCCTC GCCGGCGGCG ATGCGACGCT TGCCCGCTAC
GACATCGATC TGATCGACCC CGACGGCACC GTCTGCGTGT CGATGCGCGA ATTCAGCGCA
CGCGCGGAGA CGGCGGGCGG CAGCGGCCGG CCGAACACGT ACCGCGCCGC CGAATGGCGC
GCCGCGGAGT GCGACGGCGA GCGCGACGGG AACGAGCTGA GCGAGCTGAA CGAGCTGAAC
GAGGGGAACG AGCGGCGTCG CGCCGCGCCG CGGGTGGCGG TGCTCGACGC ATCGCCGCGT
CTTGCGCACG CGCTGCGCGG CATCGGCGTC GACGCGCTCT GGCTGCCGGC CGACGCGGCG
CACGCGGCGC GCGGGCCGGC GCTGCGGGAT CTCGACGCGG CGCTGCACGC CGGCGCGGCG
CGCGATCTGC TCGTGCTCGC CGACGAACGG CGCGAGCTCG ACGACGACGC GTTGCGCGCG
TGGCTCGACG GCGCGCCGCA CGCCGGGGGC GCGCGGCGGG CGCTCGTGTC GATCGCGGGG
CTGGCCGACG CCGACGCGCG CGCCGTGGCG GACATCGTCG AGCGCGAGCG GCATGGCCGC
GCCGCCGACG TCCGCTACGA CGCCGGCGGC GCGCGCAGCG TGCGCGGCTT CGCCGACGCG
GCCGTCGCGC GCTGGCTGCT CGACACGGAC GCGCTGCGCT CGGGCGGCGT GTACTGGATC
GCCGGCGCGA ACGGCCCGCT CGGCGCGAGC CTTGCCTGCC ACCTCGCGAC CGTGGAGCGC
GCGACCGTGG TGCTGACCGA CGCGCACGCG ATCGATGCGG CCCGGCTCGC CTGCCTCGAC
GGGTATCGCG CCGGCGGCGC GCGCCTCGAG TTCATCGAAG GCGACGCCGC GCGAGACGGC
GCGGCGCTCG CGCAGCGGAT CCGCGCGCGT CACGGGCGCA TCGACGGCGT GCTGCACTGT
GCGCAACACG CGTCGGCGCC GACGCTCGCG GCGCTGGCCG CGCTCGACCG CGCGACGCGC
GCCGACGCGC TCGATTGTTT CGTCGCCTGC GAGGCGCGGG ATGCCGATCC GGATCGCGAT
CCGGCGGCCG CGCTCGTGGC GCGATTCGTC GAGCGGCGCG ACGCGCGCGT GCAAGCGGGA
CTCGGCGGGG GGCGGACGGT GGCGATCGCG GCGCACGCGG CGCTGCCGTG GCCGGACGAC
GCGCCGCTGC TGCGCGCGGG CGGTATTGCG AGCCAGCCCG CGCTGGCGAT CGTGCAGGCG
CTGCATCATG CGTTGCGCTC GGACGAGGCG ATGCTCGCCG TCGGCTGGGG GGCGTCGGCC
GGCGGCGTCG ATGCGAACGC ATCGAACGCA TCGAACGCAT CGAACGCATC GAACACATCG
AACACATCGA ACACATCGAA CACATCGAAC GCGCCGAGCG TCGCCGCGGA TCTTGCCGCG
CCCGCCGAGC CGAACGCGCG GATTCCCGCG CGCGCGCGGG CCACGCCGGA CACGATCGCG
GCATGCCTGA AGGCCGTGAT CGCCGACGTG ATCCGGGCGG ACGTCGACGA AATCGACGCG
CGCCAGCACT TCGGCGAATA CGGACTCGAC TCGCTGTCGC TGACGTCGGT CAGCAACCGG
CTCAATGACG CATACCGCCT CGATGCTTCG CCGGCGGGCG CGCTGAATCC GACGCTGTTC
TTCGAATACC CGAGCGTCGA GCGGATGGCG GCTTATCTCG CCGAGCATCA CGCCGCGCGC
TTCGCCGACG CGTCGGCCGC ACCCGGCGCC GACGGGGCGG CCGAGTGCGC GCCGCGGCCC
GAGGCCGCGC TCAACGCCGA GGTCGAACCT GGGAACGGGG CCGCGCCCGC GCCCGCGCCC
GAGCCCGAGC CCGAGGTCGG GTTCAGGGCC GGGTTCGAGC CGGTCGCGCC GCCGATGCCG
CGCATCGAGC CCGCCGCATC GACGCCGCCG GACCAACCGG CGCCGCAACC CGGCGGTGCA
TGGCACGCCG GGCGCGGCGC GCGCCCGGCG GCCGACGACG ACGTCGCGAT CATCGGCATC
AGCGGCCGCT TTCCCGGCGC GCGCGACGTG GCCGAATTCG GCCGCAATCT GTTCGACGGC
CGGGACTGCA TCGGCGAGAT TCCCGCGGAC CGCTGGGACT GGCGCGCGTA CCTCGGCGAT
CCGCAGCACG AGGCGGGCAA GACGAACAGC AAGTGGGGCG GCTTCATCGA CGGCATCGCG
GAATTCGATC CGCTGTTCTT CAGCCTCTCG CCGAAGGAGG CCTATCTGCT CGATCCCGCG
CACCGGCTGC TGCTGATGCA CGCGTGGTGG GCGATCGAGG ACGCCGGCTA CAACCCCGCC
GCGCTCGCCG GCAGCCGGAC CGCGCTGTTC GCGGGCATCG CGCAGAGCGG CTACGCGGAT
TTGCGCAGGC AAGCCGGCGA GGGGATCGAG GGCAACTCGT TCCTCGGGGT CGTGCCGTCG
ATCGCGCTGA ACCGGATCAG CCACCTGCTC GATCTGCACG GCCCGAGCGA GCCGGTCGAG
ACGGCCTGTT CGTCGTCGCT CGTCGCGATG CACCGCGCGC TCGTCAGCCT GCGCTGCGGC
GACGCCGACA TGGCGCTCGT CGGCGGCGTG CAGACGATCC TGTCGCCGCA CGCGCATATC
GGGTTCGGCA AGGCGGGCAT GCTCGCGACC GACGGCCGCT GCAAGGCGTT CTCGAGCCGC
GCCGACGGCT TCGTGCGCGG CGAGGGCATC GGCATGCTGT TCCTGAAGCG GCTCGGCGAC
GCGCGGCGCG ACGGCGACGC GATCTACGGC GTGATCCGCG GCAGCGCGGT CAATCACGAC
GGCCGGTCGA GCTCGCTCAC CGCGCCGAAC CCGGCCGCGC AGCGCGACGT GATCGTGCAG
GCGCACATGC GAGCCGGCGT CGACCCGCGC AGCATCGGTT ACATCGAGGC GCACGGCACC
GGCACGAAGC TCGGCGATCC GATCGAGATC AACGCGCTCA CGCAGGCGCT CGACACGCTG
CTGCGCGCGC AGCGCGAGGA AGGCGCCGCC TACGTTCCCG GCGCGTGCGC GATCGGCTCC
GTGAAGAGCA ACATCGGCCA TCTGGAGCTG GCCGCCGGCG TGTCCGGCGT GATCAAGGTG
CTGCTGCAGA TGGCGAACGG GCGGCTCGCG AAGAGCCTGC ATTGCGACGA GCTCAATCCG
TACATCACGC TCGACGGCGG GCCGTTGCGC GTCGTCGGCG CGAACGCCGC GTGGCCGCGT
CCCGTCGATC GCGACGGCCG CGAGCAGCCG CGCCGCGCGG GCGTGAGCTC GTTCGGCATC
GGCGGCGTGA ACGCGCACGT CGTGCTCGAG GAGTATCCCG AGGCCGACGC GCGCGCGCGC
GACGACGGGC AGCCGGCCGC CGTGCTGCTG TCCGCGCGGG ATTCGCAGCG GCTCGCCGAT
TACGCGAGCG CATTGCTCGC GTTCGTGCGC GAGCGGCGCG AGGCGGCCGC GCATGCGCCG
CCGCCGCGGC TGTCGGATCT CGCCTATACG CTGCAGGTGG GCCGCGAGGC GATGCGCGAG
CGTGTCGGCT TCGTCGTCAC GTCGCTCGCG CAACTCGAGG TGCGGCTCGC CGCGTTCGTC
GCGGGCGAGC CGGCGGGCGA CGGCGTCTAC CGCGGCAGCG TCCGCCCGGC GCGCGGCGAA
CGCGCGGCCG ACGCGGACGG CCTCGACAGG CTCGTCGACA TCTGGCTCGC GAACCGCAAG
CATGAGGCGC TGCTCGGTGC GTGGGTGAAG GGCGCGGCGA TCGACTGGGC GAGACTTCAC
GCGGGCGGCG CGCCGCGCCG CGTCCATCTG CCCGGCTATC CGTTCGCGCG CGAGCGCTAC
TGGATCGCCG AGCCCGCGCC GGCGACGGGC GAGCCCGCGC CGCCGCGCAT GCCGACGCAG
CCGCACGGGC CGACGTCCGA CGGCCGCGCC GAATCGCGCC ATCCGTTGCG GCGCGACGCC
GCCGACGGCC GGTTCCTGCT CGATCTCGAC GGCGACGAGG CCTTTCTCGC CGACCATCGG
GTGGACGGAC GCCGCGTGCT GCCGGGCGTC GCGCACCTGG AGATCGCGTA CGAGGCCGCG
CGGCGCACGT TCGGCCCGGC CGATGCGATC CGGATCCGGA ACCTCGGCTG GATCAGGCCG
ATCGTCGCCG ACGGCGCGCT GCGCATCGGC GTCGAACTGA GCACGTCCGG CGCCGCCGAA
GGCGCGTTTC GCCTCTACAC GACGGATCCG CAACATGGGC GGCTCACGCA CAGCGAAGGC
GCGATCGGCC GCGCCGACGT CGCGCAGTCG GCGCGCGCGC TCGATCTCGG CGCGCTGCGC
GACGCGTTCG CGACGGCCGA GCGCGTCGAT CCGGCCGTCT GGTACGACGG CTTCTCGCGC
GCCGGCATCG ATTACGGCCC GAGCCACCGC TGCCTCGAAA CATGCGCCGT CGGCCCGGCC
GGCGTGCTCG CGCGGGTGCG CCTGCCGGCC GCCGAGGCGC GCGCGGCGCG GCCGTTCACC
TTGCATCCGG GCCTGATGGA CGCGGTGCTG CAGGCGGCGA TCGGCCTGCG CAAGCGCGCG
GGCGGCGCGC CGCGCGGCAC GCCGTATCTG CCGTTCGCGC TCGACACGGT CGAGATTCTC
GGCGGCTGCG GCGAGGCGGC GTGGGCATGG CTGCGCCCGT CGCCGCGCGA CGCGGCCGAC
GCTTCGGCGT CGCGCGGCGA CGCGGGCAAG CCGGCCGCCG AGCGCATCGA TATCGATGTG
TGCGACGACG CGGGCCGGAT CAGCGTGACG CTTCGCGGGC TCACGTCGCG CCCGCTCGCG
CGCCGGACGG CGCCGGCTCC CGAGGCCGGG AACCCGGCCG GTAAAGTCGG CGAGGTGGCC
GACGCCACCG ATGCCGACGC CGCTGAAGTC CGCGAAATCT CCAACGTCTC CAACGTCTCC
AACGTCTCCG ACGTCTCCGA CGTTTCCGAC GTTTCCGACG TCGCGCCGCT CGCCGACGGC
GACGTCGGCC TGCTCGCGCG AACCGCGGTG TGGAGCGCGC TGACGCCGGC GCAGTGGCTC
GCGGATCCGG CGTCGCGCCC GCGCGCCGGC GCGCGCGTGT TCGTGCTCGG CGGCACCGCC
GCGCAGCGGC GCGAGATCGC GCGGATTCAT CCCGGCTGCG AACCGCTTGA GGCGAATGCG
GCCGACGACG GCGGCGACGG CGCGGACCAA CAGGCGCACG TCGACGCGCT GCGGCGGCGG
CTCGCCGAGG GCGCGCCGAT CGACCAGCTC GTCTGGATCG CGCCGCCGGA GCCGGCCGCC
GACGCGCGCG CCGGGCTGCG CGGCGACGCG ATCGTCGCCG CGCAGGAGCA CGGGGTGCTG
CAACTGTTCC GGATCGTCAA GCTGCTGCTC GCGGCGGGCT ACGGCGGCAA GCCGCTCGAC
TGGACGATCG TCACGCGCGA AACGCACGCG ACGAGCGGCA TCGACGAGCC GTCGCCGACG
CACGCGGGCG TGCATGGGTT CGTCGGCTCG ATGGCGAAGG AGTACCGGAA CTGGCGTGTC
CGCCTGCTCG ACATGCCCGC GCGCGAGGCG TGGCCGATCG ACGCGATGTT CTCGACGCGC
TTCGATCCGC GCGGCGATGC GCTCGCCTAT CGGCGCGGCC GCTGGCTCGC CCGCGAGCTG
GCCGCGATCG ACGCGTTGCC CGACGGCGGC TGTCATGTGA AGACGGGCGG CGTCTACGTG
GTGATCGGCG GCGCGGGCGG GATCGGCGAA GTCTGGAGCC GCTGGATGAT GGAGCGCTAT
CAGGCGCGGA TCGTCTGGAT CGGGCGCCGC GACGAGGACG AGCAAATCCG CCGCAAGCGC
GAGCGGCTCG CGCGCTACGG CACGCCGCCC GTCTACCTGC GCGCGGACGC GAGCGAGCGC
GCGTCGCTCG CGGCGGCGCG CGAGCGGATC GCCGCGCTGC GCTGGGACGG CCGCGCGCTG
CCGACGAGCG GCGTCGTGCA TTCCGCGATC GTGCTGGCGG ATGCGAGCCT CGCGACGATG
GACGAGGCGC GCTTTCTGGC CGCGTGGCGA TCGAAGGCGG ATGTCAGCGT GCGCGTCGCC
GAGGTCTTCG GCGGCGATCC GCTCGATTTC ATGCTGTTCT TCTCGTCGAT CACGTCGTTC
GGCAAGACGG CCGGACAGGC GAACTACGCG GCGGGTTGCG CGTTCAAGGA CGCGTTCGCC
GCGCATCTCG GCCGCACGCT GCCGTATCCC GTCAAGGTGA TGAACTGGGG CTACTGGGGC
AGCGTCGGCG TGGTCAGCGA CGAAACCTAT CGCCGGCGCA TGGCGAGCGC GGGCTTCGGC
TCGATCGAGC CCGACGAGGG CATGTCGGCG CTGGAGCGGC TGCTCGCCAG CCGCGTCGGC
CAGATCGCGG TGCTCAAGAC GCTGCGGCCG AACCTCGTCG GCGACTCGCG CGCGGACCGG
ATCCGGCATT ACCCCGGCCG CGACTGGCCG GACGCGGCGC CCGCGCCGGC GACGGCCGCG
CTGCAGGCGG CGCTCGCGGC GCGCGCCGGG CGCTGGCACG CGCAGGCGTC GGCGCTCGCG
CTCGGCAATC CCGAGCTGGA GACGCTGATT GCGCGCGGCC TGCTCGCGGG CGTCCTTCCG
TATCTCGACG CGCCGGGCTC GGTCGACGCG CGCCATGCGC GGTGGTTCGA CGAAAGCCGG
GCGATGCTGC ACGGGTTCGG CTATCTCGCG CGCGACGGCG CGGGCGACGC GCCGTCCTGG
TCGCTCACCG ACGCCGGCCG CGCGGCGGCG CCGCACGTCT GGCAAGACTG GGAGCGGCAC
GCGCTCGCGT GGCACGACGA CGAGCGGCGC GTGCCGATGC GGCTCGCGCA CGTCTGCCTG
CGCGCGCTGC CCGAGCTTCT CGGCGGCAAG CGGCGCGCGA CCGACGTGAT GTTCCCGGGC
TCCAGCATGG CGCTCGTCGA GGGGCTGTAC AAGAGCAATC GCAAGGCCGA TCTGTTCAAC
GACGTCGTGC ACGACGCGGT GCTGTCGTAT GCGCGCGCGC TCGGGCGCGC GCTCGACATC
GTCGAGGTGG GCGCGGGCAC GGGCGGAACG ACGGACGGCC TGCTGCGCAA GCTCGTCGAG
CAAGGGATCG CGGTGCGCGA ATACCGGTAT ACGGATCTGT CGCACGCTTT TCTGCTGCAT
GCGCGCGAGC ATTACGCGCC GCGCGCGCCG TTCCTGACGA CCGGGATCTT CGACGTCGAC
AAGCCGATCG CCGCGCAGCG CGTGCCGGGC GGCCGCTATG ACGTCGCGGT CGCGACCAAC
GTGCTGCACG CGACGCGCGA CGTCCGGCGC GCGCTGCGCA ACGTGAAGGC GACGCTGCGC
GCGGGCGGCT TGCTGATCCT GAACGAACTG AGCGTCAAGT CGCTGTTCAG CCATGTGACG
TTCGGGCTGC TGGACGGCTG GTGGATGTAC GAGGACGCCG ATTTGCGGAT ACCCGGCTCG
CCCGGCATCG ATTCGTCGAC GTGGCGGCGC GTGCTGGCGG AAGAGGGCTT CGAGTATGTG
TTCTTCCCCG CGCAAGGGCT GCATGCACAT GGCCAGCAAG TCATCGTCGC GCAGAGCGAC
GGCGTGGTCC GGCAGCCGCG CGCGGCCGCC GCGCCGGGGG CCGGCGCGGC CGCGTCGCCT
TCGGGCGGCA CGCAAGCGGC GGTGCCGGCG CGCCGGGCGG CCGCGGCATC CGGCGCGCCG
CGCGTGGAGG CGATTCCGCC GGCGGCCGTT GCGCCCGCGG CCTTCGATGC CGCCACCGCG
GCTCCTCCCG GCACCGCTGC CGCTGCCGCG ACGGCGGTGC CGGCGGACGG CCGATCCGCG
CTCGCCCACG CAAGTTCGCC GGTCGCCTCG CCGCCGCAGC CGGGCGACGC GCCCGCGCTC
GAACGAATGC ATGCGTATTT GCGCGACAAG CTCTCGCAAG TGCTGAAGCT GCCGCCGGAG
CGCATCGAGC CAGACGCATC GTTCGCGAGC TACGGCGTCG ATTCGATCAT GGCGATGGCG
TTGATCACGG CGCTCGAAAA GGAGCTGGGC AGCTTGCCGA AGACGCTGTT CTTCGAGCAC
GAAACGATCG AGGAACTGGG CGCGTATCTG CTGGAGCGTT GCGAGCCGAT GCCTTCGGGC
GTGGAGCCGG CGACGGTGGG GGCGGACGAT CGCGCCGCGT ATTCCGGCGC GAGGCCGCAC
GCCTGGCCCG CGTCGCCCAC GGAGCCCGAC GAGCCCACCG AGCCCACCGC ATCGCCCGTC
TCATCCGCCC CGCCGGCCGC CTCGCCGCCG CAGCCGGGCG ACGCGCACGC GCCCGAACGA
ATGCATGCGT ATCTGCGCGA CAAGCTCTCG CAAGTGCTGA AGCTGCCGCC GGAGCGCATC
GAGCCAGATG CATCGTTCGC GAGCTACGGC GTCGATTCGA TCATGGCGAT GGCGTTGATC
ACGGCGCTCG AAAAGGAGCT GGGCAGTTTG CCGAAGACGC TGTTCTTCGA GCACGAAACG
ATCGAGGAAC TGGGCGAGTA CCTGCTGGAG CGTTGCGAGC CGATGCCTTC GGGCGTCGAG
CCGGCGACGG TGGGGGCCGA CGATCGCGCC GCGTATTCCG GCGCGAGGCC GCACGCCTGG
CCCGCGTCGC CCACGGAGCC CGACGAGCCC ACCGAGCCCA CCGCGTCGCC CGCCTCATCC
GCCCCGCCGG CCGCCTCGCC GCCGCAGCCG GGCGACGCGC CCGCGCCCGA ACGAATGCAT
GCGTATTTGC GCGACAAGCT CTCGCAAGTG CTGAAGCTGC CGCCGGAGCG CATCGAGACG
GACGCATCGT TCGCGAGCTA CGGCGTCGAT TCGATCATGG CGATGGCGTT GATCACGGCG
CTCGAAAAGG AGCTGGGCAG TTTGCCGAAG ACGCTGTTCT TCGAGCACGA AACGATCGAG
GAACTGGGCG AGTACCTGCT GGAGCGGCAA GGACAAGAGA GGGCGTGCCA TGCAAGCAAC
GTTTAA
 
Protein sequence
MNAIISDVIR AYREGKLTTA DLASELRRGA GDGAGLPLSE GQRGIWALHA LHEDRGAYNV 
PLCFAVRDLR ADAFRRALRF VARQYPSLCA AIRVIDGEPR RVQPAGATLE PIEATLADTL
GPDADEAAIL AWLREQARQP FSLEDGPLCR VHLLDLAGWR ARDAAAASRF GAAHTIVSLH
VHHLVLDGQS LLLLIGTLLD AYRALVDGVE PAPRAPAATH DDFVAEERAL LDSDEGARRI
AYWRRQLDAL PPALELPASA PAAAERAAGD AWHAVPLDAA RSARVAAFVQ SNHLGAAAFF
LGMFKLLLHR YTGEPDIVVG MPADARPSQR YRDALGFFVN MLPLRTRLAG ETAVVAMLER
VQRELVDAMA MQYPFGALVR ELGLQGAEDG APMYRIAFMY QDFLARLRFS DDVEPIGEIR
QAGEYELVLE VIEGAAPGGP ARFALNWKYD GARYRAAAVE AMARHYLTLL DGVLAAPAAR
VADCPMLPAA ERERLLALGR GPRADHARER RVHDLIDARA QQAPHAIAVS CGGRSLDYAR
LKADSDALAQ RLRACGIGAG DFVAVRLDRS TALVVGLLAV LKAGAAYVPL DPDYPDDWAA
QMLGDCRPAA ILTRAALAAG AHALARRVAA DGPPAVIALD DAADADTHAA DGARAAAIAA
ARQAAASRAH AARAADLAYV IYTSGSTGAP KGVMVTHRAL TNFLASMARR PGLHARDTLL
AVTTYCFDIA ALELFLPLVQ GAHCVICDSA SARDGGRLRE LIDAARPTVM QATPSTWEML
LHAGWRNARR MRVLCGGDTL PDAVKARLLE DGGEVWNLYG PTETTIWSMV APVTAERPTS
IGAPIDNTRI RIVDAYGNPV PIGVPGELCI AGDGLAAGYL NRPDETAARF VDALPDVDGQ
ARERHYRTGD LARWREDGEV EHLGRMDFQV KIRGHRVEVH DIERHLARHP AIRAAAVVAR
RHAGGDQLVA YYVRGDAAGH GGADDAPALA AELRGHLAGA LPDYMIPALF LPIDALPMTH
NGKLNRKALA SRGIRLRVAS SGERRAAPPR APAAADIEAR LLAICREVLK IDDIDRADGF
FEVGGNSLSV ALIASRVGAE FGLARLGAGA FFRYPTVAAL AAHLGARLRG DAGAAEGADG
ADAGPAGADA RASRPAQPRA AGPAARLPAA LDDAIAIIGI SCQFPGAQDH RAFWRNLRDG
KSGARFYSED ELRAAGVPDT LIRDRHYVPM QQTIEGKDLF DRHFFRLTTK DAQLMDPQFR
LLLQHAWKAI EDAGCTRERI ADAGVYMSAS NSYYQAMLRA AGTIDASDEY QAWLLAQGGT
IPTRISYELG LTGPSLFIHS NCSSGLVSLS VAAKSLLQRE SRCALVGAAT VLPDADIGYV
YQPGLNLSSD GRCRTFDENA DGLTSGEGVA VLLVKRARDA IDDGDPIYAL LRGIAVNNDG
ADKVGFYAPS VGGQADVIRK VLDATGIHPE TIGYVEAHGT GTKLGDPVEV AALTDAYRRH
TARTGFCAIG SVKPNIGHLD TVAGLSGCIK VALSLRHGEI APSINYEKPN REIDFAHSPF
YVVDRLTRWP AREPGAPRRA ALSSFGIGGT NAHLILEAFE RDEPPAGMRA PAARAARVIA
LSARTEERVR AQASQLLAFL EQEAGALPDF DGFAFTLQVG REAMRERVAF VADGYDALAA
ALARFLRGEP DAAACFTGAR GGDSTLAALL DDTGDTGDTA AHGLIAAWCE QGKVAKIAAL
WAHGVNVDWR RLYGARAPVR VSLPTYPFAP ERCEGVARRR AAAPAPRRAG VETAAARLHP
LVHDDRSDGA RRRFAATYSG EEFFLADHLI RGKRILPGVA YLEMARMAAV RAHGDGALSL
HDVVWMTPIV VDGPCEVELS LEAAERAEAE GAAEAAAGVR TMRFNVTSGG GAGARRTNSQ
GTIRLAPGAA APAAARVDVA ALLARCTREI GAQRFYTFLD SGGGHYGPTF RSVAALHQGE
REVLARLALP ESVAHADAFV LHPSMMDAAF QIADSLILQP RANGGCLPFF VKELVVRRRP
GRDAWVHVRL AGGDATLARY DIDLIDPDGT VCVSMREFSA RAETAGGSGR PNTYRAAEWR
AAECDGERDG NELSELNELN EGNERRRAAP RVAVLDASPR LAHALRGIGV DALWLPADAA
HAARGPALRD LDAALHAGAA RDLLVLADER RELDDDALRA WLDGAPHAGG ARRALVSIAG
LADADARAVA DIVERERHGR AADVRYDAGG ARSVRGFADA AVARWLLDTD ALRSGGVYWI
AGANGPLGAS LACHLATVER ATVVLTDAHA IDAARLACLD GYRAGGARLE FIEGDAARDG
AALAQRIRAR HGRIDGVLHC AQHASAPTLA ALAALDRATR ADALDCFVAC EARDADPDRD
PAAALVARFV ERRDARVQAG LGGGRTVAIA AHAALPWPDD APLLRAGGIA SQPALAIVQA
LHHALRSDEA MLAVGWGASA GGVDANASNA SNASNASNTS NTSNTSNTSN APSVAADLAA
PAEPNARIPA RARATPDTIA ACLKAVIADV IRADVDEIDA RQHFGEYGLD SLSLTSVSNR
LNDAYRLDAS PAGALNPTLF FEYPSVERMA AYLAEHHAAR FADASAAPGA DGAAECAPRP
EAALNAEVEP GNGAAPAPAP EPEPEVGFRA GFEPVAPPMP RIEPAASTPP DQPAPQPGGA
WHAGRGARPA ADDDVAIIGI SGRFPGARDV AEFGRNLFDG RDCIGEIPAD RWDWRAYLGD
PQHEAGKTNS KWGGFIDGIA EFDPLFFSLS PKEAYLLDPA HRLLLMHAWW AIEDAGYNPA
ALAGSRTALF AGIAQSGYAD LRRQAGEGIE GNSFLGVVPS IALNRISHLL DLHGPSEPVE
TACSSSLVAM HRALVSLRCG DADMALVGGV QTILSPHAHI GFGKAGMLAT DGRCKAFSSR
ADGFVRGEGI GMLFLKRLGD ARRDGDAIYG VIRGSAVNHD GRSSSLTAPN PAAQRDVIVQ
AHMRAGVDPR SIGYIEAHGT GTKLGDPIEI NALTQALDTL LRAQREEGAA YVPGACAIGS
VKSNIGHLEL AAGVSGVIKV LLQMANGRLA KSLHCDELNP YITLDGGPLR VVGANAAWPR
PVDRDGREQP RRAGVSSFGI GGVNAHVVLE EYPEADARAR DDGQPAAVLL SARDSQRLAD
YASALLAFVR ERREAAAHAP PPRLSDLAYT LQVGREAMRE RVGFVVTSLA QLEVRLAAFV
AGEPAGDGVY RGSVRPARGE RAADADGLDR LVDIWLANRK HEALLGAWVK GAAIDWARLH
AGGAPRRVHL PGYPFARERY WIAEPAPATG EPAPPRMPTQ PHGPTSDGRA ESRHPLRRDA
ADGRFLLDLD GDEAFLADHR VDGRRVLPGV AHLEIAYEAA RRTFGPADAI RIRNLGWIRP
IVADGALRIG VELSTSGAAE GAFRLYTTDP QHGRLTHSEG AIGRADVAQS ARALDLGALR
DAFATAERVD PAVWYDGFSR AGIDYGPSHR CLETCAVGPA GVLARVRLPA AEARAARPFT
LHPGLMDAVL QAAIGLRKRA GGAPRGTPYL PFALDTVEIL GGCGEAAWAW LRPSPRDAAD
ASASRGDAGK PAAERIDIDV CDDAGRISVT LRGLTSRPLA RRTAPAPEAG NPAGKVGEVA
DATDADAAEV REISNVSNVS NVSDVSDVSD VSDVAPLADG DVGLLARTAV WSALTPAQWL
ADPASRPRAG ARVFVLGGTA AQRREIARIH PGCEPLEANA ADDGGDGADQ QAHVDALRRR
LAEGAPIDQL VWIAPPEPAA DARAGLRGDA IVAAQEHGVL QLFRIVKLLL AAGYGGKPLD
WTIVTRETHA TSGIDEPSPT HAGVHGFVGS MAKEYRNWRV RLLDMPAREA WPIDAMFSTR
FDPRGDALAY RRGRWLAREL AAIDALPDGG CHVKTGGVYV VIGGAGGIGE VWSRWMMERY
QARIVWIGRR DEDEQIRRKR ERLARYGTPP VYLRADASER ASLAAARERI AALRWDGRAL
PTSGVVHSAI VLADASLATM DEARFLAAWR SKADVSVRVA EVFGGDPLDF MLFFSSITSF
GKTAGQANYA AGCAFKDAFA AHLGRTLPYP VKVMNWGYWG SVGVVSDETY RRRMASAGFG
SIEPDEGMSA LERLLASRVG QIAVLKTLRP NLVGDSRADR IRHYPGRDWP DAAPAPATAA
LQAALAARAG RWHAQASALA LGNPELETLI ARGLLAGVLP YLDAPGSVDA RHARWFDESR
AMLHGFGYLA RDGAGDAPSW SLTDAGRAAA PHVWQDWERH ALAWHDDERR VPMRLAHVCL
RALPELLGGK RRATDVMFPG SSMALVEGLY KSNRKADLFN DVVHDAVLSY ARALGRALDI
VEVGAGTGGT TDGLLRKLVE QGIAVREYRY TDLSHAFLLH AREHYAPRAP FLTTGIFDVD
KPIAAQRVPG GRYDVAVATN VLHATRDVRR ALRNVKATLR AGGLLILNEL SVKSLFSHVT
FGLLDGWWMY EDADLRIPGS PGIDSSTWRR VLAEEGFEYV FFPAQGLHAH GQQVIVAQSD
GVVRQPRAAA APGAGAAASP SGGTQAAVPA RRAAAASGAP RVEAIPPAAV APAAFDAATA
APPGTAAAAA TAVPADGRSA LAHASSPVAS PPQPGDAPAL ERMHAYLRDK LSQVLKLPPE
RIEPDASFAS YGVDSIMAMA LITALEKELG SLPKTLFFEH ETIEELGAYL LERCEPMPSG
VEPATVGADD RAAYSGARPH AWPASPTEPD EPTEPTASPV SSAPPAASPP QPGDAHAPER
MHAYLRDKLS QVLKLPPERI EPDASFASYG VDSIMAMALI TALEKELGSL PKTLFFEHET
IEELGEYLLE RCEPMPSGVE PATVGADDRA AYSGARPHAW PASPTEPDEP TEPTASPASS
APPAASPPQP GDAPAPERMH AYLRDKLSQV LKLPPERIET DASFASYGVD SIMAMALITA
LEKELGSLPK TLFFEHETIE ELGEYLLERQ GQERACHASN V