Gene BURPS1106A_A0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0556 
Symbol 
ID4904084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp531626 
End bp543991 
Gene Length12366 bp 
Protein Length4121 aa 
Translation table11 
GC content66% 
IMG OID640143662 
ProductLysM domain-containing protein 
Protein accessionYP_001074592 
Protein GI126457557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCGCAGG GAACGCAGAA CGTCCTCACG GTTCATCTCG GTGCACGCAT CACGAAAGTC 
CGGGACTATC TGAATGCGCT TGGCGAGATA CGCGTTTTCA CCGCGAACAG TCAGAACTTC
GGACATCAAT CGTCGTCGGT CAATATCCTG CGCAACCTGA TTCGCATGGG TGCGCCGGGG
CCGTATACAT TCGCGCTGTC CGCATCCAAC TCCGCCGACT ACGCGGACCT GGAAGAAAAG
ATCCGCTTGC TGATTCCGCA GTTCCGGCAG GTCGGCGTCA CGTTTGAACT GGGTGCCGGC
GGGCGGACCG CCGATGTCAC CGTCGTGCGG CTCGATAAAG CGCTGGCGCC CGCGCAGTTC
GCGATCAGCG GCGGTTTTGA CGATCTCGAA AACAAGACGC CGCCGTATCA TCTGCTGAAT
GTGACCAACT ACGTTCAGCT TCAGCCGTAT GCGTGGAATC GCGGGACCAA TATGGTCCGG
ATCATGCCGC CCGGCGGCAC GGCGAGCGAA TACAACCTCG ATGAGCTGAA TCCGACCACG
CTGCTTGCGC GGCGCGCGTT CTATCTCGCC GATCCCGAGC TGACGCAGTC TGACCGAGAG
GCGATTGCGC AAACGCCGTA CGCGAACAAG GCGCGCGTGA TCGAGCGTCT GCTCGAACGA
CGGGAAGCTG GCGAAATCAC GCTTTTTCCC GTCTACGGCG TCACGACGAA AGGGAGCGCC
TATACGTCGC TGTACAACGC GGTCACCGGC GCGTTGATCG CGCAGAACAC ACATCCCGCC
GTCAAAAAGA CGGTGATGGT TCAGATCACG ACCCTCACGG CCTCCGAGTG GGAAGCGTTC
CTGTTTTTGA TGCGGGACCC GGCCGGCCAG ATGGTAAACA AGATCAGGAC GACGCCTGAT
TTCCGGGGCT GGAACGAGGA GAATAAGGTC AAAGACCGGG TCCAGGATCT CGGCAGCCCG
AATAGCGTAC CCACAGTTGA GCAGCTCGAC GAAGAGCTGT CGTATCTCAA GGACGATCAA
CTGCTGGTGG TCTACATCGG CAAAATTCCG GCGCCGCTGT TCGATGCGCT TTATGCGAGC
GCGACTTTGC CACCGGTGCT GGAAGGGCAG AATACCGCCG AACTGATGCT CAATCTCGGC
AAGCCGTACT TCAAGATCAC GAGCAATAAC AGTCGCGAAG CGGATGCGCG ATTCAGCTAC
GCGACGCTGC CGCTCAGTTC CACCGGCGCG GGCACCGACG CGACCAATTC GCTCGACGAG
TCGTTCAACG GCATTTACTT CACGCGTCCC GACAACTGGT ACAGAGATCG CCCGACCTAT
CCGCCGACGC AACTGCCGGC GATGATCAAC GCGTATGTGC AGCCGGCCGG AAACGCCCGT
GCGACTTACT TCGCGGCGCA GCGTACGTTT TTTCACGACG AGCTGAACGA CAAGCTGCTG
CGCGGCCTCG ATCTGTTCGT CAATCTGATC GGGCCGGCGG CGCTCGAGGA GCACCGACTC
GCGCTCGCGC ATGCCGCGCT GGACGACGCG AACGCGCCTG TTCACGAGGC CGGCGCCACC
GCACGGGCGC CCGCGCGCAT CGCGCACCGC AAGCCGCCGA CACGTGTCGC AGGCGGCGAC
GCGAACGGCG CAAGCGAGCT GCTCGAAGCG TTCTACGAGG ATCTGACGAG TCACACGGTC
GACGGTGTGC TCGATTTTCT GCTGGCGGTT ACCGACGGCA TCCTGAACGA GTTCTTCAGG
CAAGTCGTGA TCGACGTCGT GTTCACGATC ACCGACACGG TGACGGAAAT CAACGCCGAT
AAAACCGAAG TCACGTTGAC AGGCAAGTCG AAGGCGTTCG GCGCAGGCAA CCTGACGCTC
GCGTTCTCGT TTACCGACAG CGGCGGGACG ATCGCGGGCA AGATGTCCGG CGCGTTTACG
GACACGGTCT GGGCGTTCCC GGGCGCGCAG TGGCTCAGCG TCGCCAACCC GTCGCTTGCG
CTTGCCATCG ACAGCAACGC GGCCGTTCCC GTGACGGGCA CGGTCGGTGC GACGTTTACG
GCCGGGATCG CCGCGAAGGC GTCGCTGACG CTGCCGTCCG AACCGGGGCG CCTGCTGCTG
CAAGCGGAAT TTCTCGCGCC GCGACCGAGC ATCACGAATA TCTTTCAGAT GCTCGGGGGC
ATCAACATCC AGGCGCTGCT GCCATCGCAA ATCCAGTTCT TCAGCGACAT CGAGGTGCAG
AACCTTGCGC TGCGCTACAG CTACGCGAAC GGCGTGATGG AGTACATCGG CGTCACGCTC
GGCACACCCG AGAATCGAAG CTGGCAACTG GTGCCCGGCG TGACGGTTAC CGGGCTCAGC
TTCAGCGCGC TGACCGATTA TCCCGGCGAT CTGCAGCGGC GCAGCACACG CTACGTGATC
GGCGGCCGGT TCGACATCGC CGGCGGCCAC GCGCAACTGG AGGCGCGCGT GCCCGCGCTG
CGCGTGACCG GCGGCTTGAT CGACGGCAGC CCGCCGATCA CGCTCGCCGC TATCGTCACC
GAGTATCTGG GCGCCGATTT CGCCGCGGCC ATTCCGGCGA GCGTATCGAG CACCGCGATC
GAGCAATTGA GCTTCATGGT GGATCAGGCG CAGGGCGCGT ACAGCTTCTC GATGGACGTC
TCCGCGCAGT GGCCGGTGCC GTCTGCCGCC AATGCGCTGT TTACGATCAC CGGCCTGAAC
TTCGCGATCG ACGCCGTTTC ACGCGATATC AATCCGCCGA AAGCGGATGC CGGCGGCAAC
AACGGCGCCG GCGGCACGCA GACTGAAATT GAAGGAAGCT TCGGCGGCTC GCTGATCGTC
CTGCCGAACT CGGAGAGCCC GATCGGGCTG TCCACCACGG CTACGTACAA GACAGCCGCG
AAGGCGTGGA CCTTCGACGC GCAGCAAACG TCCGGAGTGG TGAGCCTCGG CGCGTTGCTC
GTTTATTACC TCGGCAATAC GTGGCAGGCC CCGCAAGGGC AGGAGTACGC GATCGACGGG
CTTGGGCTGA CGATCACGAG CTCGCCCACC GATTCGACGT GGGCGTTCAC GGGCAAGACC
GCCGACAACT GGGTTGTGCC GTTCCTGGAC GTGAGTCTCG CAGCGAAGCT GCGCATGGGC
GACGCGGGAG CGAAGGCAGA GGTGCCGGGG AAATTCGGCA GGCTCGACCT CGAAGTGATC
TGGCAGAACA TCGACCTCAC CGTCTGGTTC GACTACAACC CGAAGATCAA GCAATACGGG
ATCACGTGGG GCCTTCTCGA AGGTGTGGTC GACGGACCGG ACCCGACGAC CCAGGACTGG
ACCGCGACGC TCGGCTTCAA GCAGAACACG ACGCTCGGCT CGATGATCGA AACGATGGTG
TCGTGGGCGA CCGGCTCGAA GTTCGGGCTC GAGTCGCCAT GGAGTTTCCT GAACGCGATC
CCGCTGTCGA ACCTCGCGCT CAAATACACG TTCAACCAGA CCACGCCGAG TCGCAACAAG
GTCAGCTTCG CCGTGACGAT CGGCCCGATC AATCTGGGCT TCGCGCGCAT CGACAGCATC
GACGTCGGCT ATCAATCCAC CGGCGAAGAT CGCGGCGTGA TGGTGACGCT CAATGGGTCT
TTCTTCTGGC AGTCGGATCC GAGCACGCCG CTCGAGTGGG ATGCGAGCAA GCCGGGCACC
GCGCCCGCGC CTCCCGGCAA CGGCAACAAG TATCTCGACC TGCGGCTGCT GGCGATGGGC
CAGCACATCA CGCTGCCCTG CTTCGCGACC GCCGATACGG TGCAAAAGGC CATCGCCTGC
ATGGCCACGC TGCCCGATCC GAAGCCCGGC CAGATTCCGG CAGTGCGCTT CGATGCGCAA
AGCGCGTGGC TGATCGGCAC CGACTTCGGC GTGCTGAAGA TCGACAGCGG GCAAACCGGC
AATAACGCGA ACGCGCTGCG CGTGACAAAC GACGGCAACT CGCTTGCGGA ATCGTCCGGC
TATGTATTGA CGCTGCAGGC GGTGTTCAAC GACCCGCATC TGTACGGGCT GCGAATTGCG
CTCGACGGCG CGGCGGCCAA GGTATTCAAG GGCCTCGACT TCCAGATCAT GTACCGTCAG
GTGAGCGACA CCGTCGGCGT GTACCAGGCG GAGATCACGC TGCCCGACCT GATGCGCCAT
CTGACGGTCG GCGCGTATTC ACTCACGCTG CCCGTGTTCG GCATCGCCGT CTATACGAAC
GGCGACTTCC AGGTGGACAT CGGCTTCCCG TGGAACGAGA ATTTTTCGCG TTCGTTCACG
ATCGAGGCGA TCATCCCGCC CGGCATTCCG GTGCTGGGCT CGGCCGGTTT CTATTTCGGC
AAGCTCTCCA GCGCGAGCAC CAATCGCGTG CCCGCGTCGT CGTACGGCAC GTTCAATCCG
GTGCTCGTAT TCGGCTTCGG CATGCAGGTG GGCTTCGGCA AGTCGATCGA ATACGGCATC
CTGTCGGCCG GCTTCAGCGT GACCGTGGTC GGGATTCTCG AAGGCATCCT CGCGAAGTGG
AACCCGTATC AGCTCACCCA CTCGGGGCGC GAGCCGTCCA CCCAGTTGCA GGGCGACTAC
TACTTCTGGC TGCGCGGCAC GGTCGGCATC GTCGGCCGCG TGTACGGCAG CGTCGACTTC
GCGATCGTGA AGGCGAACGT CGACATCACG GTCAAGCTCC TGCTGCAACT CACGTACGAA
TCGTATGTGT CGATCACGAT CACAGTGATC GCCTCGGTCG ACGTGTCGGT TAGCGTGAAG
ATCAACCTCG GGTTGTTCAA GATCAGCATC TCGTTCTCGT TCTCGATGCG ACTGAAGGAG
ACCTTCACGA TCGATAACCG GGGCGCCGCG CCATGGCTCG GCGATGGCCG CAACGTGCGC
GGCGTGCTGC GTCTGCCGGT CGAGCGTCGG CTCTCCGGCT TCGCGCGCGC GCAAGCGCGC
GACAGCCTGC TGGTGAGCGC GCCGAACTGG GGCAACCTGC GGCCGGACGG CGTGACGGAC
CTGTCGGGCT ACCTCGTGCC GGGGCTCACC GCGGCGCGCG ACGAATGGAC GCCGCAGGGC
GAACCGGCGA ACCAGTTGTC GTGCTGGGTC GCGCTGCTGC TGATCGAATC CGTGCCGCCG
GCTGGGCAGG ATGCCGGCGC GAGCAAGCTC AAGGCGGCGG GAAGCGCGCC CGACAGCTCG
TTCGAGGCGC TCGCGAAAAT GGTGCTGCGC TGGGCCATCG CGGCCGTTCA GGGGCCGATG
ACGCCGGACG AGGTCGATCG ATGCCCGGTT CCCGCGACGC TACTGGACTG GCTCGCGGAC
GAAGTGCTCG TCAGCACCGG CGACGATCCG ACGCCGATCC CGCTCGACGC GGTGCAGGCG
TTCCTCGACA CGCATTTCCG ATTCAATCTG CGCGTGCCGC CGACCGATCA GGATGCGTCG
GCCGATACCG CGTATTTCCC GGCGCCGCCG CAACTGCGCG TGATGATTCC GCCGTACGGC
AACGACTACC CGGGCGTGCA GTACACCCTC GGCAGCTATA ACGCGCTCGG CGAAAACACG
CTGGCCGAAC TGCGCGCATG GTTCGACCAG CTCGCGGTGC AGGTGGAGCG CGAGCAAGCG
GCGAACGGCG CGGCGGCGCG GGCATTCGTG GAAGAAGCGC CGCTGTCGAT GGCCGGATGG
ATGTTCTCCG ACTACTTTCT GCTGCTCGCC CGGCAGATGG TCAAGGCCGC GCAAGACGCA
CTGCGCGACT TCAAGTACGC GCTCGACGCG AACGAGACAC CGGACGACGT CGTGAGCTGG
GTGAACACGA CCGGTCAGTT GAACGGGCTG TACACGCTGA ACGACGTGTT CGGTGCCAAT
GCGCTGCACG CGCTCGTCGC CGAAAAAACA CTGACGATCG GCGTCACGAG CTCGATCAGC
CTGGCCAAGA CCGGCCAGAC CTTCACATCG CTGGCCAAGG CCTTCGACGA CGCGCTGCCC
GCAAGTGCGA TCGCGTCGGC CAATGCGGCC GACGCTGCGC TGCTGCAGCC GGGCGCGACG
ATCACTTACC CGGGCTTCGA TCCGTACACG AGCGTTGCTG GCGACACGCT CGTCAGCATC
GCTGCGCATT ACCAGGCGAA GCTCAACGAT CTGCTGGCCG ACTCAGATGT GCTCGACGCG
GCCGGCATGC TGCGTATCGG CGCGAGCGCG CTCATGCCGT ACACGGCGTA CACGGCGCTC
GCGACCGACA CCTTCGCGTC GGTCGCCGCG CTGCCCGTGT ACGCCGGCGG TTTCGGCGCG
GCCGCACTGG CCACCGCGAA TGCGGGCCGC AGCGTGCTGC TCGAAGGCGT GAAGATCGAG
TATCCGGACA AGGACGCGTA CACCGTGCAG CCGCGCGACA CACTTGGCGA CGTTGCGAAC
GCGTTCGGCG TAACCGTGTC CGACCTGCTC GCGACCAGCG CGGTGCTGAC GCAACCCGGC
CTGCTCGCGC CGGTCGCGTC GCTTACGGTT CCGGCGTTCC GCTATACGAC GCAGCAAGGC
GACGATCTCG CGCAGGTCGC GGCGCGCTTC GGCGTGACCG TCTCGGTGCT GGCAGACCAG
CCCGCCAACG GCACGGTAGC CGGGTTGTTC GACACCGGCG ACACGCTCGA CTTGCCGCAC
CTGCCGCAAT TTCCGTTGGC CGAACTGCTC GCCGAAGCGC AACGCTCAGG CATGCTGCAG
CACCTGTCCG GCATCGCGAG CAGCTATACG ATGCATGGGC TGCGCTTCCC GACGTCCGGC
CCGACCGGCA CGGGCGGGCA ATGGTCGATC GTGCCCAACG AGATGGGCAT GTGGGTGCAT
GACGTGAACG GCACGCTGAA GCTGCCGCCG CAAGCCGGGC TCTATGCGCT GACCGGTCAG
CAATTCCCGC TGCCCGCACT CGGCGCCGAT CCGTTCGCGG CGACCTTCGA CAGCGTGGCC
GGCGCCGGTT CGTCGTGGCT GCGCTTCGTC GACGGCAACG GCGGCCCGAC CGACCGTCTG
ACACTGTCGG TCACGCCCGG CACGCCCGAC GCGACGCGTA TCGCGCAGGT GACCGCCGCC
GCGAAGACAC GGCTCGTCGT GCCGATGGAT ATGCTCGGCG CGGGCAAGAT GTACGATACC
GCCCTCGCGA CCTATCCGTT CACGTCCGCG TTGCAGTGGC TGAGCACGAA CACCGTCGCG
CTGCCTTATG GCCAGCCGCC GGCCGGCGTG CAGTCGCTGC GCGTGTGGCA ACTGCCGGGC
GCGCTCGCGG CGCTTCCCGA TCCGGCCACC CATGCGGTGA ATCCGCGCTT TGCACTGCGG
GTGGCCCGCT ACGACGATGC GACGGGCGCC ACCGAAACCA CCGGGGTGGA CTCGTACGGA
TGGGCGTCGA CCATCGGCTT CACGGTGCGG CGCATTCCGC CGGTAGCGGG CAGTCCCGCG
TCCGTCGACA CGTACGAGGT GGTCGGCGCG AGCGGCGCTG CAATCGTGGT GCTCGAACAA
CTGTTGAGCC AGGTGCAGGC GGACGATTCG GCCTACTTCG GCCTGAGCGT CGGTTTTGCA
CCCGATAGCG CAACGGGCGG CGGCGAAGGC GTGCAGACCG GCGGCGCGGC GAGCGTCGTG
TTCGGCATCG CGCAGGTGAA CCTGTCGACG GAAACCCGTC CGCCTGCCGG CGCCGCATTC
GCGGCACTGC GTGAAACGGC CGGCGAAACG CCGCCGCTCA CGCTGCTCAA TTCGCCGTCG
GAGTTCGTGC GGCTGCTGTG GGAAGCGAGC ATCACACGCT CGGGCGGCTT CTTCCTGTAC
TACTACGATC GCGCCGCTGG AGGCGGGCTG CCCGATCGCA TATTCAACGA CCGCAACGAG
GCATCGCTGA CGTTGATCGT GCTGTACGCG AAGCCCGCGG CCGTAGACGA TCAGGACCGC
GTCACGAATT ACATGAATGC GGTGGTGACG ACCGACGCGC TGGATACCGG CAACGCGGTG
CTGTTCGCGG AAGCGGCGCC GGTTCCCGCC ACCGTCACGA GCGGCGCTGG CGAGACGCTC
GCGTCGCTCG CCGCGCAATG GTATTCGGAC GAGGCGGATA TCGCGGAAGC CAACGCGAAC
GTCGCGCTTC GCGCAGGCGC GCTCGTGCGC GTGAGCGAAG GGGTCTACCA GGCGCCGCCG
GGCGGCATCG CGCTCGCGCA GGTCGCGAGC CGCTTCGGCA CCACGGTGCA GGCGCTGAAC
GACGCCAATC CGCTGTGGGG CGGCTTGCCC GACCCACTGC CGTTCCCGGC CGCGATCCGC
GTACCGGACC TCACGCTCAC TGCCGGCACG AGCGCGCACA CGGCGTCGCT CGCGGATATC
GCGGGCTGGT ACGGCGAGCC GGTCGACGCG CTTGCTTCCC ATAACGCGCG GGTGGCCCAG
CTGTTCGCGG CCGGCGTGCC GCTCGTGATT CCCGGCGGCC CGCGCGTGCG CTCGGCTGCG
GTGCAGCCGG GCGTGCAGGC GCTCGCCGCG CTGCGTCCGG CGCCGCCGCA CGTGGACGGT
ACATCGCCGG ACTACGGAAC GGAACTGCTG CTGAACAACT TCAGCCTGTT GAACCAGCAG
GTATACGGCA ACGTCGACTT CCGGCCGAGC GATCCGCCGG GGCTGCCCGC CGGACCGACA
ACCAAGGCTC CGGAAGAAAA CGGCAACGAC AAGGTTCGCA CCGTCGTTCC GGCAGATCAG
GTCGAAGCGT GGAACTTCAG TCAGGCGCTG CCTTACGCAC GCTTTGCGAA GCACGTGCCG
CAAGCGCCGC GCGCGGCGGT CGCGCTGCCG CCCGCAAGCG CCAGTCCGTA TTTTGGTGTC
GGCGGCATTC TGCAGATCTC GTTTGCGTGG CAGGACTACT ACGGCAACGT GCTGTCGACG
CCGCTGTCGG ACCCACTGGC CGGCGATGCG GCGCCGTACA ACGACGCGCC GCTGCTCACC
GGCTACACGG ATCCGCTGGT GTCGCTGTCG CAATGGCCGT CGATTGCTTC GAACTGGCAG
GTGCTGCCCG GCAGCGGCGG AGCGAACCCG CGGCTGAATA TCGAGTTGAG CTTCGATCCG
AGCCGTTACC AAGGCTTGCT GCAGGCATCG GCGGCGACGC AGACGACGAT CACCGTGGTA
TTCACGGACG CGCTCGACGC CGCATCGGTC GGCGAACTGT CGCGCTGGCA ACTGGTGCCG
GGAACCGTCG ATTCGGCGTC GCTGGCCGCG GATGGAAAAA CCGTCACGCT GACGGTGCCG
GCGCTCGACG ACGACCTGCG CTACACAGTG ATCGCCACTG ATATCAAAGC GCAAGCGAGC
GACATGCGCT ACAGCGGGCA GGCGTCGTTC GACTGGCCAG ACAATCCGGT CACGCGCAGC
AGCACCGTGC AACAGAACGC GTCGCAGGAC CTGCACGTCT ATACACAGCT GTACTACCAG
CTCACCGATC CCGCCGGAGT CGATCTGTCG GCGCAGTCTT CGTTGCTCGC GGACGCGCAC
GGAGCGCCGG GCAGCGTCGC GTATGCGCCG GCCGCCGTCG ACGAGCTGAT GGACTGGCTG
TTCGGCACAG CCGGCGCGGC CTCCAGTATC TATGCGTTCG TGCAGGACCG CTCGAAGTTC
CAGAGCGTTG CCGTGCCGCC CGCCGCCGGG TTGCCGCTCG ACGTGGACGT GCCACCGCAG
CAGGTGAACA CCGCGCAGAT CTTCCCGCTG TGGACATCAT TTACGATGAC GCGCGCGCAC
GGCCCGGTGC TGCCGGGGCT GGAGACGGTC GCCGGCATTC GCAGCGCGAG CACGCGCGTC
GCGCCGCTGC AGGATGCGCT CGGCGCCACC GGCGGCACGC TCGGGCTCGT TACGTTCGCG
ACCGGTTTCG AACAGGCGCT GTCGACGCCG GGCAGTGTCC GGCTGAAGGT GGCGACCGGC
GTCGATCGCA CCGCGCCGCC CGCCACTGGA GCGGCCAGCA CGGTCTGGGC GGTACGCGTC
GGGCTCGCTG CCGGCAAGGC GATCTCGTAT GCGATCGCCG ACGCGGGCAA CCCGGCCGTG
TTCGCGCCGC AGCCGGCGAG CAACCGGTTA ATCAGCCGCA CGCAAGTACC GATCTACGAC
TACACGACCG GCAAGGGCAT CTCGTCGACG CCGTCACGCA CCACCGATTT CACGGATGTC
GATCTCGACA CGTGGTGCGC GCAGGTGTTC GCCGCCGTCG ACGACGTGCT GACACCGCAA
TTCACCGCGC CGATGCAGAT CGTCGGCGAG TTGAAGTCCG CTGATTATCT GCAGTCGATC
CTCGACGGCA AGAAGGGACT CGCGACGGTC GCGAAGCTGT GGATGATTCC GGTCTTCGCG
GGGGAAACCT CCGACCCGTC CGCCGCGCGC GAAGCGTTTT ACCAGCAACT GCTGGTGCGA
TTGTCAGCCG CATACACGAC GCGCGCCGCG GTGGAGTTCC ACGCGAACGT GACCGCCGAC
GTGATCGAGC CCGCCGCGGA TCAGCCGCCG CGGCTGTTTG GGCCGGTCAC GCGAAACGGA
CCGGTGTTCG AAGCCGCCAC TGTCGACGGA CAGGCACTGA CCACCGTGTT CCTGCTGTTC
AGCGACCCGA TGGACCCGGT CACCGCCGGC AACATCGAAA ACTACGCATT GAGCAGCGGT
GCCGGCGTGC TGACGGCGAC GGTCGACCGC GGCACGGTGA CGCTCACGCT CGCGACGGAC
GTGCAGCCGG GACAGACGAC GGTCACCGTC AGCAATCTGA AGGACGCGAC GGGACGCGCG
GTGCGGCCGC CGCTCACGCG CACTATCACG ACTGGCTCGG CGAGCCTGCC GGCCAGCACG
CTCGCGTTCA GTTCGCCGAA GCTTACGTTG CAGGCCGGCG ATACGCGCGC GCTCACGTAT
CTCGTCAACG CTCCGGATTC GGTACGCGGC GCGGGCGGCG AGATAGTGTC GTATGTCGAG
CTGGACATGA CGTACCAGGG CAGCCAGATC GAGCATCAGA TCGGCGCGCT GCCGGGTATC
GAGGATTACC AGGCATCGAC CTGGCTCAGC TTCGTCGTGC CGGACACTGA CGGGCCGCTC
GCGGCGGACC TCGGGAACTT CGCGGTGCCG CTGGTGCTGC GCGCATTCCC GGCAAGCCCC
GCGATGACGG AGCAGAGCGG CACCCCGACC CATGACCTCG ACACCGCGAG CCTGCCCCTG
CTCAAGCAGT GGGACTATGC GTTCACCTAT TCGCTGCCGT TCCACTATCC GCAAGACCGC
ATCTACGGCG AGGTCGAATT CAATCTACGC ACCGCGCCGA CTTTGTTCGC GAGCTTCCCG
GATGCGTTCG CGCAGCTCGC GGAGTTCATT ACCGTGTTCC CGAAGGTGAA CGCGGATCTG
CAAACCATCC TCGCCGGCAT CGATGCGACG GTCGACCCGA TGACGGACCA GCAAAAGATC
GACGACGCAT CGATCGCTCT GCAATCGTTC ATCCAACTGG TCGACGAACT GGTCGAAGCG
GCAGGCGGAA ATACGCAAGG CAATGGCGAG CGCCGCGGCG GCACGGGTCT CACGTTCCAG
GCCCCGGCGC GACTGCTCAC CGGCGACCCG TCGCTTACCT TCGCGTTCTA TGAGGAAGAA
GGCTCGGCCG AGGTTGGCGA TACCGAAGGC GCGCTAGTGG TGACGCTGGT CGGCGCCGTC
CCGGCAGGAA TGGGTCAGCC GGTCGTCGAA ATCGATCCGG CGCTGTACGA CGCGCAGCCT
TGGCAGCCGC CTGGCGACAC GCAGAAGGCC GGCGATGTCT TCCACTACGT GTACAAGCGC
AAGGCCGGCC CCGGGCCGGA GGGTTCATAC CTGAGCGCCG CGAACGGGCA GAACATTCCG
GGCCGTACGG TGCGGCTGCC CGCGCTCGAC ATCCTGCAGC GCCAGGACGC GTGGTCTACC
GTCTGGGTGG AGCGCAACCG CGAACTCGTT CCGGGCAAAC CGTCGGCTGA CGCGTTTGTC
TACACGACGC CCGAAGTGCG CTTCGCCAGC CCGCTGTATC CGACCAACGA CGCCAATGCG
ATCATCGATG TGGCGGCGAT CCCGTCCGGT ACGCCGGTGA AGCGTTCGCT TCAGGAGCAT
TTCGACGCGC TCTTTGCCTA TCTGCTCGCC GGCGACACGC TGCCGCAGAT CGTCGCACAG
GTGGAGGTGA CATACGGATA TGCGCTGAAC GCCGCGCTCG ATAAGATCGT GTTGCCGGTG
TTGATGCAGG CGCCGCTGAC GGTCGACGTC GCCGGAACCG GCGCGGGCAC CATCGCGAAA
ATGACCGCTG ACTGGACTGC CGCGATCGAA ACGTGGTTCT CCACATACGA ACCGACGGGC
GGCGGAACAC TCTGGATGGA TCTGACGCTG ATGTCGAATC TTACCGGCCA GCCGATGCCG
TTGCTGCGGA TGCGGCGATT GATGCTGTCC ATCGCGCAGG TGGTCCCGCC GCTGCCATGC
CGCTAG
 
Protein sequence
MSQGTQNVLT VHLGARITKV RDYLNALGEI RVFTANSQNF GHQSSSVNIL RNLIRMGAPG 
PYTFALSASN SADYADLEEK IRLLIPQFRQ VGVTFELGAG GRTADVTVVR LDKALAPAQF
AISGGFDDLE NKTPPYHLLN VTNYVQLQPY AWNRGTNMVR IMPPGGTASE YNLDELNPTT
LLARRAFYLA DPELTQSDRE AIAQTPYANK ARVIERLLER REAGEITLFP VYGVTTKGSA
YTSLYNAVTG ALIAQNTHPA VKKTVMVQIT TLTASEWEAF LFLMRDPAGQ MVNKIRTTPD
FRGWNEENKV KDRVQDLGSP NSVPTVEQLD EELSYLKDDQ LLVVYIGKIP APLFDALYAS
ATLPPVLEGQ NTAELMLNLG KPYFKITSNN SREADARFSY ATLPLSSTGA GTDATNSLDE
SFNGIYFTRP DNWYRDRPTY PPTQLPAMIN AYVQPAGNAR ATYFAAQRTF FHDELNDKLL
RGLDLFVNLI GPAALEEHRL ALAHAALDDA NAPVHEAGAT ARAPARIAHR KPPTRVAGGD
ANGASELLEA FYEDLTSHTV DGVLDFLLAV TDGILNEFFR QVVIDVVFTI TDTVTEINAD
KTEVTLTGKS KAFGAGNLTL AFSFTDSGGT IAGKMSGAFT DTVWAFPGAQ WLSVANPSLA
LAIDSNAAVP VTGTVGATFT AGIAAKASLT LPSEPGRLLL QAEFLAPRPS ITNIFQMLGG
INIQALLPSQ IQFFSDIEVQ NLALRYSYAN GVMEYIGVTL GTPENRSWQL VPGVTVTGLS
FSALTDYPGD LQRRSTRYVI GGRFDIAGGH AQLEARVPAL RVTGGLIDGS PPITLAAIVT
EYLGADFAAA IPASVSSTAI EQLSFMVDQA QGAYSFSMDV SAQWPVPSAA NALFTITGLN
FAIDAVSRDI NPPKADAGGN NGAGGTQTEI EGSFGGSLIV LPNSESPIGL STTATYKTAA
KAWTFDAQQT SGVVSLGALL VYYLGNTWQA PQGQEYAIDG LGLTITSSPT DSTWAFTGKT
ADNWVVPFLD VSLAAKLRMG DAGAKAEVPG KFGRLDLEVI WQNIDLTVWF DYNPKIKQYG
ITWGLLEGVV DGPDPTTQDW TATLGFKQNT TLGSMIETMV SWATGSKFGL ESPWSFLNAI
PLSNLALKYT FNQTTPSRNK VSFAVTIGPI NLGFARIDSI DVGYQSTGED RGVMVTLNGS
FFWQSDPSTP LEWDASKPGT APAPPGNGNK YLDLRLLAMG QHITLPCFAT ADTVQKAIAC
MATLPDPKPG QIPAVRFDAQ SAWLIGTDFG VLKIDSGQTG NNANALRVTN DGNSLAESSG
YVLTLQAVFN DPHLYGLRIA LDGAAAKVFK GLDFQIMYRQ VSDTVGVYQA EITLPDLMRH
LTVGAYSLTL PVFGIAVYTN GDFQVDIGFP WNENFSRSFT IEAIIPPGIP VLGSAGFYFG
KLSSASTNRV PASSYGTFNP VLVFGFGMQV GFGKSIEYGI LSAGFSVTVV GILEGILAKW
NPYQLTHSGR EPSTQLQGDY YFWLRGTVGI VGRVYGSVDF AIVKANVDIT VKLLLQLTYE
SYVSITITVI ASVDVSVSVK INLGLFKISI SFSFSMRLKE TFTIDNRGAA PWLGDGRNVR
GVLRLPVERR LSGFARAQAR DSLLVSAPNW GNLRPDGVTD LSGYLVPGLT AARDEWTPQG
EPANQLSCWV ALLLIESVPP AGQDAGASKL KAAGSAPDSS FEALAKMVLR WAIAAVQGPM
TPDEVDRCPV PATLLDWLAD EVLVSTGDDP TPIPLDAVQA FLDTHFRFNL RVPPTDQDAS
ADTAYFPAPP QLRVMIPPYG NDYPGVQYTL GSYNALGENT LAELRAWFDQ LAVQVEREQA
ANGAAARAFV EEAPLSMAGW MFSDYFLLLA RQMVKAAQDA LRDFKYALDA NETPDDVVSW
VNTTGQLNGL YTLNDVFGAN ALHALVAEKT LTIGVTSSIS LAKTGQTFTS LAKAFDDALP
ASAIASANAA DAALLQPGAT ITYPGFDPYT SVAGDTLVSI AAHYQAKLND LLADSDVLDA
AGMLRIGASA LMPYTAYTAL ATDTFASVAA LPVYAGGFGA AALATANAGR SVLLEGVKIE
YPDKDAYTVQ PRDTLGDVAN AFGVTVSDLL ATSAVLTQPG LLAPVASLTV PAFRYTTQQG
DDLAQVAARF GVTVSVLADQ PANGTVAGLF DTGDTLDLPH LPQFPLAELL AEAQRSGMLQ
HLSGIASSYT MHGLRFPTSG PTGTGGQWSI VPNEMGMWVH DVNGTLKLPP QAGLYALTGQ
QFPLPALGAD PFAATFDSVA GAGSSWLRFV DGNGGPTDRL TLSVTPGTPD ATRIAQVTAA
AKTRLVVPMD MLGAGKMYDT ALATYPFTSA LQWLSTNTVA LPYGQPPAGV QSLRVWQLPG
ALAALPDPAT HAVNPRFALR VARYDDATGA TETTGVDSYG WASTIGFTVR RIPPVAGSPA
SVDTYEVVGA SGAAIVVLEQ LLSQVQADDS AYFGLSVGFA PDSATGGGEG VQTGGAASVV
FGIAQVNLST ETRPPAGAAF AALRETAGET PPLTLLNSPS EFVRLLWEAS ITRSGGFFLY
YYDRAAGGGL PDRIFNDRNE ASLTLIVLYA KPAAVDDQDR VTNYMNAVVT TDALDTGNAV
LFAEAAPVPA TVTSGAGETL ASLAAQWYSD EADIAEANAN VALRAGALVR VSEGVYQAPP
GGIALAQVAS RFGTTVQALN DANPLWGGLP DPLPFPAAIR VPDLTLTAGT SAHTASLADI
AGWYGEPVDA LASHNARVAQ LFAAGVPLVI PGGPRVRSAA VQPGVQALAA LRPAPPHVDG
TSPDYGTELL LNNFSLLNQQ VYGNVDFRPS DPPGLPAGPT TKAPEENGND KVRTVVPADQ
VEAWNFSQAL PYARFAKHVP QAPRAAVALP PASASPYFGV GGILQISFAW QDYYGNVLST
PLSDPLAGDA APYNDAPLLT GYTDPLVSLS QWPSIASNWQ VLPGSGGANP RLNIELSFDP
SRYQGLLQAS AATQTTITVV FTDALDAASV GELSRWQLVP GTVDSASLAA DGKTVTLTVP
ALDDDLRYTV IATDIKAQAS DMRYSGQASF DWPDNPVTRS STVQQNASQD LHVYTQLYYQ
LTDPAGVDLS AQSSLLADAH GAPGSVAYAP AAVDELMDWL FGTAGAASSI YAFVQDRSKF
QSVAVPPAAG LPLDVDVPPQ QVNTAQIFPL WTSFTMTRAH GPVLPGLETV AGIRSASTRV
APLQDALGAT GGTLGLVTFA TGFEQALSTP GSVRLKVATG VDRTAPPATG AASTVWAVRV
GLAAGKAISY AIADAGNPAV FAPQPASNRL ISRTQVPIYD YTTGKGISST PSRTTDFTDV
DLDTWCAQVF AAVDDVLTPQ FTAPMQIVGE LKSADYLQSI LDGKKGLATV AKLWMIPVFA
GETSDPSAAR EAFYQQLLVR LSAAYTTRAA VEFHANVTAD VIEPAADQPP RLFGPVTRNG
PVFEAATVDG QALTTVFLLF SDPMDPVTAG NIENYALSSG AGVLTATVDR GTVTLTLATD
VQPGQTTVTV SNLKDATGRA VRPPLTRTIT TGSASLPAST LAFSSPKLTL QAGDTRALTY
LVNAPDSVRG AGGEIVSYVE LDMTYQGSQI EHQIGALPGI EDYQASTWLS FVVPDTDGPL
AADLGNFAVP LVLRAFPASP AMTEQSGTPT HDLDTASLPL LKQWDYAFTY SLPFHYPQDR
IYGEVEFNLR TAPTLFASFP DAFAQLAEFI TVFPKVNADL QTILAGIDAT VDPMTDQQKI
DDASIALQSF IQLVDELVEA AGGNTQGNGE RRGGTGLTFQ APARLLTGDP SLTFAFYEEE
GSAEVGDTEG ALVVTLVGAV PAGMGQPVVE IDPALYDAQP WQPPGDTQKA GDVFHYVYKR
KAGPGPEGSY LSAANGQNIP GRTVRLPALD ILQRQDAWST VWVERNRELV PGKPSADAFV
YTTPEVRFAS PLYPTNDANA IIDVAAIPSG TPVKRSLQEH FDALFAYLLA GDTLPQIVAQ
VEVTYGYALN AALDKIVLPV LMQAPLTVDV AGTGAGTIAK MTADWTAAIE TWFSTYEPTG
GGTLWMDLTL MSNLTGQPMP LLRMRRLMLS IAQVVPPLPC R