Gene BURPS668_A0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0649 
Symbol 
ID4885696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp605817 
End bp618212 
Gene Length12396 bp 
Protein Length4131 aa 
Translation table11 
GC content66% 
IMG OID640130589 
ProductLysM domain-containing protein 
Protein accessionYP_001061648 
Protein GI126445446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGGAC TGAATGACGT CATCGAACGG TTGTCGCAGG GAACGCAGAA CGTCCTCACG 
GTTCATCTCG GTGCACGCAT CACGAAAGTC CGGGACTATC TGAATGCGCT TGACGAGATA
CGCGTTTTCA CCGCGAACAG TCAGAACTTC GGACATCAAT CGTCGTCGGT CAATATCCTG
CGCAACCTGA TTCGCATGGG TGCGCCGGGG CCGTATACAT TCGCGCTGTC CGCATCCAAC
TCCGCCGACT ACGCGGACCT GGAAGAAAAG ATCCGCTTGC TGATTCCGCA GTTCCGGCAG
GTCGGCGTCA CGTTTGAACT GGGTGCCGGC GGGCAGAGCG CCGATGTCAC CGTCGTGCGG
CTCGATAAGG CGCTGGCGCC CGCGCAATTC GCGATCAGCG GCGGTTTTGA CGATCTCGAA
AACAAGACGC CGCCGTATCA CCTGCTGAAT GTGACCAACT ACGTTCAGCT TCAGCCGTAT
GCGTGGAATC GCGGGACCAA TATGGTCCGG ATCAAGCCGC CCGGCGGCAC GGCGAGCGAA
TACAACCTCG ATGAGCTGAA TCCGACCACG CTGCTTGCGC GGCGCGCGTT CTATCTCGCC
GATCCCGAGC TGACGCAGTC TGACCGAGAG GCGATTGCGC AAACGCCGTA CGCGAACAAG
GCGCGCGTGA TCGAGCGTCT GCTCGAACGA CGGGAAGCTG GCGAAATCAC GCTTTTTCCC
GTCTACGGCG TCACGACGAA AGGGAGCGCC TATACGTCGC TGTACAACGC GGTCACCGGC
GCGTTGATCG CGCAGAACAC ACATCCCACC GTCAAAAAGA CGGTGATGGT TCAGATCACG
ACCCTCACGG CCTCCGAGTG GCAAGAGTTC CTGTTTTTGA TGCGGGACCC GGCCGGCCAG
ATGGTAAACA AGATCAGGAC GACGCCTGAT TTCCGGGGCT GGAACGAGGA GAATAAGGTC
AAAGACCGGG TCCAGGATCT CGGCAGCCCG AATAGCGAGC CCACGGTCGC GGAGCTCGAC
GAAGAGCTGT CGTATCTCGA GGACGATCAA CTGCTGGTGG TCTACATCGG CAAAATTCCG
GCGCCGCTGT TCGATGTGCT GTATGCGAGC GCGACTTTGC CGCCGGTGCT GGAAGGGCAG
AATACCGCCG AACTGATGCT CAATCTCGGC AAGCCGTACT TCAAGATCAC GAGCAACAAC
AGTCGCGAAG CGGATGCGCG CTTCAGCTAC GCGACGCTGC CGCTCAGTTC CACCGGCGCG
GGCACCGACG CGACCAATTC GCTCGACGAG TCGTTCAACG GCATTTACTT CACGCGTCCC
GACAACTGGT ACAGAGATCG CCCGACCTAT CCGCCGACGC AACTGCCGGC GATGATCAAC
GCGTATGTGC AGCCGGCTGG AAACGCCCGT GCGACTTACT TCGCGGCGCA GCGTACGTTT
TTTCACGACG AGCTGAACGA CAAGCTGCTG CGCGGCCTCG ATCTGTTCGT CAATCTGATC
GGGCCGGCGG CGCTCGAGGA GCACCGACTC GCGCTCGCGC ATGCCGAGCT GGACGACGCG
AACGCGCCTG TTCACGAGGC CGGCGCCACC GCACGGGCGC CCGCGCGCAT CGCGCACCGC
AAGCCGCCGA CACGTGTCGC AGGCGGCGAC GCGAACGGCG CAAGCGAGCT GCTCGAAGCG
TTCTACGAGG ATCTGACGAG TCACACGGTC GACGGTGTGC TCGATTTTCT GCTGGCGGTT
ACCGACGGCA TCCTGAACGA ATTCTTCAGG CAAGTCGTGA TCGACGTCGT GTTCACGATC
ACCGACACGG TGACGGAAAT CAACGCCGAT AAAACCGAAG TCACGTTGAC GGGCAAGTCG
AAGGCGTTCG GCGCAGGCAA CCTGACGCTC GCGTTCTCGT TTACCGACAG CGGCGGGACG
ATCGCGGGCA AGATGTCCGG CGCGTTTACG GACACGGTCT GGGCGTTCCC GGGCGCGCAG
TGGCTCAGCG TCGCCAACCC GTCGCTTGCG CTCGCGATCG ACAGCAACGC GGCCGTTCCC
GTGACGGGCA CGGTCGGTGC GACGTTTACG GCCGGGATCG CCGCGAAGGC GTCGCTGACG
CTGCCGTCCG AACCGGGGCG CCTGCTGCTG CAAGCGGAAT TTCTCGCGCC GCGACCGAGC
ATCACGAATA TCTTTCAGAT GCTCGGGGGC ATCAACATCC AGGCGCTGCT GCCATCGCAA
ATCCAGCTCT TCAGCGACAT CGAGGTGCAG AACCTTGCGC TGCGCTACAG CTACGCGAAC
GGCGTGATGG AGTACATCGG CGTCACGCTC GGCACACCCG AGAATCGAAG CTGGCAACTG
GTGCCCGGCG TGACGGTTAC CGGGCTCGGC TTCAGCGCGC TGACCGATTA TCCCGGCGAT
CTGCAGCGGC GCAGCACGCG CTACGTGATC GGCGGCCGGT TCGACATCGC CGGCGGCCAC
GCGCAACTGG AGGCGCGCGT GCCCGCGCTG CGCGTGACCG GCGGCTTGAT CGACGGCAGC
CCGCCGATCA CGCTCGCCGC TATCGTCACC GAGTATCTGG GCGCCGATTT CGCCGCGGCC
ATTCCGGCGA GCGTCTCGAG CACCGCGATC GAGCAATTGA GCTTCATCGT GGATCAGGCG
CAGGGCGCGT ACAGCTTCTC GATGGACGTC TCCGCGCAGT GGCCGGTGCC GTCTGCCGCC
AATGCGCTGT TTACGATCAC CGGCCTGAAC TTCGCGATCG ACGCCGTTTC ACGCGATATC
AATCCGCCGA AAGCGGATGC CGGCGGCAAC AACGGCGCCG GCGGCACGCA GACTGAAATT
GAAGGGAGCT TCGGCGGCTC GCTGATCGTC CTGCCGAACT CGGAGAGCCC GATCGGGCTC
TCCACCACGG CTACGTACAA GACAGCCGCG AAGGCGTGGA CCTTCGACGC GCAGCAAACG
TCCGGAGTGG TGAGCCTCGG CGCGTTGCTC GTTTATTACC TCGGCAATAC GTGGCAGGCC
CCGCAAGGGC AGGAGTACGC GATCGACGGG CTTGGGCTGA CGATCACGAG CTCGCCCACC
GATTCGACGT GGGCGTTCAC GGGCAAGACC GCCGACAACT GGGTTGTGCC GTTCCTGGAC
GTGAGTCTCG CAGCGAAGCT GCGCATGGGC GACGCGGGAG CGAAGGCAGA GGTGCCGGGG
AAATTCGGCA GGCTCGACCT CGAAGTGATC TGGCAGAACA TCGACCTCAC CGTCTGGTTC
GACTACAACC CGAAGGTCAA GCAATACGGG ATCACGTGGG GCCTGCTCGA AGGTGTGGTC
GACGGACCGG ACCCGACGAC CCAGGACTGG ACCGCGACAC TCGGCTTCAA GCAGAACACG
ACGCTCGGCT CGATGATCGA AACGATGGTG TCGTGGGCGA CCGGCTCGAA GTTCGGGCTC
GAGTCGCCAT GGAGTTTCCT GAACGCGATC CCGCTGTCGA ACCTCGCGCT CAAATACACG
TTCAACCAGA CCACGCCGAG TCGCAACAAG GTCAGCTTCG CCGTGACGAT CGGCCCGATC
AATCTGGGCT TCGCGCGCAT CGACAGCATC GACGTCGGCT ATCAATCCAC CGGCGAAGAT
CGCGGCGTGA TGGTGACGCT CAATGGGTCT TTCTTCTGGC AGTCGGATCC GAGCACGCCG
CTCGAGTGGG ATGCGAGCCA GCCGGGCACC GCGCCCGCGC CTCCCGGCAA CGGCAACAAG
TATCTCGACC TGCGGCTGCT GGCGATGGGC CAGCACATCA CGCTGCCCTG CTTCGCGACC
GCCGATACGG TGCAAAAGGC CATCGCCTGC ATGGCCACGC TGCCCGATCC GAAGCCCGGC
CAGATTCCGG CAGTGCGCTT CGATGCGCAA AGCGCGTGGC TGATCGGCAC CGACTTCGGC
GTGCTGAAGA TCGACAGCGG CCAAACTGGC AATAACGCGA ACGCGCTGCG CGTGACAAAC
GACGGCAGCT CGCTGGCGGA ATCGTCCGGC TATGTATTGA CGCTGCAGGC GGTGTTCAAC
GACCCGCATC TGTACGGGCT GCGAATTGCG CTCGACGGCG CGGCGGCCAA GGTATTCAAG
GGCCTCGACT TCCAGATCAT GTACCGTCAG GTGAGCGACA CCGTCGGCGT GTACCAGGCG
GAGATCACGC TGCCCGACCT GATGCGTCAT CTGACGGTCG GCGCGTATTC ACTCACGCTG
CCCGTGTTCG GCATCGCCGT CTATACGAAC GGCGACTTCC AGGTGGACAT CGGCTTCCCG
TGGAACGAGA ATTTTTCGCG TTCGTTCACG ATCGAGGCGA TCATCCCCCC CGGCATTCCG
GTGCTGGGCT CGGCCGGTTT CTATTTCGGC AAGCTCTCCA GCGCGAGCAC CAATCGCGTG
CCCGCGTCGT CGTACGGCAC GTTCAATCCG GTGCTCGTAT TCGGCTTCGG CATGCAGGTG
GGCTTCGGCA AGTCGATCGA ATACGGCATC CTGTCGGCCG GCTTCAGCGT GACCGTGGTC
GGGATTCTCG AAGGCATCCT CGCGAAGTGG AACCCGTATC AGCTCACCCA CTCGGGGCGC
GAGCCGTCCA CCCAGTTGCA GGGCGACTAC TACTTCTGGC TGCGCGGCAC GGTCGGCATC
GTCGGCCGCG TGTACGGCAG CGTCGACTTC GCGATCGTGA AGGCGAACGT CGACATCACG
GTCAAGCTGC TGCTGCAACT CACGTACGAA TCGTATGTGT CGATCACGAT CACAGTGATC
GCCTCGGTCG ACGTGTCGGT TAGCGTGAAG ATCAACCTCG GGTTGTTCAA GATCAGCATC
TCGTTCTCGT TCTCGATGCG ACTGAAGGAG ACCTTCACGA TCGATAACCG GGGCGCCGCG
CCATGGCTCG GCGATGGCCG CAACGTGCGC GGCGTGCTGC GTCTGCCGGT CGAGCGTCGG
CTCTCCGGCT TCGCGCGCGC GCAAGCGCGC GACAGCCTGC TGGTGAGCGC GCCGAACTGG
GGCAACCTGC GGCCGGACGG CGTGACGGAC CTGTCGGGCT ACCTCGTGCC GGGGCTCACC
GCGGCGCGCG ACGAATGGAC GCCGCAGGGC GAACCGGCGA ACCAGTTGTC GTGCTGGGTC
GCGCTGCTGC TGATCGAATC CGTGCCGCCG GCTGGGCAGG ATGCCGGCGC GAGCAAGCTC
AAGGCGGCGG GAAGCGCGCC CGACAGCTCG TTCGAGGCGC TCGCGAAAAT GGTGCTGCGC
TGGGCCATCG CGGCCGTTCA GGGGCCGATG ACGCCGAACG AGGTCGATCG ATACCCGGTT
CCCGCAACGC TACTGGACTG GCTCGCGGAC GACGTGCTCG TCAGCACCGG CGACGATCCG
ACGCCGATCC CGCTCGACGC GGTGCAGGCG TTCCTCGACA CGCATTTCCG ATTCAATCTG
CGCGTGCCGC CGACCGATCA GGATGCGTCG GCCGATACCG CGTATTTCCC GGCGCCGCCG
CAACTGCGCG TGATGATTCC GCCGTACGGC AACGACTACC CGGGCGTGCA GTACACCCTC
GGCAGCTATA ACGCGCTCGG CGAAAACACG CTGGCCGAAC TGCGCGCATG GTTCGACCAG
CTCGCGGTGC AGGTGGAGCG CGAGCAAGCG GCGAACGGCG CGGCGGCACG GGCATTCGTG
GAAGAAGCGC CGCTGTCGAT GGCCGGATGG ATGTTCTCCG ACTACTTCCT GCTGCTCGCC
CGGCAGATGG TCAAGGCCGC GCAAGACGCA CTGCGCGACT TCAAGTACGC GCTCGACGCG
AACGAGACAC CGGACGACGT CGTGAGCTGG GTGAACACGA CCGGTCAGTT GAACGGGCTG
TACACGCTGA ACGACGTGTT CGGTGCCAAT GCGCTGCACG CGCTCGTCGC CGAAAAAACA
CTGACGATCG GCGTCACGAG CTCGATCACC CTGGACAAGA CCGGCCAGAC CTTCACATCG
CTGGCCAAGG CCTTCGACGA CGCGCTGCCC GCAAGTGCGA TCGCGTCGGC CAATGCGGCC
GACGCTGCGC TGCTGCAGCC GGGCGCGACG ATCACTTACC CCGGCTTCGA TCCGTACACG
AGCGTTGCTG GCGACACGCT CGTCAGCATC GCCGCGCATT ACCAGGCGAA GCTCAACGAT
CTGCTGGCCG ACTCAGATGT GCTCGACGCG GCCGGCATGC TGCGTATCGG CGCGAGCGCG
CTCATGCCGT ACACGGCGTA CACGGCGCTC GCGACCGACA CCTTCGCGTC GGTCGCCGCG
CTGCCCGTGT ACGCCGGCGG TTTCGACGCG GCCGCACTGG CCACCGCGAA TGCGGGCCGC
AGCGTGCTGC TCGAAGGCGT GAAGATCGAG TATCCGGACA AGGACGCGTA CACCGTGCAG
CCGCGCGACA CACTTGGCGA CGTTGCGAAC GCGTTCGGCG TAACCGTGTC CGACCTGCTC
GCGACCAGCG CGGTGCTGAC GCAACCCGGC CTGCTCGCGC CGGTCGCGTC GCTTACGGTT
CCGGCGTTCC GCTACACGAC GCAGCAAGGC GACGATCTCG CGCAAGTCGC GGCGCGCTTC
GGCGTGGCCG TCTCGGTGCT GGCAGACCAG CCCGCCAACG GCACGGTAGC CGGGTTGTTC
GACACCGGCG ACACGCTCGA CTTGCCGCAC CTGCCGCAAT TTCCGTTGGC CGAACTGCTC
GCCGAAGCGC AACGCTCGGG CATGCTGCAG CACCTGTCCG GCATCGCGAG CAGCTATACG
ATGCACGGGC TGCGCTTCCC GACGTCCGGC CCGACCGGCA CGGGCGGGCA ATGGTCGATC
GTGCCCAACG AGATGGGCAT GTGGGTGCAT GACGTGAACG GCACGCTGAA GCTGCCGCCG
CAAGCCGGGC TCTATGCGCT GACCGGTCAG CAATTCCCGC TGCCCGCACT CGGCGCCGAT
CCGTTCGCGG CGACCTTCGA CAGCGTGGCC GGCGCCGGTT CGTCGTGGCT GCGCTTCGTC
GACGGCAACG GCGGTCCGAC CGACCGTCTG ACACTGTCGG TCACGCCCGG CACGCCCGAC
GCGACCCGTA TCGCGCAGGT GACCGCCGCC GCGAAGACAC GGCTCGTCGT GCCGATGGAT
AGGCTCGGCG CGGGCAAGAT GTACGATACC GCCCTCGCGA CCTATCCGTT CACGTCCGCG
TTGCAGTGGC TGAGCACGCA CACCGTCGCG CTGCCTTATG GCCAGCCGCC GGCCGGCGTG
CAGTCGCTGC GCGTGTGGCA ACTGCCCGGC GCGCTCGCGG CGCTTCCCGA TCCGGCCATC
CATGCGGTGA ATCCGCGCTT TGCACTGCGG GTGGCCCGCT ACGACGATGC GACGGGCGCC
ACCGAAACCA CCGGGGTAGA CTCGTACGGA TGGGCGTCGA CCATCGGCTT CACGGTGCGG
CGCATTCCGC CGGTAGCGGG CAGTCCCGCG TCCGTCGACA CGTACGAGGT GGTCGGCGCG
AGCGGCGCTG CAATCGTGGT GCTCGAACAA CTGTTGAGCC AAGTACAGGC GGACGATTCG
GCCTACTTCG GCCTGAGCGT CGGTTTTGCA CCCGATAGCG CAACGGGCGG CGGCGAAGGC
GTGCAGACCG GCGGCGCGGC GAGCGTCGTG TTCGGCATCG CGCAGGTGAA CCTGTCGACG
GAAACCCGTC CGCCTGCCGG CGCCGCATTC GCGGCACTGC GTGAAACGGC CGGCGAAACG
CCGCAGCTCA CGCTGCTCAA TTCGCCGTCG GAGTTCGTGC GGCTGCTGTG GGAAGCGAGC
ATCACACGCT CGGGCGGCTT CTTCCTGTAC TACTACGATC GCGCCGCTGG AGGCGGGCTG
CCCGATCGCA TATTCAACGA TCGCAACGAG GCATCGCTGA CGTTGATCGT GCTGTACGCG
AAGCCCGCGG CCGTAGACGA TCAGGACCGC GTCACGAATT ACATGAATGC GGTAGTGACG
ACCGACGCGC TGGATACCGG CAACGCGGTG CTGTTCGCGG AAGCGGCGCC GGTTCCAGCC
ACCGTCACGA GCGGCGCTGG CGAGACGCTC GCGTCGCTCG CCGCGCAATG GTATTCGGAC
GAGGCGGATA TCGCGGAAGC CAACGCGAAC GTCGCGCTTC GCGCAGGCGC GCTCGTGCGC
GTGAGCGAAG GGGTCTACCA GGCGCCGCCG GGCGGCATCG CGCTCGCGCA GGTCGCGAGC
CGCTTCGGCA CCACGGTGCA GGCGCTGAAC GACGCCAATC CGCTGTGGGG CGGCTTGCCC
GACCCACTGC CGTTCCCGGC CGCGATCCGC GTACCGGACC TCACGCTCAC TGCCGGCACG
AGCGCGCACA CGGCGTCGCT CGCGGATATC GCGGGCTGGT ATGGCGAGCC GGTCGACGCG
CTTGCTTCCC ATAACGCGCG GGTGGCCCAG CTGTTCGCGG CCGGCGTGCC GCTCGTGATT
CCCGGCGGCC CGCGCGTGCG CTCGGCTGCG GTGCAGCCGG GCGTGCAGGC GCTCGCCGCG
CTGCGTCCGG CGCCGCCGCA AGTGGACGGT ACATCGCCGG ACTACGGAAC GGAACTGCTG
CTGAACAACT TCAGCCTGTT GAACCAGCAG GTATACGGCA ACGTCGACTT CCGGCCGAGC
GATCCGCCGG GGCTGCCCGC CGGACCGACA ACCAAGGCTC CGGAAGAAAA CGGCAACGAC
AAGGTTCGCA CCGTCGTTCC GGCAGATCAG GTCGAAGCGT GGAACTTCAG TCAAGCGCTG
CCTTACGCAC GCTTTGCGAA GCACGTGCCG CAAGCGCCGC GCGCGGCGGT CGCGCTGCCG
CCCGCAAGCG CCAGTCCGTA TTTTGGTGTC GGCGGCATTC TGCAGATCTC GTTTGCGTGG
CAGGACTACT ACGGCAACGT GCTGTCGACG CCGCTGTCGG ACCCCCTGGC CGGCGATGCG
GCGCCGTACA ACGACGCGCC GCTGCTCACC GGCTACACGG ATCCGCTGGT GTCGCTGTCG
CAATGGCCGT CGATTGCTTC GAACTGGCAG GTGCTGCCCG GCAGCGGCGG AGCGAACCCG
CGGCTGAATA TCGAGTTGAG CTTCGATCCG AGCCGTTACC AAGGCTTGCT GCAGGCATCG
GCGGCGACGC AGACGACGAT CACCGTGGTA TTCACGGACG CGCTCGACGC CGCATCGGTC
GGCGAACTGT CGCGCTGGCA ACTGGTGCCG GGAACCGTCG ATTCGGCGTC GCTGGCCGCG
GATGGAAAAA CCGTCACGCT GACGGTGCCG GCGCTCGACG ACGACCTGCG CTACACAGTG
ATCGCCACTG ATATCAAAGC GCAAGCGAGC GACATGCGCT ACAGCGGGCA GGCGTCGTTC
GACTGGCCAG ACAATCCGGT CACGCGCAGC AGCACCGTGC AACAGAACGC GTCGCAGGAC
CTGCACGTCT ATACACAGCT GTACTACCAG CTCACCGATC CCGCCGGAGT CGATCTGTCG
GCACAGTCTT CGTTGCTCGC GGGCGCGCAC GGAGCGCCGG GCAGCGTCGC GTATGCGCCG
GCCGCCGTCG ACGAGCTGAT GGACTGGCTG TTCGGCACAG CCGGCGCGGC CTCCAGTATC
TATGCGTTCG TGCTGGACCG CTCGAAGTTC CAGAGCGTTG CCGTGCCGCC CGCCGCCGGG
TTGCCGCTCG ACGTGGACGT GCCACCGCAG CAGGTGAACA CCGCGCAGAT CTTCCCGCTG
TGGACATCAT TTACGATGAC GCGCGCGCAC GGCCCGGTGC TGCCGGGGCT GGAGACGGTC
GCCGGCATTC GCAGCGCGAG CACGCGCGTC GCGCCGCTGC AGGATGCGCT CGGCGCCACC
GGCGGCACGC TCGGGCTCGT TACGTTCGCG ACCGGTTTCG AACAGGCGCT GTCGACGCCG
GGCAGTGTCC GGCTGAAGGT GGCGACCGGC GTCGATCGCA CCGCGCCACC CGCCACTGGA
GCGGCCAGCA CGGTCTGGGC GGTACGCGTC GGGCTCGCTG CCGGCAAGGC GATCTCGTAT
GCGATCGCCG ACGCGGGCAA CCCGGCCGTG TTCGCGCCGC AGCCGGCGAG CAACCGGTTA
ATCAGCCGCA CGCAAGTACC GATCTACGAC TACACGACCG GCAAGGGCAT CTCGTCGACG
CCGTCACGCA CCACCGATTT CACGGATGTC GATCTCGACA CGTGGTGCGC GCAGGTGTTC
GCCGCCGTCG ACGACGTGCT GACACCGCAA TTCACCGCGC CGATGCAGAT CGTCGGCGAG
TTGAAGTCCG CTGATTATCT GCAGTCGATC CTCGACGGCA AGAAGGGACT CGCGACGGTC
GCGAAGCTGT GGATGATTCC GGTCTTCGCG GGTGAAACCT CCGACCCGTC CGCCGCGCGC
GAAGCGTTTT ACCAGCAACT GCTGGTGCGA TTGTCAGCCG CATACACGAC GCGCGCCGCG
GTGGAGTTCC ATGCGAACGT GACCGCCGAC GTGATCGAGC CCGCCGCGGA TCAGCCGCCG
CGGCTGTTTG GGCCGGTCAC GCGGAACGGA CCGGTGTTCG AGGCCGCCAC TGTCGACGGA
CAGGCACTGA CCACCGTGTT CCTGCTGTTC AGCGACCCGA TGGACCCGGC CACCGCCGGC
AACATCGAAA ACTACGCATT GAGCAGCGGT GCCGGCGTGC TGACGGCGAC GGTCGACCGC
GGCACGGTGA CGCTCACGCT CGCGACGGAC GTGCAGCCGG GACAGACGAC GGTCACCGTC
AGCAATCTGA AGGACGCGAC GGGACGCGCG GTGCGGCCGC CGCTCACGCG CACTATCACG
ACTGGCTCGG CGAGCCTGCC GGCCAGCACG CTCGCGTTCA GTTCGCCGAA GCTTACGTTG
CAGGCCGGCG ATACGCGCGC GCTCACGTAT CTCGTCAACG CTCCGGATTC GGTACGCGGC
GCGGGCGGCG AGATAGTGTC GTATGTCGAG CTGGACATGA CGTACCAAGG CAGCCAGATC
GAGCACCAGA TCGGCGCGCT GCCGGGTATC GAGGATTACC AGGCATCGAC CTGGCTCAGC
TTCGTCGTGC CGGACACTGA CGGGCCGCTC GCGGCGGACC TCGGGAACTT CGCGGTGCCG
CTGGTGCTGC GCGCATTCCC GGCAAGCCCC GCGATGACGG AGCAGAGCGG CACCCCGACC
CATGACCTCG ACACCGCGAG CCTGCCCCTG CTCAAGCAGT GGGACTATGC GTTCACCTAT
TCGCTGCCGT TCCACTATCC GCAAGACCGC ATCTACGGCG AGGTCGAATT CAATCTACGC
ACCGCGCCGA CTTTGTTCGC GAGCTTCCCG GATGCGTTCG CGCAGCTCGC GGAGTTCATT
ACCGTGTTCC CGAAGGTGAA CGCGGATCTG CAAACCATCC TCGCCGGCAT CGATGCGACG
GTCGACCCGA TGACGGACCA GCAAAAGATC GACGACGCAT CGATCGCTCT GCAATCGTTC
ATCCAACTGG TCGACGAACT GGTCGACGCG GCAGGCGGAA ATACGCAAGG CAATGGCGAG
CGCCGCGGCG GCACGGGTCT CACGTTCCAG GCCCCGGCGC GACTGCTCAC CGGCGACCCG
TCGCTTACCT TCGCGTTCTA TGAGGAAGAA GGCTCGGCCG AGGTTGGCGA TACCGAAGGC
GCGCTAGTGG TGACGCTGGT CGGCGCCGTC CCGGCAGGAA TGGGTCAGCC GGTCGTCGAA
ATCGATCCGG CGCTGTACGA CGCGCAGCCT TGGCAGCCGC CTGGCGACAC GCAGAAGGCC
GGCGATGTCT TCCACTACGT GTACAAGCGC AAGGCCGGCC CCGGGCCGGA GGGTTCATAC
CTGAGCGCCG CGAACGGGCA GAACATTCCG GGCCGTACGG TGCGGCTGCC CGCGCTCGAC
ATCCTGCAGC GCCAGGACGC GTGGTCTACC GTCTGGGTGG AGCGCAACCG CGAACTCGTT
CCGGGCAAAC CGTCGGCTGA CGCGTTTGTC TACACGACGC CCGAAGTGCG CTTCGCCAGC
CCGCTGTATC CGACCAACGA CGCCAATGCG ATCATCGATG TGGCGGCGAT CCCGTCCGGT
ACGCCGGTGA CGCGTTCGCT TCAGGAGCAT TTCGACGCGC TCTTTGCCTA TCTGCTCGCC
GGCGACACGC TGCCGCAGAT CGTCGCACAG GTGGAGGTGA CATACGGATA TGCGCTGAAC
GCCGCGCTCG ATAAGATCGT GTTGCCGGTG TTGATGCAGG CGCCGCTGAC GGTCGACGTC
GCCGGAACCG GCGCGGGCAC CATCGCGAAA ATGACCGCTG ACTGGACTGC CGCGATCGAA
ACGTGGTTCT CCACATACGA ACCGACGGGC GGCGGAACAC TCTGGATGGA TCTGACGCTG
ATGTCGAATC TTACCGGCCA GCCGATGCCG TTGCTGCGGA TGCGGCGATT GATGCTGTCC
ATCGCGCAGG TGGTCCCGCC GCTGCCATGC CGCTAG
 
Protein sequence
MAGLNDVIER LSQGTQNVLT VHLGARITKV RDYLNALDEI RVFTANSQNF GHQSSSVNIL 
RNLIRMGAPG PYTFALSASN SADYADLEEK IRLLIPQFRQ VGVTFELGAG GQSADVTVVR
LDKALAPAQF AISGGFDDLE NKTPPYHLLN VTNYVQLQPY AWNRGTNMVR IKPPGGTASE
YNLDELNPTT LLARRAFYLA DPELTQSDRE AIAQTPYANK ARVIERLLER REAGEITLFP
VYGVTTKGSA YTSLYNAVTG ALIAQNTHPT VKKTVMVQIT TLTASEWQEF LFLMRDPAGQ
MVNKIRTTPD FRGWNEENKV KDRVQDLGSP NSEPTVAELD EELSYLEDDQ LLVVYIGKIP
APLFDVLYAS ATLPPVLEGQ NTAELMLNLG KPYFKITSNN SREADARFSY ATLPLSSTGA
GTDATNSLDE SFNGIYFTRP DNWYRDRPTY PPTQLPAMIN AYVQPAGNAR ATYFAAQRTF
FHDELNDKLL RGLDLFVNLI GPAALEEHRL ALAHAELDDA NAPVHEAGAT ARAPARIAHR
KPPTRVAGGD ANGASELLEA FYEDLTSHTV DGVLDFLLAV TDGILNEFFR QVVIDVVFTI
TDTVTEINAD KTEVTLTGKS KAFGAGNLTL AFSFTDSGGT IAGKMSGAFT DTVWAFPGAQ
WLSVANPSLA LAIDSNAAVP VTGTVGATFT AGIAAKASLT LPSEPGRLLL QAEFLAPRPS
ITNIFQMLGG INIQALLPSQ IQLFSDIEVQ NLALRYSYAN GVMEYIGVTL GTPENRSWQL
VPGVTVTGLG FSALTDYPGD LQRRSTRYVI GGRFDIAGGH AQLEARVPAL RVTGGLIDGS
PPITLAAIVT EYLGADFAAA IPASVSSTAI EQLSFIVDQA QGAYSFSMDV SAQWPVPSAA
NALFTITGLN FAIDAVSRDI NPPKADAGGN NGAGGTQTEI EGSFGGSLIV LPNSESPIGL
STTATYKTAA KAWTFDAQQT SGVVSLGALL VYYLGNTWQA PQGQEYAIDG LGLTITSSPT
DSTWAFTGKT ADNWVVPFLD VSLAAKLRMG DAGAKAEVPG KFGRLDLEVI WQNIDLTVWF
DYNPKVKQYG ITWGLLEGVV DGPDPTTQDW TATLGFKQNT TLGSMIETMV SWATGSKFGL
ESPWSFLNAI PLSNLALKYT FNQTTPSRNK VSFAVTIGPI NLGFARIDSI DVGYQSTGED
RGVMVTLNGS FFWQSDPSTP LEWDASQPGT APAPPGNGNK YLDLRLLAMG QHITLPCFAT
ADTVQKAIAC MATLPDPKPG QIPAVRFDAQ SAWLIGTDFG VLKIDSGQTG NNANALRVTN
DGSSLAESSG YVLTLQAVFN DPHLYGLRIA LDGAAAKVFK GLDFQIMYRQ VSDTVGVYQA
EITLPDLMRH LTVGAYSLTL PVFGIAVYTN GDFQVDIGFP WNENFSRSFT IEAIIPPGIP
VLGSAGFYFG KLSSASTNRV PASSYGTFNP VLVFGFGMQV GFGKSIEYGI LSAGFSVTVV
GILEGILAKW NPYQLTHSGR EPSTQLQGDY YFWLRGTVGI VGRVYGSVDF AIVKANVDIT
VKLLLQLTYE SYVSITITVI ASVDVSVSVK INLGLFKISI SFSFSMRLKE TFTIDNRGAA
PWLGDGRNVR GVLRLPVERR LSGFARAQAR DSLLVSAPNW GNLRPDGVTD LSGYLVPGLT
AARDEWTPQG EPANQLSCWV ALLLIESVPP AGQDAGASKL KAAGSAPDSS FEALAKMVLR
WAIAAVQGPM TPNEVDRYPV PATLLDWLAD DVLVSTGDDP TPIPLDAVQA FLDTHFRFNL
RVPPTDQDAS ADTAYFPAPP QLRVMIPPYG NDYPGVQYTL GSYNALGENT LAELRAWFDQ
LAVQVEREQA ANGAAARAFV EEAPLSMAGW MFSDYFLLLA RQMVKAAQDA LRDFKYALDA
NETPDDVVSW VNTTGQLNGL YTLNDVFGAN ALHALVAEKT LTIGVTSSIT LDKTGQTFTS
LAKAFDDALP ASAIASANAA DAALLQPGAT ITYPGFDPYT SVAGDTLVSI AAHYQAKLND
LLADSDVLDA AGMLRIGASA LMPYTAYTAL ATDTFASVAA LPVYAGGFDA AALATANAGR
SVLLEGVKIE YPDKDAYTVQ PRDTLGDVAN AFGVTVSDLL ATSAVLTQPG LLAPVASLTV
PAFRYTTQQG DDLAQVAARF GVAVSVLADQ PANGTVAGLF DTGDTLDLPH LPQFPLAELL
AEAQRSGMLQ HLSGIASSYT MHGLRFPTSG PTGTGGQWSI VPNEMGMWVH DVNGTLKLPP
QAGLYALTGQ QFPLPALGAD PFAATFDSVA GAGSSWLRFV DGNGGPTDRL TLSVTPGTPD
ATRIAQVTAA AKTRLVVPMD RLGAGKMYDT ALATYPFTSA LQWLSTHTVA LPYGQPPAGV
QSLRVWQLPG ALAALPDPAI HAVNPRFALR VARYDDATGA TETTGVDSYG WASTIGFTVR
RIPPVAGSPA SVDTYEVVGA SGAAIVVLEQ LLSQVQADDS AYFGLSVGFA PDSATGGGEG
VQTGGAASVV FGIAQVNLST ETRPPAGAAF AALRETAGET PQLTLLNSPS EFVRLLWEAS
ITRSGGFFLY YYDRAAGGGL PDRIFNDRNE ASLTLIVLYA KPAAVDDQDR VTNYMNAVVT
TDALDTGNAV LFAEAAPVPA TVTSGAGETL ASLAAQWYSD EADIAEANAN VALRAGALVR
VSEGVYQAPP GGIALAQVAS RFGTTVQALN DANPLWGGLP DPLPFPAAIR VPDLTLTAGT
SAHTASLADI AGWYGEPVDA LASHNARVAQ LFAAGVPLVI PGGPRVRSAA VQPGVQALAA
LRPAPPQVDG TSPDYGTELL LNNFSLLNQQ VYGNVDFRPS DPPGLPAGPT TKAPEENGND
KVRTVVPADQ VEAWNFSQAL PYARFAKHVP QAPRAAVALP PASASPYFGV GGILQISFAW
QDYYGNVLST PLSDPLAGDA APYNDAPLLT GYTDPLVSLS QWPSIASNWQ VLPGSGGANP
RLNIELSFDP SRYQGLLQAS AATQTTITVV FTDALDAASV GELSRWQLVP GTVDSASLAA
DGKTVTLTVP ALDDDLRYTV IATDIKAQAS DMRYSGQASF DWPDNPVTRS STVQQNASQD
LHVYTQLYYQ LTDPAGVDLS AQSSLLAGAH GAPGSVAYAP AAVDELMDWL FGTAGAASSI
YAFVLDRSKF QSVAVPPAAG LPLDVDVPPQ QVNTAQIFPL WTSFTMTRAH GPVLPGLETV
AGIRSASTRV APLQDALGAT GGTLGLVTFA TGFEQALSTP GSVRLKVATG VDRTAPPATG
AASTVWAVRV GLAAGKAISY AIADAGNPAV FAPQPASNRL ISRTQVPIYD YTTGKGISST
PSRTTDFTDV DLDTWCAQVF AAVDDVLTPQ FTAPMQIVGE LKSADYLQSI LDGKKGLATV
AKLWMIPVFA GETSDPSAAR EAFYQQLLVR LSAAYTTRAA VEFHANVTAD VIEPAADQPP
RLFGPVTRNG PVFEAATVDG QALTTVFLLF SDPMDPATAG NIENYALSSG AGVLTATVDR
GTVTLTLATD VQPGQTTVTV SNLKDATGRA VRPPLTRTIT TGSASLPAST LAFSSPKLTL
QAGDTRALTY LVNAPDSVRG AGGEIVSYVE LDMTYQGSQI EHQIGALPGI EDYQASTWLS
FVVPDTDGPL AADLGNFAVP LVLRAFPASP AMTEQSGTPT HDLDTASLPL LKQWDYAFTY
SLPFHYPQDR IYGEVEFNLR TAPTLFASFP DAFAQLAEFI TVFPKVNADL QTILAGIDAT
VDPMTDQQKI DDASIALQSF IQLVDELVDA AGGNTQGNGE RRGGTGLTFQ APARLLTGDP
SLTFAFYEEE GSAEVGDTEG ALVVTLVGAV PAGMGQPVVE IDPALYDAQP WQPPGDTQKA
GDVFHYVYKR KAGPGPEGSY LSAANGQNIP GRTVRLPALD ILQRQDAWST VWVERNRELV
PGKPSADAFV YTTPEVRFAS PLYPTNDANA IIDVAAIPSG TPVTRSLQEH FDALFAYLLA
GDTLPQIVAQ VEVTYGYALN AALDKIVLPV LMQAPLTVDV AGTGAGTIAK MTADWTAAIE
TWFSTYEPTG GGTLWMDLTL MSNLTGQPMP LLRMRRLMLS IAQVVPPLPC R