Gene BTH_II1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1994 
Symbol 
ID3845889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2429307 
End bp2441705 
Gene Length12399 bp 
Protein Length4132 aa 
Translation table11 
GC content66% 
IMG OID637839295 
ProductLysM domain-containing protein 
Protein accessionYP_440188 
Protein GI83716175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGGAC TGAATGACGT CATCGAACGG TTGTCGCAGG GAACGCAGAA CGTCCTCACG 
GTTCATCTCG GTGCACGCAT CACGAAAGTC CGGGACTATC TGAATTCGCT TGGCGAGATA
CGCGTTTTCA CCGCGAACAG TCAGAACTTC GGACATCAAT CGTCGTCGGT CAATATCCTG
CGCAACCTGA TTCGCATGGG TGCGCCGGGG CCGTATACAT TCGCGCTGTC CGCATCCAAC
TCCGCCGACT ACGCAGACCT GGAAGAAAAG ATCCGCTTGC TGATTCCGCA GTTCCGGCAG
GTCGGCGTCA CGTTTGAACT GGGTGCCGGC GGGCGGACCG CCGATGTCAC CGTCGTGCGG
CTCGATAAAG CGCTTGCGCC CGCGCAGTTC GCGATCAGCG GCGGTTTTGA CGATCTAGAG
AACAAGACGC CGCCGTATCA CCTGCTGAAT GTGACCAACT ACGTTCAGCT TCAGCCGTAT
GCGTGGAATC GCGGGACCAA TATGGCCCGG ATCATGCCGC CCGGCGGCAC GGCGAGCGAA
TACAACCTCG ATGAGCTGAA TCCGACCACG CTGCTTGCGC GGCGCGCGTT CTATCTCGCC
GATCCCGAGC TGACGCAGTC TGACCGAGAG GCAATTGCGC AAACGCCGTA CGCGAACAAG
GCGCGCGTGA TCGAGCGTCT GCTCGAACGA CGGGAAGCTG GCGAAATCAC GCTTTTTCCC
GTCTACGGCG TCACGACGAA AGGGAGCGCC TATACGTCGC TGTACAACGC GGTCACCGGC
GCGTTGATCG CGCAGAACAC ACATCCCACC GTCAAAAAGA CGGTGATGGT TCAGATCACG
ACCCTCACGG CCTCCGAGTG GCAAGAGTTC CTGTTTTTGA TGCGGGACCC GGCCGGCGAG
ATGGTAAACA AGATCAGGAC GACGCCTGAT TTCCGGGGCT GGAACGAGGA GAATAAGGTC
AAAGACCGGG TCCAGGATCT CGGCAGCCCG AATAGCGTAC CCACAGTTGA GCAGCTCGAC
GAAGAGCTGT CGTATCTCAA GGACGATCAA CTGCTGGTGG TCTACATCGG CAAAATTCCG
GCGCCGCTGT TCGATGCGCT TTATGCGAGC GCGACTTTGC CACCGGTGCT GGAAGGGCAG
AATACCGCCG AACTGATGCT CAATCTCGGC AAGCCGTACT TCAAGATCAC GAGCAATAAC
AGTCGCGAAG CGGATGCGCG ATTCAGCTAC GCGACGCTGC CGCTCAGTTC CGCCGGCGCG
GGCACCGACG CGACCAATTC GCTCGACGAG TCGTTCAACG GCATTTACTT CACGCGTCCC
GACAACTGGT ACAGAGATCG CCCGACCTAT CCGCCGACGC AACTGCCGGC GATGATCAAC
GCGTATGTGC AGCCGGCCGG AAACGCCCGT GCGACTTACT TCGCGGCGCA GCGTACGTTT
TTTCACGACG AGCTGAACGA CAAGCTGCTG CGCGGCCTCG ATCTGTTCGT CAATCTGATC
GGGCCGGCGG CGCTCGAGGA GCACCGACTC GCGCTCGCGC ATGCCGAGCT GGACGACGCG
AACGCGCCTG TTCACGAGGC CGGCGCCACC GCACGGGCGC CCGCGCGCAT CGCGCACCGC
AAGCCGCCGA CACGTGTCGC AGGCGGCGAC GCGAACGGCG CAAGCGAGCT GCTCGAAGCG
TTCTACGAGG ATCTGACGAG TCACACGGTC GACGGTGTGC TCGATTTTCT GCTGGCGGTT
ACCGACGGCA TCCTGAACGA GTTCTTCAGG CAAGTCGTGA TCGACGTCGT GTTCACGATC
ACCGACACGG TGACAGAAAT CAACGCCGAT AAAACCGAAG TCACGTTGAC CGGCAAGTCG
AAGGCGTTCG GCGCAGGCAA CCTGACGCTC GCGTTCTCGT TTACCGACAA CGGCGGGACG
ATCGCGGGCA AGATGTCCGG CGCGTTTACG GACACGGTCT GGGCGTTCCC GGGCGCGCAG
TGGCTCAGCG TCGCAAACCC GTCGCTTGCG CTTGCCATCG ACAGCAACGC GGCCGTCCCC
GTGACGGGCA CGGTCGGTGC GACGTTTACG GCCGGGATCG CCGCGAAGGC GTCGCTGACG
CTGCCGTCCG AACCGGGGCG CCTGCTGCTG CAAGCGGAAT TTCTCGCGCC GCGACCGAGC
ATCACGAATA TCTTTCAGAT GCTCGGGGGC ATCAACATCC AGGCGCTGCT GCCATCGCAA
ATCCAGCTCT TCAGCGACAT CGAGGTGCAG AACCTTGCGC TGCGCTACAG CTACGCGAAC
GGCGTGATGG AGTACATCGG CGTCACGCTC GGCACACCCG AGAATCGAAG CTGGCAACTG
GTGCCCGGCG TGACGGTTAC CGGGCTCAGC TTCAGCGCGC TGACCGATTA TCCCGGCGAT
CTGCAGCGGC GCAGCACGCG CTACGTGATC GGCGGCCGGT TCGACATCGC CGGCGGCAAC
GCGCAACTGG AGGCGCGCGT GCCCGCGCTG CGCGTGACCG GCGGCCTGAT CGACGGCAGC
CCGCCGATCA CGCTTGCCGC TATCGTCACC GCGTATCTGG GCGCCGATTT CGCCGCGGCC
ATTCCGGCGA GCGTCTCGAG CACCGCGATC GAGCAATTGA GCTTCATGGT GGATCAGGCG
CAGGGCGCGT ACAGCTTCTC GATGGACGTC TCCGCGCAGT GGCCGGTGCC GTCTGCCGCC
AATGCGCTGT TTACGATCAC CGGCCTGAAC TTCGCGATCG ACGCCGTTTC ACGCGATATC
AATCCGCCGA AAGCGGATGC CGGCGGCAAC AACGGCGCCG GCGGCACGCA GACTGAAATT
GAAGGGAGCT TCGGCGGCTC GCTGATCGTC CTGCCGAACT CGGAGAGCCC GATCGGGCTC
TCCACCACGG CTACGTACAA GACAGCCGCG AAGGCATGGA CCTTCGACGC GCAGCAAACG
TCCGGAGCGG TGAGCCTCAG CGCGTTGCTC GTTTATTACC TCGGCAACAC GTGGCAGGCC
CCGCAAGGGC AGGAGTACGC GATCGACGGG CTTGGGCTGA CAATCACGAG TTCGCCCACC
GATTCGACGT GGGCGTTCAC GGGCAAGACC GCCGACAACT GGGTTGTGCC GTTCCTGGAC
GTGAGCCTCG CAGCGAAGCT GCGCATGGGC GACGCGGGAG CGAAGGCAGA GGTGCCGGGG
AAATTCGGCA GGCTCGACCT CGAAGTGATC TGGCAGAACA TCGACCTCAC CGTCTGGTTC
GACTACAACC CGAAGGTCAA GCAATACGGG ATCACGTGGG GCCTGCTCGA AGGTGTGGTC
GACGGACCGG ACCCGACGAC CCAGGACTGG ACCGCGACGC TCGGCTTCAA GCAGAACACG
ACGCTCGGCT CGATGATCGA AACGATGGTG TCGTGGGCGA CCGGCTCGAA GTTCGGGCTC
GAGTCGCCAT GGAGTTTCCT GAACGCGATC CCGCTGTCGA ACCTCGCGCT CAAATACACG
TTCAACCAGA CCACGCCGAG TCGCAACAAG GTCAGCTTCG CCGTGACAAT CGGCCCGATC
AATCTGGGCT TCGCGCGCAT CGACAGCATC GACGTCGGCT ATCAATCCAC CGGCAAAGAT
CGCGGCGTGA TGGTAACGCT CAATGGGTCG TTCTTCTGGC AGTCGGATCC GAGCACGCCG
CTCGCATGGG ATGCGAGCAA GCCGGGCACG GCGCCTGCGC CTCCCGGCAA CGGCAACAAG
TATCTCGACC TGCGGCTGCT GGCGATGGGC CAGCACATCA CGCTGCCCTG CTTCGCGACC
GCCGATACGG TGCAAAAGGC CATCGCCTGC ATGGCCACGC TGCCCGATCC GAAGCCCGGC
CAGATTCCGG CCGTGCGCTT CGATTCGCAA AGCGCGTGGC TGATCGGCAC CGACTTCGGC
GTGCTGAAGA TCGACAGCGG GCAAGCCGGC AATCACGCGA ACGCGCTGCG CGTGACAAAC
GACGGCGATT CGCTCGCGGA ATCGGCCGGC TATGTATTGA CGCTGCAGGC GGTGTTCAAC
GACCCGCATC TGTACGGGCT GCGAATCGCG CTCGACGGCG CGGCGGCCAA GGTATTCAAG
GGCCTCGACT TCCAGATCAT GTACCGCCAG GTGAGCGACA CCGTCGGCGT GTACCAGGCG
GAGATCACGC TGCCCGACCT GATGCGTCAT TTGACGGTCG GCGCGTATTC ACTCACGCTG
CCCGTGTTCG GCATCGCCAT CTATACGAAC GGCGACTTCC AGGTGGACAT CGGCTTCCCG
TGGAACGAGA ATTTTTCGCG TTCGTTCACG ATCGAGGCGA TCATCCCGCC CGGCATTCCG
GTGCTGGGCT CGGCCGGTTT CTATTTCGGC AAGCTCTCCA GCGCGAGCAC CAATCGCGTG
CCCGCGTCGT CGTACGGCAC GTTCAATCCG GTGCTCGTAT TCGGCTTCGG CATGCAGGTG
GGCTTCGGCA AGTCGATCGA ATACGGCATC CTGTCGGCCG GCTTCAGCGT GACCGTGGTC
GGCATTCTCG AAGGCATCCT TGCGAAGTGG AACCCGTATC AGCTCACCCA CTCGGGGCGC
GAGCCGTCCA CCCAGTTGCA GGGCGACTAC TACTTCTGGC TGCGCGGCAC GGTCGGCATC
GTCGGCCGCG TGTACGGCAG CGTCGACTTC GCGATCGTGA AGGCGAACGT CGACATCACG
GTCAAGCTGC TGCTGCAACT CACGTACGAA TCGTATGTGT CGATCACGAT CACGGTGATC
GCCTCGGTCG ACGTGTCGGT TAGCGTGAAG ATCAACCTCG GGTTGTTCAA GATCAGCATC
TCGTTCTCGT TCTCGATGCG GCTGAAGGAG ACCTTCACGA TCGATAACCG GGGCGCCGCG
CCATGGCTCG GCGATGGCCG CAACGTGCGC GGCGTGCTGC GTCTGCCGGT CGAGCGTCGG
CTCTCCGGCT TCGCGCGCGC GCAAGCGCGC GATAGCCTGC GGGTGAGCGC GCCGAACTGG
GGCAACCTGC GGCCGGACGG CGTGACGGAC CTGTCGGGCT ACCTCGTGCC GGGGCTCACC
GCGGCGCGCG ACGAATGGAC GCCGCAGGGC GAACCGGCGA ACCCGTTGTC GTGCTGGGTC
GCGCTGCTGC TGATCGAATC CGTGCCGCCG GCTGGGCAGG ATGCCGGCGC GAGCAAGCTC
AAGGCGGCGG GAAGCGCGCC CGACAGCTCG TTCGAGGCGC TCGCGAAAAT GGTGCTGCGC
TGGGCCATCG CGGCCGTTCA GGGGCCGATG ACGCCGGACG AAGTCGACCG ATACCCGGTT
CCCGCGACGC TACTGGACTG GCTCGCGGAC GAAGTGCTCG TCAGCACCGG CGACGATCCG
ACGCCGATCC CGCTTGACGC GGTGCAGGCG TTCCTCGACA CGCATTTCCG ATTCAATCTG
CGCGTGCCGC CGACCGACCA GGATGCGTCG GCCGATACCG CGTATTTCCC CGCGCCGCCG
CAACTGCGCG TGACGATTCC GCCGTACGGC AAAGACTACC CGGGCGTGCA GTACGATCTC
GGCAGCTATA ACGCGCTCGG CGAAAACACG CTGGCCGAAC TGCGCGCATG GTTCGACCAG
CTTGCGGTGC AGGTGGAGCG CGAGCAGGCG GCGAACGGCG CGGCGGCGGC GCGGGCATTC
GTGGAAGAAG CGCCGCTGTC GATGGCCGCA TGGATGTTCT CCGACTACTT CCTGCTGCTC
GCCCGGCAGA TGGTCAAGGC CGCGCAAGAC GCACTGCGCG ACTTCAAGTA CGAGCTTGAC
GCGAACGAGA CACCGGACGA CATCGTGAAC TGGGTGAACA CGACCGGTCA GTTGAACGGG
CTGTACACGC TGAACGACGT GTTCGGCGCC AATACGGTGC ACGCGCTCGT CACCGAAAAA
ACACTGACGA TCGGCGTCAC GAGCTCGATC ACCCTGGACA AGACCGGCCA GACCTTCACA
TCGCTGGCCA AGGCCTTCGA CGACGCGCTG CCCGCAAGTG CGATCGCGGC GGCCAATGCG
GCCGACGCTG CGCTGCTGCA ACCGGGTGCG ACGATCACTT ACCCGGGCTT CGATCCGTAC
ACGAGCGTTG CTGGCGACAC GCTCGTCAGC ATCGCCGCGC ATTACAAGGC GAAGCTCAAC
GATCTGCTGG CCGATTCGGA TGTGCTCGAC GCGGCCGGCA TGCTGCGTAT CGGCGCGAGC
GCGCTCATAC CGTACACGTC GTACACGGCG CTCGCGACCG ACACCTTCGC GTCGGTCGCC
GCGCTGCCCG TGTACGCCGG CGGTTTCAGC GCGGCCGCGC TGGCCACCGC GAATGCGGGC
AGCAGCGTGC TGCTCGCCGG CGTGAAGATC GCGTATCCGG ACAAGGACGC GTACACCGTG
CAGCCGCGCG ACACACTTGG CGACGTTGCG AACGCGTTCG GCGTGACCGT GTCCGACCTG
CTCGCGAGCA GCGCGGTGCT GACGCAACCC GGGCTGCTCG CGCCGGTCGC GTTGCTTACG
GTTCCGGCGT TCCGCTACAC GACGCAGCCA GGCGACGATC TCGCGCAGGT CGCGGCGCGC
TTCGGCGTGG CCGTCTCGGT GCTGGCCGAC CAGCCCGCCA ACGGCACGGT GGCCGGGTTG
TTCGACGCCG GCGACACGCT CGACTTGCCG CACCTGCCGC AATTTCCGTT GGCCGAACTG
CTCGCCGAAG CGCAACGCTC GGGCATGCTG CAGCACCTGT CCGGCATCGC GAGCAGCTAT
ACGATGCACG GGCTGCGCTT CCCGACGTCC GGCCCGACCG GCACGGGCGG ACAATGGTCG
ATCGTGCCCA ACGAGATGGG CATGTGGGTG CATGACGTGA ACGGCACGCT GAAGCTGCCG
CCGCAAGCCG GGCTCTATGC GCTGACCGGC CAGCAATTCC CGCTGCCCGC ACTCGGCGCC
GATCCGTTCG CGGCGACCTT CGACAGCGTG GCCGGCGCCG GTTCGTCGTG GCTGCGCTTC
GTCGACGGCA ACGGCGCCCC GGCCGACCAT CTGACACTGT CGGTCACGCC CGGCACGCCC
GACGCGACAC GTATCGCGCA GGTGACCGCC GCCGCGAAGA CACGGCTCGA CGTGCCGATG
GATAGGCTCG GCGCGGGCAA GATGTACGAT ACCGCCCTCG CGACCTATCC GTTCACGTCC
GCGTTGCAGT GGCTGAGCAC GAACACCGTC GCGCTGCCTT ATGGCCAGCC GCCGGCCGGC
GTGCAGTCGC TGCGCGTGTG GCAACTGCCT AGCGCGCTCG CGGCGCTTCC CGATCCGGCC
ACCCATGCGG TGAATCCGCG CTTTGCACTG CGGGTGGCCC GCTACGACGA TGCGACGGGC
GCCACCGAAA CCACCGGGGT GGACTCGTAC GGATGGGCGT CGACCATCGG CTTCACGGTG
CGGCGCATTC CGCCCGTAGC GGGCAGTCCC GCGTCCGTCG ACACGTACGA GGTGGTCGGC
GCGGGCGGCG CCGCAATCGT GGTGCTCGAA CAACTGTTGA ACCAGGTGCA GGCGAACGAT
TCGGCCTACT TCGGCCTGAG CGTCGGTTTT GCGCCCGATA GCGCAACGGG CGGCGGCGAA
GGCGTGCAGA CCGGCGGCGC GGCGAGCGTC GTGTTCGGCA TCGCGCAGGT GAATCTGTCG
ACGGAAACCC GTCCGCCTGC CGGCGCCGCA TTCGCGGCAC TGCGTGAAAC GGCCGGCGAA
ACGCCGCAGC TCACGCTTCT CAATCCGCCG TCGGAGTTCG TGCGGCTGCT ATGGGAAGCG
AGCATCACAC GCTCGGGCGG CTTCTTCCTG TACTACTACG ATCGCGCCGC TGGAGGCGGG
CTGCCCGATC GCATATTCAA CGACCGCAAC GAGGCATCAC TGACGTTGAT CGTGCTGTAC
GCGAAGCCCG CGGCCGTAGA CGACCAGGAC CGCGTCGCGA ATTACATGAA CGCGGTAGTG
ACGACCGACG CGCTGGATAC CGGCAACGCG GTGCTGTTCG CGGAAGCGGC GCCGGTTCCA
GCCACCGTCA CGAGCGGCGC CGGCGAGACG CTCGCGTCGC TCGCCGAGCA ATGGTATTCG
GACGAGGCGG ATATCGCGGA AGCCAACGCG GACGTCGCGC TTCGCGCAGG CGCGCTCGTG
CGCGTGAGCG AAGGGGTCTA CCAGGCGCCG CCGGGCGGCA TCGCGCTCGC GCAGGTCGCG
AGCCGCTTCG GCACCACGGT GCAGGCGCTG AACGACGCCA ATCCGCTGTG GGGCGGCTTG
CCCGACCCGC TGCCGTTCCC GGCCGCGATC CGCGTACCGG ACCTCACGCT CACCGCCGGC
ACGAGCGCGC ACACGGCGTC GCTCGCGGAT ATCGCGGGCT GGTACGGCGA GCCGGTCGAT
GCGCTCGCTT CCCATAACGC GCGGGTGGCC CAGCTGTTCG CGGCCGGCGT GCCGCTCGTG
ATTCCCGGCG GCCCGCGCGT GCGCTCGGCG GCCGTGCAGC CGGGCGTGCA GGCGCTCGCC
GCGCTGCGTC CGGCGCCGCC GCAAGTGGAC GGTACGTCGC CGGACTACGG AACGGAACTG
CTGCTGAACA ACTTCAGCCT GTTGAACCAG CAGGTGTACG GCAACGTCGA CTTCCGGCCG
AGCGATCCGC CGGGGCTGCC CGCCGGACCG ACAACCAAGG CTCCGGAAGA AAACGGCAAC
GACAAGGTTC GCACCGTCGT TCCGGAGGAT CAGGTCGAAG CATGGAACTT CAGTCAGGCG
CTGCCTTACG CACGCTTTGC GAAGCACGTG CCGCAAGCGC CGCGCGCGGC GCTCGCGCTG
CCGCCCGCGA GCGCCAGTCC GTATTTTGGT GTCGGCGGCA TTCTGCAGAT CTCGTTTGCG
TGGCAGGACT ACTACGGCAA CGTACTGTCG ACGCCGCTGT CGGACCCACA GGCCGGCGAT
GCGGCGCCGT ACAACGACGC GCCGCTGCTC ACCGGCTACA CGGACCCGCT GGTGTCGCTG
TCGCAATGGC CGTCGATTGC TTCGAACTGG CAGGTGCTGC CCGGCAGCGG CGGAGCGAAC
CCACGGCTGA ATATCGAGTT GAGCTTCGAT CCGAGCCGCT ACCAAGGCTT GCTGCAGGCA
TCGGCGGCAA CGCAGACGAC GATCACCGTG GTATTCACGG ACGCGCTCGA CGCCGCATCG
GTCGGCGAAC TGTCGCGCTG GCAACTGGTG CCGGGAACCA TCGATTCGGC GTCGCTGGCC
GCGGATGGAA AAACCGTCAC GCTGACGGTG CCGGCGCTCG ACGGCAACCT GCGCTACACA
GTGATCGCCA CTGATATCAA AGCGCAAGCG AGCGACACGC GCTACAGCGG GCAGGCGTCG
TTCGACTGGC CAGACAATCC GGTCACGCGC AGCAGCACCG TGCAACAGAA CGCGTCGCAG
GACCTGCACG TCTATACACA GCTGTACTAC CAGCTCACCG ATCCCGCCGG AGTCGATCTG
TCGGCGCAGT CTTCGTTGCT CGCGGGCGCG GACGGAGCGC CGGGCAGCGT CGCGTATGCG
CCGGCCGCCG TCGGCAAGCT GATGGACTGG CTGTTCGGCA CCGCCGGCGC GGCCGCCAGC
GTCTATGCGT TCGTGCTGGA CCGCTCGAAG TTCCAGAGCG TTGCCGTGCC GCCCGCCGCC
GGGTTGCCGC TCGACGTGGA CGTGCCACCG CAGCAGGTGA ACACCGCGCA GATCTTCCCG
CTGTGGACAT CATTTACGAT GACGCGCGCG CACGGCCCGG TGCTGCCGGG GCTGGAAACG
GTCGCCGGCA TTCGCAGCGC GATCACGCGC GTCGCGCCGC TGCAGGATGC GCTCGGCGCC
ACCGGCGGCA CGCTCGGGCT CGTCACGTTC GCGACCGGTT TCGAACAGGC GCTGTCGACG
CCGGGCAGTG AGCGGCTGAA GGTGGCGACC GGCGTCGATC GCACCGCGCC GCCCGCCACT
GGAGCGGCCA GCACGGTCTG GGCGGTACGC GTCGGGCTCG CGGCCGGCAA GGCGATCTCG
TATGCGATCG CCGACGCGGG CAACCCGGCC GTGTTCGCGC CGCAGCCGGC GAGCAACCAG
TTGATCAGCC GCACGCAAGT GCCGATTTAC GACTACACGA CCGGCAAGGG CATCTCGTCG
ACGCCGTCAC GCACCACCGA TTTCACGGAT GTCGATCTCG ACACGTGGTG CGCGCAGGTG
TTCGCCGCCG TCGACGACGT GCTGACGCCG CAATTCACCG CGCCGATGCA GATCGTCGGC
GAGTTGAAGT CCGCCGATTA TCTGCAGTCG ATCCTCGACG GCAAGAAGGG GCTCGCGACG
GTCGCGAAGC TGTGGATGAT TCCGGTCTTC GCGGGCGAAA CCTCCGACCC GTCTGCCGCG
CGCGAAGCGT TTTACCAGCA ACTGCTGGTG CGATTGTCAG CGGCGTACAC GACGCGCGCC
GCGGTGGAAT TCCATGCGAA CGTGACCGCC GACGTGATCG AGCCCGCCGC GGATCAGCCG
CCGCGGCTGT TCGGGCCGGT CACGCGGAAC GGACCGGTGT TCGAGGCCGC CAATGTCGAC
GGACAGGCAC GGACCACCGT GTTCCTGCTG TTCAGCGACC CGATGGACCC GGTCACCGCC
GGCAACATCG AAAACTACGC ATTGAGCAGC GGTGCCGGCG TGCTGACGGC GACGGTCGAC
CGCGGCACGG TGACGCTCAC GCTCGCGACG GACGTGCAGC CGGGACAGAC GACGGTCACC
GTCAGCAATC TGAAGGACGC GACGGGACGC GCGGTGCGTC CGCCGCTCAC GCGCACTGTC
ACGACCGGCT CGGCGAGTCT GCCGGCCAGC ACGCTCGCGT TCAGTTCGCC GAAGCTTACG
TTGCAGACCG GCGATACGCG CGCGCTCACG TATCTCGTCA ACGCTCCGGA TTCGGTACGC
GGCGCGGGCG GCGAGATAGT GTCGTATGTC GAGCTGGACA TGACGTACCA GGGCAGCCAG
ATCGAGCATC AGATCGGCTC GCTGCCGGGT ATCGAGGATT ACCAGGCATC GACCTGGCTC
AGCTTCGTCG TGCCGGACAC TGACGGGCCG CTTGCAGCGG ACCTCGGGAA TTTCGCGGTG
CCGCTGGTGC TACGCGCATT CCCGGCGAGC CCCGCGATGA CGGAGCAGAG CGGCACGCCG
ACCCATGACC TCGACACCGC AAGCCTGCCC CTGCTCAAGC AGTGGGACTA TGCGTTCACC
TATTCGCTGC CGTTCCACTA TCCGCAAGAC CGCATCTACG GCGAGGTCGA ATTCAATCTG
CGCACCGCGC CGACATTGTT CGCGAGCTTC CCGGATGCAT TCGCGCAGCT CGCGGAGTTC
ATTACCGTGT TCCCGCAGGT GAACGCGGAT CTGCAGACCA TCCTTGCCGG CATCGATGCC
ACGATCGACC CGATGACGGA CCAGCAAAAG ATCGACGACG CATCGATCGC GCTGCAATCG
TTCATCCGAC TGGTCGACGA ACTGGTCGAA GCGGCCGGCG GAAATACGCA AGGCAATGGC
GAGCGCCGCA GCGGCACGGG TCTCGCGTTC CAGGCCCCGG CGCGACTGCT CACCGGCGAC
CCGTCGCTCA CCTTCGCGTT CTATGAGGAA GAAGGCTCGG CCGAGGTTGG CGATACCGAA
GGCGCGCTAG TGGTGACGCT GGTCGGCGCC GTCCCGGCGG GAATGGGTCG GCCGGTCGTC
GAAATCGATC CGTCGCTGTA TGACGCGCAG CCTTGGCAGC CGCCTGGCGA CACGCAGAAG
GCCGGCGATG TCTTCCACTA CGTGTACAAG CTCAAGGCCG GCCCCGGGCC GGAGGGTTCA
TACCTGAGCG CTGCGAAAGG GCAGAACATT CCAGGCCGTA CGGTGTGGCT GCCTGCGCTC
GACATCCTGC AGCGTCAGGA CGCGTGGTCT ACCGTCTGGG TGGAGCGCAA CCGCGAACTC
GTTCCGGGCA AACCGTCGGC TGACGCGTTT GTCTACACGA CGCCCGAAGT GCGCTTCGCC
AGCCCGCTGT ATCCGACCAA CGACGCCAAT GCGATCATCG ATGTGGCGGC GATCCCGTCC
GGAACGCCGG TGACGCGTTC GCTTCAGGAG CATTTCGACG CGCTCTTTGC CTATCTGCTC
GCCGGCGACA CGCTGCCTCA GATCGTCGCG CAGGTGGAGG TGACATACGG ATATGCGCTG
AACGCCGCGC TCGATAAGAT CGTGTTGCCG GTGCTGATGC AAGCGCCGCT GACGATCGAC
ATCGCCGGAA CCGGCGCGGG CACCATCGCG CGAATGACCG CCGACTGGAC TGCCGCGATC
GAAACGTGGT TCTCCACATA CGAACCGACG GGCGGCGGGA CACTCTGGAT GGATCTGACG
CTGATGTCGA ATCTTACCGG CCAGCCGATG CCGTTGCTGC GGATGCGGCG ATTGATGCTG
TCCATCGCGC AGGTCGTCCC GCCGCTGCCA TGCCGCTAG
 
Protein sequence
MAGLNDVIER LSQGTQNVLT VHLGARITKV RDYLNSLGEI RVFTANSQNF GHQSSSVNIL 
RNLIRMGAPG PYTFALSASN SADYADLEEK IRLLIPQFRQ VGVTFELGAG GRTADVTVVR
LDKALAPAQF AISGGFDDLE NKTPPYHLLN VTNYVQLQPY AWNRGTNMAR IMPPGGTASE
YNLDELNPTT LLARRAFYLA DPELTQSDRE AIAQTPYANK ARVIERLLER REAGEITLFP
VYGVTTKGSA YTSLYNAVTG ALIAQNTHPT VKKTVMVQIT TLTASEWQEF LFLMRDPAGE
MVNKIRTTPD FRGWNEENKV KDRVQDLGSP NSVPTVEQLD EELSYLKDDQ LLVVYIGKIP
APLFDALYAS ATLPPVLEGQ NTAELMLNLG KPYFKITSNN SREADARFSY ATLPLSSAGA
GTDATNSLDE SFNGIYFTRP DNWYRDRPTY PPTQLPAMIN AYVQPAGNAR ATYFAAQRTF
FHDELNDKLL RGLDLFVNLI GPAALEEHRL ALAHAELDDA NAPVHEAGAT ARAPARIAHR
KPPTRVAGGD ANGASELLEA FYEDLTSHTV DGVLDFLLAV TDGILNEFFR QVVIDVVFTI
TDTVTEINAD KTEVTLTGKS KAFGAGNLTL AFSFTDNGGT IAGKMSGAFT DTVWAFPGAQ
WLSVANPSLA LAIDSNAAVP VTGTVGATFT AGIAAKASLT LPSEPGRLLL QAEFLAPRPS
ITNIFQMLGG INIQALLPSQ IQLFSDIEVQ NLALRYSYAN GVMEYIGVTL GTPENRSWQL
VPGVTVTGLS FSALTDYPGD LQRRSTRYVI GGRFDIAGGN AQLEARVPAL RVTGGLIDGS
PPITLAAIVT AYLGADFAAA IPASVSSTAI EQLSFMVDQA QGAYSFSMDV SAQWPVPSAA
NALFTITGLN FAIDAVSRDI NPPKADAGGN NGAGGTQTEI EGSFGGSLIV LPNSESPIGL
STTATYKTAA KAWTFDAQQT SGAVSLSALL VYYLGNTWQA PQGQEYAIDG LGLTITSSPT
DSTWAFTGKT ADNWVVPFLD VSLAAKLRMG DAGAKAEVPG KFGRLDLEVI WQNIDLTVWF
DYNPKVKQYG ITWGLLEGVV DGPDPTTQDW TATLGFKQNT TLGSMIETMV SWATGSKFGL
ESPWSFLNAI PLSNLALKYT FNQTTPSRNK VSFAVTIGPI NLGFARIDSI DVGYQSTGKD
RGVMVTLNGS FFWQSDPSTP LAWDASKPGT APAPPGNGNK YLDLRLLAMG QHITLPCFAT
ADTVQKAIAC MATLPDPKPG QIPAVRFDSQ SAWLIGTDFG VLKIDSGQAG NHANALRVTN
DGDSLAESAG YVLTLQAVFN DPHLYGLRIA LDGAAAKVFK GLDFQIMYRQ VSDTVGVYQA
EITLPDLMRH LTVGAYSLTL PVFGIAIYTN GDFQVDIGFP WNENFSRSFT IEAIIPPGIP
VLGSAGFYFG KLSSASTNRV PASSYGTFNP VLVFGFGMQV GFGKSIEYGI LSAGFSVTVV
GILEGILAKW NPYQLTHSGR EPSTQLQGDY YFWLRGTVGI VGRVYGSVDF AIVKANVDIT
VKLLLQLTYE SYVSITITVI ASVDVSVSVK INLGLFKISI SFSFSMRLKE TFTIDNRGAA
PWLGDGRNVR GVLRLPVERR LSGFARAQAR DSLRVSAPNW GNLRPDGVTD LSGYLVPGLT
AARDEWTPQG EPANPLSCWV ALLLIESVPP AGQDAGASKL KAAGSAPDSS FEALAKMVLR
WAIAAVQGPM TPDEVDRYPV PATLLDWLAD EVLVSTGDDP TPIPLDAVQA FLDTHFRFNL
RVPPTDQDAS ADTAYFPAPP QLRVTIPPYG KDYPGVQYDL GSYNALGENT LAELRAWFDQ
LAVQVEREQA ANGAAAARAF VEEAPLSMAA WMFSDYFLLL ARQMVKAAQD ALRDFKYELD
ANETPDDIVN WVNTTGQLNG LYTLNDVFGA NTVHALVTEK TLTIGVTSSI TLDKTGQTFT
SLAKAFDDAL PASAIAAANA ADAALLQPGA TITYPGFDPY TSVAGDTLVS IAAHYKAKLN
DLLADSDVLD AAGMLRIGAS ALIPYTSYTA LATDTFASVA ALPVYAGGFS AAALATANAG
SSVLLAGVKI AYPDKDAYTV QPRDTLGDVA NAFGVTVSDL LASSAVLTQP GLLAPVALLT
VPAFRYTTQP GDDLAQVAAR FGVAVSVLAD QPANGTVAGL FDAGDTLDLP HLPQFPLAEL
LAEAQRSGML QHLSGIASSY TMHGLRFPTS GPTGTGGQWS IVPNEMGMWV HDVNGTLKLP
PQAGLYALTG QQFPLPALGA DPFAATFDSV AGAGSSWLRF VDGNGAPADH LTLSVTPGTP
DATRIAQVTA AAKTRLDVPM DRLGAGKMYD TALATYPFTS ALQWLSTNTV ALPYGQPPAG
VQSLRVWQLP SALAALPDPA THAVNPRFAL RVARYDDATG ATETTGVDSY GWASTIGFTV
RRIPPVAGSP ASVDTYEVVG AGGAAIVVLE QLLNQVQAND SAYFGLSVGF APDSATGGGE
GVQTGGAASV VFGIAQVNLS TETRPPAGAA FAALRETAGE TPQLTLLNPP SEFVRLLWEA
SITRSGGFFL YYYDRAAGGG LPDRIFNDRN EASLTLIVLY AKPAAVDDQD RVANYMNAVV
TTDALDTGNA VLFAEAAPVP ATVTSGAGET LASLAEQWYS DEADIAEANA DVALRAGALV
RVSEGVYQAP PGGIALAQVA SRFGTTVQAL NDANPLWGGL PDPLPFPAAI RVPDLTLTAG
TSAHTASLAD IAGWYGEPVD ALASHNARVA QLFAAGVPLV IPGGPRVRSA AVQPGVQALA
ALRPAPPQVD GTSPDYGTEL LLNNFSLLNQ QVYGNVDFRP SDPPGLPAGP TTKAPEENGN
DKVRTVVPED QVEAWNFSQA LPYARFAKHV PQAPRAALAL PPASASPYFG VGGILQISFA
WQDYYGNVLS TPLSDPQAGD AAPYNDAPLL TGYTDPLVSL SQWPSIASNW QVLPGSGGAN
PRLNIELSFD PSRYQGLLQA SAATQTTITV VFTDALDAAS VGELSRWQLV PGTIDSASLA
ADGKTVTLTV PALDGNLRYT VIATDIKAQA SDTRYSGQAS FDWPDNPVTR SSTVQQNASQ
DLHVYTQLYY QLTDPAGVDL SAQSSLLAGA DGAPGSVAYA PAAVGKLMDW LFGTAGAAAS
VYAFVLDRSK FQSVAVPPAA GLPLDVDVPP QQVNTAQIFP LWTSFTMTRA HGPVLPGLET
VAGIRSAITR VAPLQDALGA TGGTLGLVTF ATGFEQALST PGSERLKVAT GVDRTAPPAT
GAASTVWAVR VGLAAGKAIS YAIADAGNPA VFAPQPASNQ LISRTQVPIY DYTTGKGISS
TPSRTTDFTD VDLDTWCAQV FAAVDDVLTP QFTAPMQIVG ELKSADYLQS ILDGKKGLAT
VAKLWMIPVF AGETSDPSAA REAFYQQLLV RLSAAYTTRA AVEFHANVTA DVIEPAADQP
PRLFGPVTRN GPVFEAANVD GQARTTVFLL FSDPMDPVTA GNIENYALSS GAGVLTATVD
RGTVTLTLAT DVQPGQTTVT VSNLKDATGR AVRPPLTRTV TTGSASLPAS TLAFSSPKLT
LQTGDTRALT YLVNAPDSVR GAGGEIVSYV ELDMTYQGSQ IEHQIGSLPG IEDYQASTWL
SFVVPDTDGP LAADLGNFAV PLVLRAFPAS PAMTEQSGTP THDLDTASLP LLKQWDYAFT
YSLPFHYPQD RIYGEVEFNL RTAPTLFASF PDAFAQLAEF ITVFPQVNAD LQTILAGIDA
TIDPMTDQQK IDDASIALQS FIRLVDELVE AAGGNTQGNG ERRSGTGLAF QAPARLLTGD
PSLTFAFYEE EGSAEVGDTE GALVVTLVGA VPAGMGRPVV EIDPSLYDAQ PWQPPGDTQK
AGDVFHYVYK LKAGPGPEGS YLSAAKGQNI PGRTVWLPAL DILQRQDAWS TVWVERNREL
VPGKPSADAF VYTTPEVRFA SPLYPTNDAN AIIDVAAIPS GTPVTRSLQE HFDALFAYLL
AGDTLPQIVA QVEVTYGYAL NAALDKIVLP VLMQAPLTID IAGTGAGTIA RMTADWTAAI
ETWFSTYEPT GGGTLWMDLT LMSNLTGQPM PLLRMRRLML SIAQVVPPLP CR