Gene BURPS1106A_A2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2213 
Symbol 
ID4905062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2178186 
End bp2191592 
Gene Length13407 bp 
Protein Length4468 aa 
Translation table11 
GC content66% 
IMG OID640145318 
Productnon-ribosomal peptide synthase 
Protein accessionYP_001076246 
Protein GI126458543 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGTAA ACAGCCTGTT GGATCTTCTG CAACGAAAGA ACATTCGGAT TCACGTCGAT 
CGCGACGAGT TGGTGGTGCG TGCGCCGCGC GGCGCGCTGA ATGCCGAGCT CACGCAAGCG
CTCAAGAAAA GCAAGGCGGA ACTGATCGAC GTGCTTCGCC GGCGCGGCGC GCAAGCGTCG
CCGGATCCGG TTCGCATTAC GCCGGCCCAA CTGACGCTGG TGGCGCTGAG CCAGGAGTCG
ATCGACGCGT TGGTGACGAA GGTGGAAGGC GGTGCGGCGA ACGTGCAGGA CATCTATCCG
TTGGCGCCGT TGCAGGAAGG CATTCTGTTT CATCACCTGA TGTCGGGGGA AAGCGATCCG
TACGTGCTGT CGGGCGTGCT GGCGTTCCGC AGCCGCGAGG TGATGGAGCG GTTTGTGTCG
GCATTGCAGC AGGTAATCGA TCGGCACGAC ATCCTGCGCA CCGGCTTCTT CTGGGAGGGA
CTCGAGCAAC CGGTGCAGGT CGTGCAGCGC CGGGCGACGT TGCCGGTTAG TGTCGTCGAG
TTCGATGCAC GCGAAGGCGA CATCGTGCGG CAACTGGAGG CCCGCTTCGA TTCGCGCGGC
TATCGGATGG ACGTGAGCCG CGCGCCGTTG ATGCACGTTC ACGCGGCTTG TGACGGCGAG
CACGAACGCT GGGTGGCGCG CGTGCTGTTT CACCATCTGT CGATCGATCA TACGACGCTC
GAGCGTGTGA TCGAGGAGGC GCGCGCGATC GGGCAGGGCC GAGCGGAGGA CTTGCCGCGG
CCGGAGCCGT TCCGGAATTT CGTCGCGCAG GCGAGGCTGG GGGTGAGCGA GGCGGACCAC
GAGGCGTACT TCAGGGCGAA GCTGGGGGAC ATCGATGAGC CGACGGCGCC GTTCGGTCTG
CTGAGCGTTC AGGGAGACGG GCGTGAGATA GCGGAGGCGG CGCGGACGTT GAAGCCGGAG
CTGTCGGGGG CGCTTCGCGG ACATGCGCGC CGGCTGGGGG TGAGCGCGGC GAGCATGATG
CATGTGGCGT GGGGGCTGGT GCTGTCGCGC ACGACGGGGC GGCAGGACGT GGTGTTCGGC
ACGGTGCTGT TCGGGCGGAT GCAGGGAGGC GCGCAATCGG ATCGAGCGCT GGGCTTGTTC
ATCAACACGT TGCCGGTGCG GATGAGGGTA GCGCAGACGG GCGTGGAGGC GAGCGTGAAG
GGGACGCATG CGCAGCTGGC GGAATTGATG CGTCACGAGC ATGCGCCGCT GGTGCTGGCG
CAGCGTTGCA GCGGGGTGCC GGCGCAGACG CCGCTGTTCA CGTCGCTGCT GAACTATCGG
TACAGCAAGC CGAAGGTCGC GGCCGCACAT ATTGCCGACG GCATCGAATT GCTCGACGGG
CACGAGCGAA CCAACTACCC CGTCAGCGTG GACATCGACG ATCACGGCGA CGATTTCAAG
ATTCGTGCGC AGGCGGCTGC AAGCGTCGAT CCGGCGCGCG TTTGCGATTT CCTCGAGGTC
GCGCTGACAC GGTTGGTCGA TGCGCTGGAG CGCGATCCGC ACGGGGCGCT GCGGCAATTG
GACATCCTGC CGGAAGTGGA GCGTGAAGAA GTCGTTCGTC GTTGGAATGC CGGCGAAAAG
GCACGTCCAT CGCGGCTCTG CCTGCACGAA CTGTTCGAAA GGCAGGCCGC GCGCGCGCCT
GACGCGATTG CGGTGATTCA GGACGAGCGT GCTCTGACCT ACGCGGAACT CAATCGGTGT
GCGAACCGCT TGGCCCACTA TCTGCGTGCT CGGGGCGTGC GTGGGGGCGA TCGCGTCGCA
CTGTATGCGC GGCGCAGTCC CGAACTGCTG ATCGGCATGC TGGCGACGCT CAAGGCGGGC
GGCGCATATG TGCCGCTCGA TCCGGGATAC CCGGCCGAGC GCTTGACGCA CATATTGCTC
GACAGTGCGC CCGTAGTCGT GCTTCGCGAC GCCGCGGCCT CGGACGACGT TCTGGTTCGA
TTGAACGCGG GTACGCTGAT CCTGGATCTG CATGCCGACG ACGAACGCTG GAGCGCGCAG
CCGTCGGGCA ATCTGAAGCT ATGCGGATCT CACGAGCCAG ACGTCGGTGC GCGGCGCCTT
GCGTACGTGA TCTACACGTC GGGATCGACG GGGGCGCCGA AGGGCGTGAT GGTCGAGCAC
GCGAGCGTGG TCAATCAGAT CGGTGCGTTG ACCGAGTATC TCGAACTCGA TGCATCCGAT
CGTGTGCTGC AGTTTTCCAA CATCGCTTTC GATGCTTCGG TCGAGGAGAT CTTCGCGACG
CTGACTTGCG GCGCGACGCT CGTGTTGCGG ACCGATCGGT GGCTGGCGGA TGCCGAGACG
TTCTGGGCGC TGTGCGGGGC GCAGCGGATC AGTATCGTCG ATTTGCCCGC GCAGTTTTTT
GGCCAGCTCG CCCTGAGTGG GCGTCGGGCG GTCCCGACCG GCGTACGCTG CGTGGTGATC
GGCGGCGAAG CGGTCGGTGC GTCGGCGCTC GATGCATGGT TTGCCGAGGA AGGCCGTCGG
CCGCGCCTTT TCAACACGTA CGGGCCGACG GAAACCACGG TCAGCGTGAC GGTGCACGAG
GTTCGGGGGC GACACGACGA TGCGAATGTC ATTGGGCGTC CAATTGCGAA TACGCGGGTT
TATGTGCTGG ATGCGTGGCT GCGTCCGGCG CCGATCGGTG TGGCGGGGGA GTTGTATATC
GGTGGCGTGC AGGTGGCGAG GGGGTACTTG AACCGGCCGG AGCTGACGCG GGAGCGGTTC
ATCGACGATC CGTTCGTGGC GGGCGGGCGG CTGTATAGGA CGGGGGATCT GGCGCGTTGG
CGCACGGACG GGAGGCTGGA ATATCTGGGT CGAAACGACT TCCAGGTGAA GATACGCGGT
TTCCGGATCG AGTTGGGGGA GATCGAGGCG CAGTTGGCGA AGGTGGCGGG GGTGCGCGAG
GTGGTCGTGC TTGCGCGAGA TTCGGCATCG GAGGTGCACG ATAGCGCGAC GGAACACGCA
ACTCCGGATG CGCTTTCGCC CTCGCCCGAG ACCTCAACCG CGACAGCAAC GGCAACAGCG
ACCGAGAAAC GCCTCGTCGC GTACTACACG GGTGACGCCG ATGTCGTGGC ATTGAGAGCG
CAAGCCGCGC AGCACTTGCC GAGCTACATG GTGCCGTCGG CGTACGTGCG GCTGGACGCG
TGGCCGCTGA CGCCGAACGG CAAGCTGGAC CGGCGCGCGC TACCCGCGCC GGCGGACGAC
GCATACGCTC GCGCCGAATA CGAAGCGCCG CGGGGCGCAA AAGAAGAAGC ACTGGCCGCG
ATCTGGCGGG AGCTGCTGCA TGTGGAGCGC GTCAGCCGCC ACGACAACTT CTTCGAACTC
GGCGGCCACT CGCTGCTGGC GGTGCAACTG GTATCGCGCC TGCGGCAGGC GTTGTCGGTG
GAGGTGGCGC TGAGCACGGT GTTCGATGCG CCGGTGCTGT CGGCTTTGGC CGAGCGGCTC
GAAGCGGGGA ACACGGAGGT CCTGCCGCCG ATACCGCTGG CGCCACGCGA CGGACGAATC
GCGCTGTCGC TGGCGCAGCA ACGGCTATGG TTCCTGACGC AACTGGAAGG CGTCAGCGAG
GCGTACCACA TGAGCGGCGC GGTGCGGCTT GATGGGCCGT TGAATCGAGA GGTGCTGCAA
CGTGCGCTGA ACCGTATCGT GATGCGCCAC GAAGCACTGC GGACGTGCTT CGTTCGGGAG
GAGGGTGAAC CGATCCAGGT GATCCAGCCG CATGCCGATC TGACGGTGAG CTACCACGAC
CTGCGCGAAG CGGAGTCGAT CCGACATGAA GCCGGGAACC GCGAACAGCG GGCAAAGGAC
CTGAGCCAAG CCCACGCATC GGCGCCATTC GACCTGAGCC GAGACCTGCC GGTGCGAGTA
CTGCTGCTGC AATTGGCGGA TGAAGCCCAC GTCGTGCAGG TGGTGATGCA TCACATCGCA
TCGGACGGCT GGTCGGTCGG GGTGTTCCTG CAAGAGCTGA GCGCGCTGTA CGGGTCGTTC
ATCGCGGAGC AGGGCGATCC GCTGGCGCCG CTGCCGTTGC AATACGCGGA CTACGCCGCG
TGGCAACGCA GGTGGCTGGC GAGCGGCCAG TTGGAGAAGC AAGGCGCGTT CTGGCAAACG
AACCTGTCCG GCGCGCCGAC GCTGCTGGAG CTGCCGACGG ACCGTCCGCG TCCGCCGAAG
CAATCGCACG CGGGCGCGAG CGTCGAGGTG AAGCTGGGCG CGGCGTTGAG CGAGCGGGTG
AAGCGCCTGA GCCAACGTCA CGGGGTGACG CCGTACATGA CGCTGCTGTC GAGCTGGGCG
GCGGTGCTGA GCCGCTTGAG CGGACAGGAG GAGGTGGTGA TCGGCAGCCC GGTCGCGGGG
CGGAACCGAA CGGAGGTTGA AGCGCTGATC GGCTTCTTCG TGAACACGCT GGCGTTGCGG
CTGGATCTGT CGTCGGAGCC GACGGTGGGC GAGTTGCTGA AGCGGACGAA GGCGCAGGTG
CTATCGGCGC AGGCGCATCA GGACTTGCCG TTCGATCAGG TGGTGGAGCG GGTGAAGCCG
CCGCGCAGTA CCGCGCATCC GCCGCTGTTC CAGGTGATGT TCGTCTGGCA GAACATGCCG
GCGGGGGAAC TGACGATACC GGGGCTGACG ATCCGCGCCG TGGAGACGCC GCTGCAGACG
GCGCAGTTCG AACTGACGCT GTCGTTGCAG GAGGCGGGCG ACGACATCGT CGGCCACCTG
AATTACGCGA GCGCGCTGTT CGACGAATCG ACGGTGAGGC GTTACGTGAC CTATTGGTGC
CGTTTGCTGG AAGGCATGAC AGCGGGCGCC GCGGACCAGA CGATCGTGGG CTTGCCGTTG
CTCGACGAAG CCGAACGCAA GCAGGTGGTG TACGCGTGGA ACGCAACAGA GCGTGACTAT
CCGATCGAGC AATGCATTCA TCAGTTGTTC GAAGCGCAGG TGGATCGGAA GCCGGAGGCG
ATTGCGCTGA CGTTCGAGGG ACAGCGACTG AGCTACGCGG AACTCAACGC CCGAGCGAAC
CGGCTTGCGC ACTATCTGCA GGGGCGCGGC GTGGGCCCGG GCCGACTGGT TGCGCTTTGC
GCGGAACGCG GAATCGAGAT GGTGGTGGGG CTGCTGGCGA TCCTGAAGGC GGGCGGCGCG
TATGTGCCGC TGGATCCGGC ATATGCGTCG GATCGCCTGC GCGGGATCGT GGAGGACAGC
CAGCCGGCCT TGGTGCTGGC GGACGCGGTG GGGCGCGCGG CGTTGGGCGA GTTGGATGGT
GCGCTGCCGG TGATCGATCT GGAAACGGAT GCGCTGCGCT GGCGTGAGAT GCCGGCGACC
AATCCGGAGG TGGCATCGCA GCATGTGCAC CACCTTGCCT ATGTGATCTA CACGTCGGGC
TCGACGGGCC GGCCGAAGGG GGTGATGGTC GAGCACGCGC AGGTTGTGCG CCTGTTCGGC
GCGACGCAGG CATGGTTCGG CTTCGACGAG CGGGACGTGT GGACGCTGTT CCATTCGTAC
GGCTTCGACT TCTCGGTATG GGAGATGTGG GGCGCGCTGC TGCACGGTGG TCGGCTGGTG
ATCGTGCCGA CGGAGGTAAC GCGCACGCCG TCGGCGTTCT TTGCGCTGCT GTGCGCGGAA
GGCGTGACGG TGCTGAATCA GACGCCGAGC GCGTTTCAGG CGCTGATGTC GGCGCAGGAG
GAGCGGGAGG AGGCGGCCGG GAATATCGAG CGCGCAAACG TAGTCGCCCA CCGGCTGCGC
TATGTCATCT TCGGTGGAGA GGCGCTGGAG CCGAGGACGC TCGCGTCGTG GTATGCCCGT
CACGGCGAGC GTACGCAGTT GGTGAACATG TATGGAATCA CCGAGACGAC GGTGCACGTG
ACGTATTATG CGCTGCGAGC GGAAGACGCC ATGCGTTTGG GTGCGAGTCC GATCGGCGTG
CGGATTCCGG ATTTGCAGCT GTACGTGCTG GACGCTCGTC GCGAGCCGGT GCCGATGGGC
GTGACGGGAG AGCTGTATGT GGGCGGGGCG GGTGTCGCAC GCGGGTACTT GAACCGGCCG
GAGCTGACGC GGGAGCGGTT CATCGACGAT CCGTTCGTGG CGGGCGGGCG GCTGTATAAG
ACGGGGGATC TGGCGCGTTG GCGCACGGAC GGGAGCCTGG AATATCTGGG TCGAAACGAC
TTCCAGGTGA AGATACGCGG ATTCCGGATC GAGTTGGGGG AGATCGAGGC GCAGCTGGCG
AAGGTGGCGG GGGTGCGCGA AGTGGTCGTG CTTGCGCGAG ATTCGGCAGC GGAGGTGCGC
GATAGCGCGA CGGAACACGC AACTCCGAAT GCGCTTTCGC CCTCGCCCGA GACCTCAACC
GCGACAGCAG CGGCAACAGC AACGGCAACA GCAACAGCGA CCGAGAAACG CCTCGTCGCG
TACTACACGG GCGACGCCGA TGTCGCGGCA TTGAGAGCGC AAGCCGCGCA GCACTTGCCG
AGCTACATGG TGCCGTCGGC GTACGTGCGG CTGGACGCGT GGCCGCTGAC GCCGAACGGC
AAGCTGGACC GGCGCGCATT GCCCGCGCCG GCAGACGACG CATACGCCCG CGCCGAATAC
GAAGCGCCGC AGGGCGCAAA AGAAGAAGCA CTGGCCGCGA TCTGGCGGGA GCTGCTGCAT
GTGGAGCGCG TCAGCCGCCA CGACAACTTC TTCGAACTCG GCGGCCACTC GCTGCTGGCT
ATCGGCGTGA TCGAACGCAT GCGCCGCGAG GGGCTCCATA CCGACGTACG CAGCATATTC
AACGCGCAGA CGTTGTCCGA CCTTGCGGCG CGTGCGCAAA CCGACGATCG ATCGATCCAG
GCACCGCCGA ACCTGATTCC CGCGCGCGCC ACGCGCATCA CGCCCGACAT GCTGCCGCTC
GTCGCGCTGA CCCAGACGCA GATCGACATG CTCGCGGTGC AGGTCGAAGG CGGCGCCGCC
AACATACAGG ACATCTATCC GCTCGCGCCA TTGCAGGAAG GCATGGTATT CCACCATCTG
CTGCATGCCG AAAGCGATGC GTACATGGAA GCCTATTTCG TCGGCTTCCG TACGCGGGCG
CGTCTCGATC GCTTCCTCGA CGCGCTGCGG ATGATCGTCG ATCGGCACGA CATCCTGCGC
ACCGGTTTTT TCTGGGAGGG ACTCGAGCAG CCGGTGCAGA TCGTGCAGCG TCGCGTGCGC
CTGCCGATCG AATTCGTCGA TCTCGATCCG GCTGACGGCG ACGTCCTGCG GCAACTGGAG
GCGCGACACG ATCCCCGCGC GCACCGCCTC GACATCCGTC GGCCTGCGCT GCTGAGCTGC
CACGCGGCTC ATGATCCCGC CGCCGGCCGC TGGCTGCTGT GCGTGATGGC TCATCATCTG
GCGATCGACA ACACGTCGCT GAAGCTGCTC GTCGCCGAGG AGCAGGCGAT CGAGCAAGGC
GGATTCGACG CGTTGCCGCC GGCCCCGTCG TTTCGCAACT TCATCGCGCA GATCGCATCC
GGCATCGATC GACGGGAGCA TGAAGCATTC TTCAGCGCAA TGCTCGGCGA CATCGACAGC
CCCACGCATC CGTTCGGACT GCAGGACGTA CAGGGCGACG GCCGAGAGAT CGCGGAGTTT
CAGCAAAGGT TGTCGCCCGA ATTGTCGAAG GCGATTCGCG TCTGCACGCG GCGGCTCGGC
GTCAGTCCGG CAAGCCTGAT GCACCTGGCA TGGGCGATGG TGCTCTCGCG CGCCACCGGC
CGACGGGAAG CGGTGTTCGG CACGGTGCTG TTCGGCCGCA TGCAGGGCGG CGAGCGCGGC
ATGGGCATGT TCATCAATAC GCTGCCGATC CGGATCGACG TCGATGAGCG GTATGTTGCC
GAGTGTCTGG CGCACACGCA TGAGCGGGTG GTGCAGCTCA TTTATCACGA ACATGCGCCG
CTCGCGCTCG CCCTGCGATG CAGCGGATTG CCGGCCCAGC AGGCGCTGTT CTCGTCGCTG
CTCAACTACC GGCATAGCGA GCAGGCAGCC AGGCCACCGC GGGACGACGA CGATATCCAG
TATCTGGACG GCAACGAGCG CACCAATTAT CCGCTGACCG TATCGATCGA CGATCTCGGC
GAAGCGTTTT CGGTGACCGT TCAGGCGCGC CATCCCGCGT CGCCCGAGCG CATTCGTGCT
TTCATGGAGA CCGCGCTCGA ACAGCTCGTG CGCGCACTCG ACGGTACTTC GGGCGTTGCC
GCGCCGGGCG TTGTTATGCC GCGCATCGCC GTGTGCGACA TCGACGTGTT GCCGAGCGAA
GAGCGTCACC GTCTGCTCGT CGAATGGAAC GACACGGCCG CCGACTATCC GCAGGATCAG
TGTCTGCATC GCTTGTTCGA GGCGCAGGCC GCGCGGCATC CGGACACGAT CGCGCTGATC
GCGGACGGCG AGCCTGTCGG TTATGCCGAA CTGAATCGCC GTGCGAACCG GCTCGCCCGT
CACCTGAGCG CGCGAGGGCT GCAACCGGAC CAGCGCGTGG CGATCTGCAT CGATCGCGGC
ATCGACATGG TCGTCGCGAT GCTGGCCGTG CTCAAGGCGG GCGGCGCATA CGTGCCGCTC
GATCCGGCTT ATCCGTCGGA GCGGCTCGAT TATCTGTTGC GCGACTGCGC GCCCGTTGCA
CTGCTCACGC ATGCGCGTCT CGGCGCGTCG ATGCAGACGC GGCTCGTGCT CGCGCTCGCG
AGGCTGGACA CCGGATGTGC GTTGATCGAT CTCGAATCGG ATGCCGGCGC ATGGCGGCAC
GAGCGCGACG ACGATCCGCC GCCGAGCGGC TTGACGCCGC GCCATCTCGC TTACGTGATC
TATACGTCGG GCTCGACGGG GCAACCGAAG GGGGTGATGG TCGAGCATCG CAGCGTCTGT
AATCTGGTGG CGTGGCACGC GGGCGCGTTC GATGTCGGCA CGGGCTGCCG CAGCGCGAGC
GTGGCGGGCG TCGCGTTCGA TGCGACGACG TGGGAGGTTT GGGCGGCGCT GTGCAACGGC
GGCTGCCTGT CGCTCGCGCC CGGCGACGCC GCCTCCGATC CGCAGGCGTT GTTGCGCTGG
TGGCGGGCGC AGGAGCTGGA CGTCGGCTTT CTCGTGACCC CGCTTGCCGA ACTCGCGTAT
GCGACGGGAC AGAGCAATGC CGGCATGCGG ACATTGCTGA TCGGCGGAGA CAGGCTTAGC
CGCTGGCCCG ATTCGATGCC GCCCGGGCAG ATGCTCGTCA ACAACTACGG GCCTACCGAG
GCGACGGTGG TGGCGACTTC GGGGCGCCTG CAGCCGGGCG AGGCCACGCC GCCCATCGGC
CGTCCGATCG CGAATACGCG GGTGTATGTG CTGGATGCGT GGTTGCGTCC GGCGCCGATC
GGTGTGGCGG GGGAGTTGTA TATCGGTGGC GTGCAGGTGG CGAGGGGGTA CTTGAACCGG
CCGGAGCTGA CGCGAGAGCG GTTCATCGAC GATCCGTTCG TGGCGGGCGG GCGGTTGTAT
AAGACGGGGG ATCTGGCGCG TTGGCGCACG GACGGGAGGC TGGAATATCT GGGTCGAAAC
GACTTCCAGG TGAAGATACG CGGATTCCGG ATCGAGTTGG GGGAGATCGA GGCGCAACTG
GCGAAGGTGG CGGATGTGCG CGAGGTGGTC GTGCTTGCGC GAGATTCGGC AGCGGAGGTG
CACGATAGCG CGACGGAACA CGCAACTCCG AATGCGCTTT CGCCCTCGCC CGAGACCTCA
ACCGCGACAG CAGCGGCAAC AGCAACGGCA ACAGCAACAG CGACCGAGAA ACGCCTCGTC
GCGTACTACA CGGGCGACGC CGATGTCGCG GCATTGAGAG CGCAAGCCGC GCAGCACTTG
CCGAGCTACA TGGTGCCGTC GGCGTACGTG CGGCTGGACG CGTGGCCGCT AACACCGAAC
GGCAAGCTGG ACCGGCGCGC ATTGCCCGCG CCGGCGGACG ATGCATACGC CCGCGCCGAA
TACGAAGCGC CGCAGGGCGC AAAAGAAGAA GCACTGGCCG CGATCTGGCG GGAGCTGCTG
CATGTGGAGC GCGTCAGCCG CCACGACAAC TTCTTCGAAC TCGGCGGCCA CTCGCTGCTG
GCGGTGCAAC TGGTATCACG CCTGCGGCAG GCGCTGTCGG TGGAGGTGGC GCTGAGCACG
GTGTTCGACG CGCCGGTGCT GTCGGCTTTG GCATCCCGAT TGGACGATAA CACCGCGGCG
GTCCTGCCGC CGATACCACT GGCGCCACGC GACGGAAGAA TCGCGCTGTC GCTGGCGCAG
CAACGGCTAT GGTTCCTGAC GCAACTGGAA GGCGTCAGCG AGGCGTACCA CATGAGCGGT
GCGGTGCGGC TTGATGGGCC GTTGAATCGA GAGGTGCTGC AACGTGCGCT GAACCGTATC
GTGATGCGCC ACGAAGCATT GCGGACGTGC TTCGTTCGGG AGGAGGGTGA ACCGATCCAG
GTGATCCAGC CGCATGCCGA TCTGACGATG AGCTATCACG ACCTGCGCGA AGCGGAGTCG
ATCCGACATG AAGCCGGGAA CCGCGAACAG CGGGCAAAGA ACCTGAGCCA AGCCCACGCA
TCGGCGCCAT TCGACCTGAG CCGAGACCTG CCGGTGCGAG TGCTGCTGCT GCAATTGGCG
GATGAAGCCC ACGTCGTGCA GGTGGTGATG CATCACATCG CATCGGACGG CTGGTCGGTC
GGGGTGTTCC TGCAAGAGCT GAGCGCGCTG TACGGGTCGT TCATCGCGGA GCAGGGCGAT
CCGCTGGCGC CGCTGCCGTT GCAATACGCG GACTACGCCG CATGGCAACG CAGGTGGCTG
GCGAGCGGCC AGTTGGAGAA GCAAGGCGCG TTCTGGCAAA CGAACCTGTC CGGCGCGCCG
ACGCTGCTGG AACTGCCGAC GGACCGTCCG CGTCCGCCGA AGCAATCGCA CGCGGGCGCG
AGCGTCGAGG TGAAGCTGGG CGCGGCGTTG AGCGAACGGG TGAAGCGCCT GAGCCAACGC
CACGGGGTGA CGCCGTACAT GACGCTGCTG TCGAGCTGGG CGGCGGTGCT GAGCCGCTTG
AGCGGACAGG AGGAGGTGGT GATCGGCAGC CCGGTCGCGG GGCGGAACCG AACGGAGGTC
GAAGCGCTGA TCGGTTTCTT CGTGAACACG CTGGCGTTGC GGCTGGATCT GTCGTCGGAG
CCGACGGTGG GCGAGTTGCT GAAGCGGACG AAGGCGCAGG TGCTATCGGC GCAGGCGCAT
CAGGACTTGC CGTTCGATCA GGTGGTGGAA CGGGTGAAGC CGCCGCGCAG CACCGCGCAT
CCGCCGCTGT TCCAGGTGAT GTTCGATTGG CACAACACGC CCGCCCGCGC CTTGACGATG
CCCGGCCTGA CCGTGAGTGT GGCGAGCACG GAGACGACGA CGTCGCAATA CGACCTCGTG
CTGTCGATGC AGGAACGCAA CGGCGACATC GTCGGGCACC TGAATTATGC GACGGCGCTG
TTCGACGAAC AGACGGCGCG TCGCTACGCG CGCTACTGGC GCCGCCTGCT GGAAGGCATG
ACGGCCGGAT CGGCGAACGT GTCCGTCGCC CGTTTGCCGT TGCTCGACGA AGCCGAACGC
GAACAGGTGG TGCACGAGTG GAACGCAACG GAGCGCGCCT ACCCGATCCG GCAATGCATC
CATCAGTTGT TCGAAGCGCA GGCGGCGCGC ACCCCGAACG CAATCGCGAT CGGCGATGAG
CGCGTGACCT ACGCCGCATT GAACGCATCC GCGAATCGTC TCGCACGCCA TCTGCGGGCG
CTCGGCGTGG TCGCCGACAC GCGCGTGGCG GTCTGTATCG AGCGCGGCGC GCCGATGGTG
ATCGCACTGC TCGCGATCTG GAAGGCGGGC GGCGCATATG TGCCGCTGGA CCCGGCGTAC
CCGCGCGAGC GCATCGCGTA CATGCTGCGA GACAGCGCGC CCATCGCGGT GCTGACCTCG
CGCGCGAGCC GCGATCTCGT TGCATCGCAC CTTCCGGACC GCGCGCCGCT CGTAGTGATC
GACGCCGCCG CATGCCCGTG GGACGCATTG TCCGGCGACG ATCTCGATCC GAACGACATC
GAGCTGAACG CGACGCATCT TTGCTACGTG ATCTATACGT CAGGTTCGAC CGGACAGCCG
AAGGGCGTGA TGATCGAGCA CCGCAATCTC GTGAACTACA CGCTCGATGC GATTCGCTGG
TTCGGACTCG GGCCGGGCGA AACGGTGCTG CAGCAAAACT CGCTGAACTT CGATCTTTCG
CTGGAGGAAA TCGTGCCGGC GCTGTCGTCC GGCGCGGCGC TCGCGCCGGC CGTCGAGCTG
TTCGGCGCGG GCGGCAGCGC GCGCGGCCAT TCGGCCCGGC CGACGATGAT CCATCTGACG
GCCGCGCATT GGCAGCAACT GGTCGGCGAG TGGCATCGCG CCGGCGCGCG TCCGGCGGCG
GCGCTCGAAG GCGTGCGGCT CGTCAACGTG ACGGGCGATG CGTTGTCGCC GCATAAGCTC
GAGCAGTGGG ACGCGATCCG GCCGGCGCAC ACGCGGCTCA TCAATACATA CGGGCCGACG
GAGATCACGA TCTCGTGCAG CGCGGCCTAC GTGCGCCATG CGCCGGGGAT GAGCCGCGTG
AGCATCGGGC GGCCGTTCGC GAACAGCCGA ATGTATCTGC TCGACGCTCG CGGCGAGCCC
GTTCCGGTGG GTGTCACCGG GGAGTTGTAC ATCGGCGGCG ACGGCGTCGC GCGCGGCTAT
CTGAATCGGC CCGAGTTGAG CGCGGAGCGC TTCGTCGACG ATCCGTTCCG CCCCGGCTCG
CGGATGTACA AGACGGGCGA CCTCGCTTGC CGGCGCGGCG ACGGAGAGAT CGAGTTCGTC
GGCCGAAACG ATTTTCAGGT GAAGGTGCGC GGATTCCGCG TCGAGTTGAG CGAAGTGGAG
ACGCGGCTCG CAGCGGTGGA CGGCGTGCAG GAAATCGCGG TGCTGGCCCG CGAGGACGCG
CCAGGCGAGA AGCGGCTCGT CGCCTACTAC ACCGGTGCGG CCGAGATGGC CGCGCTGCGC
GAATGCGCGG CGCGCGACTT GCCCGCCTAC ATGATGCCGG CAGCTTACGT CTGCCTGCCG
GCGTTGCCGC TCACGCCGAA CGGCAAGCTG GACCGCAATG CACTGCCGCC GCCGGCGCAC
GATGCGGATT CGAATCGCGG CTACGAAGCC CCACAGGGCG ACATCGAGGA GACGCTCGCG
CGTATATGGG AACAATTGCT CGAGCGCGAG CGTGTGGGGC GCCATGACAA CTTTTTCGAT
CTGGGCGGCC ATTCGCTGCT GACGGTGAGC CTGATCGAGC GAATGCGCCA GGCCGACTTG
CACGCGGATG TCGCTGCGCT GTTTACGACA TCGACGCTCG CCGAACTTGC CGCTTGCACG
ACCAAATTGA AGGAGATTCT GCTGTGA
 
Protein sequence
MNVNSLLDLL QRKNIRIHVD RDELVVRAPR GALNAELTQA LKKSKAELID VLRRRGAQAS 
PDPVRITPAQ LTLVALSQES IDALVTKVEG GAANVQDIYP LAPLQEGILF HHLMSGESDP
YVLSGVLAFR SREVMERFVS ALQQVIDRHD ILRTGFFWEG LEQPVQVVQR RATLPVSVVE
FDAREGDIVR QLEARFDSRG YRMDVSRAPL MHVHAACDGE HERWVARVLF HHLSIDHTTL
ERVIEEARAI GQGRAEDLPR PEPFRNFVAQ ARLGVSEADH EAYFRAKLGD IDEPTAPFGL
LSVQGDGREI AEAARTLKPE LSGALRGHAR RLGVSAASMM HVAWGLVLSR TTGRQDVVFG
TVLFGRMQGG AQSDRALGLF INTLPVRMRV AQTGVEASVK GTHAQLAELM RHEHAPLVLA
QRCSGVPAQT PLFTSLLNYR YSKPKVAAAH IADGIELLDG HERTNYPVSV DIDDHGDDFK
IRAQAAASVD PARVCDFLEV ALTRLVDALE RDPHGALRQL DILPEVEREE VVRRWNAGEK
ARPSRLCLHE LFERQAARAP DAIAVIQDER ALTYAELNRC ANRLAHYLRA RGVRGGDRVA
LYARRSPELL IGMLATLKAG GAYVPLDPGY PAERLTHILL DSAPVVVLRD AAASDDVLVR
LNAGTLILDL HADDERWSAQ PSGNLKLCGS HEPDVGARRL AYVIYTSGST GAPKGVMVEH
ASVVNQIGAL TEYLELDASD RVLQFSNIAF DASVEEIFAT LTCGATLVLR TDRWLADAET
FWALCGAQRI SIVDLPAQFF GQLALSGRRA VPTGVRCVVI GGEAVGASAL DAWFAEEGRR
PRLFNTYGPT ETTVSVTVHE VRGRHDDANV IGRPIANTRV YVLDAWLRPA PIGVAGELYI
GGVQVARGYL NRPELTRERF IDDPFVAGGR LYRTGDLARW RTDGRLEYLG RNDFQVKIRG
FRIELGEIEA QLAKVAGVRE VVVLARDSAS EVHDSATEHA TPDALSPSPE TSTATATATA
TEKRLVAYYT GDADVVALRA QAAQHLPSYM VPSAYVRLDA WPLTPNGKLD RRALPAPADD
AYARAEYEAP RGAKEEALAA IWRELLHVER VSRHDNFFEL GGHSLLAVQL VSRLRQALSV
EVALSTVFDA PVLSALAERL EAGNTEVLPP IPLAPRDGRI ALSLAQQRLW FLTQLEGVSE
AYHMSGAVRL DGPLNREVLQ RALNRIVMRH EALRTCFVRE EGEPIQVIQP HADLTVSYHD
LREAESIRHE AGNREQRAKD LSQAHASAPF DLSRDLPVRV LLLQLADEAH VVQVVMHHIA
SDGWSVGVFL QELSALYGSF IAEQGDPLAP LPLQYADYAA WQRRWLASGQ LEKQGAFWQT
NLSGAPTLLE LPTDRPRPPK QSHAGASVEV KLGAALSERV KRLSQRHGVT PYMTLLSSWA
AVLSRLSGQE EVVIGSPVAG RNRTEVEALI GFFVNTLALR LDLSSEPTVG ELLKRTKAQV
LSAQAHQDLP FDQVVERVKP PRSTAHPPLF QVMFVWQNMP AGELTIPGLT IRAVETPLQT
AQFELTLSLQ EAGDDIVGHL NYASALFDES TVRRYVTYWC RLLEGMTAGA ADQTIVGLPL
LDEAERKQVV YAWNATERDY PIEQCIHQLF EAQVDRKPEA IALTFEGQRL SYAELNARAN
RLAHYLQGRG VGPGRLVALC AERGIEMVVG LLAILKAGGA YVPLDPAYAS DRLRGIVEDS
QPALVLADAV GRAALGELDG ALPVIDLETD ALRWREMPAT NPEVASQHVH HLAYVIYTSG
STGRPKGVMV EHAQVVRLFG ATQAWFGFDE RDVWTLFHSY GFDFSVWEMW GALLHGGRLV
IVPTEVTRTP SAFFALLCAE GVTVLNQTPS AFQALMSAQE EREEAAGNIE RANVVAHRLR
YVIFGGEALE PRTLASWYAR HGERTQLVNM YGITETTVHV TYYALRAEDA MRLGASPIGV
RIPDLQLYVL DARREPVPMG VTGELYVGGA GVARGYLNRP ELTRERFIDD PFVAGGRLYK
TGDLARWRTD GSLEYLGRND FQVKIRGFRI ELGEIEAQLA KVAGVREVVV LARDSAAEVR
DSATEHATPN ALSPSPETST ATAAATATAT ATATEKRLVA YYTGDADVAA LRAQAAQHLP
SYMVPSAYVR LDAWPLTPNG KLDRRALPAP ADDAYARAEY EAPQGAKEEA LAAIWRELLH
VERVSRHDNF FELGGHSLLA IGVIERMRRE GLHTDVRSIF NAQTLSDLAA RAQTDDRSIQ
APPNLIPARA TRITPDMLPL VALTQTQIDM LAVQVEGGAA NIQDIYPLAP LQEGMVFHHL
LHAESDAYME AYFVGFRTRA RLDRFLDALR MIVDRHDILR TGFFWEGLEQ PVQIVQRRVR
LPIEFVDLDP ADGDVLRQLE ARHDPRAHRL DIRRPALLSC HAAHDPAAGR WLLCVMAHHL
AIDNTSLKLL VAEEQAIEQG GFDALPPAPS FRNFIAQIAS GIDRREHEAF FSAMLGDIDS
PTHPFGLQDV QGDGREIAEF QQRLSPELSK AIRVCTRRLG VSPASLMHLA WAMVLSRATG
RREAVFGTVL FGRMQGGERG MGMFINTLPI RIDVDERYVA ECLAHTHERV VQLIYHEHAP
LALALRCSGL PAQQALFSSL LNYRHSEQAA RPPRDDDDIQ YLDGNERTNY PLTVSIDDLG
EAFSVTVQAR HPASPERIRA FMETALEQLV RALDGTSGVA APGVVMPRIA VCDIDVLPSE
ERHRLLVEWN DTAADYPQDQ CLHRLFEAQA ARHPDTIALI ADGEPVGYAE LNRRANRLAR
HLSARGLQPD QRVAICIDRG IDMVVAMLAV LKAGGAYVPL DPAYPSERLD YLLRDCAPVA
LLTHARLGAS MQTRLVLALA RLDTGCALID LESDAGAWRH ERDDDPPPSG LTPRHLAYVI
YTSGSTGQPK GVMVEHRSVC NLVAWHAGAF DVGTGCRSAS VAGVAFDATT WEVWAALCNG
GCLSLAPGDA ASDPQALLRW WRAQELDVGF LVTPLAELAY ATGQSNAGMR TLLIGGDRLS
RWPDSMPPGQ MLVNNYGPTE ATVVATSGRL QPGEATPPIG RPIANTRVYV LDAWLRPAPI
GVAGELYIGG VQVARGYLNR PELTRERFID DPFVAGGRLY KTGDLARWRT DGRLEYLGRN
DFQVKIRGFR IELGEIEAQL AKVADVREVV VLARDSAAEV HDSATEHATP NALSPSPETS
TATAAATATA TATATEKRLV AYYTGDADVA ALRAQAAQHL PSYMVPSAYV RLDAWPLTPN
GKLDRRALPA PADDAYARAE YEAPQGAKEE ALAAIWRELL HVERVSRHDN FFELGGHSLL
AVQLVSRLRQ ALSVEVALST VFDAPVLSAL ASRLDDNTAA VLPPIPLAPR DGRIALSLAQ
QRLWFLTQLE GVSEAYHMSG AVRLDGPLNR EVLQRALNRI VMRHEALRTC FVREEGEPIQ
VIQPHADLTM SYHDLREAES IRHEAGNREQ RAKNLSQAHA SAPFDLSRDL PVRVLLLQLA
DEAHVVQVVM HHIASDGWSV GVFLQELSAL YGSFIAEQGD PLAPLPLQYA DYAAWQRRWL
ASGQLEKQGA FWQTNLSGAP TLLELPTDRP RPPKQSHAGA SVEVKLGAAL SERVKRLSQR
HGVTPYMTLL SSWAAVLSRL SGQEEVVIGS PVAGRNRTEV EALIGFFVNT LALRLDLSSE
PTVGELLKRT KAQVLSAQAH QDLPFDQVVE RVKPPRSTAH PPLFQVMFDW HNTPARALTM
PGLTVSVAST ETTTSQYDLV LSMQERNGDI VGHLNYATAL FDEQTARRYA RYWRRLLEGM
TAGSANVSVA RLPLLDEAER EQVVHEWNAT ERAYPIRQCI HQLFEAQAAR TPNAIAIGDE
RVTYAALNAS ANRLARHLRA LGVVADTRVA VCIERGAPMV IALLAIWKAG GAYVPLDPAY
PRERIAYMLR DSAPIAVLTS RASRDLVASH LPDRAPLVVI DAAACPWDAL SGDDLDPNDI
ELNATHLCYV IYTSGSTGQP KGVMIEHRNL VNYTLDAIRW FGLGPGETVL QQNSLNFDLS
LEEIVPALSS GAALAPAVEL FGAGGSARGH SARPTMIHLT AAHWQQLVGE WHRAGARPAA
ALEGVRLVNV TGDALSPHKL EQWDAIRPAH TRLINTYGPT EITISCSAAY VRHAPGMSRV
SIGRPFANSR MYLLDARGEP VPVGVTGELY IGGDGVARGY LNRPELSAER FVDDPFRPGS
RMYKTGDLAC RRGDGEIEFV GRNDFQVKVR GFRVELSEVE TRLAAVDGVQ EIAVLAREDA
PGEKRLVAYY TGAAEMAALR ECAARDLPAY MMPAAYVCLP ALPLTPNGKL DRNALPPPAH
DADSNRGYEA PQGDIEETLA RIWEQLLERE RVGRHDNFFD LGGHSLLTVS LIERMRQADL
HADVAALFTT STLAELAACT TKLKEILL