Gene BBta_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3701 
Symbol 
ID5155231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3863604 
End bp3875492 
Gene Length11889 bp 
Protein Length3962 aa 
Translation table11 
GC content68% 
IMG OID640558540 
Productnon-ribosomal peptide synthase 
Protein accessionYP_001239686 
Protein GI148255101 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGATCTC GCGCAGCACA TCAAATCGGA TGCGGCACGA CCGGCCGGCC GTCAGATCGT 
GATGGAGCGG CTGTAATGTC CCTGATCGAA TCCGATCCGT CGGAGCTCGT CGACCGGCTG
GCCAGGGAAG GCTTCCTGCT TCGCCGCAAC GGCGAACAGC TCAGCCTGAT CAATCTGGCG
AACCGGCCGC TGCCCGACAC GCTGCGCGGC GAGATCGGCT CGCAGCGCGA GTCGATCATG
CTGCATCTGG CCGACCGGAT GTGGCCGCTG TCCTTCAACC AGCAGCGGAT GTGGTTCATC
GACCGACTGC AGCACGGGCA CAGCGTCGCC TACAACGTGC CGCTCGCGAC GCATATTCGA
GGTCCGCTCG ACGTTGGCGC GCTCAGCCAG GCGCTGACGA TGCTGGCGGC ACGCCACGCG
ATCCTGCGCA CCACCTTCAT CGAGCATGAG GGCGGGTCGG CTCAGCGCAT CGTCGCCCCG
GCTCCCGTCC AGCTCCACGT CGAGCCGATC AGTGAAAGCG CGTTGCCGCA GCGGCTCGAT
GCCGAAGCGC GGCGGCCGTT CGACCTCTCA ACCGGCCCAT GCTGGCGCGT CAATCTATTC
GCCCTGTCGC ACGACCATCA TGTGCTGCTG ATCAATCAGC ACCACATCAT CAATGACGGC
TGGTCGCTGG TCGTGATGCA GCGTGACCTC GCCGCCCTGT ATGATGGCGC GCGGCAGGGC
CGGCCGTCTG ATGCCGCCGA CCCGCCGCTG CAATTTGCCG AGTTCGCGCT GTGGCAGCGC
CGCGCGCTCC CGGCCGTGAT CGAGACCGCG CTGGACTATT GGCAGCGGCG GCTCGATGGA
TTCGCTCCGT TCGATCTGCC GGCCGATCGC CCGCGGCCGC AGCGGCTGAG TGGCCATGGC
CAGACTTACC GCTTCGCCTT GCCGCCGGCG CTCACCGATG CGCTCCGCGA CGCCGCGCTC
GGCCAATCGG TCACCCCCTG TGCCGGTTGG CTTGCGATCT TCGCGCTGCT GCTGTCGCGC
CTCGGCGGCC AGGACGACAT CGTCGTCGGC ATGCCGTTTG CCAATCGCCG CGACCCCGCG
GTCGCCGATC TGATCGGCTA TGTCGCCAAC ACGTTGCCGC TGCGCGTCGA TCTCACCGGG
AATCCCAGCT TTGCGGCCCT GATGACACGG GTCGCATCGG AATTGCTCGA CGCCGACGCA
CATCAGGAGG CGCCATTCGA CCTGATCGTC AACCGCATGG CTGGCGCGCG CGATCCCAGC
CGAAACCCGA TCTTCCAGGT CGCCTTCGTC TACCAGACCT TCGCCGACGG CCTTGCCAAT
CTGCGCCTGC CCGGCCTGAC CTGCGAGACT GCCTTCATCG ATACCGGCAC GGCCAAGTTC
GATCTCACGC TCACGCTCTC CGAGACGGCC GAGGGCATGG TCGCCGAGAT CGAATACAGC
ACCGATCTGT TCGACTCGTC GACGATCGAG CGCCTGACGG CGGAACTCCT GCATCTGGCA
GAGATTGCGA CGCGGGATTG CGACGCGGCG ATCGAGACGC TCGAGATCCT GCCGCCATGC
GAGCGCGCGG CGCTCATCGC CTTCGGCGCG TCGGAAACCG CGGCTCCGGA TGAGAGCACG
CTGGTGGATC TGTTTGCGCA GACAGTGGCT GCGCATGGCG AGCTCCTGGC GATTGCGGGC
GATCAGGCCC TGACGTATGC CGCGCTCGAC CGCCGATCCG ACGCGTTGGC CGCTCACATC
AGGGCGGGCA TGACCGCATC TGGCCTCGCC ATCGCCGATG GCGTGATCGG GGTCGCGGTT
CCGAAGAGCG CCGACCTGAT CGTTGCCTTC CTCGCGGTGC TTAAGGCAGG GGCTGCCTAT
CTGCCGCTCG CGCCCGATCT TCCGTCCGAG CGCCTGCGCT TCATGGTGGA GGACGCCGCG
CCACGCCTGA TCATCGTCAC GGATCAGACA TCCGGCCTCT TCGACAGCAT GCCGGTGCCT
CAGCTGCTGG TGGACATGCA GCATGAGGAC GCGCACGCCG ATGCGGCGGC CAGACCGCCG
CGCCGGCACG ATCTGGCTTA TGTCATCTAC ACGTCCGGTA CGACCGGGCG CCCGAAGGGC
GCATTGATCG AGCACGGCTC GCTGGCGAAT CTGGCGCGCG CACAGCGCGA TCTCTTCGGT
CTCGAGCCGG GCGCCCGGGT CCTGCTTTAC GTCGCCATGA GCTTCGACGT CTCGATCGGC
GCCATCGCCA CCGCGCTTGC TGCCGGTGCC ACGCTGCACC TCGTGCCACA ACGGCTGATG
GCCGAACCCG AGGCGATCGC GGCCATGATC CGCGATCACG CCATCGATCT CGTCGAGTTG
CCGGCCACCA TTGCCCAGCA GCTTCCAAGG CGGCAGGACG CTGCGCCCCG GACATTGGTG
ATCGGCGGCG AGGTCTGTCC GCAGGACGTG CTGGCCTATT GGTCCGGACA GTGCCGCGTC
ATCAATGCGT ATGGACCGAG CGAGACCGCG GTGCTCGCGA CCACCGACGC GATCTGCAAG
CCGATCACCC CGCATGTGAT CGGCCGGCCG ATCGCGAATG TGCAGGTCCG GATCCTGGAC
GGCAACGATC GGTTTTGTCC GATCGGCGTG CCCGGCGAGA TCTGCATTGG CGGCGCTGGC
GTGGCGCGTG GCTATCTCGG ACGCCCGGAC CTGACGGCGC GCCAGTTCAT GGCCGACCCG
ACGGGCCCAG GCCGGCTGTA TCGCAGCGGC GACATCGGGC GCTGGCGCAG CGATGGACGC
CTTGAATGGC TCGGCCGCGT CGATGAGCAG ATCAAGCTCA ACGGTCTGCG GATCGAACCC
GGCGAGATCG CGCGCGTGAT GGAGCAGCAC GAAGGTGTCA CCTCCGCCCA CGTCCTGGTT
GACCAGGACG GCCGCTCGTC GCGCTTGGTC GGCTTCTACG CCGCGGCCCC GGAGTTCGAC
GAACCGGCGC TGCGACATCA TCTGCGGGCA AGGCTGCCCG CTTATATGGT GCCGTCGCTG
CTGATGCGCC TCGACGCGCT GCCGGTCGGC CCCAATGGCA AGCTCGACCC GCGCGCGCTG
CCGCGCCCGC AGGCGGAGGT GATGCCGGAG GGGCGGACGC CGCGTACGCC GCTTGAGACG
ACGTTGGCGA GCATTTGGTG CAGCGTGCTG CGGCTCGAAC GTATCAGCAT CGATGACGAG
TTCTTCGCGC TCGGCGGCGA CTCCATCATG ACTATCCAGG TGTGCGCCGC CGCCCGCGCC
GTCGGCATCG CCCTGACAGC GCAGGCGATG TTTGAAAATC CCACCATCGC CCGGCTGGCC
GAGGCGGTCT CGCGCACGCC GGCAGCAATT GTCGTGCCGC CCGCTGAGCC CGAGCAGACG
GATGCGTCGA TCGACATCCC CCCTGACGTG CTCGACACCT TGTGGAGCGT CGGGCGCGTC
GAAGCCGTCT ACGGACTGTC GCCGCTGCAG GAGGGGCTGC TGTTCCATGC GCTCTATGCG
CCAGGCTCCG ACCAATATTG CGTGCAGCTG TCGTGGCGCC ATCAAGGACC GCTGGATCCC
GCGGCGCTGC GTCTCGCTTG GGAAGGCGTC ATCGCGCGGC ACGGCATCCT GCGCACCGGC
TTCGTGTGGG AGGGCGTGAG TCGGCCGTTG CAGGTGGTCT ATGCAACTGC TCCGCTCGAT
TGGAACCAAG CCGATTGGCG CGGCGACGAT GCCGCCAGGC TGGACACGTT CTTGTCCGAA
GATCGTCGCC GCGGTTTCGC ACTCGGTCGG CCGGGGCTGA TGCGGCTGCG ACTGATGCAG
CTGACTGACG ACAGCTGGGC CATCGTTTGG ACGACGCATC ATCTGCTGAT GGACGGCTGG
TGCCTGCCGC TGATCCTGCA GGAGGTGGCG CTGCGCTACC GCCAGGCCTG CGGCGAGGCC
GTCGCGCTGC CGCCAGCGCC GCCGCCGTTC GAGCGCCATA TCGCCTGGCT GGCGCGGCAG
GACCGCGACC TTGCACAGGC GTTCTGGCGC GCGCATCTCG CCGGCATCGA GGCGCCGACA
CCGCTGGCGA TCAACCACCG GCCGCTCGAC ATCCGTCGCC CGATCGAACG GCTGCAGCGA
GAGTCCGTGC TGCTCGACAA GACGCGCAGC GCGGCGCTGG CCGCGTTCGC GCGCCAACAC
CGGATCACGC TCAACACGCT GATCGAGCTG GCCTGGGCTG CCCTGCTGTC GCGCTATAGC
GGGCATGACG ATGTCGTGTT CGGCATTGCC GTGTCGGGCC GGCCGCCCGA ACTGCCGCAG
GTCGAACGGA TGATCGGCCT GTTCATCAAT GCCGTGCCGT TGCGGCTTTC GCTCGACCAG
GACCGCAGCA CCCTCGATCA TCTGCATGCA GTACAGGCCG CGACCCGCGG TGCCGAGGCG
CATGCGACGG TCGGACTCGC CACGATTCAA GGCTGGAGCA AGCTCGAATC CGGCAGCGCC
TTGTTCCAGT CGCTGCTGGT GTTCGAAAGC TATCCCGGCG GTCTTCCAGC CGAAGTAACC
GAGCTGCGCA TCGACGAGAA GACCAACTAC CCGCTCACTT TGGCCGTGCT CCCGGGCACC
GCGATCGAGC TGCGCGTGCT GTACGATGCC GACAGTTTTA CGGCCGCTGC GGTCGCTCAA
TTGTGCGCGC ATCTCGACCG CACCATCACC TGGCTGACGC AGCATGCGGC ACGGCCGCTC
GGCGAGCTCG ACTTCCTCTC CGTGGCCGAG CGCCACGATT TGCTCATGGC CTGGAACGAC
CGCTGGGTCC CCTACCCGCG CGATGCCACC TTGCATGGAT TGTTTGCCGA GATCGCGCGC
CGCCACCCGT CGGCGACCGC AGTGGTCGAA GGCCATCGCC ACATCGACTT CGGCACGCTC
GATCGAACTG CCAACCGCTT GGCGCACCGC ATCGTCGCGA GCCCCGCCCC ATCCGGTGGG
ATCGCGCCGG GCCGCCCAAT CGCGCTGTGC TGTGGCCGCA CCATCGAGAT GGTGATCGCC
ATTCTCGCGA TCCTCAAGGC CGGCGGCGCC TGGGTGCCGC TCGACCCGGA TTATCCGGCC
GAGCGGCTGA GATTCATGAT CGAGGACAGC GCCGCCGAGC TCGTGCTGGC GAGCCCCAAG
GCGGCCCGCG ACGTCGCCGT CCTGCAGTCG CCGCAGCGCC TGCTGCTGAT CGTCGAGCCG
ACCGATGGGA GCGGCGATGA TCGTCCGCCT CCCGCCACAA CGGGACCTGC TGACGCGGCT
TACGTGATCT ACACCTCGGG CTCGACCGGG CGTCCCAAGG GTGTCGCCTG CGTGCACCGC
GCCGTCATCA ACTTCTGCCA CGAATGGCAG AGCAAGCGCG CGATCGCGCC AGGCGATGCG
GGAACGCTGA CCAGCAGCCT CAGCTTCGAC GTGTCCGTCT ATGAGATCTT CTCCAACCTG
TTGTTCGGCG CCGCGGTCCA TCTGCTCGAC AAGGACACCA TTCTCGATGC TGATCGCTTC
GCGCGCTATC TGCGCGACCA GCGGATCCAG AACTGCTATC TGCCGCCGCA TCTGCTGACC
GCCGTCGCGT CGCTGGTCGC GGCGGACGGG GCGAACTATG CCCTCAAGCG GCTGATGGTC
GGCGTCGAGC CGCCGCTCGA ACAGGCGATG TGGCGCATCA AGCAGGCCGT GCCTGGAGCT
GCCGTCGTGA ACGGATACGG CACCACCGAG ACGACGATCG GCTCGATCGC CTATTATGTC
GAGCGCGACA CCGGGCGCAG CGGCAACGCG CCGATCGGCG TGCCCTTCCA GAACCAGACC
GCCTATCTGC TCGATAAGCG GCTGCGGCCG GTGCCGCTCG GCGCCATCGG CGAGATCTAT
ATCGGCGGCG ACGGTGTCTC GGCCGGCTAC CTCAACCGCC CGGAGCTCAC CGCCGAGCGG
TTCATGGACA ATCCATTCCA GTCGCTCCCT GACCGGCTGG CCGGGCGCAA CGCCAGGATC
TATCGCACCG GCGATCTCGG CCGGATGCTG CCCGACCGCC AGCTCGAATG CCTCGGCCGC
ATCGATACCC AGATCAAGAT CCGCGGCTAT CGGGTCGAAC CCAGCGAAAT CGAGGCGGTC
ATCGCCGCCT GCCCCGGCGT GACACAGAGC GCGGTCATCG TCGTCGACAG CGGCGCGGCA
AGGCGGCTCG TCGGCTATTA CGCGGCCCCG TCCGGACAGC CTGATGAGCA GACGGTGCGG
GCGCGGCTCG CTGCGCTGCT GCCGCCCTAT ATGGTGCCTG CGGTTCTGAT GCGACTCGAC
CGGCTCCAAC TGTCGCCGAA CGGCAAGATC GACCGCAGGG CGCTTCCGCT GCCCGCGAGC
GCTACGCCTG ACAAGCGCGA CCTCAGCGCA CCGCGCGACG ACACCGAGGC CGCCCTCGCC
GCGGTGTGGC GCACCGTGCT GAAGCTCGAC CAGATCGGCA TCGACGAGGA TTTCTTCGCG
CTGGGCGGCG ACTCCATTCT CACCATCCAG GTGATCGCCC GGGCGCGCCA GATGGGATTG
GAGCTGAGCG CCAGTCAGCT GTTTGCCGCC CCGACCATCC GCGCGCTCGC CGCCAGCCTG
CGGCGCGAGG CGCCGGAGGC CCTGGTTGTC ATTGATGACG ACGACGCCGA TGCCAGCCGC
GCACCGCTCG CGCCGATCCA GCAATGGTTC TTCGCGATCG GCCATTCCGA CCTCAACCAT
TGGAATCAGG CGTTCCTGTT CCACGTCGAT GCCAGCGTCA CAATCGCGCG CATCGCTCAC
GCGCTGTCCC AGCTTGGCGA ACTGCACCCG GCGTTCGCGA GCCGCTTCGT CAGCCATGCC
GACAAGGGTT GGCGCGAGAC GACCGGCCCC GCGGCGTGGC CGGTCGAGGC GTTCGACCTC
ACCGGTTGCG CCGACGCTGC GGCGGCACTT CGGGCGGAAG CCGATCGCGT TCAGGCGCGG
CTCGACATCA GCCACGGGCC GCTGGCGCGC GCGACGCTGT TCACCGGCCA TCCCGACCGG
CGCCCGCGGC TGCTGCTCGC CGTCCACCAT CTCATCATCG ACGGCGTATC CTGGCGCATC
CTGCTCGAAG ACCTTGAATG CCTGTTGGCC GGCGGAACGC CGGCCAGACC TGCAGTGCGC
TTTACATCCT GGCGCAGCGC GCTGGCCCGT TACGCCCGCA CCGCTGCGGT CGCGCAATGG
GATTACTGGC TCGCCGTCGG CGCCAACGCC TCTGCGCTGC CGCTCGACAG CAGCGAGGCC
GATCCAGGCT GTGGCGAAGA CGCTGACCGC ATCTGCTTCA CGCTCGGCAC CGACACCACG
GGACGGCTGC TGACCCGGGC CGGCCCAGCC TATCGCACCC AGATCAACGA TCTGCTGCTG
ACCGCACTGG CGCTGGCAGT GCGCAGCTCC ACCGGACTCG CCGATGTGAT CGTCGATCTC
GAAGGTCACG GGCGCGAGGA GTGCGTGAGC GCACGCGACG TGTCGCGCAC GGTCGGCTGG
TTCACCTCGA TGTTCCCCGT CCATCTCACC CTGCCGGCCG GCAGCGACAT CGACGCGGCC
ATCAAGGCAA TCAAGGAAAC CTTGCGCGCC GTTCCCGACA AGGGCGTCGG CTATGGCGCG
CTGCGTTTTC TGTCTGACGA TCCCCGTCAT GCGGCGCTCG CTGCCGGTGC CAAGGCGCGC
ATCGGCTTCA ACTATCTCGG CCGCTTCGAT GCGCTGAGCG GGCGCTACAT CACCCTGTCT
GAGGAGTCCG CCGGCACCGC GGTCTCGCCG CGCAACAGGC TGATCCATCC GATCGAGATC
AGTGCCTATG TGTCCGATGG CGCGCTCAAA GTGACCATCG AATACAGCGG CAGGTGCTGT
GCAAGCAGCC GCGCCCAGCG CCTTGCGGAT GCGTTTGCGA CCGCCCTCGG CGATGTCGTC
AAGCATTGTG TGAGCGGCGC TGGCGGCCTG ACGCCGAGCG ACTGCCCGCT CGCGCCGGAT
GTGAGCCAGG CCACGCTCGA CGGGCTCGCG CGCGCCGGCC GGATCGACAC CGTCTACGGC
CTCGCGCCAT TGCAGGAAGG CTTGTTGTTC CACGCACTCT ACGCACCGGA CTCCGATCAG
TATTGCGTGC AACTGTCGTG GCGCCATCAG GGTCAACTCG ACACCGCTGC GCTGCGCCGC
GCCTGGCAAG GGATCATCGA TCGGCATGGC ATCCTGCGCG CCGCCTTCAT CTGGGAAGGC
GTGACGCGGC CGCTGCAGGC GATCTATGAA AACGTTCCGC TCGACTGGGA GGAAGCGGAC
TGGCGCGGCG GCCCAGACGA TCAGCTCGAC GCCTATCTGG CCGAGGACCG CCGCCGCGGC
TTCAGGCTCG ACCGACCCGG CCTGACACGG TTGCGGCTGA TGCGGCTGCG GGACGATGGC
TGGGCGGTGG TTTGGACGAC GCATCATCTG CTGATGGACG GCTGGTGCCT GCCGCTGATC
CTGCAGGAGG TCGCGCTGCG CTATCGCCAG GCCTGTGGCG AGGCGATCGA GCTGCCGCCC
GCGCCGCCGC CATTCGAGCG CCACATCGCC TGGCTCGCCC GGCAGGACCG CAACCAGGCG
CAGGCGTTCT GGCGCGCCCA TCTCGCCGGC ATCGAGGCGC CGACTCCGCT CGCGATCAAT
CACCGGCCGC TCGACATGCG ACGCCCGATC GAGCGCATGG AGCGACGCCG GCTGCTGCTC
GATCGCACGG ACAGCGCGGC GCTTGCAACA TTCGCACGGC AGCACAAGGT CACGCTCAAC
ACGCTGGTCG AGCTCGCTCT GGCCGCCGTG TTGTCCCGTT ACAGCGGCCA GGATGAGGTC
GTGCTCGGCA TCGCCGTCGC CGGGCGACCA GGCGAGCTGC CGCAGGTCGA GCGCATGATC
GGCCTGTTCA TCAACACCGT CCCGCTGCGG CTGTCGCTCG ATGCCGAACG CAGCGTGCTC
GAGAACCTGC ATGCGGTCCA AGCCGCCACG CGCAGCGTGA TCCAGCACGG CTATCTGTCG
CTGACCGAGA TCCAGGCGCA GAGCGCGGTG CCGAACGGCA CGCCGTTGTT CCAGCAATTG
CTGGTGTTCG AAAACTATCC CGACGACGGC ATCGGCGCCG CGCCGGCCGG CGGCTATCTC
GACATGCGCG GCGACATCAA GACCAGCTTC CCGCTGACGC TGGTGGTCGT GCCCGCTGCC
GAGCTTCTGG TGCAGGCCTC CTACGACGCG GCCTGTTTCG ACGAAACGGT GATCGCGCGA
CTGCTCGGCC ATCTCTCCGA GACGCTGCGA TGGCTGGCCG CCAATCCCGA GCGGCCGCTC
GCGGACGCCG CCCTGCTGAC CGAAGCGGAA CGGCGCGCGG CCCTTGCGGC GGCCGTGACG
GCGGTGCCCT ATCCGCGCGA CATCTCGCTT GCGGACATGT TCGAGGCCGT CGCGCGCCGC
CAGCCGCAAC GCGCGGCCGT GATGCACGGC GACAGCGCCA TCGCGTTCGG CGAACTCAAC
GTCCAGGCCA ATCGGCTCGC GCACCGTCTC CGCAAGCTCG GCGTGCGCGC GGAGACCGCG
GTCGGCATCA GCATCGAACG CTCGATCCCG CTGATCGTCG GCCTGATGGG CATCCTCAAG
GCCGGGGGCG CCTATGTGCC GCTCGAACCC GACGTGCCCG ATGACCGGCT GCAATTCATG
CTCGCGGATT CGCAGGCGCC GGTGCTGGTC ACCACGGCGG CCCTGGCCAA CAAGTTTCCG
CAGTTCACGG GCGAGGTGAT CGCGCTCGAC GATCCCATGC TCGACAGCGA GACGGCATGC
GATCCCACGC GGGAGGCCGT CGCGGATCCC CTGCTCTACA TCGCCTATAC GTCCGGCTCG
ACCGGACGGC CGAAGGGCGT GATGGTCCAG CAATCGACGG TGCTCAACCG CTTTCACTGG
CTGTGGCGGT CCTTGCCGCT CGCCGACGAC GAGGTCGGCT CGCAGATCAG CTCGATCAAC
TTCGTCGACG CGGTCTGGGA GGTCTTCAGC CGGCTCGCGC GCGGCATTCC CTTCGTCGTG
TGCTCCGACG AGGTGGTCCG CGATCCCCAG CGCATGGTCG ACGCGCTCGC TCGCCATCGC
GTGACCCGGC TCGAGCCGGT GCCGTCGCTG CTGGCCTCGC TGCTCGACAA CGTGCCTGAC
ATCGCCGAGC GGCTGCCGCA TCTGCGCTAT TGCATCTGCT CCGGCGAGAT CCTGCCAGTC
GAGCTCGCCC GGCGCTTCCG TGCGACCATG CCTGCGGTGC GGCTGTTCAA CCGGTACGGT
TCGACCGAGG CGACCTCGGT GCTGTGGCAG GAGGTCGTGA ACACCGAGGC CTATGGCGCC
AACGTGCCGG TCGGCCATCC GGTGCAGAAC GTCGGCATCT GCATCCTCGA CCGCCGACGG
CGGCCGCTGC CGCATGGCAT CGCGGGATCG CTCTATGTCT ACGGCGATGC CGTCGCGCGC
GGCTATCATG GCCGTCCTGA TCTCACCGCT GAGCGCTTTG TCACGCTGCC GCTGATCACG
CAGGAGGTGA GAGCCTACTA CACCGGCGAC CTCGCCCGTC AGCGCGCCGA CGGCAGCATC
GAGGTGCTCG GCCGCGACGA CAATCAGCTG TCGATCCATG GCTACCGCAT CGAGCCGGGC
GAGATCGAGA CCGCGCTCGG CCGCCTCGCC GGCATCCGCG ACTGCGTAGC CGTGGTGCGC
GACATCGGCG GCAGCCGCCA GCTCGTCGCC TTCTACGCCG AGGCCGACGA CGCCGGGACC
GCACTGAGCC CGCAGGCATT GCGCAACCAC TTGGCCGGGC AATTGCCGGC CTACATGGTG
CCGTCGCTGT TCGTGAAACT CGCGGCGCTG CCGCTGACCA TCAACGGCAA GGTCGATCGC
AAATCGCTGG TCGCGCGCGA CCTCGAGATC GGCCCGCTCG ACAGCGATGC GCTGCTGCCA
CGCGACGCGA CCGAGCGCCG GCTGCATGAC CTCTGGGCCC AGGTCATCGG CATCGACCAT
TTCGGGGTCG AGGACGACTT CTTCGCCGTC GGCGGTCACT CCCTGCTCGC CGTCCGCCTT
GCTACGCGCA TCAACACGAC GTTCCGTCGC AGCTTCCCCG TCGCGTGGAT ATTCAGCACG
CGCACCATCG CCGCGCAGGC CGCGGCGCTG CGCAAGGACG GCGCCGGCGC CGATTTCGAG
CCCACCGTGA TGCTCAAGCG CGGCCGCCGC ATCCTGTTCC TTGTCCATCC TGGGCATGCC
GGCGCCGAGG CCTATTCGGC GCTGGCGCCG CTGCTGCAGG ATGATCTTGC GATCTGCGCG
GTCGAGTCCT GGAACCTGTA TGGCAACGAT CAGCCCGTCA CCGGCATTCC CGCGCTGGCG
CGGCGCTATC TCGCAGCGAT CAGGACCGTG CAGCCGACGG GACCTTATCT GCTCGGCGGC
TGGTCGTTCG GCGGCAGCGT CGCCTATGAG ATCAGCTGCC AGCTCGCCGC CGCCGGCGAG
CGCGTCGAGC AATTGATCCT GCTCGACTCC TTCGGCCCGC ATGACGGACG CCAGGCCTTC
GTGGCCGCGT TCGACGCAGC CGCGCGTGCG GCCTTTCTCG ACGTGCCGTT TTATCGCGAC
ATGCCGGACG AGATTCGCGA CCGCATGGCG CGCGTGATCC GCATGGAGAA CGCCATGCTC
GCCGACTTCC GCCCGCAGCG ATATGACGGC CCGGTGCTGC TGCTGAAGGC CAACGAAGCC
GAGACGCCGC CGGAGCCGTG CGTCGCGGCG TATCCCTCCG GCTTTCTCGA CATGCTGCAA
GCCACTCAGG CGGCGCCCGA CAATTTCTGG GGACGCGTGG CCAGCCGGCT CATCGTGCGG
CCGGTCGCCG GCACGCATGG CGGGCTGATG AGCGGCGCTG CCGTGGCCGA GATCGCCGCG
ATCCTTTCGC AGACCTGTCT TGAACCATCA AACATAGAGG ATTCGTCCAT CATGGGATCA
GCGCCATGA
 
Protein sequence
MRSRAAHQIG CGTTGRPSDR DGAAVMSLIE SDPSELVDRL AREGFLLRRN GEQLSLINLA 
NRPLPDTLRG EIGSQRESIM LHLADRMWPL SFNQQRMWFI DRLQHGHSVA YNVPLATHIR
GPLDVGALSQ ALTMLAARHA ILRTTFIEHE GGSAQRIVAP APVQLHVEPI SESALPQRLD
AEARRPFDLS TGPCWRVNLF ALSHDHHVLL INQHHIINDG WSLVVMQRDL AALYDGARQG
RPSDAADPPL QFAEFALWQR RALPAVIETA LDYWQRRLDG FAPFDLPADR PRPQRLSGHG
QTYRFALPPA LTDALRDAAL GQSVTPCAGW LAIFALLLSR LGGQDDIVVG MPFANRRDPA
VADLIGYVAN TLPLRVDLTG NPSFAALMTR VASELLDADA HQEAPFDLIV NRMAGARDPS
RNPIFQVAFV YQTFADGLAN LRLPGLTCET AFIDTGTAKF DLTLTLSETA EGMVAEIEYS
TDLFDSSTIE RLTAELLHLA EIATRDCDAA IETLEILPPC ERAALIAFGA SETAAPDEST
LVDLFAQTVA AHGELLAIAG DQALTYAALD RRSDALAAHI RAGMTASGLA IADGVIGVAV
PKSADLIVAF LAVLKAGAAY LPLAPDLPSE RLRFMVEDAA PRLIIVTDQT SGLFDSMPVP
QLLVDMQHED AHADAAARPP RRHDLAYVIY TSGTTGRPKG ALIEHGSLAN LARAQRDLFG
LEPGARVLLY VAMSFDVSIG AIATALAAGA TLHLVPQRLM AEPEAIAAMI RDHAIDLVEL
PATIAQQLPR RQDAAPRTLV IGGEVCPQDV LAYWSGQCRV INAYGPSETA VLATTDAICK
PITPHVIGRP IANVQVRILD GNDRFCPIGV PGEICIGGAG VARGYLGRPD LTARQFMADP
TGPGRLYRSG DIGRWRSDGR LEWLGRVDEQ IKLNGLRIEP GEIARVMEQH EGVTSAHVLV
DQDGRSSRLV GFYAAAPEFD EPALRHHLRA RLPAYMVPSL LMRLDALPVG PNGKLDPRAL
PRPQAEVMPE GRTPRTPLET TLASIWCSVL RLERISIDDE FFALGGDSIM TIQVCAAARA
VGIALTAQAM FENPTIARLA EAVSRTPAAI VVPPAEPEQT DASIDIPPDV LDTLWSVGRV
EAVYGLSPLQ EGLLFHALYA PGSDQYCVQL SWRHQGPLDP AALRLAWEGV IARHGILRTG
FVWEGVSRPL QVVYATAPLD WNQADWRGDD AARLDTFLSE DRRRGFALGR PGLMRLRLMQ
LTDDSWAIVW TTHHLLMDGW CLPLILQEVA LRYRQACGEA VALPPAPPPF ERHIAWLARQ
DRDLAQAFWR AHLAGIEAPT PLAINHRPLD IRRPIERLQR ESVLLDKTRS AALAAFARQH
RITLNTLIEL AWAALLSRYS GHDDVVFGIA VSGRPPELPQ VERMIGLFIN AVPLRLSLDQ
DRSTLDHLHA VQAATRGAEA HATVGLATIQ GWSKLESGSA LFQSLLVFES YPGGLPAEVT
ELRIDEKTNY PLTLAVLPGT AIELRVLYDA DSFTAAAVAQ LCAHLDRTIT WLTQHAARPL
GELDFLSVAE RHDLLMAWND RWVPYPRDAT LHGLFAEIAR RHPSATAVVE GHRHIDFGTL
DRTANRLAHR IVASPAPSGG IAPGRPIALC CGRTIEMVIA ILAILKAGGA WVPLDPDYPA
ERLRFMIEDS AAELVLASPK AARDVAVLQS PQRLLLIVEP TDGSGDDRPP PATTGPADAA
YVIYTSGSTG RPKGVACVHR AVINFCHEWQ SKRAIAPGDA GTLTSSLSFD VSVYEIFSNL
LFGAAVHLLD KDTILDADRF ARYLRDQRIQ NCYLPPHLLT AVASLVAADG ANYALKRLMV
GVEPPLEQAM WRIKQAVPGA AVVNGYGTTE TTIGSIAYYV ERDTGRSGNA PIGVPFQNQT
AYLLDKRLRP VPLGAIGEIY IGGDGVSAGY LNRPELTAER FMDNPFQSLP DRLAGRNARI
YRTGDLGRML PDRQLECLGR IDTQIKIRGY RVEPSEIEAV IAACPGVTQS AVIVVDSGAA
RRLVGYYAAP SGQPDEQTVR ARLAALLPPY MVPAVLMRLD RLQLSPNGKI DRRALPLPAS
ATPDKRDLSA PRDDTEAALA AVWRTVLKLD QIGIDEDFFA LGGDSILTIQ VIARARQMGL
ELSASQLFAA PTIRALAASL RREAPEALVV IDDDDADASR APLAPIQQWF FAIGHSDLNH
WNQAFLFHVD ASVTIARIAH ALSQLGELHP AFASRFVSHA DKGWRETTGP AAWPVEAFDL
TGCADAAAAL RAEADRVQAR LDISHGPLAR ATLFTGHPDR RPRLLLAVHH LIIDGVSWRI
LLEDLECLLA GGTPARPAVR FTSWRSALAR YARTAAVAQW DYWLAVGANA SALPLDSSEA
DPGCGEDADR ICFTLGTDTT GRLLTRAGPA YRTQINDLLL TALALAVRSS TGLADVIVDL
EGHGREECVS ARDVSRTVGW FTSMFPVHLT LPAGSDIDAA IKAIKETLRA VPDKGVGYGA
LRFLSDDPRH AALAAGAKAR IGFNYLGRFD ALSGRYITLS EESAGTAVSP RNRLIHPIEI
SAYVSDGALK VTIEYSGRCC ASSRAQRLAD AFATALGDVV KHCVSGAGGL TPSDCPLAPD
VSQATLDGLA RAGRIDTVYG LAPLQEGLLF HALYAPDSDQ YCVQLSWRHQ GQLDTAALRR
AWQGIIDRHG ILRAAFIWEG VTRPLQAIYE NVPLDWEEAD WRGGPDDQLD AYLAEDRRRG
FRLDRPGLTR LRLMRLRDDG WAVVWTTHHL LMDGWCLPLI LQEVALRYRQ ACGEAIELPP
APPPFERHIA WLARQDRNQA QAFWRAHLAG IEAPTPLAIN HRPLDMRRPI ERMERRRLLL
DRTDSAALAT FARQHKVTLN TLVELALAAV LSRYSGQDEV VLGIAVAGRP GELPQVERMI
GLFINTVPLR LSLDAERSVL ENLHAVQAAT RSVIQHGYLS LTEIQAQSAV PNGTPLFQQL
LVFENYPDDG IGAAPAGGYL DMRGDIKTSF PLTLVVVPAA ELLVQASYDA ACFDETVIAR
LLGHLSETLR WLAANPERPL ADAALLTEAE RRAALAAAVT AVPYPRDISL ADMFEAVARR
QPQRAAVMHG DSAIAFGELN VQANRLAHRL RKLGVRAETA VGISIERSIP LIVGLMGILK
AGGAYVPLEP DVPDDRLQFM LADSQAPVLV TTAALANKFP QFTGEVIALD DPMLDSETAC
DPTREAVADP LLYIAYTSGS TGRPKGVMVQ QSTVLNRFHW LWRSLPLADD EVGSQISSIN
FVDAVWEVFS RLARGIPFVV CSDEVVRDPQ RMVDALARHR VTRLEPVPSL LASLLDNVPD
IAERLPHLRY CICSGEILPV ELARRFRATM PAVRLFNRYG STEATSVLWQ EVVNTEAYGA
NVPVGHPVQN VGICILDRRR RPLPHGIAGS LYVYGDAVAR GYHGRPDLTA ERFVTLPLIT
QEVRAYYTGD LARQRADGSI EVLGRDDNQL SIHGYRIEPG EIETALGRLA GIRDCVAVVR
DIGGSRQLVA FYAEADDAGT ALSPQALRNH LAGQLPAYMV PSLFVKLAAL PLTINGKVDR
KSLVARDLEI GPLDSDALLP RDATERRLHD LWAQVIGIDH FGVEDDFFAV GGHSLLAVRL
ATRINTTFRR SFPVAWIFST RTIAAQAAAL RKDGAGADFE PTVMLKRGRR ILFLVHPGHA
GAEAYSALAP LLQDDLAICA VESWNLYGND QPVTGIPALA RRYLAAIRTV QPTGPYLLGG
WSFGGSVAYE ISCQLAAAGE RVEQLILLDS FGPHDGRQAF VAAFDAAARA AFLDVPFYRD
MPDEIRDRMA RVIRMENAML ADFRPQRYDG PVLLLKANEA ETPPEPCVAA YPSGFLDMLQ
ATQAAPDNFW GRVASRLIVR PVAGTHGGLM SGAAVAEIAA ILSQTCLEPS NIEDSSIMGS
AP