Gene BBta_6813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6813 
SymbolsypC 
ID5152868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp7099581 
End bp7112732 
Gene Length13152 bp 
Protein Length4383 aa 
Translation table11 
GC content68% 
IMG OID640561496 
Productarthrofactin synthetase/syringopeptin synthetase C-related non-ribosomal peptide synthetase 
Protein accessionYP_001242608 
Protein GI148258023 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCC ACACGACAGA GCTATATCGG TCGAGCCTTT CGCAGCGCGA ACAGTTGGTT 
CGGCTGGCAC GGTCAAAAGG CTTAAAGAGT CGAAATGCAC TGCCGCCGAT AATTTCGACC
GAACGAATTG GAGTTATCCC TCTGTCGTTT GCGCAGCAGC GGCTGTGGTT TCTGTCTCGT
TTCGAGGAAG CGAGCACGGC CTATCACATA GCCGCTGGAT TGCGGCTTAT CGGGAGGCTT
GAGCGAGAGC CGCTGGTGCG CGCGCTGGAC CGGATCGTGG CGCGGCATGA GGCGCTGCGA
ACGGTGTTTG TGCAAGGTGA TGATGGCGTG CCGGTGCAGC AGGTTGCTGC GGCGGAGGCG
GGCTTTGCCT TGAGCGAGCA CGACCTCCGG AGCATGGCGG ATGCGGAAGG CGAGTTGCAG
CGACTGGTGG CTGTGGAAGC GACCGCGCCG TTCGATCTTG AGCGTGGTCC GCCGATCCGG
GGCCGGCTGG TGCGATTGGA TGCCGACGAG CACGTGCTGC TGGTGACGAT GCACCACATC
GTGTCGGATG GCTGGTCGAT GAGCGTGCTA ACGCGTGAGC TGAGCCTGCT GTATGCGGCG
TTCGCGCGTG GCGAGACCGA TCCGCTGCCG GCGTTGGCGA TCCAATACGG CGACTACGCG
GTCTGGCAGC GGCGGTGGCT GTCGGGTGAG GGGCTGGCGC GCCAGGGCGC TTATTGGAAG
GAGGCGCTTG CGGGCGCGCC TGCGCTTCTG GAACTGCCCT GGGATCGTCC GCGTCCGCCG
GAGCAGGACC ATGCCGGGGC CGTGGTGCCG GTGCGGCTGG ATGCGAGGCT GACCAGCGCA
TTGAAGGCGC TGAGCCATCA CCACGGCACG ACGTTATACA TGACGCTGCT GGCAGGTTGG
GCAGCACTGC TGTCGCGGTT GTCGGGGCAG GAGGATGTGG TGATCGGCAG CCCGGTTGCG
AACCGTGGAC GAGCCGAGAT CGAGGCGCTG ATCGGATTCT TCGTGAACAC GCTGGCGCTG
CGTGTCGATC TGTCCGGCTC GCCGTCGGTG AGTGAGCTTC TGGCACGGGT GAAGGCGCGG
ACGGTGGCGG CGCAAGAGCA TCAGGATCTG CCGTTCGAGC AGGTGGTGGA GCTGTTGCAG
CCGTCGCGGA GCCTGAGCCA CGCGCCGCTG TTCCAGGTGG CGCTGGCTTG GCAGGACATG
TTGGCAGGAC GTCTCGACCT CGGGGAACTG CGGCTGGAAG CGGTGGAGGT GCCGCGTGTG
AGCGCGCAGT TCGACCTGAC GCTGAGCTTG GCGGAGGCAG GTGGGACGAT CAGCGGCGGG
CTGGAATATG CGACCGCGCT GTTCGATCGC GAGACGATCG AGCGCTGGGT CGGGTACCTT
GTCCGGCTGC TGGACGGGAT GGTGACGGAC GAGCATGCGG CGGTGGATCG TTTGCCGTTG
CAGGATGACG CGGAGCGTCA TCGTGTCGTG GTGGAGTGGA ACGCGACGGC GGCGGATTAT
CCGCAGGATG TGTGCGTGCA CGAGCTGTTC GAGGCGCAGG CCGAGCGGAC GCCGGATGCG
ATTGCGTTGG TGCATGAGGA CGAACGGCTG AGCTATGCGG AGCTGAACAC CAAGGCGAAC
CGGCTGGCGC ATCATCTGTG CAAGCTCGGC GTCAGGCCGG ACGATCGTGT GGCGAGCTGC
ATCGCGCGCA GCCCGGAGAT GATCGTGGGT CTGCTGGCGA TCCTGAAGGC GGGCGGGGCC
TATGTGCCGC TGGACCCGGC CTATCCGCCG GAGCGGCTGG CGTTCATGGT TCATGACAGT
GCGCCGGCGG CGCTGCTGGT GGGTGGCGGC GCGCTGGATG TGCTGCCGGT CGTGGAGGCC
GAGCTTGCGG CCAGCGGCGT GCCGGTGCTC GACATCGGCG CGGATGCGGC GCAGTGGGTC
GATGCGCCCG CGCGCAATCC GGAGCGGAGC GAAGTGGGCC TTGCGCCTGA TCATCTCGCT
TATGTGATCT ACACCTCCGG ATCGACCGGT CAGCCCAAGG GGGTGATGGT CGAGCACCGC
GGTCTCGCCA ACCTAGTGCA TTGGCACTGC GAGGCCTTCG CGCTGCATCC GGGGACGCAC
AGCTCCTTGG TAGCTGGTTT GAGCTTTGAC GCTTCGAGCT GGGAAGTGTG GCCGGCGCTG
TGTTGCGGCG GCGTGCTGGT GGTGCCGCGG CCTGAGACGG CGCGGGATCC CGAAGCGCTG
ATGGCCTGGT GGCAGACGCA GCCGCTGGAC GTGAGCTTCC TGCCGACACC GATGGCGGAG
TTTGTGCTGT CGCAGGGGCT CGTGAACCGG CATCTGCGTG TTCTGCTGAC GGGGGGCGAC
CGTTTGCGGA AGCTGCCCAA GGCGCTGCCA TTCGCGCTGG TCAACAATTA CGGTCCCACG
GAAACGACGG TGGTGGCCAC GTCCGGTTTG CTCGGGGCTG ATGAGGCGGT GCTGCATATC
GGCCGTCCGA TCTCGAACAC GCAGATCTAC ATCCTGGATG GGCATGGCGA GCCGGTACCG
ATCGGTGTTG CCGGCGAGAT CTACATCGGC GGAGCCGGCG TTGCGCGCGG CTATCTGAAC
CGTCCCGAGC TGACGGCGGA GCGGTTCGTG GAGGACCGGT TCTCGAGCGA GGCGGGCGCG
CGGCTGTACC GGACCGGGGA TCTCGGGCGC TGGCTTAGCG ACGGGACGAT CGCCTATCTC
GGGCGCAATG ACTTCCAGGT GAAGATCCGC GGCTTCCGGA TCGAGCTCGG GGAGATCGAG
GCGCGGCTGT CGGAGCATAC GGGTGTGCGC GATGTCGCGG TGATCGCGCG TGAGGATGTC
GCGGGCGACC AGCGTCTGGT GGCCTACTAT GTGAGCGACG CGGCGATCGG AGCCGAGCAG
CTGCGGGGCC ATCTTGCGGC GCGGCTGCCG GACTACATGG TGCCGTCGGC CTATGTGCAT
CTGGCGCGTC TGCCGCTGAC GCCGAACGGC AAGCTCGACC GCAAGGCGCT GCCGGCCCCG
GAGGGCGCTG CGTTCGCGGT GCAGGTCTAT GCGCCGCCGC AAGGAGAGAC CGAGGCGGCG
ATCGCCCGGA TCTGGTCGGA GCTGCTCGGT GTCGAGCGGA TCGGTCGTCA CGACAATTTC
TTCGCGCTCG GCGGGCACTC GCTGCTGGCG GTGACCTTGG TGGAGCGGAT GCGGCGGGCG
GGGATCGCGG CAGATATCCG CACCTTGTTC GCGACGCCGA GCCTTGCGGC GCTGGCGGCC
GCAGGCGGGG GTGGCGCGGT TACGGTTGCG GTGCCGGCCA ACCTGATCCC CGCGGGCTGC
GGCCAGATCA CGCCGGAGAT GCTGCCGCTG GTGACGCTGA GCCAGGCGGA GATCGATCGC
ATCGCAGCGG CTGTGCCGGG CGGCAGCGCC AACATCCAGG ACATCTATCC GCTGGCGCCG
CTGCAGGAGG GCATCCTGTT CCACCATCTG CTGGGGGGAG AGGGGGACGT CTATCTGTTG
TCCGGCTTGC TCGGCTTCGA CAGCCGGGCG CGGCTGGACG GTTTTCTGGC GGCGTTCGAT
GCGGTGATCG GCCGTCACGA CATCCTGCGG ACGGCGGTGC TGTGGGAGGA TCTGCCGGAG
CCGGTGCAGG TGGTGCTGCG GCATGCGCCG CTGCCGGTGG AAGAGGTCGA GCTCGATGTG
GGCGGCGGGG ACAGGGCGGC GCAGCTGCGC GCGCGGTTCG ATGCGCGGCG TTGCCGGCTG
GATGTGCGGC GAGCGCCGTT GCTGCGGGGC GTGATCGCGC GTGATCCGCG GAGCGAAGGC
TGGCTGCTGC TGCTGCAGTT CCATCATCTG GTGATGGACC ATACCGGACT GGAGATTTTG
TCGGAGGAGA TTGCGTCGCA TCTGCGTGGC GAGGCGGAGC GGCTTGCGGC ACCGCTGCCG
TTCCGGAGTT TTGTGGCGCA GGCACGGCTG GGGATGAGCC GTGAGCAGCA CGAGACGTTC
TTCCGCGAGA TGCTGGGCAA GGTCGAGGAG CCGACGGCCG CGTTCGGGCT GCTGAACGTT
CAGGGCGACG GCTCGTCGGT GGAAGAGGCT CGTCAGGTCT TGTCGAGCGA GCTCGCGGAG
CGGCTGCGGC GGCAGGCGCG GGTACTGGGC GTGAGCGCGG CGAGCCTGTT CCACGTGGCA
TTTGCGCAGC TGCTTGCGCG GAGCTGCGGG CGCAGCGAGG TGGTGTTCGG GACGGTGCTG
TTCGGGCGGC TGCAGGGCGG AGAGGGTGCG GAGCGTACGC CCGGAGTGTT CATCAACACG
TTGCCGGTGC GGATCTGCAT CGACGAGACA GGAGCTGCGG ATGCGACGCG AGCCGTGCAG
ATCCGGCTGG CGGAGCTGCT GCGTCACGAG CATGCGCCGC TGGCGCTGGC GCAGCGCTGC
AGCAGGGTTA CGCCGCCGGC GCCGTTGTTC TCGGCGCTGC TGAACTACCG GCACAGCGGA
TCGGGTTCGG GGACCGGATC GGGTGGTTTG GAAGGCATCA AGGGGCTCGG GGGTGAGGAG
CGGACGAGCT ACCCGCTGAC GGTGTCGGTG GACGATCTGG GCGCGGGCTT TGCGTTGACG
GCGCAGGTCG AGGCGGCGGT CGGTGCGCAG CGGGTCTGTG CGTTCATGGC GACGGCGCTG
GAGGGACTGG TCGAAGCGCT GGAGCGGACA CCGGAGCGGC CGTTGCGGCA GATCGATGTG
CTGCCCGAGG CAGAGCGTCA TCGTGTCCTG GTGGAATGGA ACGCGACGGC TGCGGCTTAT
CCGCAGGATG TGTGCGTGCA CGAGCTGTTC GAGGCGCAGG CGGAGCGGAC GCCGGATGCG
GTGGGGGTGG TGCATGAGGA GCGCCGGCTG AGCTATGCGG AGCTGAACAT CCAGGCGAAC
CGGTTGGCGC ATCATCTGCG CAAGCTCGGC GTGAAGCCGG ATGACCGTGT GGCGATCTGC
ATCGCGCGCA GCCCGGAGAT GATCGTAGGT CTGCTGGCGA TCCTGAAGGC GGGCGGGGCC
TATGTGCCGC TGGATCCGGC CTATCCGCCG GAGCGGCTGG CGTTCATGCT GCAGGACAGC
GCGCCGGTGG CGCTGCTGGT GGGCGGCAGC GCGCAGGCAG CGTTGTCGGT GGTGGAGGCC
GAGCTTGCGG CCAGCGGCGT GCCGGTGCTC GACATCGGCG CGGATGCGGC GCAGTGGGCG
GAGGCGCCCG CGCGCAATCC GGAGCGGAGC GACATCGGGC TCGCGGCGAG TCACCTCGCT
TATGTGATCT ACACCTCCGG ATCGACCGGT CAGCCCAAGG GGGTGATGGT CGAGCATCGG
AACGTTGCCC GGCTCTTCCA CGCGACGGAG CATTGGTACC AGTTCGGCCC GGCCGATGTC
TGGACCTTGT TTCATTCCTA TGCATTCGAC TTCTCGGTGT GGGAGATCTG GGGAGCGCTG
CTTTATGGCG GTCGGCTGGT GGTGGTGCCG CAGCTGACGG CGCGATCTCC GGACGATTTT
TATCATCTCC TCTGCCGCGA GCGGGTGACG GTCCTGAACC AGACGCCGAG CGCGTTCCGG
CAATTGATCG CCGCTCAAGC CGAGGCGGTG GAGACACATC ATCTGCGCAC TGTGATCTTT
GGCGGTGAGG CGCTGGAGCC TGCGGCCCTG AAGCCTTGGT ACCGCCGCGA GGCCAATCAG
GCGACGAGCC TGATCAACAT GTACGGGATC ACCGAGACGA CGGTCCATGT CACATACTAT
GCGCTCCAGG CAGCCGATGC GGACAGGTAC GGCGCGAGCC CGATCGGGCA ACCGATCCGG
GATCTCAAGG TCTACATTCT TGATGCATAT GGTCAGCCTG CGCCGATCGG TGTTGCGGGC
GAGATCTGCG TCGGTGGCGC CGGCGTTGCG CGGGGCTATC TGAACCGTCC TGAGCTGACG
GCGGAGCGGT TCGTGGAGGA CCGGTTCTCT GGCGAGGCGG GCGCGCGGCT ATATCGGACC
GGGGATCTCG GGCGGTGGCT TGAAGACGGA ACGATCGCGT ATCTCGGGCG CAACGACTTC
CAGGTGAAGA TCCGCGGCTT CCGGATCGAG CTCGGCGAGA TCGAGGCGCG GCTGTCGGAG
CATGCGGCAG TGCGCGATGC CGCGGTGATC GCGCGGGAGG ACGTCGCGGG CGACAAGCGT
CTGGTGGCCT ACTATGTGAG CGACGCGGCG ATCGGGGCGG AGGAGCTGCG GGGCCATCTT
GCGGCGCGGT TGCCGGACTA CATGGTGCCG TCGGCCTATG TGCATCTTCA GCGCCTGCCG
CTGACGCCGA ACGGCAAGCT GGACCGCAAG GCGCTGCCGG CGCCGGAGGG CGCTGCGTTT
GCGGTGCAGG CCTACGAGCC GCCGCACGGC GAGACCGAGG TGGCGATCGC CCGGATCTGG
GCGGAGCTGC TCGGTGTCGA GCGGATCGGG CGCCACGACA ATTTCTTCGC GCTCGGCGGA
CACTCGCTGC TGGCGGTGAC CTTGGTGGAG CGGATGCGGC GGGCGGGGAT CGCGGCTGAT
ATCCGCACCC TGTTCGCGAC GGCGAGCCTT GCGGCGCTGG CGGCAGCAGG CGGTGGTGGC
GCGGTTACGG TTGCGGTGCC GGCCAACCTG ATCCCGGCGG GCTGCGAGCG GATCACGCCG
GAGATGCTGC CGCTGGTGGC GCTGAGCCAG GCGGAGATCG ATCGCATCGC GGCGGCTGTG
CCGGGCGGCA GCGCCAACAT CCAGGACATC TATCCGCTGG CGCCGCTGCA GAAGGGCATC
CTGTTCCACC ATCTGCTGGG CGGGGAGGGG GACGTCTATC TGTTGTCCGG CTTACTCGGA
TTCGACAGCC GGGCGCGGCT GGACGGTTTC CTGGCGGCGT TCGATGCGGT GATCGGCCGT
CACGACATCC TGCGGACGGC GGTGCTGTGG GAGGAGCTGC CGGAGCCGGT GCAGGTGGTG
TTGCGGCATG CGCCGCTGCC GGTGGAAGAG GTCGAGCTGG ATGCGGGCGG CGGGGACGGG
GTAGCGCAAC TGCAAGCGCG GTTCGATCCC CGGCATGACC GGCTGGATGT GCGTCAGGCG
CCGCTGCTGC GGGTCGTGAT CGCGCGTGAT CCGCGGAGCG GAGGCTGGTT GCTGCTGCTG
CAGTTCCATC ATCTGGTGAT GGACCATACC ACGCTGGAGA TCGTGCTGGA GGAGATCCAG
GCGCATCTCG CGGGAGAGGA AGCGTCGCTT GCGGCGCCGC TGCCGTTCCG GAGTTTTGTG
GCGCAGGCGC GGCTGGGCGT GAGCCGTGAG CAGCACGAGA CGTTCTTCCG CGAGATGCTG
GGCAAGGTCG AGGAGCCGAC CGCGCCGTTC GGGCTGCTGG ACGTGCAGGG CGACGGGTCT
GAGATCGCGG AGGCACATCT CGAACTGGCG CCTGAGCTTG CGGAGCGGCT GCGGTCGCAG
GCGCGCGCGC TGGGTGTGAG CGCGGCGAGC CTGTTCCATG TGGCGTTTGC GCAACTGCTG
GCGCGCAGCT GTGGACGCAG CGATGTGGTG TTCGGCACCG TGCTGTTCGG GCGGCTGCAG
GGCGGCGAAG GGGTGGACCG TGCGCTCGGC CTGTTCATCA ACACGCTGCC GCTGCGGGTG
GAGGTCGGCG AGCAGGGTGT TGCAGCGAGC GTGCGCGAGG TGCAGCGGCA TCTTGCGGAA
CTGCTGCGTC ATGAGCATGC GCCGCTGGCG CTGGCACAGC GTTGCAGCGG GGTGGCGGCC
CCGGCGCCGC TGTTCTCGGC ACTGCTGAAC TACCGGCACA GCGCGGTGGA CGGATCTGCG
GATCGCAGGG CGTGGGAGGG GATCGAGGTC CTGTACGCGG AGGAGCGGAC GAACTATCCG
CTGACGCTGT CGGTGGACGA TCTGGGCGCG GGCTTTGCGC TGACGGCGCA GGTGGCGGCA
TCGGTCGCGG CGGCGCGGGT CTGTGCGTTC ATGGCGACGG CGCTGGAGGG GCTGGTCGAG
GCGCTGGAGC GGACACCGGA GCGGCCGCTG CGGCAGATCG ACGTGCTGCC CGAAGCGGAG
CGTCATCGCG TCCTGGTGGA GTGGAACGCG ACGGCGGCGG ATTATCCGCA GGATGTGTGC
GTGCACGAGT TGTTCGAGGC GCAGGCGGAG CGGACGCCGG ATGCGGTGGC GGTGGTGCAT
GAGGAGCGCC GGCTGAGCTA TGCGGAGCTG AACACCCAGG CGAACCGGCT GGCGCATCAT
CTGCGCAAGC TCGGCGTCAA GCCGGATGAC CGTGTTGCGA TCTGCATCGC GCGCAGCCCG
GAGATGATCG TAGGTCTGCT GGCGATCCTG AAGGCGGGCG GGGCCTATGT GCCGCTGGAC
CCGGCCTATC CGCCGGAGCG GCTGGCGTTC ATGCTTCAGG ACAGCGCGCC GGTGGCGCTG
CTGGTGGGCG GCGGCGCGCT GGATGTGCTG CCGGTCGTGG AGGCCGAGCT TGCGGCCAGC
GGCGTGCCGG TGCTCGACAT CGGCGCGGAT GCGGCGCAGT GGGCGGAGGC GCCCGCGCGC
AATCCGGCGC GGAGCGAGGT CGGTCTCACG CCGGACCATC TGGCTTATGT GATCTACACC
TCTGGATCCA CCGGCACGCC CAAGGGCGCG ATGAACGGTC ATCGCGCGGT GGTGAACCGC
CTGCTGTGGA TGCAGGACGC CTACGCTCTC GATGGTGGCG ATGCGGTGCT ACAGAAGACG
CCGTTCAGCT TCGATGTGTC GGTGTGGGAG TTCTTCTGGC CGTTGCTCGC GGGAGCGCGG
CTGGTGATGG CGCGGCCGGA GGGCCACAAG GATCCAGCCT ATCTGGTGGA GGTGATCCGG
CGGGAGCGGA TCACGACGCT GCACTTCGTG CCATCGATGC TGCAGATATT CGTGGAGTAT
GCGGAGGCCG GTAGCTGCAC GAGCGTGAAG CGGGTGATGT GCAGCGGCGA GGCGCTGTCC
CCGGTCTTGG CGGCGAGGTT GCTGGAGCGG CTTGAAGGAA CAGAGCTGCA TAACCTTTAC
GGTCCGACGG AAGCTGCGGT GGACGTGACG GCGTGGCGCT GTGCAAAGGA AGCGTCGGAT
GCGAGCGTTC CAATCGGCCG TCCGATCTCG AACACGCAGA TCTACATCCT GGATCAGCAT
GGCGATCCCG TGCCGATCGG TGTTGCGGGC GAGATCCATA TCGGTGGCGT CCAGGTGGGG
CGGGGCTATC TGAACCGTCC TGAGCTGACG GCGGAGCGGT TCGTGGAGGA CCGGTTTTCG
GGAGAAGCCG GAGCGCGGCT ATATCGGACC GGGGATCTCG GGCGCTGGCT TGAAGACGGA
ACGATCGCGT ATCTCGGGCG CAACGACTTC CAGGTGAAGA TCCGCGGCTT CCGGATCGAG
CTCGGAGAGA TCGAGGCGCG GCTGTCGGAG CATGCGGATG TGCGCGATGC CGCGGTGATC
GCGCGGGAGG ATGTCGCGGG CGACAAGCGC CTGGTGGCCT ACTATGTGAG CGACGCGGCG
ATCGGGGCGG AGGAGCTGCG GGGCCATCTT GCGGCGCGGC TGCCGGACTA CATGGTGCCG
TCGGCCTATG TGCATCTCGA GCGCCTGCCG CTGACGCCGA ACGGCAAGCT GGACCGCAAG
GCGCTGCCGG CCCCGGAGGG CGCTGCGTTT GCGGTGCAGG CCTACGAGCC GCCGCAGGGC
GAGACCGAGG AGGCGATCGC CCGGATCTGG TCGGAGCTGC TCGGTGTCGA GCGGATCGGG
CGCCACGACA ATTTCTTCGC GCTCGGTGGG CACTCGCTGC TGGCGGTGAC CTTGGTGGAG
CGGATGCGGC GGGCGGGGAT CGCGGCTGAT ATCCGCACCT TGTTCGCGAC GCCGAGCCTT
GCGGCGCTGG CGGCCGCAGG CGGGGGTGGC GCGGTTACGG TTGCGGTGCC GGCCAACCTG
ATCCCCGCGG GCTGCGAGCG GATCACGCCG GAGATGCTGC CGCTGGTGAC GCTGAGCCAG
GTGGAGATCG ATCGCATCGC GGCTGCGGTG CCGGGCGGCT GTGCCAACAT CCAGGACATC
TATCCGCTGG CGCCGCTGCA GGAAGGCATC CTGTTCCACC ATCTGCTGGG GGGAGAGGGC
GACGTCTATC TGTTGTCCGG CTTACTCGGA TTCGACAGCC GGGCGCGGCT GGACGGTTTT
CTGGCGGCGT TCGATGCGGT GATCGGCCGT CACGACATCC TGCGGACGGC GGTGCTGTGG
GAGGAGCTGC CGGAGCCGGT GCAGGTGGTG TTGCGGCATG CGCCGCTGCC GGTGGAAGAG
GTCGAGCTCG ATCCGGCTGG CGACGACGGG GTAGCGCAAC TGCAAGCGCG GTTCGATCCC
CGGCATGACC GGCTGGATGT GCGTCAGGCG CCGCTGCTGC GGGTCGTGAT CGCGCGTGAT
CCGCGAAGCG GAGGCTGGCT GCTGCTGCTG CAGTTCCATC ATCTGGTGAT GGACCATACC
ACGCTGGAGA TCGTGCTGGA GGAGATCGCG TCGCATCTCG CGGGAGAGGA AGCGTCGCTT
GCGGCGCCGC TGCCGTTCCG GAATTTTGTG GCGCAGGCGC GGCTGGGCGT GAGCCGTGAG
CAGCACGAGA CGTTCTTCCG CGAGATGCTG GGCAAGGTCG AGGAGCCGAC CGCGCCGTTC
GGGCTGCTGG ACGTGCAGGG CGACGGGTCT GAGATCGCGG AGGCACATCT CGAACTGGCG
CCTGAGCTTG CGGAGCGGCT GCGGTCGCAG GCGCGCGCGC TGGGTGTGAG TGCGGCGAGC
CTGTTCCATG TGGCGTTTGC GCAGCTGCTG GCACGCAGCT GTGGACGCAG CGAGGTGGTG
TTCGGCACGG TGCTGTTCGG GCGGCTGCAG GGCGGCGAAG GGGTGGACCG TGCGCTCGGC
CTGTTCATCA ACACGCTGCC GCTGCGGGTG GAGGTTGGCG AGCAGGGCGT TGCGGCGGGC
GTGCGCGAGG TGCAGCGGCA TCTTGCGGAG TTGCTGCGTC ACGAGCATGC GCCGCTGGCG
CTGGCACAGC GCTGCAGCGG GGTGGCGGCC CCGGCGCCGC TGTTCTCGGC GCTGTTGAAC
TACCGGCACA GCGCGGTGGA CGGATCTGCG GATCGCAGGG CGTGGGAGGG GATCGAGGTC
CTGCACGCGG AGGAGCGGAC GAACTATCCG CTGACGCTGT CGGTGGACGA TCTGGGCGCG
GGCTTCACGC TGACGGCGCA GGTGGCGGCA TCGGTCGCGC CGGAGCGGAT CTGTGCGTTC
ATGGCGACGG CGCTGGAGGG ACTGGTCGAG GCGCTGGAGC GGACACCGGA GCGGCCGCTG
CGGCAGATCG ACGTGCTGCC CGAAGCGGAG CGTCATCGTG TCCTGGTGGA GTGGAACGCG
ACGGCGGCGG CGTATCCGCA GGATGTGTGC GTGCACGAGC TGTTCGAGAC GCAGGCCGAG
CGGACGCCGG ATGCGGTGGC GGTGGTGCAT GAAGACGAAC GGCTGAGCTA TGCGGAGCTG
AACATCCAGG CGAACCGGTT GGCGCATCAT CTGCGCGGAC TTGGTGTCAG GCCCGATGAC
CGTGTTGCAA TCTGCATCGC GCGCAGCCCG GAGATGATCG TGGGTCTGCT GGCGATCCTG
AAGGCGGGCG GGGCCTATGT GCCGCTGGAC CCGGCCTATC CGCCGGAGCG GCTGGCGTTC
ATGCTTCAGG ACAGCGCGCC GGTGGCGCTG CTGGTGGGCG GCGGCGCGCT GGATGTGCTG
CCGGTGGTGC AGGCCGAGCT TGCGGCCAGC GGTGTGCCGG TGCTCGACAT CGGCGCGGAT
GCGGCGCAGT GGGCGGAGGC GCCGGCGCGC AATCCGGAGC GGAGCGAAGT GGGCCTTGCG
CCGGACCATC TGGCTTATGT GATCTACACC TCTGGATCCA CCGGCACGCC CAAGGGCGCG
ATGAACGGTC ATCGCGCGGT GGTGAACCGC CTGCTGTGGA TGCAGGACGC CTACGCTCTC
GATGGTGGCG ATGCGGTGCT ACAGAAGACG CCGTTCAGCT TCGACGTGTC GGTGTGGGAG
TTCTTCTGGC CGTTGCTCGC GGGAGCGCGC CTGGTGATGG CGCGGCCGGA GGGCCACAAG
GATCCAGCCT ATCTGGTGGA GGTGATCCGG CGGGAGCGGA TCACGACGCT GCACTTCGTG
CCATCGATGC TGCAGATATT CGTGAAGTAT GCGGAGGCCG GTAGCTGCAC GAGCGTGAAG
CGGGTGATGT GCAGCGGCGA GGCGCTGTCC CCGGTCTTGG CGGCGAGGTT GCTGGAGCGG
CTTGAAGGAA CAGAGCTGCA TAACCTTTAC GGTCCGACGG AAGCTGCGGT GGACGTGACG
GCGTGGCGCT GTGCAAAGGA AGCGTCGGAT GCGAGCGTTC CAATCGGCCG TCCGATCTCG
AACACGCAGA TCTACATTCT GGATGAGCAT GGCGAGCCGG CACCGATCGG TGTTGCGGGC
GAGATCCATA TCGGTGGCGT CCAGGTGGGG CGGGGCTATC TGAACCGTCC TGAGCTGACC
GCGGAGCGGT TCGTGGAGGA CCGGTTTTCG GGAGAAGCGG GAGCGCGGCT GTACCGGACC
GGGGATCTCG GGCGCTGGCT TGGCGACGGG ACGATCGCGT ATCTCGGGCG CAATGACTTC
CAGGTGAAGA TCCGCGGCTT CCGGATCGAG CTCGGAGAGA TCGAGGCGCG GCTGTCGGAG
CATGCGGGTG TGCGCGATGC CGCGGTGATC GCGCGGGAGG ATGTCGCGGG CGACAAGCGT
CTTGTGGCCT ATTATGTGAG CGACGGGGCG ATCGGGGCGG AGGAGCTGCG GGGCCATCTT
GCGGCGCGGC TGCCGGACTA CATGGTGCCG TCGGCCTATG TGCATCTTCA GCGTCTGCCG
CTGACGCCGA ACGGCAAGCT GGACCGCAAG GCGCTGCCGG CCCCGGAGGG CGCTGCGTTT
GCTGTGCAGG CCTACGAGCC GCCGCAGGGC AAGATCGAAG AAGAGTTGGC TCGTATATGG
GAGGAGCTGC TGGGCGTGGC GCGCGTCGGG CGCCACGACA ATTTCTTCGC GCTCGGCGGT
CATTCGCTGC TGGCGGTGGC CTTGGTGGAG CGGATAGATC GACAATTCGA TCTTCGCATA
AGGCTGTCTG CGGTATTTTC GAATGAGAGG CTCCATCAGT TAGCAGAGCT GATTCTAAAT
AGTCAGTTGA GTCAGTTCGA CGTTGCGGAA CTGTTGGCGC TTAAAAAATT AGGCCGTGGC
TCTTCGGAAT GA
 
Protein sequence
MSVHTTELYR SSLSQREQLV RLARSKGLKS RNALPPIIST ERIGVIPLSF AQQRLWFLSR 
FEEASTAYHI AAGLRLIGRL EREPLVRALD RIVARHEALR TVFVQGDDGV PVQQVAAAEA
GFALSEHDLR SMADAEGELQ RLVAVEATAP FDLERGPPIR GRLVRLDADE HVLLVTMHHI
VSDGWSMSVL TRELSLLYAA FARGETDPLP ALAIQYGDYA VWQRRWLSGE GLARQGAYWK
EALAGAPALL ELPWDRPRPP EQDHAGAVVP VRLDARLTSA LKALSHHHGT TLYMTLLAGW
AALLSRLSGQ EDVVIGSPVA NRGRAEIEAL IGFFVNTLAL RVDLSGSPSV SELLARVKAR
TVAAQEHQDL PFEQVVELLQ PSRSLSHAPL FQVALAWQDM LAGRLDLGEL RLEAVEVPRV
SAQFDLTLSL AEAGGTISGG LEYATALFDR ETIERWVGYL VRLLDGMVTD EHAAVDRLPL
QDDAERHRVV VEWNATAADY PQDVCVHELF EAQAERTPDA IALVHEDERL SYAELNTKAN
RLAHHLCKLG VRPDDRVASC IARSPEMIVG LLAILKAGGA YVPLDPAYPP ERLAFMVHDS
APAALLVGGG ALDVLPVVEA ELAASGVPVL DIGADAAQWV DAPARNPERS EVGLAPDHLA
YVIYTSGSTG QPKGVMVEHR GLANLVHWHC EAFALHPGTH SSLVAGLSFD ASSWEVWPAL
CCGGVLVVPR PETARDPEAL MAWWQTQPLD VSFLPTPMAE FVLSQGLVNR HLRVLLTGGD
RLRKLPKALP FALVNNYGPT ETTVVATSGL LGADEAVLHI GRPISNTQIY ILDGHGEPVP
IGVAGEIYIG GAGVARGYLN RPELTAERFV EDRFSSEAGA RLYRTGDLGR WLSDGTIAYL
GRNDFQVKIR GFRIELGEIE ARLSEHTGVR DVAVIAREDV AGDQRLVAYY VSDAAIGAEQ
LRGHLAARLP DYMVPSAYVH LARLPLTPNG KLDRKALPAP EGAAFAVQVY APPQGETEAA
IARIWSELLG VERIGRHDNF FALGGHSLLA VTLVERMRRA GIAADIRTLF ATPSLAALAA
AGGGGAVTVA VPANLIPAGC GQITPEMLPL VTLSQAEIDR IAAAVPGGSA NIQDIYPLAP
LQEGILFHHL LGGEGDVYLL SGLLGFDSRA RLDGFLAAFD AVIGRHDILR TAVLWEDLPE
PVQVVLRHAP LPVEEVELDV GGGDRAAQLR ARFDARRCRL DVRRAPLLRG VIARDPRSEG
WLLLLQFHHL VMDHTGLEIL SEEIASHLRG EAERLAAPLP FRSFVAQARL GMSREQHETF
FREMLGKVEE PTAAFGLLNV QGDGSSVEEA RQVLSSELAE RLRRQARVLG VSAASLFHVA
FAQLLARSCG RSEVVFGTVL FGRLQGGEGA ERTPGVFINT LPVRICIDET GAADATRAVQ
IRLAELLRHE HAPLALAQRC SRVTPPAPLF SALLNYRHSG SGSGTGSGGL EGIKGLGGEE
RTSYPLTVSV DDLGAGFALT AQVEAAVGAQ RVCAFMATAL EGLVEALERT PERPLRQIDV
LPEAERHRVL VEWNATAAAY PQDVCVHELF EAQAERTPDA VGVVHEERRL SYAELNIQAN
RLAHHLRKLG VKPDDRVAIC IARSPEMIVG LLAILKAGGA YVPLDPAYPP ERLAFMLQDS
APVALLVGGS AQAALSVVEA ELAASGVPVL DIGADAAQWA EAPARNPERS DIGLAASHLA
YVIYTSGSTG QPKGVMVEHR NVARLFHATE HWYQFGPADV WTLFHSYAFD FSVWEIWGAL
LYGGRLVVVP QLTARSPDDF YHLLCRERVT VLNQTPSAFR QLIAAQAEAV ETHHLRTVIF
GGEALEPAAL KPWYRREANQ ATSLINMYGI TETTVHVTYY ALQAADADRY GASPIGQPIR
DLKVYILDAY GQPAPIGVAG EICVGGAGVA RGYLNRPELT AERFVEDRFS GEAGARLYRT
GDLGRWLEDG TIAYLGRNDF QVKIRGFRIE LGEIEARLSE HAAVRDAAVI AREDVAGDKR
LVAYYVSDAA IGAEELRGHL AARLPDYMVP SAYVHLQRLP LTPNGKLDRK ALPAPEGAAF
AVQAYEPPHG ETEVAIARIW AELLGVERIG RHDNFFALGG HSLLAVTLVE RMRRAGIAAD
IRTLFATASL AALAAAGGGG AVTVAVPANL IPAGCERITP EMLPLVALSQ AEIDRIAAAV
PGGSANIQDI YPLAPLQKGI LFHHLLGGEG DVYLLSGLLG FDSRARLDGF LAAFDAVIGR
HDILRTAVLW EELPEPVQVV LRHAPLPVEE VELDAGGGDG VAQLQARFDP RHDRLDVRQA
PLLRVVIARD PRSGGWLLLL QFHHLVMDHT TLEIVLEEIQ AHLAGEEASL AAPLPFRSFV
AQARLGVSRE QHETFFREML GKVEEPTAPF GLLDVQGDGS EIAEAHLELA PELAERLRSQ
ARALGVSAAS LFHVAFAQLL ARSCGRSDVV FGTVLFGRLQ GGEGVDRALG LFINTLPLRV
EVGEQGVAAS VREVQRHLAE LLRHEHAPLA LAQRCSGVAA PAPLFSALLN YRHSAVDGSA
DRRAWEGIEV LYAEERTNYP LTLSVDDLGA GFALTAQVAA SVAAARVCAF MATALEGLVE
ALERTPERPL RQIDVLPEAE RHRVLVEWNA TAADYPQDVC VHELFEAQAE RTPDAVAVVH
EERRLSYAEL NTQANRLAHH LRKLGVKPDD RVAICIARSP EMIVGLLAIL KAGGAYVPLD
PAYPPERLAF MLQDSAPVAL LVGGGALDVL PVVEAELAAS GVPVLDIGAD AAQWAEAPAR
NPARSEVGLT PDHLAYVIYT SGSTGTPKGA MNGHRAVVNR LLWMQDAYAL DGGDAVLQKT
PFSFDVSVWE FFWPLLAGAR LVMARPEGHK DPAYLVEVIR RERITTLHFV PSMLQIFVEY
AEAGSCTSVK RVMCSGEALS PVLAARLLER LEGTELHNLY GPTEAAVDVT AWRCAKEASD
ASVPIGRPIS NTQIYILDQH GDPVPIGVAG EIHIGGVQVG RGYLNRPELT AERFVEDRFS
GEAGARLYRT GDLGRWLEDG TIAYLGRNDF QVKIRGFRIE LGEIEARLSE HADVRDAAVI
AREDVAGDKR LVAYYVSDAA IGAEELRGHL AARLPDYMVP SAYVHLERLP LTPNGKLDRK
ALPAPEGAAF AVQAYEPPQG ETEEAIARIW SELLGVERIG RHDNFFALGG HSLLAVTLVE
RMRRAGIAAD IRTLFATPSL AALAAAGGGG AVTVAVPANL IPAGCERITP EMLPLVTLSQ
VEIDRIAAAV PGGCANIQDI YPLAPLQEGI LFHHLLGGEG DVYLLSGLLG FDSRARLDGF
LAAFDAVIGR HDILRTAVLW EELPEPVQVV LRHAPLPVEE VELDPAGDDG VAQLQARFDP
RHDRLDVRQA PLLRVVIARD PRSGGWLLLL QFHHLVMDHT TLEIVLEEIA SHLAGEEASL
AAPLPFRNFV AQARLGVSRE QHETFFREML GKVEEPTAPF GLLDVQGDGS EIAEAHLELA
PELAERLRSQ ARALGVSAAS LFHVAFAQLL ARSCGRSEVV FGTVLFGRLQ GGEGVDRALG
LFINTLPLRV EVGEQGVAAG VREVQRHLAE LLRHEHAPLA LAQRCSGVAA PAPLFSALLN
YRHSAVDGSA DRRAWEGIEV LHAEERTNYP LTLSVDDLGA GFTLTAQVAA SVAPERICAF
MATALEGLVE ALERTPERPL RQIDVLPEAE RHRVLVEWNA TAAAYPQDVC VHELFETQAE
RTPDAVAVVH EDERLSYAEL NIQANRLAHH LRGLGVRPDD RVAICIARSP EMIVGLLAIL
KAGGAYVPLD PAYPPERLAF MLQDSAPVAL LVGGGALDVL PVVQAELAAS GVPVLDIGAD
AAQWAEAPAR NPERSEVGLA PDHLAYVIYT SGSTGTPKGA MNGHRAVVNR LLWMQDAYAL
DGGDAVLQKT PFSFDVSVWE FFWPLLAGAR LVMARPEGHK DPAYLVEVIR RERITTLHFV
PSMLQIFVKY AEAGSCTSVK RVMCSGEALS PVLAARLLER LEGTELHNLY GPTEAAVDVT
AWRCAKEASD ASVPIGRPIS NTQIYILDEH GEPAPIGVAG EIHIGGVQVG RGYLNRPELT
AERFVEDRFS GEAGARLYRT GDLGRWLGDG TIAYLGRNDF QVKIRGFRIE LGEIEARLSE
HAGVRDAAVI AREDVAGDKR LVAYYVSDGA IGAEELRGHL AARLPDYMVP SAYVHLQRLP
LTPNGKLDRK ALPAPEGAAF AVQAYEPPQG KIEEELARIW EELLGVARVG RHDNFFALGG
HSLLAVALVE RIDRQFDLRI RLSAVFSNER LHQLAELILN SQLSQFDVAE LLALKKLGRG
SSE