Gene PputGB1_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPputGB1_4083 
Symbol 
ID5871883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida GB-1 
KingdomBacteria 
Replicon accessionNC_010322 
Strand
Start bp4538849 
End bp4550677 
Gene Length11829 bp 
Protein Length3942 aa 
Translation table11 
GC content62% 
IMG OID641549211 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001670309 
Protein GI167035078 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.223479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGAGC TGCTTGAGTC TGTAAAGCTA CTGTCCACCA AAGAGCGCCA GGCCCTGGCA 
GCCTTGCTCA AGCAGCAGGG GGTGAACCTT TATGGCGTCA CCCCGATCTT CAAGCGTGAG
CCTGAAGCCT CCCTACCGCT GTCGTATGCC CAGCAGCGGC AGTGGTTTCT GTGGCAACTT
GACCCTGGTG CACCGACGTA CAACGTCCCG GCTGCGCTGC GTCTGCGCGG CCGCTTGGAT
GCCGAGGCTT TGCAGCGCAG TTTCCAAGCC TTGATCGCCC GCCACGAGAC CCTGCGTACA
CTCTTCAGGG AAGAGGCGCA GGGTACCGTC CAGGTCATTC ACCCCGAAAT CACCTTGGTG
CTCGACATCC TGATGCTGCA GAGCGCCGCT GAGCGTGACG TGCTGGAGCA GGTGCAGGCC
GAGGCGCGTC GCCCATTCGA CCTGCAGCAG GGGCCGCTGC TACGGGTCAA GTTGCTGCGC
CTTGCGCAGG ACGAGCATGT GCTGGTGGTG ACAATGCATC ACATCGTCTC CGATGGCTGG
TCGATGCCAC TGATGATCGA CGAGCTGATG CAGCTGTATG CGGGATACAG CCAAGGTGAA
GACGTGCAGC TGCCAACGTT GCCAGTGCAA TATGCCGACT ATGCGATCTG GCAGCGTCAG
TGGATGGAGG CGGGTGAGCA AGAGCGCCAA CTGGCGTACT GGAAGGCGCA GCTTGGCGAT
GACCAGCCGG TGCTTGAGTT GCCGACCGAC CGCCCACGTG CTGCCGCCAA TAGCCACCGT
GGCGCCCGGC ACGAGGTTCG GCTGCCGGAC GCGTTGGTGC ATTCACTCAA GCAGCTTGCC
CAACAGCAGG GTGCGACCTT GTTCATGCTG CTGCTGGCCA GCTTCCAGAC ATTGTTGCAC
CGCTATAGCG GGCAGCCGGA CATTCGCGTC GGGGTGCCGG TTGCCAACCG TAATCGGGTA
GAAACCGAGC GGCTGATCGG CTTTTTCGTC AACACCCAAG TGTTGCGCGC TGAGTTCGAT
CTGCAGATGA CCTTCAGCGA GTTGCTGGAG CAGGTCAAGC AGCGAGCGTT GGGGGCGCAA
AGCCATCAGG ACTTGCCCTT CGAGCAACTG GTCGAGGTTT TGCAACCAGA GCGTAGCCTG
AGCCACAGCC CGCTGTTCCA GGTGATGTAC AACCACCAGA TGCAGGCCAA GGGCAAGCAA
CGCAGCGTGC AAGGGTTGCA CGTGGAGGGG CTGAGCTGGG ATCACGACAC GGCCAAGTTC
GACCTGACCC TGGATACCTT CGAATACGAA CACAGCCTTG GTGCCGCGTT GAGCTACGCG
GTCGACCTGT TCGATGCTGC CACCATCGAG CAGATGGCAC GGCACTGGGT GAACCTGCTG
GAGGCCATCG TTCGCCAGCC TGGGCAGCGG ATCGTCGAGC TGCCGTTGAT CGGCCAGCAG
GAGCAGCAGC AGATCGAGCG CGCCTGGAAC CCGGCGCTGG CCCACTACCC GGCAGAGCGG
CCACTTCACC AACTGATCGA AGCCCAGGTC GAGGCCGCCC CAGATGCCGT GGCGCTGGTG
TTCGGTGAGC GTTCCCTGAG TTACGCCGAG CTCAACCGCC GCGCCAATCA ACTGGCACAC
AAACTACTGG AACTGGGCGC CGGGCCGGAT GTGCTGGTTG GCCTGGCTGT GGAGCGCAGC
CTGGAAATGG TCATCGGCCT GCTGGGCATA CTCAAGGCTG GCGCTGCCTA TGTGCCGTTG
GACCCAGAGT ATCCACAGGA CCGTCTCAGC TACATGTTCC AGGACAGCGG CATTCATCTG
TTGCTGACCC AGCAGCATTT GCGTGATGCG CTGCCAGTAC CGGCAGGGGT GAAAACCCTT
GTGCTGGATG GCCACGCCGG CCTCGCGGGC TACAGCGACG CCAACCCCGT GTGCCGAGTG
ACCCCGGACA ACCTGGCCTA TGTGATCTAT ACCTCCGGCT CAACCGGCAA GCCCAAAGGC
ACGCTGTTGC CACACCGCAA TGTGGTACGC CTGTTTGCGG CCACCCAGCA CTGGTTCAAC
TTCGATGCCA GCGATGTGTG GACGGTGTTC CATTCCTACG CCTTCGACTT CTCGGTGTGG
GAGTTGTACG GCGCGTTGCT GTATGGCGGT AAGGCAATCA TCGTACCCAA GGATGTGGCG
CGCTCCAGTG AAGACTTCCA CGCATTGCTG GTTCGCGAGC AGGTCACCGT ACTCAACCAG
ACACCTTCTG CGTTCAAGCC GCTGATCCCG GTGGCCTGCG AGGCGATGAA AGCGGGGCAG
GGCCTGGCCC TGCGGCACGT GGTGTTCGGT GGCGAAGCCC TGGAAGTGAG CAGCCTGAAG
CCTTGGTTCG AGGTATTCGG CGACCGTCGG CCGCGCATGA TCAACATGTA TGGCATCACC
GAAACCACCG TGCACGTGAC GTATCGGCCG GTCACTTTCG AAGACTTGCA CAAAGGTGCC
AGCAGCCCGA TTGGCGAGGT GATTCCCGAC CTGTCGTGGT ACTTGCTCGA TGCAGCATTG
AACCCGATTC TGCCGGGTTG CACGGGGGAA ATGTTGATTG GCCAGGCGGG CCTGGCGCGT
GGCTATCATG GCCGCCCCGG GCTGACAGCC GAGCGCTTCG TACCGAACCC GTTCGATGGC
AATGGCGGGC GCCTGTACCG CTCGGGCGAC CTGGCGCGTT ACCGCAGCGA TGGCGTCATC
GAGTACATCG GGCGAATCGA TCATCAAGTG AAGATTCGCG GTTTCCGTAT CGAGCTGGGC
GAGATCGAAG CACGCCTGAT CGAACAACCA GCGGTACGGC AAGTGGCGGT GCTGGCGGTG
GATGGCGCCA GTGGCAAGCA ACTGGTGGGT TACGTGGTGC CAGTTGAGTC TGAGGTGCTG
CAGGACACCG AGCAGCAGGC CAGGCTACGT GACAGCCTGC GAAGCGAACT GAAGGCAAGC
CTGCCCGAGC ATATGGTGCC CGCGCATTTG CTGTTCCTGG AGCAGCTGCC GCTGACCGGC
AACGGCAAGC TCGACCGCAA GGCCTTGCCT GCGCCTGATG CGAGCTTGTT GCAGGGCGAG
TACGTGGCAC CGCAAACCGA GCTGGAACGA GAGATCGCGG CCATCTGGGC CGACGTGCTC
AAGCTGGAAA GAGTCGGCCT TGCCGACAAC TTCTTCGAGC TGGGCGGCGA TTCGATCATG
TCGCTGCAGG TGGTCAGCCG TGCTCGCCAG GTGGACATAC AGTTCACGCC CAAAGACCTG
TTCCAGCACC AGACCGTGCA AGGTTTGGCT GCGGTTGCTC GGCGTGAGGT GAGTACGTTG
ATCGACCAGG GACGGGTAGT GGGGAGCATG CCGCTGACGC CGATCCAGCA CTGGTTCTTC
GAGTCAGATA TCGTCGAGCG CCACCACTGG AACCAATCGG TGATGCTGCA AACCAGCGAG
GTGTTGGACG AACATCACCT ACAAGCGGCC GTGCAGGCGT TGGTGGCGCA CCATGATGCC
TTGCGCTTGC AGTTTACCCA GGCGCATGGC CGCTGGCAGG CCGAGTTCGG CGATGCAGAG
CACGCCCTGT TATGGCATCG TACGGTGGTA GATTCCGAGG CGTTGACGCA GCTGGCTGAC
GAGGCGCAAC GCAGCTTGAG TCTGACGCAT GGGCCGCTGC TGCGCGCCGT TCTGGCCGAT
TTGCCAGACG GCGGCCAGCG CTTGCTGCTG GTCATCCATC ATCTGGTGGT GGACGGTGTG
TCGTGGCGTG TGCTGCTGGA AGATCTGCAG CAGGCTTATG AGGCGCTGCG CGATGAGCAG
CCGTTGAAAC TGCAGCCCAA GACCACCTCG TTCAAGGCCT GGGCCGAGCA GCTGCAGGAC
TATGCAGCCA GCGCGCAGCT GCAAGCGGAA CTGGGGTATT GGCAGGCGCA ACTGCAAGGC
GCCAGTGATG CCTTGCCGTG CGACCGCCCG CAAGGCAGCA ACCTCGAACG CCATGCCGCC
TCGGTCAGCA CCCAGCTGGA CCGCGAACTG ACCCGACAAC TGCTGCAGGA AGCGCCGGCC
GCCTACCGAA CGCAGATCAA TGACTTGTTG CTGACAGCCC TGGCGCGAGT GATCAGCCGC
TGGACCGGTC GTGCCGAGGT ACTGGTACGC CTGGAAGGCC ATGGCCGTGA AGACCTGTTC
GAGGGCGGCG ACCTGAGCCG CAGTGTGGGC TGGTTCACCA GCATGTACCC GGTCAGGCTC
AGCCCACAGG CCGGGCTGGT CAACTCGCTC AAGACCATCA AGGAGCAGTT GCGCGCGGTA
CCCAACAAAG GCCTTGGCCA TGGCCTGCTG CGCTACTTGG GTAGCCCTGA AGCGCAAGCG
ACCCTGGCGG CATTGCCACG GGGTGAGGTC GTGTTCAACT ACCTCGGCCA GTTCGACGCC
AGCTTCGACC AGCAGGCAGG GTTGTTCGTG CCGGCCAAGG AGTACGGCGG TGCCACCCAA
GATGAAAGTG CCCCGTTGGG CAGCCTCTTG GCACTGAACG GGCAGGTGTA CGGTGGGGAG
CTGAAACTGG GTTGGCGCTT CAGCCGGGAC ATTTTCGATA CAGCCACCAT CCAGCACCTG
GCCGACGATT ATGCGCAAGA GCTGGCGCTA CTGATCGATC ATTGCCGCCA ACCGCAGCAC
CGTGGCGTGA CGCCTTCGGA CTTCCCGTTG GCTGACCTTA CCCAGGCGCA ACTGGACAAC
CTGCCGATAG AGGCCGAACA GATAGCCGAT GTGTACCCGC TGTCGCCGAT GCAGCAGGGC
ATGTTGTTCC ATACGCTTTA CGAACAGCAG TCCGGCGACT ACATCAACCA GTTGCGCGTG
GATGTTCAGG GGCTGGACGT GGAACGCTTC CGCCAGGCCT GGCAGGCGGC GGTGGATAAG
CACGCGATCC TGCGCAGTGG CTTTATCTGG CAGGGAGAGC TCGACCAGGC TGTGCAAGTG
GTACACAAGC AGGTCGTGCA GGCCTTGGCC GAGCACGATT GGCGTGGTCA GCCAGCGTCG
GAGCAGCTGC AACGTCTGGC CGAGAGCGAC CGACAAACCG GCTTCGCGCT GGACCAGGTT
CCGTTGCAGC GGCTGACCCT GGTGCGCACG GCAGATGACA GCCACCACCT GATCTATACC
AACCATCACA TCCTGATGGA TGGCTGGAGC ACTTCGCGCC TGCTGGGCGA GGTGTTGCAG
CATTACGCCG GGCAGCAAGC AACGGCGCCG GCCGCACCTT ATCGTGACTA TATCGCCTGG
CTCCAACGCC AGGACGCGGC AGCAGCCGAA GCGTTCTGGA AGGACCAACT GGCACCGCTG
GAGGAGTCGA CCCGGCTGGC TCAGGTAGTA CGCCGAAGCA CTGCGCCGCA GTCAGGCCAG
GGTGAGCACT ATCAGTTGTT CGACCACTCG ACGACCCAGC GCATCGAAAA TTTTGCACGT
GCCAACCGGG TCACCGTCAA TACCTTGGTG CAATCAGCCT GGTTGCTGCT GCTGCAACGC
TATACCGGTC AATCTGCCGT TTGCTTTGGC GCCACTGTGG CTGGGCGTCC GGCGGATCTG
CCAGGGGTGG AAGAACACAT CGGCCTGTTC ATCAATACCC TCCCAGTCGT CGGTGCGCCG
CGCAGCGAAC AGACCGTTGC CGAATGGGTT GCCCAGGTGC AAGCCTGCAA CCTGGCCTTG
CGTGACTTCG AGCATACCCC GTTGAATGAG GTACAACGCT GGGCCGGCTT GGGTGGCGAA
GCGCTGTTCG ACAGCTTGCT GGTGTTCGAG AACTACCCGG TTTCCCAGGC CTTGCAGGAA
GGTGCACCCG ATGGCTTGCG CTTTGGTCCG GTGGCCAACC TGGAGCAGAC CAACTACCCG
CTGACCCTGG CGGCCACCTT AAGTGACACC TTGGCCATTC AGTACAGTCA CAACCGCGGC
AGTTGGGACG ACGAGGCTAT CCAGCGTCTG GCCGAGCACT TCGGCAACCT GCTGCGGGCG
CTGGTAAATG ATGCCTCGGT GGCGATCGGC GAGCTGCCGA TGCTGGGAGA TGCAGAGCGC
CAACTTTTGC TGCATGACTG GAACCAAAGC GAGGCAGTTA GCCCGTCGGG CCTGTGCGCT
CACCAGTTGT TCCAGTTGCA GGCCCGAGAG CGCCCCGGTG CGACAGCGCT GGTGTTCGGC
GAGCAACAGC TGACTTACCG AGAGTTGGAC CTGCGCACCA ATCGTCTGGC GCATCTGCTG
ACTGCGCACG GTGTTACCGC CAACAGCCTG GTTGGGGTTG CCGCCGAGCG TGGGTTGGCG
CTGGCCGTGG CCTTGATTGC CATCCACAAA GCCGGGGCGG CATACGTCCC GCTGGATCCG
GATTACCCGC AGGACCGCCT GACCTACCTG ATCGAGGACA GTGGCATCGG CCTGCTGCTG
GGTGACGCCG AGGCGATGGC GCGCATGCCG GTCCCTGCTG AGTTGCCATG TATCGAGCTA
CAGTCAGGTG AGGACTGGCT GCAAGACTGC AGTGAGCAGC CGCTGTTATG TGAGGTCTCG
CTCGACAGCC TGGCCTATGT CATCTACACC TCCGGCTCCA CGGGCATGCC GAAAGGGGTG
GCCATTGCTC ACCAGGCACT GTCAGTCTTC TGTGAAGTTG CCAGTGGTTA TTCGCGACTG
ACGCCGGATG ACCGGGTGCT ACAGTTCGCC ACGTTCAGTT TTGACGGGTT CATCGAGCAG
TTCTTCCCGC CATTGTCACG GGGCGCCTGC GTGGTGATGC GCGATCAGCA GCTGTGGGAC
ACAGACACAT TCAGCACGCA GGTCATCCGC CACGGCATTA CGGTGGCTGA CTTGCCGGCA
GCCTACTGGC GCTTGTTGGC GCTGGACCGT CGCGCTGCAG TTGCCTATGG CCAGCTGAAG
CAGATTCATG TGGGGGGTGA GGCGGTGGCC CTGGACGGGT TGCAGGCCTG GCTGGAGGAT
GGGCCGGCGC AGGTGCGTCT GCTCAACACC TACGGCCCTA CCGAGGCCAC CGTGGTGGCG
ACCACGTACG ATTGCTCACG GCTTGCGCAG GTGCCTGCGG CACATAGCGG TGTACCGATT
GGCCGCGCGT TGGCCGGGCG TACCCTGCGT GCCTTGGACG ACGGCCTGTT GCCTACACCG
ATTGGCGTGC CGGGTGAGTT GTACATCGGT GGTGACGGCT GCCTGGCACG CTGTTATCAC
CAGCGCCCGT CGTTGACGGC GGAGCGCTTC ATCCCTGACC CGCTGGCCGA AACAGCGGGG
GCTCGCTTGT ACCGCACCGG CGATTTGGGT TACTTCGATG AGCAGGGTGA ACTTGCCTAC
CGTGGCCGGG CCGACCATCA GGTGAAAGTT CGCGGTTTCC GCATCGAGCT GGGGGAAATC
GAGCAGCACT TGCAGGCGCA CCCACAGGTT CGCCAGGCTG CAGTGATCGT TGTCGACCAC
GCCGGCGTCA AGCAGCTGTG CGGCTATGTG GTCGCGGTTG ACCAGGGCGC GGATCAGGCG
GAGCTGCGGG CAACGCTGAA GCAGTCTCTG AAGGCCGGCT TGCCGGATTA CATGGTGCCG
AGCTACCTGA TGCTGCTCGA GCACATGCCC ATGACGCCAA GTGGCAAGCT CGACCGCAAA
CGCCTCCCGG ATATCGATCA GACGCAGTCG CAGGGCGAGC ATGTGGCTCC GCGCAACGCG
CTCGAGCGGC AGTTGGCGGA TATCTGGGGT GCCGTATTGA AAGTGGCGCA AGTCGGGGTA
ACGGACAACT TCTTCGAGCT CGGTGGTGAC TCGATCATTT CGATACAAGT GGTGAGTCGC
GCGCGCCAGG CGGGTATCCG CTTCACACCC AAGCAATTGT TCCAGTATCA GACCGTTGAG
CAATTGGCCG CTGTAGCCGA GGTGGGTGAG CAGGTTGCCG ATGACCCGGT GACAAGTAGC
CAACAACCTT CGCTGGCCGG CCTGACCCAG ACGCAGTTGC AGCACTTGCC GGTGCCGGTG
GCGCAGGTTG ACGCTATCTA CCCTCTGTCG CCGATGCAGC AAGGCATGTT GTTCTACAGC
GACCAGGGAG GGGACGCTGC GCTCTATCTC AACCAGACAT CGGTTGCGGT CGACGGCCTG
GATATCGAAC GCTTCACCCA GGCATGGAAG CAGGTGGTGG CGCGTCATGA CATTCTGCGC
ACCAGCTTCT GGTCAGACGC GCAACTGGCA CAGCCACTGC AAATCGTCCA TCGCCATGTC
GAGCTGCCGA TCCGTGTGCT GGACTGGCGT GCGCGTGATG ACCAGGCCGA CGCTTTGCAG
GCACTGGTCG ATGCTGATGC CGAGCAGGCC TTCGACCTGT CGCAGGCACC GTTGATGCGG
GTAACGCTGG TGCGCCTGGA TGAACAGCGC ATGCAACTGA TCTGGACCCG CCATCACATC
CTGATGGATG GCTGGAGCAA CTCGCGGCTG TTGGGTGAAG TGTTCCAGGC TTATCACGGG
CAAGCGCTGG ATACCCAGGT ACCGCGCTTC GGAGACTATA TCCGCTGGCT CGAGGCGCAG
CCCCAGGATG AGCTGGAGGC GTTCTGGACG CGCAAGCTTG GCAGCCTTGA AAGCTCGACC
CTGCTGGAGC AGACGTTGCT GCCACGACCG GATGCCAACT TGCCTGGGCA TGCCGCGCTG
TACCTGCATT GGGATGCCAG GCGAACCGAG CGCCTGCGCG CCCAGGCTCA ACGCTTGCGG
GTAACGCCCA ATACGCTGGT ACAGGCTGCC TGGTTGCTGC TGCTGCAGCG CTATAGCGGG
CAGCAGGCTG TCTGTTTTGG CGCCACCGTG GCTGGCCGGC CGGCGAATTT GCCGGGGGTG
GACAACATGC TCGGTTTGTT CATCAACACA CTGCCTGTCG TCCAGGCGCC TGCTGCACAT
TGGCGTGTCA ACGACTGGTT GCAGCAACTG CAAGCCTACA ACCTTGAGCT GCGTGACCAT
GAACATGCCA CGCTGTCGAA TGTTCAGCGC TGGGCGGGGC GCCCGGGGCA ACCGTTGTTC
GATAGCATTA TCGTGTTCGA GAACTACCCG CTGGACGAAC GCCTGAACGA CAGTGGCGAC
AGTGCGTTGC AGTTTGGTGC AGTGAGCGAG CGCGGCGTCA CCAACTATGC GATGGACCTG
GCCGTGCACT TGAGCGAGAC ACTTTCGGTA GAGTTCATGT ACCTGCGTGG CAGCTTCAGC
GAGGCCGCAG TGGCTGTTAT CCGCGGCAGC TTCGAGCGGC TGCTCGAGGG CATGCTGGAT
AACCCGGATG CCACGCTGGG CAGCCTCGAC ATGCTGACAC TGGAGCAGAG CCAGCAGACC
CGTCAGCGCA ACACGCTGGC GGCAGCTACG ACGCATGTTG CGCATCTGGC CCGGAGCATT
GCCGAGCACG CTCGGTTGCG GCCTGACGCC CTGGCCGTCG TATGTGGTGA TCAGCAGTTG
AGCTATGCGC AACTCGACCA GCGCGCCAAC CGCCTGGCGC ACCATCTGAT CGCCTTGGGC
ACCAAGCCGG AGAGTACGGT CGGTATTGCC CTGGAGCGTT CGGTCGAGGT GATCGTTGCC
TTCCTTGCCG TGATGAAGAC CGGTGCTGCC TATGTACCGC TGGACATTGA TTATCCGCAG
GACCGTTTGC AATGGATTGT CGAAGACTCG GCCATGCACC TGCTGCTGAC CAACAGTGCC
TTGAGCCAGC GTTTCGACAC GGTTGGGCGA ATCGTAGAGC TTGACCGCCT GGCGCTGGCG
GGCCTGCCCG ACGGTGTGCC ACGCGCGCGG GTGGAAGATG ACAACCTGGC GTACCTGATC
TACACCTCGG GTTCGACTGG CAAGCCCAAG GGGGTGGCCG TCAGCCATGG GCAGATCCGC
ATGCACTGCC AAGCCATTGC CGAACTTTAT GAGATGGACG AAAGCACCCG TGAGCTGCTG
TTCATGTCAT TCGCTTTCGA CGGCGCGCAA GAGCGCTGGC TGTCGACGCT GTCGTCGGGT
GGCTGCCTGG TGATCCGTGG CAATCGCCTG TGGACGGCAG AGGAAACCTG GCAGGTGCTG
CATGAGCAAC GCATCGATAT CGCCTGCTTC CCTCCGGCGT ATCTGCAGCA GTTGGCCGAG
TTCGGCGAGA GCCAGCAACA GGTTGCACCG CCAGTGCGTA TCTACTGCTT TGGTGGCGAT
GCGGTGCCCG ATGCGTTGTT CGAGCTGGTC AAGCGCACGC TAAGGCCACA GTACCTGACC
AACGGGTACG GGCCTACCGA GACGGTGGTG ACACCCCTGC TGTGGAAGGT ATCGGCCGAT
CAGTCCTGTC AGGCGGTGTA TGCGCCGATT GGCGATCGTG TGGGCCTGCG GACCTTGCAG
GTACTCGACC AGGACCTCAA CCCGTTGCCT GATGGCGTGG CGGGTGAGCT CTATATCGGC
GGTGAAGGGC TGGCGCGGGG GTATCACCAG CGTGCAGCGC TTACCGCCGA GCGTTTTGTC
GCGGACCCGT TCGCGGAAGG CGCGCGTCTT TATCGCACTG GTGACCGCGT ACGCCGCCGT
GCCGATGGCA CCTTGGACTT CATCGGCAGG CTGGACAACC AGTTGAAAAT CCGCGGGTTC
CGTATCGAGC CGGGCGAGAT CGAGGCGCGC TTGCGCAACT TGGCCGATGT CCGTGATGCG
GTTGTGGTGG CGCGTGAAGG CGCCACCGGC AAGCAGCTGG TGGGTTATGT GGTCAGTGGC
AGCGAGAATA CGAACCCGGC ACAACTGCGT GAAGCCCTGC GTACCGAGCT GCCGGACTAC
ATGGTCCCGG CGCAACTGGT GGTGCTGGAG GCGATGCCGT TGACGCCCAA TGGCAAGGTC
GACCGCAAGG CGCTGCCGGC GCCTGATTTT GCCGCGCATC GGCAGCACCG GGCACCGCGC
AATGAAGCCG AGTTGGCCTT GGCGCACATC TGGCAAGACG TGCTGGGTGT TGAGTCGATT
GGAGTCGACG ACAACTTCTT CGAGCTGGGG GGGGATTCGC TTCGGGTGCT GAAAATGCTC
TCCAAAGTGC GTGCGCACAG CGATTTGGCG CTGGATCTGA AGTTGCGTGA CGTGATGGCC
AAACCGACCA TTGGTGAGCT GTCAGGTTAT TCGGCGGATG AAGCCTGCCT GGATCCTGTG
TTGTTGCTCA ACACCCGCGT GGCCCATGCC ACACCGTTGT TCTGCCTGCA TGCAGGTTTC
GGCACGGTAT TCGATTATGA GCCGCTGGCG CGTCGCCTGG AGGGGCATTG CAGTGTTTAT
GGCGTGCAGT GCCGAATGCT GCTCGACCGC ACGTGGCAAG ATGAATCGTT GCAGGCGATG
GCCATCGACT ATGCCCAATA CATCCGCCAG AAGCAGCCGG CAGGCCCATA CCGCTTGGCA
GGCTGGTCGT TGGGGGGAAC CCTGGCGGTG CTGGTAACCA AGGAGCTGGA GCGTCAGGGC
CAGACGGTAG CATTGCTCGG CCTGGTGGAC AGCTTCGTGC CCTGTGCATT GCATACCGAG
GTGGCCGAGG ACTGGACGGA GGATTTGCAG GGCTTCCTCA GTGTATTGCT GGGTGTGCCA
AAAGACCGCC TGGCAGTACC TGAGCTGGCC GTGGGGACGG ACGTTTCGAC GTTGCAGGGT
GCCATCGAGG CCGTTCGTGC TGCGCAGACC GAGCCATCGC TTTATGCCGA TATCGACAGT
TCGGAGCTGG CCCATACCTT TGCGGTGGCG ATGCGTCTGA AAGCTTTGTC ATTGAACCTG
ACCAGCCTGC CTCCCACCCA AGCCTTGGCG ACATGCTGGT GGGCGGGCCG CAATGGCGAC
GCGGCCTGGA CTTTCGCAAC GAAGGCAAGC CATGCAGTGG AGGCAGGGCA CTACGACATT
CTCAGGCAGC GTGAAGTTAT CGACGGTCTG GCCATGCACT GGTTGAAAGA GGGCTCCATG
GTCGAGTAA
 
Protein sequence
MQELLESVKL LSTKERQALA ALLKQQGVNL YGVTPIFKRE PEASLPLSYA QQRQWFLWQL 
DPGAPTYNVP AALRLRGRLD AEALQRSFQA LIARHETLRT LFREEAQGTV QVIHPEITLV
LDILMLQSAA ERDVLEQVQA EARRPFDLQQ GPLLRVKLLR LAQDEHVLVV TMHHIVSDGW
SMPLMIDELM QLYAGYSQGE DVQLPTLPVQ YADYAIWQRQ WMEAGEQERQ LAYWKAQLGD
DQPVLELPTD RPRAAANSHR GARHEVRLPD ALVHSLKQLA QQQGATLFML LLASFQTLLH
RYSGQPDIRV GVPVANRNRV ETERLIGFFV NTQVLRAEFD LQMTFSELLE QVKQRALGAQ
SHQDLPFEQL VEVLQPERSL SHSPLFQVMY NHQMQAKGKQ RSVQGLHVEG LSWDHDTAKF
DLTLDTFEYE HSLGAALSYA VDLFDAATIE QMARHWVNLL EAIVRQPGQR IVELPLIGQQ
EQQQIERAWN PALAHYPAER PLHQLIEAQV EAAPDAVALV FGERSLSYAE LNRRANQLAH
KLLELGAGPD VLVGLAVERS LEMVIGLLGI LKAGAAYVPL DPEYPQDRLS YMFQDSGIHL
LLTQQHLRDA LPVPAGVKTL VLDGHAGLAG YSDANPVCRV TPDNLAYVIY TSGSTGKPKG
TLLPHRNVVR LFAATQHWFN FDASDVWTVF HSYAFDFSVW ELYGALLYGG KAIIVPKDVA
RSSEDFHALL VREQVTVLNQ TPSAFKPLIP VACEAMKAGQ GLALRHVVFG GEALEVSSLK
PWFEVFGDRR PRMINMYGIT ETTVHVTYRP VTFEDLHKGA SSPIGEVIPD LSWYLLDAAL
NPILPGCTGE MLIGQAGLAR GYHGRPGLTA ERFVPNPFDG NGGRLYRSGD LARYRSDGVI
EYIGRIDHQV KIRGFRIELG EIEARLIEQP AVRQVAVLAV DGASGKQLVG YVVPVESEVL
QDTEQQARLR DSLRSELKAS LPEHMVPAHL LFLEQLPLTG NGKLDRKALP APDASLLQGE
YVAPQTELER EIAAIWADVL KLERVGLADN FFELGGDSIM SLQVVSRARQ VDIQFTPKDL
FQHQTVQGLA AVARREVSTL IDQGRVVGSM PLTPIQHWFF ESDIVERHHW NQSVMLQTSE
VLDEHHLQAA VQALVAHHDA LRLQFTQAHG RWQAEFGDAE HALLWHRTVV DSEALTQLAD
EAQRSLSLTH GPLLRAVLAD LPDGGQRLLL VIHHLVVDGV SWRVLLEDLQ QAYEALRDEQ
PLKLQPKTTS FKAWAEQLQD YAASAQLQAE LGYWQAQLQG ASDALPCDRP QGSNLERHAA
SVSTQLDREL TRQLLQEAPA AYRTQINDLL LTALARVISR WTGRAEVLVR LEGHGREDLF
EGGDLSRSVG WFTSMYPVRL SPQAGLVNSL KTIKEQLRAV PNKGLGHGLL RYLGSPEAQA
TLAALPRGEV VFNYLGQFDA SFDQQAGLFV PAKEYGGATQ DESAPLGSLL ALNGQVYGGE
LKLGWRFSRD IFDTATIQHL ADDYAQELAL LIDHCRQPQH RGVTPSDFPL ADLTQAQLDN
LPIEAEQIAD VYPLSPMQQG MLFHTLYEQQ SGDYINQLRV DVQGLDVERF RQAWQAAVDK
HAILRSGFIW QGELDQAVQV VHKQVVQALA EHDWRGQPAS EQLQRLAESD RQTGFALDQV
PLQRLTLVRT ADDSHHLIYT NHHILMDGWS TSRLLGEVLQ HYAGQQATAP AAPYRDYIAW
LQRQDAAAAE AFWKDQLAPL EESTRLAQVV RRSTAPQSGQ GEHYQLFDHS TTQRIENFAR
ANRVTVNTLV QSAWLLLLQR YTGQSAVCFG ATVAGRPADL PGVEEHIGLF INTLPVVGAP
RSEQTVAEWV AQVQACNLAL RDFEHTPLNE VQRWAGLGGE ALFDSLLVFE NYPVSQALQE
GAPDGLRFGP VANLEQTNYP LTLAATLSDT LAIQYSHNRG SWDDEAIQRL AEHFGNLLRA
LVNDASVAIG ELPMLGDAER QLLLHDWNQS EAVSPSGLCA HQLFQLQARE RPGATALVFG
EQQLTYRELD LRTNRLAHLL TAHGVTANSL VGVAAERGLA LAVALIAIHK AGAAYVPLDP
DYPQDRLTYL IEDSGIGLLL GDAEAMARMP VPAELPCIEL QSGEDWLQDC SEQPLLCEVS
LDSLAYVIYT SGSTGMPKGV AIAHQALSVF CEVASGYSRL TPDDRVLQFA TFSFDGFIEQ
FFPPLSRGAC VVMRDQQLWD TDTFSTQVIR HGITVADLPA AYWRLLALDR RAAVAYGQLK
QIHVGGEAVA LDGLQAWLED GPAQVRLLNT YGPTEATVVA TTYDCSRLAQ VPAAHSGVPI
GRALAGRTLR ALDDGLLPTP IGVPGELYIG GDGCLARCYH QRPSLTAERF IPDPLAETAG
ARLYRTGDLG YFDEQGELAY RGRADHQVKV RGFRIELGEI EQHLQAHPQV RQAAVIVVDH
AGVKQLCGYV VAVDQGADQA ELRATLKQSL KAGLPDYMVP SYLMLLEHMP MTPSGKLDRK
RLPDIDQTQS QGEHVAPRNA LERQLADIWG AVLKVAQVGV TDNFFELGGD SIISIQVVSR
ARQAGIRFTP KQLFQYQTVE QLAAVAEVGE QVADDPVTSS QQPSLAGLTQ TQLQHLPVPV
AQVDAIYPLS PMQQGMLFYS DQGGDAALYL NQTSVAVDGL DIERFTQAWK QVVARHDILR
TSFWSDAQLA QPLQIVHRHV ELPIRVLDWR ARDDQADALQ ALVDADAEQA FDLSQAPLMR
VTLVRLDEQR MQLIWTRHHI LMDGWSNSRL LGEVFQAYHG QALDTQVPRF GDYIRWLEAQ
PQDELEAFWT RKLGSLESST LLEQTLLPRP DANLPGHAAL YLHWDARRTE RLRAQAQRLR
VTPNTLVQAA WLLLLQRYSG QQAVCFGATV AGRPANLPGV DNMLGLFINT LPVVQAPAAH
WRVNDWLQQL QAYNLELRDH EHATLSNVQR WAGRPGQPLF DSIIVFENYP LDERLNDSGD
SALQFGAVSE RGVTNYAMDL AVHLSETLSV EFMYLRGSFS EAAVAVIRGS FERLLEGMLD
NPDATLGSLD MLTLEQSQQT RQRNTLAAAT THVAHLARSI AEHARLRPDA LAVVCGDQQL
SYAQLDQRAN RLAHHLIALG TKPESTVGIA LERSVEVIVA FLAVMKTGAA YVPLDIDYPQ
DRLQWIVEDS AMHLLLTNSA LSQRFDTVGR IVELDRLALA GLPDGVPRAR VEDDNLAYLI
YTSGSTGKPK GVAVSHGQIR MHCQAIAELY EMDESTRELL FMSFAFDGAQ ERWLSTLSSG
GCLVIRGNRL WTAEETWQVL HEQRIDIACF PPAYLQQLAE FGESQQQVAP PVRIYCFGGD
AVPDALFELV KRTLRPQYLT NGYGPTETVV TPLLWKVSAD QSCQAVYAPI GDRVGLRTLQ
VLDQDLNPLP DGVAGELYIG GEGLARGYHQ RAALTAERFV ADPFAEGARL YRTGDRVRRR
ADGTLDFIGR LDNQLKIRGF RIEPGEIEAR LRNLADVRDA VVVAREGATG KQLVGYVVSG
SENTNPAQLR EALRTELPDY MVPAQLVVLE AMPLTPNGKV DRKALPAPDF AAHRQHRAPR
NEAELALAHI WQDVLGVESI GVDDNFFELG GDSLRVLKML SKVRAHSDLA LDLKLRDVMA
KPTIGELSGY SADEACLDPV LLLNTRVAHA TPLFCLHAGF GTVFDYEPLA RRLEGHCSVY
GVQCRMLLDR TWQDESLQAM AIDYAQYIRQ KQPAGPYRLA GWSLGGTLAV LVTKELERQG
QTVALLGLVD SFVPCALHTE VAEDWTEDLQ GFLSVLLGVP KDRLAVPELA VGTDVSTLQG
AIEAVRAAQT EPSLYADIDS SELAHTFAVA MRLKALSLNL TSLPPTQALA TCWWAGRNGD
AAWTFATKAS HAVEAGHYDI LRQREVIDGL AMHWLKEGSM VE