Gene RPC_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3036 
Symbol 
ID3973489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3347539 
End bp3360036 
Gene Length12498 bp 
Protein Length4165 aa 
Translation table11 
GC content64% 
IMG OID637926147 
Productamino acid adenylation 
Protein accessionYP_532900 
Protein GI90424530 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.585184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTACCG ACGCGGATCC GGGTCGTTCC AGCCGAGATA CTCTGGTTTC CCGTCTGGAG 
AAGTATGCGC AGCTCGAGGG TGGCAGGCCG GCGTTGGATT TCGTCGACCG GTCCACGGGT
ACACGAACGC AATTGTCTTA TGGTGAGCTC TCGCATCGGG TAAAGGCCGT TGCCGTAAAC
GTGCAGGATG CGCTAGTGCC TGGAGGCCGC GCGTTGCTGC TGTTGCCGTC GGGGCACGAC
TACGTCGTCG CGCTTCTTGC GTGCCTGTAC GCCGGCGCTG TCGCGGTGCC CGTCAATTTG
CCGGGGGCAT CGCGGGTCGC CCGCGTGCTG GGGCGAGTCG AGCACATCTC ACGCGATTGC
GGTGCGACAG CAATTTTGAC CACTCGCGCC ATCGCCGACC AGTCGCGCGA AGCGTTGACG
TCGTTCGTGG CGGCTCATCG GCTGCGCCTT ATCCTCATCG ACGATGCTAA ATCCGGTCGC
GCATGGTCCG GATATAGTCC GTCAGAAACC GACATTGCCT TCATTCAGTA CACGTCGGGC
TCGACCGCGG AGCCGAAGGG AGTGATCAAC CGCCACGACA CGCTGATCAG CAACGTGTCG
TTTCTTCGTT GTTTGCTATG GCCGAAGGAT GCGCCGGTCG TGGCGAGCTG GCTGCCGCTG
TTCCACGACA TGGGCCTCAT CATGGGGGTT CTTGCGCCGC TCGCGCTTGG CGGCCGGGTC
GTCTACATGG CGCCGGGAGC GTTCGTCAGC GATCCGTTGA TGTGGCTGGA GTTGGCCGCC
CGAGAGCGTG CAGCGGTGCT GCCATGTCCC GCGTTTGCGC TCGACGCTTG TGTCGAGCAC
TACGATGCCG ATCGGCTCCG CGATCTGGAT CTGAGCTGCG TCGAAAGCCT CGTTCCGGCT
GCGGAACCGG TTCATCTGCG GCAGGTACGT GCGTTCTTCG ATCTGTACAG CCGTCACGGC
CTGCATTGGG GCGCGATACG TCCTTCTTAC GGCCTCGCCG AGGCAACGTT GATCGCGTCC
GGATCGAGTC ACGACGGCGG GCCGGTCGCG GTCAGCGTCG ATGCGGCATC GATCGCCCGC
GGCACGGCGC TCGTCGTCAC CGACGGTGCG CCGGACAGCC GCGTGTACCT TTCGAACGGT
GCTGATTTCG GCGGGCAGGA CCTGCGTATC GTCGATCCGG AAACGCGGCG GACAAAGCCG
GCCGGGGACG TCGGTGAGAT CTGGATTTCG GGGGCGGCGA TAGCGGCGGG CTATTGGGGG
CGTTCCGACG CTACAGAAGA GACATTCGCT GCGCATCTGT CGGATGACGG TGCGAGCGAC
GCCACGAATT ATCTGCGCAC AGGCGATCTC GGATTCCTGC ACGGCGGTCA TCTCTACATC
ACCGGACGTT CGAAGGATGT GATGATCTTC CGGGGACAAT GCCACTATCC GAACGACATC
GAAGCGAGCC TGGCAAATCT CCACGACGAT ATTATTGCGG GAGGGGCTGC TGCTTTCGCG
ATTCCGGGGG ATCAAGGCGT CGAGCGGTTG GTGGTCGTCC AGGAGGTGCG CCGCCATAGC
GATCTCGATG CGGCGGCACT CGAAGCGTCA ATCCGCGAAA CCATTGCGCG TGAACACGGA
CTTGCGGCGC ACGACGTCGT CTTGATTCGC CGTGGTACGC TCAAACGGAC GACGAGCGGT
AAAGTTCGCC GGGCCGAAAT GCGTCGCCTT TACATTTCCG GCGGCCTCAC GATCGTTGGC
AGCGATCGGC AGCCCGCTGG TCAGGACACG CGTTCTCCTT CGAGGGAGCA ACGGCGTGAC
GCCGTTCGCG GCAAGGTTCT GGATTGTGTC AGGCGAGCAC TCGGTCCGTC GAGCCCTCGT
GTGATCGATC CCGCCAGAAG CCTGTTCGCA CTCGGTCTCG ACTCGCTCGC TGCGACACAG
GCGGTCGCGG CTCTCGAAAG GGATCTGGGG CGTTCGTTGC CCGAAGGGGT GTTGTTTGAT
TACCCGACGG TGGATGCGCT CAGCGATTGG CTGGTATCCC GCGCCGAGGA GGCGCCTTGT
TTGCCGCCCG ACCAGCCGCT GTCACGTCGA GAGGCAGGCG AGCCGCTCGC GATCGTCGGT
CTTTCTTGTT TGTTCCCCGC CGGGGCCAAC GATATCGAGG ATCCCGCCGA ATTCTGGCGC
TGGCTGATGA CCGGCGGCGA TGCCGTGCGG GGGCTCGCGG CCGACCGCTT CCGTCAGGAT
CTCGACATTC CGGGATACGG AGCGTGCCTT CGTCGCGTCG ACGGCTTCGA TGCCGCGTTC
TTCGGTGTCG GGCCGCGCGA GGCGATGAAC ATGGACCCGC AGCAGCGCCT TTTGCTTGAA
GCGACCTGGC ATGCGCTAGA GGACGCCGGC TTCGTGCCGC AATCGCTGCG CGGCAGCGAC
ACCGGCGTCT TCGTCGGCGT TGGCACTGGT GACTATGGGC ATTTGCCGTT CGTCACGCGC
GATCCCGCCC ACCTCGATCC GTACTACGGC ACAGGAAACG CATTCGCCGC CGTTGCGGGG
CGGATTTCCT ACGTGTTCGA CTGGTCGGGG CCGAGCATCG CCGTCGATAC CGCATGTTCC
GCGTCGCATG CCGCGGTTCA TCTCGCCTGC CAGTCGCTTC GTACCGGCGA GAGCTCGCTT
GCCGTCGCCG CTGGCGTCAA GCTTCAAATC CTTCCGGAGA TCGATCTGGT GCTGGCGCGC
GCTGGCATGC TGGCCGCTGA CGGACGGTGC AAGACTTTCG ACGCAGCCGC CGACGGTTAT
GTGCGGGGTG AGGGTGCCGG CGTCGTCGTG ATCAAACGTC TGGCGGACGC GCTCCGCGAC
GGCGATCCAA TTCGCGCGGT CATTCGCGAA AGCGTTCTGA GCCAGGACGG CGCGAGCGCC
AGCCTCTCCG CTCCCAATTC CGAGGCGCAG CGGCGAATGC TCTCCAAGGC GCTGGCCCGA
GCGGAATGGA CGCCAGCCGA TGTCGACTAT GTCGAACTCC ACGGCACTGG GACGCGACTT
GGTGACCCCA TTGAGTTCGA GGCTCTGGCA TCGGTGTTCT TCGGTCGGGA CGCGACCGAC
CCACTTTATC TCGGCTCGGT GAAGACCAAT ATCGGGCATC TTGAGGCGGC CGCAGGAATC
GCCGGGCTCA TCAAGGTGGT ACTCGCTTTG CAGGAGGGGC GAATTCCTCC TAACCTGCAT
TACAATCGAC CGAATCCGGC AATTGACCTT GCGCGCATAC CCGCCGTGAT ACCGACGGCG
CCGATCGATT GGCCGCTGCG GGGAGATCGG CGCCGCGCGG GCGTCACGTC GTTTGGGTTC
GCCGGCACGA TCGGCCATAT TCTGCTCGAG CAAGCTCCGT TGCGGGAGGC TCCCGCATGC
TGTCGTTCCG ATCGGGTGTC GCTGTTACTA CTCTCAGCGA GAAGCACAGG TTCGTTGGAG
GAACTACGTC GACGATACCT GGAATGTCTT CGCGATTTGG ACGGTCCCTT GTCGGCGTTC
GTCAATGCTG CAGCTCGCCA GCGCCAGCAT TTTCTGGACC ATCGGTTGAT TGCGATCGGT
GCTGATGCAA AAGGCCTGGC GGGTGCCTTG GTCGCCGCAA ATCCCGTGCA ACAACAGCGC
CCGGCACGTA TTTGCTTTCT GTTCACAGGG CAAGGTGCGC AGCATGCCGG GATGGGCCGG
GCGCTCTACG ACGCCGAGCC GGCCTTCCGT CGCGCAATCG ATCGTGTCGA TGCGGCGATG
TCTCCGTTCC TCGGCGGATC GATCCGCGAT CTGATGTTCG CCGCGGATGC TCTGGAACTC
CACGAGACAC GTTATACCCA GCCGGCGATG TTTGCTTTCG GCTACGCATT GGCGCAGCTC
TGGCGCGGCT GGGGCGTCGA GCCGGACACG ATCATCGGGC ACTCCATCGG CGAAGTTGCG
GCCCTCGTGC ATGCCGGTTC GCTGACTCTC GACGCCGCCG CATCCTTCAT CGTGCGGCGT
GCCGGGTTGA TGCAGTCCTT AAAGCAGCGT GGCGGCATGA TGGCTGTTAG GCTAAGTGCC
GCAAACGTAT CGGAGCGGAT TGCCGACACA TCGATCTCGA TCGCAGCGAT CAATGGTCGA
GAAGACGTCG TCGTTGCAGG CCCGGACGCG GACATTGATC GACTATCGGC GGAATTCTCC
GCTCAGGGCA TCAGTGCGCG GCGGCTGAAG GTCTCGCACG CTTTTCATTC GGCACTGATG
GATCCGCTTC TCGAGGAACT GGAGGCTGTG GCCGAGAATT GTGAGGCTCG CGCGCCCCGC
GTGCGCTTCA TTTCGACCCT GTCGGGGGAT TTGCTGTCGG GCGCGCCCGA TCCTGCCTAC
TGGCGCCGTC ACGCGAGAGA GCCGGTCCGT TTCGCATCGG CGCTGGGTGT CGCGATCGCC
AATGGCTGCG ACTGCTTCAT CGAGATCGGC CCGCGCCCCC TTCTGGTCTC GCTTGCTGGA
CGCGAGGCTC ATGAGTCCGG TCTTGCGGAC GGGCTGTTCC TCGCCAGCGC GAGGGAGGGA
GAATCCCATC CGAACAGTCT GATGGAGTGC CTCGGAGCTT TGTATCTGCG CGGCGTAGAC
TTCAATCTGG AGGCGGCGTT TCGGGGACCT GCAGCATTGC CAGCGGCGCT CCCGTCGTAT
CCGTTCGATC GTCGCTCATA CTGGCTCGAA TATCGTGATC AGCACGAGGG ACCGTCCCCT
CAGTTGCCTC AGATGCGCAA GCAGGACGAG TCGGCCGAAC TTTCATTGAG CCGCCTAACA
TGGACCGAAG CCGTCCTCCC TCAGAACGAT CCGGCGCCCG CGCCGCGACT GTATCTCATC
GAGGGGGGGG GTGCCGCAGG GCGAGCAGTC GCCGAGAGCT GCGCTGAAGT GGGGGTGATC
GCATCCGATG CCGAAGCGCT GCCCGCTGCA GCCTCGGGCG ATTCGATCCT GGTTTGGCTC
GGTGCGCTCG CCGCGCACGA GGCCCCGCAG CCGGACGCGC TATGGCGGTA CATTGCATAT
TGCCAGGCGC TGTACCGAGC CCGCCTGACG GCACGTATCG TGGTGGTTAC CAGAGGAGGC
CAGCGAGACG GTACTGCCGC GCCCGCACAG GCAGCTTTCT GGGGAGCGAC CCGGGCGCTT
GCGATCGAGT GCCCCGATCT GCAGTTTCTC CTCGTCGACG TCAACGCGGC AGACGATCCG
GCCCGCGTTC TCGCGACGAT TGCGCCTCGT CTGAGCGACA TCATTCCGCA GGAGGACATG
CTGGCGTGGG ATGGGGCACA ATGGCTCTCC CCGAGGCTGG AAGCGGTGCC GGCGTTCGAA
GCCCAACCGG GCCCGGTCGG TACGGACGGT ACCTACCTGA TCTGCGGCGG CCTTGGGGCG
CTGGGTGGAC ATGTGCTGGA TTGGCTGGTC GCCTGCGGCG TACGTGACAT CATCGTTACA
GGTCGCGGCG AACCGGGATC CCGGGCGCGG TCGACATTCC AGCGCTACGG TCAGGCGGGA
GTTGATATCC GATACGTCCG CGCCGACGTC GCGAACGAGA CGGACATGCG CAATCTGTTT
CGAGAGATCG ACGAGTCGGC GCGCGCGCTA CGTGGTGTCT TCCACTGTGC GGGTATCGGC
CGCTTCGATA CCATCGATGC GATCGATGAG GCCGCGTTCC GTGAGGCCAC GCAAGCGAAG
GCGGATGGCA GTTGGATCCT GCACTGTCTC ACATGCGATC GGAGCGACGT CGAGCACTTC
GTCGTCTTCA CGTCGATCGC GGGGATCTGG GGGTCGCGCT TCCAAATCCA CTATGGCGCC
GCCAACGCCT ACCAGGACGC GCTGGCCCGG TTGCGGCGAG CGCGCGGATT GCCGGCCCTT
GCGATCGCCT GGGGAGCATG GGGCGGCGGT GCCGGGCTGT CGGAGGTGGA CGACAGCCTT
CTACAATATC TGCATCGTGC GGGGATATCG CGCTTCGAGC CGCAACGCGC CATCACCACA
TTGGCCGGCC TGATAGCGAC CCCCGGCAAT TGGATCGCTG CGGAGGTCGA TTGGCGCAGA
TTCGCTCCGC TCTACCGGAC GTTCGGCCGC AGTGATCTTC TCGCACGGCT TGCGCCGGCG
ACGGAGGCGA CGGCGAGCCG TAGCGAGGAT TGCCCGGATT GGACGCGCTT GTCCGCCGCA
GACCGGCGTT GCGTGTTGGA GAGCTTCGTC CGCACGACCA TCGCCACGGT GCTGCGGATC
GACGCCGGGG AGATGCACGA CGACGTCGAA CTCATCAAGC ATGGACTCGA TTCGATTCTG
GTGATGGATT TCGCCCGCGC GTGTCGGCAG AAACTCGGTG TCGACTGTGC GCTTCGCGCG
ATCTTCGAAT CCGCGACGCC GGGCGGCCTA GTCGATTATC TCGAAGGGCT CGCCGCCTCG
TGTGAGCGAA CGCACGAGGA GGCGACCGTC GATCAGATCG TTCCCGATCT CGCGAGTCGG
CACGAACCGT TTCCGTTGAC GGATTTGCAG TACGCCTATT GGGCAGGACG TGATCCGCAG
TTCACGCTCG GCAACGTCTC TTGTCATGCC TATCTGGAAT CCGAAATCGT CGGCGCGTTC
GATCTCGCGC GCCTGGAGGC CGCGTGGAAT CTCCTGATCG CCCGCCACGA CGCGTTGCGG
CTCGTCATCG ACGATCATGG CATGCAGCGG ATCCTCGCCG AAGTACCGCG GTATCAATTC
AGGTTGGTCG ACCTCACCAC GGCTGACGCG GCCGTCGTCG ACGACCATCT TTCGGCCTGG
CGGGAGGAGA TGTCGCACCA GGTCCTCGAT TCCACCTCGT GGCCGATGTT CGATCTCCGG
GCGTCGAAAC TGCTGGACGG ACGAACCCGT CTCCACTTCA GCATCGACAT GCTGATCAAC
GACGTCGCCA GCAGCCAGAC CCTATGGAGC GAGCTCGGGC GCGTGTACCG CGCCGGATCG
ATCGAAGCGG CAGGTCTCGA GCCGTTCACC ATCTCGTTCC GCGACTACGT CGTCGCCAAG
GCGAATCCAT CGCCGTCGCG GCAGGCGATC CGTCAGCGGG ATTGGGCCTA CTGGATGGAA
CGGCTGCCAT TGCTGCCGCC GCCGCCGCAA CTGCCTCTCG CGGTGAATCC GGAACAGGTC
GCCCGTCCGC GATTCGTCCG TCACGCGGCG CGTTGCGATC AGGCCCAATG GTCGGAATTG
CGTCGGCGGG CACGGTCGTT CGGGGTCACG CCCGCGACCC TGCTGATCGG CGTGTTCGGG
GAAGTACTGG CCGCTTGGAG CGATCAGCTT GATTTCACCC TCAACCTCAC GATCTTCGAT
CGTCTGCAGA ACCATCCGGA TGTTCCGCGT CTCGTCGGAG ACTTCACGTG CGTCACGCTG
CTTGCGGTAG ATTGCCGAGA ACCGATGCCG CTCGCCGCGC GTCTGCAGAC GATCCAGAAG
CGGATGCTCG AGGATCTCGA ACATCGAAGC GTCAGTGCCG TCGAGGTTCT GCGCGAGAAG
AATCGCGGGA ACGATCGCTT GGTCGGAGCG CCGGTGGTAT TCACCAGCCA ACTCGGGATG
CACGATCCCA CGAAGGGAAC CTCCGACGGT GATCCGCTCG GACAGGTCGT CTACGGGATA
ACCCAAACTC CGCAGGTATG GCTCGACTAC CAGGCGGCCG AGCTCGACGA CGGACTGCTG
TTGAATTGGG ACGTGGTCGA AGGGCTGTTC CCCGAGGGCG TGATCGAAGC GATGTTCAAG
GCCAATGCGG ACCTGCTCGC CGCGCTGGCT ACTGCCGACG ACGCCTGGAC CCGCGCGGCG
GGGGCCCTGC TGCCGCAGTC GCAGCGCGAG GTCCGCCGAC ACGTCAATGC GACGAAGAGC
GAGCTGCCTC TCGATACGCT CGACAATCTG TTCTTCGAGA CCGCGGCACG CGAGCCCGAC
CGCATCGCGG TGATTGCCGG TGATACGATG GTCAGCTATG GCGAGCTTGC GAGCTGGAGC
CGACGGCTGG CAACACGACT GCGAGCCGAA GGCATCAGGC CGGGCGACCG TGTGGCGGTC
GTGATAAGCA AGGGGCCGGA ACAGGCCGCT GCATGCCTCG CGATCCTCTC ACAAGGCGGC
GTCTACGTCC CGCTCGATCC GGCTATGCCG ACCGCCCGGA TGGCCAAGGT CGTCGCGGGC
AGCGGCATCG GCATCGTACT CGTTCAACAG TATCGGGATG ATTGCGTCGC TGAACTTGGC
GTTCGTGTTC TCGTCGCCGA CCTCGTCGAA TGTCGAGGAT GCGAAGAAAC CGAAGCGGCG
CCGGGCCGAT CGCTGAATGA CGAGGCCTAT GTGATCTACA CCTCGGGCTC CACGGGAACC
CCGAAAGGCG TGGTCATCGA TCACCGTGGC GCGGCCAATA CTGTTCTCGA CGTCAACCGT
CGCTTTGGTG TCGGCCCCGA CGATCGGGTG TTCGGGTTCT CAGCGCTCGG GTTCGATCTG
TCGGTGTACG ATCTGTTCGG AACGTTCGCC GCCGGAGCGA CCCTCGTGCT TCCGGAGGCC
GACGGCACTC ACGATCCGCG CCATTGGTCC GATCTGGTGC AACGTTACGG CGTCAGCGTA
TGGAATTCCG TTCCGGCGGT CTTCGACCTG TTGCTCGACG AAACGAACGC CGATCTCGCG
AGCTTGCGAC TGGTGCTGCT GTCGGGCGAT TGGATTCCGC TGAAATTGCC GACCAGGCTC
CGCGACCGTG TCCAAACCGC AAGGTTGATC GCGCTCGGTG GCGCGACGGA AGCATCGATC
TGGTCGAACT GGTTCGAGGT TCGCCGCGTC GAGCCGCACT GGCGTTCGAT ACCATACGGA
TTCCCGCTCT CCAACCAGTC GTATCGCGTG CTCGATCCTG CACTGCGCGA TCGTCCTGAC
TGGGTGGTAG GAGACCTTTA CATCGGTGGT GTCGGCGTCG CGCTGGGCTA TGACGGCGAT
GCCGAGCGTA CCGCTGACGC CTTCATTGTT CATCCCGATG GCGAACGCCT CTATCGCACC
GGAGATCTGG CCCGCTACTG GCCCGACGGC ACGATCGAAT TTCTCGGTCG CAGAGACGGC
CAGGTGAAGA TTGCCGGGCA CCGCATCGAA CTCGGCGAGA TCGAGAGCGC ACTCACTTCG
CACCACGAAG TGTTGGACGC GGTCGTCGAT GTGGTGGGAG CCGCTGAAGG ATCTCGACGG
CTCGTGGCGT GGGTCTCGCT GGGGGACGAC GGCGACGATC TGCAGACCAC TGTGGTCGCG
GAAGCGGCCA CCGTCGAGCG GAACGGCAAG GATCTCGGCC GCGCATTCGC GACGTCGTGG
GAAGCCTCGC CTTGGGGTAA CGACGAACTC GCCGACTTCT GGGGATGGCA GGAGCTCATT
GCGCGGCAAT GCGTGCGCGA TCTCCTCGCG TCCCAAAGCG CGTTCGGGAA CAGCGATCGG
CCGTATACGC TTCCAGGCCT GGAGCGAAGC TTGGGGTTGA CCGAGAAATA CCGGGCCCTG
CTGCCACGCT GGCTTTCGCT GCTCGAGACG TCTGGTGAGT TGCGCCGGAA CGGGGGCGAC
TGGTTCGGGC AGCTCGCCGG GTCGGACTGG GACCTGATCG CTCTGCAGGC CGGCCGGTTC
GGCGTGCCAC CGGCGGTCAT CGCGCGTCTG CGCGACAGCG CAGACCGGCG CCATGCGGTG
TTGCGCGGGG AGGAGAGCGC GCTCGCGGTG TTCTATGATG AAGGTTCGGG ACTCAGCCCC
GAACAGCTTG CCAAGCTCCA TCCCTGTGCA ACTCGGATCT ACGCGGATGT CGGCTGCCAG
CTCGCGCGTA TCGCCGCCGA ATCCGATCGA CCGCTGCGGA TTCTGGAGCT CGGGGCGCGG
GCAGGGGAAG CGACGCGGGA ATGGCTCGCC GCCGCCGCCG CTGCGCCGAT CGAGGTGACG
ATCACCGACC CGTCCTCGCT GCTGCTCGAC GACGCTCGCG CACGCCAACC GGATGCCGCC
GCTTCGATCT GGCGCGTGTT CGATCCCGAT CGAAGTCCGA CGGCCCAGGG ATTCGTCGAA
CACGAGTTCG ACGTCATCGT CGCATTCAAT GCACTCCATC GCAGCGACGA CGTCAATCGC
GTACTGAGCA ATTGCCGGCG GCTGTTGCGG CCGGCCGGCT TGCTGATCGG TGTCGAACTG
ACCATCAACA GCCCACTCCT CGATGTGACC GTGGCTCTCA TTGAGAGTGG ATTCGATCAA
TTGCAGGATC TGCGCCGTGG CCGCGGAGCT CCTCTTCTGA GTGGCGAGGA GTGGCGTGGC
TGCCTGCAGG CGGCGGGTTT CGCCGATGTC GCCGCAGTGA CGCCTGCGGC AGATGCCGGG
CTGTACGTTC TCGCCGCGCG TAACAGCGAC CGGGTCAAGG TATTTGAGCC GGACAAGGCG
GTCGCGTTCT TGGAGCGACA TCTTCCGGCG TACATGGTCC CCCGTCAGAT CGTCCGCCTG
GATGGCATGC CGTTGTCGCC GAACGGGAAG GTGGATCGCA AACGGTTGCC GCGACCCGAT
CAGACCGGAC AGATGCCGAA GCACGGCCAA ACGGAGGCGC CAAGGACATC CACGGAATGC
AAGCTCGCGG AGATCTGGTC CGAACTACTC GGAACAAAAG CGGTCGGCAG AGACAGCAGT
TTCTTCGAGG GCGGTGGCGA TAGCCTCATC GCTGTACGTA TGGTGGAGCG TGTCCGCTCG
CGTCTGGGAC GCAGGTTGGC CCTGCGTGAC GTATTCGCGG CCCCGGTTTT GTATCAGCTC
GCCGAACGCC TGGATGGCGA TGCCGACGTC GCGGACGTCG TGAGCAGGAA ACGCCTGGCG
TCGGATATCG CCGCGCGCCA CGATCCCTTC CCCTTGACCG ACGTGCAGCA GGCCTACTGG
ATCGGACGAC AGGGGTTGTT CCCGCTCGGG GGCGTTTCGA CTCACCTTTA CGTCGAGATC
GACGTAAAGA AGCTGCCGCT CGGGCGCCTC GAGAAAGCGT GGAACCGGCT GGTGAGGCGG
CATGATATGT TGCGTGCCGT GATCGATGAA CGGGGGATGC AGCGCGTCCT GCAACACACG
CCGGTCTATC ACTTCCTCGG AGCCGATCTC AGCGACGCCG ATCAGTGCGA GATACAGAAC
TGGCTTGAGC GTCAACGCGC GGATATGAGC CATCGCGTGT ACGACGCATC GGTGTGGCCG
CTGTTCGAAA TTCGTGCGGC GCGCCTGTCG GAAAGCGTAC GTCTGCTGAT CAGCATCGAC
AATCTCGTTT GCGACGGCAG GAGCATGGTT CTGCTGCTGC ACGAATGGGC GATGTTGGCA
CGCGATCCGG ACCGCGACCT CGCGCCGCTC GAAATCGGCT TCCGCGATGT GGTCTTGCAT
CTGGTGAGCG AACAGGACGG CGCGCAGGCC CGGCGGGCTC TCGACTATTG GATCGATCGG
ATCGGCTCGC TTCCGTCGAG TCCGAATTTG CCGTTGATCG GCGATCCCGC TGCGTTGACC
CCACCGCGGT TTCGTCGCCT CGAAGCCGTG CTCGATGCCC CGATGTGGCA GGCGCTCAGA
TCGCGCACGG TCGCGGCCGG GCTGACGCCC AACGCCGTTC TCCTCACGGC CTACGGAATG
TCGCTTCGAG CCGGCGGCGG GGGCGATCGC TTCACTCTCA ACCTGACGCT GTTCCAGCGA
CCGGAGCTTC ATCCGCAGAT CGACGATATC GTCGGCGACT TCACCTCGCT GCTTCTCGTC
GCGTTCGAGG AACGGTCCGG CGACACCTTC ACCGAGCAAG CACGGCGGCT GCAAGAGCGT
CTCTGGATCG ATCTGGATCA TGCGGACGTG TCGGCCGTGC GCGTCATTCG TGAAGCCGCG
CGGCGAGGGG GTAATATACA AGCGCTGGCA GCCCCGGTCG TTTTCACCAG CGGCATCGGA
GTGGACGGGG CGGCTTCGGG CCTCGGGGGC GCTGCGCTCG GGGAACTCAC ATGGGGGATC
ACCCAGACGC CGCAAGTGTG GATCGATCAT CAGGTGGTCG AACGCGACGG TCGACTGGTG
TTCAATTGGG ACTACGTCGA CGGCCTGTTT GCGCAGCCGT GGATCGAGGC CGTGTTCGGC
GGGTATCGAG ACCTGCTCGT TGACTTGGCG CAGCAGGCCG AGGCGTGGGA GATCCCGACC
TCCCGCCTCG TACCCGGTCT CGACACGACG TTGGTCGGCC ACGCTACGCG AGCGCACGGA
CCGCAGATCG GGCAACCGGC CGCAAAGGCT GTCGTGATAT CCGTTGGTGC CGGCCGCGAT
ATGGAAGGCT GCGTATCCGA GGCCTTTGCG CGCGAGCTCT CGCGTGACGA CATCGATCCG
GCGCGTAACG TCTTCGAGCT TGGTGTATCC TCATTGGTTT TGATCCGCAT TCATCAGCGG
CTTCGTCGCG ATCTCGGCTG CGATTTTCCG GTCGTGACCA TGTTCGAGCA CCCGACGATA
TGCGGACTGG CGCGCTATCT CTCAGGCGCA GCGGCCTCGG CTGAGCGCCA TCAGGTCGAT
GAGCGCCTTG CTGCGCGGCA GAGAAACGCC AAGCGGCGTC GGCCCTCCAT GCATGATGGC
TCCGCGTCGA TCTATTGA
 
Protein sequence
MCTDADPGRS SRDTLVSRLE KYAQLEGGRP ALDFVDRSTG TRTQLSYGEL SHRVKAVAVN 
VQDALVPGGR ALLLLPSGHD YVVALLACLY AGAVAVPVNL PGASRVARVL GRVEHISRDC
GATAILTTRA IADQSREALT SFVAAHRLRL ILIDDAKSGR AWSGYSPSET DIAFIQYTSG
STAEPKGVIN RHDTLISNVS FLRCLLWPKD APVVASWLPL FHDMGLIMGV LAPLALGGRV
VYMAPGAFVS DPLMWLELAA RERAAVLPCP AFALDACVEH YDADRLRDLD LSCVESLVPA
AEPVHLRQVR AFFDLYSRHG LHWGAIRPSY GLAEATLIAS GSSHDGGPVA VSVDAASIAR
GTALVVTDGA PDSRVYLSNG ADFGGQDLRI VDPETRRTKP AGDVGEIWIS GAAIAAGYWG
RSDATEETFA AHLSDDGASD ATNYLRTGDL GFLHGGHLYI TGRSKDVMIF RGQCHYPNDI
EASLANLHDD IIAGGAAAFA IPGDQGVERL VVVQEVRRHS DLDAAALEAS IRETIAREHG
LAAHDVVLIR RGTLKRTTSG KVRRAEMRRL YISGGLTIVG SDRQPAGQDT RSPSREQRRD
AVRGKVLDCV RRALGPSSPR VIDPARSLFA LGLDSLAATQ AVAALERDLG RSLPEGVLFD
YPTVDALSDW LVSRAEEAPC LPPDQPLSRR EAGEPLAIVG LSCLFPAGAN DIEDPAEFWR
WLMTGGDAVR GLAADRFRQD LDIPGYGACL RRVDGFDAAF FGVGPREAMN MDPQQRLLLE
ATWHALEDAG FVPQSLRGSD TGVFVGVGTG DYGHLPFVTR DPAHLDPYYG TGNAFAAVAG
RISYVFDWSG PSIAVDTACS ASHAAVHLAC QSLRTGESSL AVAAGVKLQI LPEIDLVLAR
AGMLAADGRC KTFDAAADGY VRGEGAGVVV IKRLADALRD GDPIRAVIRE SVLSQDGASA
SLSAPNSEAQ RRMLSKALAR AEWTPADVDY VELHGTGTRL GDPIEFEALA SVFFGRDATD
PLYLGSVKTN IGHLEAAAGI AGLIKVVLAL QEGRIPPNLH YNRPNPAIDL ARIPAVIPTA
PIDWPLRGDR RRAGVTSFGF AGTIGHILLE QAPLREAPAC CRSDRVSLLL LSARSTGSLE
ELRRRYLECL RDLDGPLSAF VNAAARQRQH FLDHRLIAIG ADAKGLAGAL VAANPVQQQR
PARICFLFTG QGAQHAGMGR ALYDAEPAFR RAIDRVDAAM SPFLGGSIRD LMFAADALEL
HETRYTQPAM FAFGYALAQL WRGWGVEPDT IIGHSIGEVA ALVHAGSLTL DAAASFIVRR
AGLMQSLKQR GGMMAVRLSA ANVSERIADT SISIAAINGR EDVVVAGPDA DIDRLSAEFS
AQGISARRLK VSHAFHSALM DPLLEELEAV AENCEARAPR VRFISTLSGD LLSGAPDPAY
WRRHAREPVR FASALGVAIA NGCDCFIEIG PRPLLVSLAG REAHESGLAD GLFLASAREG
ESHPNSLMEC LGALYLRGVD FNLEAAFRGP AALPAALPSY PFDRRSYWLE YRDQHEGPSP
QLPQMRKQDE SAELSLSRLT WTEAVLPQND PAPAPRLYLI EGGGAAGRAV AESCAEVGVI
ASDAEALPAA ASGDSILVWL GALAAHEAPQ PDALWRYIAY CQALYRARLT ARIVVVTRGG
QRDGTAAPAQ AAFWGATRAL AIECPDLQFL LVDVNAADDP ARVLATIAPR LSDIIPQEDM
LAWDGAQWLS PRLEAVPAFE AQPGPVGTDG TYLICGGLGA LGGHVLDWLV ACGVRDIIVT
GRGEPGSRAR STFQRYGQAG VDIRYVRADV ANETDMRNLF REIDESARAL RGVFHCAGIG
RFDTIDAIDE AAFREATQAK ADGSWILHCL TCDRSDVEHF VVFTSIAGIW GSRFQIHYGA
ANAYQDALAR LRRARGLPAL AIAWGAWGGG AGLSEVDDSL LQYLHRAGIS RFEPQRAITT
LAGLIATPGN WIAAEVDWRR FAPLYRTFGR SDLLARLAPA TEATASRSED CPDWTRLSAA
DRRCVLESFV RTTIATVLRI DAGEMHDDVE LIKHGLDSIL VMDFARACRQ KLGVDCALRA
IFESATPGGL VDYLEGLAAS CERTHEEATV DQIVPDLASR HEPFPLTDLQ YAYWAGRDPQ
FTLGNVSCHA YLESEIVGAF DLARLEAAWN LLIARHDALR LVIDDHGMQR ILAEVPRYQF
RLVDLTTADA AVVDDHLSAW REEMSHQVLD STSWPMFDLR ASKLLDGRTR LHFSIDMLIN
DVASSQTLWS ELGRVYRAGS IEAAGLEPFT ISFRDYVVAK ANPSPSRQAI RQRDWAYWME
RLPLLPPPPQ LPLAVNPEQV ARPRFVRHAA RCDQAQWSEL RRRARSFGVT PATLLIGVFG
EVLAAWSDQL DFTLNLTIFD RLQNHPDVPR LVGDFTCVTL LAVDCREPMP LAARLQTIQK
RMLEDLEHRS VSAVEVLREK NRGNDRLVGA PVVFTSQLGM HDPTKGTSDG DPLGQVVYGI
TQTPQVWLDY QAAELDDGLL LNWDVVEGLF PEGVIEAMFK ANADLLAALA TADDAWTRAA
GALLPQSQRE VRRHVNATKS ELPLDTLDNL FFETAAREPD RIAVIAGDTM VSYGELASWS
RRLATRLRAE GIRPGDRVAV VISKGPEQAA ACLAILSQGG VYVPLDPAMP TARMAKVVAG
SGIGIVLVQQ YRDDCVAELG VRVLVADLVE CRGCEETEAA PGRSLNDEAY VIYTSGSTGT
PKGVVIDHRG AANTVLDVNR RFGVGPDDRV FGFSALGFDL SVYDLFGTFA AGATLVLPEA
DGTHDPRHWS DLVQRYGVSV WNSVPAVFDL LLDETNADLA SLRLVLLSGD WIPLKLPTRL
RDRVQTARLI ALGGATEASI WSNWFEVRRV EPHWRSIPYG FPLSNQSYRV LDPALRDRPD
WVVGDLYIGG VGVALGYDGD AERTADAFIV HPDGERLYRT GDLARYWPDG TIEFLGRRDG
QVKIAGHRIE LGEIESALTS HHEVLDAVVD VVGAAEGSRR LVAWVSLGDD GDDLQTTVVA
EAATVERNGK DLGRAFATSW EASPWGNDEL ADFWGWQELI ARQCVRDLLA SQSAFGNSDR
PYTLPGLERS LGLTEKYRAL LPRWLSLLET SGELRRNGGD WFGQLAGSDW DLIALQAGRF
GVPPAVIARL RDSADRRHAV LRGEESALAV FYDEGSGLSP EQLAKLHPCA TRIYADVGCQ
LARIAAESDR PLRILELGAR AGEATREWLA AAAAAPIEVT ITDPSSLLLD DARARQPDAA
ASIWRVFDPD RSPTAQGFVE HEFDVIVAFN ALHRSDDVNR VLSNCRRLLR PAGLLIGVEL
TINSPLLDVT VALIESGFDQ LQDLRRGRGA PLLSGEEWRG CLQAAGFADV AAVTPAADAG
LYVLAARNSD RVKVFEPDKA VAFLERHLPA YMVPRQIVRL DGMPLSPNGK VDRKRLPRPD
QTGQMPKHGQ TEAPRTSTEC KLAEIWSELL GTKAVGRDSS FFEGGGDSLI AVRMVERVRS
RLGRRLALRD VFAAPVLYQL AERLDGDADV ADVVSRKRLA SDIAARHDPF PLTDVQQAYW
IGRQGLFPLG GVSTHLYVEI DVKKLPLGRL EKAWNRLVRR HDMLRAVIDE RGMQRVLQHT
PVYHFLGADL SDADQCEIQN WLERQRADMS HRVYDASVWP LFEIRAARLS ESVRLLISID
NLVCDGRSMV LLLHEWAMLA RDPDRDLAPL EIGFRDVVLH LVSEQDGAQA RRALDYWIDR
IGSLPSSPNL PLIGDPAALT PPRFRRLEAV LDAPMWQALR SRTVAAGLTP NAVLLTAYGM
SLRAGGGGDR FTLNLTLFQR PELHPQIDDI VGDFTSLLLV AFEERSGDTF TEQARRLQER
LWIDLDHADV SAVRVIREAA RRGGNIQALA APVVFTSGIG VDGAASGLGG AALGELTWGI
TQTPQVWIDH QVVERDGRLV FNWDYVDGLF AQPWIEAVFG GYRDLLVDLA QQAEAWEIPT
SRLVPGLDTT LVGHATRAHG PQIGQPAAKA VVISVGAGRD MEGCVSEAFA RELSRDDIDP
ARNVFELGVS SLVLIRIHQR LRRDLGCDFP VVTMFEHPTI CGLARYLSGA AASAERHQVD
ERLAARQRNA KRRRPSMHDG SASIY