Gene RPC_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2997 
Symbol 
ID3973239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3279845 
End bp3293152 
Gene Length13308 bp 
Protein Length4435 aa 
Translation table11 
GC content64% 
IMG OID637926108 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_532861 
Protein GI90424491 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC GCGAAATACT CATTCGTTCT GGCCGTTTGC GACGGGCTCT CGACGCTGCG 
CTACTCGGCG GCGTCAGCCT GTTGGCGCTG ATGCTCGGCA GCAATCCGGG CGCGGCGCGC
GGCCTCACCC CCGGCGGATC TGTCGGCTCG ACAGTGGCGG CGCAGCAGGC GGTCGCGGCG
GCGGCCCAGC AAGCCTCGGC AGCGGCGACG CAGGCACAGG CCGCGCTGGC CCGCGCCGCG
GCGGCGCTGG CGGCGGCGCG AAAATTCCAG GCGGACGCTG CCGCGGCGGC CCGGGCTTCG
TCCTCATCGG TAGCGAACGG CCTGGCCACC GGGGGGCTGA TGCCGTTCGG CGGCACCGCC
GCCGATCCGT CTGCCGGGAT CAGGGCCGGC ACCACCGCCT GGACGCTCTC TGGCGATCTG
CCGGTGCAGA CCAGCAACGG GTCCCAGGTC ACGGTCGATA TCAACCAGAC CCAGGCCAAG
GCGATTTTTT ACTGGGATAC GTTCAACGTC GGCGCCAAGA CCACGGTGAA CTTCAACCAG
AGGGCGTCGG ACTGGATCGC GTTGAATCGG GTGCTCGATC CCTCCGGGGC GCCGAGCAAG
ATCCTGGGCC AGATCAACGC GCTCGGCGCG GTCTATCTGA TCAACCGCAA CGGCATCGTC
TTCGGCGCCG GCGCGCAGGT CAACGTCCAC ACTTTGGTGG CCTCCAGCCT CGACATCGGC
AAACTCGGCA CCGATCTCAC CGCGCGCGAC CAGTACTTTT TCAACACCGG CATCGGAAAT
CTGAATTCCT TCTCGGTGCT GGATCCGGAT GCGAGCGTTG GCGCCGCAAC GAAATTCCTG
CCCGGCGATG TCACGGTCGA ACGCGGCGCC TCGATCACCG CCAATATCCG CACCGACATT
GCCGCGCTCG GTTCGCCGGG CTCGATCTAT CTATTCGGCA AGAACGTCAG CAATTCCGGT
CTGCTCACCG TGCCCACCGG CGAGGTCGGG ATGGTGGCGG CGCGCATCAT CGATCTGATC
CCGAACGGCT ACTCAGCGCT GCCGACCGCG GTGCTGGGCA CCGATGCGGA CGGCAATGCG
CTGACATTCC GTGGCACCGA ATTCATGCTT TCCCAGTTCG GTTCGGCTTA TGACTCCTCA
GGGTACCCCA CGGTTCAAGG CGGTTTTAAC TACTTGGCCG GGACCGGCGC GGTTACCCAT
GACGGCCTGA TCGAGGCCTC GCGCGGCATC GTGATCATGA ACGGCGACAA AATCGCCATC
AACAATCCCG GCGGCGGCTC GCTGCGAGAT GCCTCAGGCA CCCTCATTCA GGGCGTGATC
TCGGTCGATA CCAGCATCGA CCGCAACAGC ATGGTGCTGC TGCGCGCGGC CACCAGCGTC
ACCCTGAACG GCGTGATCTC CAGCCTGCCA TTCGACGACG GCGCCGATCC GCTGCCGAGC
GGATCGAGCA GCGGCAGCAC GGTGCAAAGC TTTACGCCGG CCTATATCGA ATTGAGCGCC
AAGGGTGAGA TGACGCTCGG ATCAAGCGGG CTGGTCTCGG CGCCGTCGGC CCAGGTGGCG
GTTAAGACGC TTGGGAGTGG TATCGGCTCG CTGTTCGCCC AGGGCGGCAC CCAAGGCAAC
ATTGCTTCCA ATGGCAGCAA CGCCGGGGTG TCAATCCTGC TCAATCCCGG CGCGACGATC
GACGTCGCCG GGCTGCAGAA TGTCGAACTG CCGGCGAGCT ACAATTTCAT TTCGTTCCAG
CCGCGCGCCG AATTCGCCGA CATGCCGCTG CAGCGCGATG GCGTGCTATA CGGTAAGACC
TTGTGGATCG ACATCCGCGC CTCCGGCACC CGCGCCGACG GCACCAGCTG GGTCGGCACG
CCGCTGGCGG ACGCCAGCGG CTATGTCAAC GCGGTCGGCC GGAGCATCTA CCAGCTGATG
ACGGTGGGCG GCCAGGTCAG CCTGAGCACT GGCGCATCTG CCGGCACGGT CCAGACCACC
GGCGCGGTGA TCAACGTGGC CGGCGGCAGC GTCACATTCC TGCCCGGCGT GGTGCCGGTG
ACGCGGCTGC TCGGGATCGA CGGCCGCATC TACAACATGA CCAACGCCGA TCCGAACATG
ACCTATGTCG GATTCGCCGG CCAGTTCACT GTCGACCATG CGCGCTGGGG CGTCACCGAA
ACCTGGTCCG TCGGAACGCA AACCTATGTC TCCGGCTATA CCGAGGGCCG CAACGCCGGC
GGCGTCGCGG TGACGACGCT GACGCCGACG CTGGGAGCGA TGTATTTCGG CTCGGTGGCG
GGCGAGCGGC AGATCGCGGC GGGCAGCCTG CCGTTGCAGG GATCGCTGAC GCTGACCACG
ACTTCGGATG TGCAGATCGG CACGGCGCCT TCGGACAATT TCACGGCGAA TCCGCTGTAT
TCCGTCACCC TGTCGGCCGA CACGCTGTCG AGCTACGGGC TGAGCACGCT CGCGATCACG
GCCTACGACC TCGTGGTATC GAGCGGCAGC ACGCTGTCGC TGGCGGCCGG CGGCAGCTTC
TCGGTCAAAG CTGCGGGCAT CGTCGATATC GCAGGCACGG TCTCGGCCGC GGGCGGCGCG
ATCAGTCTCG AGACCGATTA CTATGGTCTC CATAATGTTA GTGGACTGTT CCAGTCGTCG
CCGATTAATA ATTCCAAGGT GGCAGCGAAT ATCTTCGTCG AAGGCACGCT GGACGTCAGC
GGCCGCTTCG TCAACGACAC CGGCCGCTTC GGCACGGATG CATCCGGGCC GGCCTTCATC
AACGGCGGCA CCATCTCGAT TAGCACGAAC CTGAACAGTA GTGCTATTGA CTCGAAACAG
GTTGACACCA CGGGCAGCAT CTTGCTGGCC AAGACCAGCG TGCTCGATGT GTCGAGCGGC
GGATATATAT CGGCTCAAGG AAAGCCGAAA ACCGTCGCCG GTGGCGGAAT GGCCGGCAAG
GCCGGTAGTG TTTCCCTTGC TATCGTCCAG TCAAACAGTT GGAGACCACG GGAGAGTGGT
AGTCCTGTAC CCGAATTCCC CGAATCCGGT ACTATAGCCG TACTTCAGCT CGACGGCAGT
TTGCGCGGCT ATGGATTTGA GAGCAACGGC GGCCTGAGAC TGGTCGCCTT CGATACGATC
CGGATCGGCG GGACGCTGCA GCCCGGCGAG ACGTCGTCGA TTCGCGTCAA CGGCGTGGCC
ACTACGCTGC CGGTCGCGCT GCTCACCGGC GGCGGGTTCG GCGACTATAC GATCGAGTCG
CTCGCGGACG GCTGGACCGG CGCCACCGCC AACATCATTG TCTCGGCCGG CGTCAGCCTG
GCCCTGCAGC AGCAGAACCT GTCGAGCGTC GCCGATTACA GCGACACCGC GACCGGCACC
AAGCTCGGCC AGCAGGCGCA ACTGGCGACC CTTCCGGACG ACCAGCGCAA GCCGGTCAAC
CTCACGCTCA GGGCTGACAA CATCCTGCTC GACACCGGCT CGAAGATCGT GACGGACCCG
AAATCCGCCA TTGCCTTCGG TGGTATCGAT GGCTTGTTGA AAGTCACCGA TCCGCTGCGC
AATGCGGCGG CGAAAAGCGT CCAATTTCTC GGCAGCATCG TCGATCCCAG CGGCGCCGTG
CTGGTCAACG CGCTGAAGAC CTATCTCGGG CCGCAGGCGG TGGTCGACCT GTCGGGCATC
TTCGTCGCCA ATTCCAGGTT CGGCGAGCCG AAAGGGCCTT CGACCGGCGG CAGCTATTTG
TCGGGTGGCA GTTTTACGGT CGAAGCGGCC GGCCTTAGAA CGATCAATGT TCTCATCGAT
ACGTCGGATG TTACAAAGCA CTATCTCTAC TACACCTATG GCCTTCCGGG CACAGGCTAT
CTGGTTGCCG ATCGCGGAGC GATCGTCGAT GTCTCCGGCG CCGCCGGCTT GGTTGAGGTC
GCCGGCGGGC ACGGCACTTC GACGTCGGTG TGGTCGTGGA GCGACGCCGG CACCATCAGC
GCCGACGTCA CTGGCTTCGC CTGGGGCGGC AGCTTTGTCG CCACCGGCGG TCGATATCTC
GGCGCCGACG GCATTGTCCA TGCCGATTTA CGCGCCAACA ACGGCACCTT CATTCTCGGC
GGCGGTTCCA TCTTGCTGCG GCAGGATATG ACGGACGTCA ACGCTCATTT GTCTGCTCCG
CAGTCGCTCT ATGTCTCCGC CGATCAGCTC GCGCCGTTTG ATAATATCTA CCTTTACAGC
GGCGCCGCTG TTGGCGGCGC GGCCCGGCTG TTCGATGAGC TGCCGGCGTA TAACCGGCTG
GCCGGGCGCA ATGTCGTCGC CGGGACCTAT GAGGTGGCCG GAAGCACCTA CAACATTCCC
TGGACGTTCA CTGGTAGGCC AAACATGGTC TTCAACCCGC TGACCATCTC GGGTTCGCTG
GACTGGCACG TGCAAACTGG GCTCCGCATT GCGGCCGGCA CGATCGCGTC GTCGACGACC
ACGGACAATA GCACCGTCAC GCTGGATGCG CCCTATCTGC TGCTCACTGG CGGCAATGGC
ACGGCGGCGT CCGGCCGCAA CACGTTGACC ATCAGCGGCC AGGCCATCGA CATTTTTGGC
GCGGTCAGCC TATCGGGTTT CGCGCAGGCC AACTTCGTCA GTTCGGGCGA CATCCGCCTC
GGTACTAGGA AAGTCTCGAA TACGATTGCG ACTGGCACGG GGGCATCGAC CGCCGCCTCC
AGCTTCACGG GCAGCCTGGA TTCCAAGGGC GATGTTCTGC TCAAGGCGCA GCGGATCTAT
CCGGTTTCCG CGGTCGACTT CACCATCACG ACGCCGGGCA AGGTGAAATT CCTGGCGCCG
GCCGGCAGCG ACGCCGCCAT TCCGCTGGCC GCCGGGGGCG GCATGACGGT GATCGCAAGC
GCGATCGAGC AGGCGGGCAA TCTGTTCGCG CCGCTCGGCA AGATCACGCT CGGCAACACC
GACAGCACGG TTTCGAGCAT CATCACCCAG AGCGTCACCC TTGAGCCCGG CAGCCTGACC
TCGGTCACGC TCGCCGACAC GGTGGTGCCC TACGGCGCGA CTTTGGACGG CACCAACTGG
TACTACAACG CCAATCTCGA TCCGATGTCG CAGCCGCCCA CCAAGGGTCT GGTGCTCGAC
GGCGCCAACG TGGTGCGCGC GGACGGCTCG ACCATCGACC TGCGCGGCGG CGGCGACCTG
CAGGCGATCG AATGGATTCA GGGCAATGGC GGCTCGCGCG ACACGCTGAA TAAAACGACG
CTTGGACAGA CGGTCTACGC GCTGGTGCCG TCGCAGAGCA CGGCGGTCGC GGCGTTCGAT
ATCCATTTCA CCACGTTGCG CACCCTCACA TTGGACACCG GCAAAACCAT CATGGTCGGC
GACGCCGATC CGCTGGTCGG CACCCAGATC ACCATCGCCG GCGGCAACGG CATTGCGGCC
GGAACCTACA CGCTCTATCC CGCCCATTAC GCCACCCTGC CGGGCGCGAT GCGCGTGGTC
TATTACGGCA GCAATATCGG CCGCAACGTC GCATCAGGCA CCACGCTGCC GGACGGCACC
GTGCTGGTCA CCGGCTACAC CACGCAGTCG ACCGCGTCGG CGACGCAATC CTCGGGCCAG
AGCCTGTTCG CGGTGCAGAC CAGCGCGGTC TGGCGGAAGT ACAGCGAATA CAATCTGAGC
AGCGCCAACA GCTACTACAG AGCGTTGGCG ACCAAGAACG GCAGTCTCGT GCCACGGCTG
CCGATGGATG CCGGGCGCCT TGCGGTCCTG GCGCAACAGT CGATCCTGCT CGCCGGCGTC
GCCCTGACCC AGCCGGGACA AGACAGCCTC GGCACCACCG GGCGCGGCGG CGAACTGGAC
ATCAGCGCGC CGCAACTCGC GGTGATCGGC CACGCCCAAT ATGTCAACAA CGACATTGCG
GCCGGCTATG TCGGCCTCGA TCTCACCCAG GTCGAGGGCT TCGAGAGCGT GCTGATCGGC
GGCCTGCGCA GCGACACCAC GACGGGGACG CTGATCACCG CAAGGGCGAC CAATGTGCTG
GTCGACACCC AGGGCGAGAC GTTCACGGCG CCGGAAATCC TGTTGGTCGC CAAGACCGCC
GGGGAGTGGC AACGGCTCAG GCAATCGGTG CAAGTCGGTT CCGATCCGGC AAATACAGTG
CTGATCGAAT TGCCGGTCTA TGTGCCGATC GCCGGCACCG GGGTGGTCAC GGTCAAGGCC
GGAAGCGTCA TCGAAACCAC CGGCGCGGTC CATAACGGGT ACGGACGCAG GTTCTACTTC
CCCAGCAATT CCGGCACGAC CCCGGCGCAG CAGATCGCCA CGGCCTTGGG CGGAACGGTG
GCTTCATCGG GAACCGCGAT CACCGGGGTC GATCTCGCTA AATTGCAGGC AGCCCTCGCT
GCGCAAACCA GCACCGGCTT CAATAGTCTG GCCTATTATG CCTATGGCGG CGGCGCCATG
GACCCGCTCG GCGCGCTGTT CGCCGCCAGC AACGACCCCG ATCTGGTGTT GTCCGGACCG
GCGGGCGCTT CGGTGTCTGC ACTTACCTTG CAGTTCGCGG ATGTGACCAA GGGGAAGGTG
ACCGGCCTGC CGGCCGCGGT CGAAGGTGCG GTGACCGGGA CGGTCACCCT GCAAGGCGGC
GATAGCGGTC GAGTCTCGAT TGAAAATGGC ACAAGGATTG CGACCGACAC CCTGACTTTG
CAGGCGACCG CAGCGACCAA CGCAATCAAT CTGGAGACCA ACGATCTGCA CGCCGACCAG
GTCAACCTGA CGGCGCACAG CATTGCCATT GGCTCCAACC CGCCGGCCGC CAGCGGCAAG
AGCCTGATCT TGGCGGCGAA CAGCGGCCGG TTCGGCGACG TGCACGGTCT GACGCTCAGG
GCGCTGACCG GCGGCATCAG CGTCTATGGC AATTTCGATG CCGGCGCGGT GATGGAACGG
CTCACGCTGG ACGCAGCCGT GGTGGCCCGG GGCGATAGCA GCGGCGCCGC CAAGGTCTCC
GCCAAACGCG GCACCATCAC CCTGGCCAAC AGCGGTGCCG ACGCCTCGAC CGGTTCGGCC
TCGACCGCCG GTGATCTCGG GCTCACGCTC GAGGCGGCCG ACATCGTGCT CGGCGGCGGC
GGTAAACAAG CAATTCTGGG CTACGCTCGC CTGAACTGGC TCGCCAGCGA TCGCGTACTG
GTTGCCGGCT CCGGAGCGCT TACGCTCGGC ACCGCGACCG ACAAGGTCGA TTTCACGGTG
ACCACGCCGA CCATCCTGGT CGCAAGCGCC ACCGTGAGCG GCACCAAGAG CTTCGGCATC
ACCACGCTCG GCAACGTCAT CCTGGCCGAC ACCACGCCGC GCAGCGGCGC GGTCGATCGT
CCGACCGACA GCGCGGAGAC CAACGGCGTC TTTGCGATCA CCGCGGCGAG CATCAGCGTC
GGCAACACCA TCCAGGCCCA GGCCGGCACC ATCACGCTCG ACGCGACGGC GGGCGACGTG
ACGCTGCAGA CCGGCGCCTA TCTCGCCGCC GGCGGCTACA AGAAAACTCT GATCGACGTC
GACACCTATG TCGCCGGCGG CAAGGTGGTG CTGAAGGCCG ACGCCGGCAA CGTCGTTACG
GCGTCGGGCA GCGTCATCGA CGTGGCGCAG CCGCAGGACG GTTTGGGCTA CGCCGGCGAA
ATCGACGTCA CCGCGCTCAA CGGCGGCGCT ACGCTTCATG GCACGCTGCG CGGCAGCGGC
GGCCCCGGTC TCGGCGGCCG GTTTTATCTC GACATCAAGG GCGCGGCCGA TCTGACCCGC
CTCGCGGACA CCTTGCTGGA CGGCGGGATC ACGGGAGTCA TCGAGATCCA GACCCGGACC
GGCAATCTCG AATTGCAGGC GGGACACACG CTGCAAGCCA ATGCCGTCTC GCTGACGGCC
GATGACCCGA CCTGGGACAA TACCGACACC TCAAAGCAGC TCGGCCAGCT CACCATCGCC
GGCACGATCA ACGCCGACGG TTACTCCGGC TACACCGCCG ACGGACTATG GCAGGCAGGC
GGGCAGGTCG GCCTGTTCGG CCACAACCGA GTCACTTTGA AATCCACCGC CCGGATCAGC
GCCACGACCA GCCACGCCGA CGAGCGCGGC GGCGACGTGA TGATCGGCAC CGCCTGGGCT
GCCGGCGGCA ACGCCGACCC ATCGCAGAAT TCCACCGCCG GCTATATCGA CCTGCAGGCC
GGATCGGTGA TCGACGTCAG CGGCGGCACC AAGGGGGGAC TCAGCGGCGG CACCGTGACG
CTGCGGGCGC CGCTCGACGG CGCCAACGAC GTCAAGATCG CCGGGTTGGA CTCGACCATC
CGCGGCGCAC GCGCGGTCTA CATCCAGGGC TTCATTACCA TCAATACCGA AGCCACCCCG
AACAATGTCA GCGGCATCGA TGGCAGCAAG CTCACCACCA AGGACGGCAC CGCGGTGACG
TGGGACGGCT ATATCGATCC GGCGGGAGCG GTGACATCCT CTGGCGCGGC GATCGATTTC
GGGGTCTGGA CCGGGGTGAG TGGCGTCAAA CTCACCTACA GCGGCGGCTC CGGATATACC
TCGGTTCCGA ATATCTCGAT TGGGACGCTC AATGGCAGCC TGGTGGGTAA TTATTACCAG
GTCATAGATC CCGTCACCGG TGCCATCTAT CGATCTAGCC TCCGGATCGC CTCCGTTCAA
GTTACCAACG GCGGTAGTTA CACCACCGCG CCGACCGTTG TCGTCGCCGC GCCGAACGAG
GGAAGTAGCG CAACTTTCAA TCTCCAGATG AGGTACACAA ACAACACGCT GACATTGAGC
ACCTCAAGCG CGGTACCTTA TACCAACGTC GCCTTCTATA GCAGCGCCGG AGTGATGATG
GCGAGGGGAA AAATTGCGCC GGACGCAACG ACGCCCGGAA GGTACACTGT CACGATTACG
ACTTACCCAT CGAATGTCGC GATAACTAAT CTAAATCCAT CTTCGATCAA GCTATGTTCA
TCTCTAGCCT GCCTCACGGG CACTTTCGTC GATATCACCA ACACCAGCAG TGTGGTCTCC
GGTAGCCTTT ACGTCTATGC GATCGTCCCC GTTGATGGCG GCTCAGGTTA CGTCAGCAGC
AGCCAAGCGA GCCTGACTTT ACAGGGCGGC GTGGGGACAT CGCTGGCGAC CGCGAGTGTC
AGCATGGGCG CTGCGGTGAC CATTCTTGGA GTTTCCAAAG GCTATACGGC CGCCGCGATA
GCGCCCAAAA TAGTGGCCGA CGCCACCAGT TGCCCGACGG CCACTTGTAC CGCCACCACG
GTCGCGTTGG CGACGACCAC GACCACCGGC CTGACCGGGG CGACGAAACC GCAAGGGACC
AGTAACGTCT TCGTGCCCGG CTCGAACTAC CTTCCGCTCG CCACCAGCAA GGTCTTTGCC
GGCACCGCGG ACAGCGCCGT CCTCACCATT TCGCCATACG CAGACCATCA GCTGCTCTAT
ACCGACGTGC TCGCCAATTT CGTCGAGGGC AACGGCATCC AGGGCGTGGG CCTGACCGGC
AGCTATGGCT TCAGCAACCT GTTCGCCCGG CTGCAGAACG GACTGGTCGG ACAACTCGGC
GCCAGCATCG TGCACGTGCA GCCTGGCATC GAGCTGGTCA ACACCAGCAC CAGCAAGAAT
TCCGGCGACA TCACGGTGGC CAGCAACTGG AACCTGGCGG CGGTGAGCGC GGTCGGCAAT
CTGAAGACCG TCACCACACC AGGCTCGGTG ACGTACAACT ATTTCGATCC GGCGACGGGC
TACGTCAACT TCATCTACCG GCTGGCAACG CCGTGGGGCG GGCTCGACGC CGGCGCGCTG
ACGCTGCGCG CGGTCCGGAA CGTCAACGTC AACGCCTCGA TCTCCGACGG GTTTTTCCAG
GCAGGGAATT ACAGTGACGC CGACTACGTT AACTGGCTCT ATACCTACGT GCAAAGCACG
AGAATGATTA ACGCAATTAG CTCATTACAA CATGGTAGTT TAAACTACTA CTACCTTAAT
AAATATTCGA GTGGCGCGGT TCCAATCGCT CCCTATAAAG ACGCCGCCAA TTCGGTCAGC
CCGACCGCCC AGGATCTCGC GGGTGCGGAC CTGTTCCCGA ATTCGCTGAA TGTCTGCACC
GTCGATTGCA GCGCGGCGAA CATCAAGCAG GTGACCGATC CATCCTCCTG GTCCTACAGC
TTGACTGCCG GCGCCGACGT CGCCAGCGCC AATCCCACGG CCATGATTTC GCTCGCCAAT
GCAGGCGGCA AGGGCGACGT GATCATCGAC AACCACACCA ACTATTCGCA GACAGAGTTT
TATACGCCCG CTGGTAATAA CTATACGAGC AAGAACGTCG ACGTCAGTCT TGCGACCATG
GTGCGGACCG GGACCGGCAA TATCGCAATC ACGGCGGCGC AAGACGTGAT CCTGCGGGAT
AAGGTGGCCC CTGGGGTGAT CTACGCGGCC GGCGTCAACA GTGCAAAGCT GGCCGATGCG
AATTACAGCG GCACATCGAG CGTCGTCGCA GGCAATGCGG AGGGCTTCTA CGAGCCCAAG
GTGCTGGCCT ATGGCTACAG CGAAGGTCTC CTGTATTACG GGCCGCCCAC GGCGGCGGCA
TTCCCCGAAC AAGGCGGCGA TGTCACTGTC GAGGCGCAAC GCGATATCAT CGGCTACAGC
GGTAGCGGCA ACAAGACCTT GCAGTATTAT CAACCTTGGC TGTTGTCCGA CGCCGGTGTG
TCGCCCGCCA CCACGCAGGC CAGCGGGCTT TCGATTAGCC TGGTCGGGCA GGGCGTGTTC
GCGCCGCTCG GCAGCCAGAT CGCCTCGCAG ACCGCCTGGT GGATCCAATA TGGCAGCTTC
CAGCAGGGCA TCCTCAGCGC CGGCGGCAAT GTCACCGTCA CCGCCGGGCG CGATCTGATC
GACGTGTCGG TGTCGCTGCC GACCACCGGC CGGGTCAGCG GCGGCCTGTC GGCGACCAGC
ACGCCGGTCA CCCACCTCTA CGGCAGCGGC AACATGCTAG TGCGGGCCGG GCGCGACATT
CTCGGCGGCG CGTTCTACGA AGGCTCCGGC CATGCCAGCA TCATCGCCGG GGGGGCGGTC
GGCCAGAACG GCACGATGAC GAGTAAGCTG AAGCTTGCCG ACGTGCCGCT GCTCGCCGTC
GATACCGGCC AGATCGCGAT GACGGCGGGT GGCGCACTCA CGATCGCCGG CGTGATCAAT
CCTGCGGAGC TGCACCTCCA GACGCCGAGC TGGGCGAATC CACTTGAAGT TAGCTGGCAA
CTTTCTGGAT ACCTCCACAT CGATAGCTAT GGTCCGGACA GCAAGGTGCG GCTGGTGGCC
GCGACCGGCG ATCTCACGAT CGATTACTCG ACGCAAAACC TATATCCTGC GAGCTTCGAA
GCGCTTGCGC TGAAAGGCAG CCTGATCACC AGTGGAAAAA CTGGGTTTGG TATCGTGCTG
AGCCCGTCGG AGCATGGCAG CTTCCTGTTG CTGGCGCAGG GCGACGTCGA TCTCACCTTC
GGCAGTTCCG GGGGAATCCC GATCTCCGCC GGCGCGGCGC TGCTCGAAAC CGCCTTCGAT
CCGTTCCAAC CTAACAACGG CTATGACGGC GCATTCAGCA AGGGGACCCT GGCGCAGCAG
GATTACAGCT CGGCGCAGAT CGCGCGGATC TACGCGGTGA CCGGAGACAT CACCGCGACC
GGAGGCTACG TTGAAGCCAA CCCTAACGCG GGTATTAGGA TCTCGTCCTA CAAACGCATC
GAAATCAACC GGCCGACCAA GATCTATGCC GGAGGCGACA TCGTCGATCT CAACCTGGTG
GTGCAGAACA TCTACCAGAC CGACGTCAGC ACCATCGAGG CCGGCGGCAG CATCTATTAC
ACCGGCACCA ACAATGGCGG CGGCCTGCAG GTGGCCGGAC CGGGCTTCTT CGTGGTGCAG
GCGGGCGGGG ATATCGGCCC GTTCCTTCCC GCGGCCTATG ATCTCGCATC GACAGCGAGG
GTCCAGGAGG GCATCACATC GGTCGGCAAC GCGACGATGA CAGCGGTTGG CAATTCCGGA
TTTGTCGGCA TCTACAATGC GTCGTTGCTC GGTCCTTATG AGAATCCCCG TCGCAATGCG
CTGCTGACGG AAGCGGCCGG CACCGCCCAG GGCGCCGACA TCGTCGTGCT GTTCGGCGCC
AAATACGGCG CCGACTACCA GGCTGTGGTC GACACCTATA TCAACCCCGC CAACGCCGCC
AAGGTCGCAT ACAACTACCT CGGCGAGCTG CGCGACTTCC TCGCCCGCGT CGGCATCGCC
ACCACCAGCA CGCAAGACGC CTGGAACAAG TTCACCAATG CCGACAAGCT GGCGGTCCCG
CCGGTGAGCC AGGACCTGCG CCAGATCTTC GTCGACAAGG TGTTCTTCGC CGAGCTGAAG
GCGGTGGGGA TCAGCGAGCA AGCGGGCGTC ACGCAGCACC AGCGCGGCTA CGAAGTGATC
AACACCCTGT TTCCGGCCAG CCTTGGCTAT ACCGCCAACG CGCTCGGCGA CGGCATCAGT
GGCACCAGCG AGCGGGTCTG GACCGGCGAT CTCAATCTGT TGCATTCCAC CATCCAGACC
AAGCTCGGCG GCGATATCTC GATCTTCGGG CCGGGCGGCA ATGTCATCGT CGGCTCGCTC
GCCGCCGAGT CCAACAAGAA CCTGAAGCTG CGCGATCTCG GCATCCTCAC GCTGGGCGGC
GGTGCCATCA ACATCTTCAC GGATCAGAGC GCGTTGGTGA ATTCGAGCCG CGTGCTCACC
ACCCAGGGCG GCGACGTGCT GATGTGGAGC TCCAACGGCG ACCTCGACGC CGGGCGCGGC
TCCAAGACCA CGCTGTCGGC GCCGGCGTTG CAGGTGGAGT TCGACCAGGA CGACTATCAA
ACGATCGATC TCGGCGGCTT CGTCACCGGC GCCGGTATCG GCACCTTGAA GGCCTCACGG
GTCGCCAAGA AAAGCTCGGT CTATCTGATG GCGCCGCGCG GCAAGATCGA TTTCGGCACC
GCCGGAGCCC GGTCTTCGGA CAACCTGGTG GTGATCGCTC CGGTTGTCGC CAATGCCAGC
AACGTCAGCG TGGCCGGTGC TACGACCGGA ATTCCGATGA TCTCGGTCCC CAATGTCGGC
GCGTTGACCG CGGGTTCGAA CGCCGCGGGC GCCGCCGCCA AATCCGCCGA AACGCCTTCC
GCTTCCGGCA ACCGGGATCG GGCGTCGATC TTCATCGTAG AGGTCGTCGG CTACGGCGGC
GGCGACGGGC AGAGCCGGTC TGCCCCGAGC GGCGCAGACA GCGAGAGCGA GCCGCAGGCG
CGCGGTACCG ACCAACAGCC TGATCGCAAA GACAGGCGGC AACAATAG
 
Protein sequence
MADREILIRS GRLRRALDAA LLGGVSLLAL MLGSNPGAAR GLTPGGSVGS TVAAQQAVAA 
AAQQASAAAT QAQAALARAA AALAAARKFQ ADAAAAARAS SSSVANGLAT GGLMPFGGTA
ADPSAGIRAG TTAWTLSGDL PVQTSNGSQV TVDINQTQAK AIFYWDTFNV GAKTTVNFNQ
RASDWIALNR VLDPSGAPSK ILGQINALGA VYLINRNGIV FGAGAQVNVH TLVASSLDIG
KLGTDLTARD QYFFNTGIGN LNSFSVLDPD ASVGAATKFL PGDVTVERGA SITANIRTDI
AALGSPGSIY LFGKNVSNSG LLTVPTGEVG MVAARIIDLI PNGYSALPTA VLGTDADGNA
LTFRGTEFML SQFGSAYDSS GYPTVQGGFN YLAGTGAVTH DGLIEASRGI VIMNGDKIAI
NNPGGGSLRD ASGTLIQGVI SVDTSIDRNS MVLLRAATSV TLNGVISSLP FDDGADPLPS
GSSSGSTVQS FTPAYIELSA KGEMTLGSSG LVSAPSAQVA VKTLGSGIGS LFAQGGTQGN
IASNGSNAGV SILLNPGATI DVAGLQNVEL PASYNFISFQ PRAEFADMPL QRDGVLYGKT
LWIDIRASGT RADGTSWVGT PLADASGYVN AVGRSIYQLM TVGGQVSLST GASAGTVQTT
GAVINVAGGS VTFLPGVVPV TRLLGIDGRI YNMTNADPNM TYVGFAGQFT VDHARWGVTE
TWSVGTQTYV SGYTEGRNAG GVAVTTLTPT LGAMYFGSVA GERQIAAGSL PLQGSLTLTT
TSDVQIGTAP SDNFTANPLY SVTLSADTLS SYGLSTLAIT AYDLVVSSGS TLSLAAGGSF
SVKAAGIVDI AGTVSAAGGA ISLETDYYGL HNVSGLFQSS PINNSKVAAN IFVEGTLDVS
GRFVNDTGRF GTDASGPAFI NGGTISISTN LNSSAIDSKQ VDTTGSILLA KTSVLDVSSG
GYISAQGKPK TVAGGGMAGK AGSVSLAIVQ SNSWRPRESG SPVPEFPESG TIAVLQLDGS
LRGYGFESNG GLRLVAFDTI RIGGTLQPGE TSSIRVNGVA TTLPVALLTG GGFGDYTIES
LADGWTGATA NIIVSAGVSL ALQQQNLSSV ADYSDTATGT KLGQQAQLAT LPDDQRKPVN
LTLRADNILL DTGSKIVTDP KSAIAFGGID GLLKVTDPLR NAAAKSVQFL GSIVDPSGAV
LVNALKTYLG PQAVVDLSGI FVANSRFGEP KGPSTGGSYL SGGSFTVEAA GLRTINVLID
TSDVTKHYLY YTYGLPGTGY LVADRGAIVD VSGAAGLVEV AGGHGTSTSV WSWSDAGTIS
ADVTGFAWGG SFVATGGRYL GADGIVHADL RANNGTFILG GGSILLRQDM TDVNAHLSAP
QSLYVSADQL APFDNIYLYS GAAVGGAARL FDELPAYNRL AGRNVVAGTY EVAGSTYNIP
WTFTGRPNMV FNPLTISGSL DWHVQTGLRI AAGTIASSTT TDNSTVTLDA PYLLLTGGNG
TAASGRNTLT ISGQAIDIFG AVSLSGFAQA NFVSSGDIRL GTRKVSNTIA TGTGASTAAS
SFTGSLDSKG DVLLKAQRIY PVSAVDFTIT TPGKVKFLAP AGSDAAIPLA AGGGMTVIAS
AIEQAGNLFA PLGKITLGNT DSTVSSIITQ SVTLEPGSLT SVTLADTVVP YGATLDGTNW
YYNANLDPMS QPPTKGLVLD GANVVRADGS TIDLRGGGDL QAIEWIQGNG GSRDTLNKTT
LGQTVYALVP SQSTAVAAFD IHFTTLRTLT LDTGKTIMVG DADPLVGTQI TIAGGNGIAA
GTYTLYPAHY ATLPGAMRVV YYGSNIGRNV ASGTTLPDGT VLVTGYTTQS TASATQSSGQ
SLFAVQTSAV WRKYSEYNLS SANSYYRALA TKNGSLVPRL PMDAGRLAVL AQQSILLAGV
ALTQPGQDSL GTTGRGGELD ISAPQLAVIG HAQYVNNDIA AGYVGLDLTQ VEGFESVLIG
GLRSDTTTGT LITARATNVL VDTQGETFTA PEILLVAKTA GEWQRLRQSV QVGSDPANTV
LIELPVYVPI AGTGVVTVKA GSVIETTGAV HNGYGRRFYF PSNSGTTPAQ QIATALGGTV
ASSGTAITGV DLAKLQAALA AQTSTGFNSL AYYAYGGGAM DPLGALFAAS NDPDLVLSGP
AGASVSALTL QFADVTKGKV TGLPAAVEGA VTGTVTLQGG DSGRVSIENG TRIATDTLTL
QATAATNAIN LETNDLHADQ VNLTAHSIAI GSNPPAASGK SLILAANSGR FGDVHGLTLR
ALTGGISVYG NFDAGAVMER LTLDAAVVAR GDSSGAAKVS AKRGTITLAN SGADASTGSA
STAGDLGLTL EAADIVLGGG GKQAILGYAR LNWLASDRVL VAGSGALTLG TATDKVDFTV
TTPTILVASA TVSGTKSFGI TTLGNVILAD TTPRSGAVDR PTDSAETNGV FAITAASISV
GNTIQAQAGT ITLDATAGDV TLQTGAYLAA GGYKKTLIDV DTYVAGGKVV LKADAGNVVT
ASGSVIDVAQ PQDGLGYAGE IDVTALNGGA TLHGTLRGSG GPGLGGRFYL DIKGAADLTR
LADTLLDGGI TGVIEIQTRT GNLELQAGHT LQANAVSLTA DDPTWDNTDT SKQLGQLTIA
GTINADGYSG YTADGLWQAG GQVGLFGHNR VTLKSTARIS ATTSHADERG GDVMIGTAWA
AGGNADPSQN STAGYIDLQA GSVIDVSGGT KGGLSGGTVT LRAPLDGAND VKIAGLDSTI
RGARAVYIQG FITINTEATP NNVSGIDGSK LTTKDGTAVT WDGYIDPAGA VTSSGAAIDF
GVWTGVSGVK LTYSGGSGYT SVPNISIGTL NGSLVGNYYQ VIDPVTGAIY RSSLRIASVQ
VTNGGSYTTA PTVVVAAPNE GSSATFNLQM RYTNNTLTLS TSSAVPYTNV AFYSSAGVMM
ARGKIAPDAT TPGRYTVTIT TYPSNVAITN LNPSSIKLCS SLACLTGTFV DITNTSSVVS
GSLYVYAIVP VDGGSGYVSS SQASLTLQGG VGTSLATASV SMGAAVTILG VSKGYTAAAI
APKIVADATS CPTATCTATT VALATTTTTG LTGATKPQGT SNVFVPGSNY LPLATSKVFA
GTADSAVLTI SPYADHQLLY TDVLANFVEG NGIQGVGLTG SYGFSNLFAR LQNGLVGQLG
ASIVHVQPGI ELVNTSTSKN SGDITVASNW NLAAVSAVGN LKTVTTPGSV TYNYFDPATG
YVNFIYRLAT PWGGLDAGAL TLRAVRNVNV NASISDGFFQ AGNYSDADYV NWLYTYVQST
RMINAISSLQ HGSLNYYYLN KYSSGAVPIA PYKDAANSVS PTAQDLAGAD LFPNSLNVCT
VDCSAANIKQ VTDPSSWSYS LTAGADVASA NPTAMISLAN AGGKGDVIID NHTNYSQTEF
YTPAGNNYTS KNVDVSLATM VRTGTGNIAI TAAQDVILRD KVAPGVIYAA GVNSAKLADA
NYSGTSSVVA GNAEGFYEPK VLAYGYSEGL LYYGPPTAAA FPEQGGDVTV EAQRDIIGYS
GSGNKTLQYY QPWLLSDAGV SPATTQASGL SISLVGQGVF APLGSQIASQ TAWWIQYGSF
QQGILSAGGN VTVTAGRDLI DVSVSLPTTG RVSGGLSATS TPVTHLYGSG NMLVRAGRDI
LGGAFYEGSG HASIIAGGAV GQNGTMTSKL KLADVPLLAV DTGQIAMTAG GALTIAGVIN
PAELHLQTPS WANPLEVSWQ LSGYLHIDSY GPDSKVRLVA ATGDLTIDYS TQNLYPASFE
ALALKGSLIT SGKTGFGIVL SPSEHGSFLL LAQGDVDLTF GSSGGIPISA GAALLETAFD
PFQPNNGYDG AFSKGTLAQQ DYSSAQIARI YAVTGDITAT GGYVEANPNA GIRISSYKRI
EINRPTKIYA GGDIVDLNLV VQNIYQTDVS TIEAGGSIYY TGTNNGGGLQ VAGPGFFVVQ
AGGDIGPFLP AAYDLASTAR VQEGITSVGN ATMTAVGNSG FVGIYNASLL GPYENPRRNA
LLTEAAGTAQ GADIVVLFGA KYGADYQAVV DTYINPANAA KVAYNYLGEL RDFLARVGIA
TTSTQDAWNK FTNADKLAVP PVSQDLRQIF VDKVFFAELK AVGISEQAGV TQHQRGYEVI
NTLFPASLGY TANALGDGIS GTSERVWTGD LNLLHSTIQT KLGGDISIFG PGGNVIVGSL
AAESNKNLKL RDLGILTLGG GAINIFTDQS ALVNSSRVLT TQGGDVLMWS SNGDLDAGRG
SKTTLSAPAL QVEFDQDDYQ TIDLGGFVTG AGIGTLKASR VAKKSSVYLM APRGKIDFGT
AGARSSDNLV VIAPVVANAS NVSVAGATTG IPMISVPNVG ALTAGSNAAG AAAKSAETPS
ASGNRDRASI FIVEVVGYGG GDGQSRSAPS GADSESEPQA RGTDQQPDRK DRRQQ