Gene RPB_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3126 
Symbol 
ID3910927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3560155 
End bp3573198 
Gene Length13044 bp 
Protein Length4347 aa 
Translation table11 
GC content67% 
IMG OID637885028 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_486733 
Protein GI86750237 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.588086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGGCA TCCGATCGGT GCTCCGGGCT GCCGGAGGTT CCGGTCGGTC TGCAATGACT 
ACTCGCCGAA AGAGCAAACT GATGTCCGTT CATGCCGGTT CGACGCCTTC TTCGTTCTCC
CGTTGGTCTT TCTCCCATCG ATATCGTCGT CCGGCATTGT TGGCGGGAGC GAGTGCGCTG
GCTTTGATGC TAGCCATGCC TGCTCACGCA CGGTCGTTGA ACGGCGCCGC CTCCACAGTG
TCGGCGCCGA ACATCGCCTC GGACGCCGCG GCGCAAGCGG CACAGCAAGC CGCGGCCGCT
GCGCGGCAGA CCCAGGACTC GCTGACGCGC GCCGCACGTG CCGTGCAGGA CATGCAGGTT
GTGCAGGCGG CGGCGCGCGC CGCGGCGGCC GCTCGGCAGA CATCGGCGAC GTCGCCCATC
AGTGTGCCGA ACGGACTCGC GGTCGGCGGC CTCGATCCGA ATCTGGCGGC GGGCTGGAGC
GGCGCCAATG TGCCGACCGA GGGCGTCAAT GCGTCGGGCC AGACCCAGGT CGGCATTCGC
CAGACCTCGG CGCAGGCGAT CTTGAACTGG AACACGTTCA ACGTCGGCGC CAAGACGACG
CTGACCTTCG ACCAGCAGGG CAACAGTTCG TGGGTCGCGC TCAACCGCGT CACGGCCGCC
ACCGCGCCGA GTCAGATCCT CGGCCAGATC AAGGCCGATG GCCAAGTCTA TGTCATCAAC
CAGAGCGGCA TCATCTTCGG CGGCAACAGC CAGATCAATG TCGGATCGCT GATCGCCTCG
ACCGCCAATA TCACCGACAC CCAGTTCAGC GCCAACGGCA TCTATTCGAC GCAATCCGGC
AGCAGTTACA CGCCGAGCTT CACTGCGGCT GGCGGCAAGG TCGTGGTCGA AGCCGGAGCC
TCGATCTCCA CGTCTGCGCC AGCGTCGGTG ACCTCGGGCG GCGGCTTCGT GCTGATGATC
GGCAGCGAGG TCAGCAATGC CGGAAGTATC TCCACGCCGA AGGGCCAGAC GATGCTGGCG
GCGGGCGATA ACTTCATTTT GCGCAAGGGC TACGGCACCG ACGCCAACCA GTTCTCGACC
ACCAATGGCA ACGAAATCGC GACGGTGATC GCTGCCGGCA GCTCCGCGGG CCGCGTGACC
AATAGCGGCA TCGTGCTGTC GCAGCAGGGC GATATCACGC TCGCCGGCCG GACCGTGACG
CAGGACGGGG TGTTGCTCTC CACGACCTCG GTCAACCAGC GCGGCACCAT CCATCTGTTG
AACGCGGCCA GCGACGCCAG CGGCAGCGTC ACGATGACCG GCAACAGCGT CAGCGCGATC
CTGCCCGAGC TGGATTCGGA CGACACGGCC TTGAACTCAC AGCGCGATGC GCTGATCGCC
GCGTCCGGGT TGAATCCGCT GGCCGCGCAG TTCAACAATC TGTCGCCGTT GACCGACCGT
CGCGACCAGT CGCGGATCGA GATCGTCACC GGCGGTCTCG TCAATTTCCA GAACGGTTCC
TACACCGCGG CGCAAGGCGG CCAGATTGCG GTCAGCGCCG GCACGCGCGT GTTCGCCGAG
ACCGGCGCAA CGCTCGACGT TTCGGGTGTG CGGGACGTAT TGCTGCCGAT GTCGGCCAAC
CGGGTGGAGG TCAATGTCCA GGGCAATGAA CTCAGGGATT CCCCGGTCAA TCGCGACAAC
GCCGCGCTGA TCGGCAAGAA CGTCTGGATC GATGTCCGCG ATCTCGTTCT GGTGGCGGCG
GGCACCGGCG GCTATGCCTC CGACCGATAC TACACCAGTG GCGGATTGCT CGAAATCAGC
GGCTATCTCG CCAATACCGG CCACACCATC GGCGAATGGG CGGCTGTGGG CGGCACCATC
ACGCTGTCGG CGCCTCAAGT CATCGCGCAG CAGGGCGCAA AGTTCGATAT CTCCGGTGGT
TCGGTCAGCT ATCAAGGCGG CTGGATCTAT TCGACCGTGC TGATCGGGAG CGACGGTCGC
AGGTATACGG TCGACACTGC CCCGGCCGAT ATGACTTTCA TTGCGGCCGG CGGCAGCTTC
GTGCGTACGC ATATTATTCA AGGCGAGGTG GCCGAGTCGC TGACTGAAGT GTGGGCCAGC
CCCGCCGGTC GCGACATCAT AGCGAGCTAT GAGGCCGGCT ACACCGTCGG CCGCGACGCC
GGCCGGCTCA ATCTGTCGAC GCCGACGGCG ATCTTCGAGG CGGATATCAT CGCCGACATC
ATCACCGGCG AACGCCAGGC CCGCGCGCGC GCCGAGGGCG TCACCGACGG CTACAAGCAG
GTGCAGAATG CCGCGCCGCT GGAAGGCACG CTCGGCCTCG CCCGTTATGA CGCCACGGCA
AATCTGGTTG CCGTCTACGG CTCGGATGTC CGCTTCAGCG ATGTCGCCGA CATCACCACG
GGCTTGTCCG CCACCGCCGT GCTGCCGTCG GTCCGCACCA ACACCGCCTG GATCGACGCC
GACCGAATCA ATGAAGCCCA TCTCGGCGGG CTCGATATCG GCACCTCCGG CACCATCACG
ATCGATCGCG CGATCACGCT CGCCGATGGC GGCAAGATCA ATCTCAACGC GGCGGTGGTC
GACATCAAGG CCGATGTCAC CGCGCGGAGC GGCTCGATCG TGGTCGACAA CGTGCTGGCA
GGCGCCGCTT CGGGCGGCCG CGGCGCCTAT GCGGTGCTGC TGAAGAATGG CCTCTCCTCG
ATCACGCTCT ATGACGGCGC GACGCTCGAT CTGCGCGGGC TTTGGGTCAA CGCGGCGCAG
GCGGATTCGG ACGATCCGAA GCAGGCCTTC ATCGACGGCG GTTCGGTCAC GCTGCGTTCG
ACACATGACG TGACGTTGCA GGAGGGCAGC GTCATCGACG TCTCGTCGGG CGCGGCGATC
CTCGCCACCG GCAAGACCAA AGGCGGCCGC GGCGGCGATG TGACGCTGAT TGCGGACCAG
CAGAATTCCA CCGTGACGGC CAACGGCCTG TTGACCCTCG ACGGCACGAT CCGCGCCTAT
GGCGTCAGCG GCGGCGGCAC GCTCAAGCTC GAATCCGGCA CGGCCATCGC CATCGGCGGC
AAGGTGCTGG CGACCGACGG CGTGCTCGGC GCCGGCGAGG CGGCGCCGGC CGATCTGGTG
CTGCTGGAGG ACTATCAGGT CAAGGCCGGC GAGGTGCTGC CGGCGAGCTA CTCCTATCAG
GCCACCTGGT TCGAGCCCGG CGATACATTG ACCGCGGATG CGCGCATCCA ATACGTGACG
CTGGCGGCGG ACTGGACGCC ACCGGCGCCA AATCTAACCT TTTACAGTTA CACAATCAAA
ACGGGCGCGG ACCAAAAAGG CCCATCTTAC AACATCTACG GTTCCAACGC CGTGACTCTC
CCGGCGGGGA CGGTCATCAG CATCCATGCA AACTCTATGC AATATTTGGC GGGCTACACC
TTGCCGGCTG CGGTGTTCCC GAACGGCCTG CCGTCGGTCG CCCCGTTCAC CAAGACAGCG
GCGGCCGGGA CGCTGGCGCC GTCGGACGGC ACGATCGCGG CCGGAACGCT GATCAATGCG
GGCTCGGTGC TCCAGCGTGC GGCGGCCGTC AAAGAGATCT TGCAGGTCGA CGCGTCGCTG
TTGCAGTCGG GGTTCTCCAG CTATGATATC AATGGCCGGC AAGGTGTCGT GGTCGCGGCC
GGCGCCCAGC TCGATGTCGC GATGCCGGTC TATCGCTTGA CGGACGCGGC CTTCTCGGTC
GCCACCGGCG AGGATCCCTC TCGCGCGCTC AGCGTCTGGA CGCCGCCCGA ATGGACCGAA
GACGCCGGCA AAAGCAGCCT CACCCAGCGC GGCGGCGCCA GCCTGACCTT GCGCTCCAAT
GTCGGCGAGG GGACCCTGAC GACGGCGAGC GGCCCGATCA GCATTCAGAC CGGCGCGGTG
ATCCGCGTCG ATACCGGCCA GTCGATCAGC CTGCAGGCCA AGGATTTCAC CATCGACGGC
ACGCTGACCG CGCCGGGCGG CACCATCAGC CTGACCCAGG CGTCGGCGAG CCTCAATCAG
GGCATCGGCG ACAACAAACC CGGGCTGGTC TGGATCGGCG ATGACGCCGT GCTCGATGTC
GCGGCCCGCG CGGTCACGGC GACCAATGCG CGCGGCGAGA CCTATGGCGT GGTCGGCAAT
GGCGGCTCGA TCCTGATCGG CGGCGCGATG GATTGGGAGA CGACCGGAGA ATCCTCCACG
TTGAATGCCT TCGTGGTGAT CCGGCCCGGC GCGCTGCTCG ATGCCTCCGG CACCAGCGCG
GTGCTCGACA TCTCAGGAAC CGGGCTGGCG GGAACCAGCG CGCCGCTCGA GGTTGCCAGC
AATGGCGGCA GCATCGTTAT CAAATCGAGC AACGGGATCT ATCTCGACGG CACGCTGCGC
GCGGCAGCCG GCGGCGCGAA CGCCGCCGGC GGGACGCTGG CGCTGGCGCT GGAGGCGCCG
AACTATTTGC GCTCGTCGAC ATCCGGCGAT GTGTTGCGGC ATCGCGAATT GGTCATCGCC
GACATCCAGG GCGACAGCGC CATCGCCGAT GCCGACTCGA TGGCGGAAGC CAAAGCGGCC
CTGGTGACCG GCACCGCGCG GCTCGGCGTC GATCGCATCA AGGCCGGCGG CTTCGGCACG
CTGTCGCTGC TGTCGGATGG TCTGATTTCG TTCGATGGCA GCGTCACGCT GGCCATGAGC
CAGAGCCTGA GCCTCTATGC CGGCGCCTTC GCGCTTGGCG CCAATGCCGC CGCGGATTCG
CGGGTATCGC TGTCGGGGCC TTATGTGCGG CTGGCCGGCG TGACCCGCAT CGCCAAAGAT
CTGTACACGT TGCCTGCCGT GCGGTGGGAC GAGGGCTATG GCACACCGTC GCAACAGTCG
AGCAGCGCGG TCTTCTCCGT CGCCGCCGAT CTGCTCGATA TCCGCGACCG CGTGGTGTTC
GGCATCCACC AGTCCATCAA CACCCAGGCC GAGACCTACA CACTCGATCG CCGCGGCTTC
GCGCTGGTCG ATCTTCTGAG CCGCGGCGAT GTCCGCATGC TCGGTGGAAC CGGGACGAAG
GGTGCCCAGC TGCAGACGCC GGGCAACGTC ACCGTGACGG CCGCGCAGAT CTATCCGGCG
ACCAAGACCA CCGGGTCGAT CACCGCCGGC TACATCCCCG GCAGCGACGC TCTGCTTCTC
GCCGGCAGCG TGCTGAATAT CCTGCGCTAC GGCGACACCG ATCCCGACGT GCCCTATTCC
GCCTTCGGCG CGCTCACCCT GGCCGCCGAT ACGATCAACC AGGGCGGCGT GGTGCGCGCG
CCGTTCGGGC GAATCGTGCT CGGCAGCCTC TCGAGCGGGA GCGTCCCACA GGCGGATACT
GTCCATCTGC TGGCGGGCAG CATCACCTCG GTCAGCGGCG TCGGCCTGGT GATGCCCTAT
GGCGGCACGG CCGACGGCAC CAGCTACACC TATAACGGCG CGACGGTCGC CGCGACCAGT
CCGACCATCA CCCTCAACGG CCTGCATATC GAGGCCGATG CCGGTTCACT GATCGACCTG
TCGGGCGGCG GCGAACTGAC CGGCGCCGGC TTCGTCTCGG GGCGTGGAGG CTCGGTGAAC
ATCCTGACCA CGCCGCTGGT CAATGCCAAT CCGGGCTACA GCTACAGCGC CAAGGGCAAT
CAGGTCTACG CGATCGTGCC GAGCAGCTCC GTCGCCTATG CGCCGGTGGT GCAGGAAGCG
GGCTATGGCC TGCCGGCGGT GGGCCGCCAG ATCACCATTC CCGAGGGCGT GCCGGGTCTT
GCCGCCGGCA CCTACACGCT GATGCCCGCG ACCTATGCGC TGCTGCCAGG CGCCTATCGG
GTGGAACTGG GCAGTACCGT CAGCCCCGCC GTCACCGGCG TCGCTGCGAC CGGCGGCGGC
TCGTACATCG TCAGCGCCTA TCTCGGCGTC GCCAATACGG CGATCCGCGC GTCGCTGCCG
AACCGGGTCA TCATCACGGC GGCGGATCAG GTCCGCAAGC ATTCGTCCTT CAACGAGACC
ACCTATAATG CCTACGTGCT GGCTAACGCC GAGACCAACA GCGTCGCGCG CGGCTGGATG
ACGGTCGATG CCGGCGGGCT GTACATGCAG CTCGCCAAGG CGCGCGTCGC CGATGATCGG
ATGCAATTGA TGTTCGATGG CGCGCTGCGC ATCCGGGCCG AGGCCGGCAG CGACGGCTAC
AGCGGCGCAG TCTCGATCAC CGGCATCAGC GAGATCCTCG CGACCGGGCA GGGCGCCACC
GCCGGCATGG TGGCGGCCTC GGTCTCCGCC GACGAACTCA GCAAGCTCGA CGCGGCGCGT
CTGATCCTGA ACGCCGGCTA TTTCGGCGAT ATCGTGCTGC GCAGCGGCGC TCGACTGTCG
GCGGCTGAGA TCGTCTTCTA CTCGAGATCA AGGTCGTGGG CACAGGAGAG GGGCGCCATC
ACGATCGAGG AGGGCGCGAC GATCAGCACC ATCGGTCGCG GATCGACTGG CACCGACACG
AGTGCTCCCT ACCTTGTGGA TTCCGGCATG CTGATCGTGT CCAATGGCGT GATCACCTTG
CTGCAGGGCG AGACAACGGC GGCCGACGTC GATATCACGA TCGGCGCCTG CGTCACCGCT
GGCTGCAACC TGACCACGAC GATCGCCTCC GAAGGCACCA TCGGCCTGAT GACGTCGGGC
AGCGTCAGCT TCGCCGACAA CGTCTCATAC GGCACCCGCA ATCTCGTGCT GGGATTGTCG
GCGATCAATC TCGGCTCCGA TGCCAGCATC GCTGCAGCTT CGGCCGCCGG CCGGCTGCCG
ACCGGGCTGA CCCTCAACCA GGCCGTGCTG GCGCGATTGC TGGCGGGCAA CACCGCGATC
GGCGCGCCGG CGCTGGAGAC GCTGGCGCTC AGTGCGCGTG ACGCGGTCAA CGTGTTCGGC
GACGTCACGC TCGACGCATC GACATTGGAG CGTCTGGTGC TGGCGACGCC GGCGATCTAC
GGCTACGGCG CGGCCGGCGA CACCGCCACG ATCCGGGCCG GCGAATTCGT CTGGACCGGC
TCGACCCTGG CGCCGGGTGC GGCGATGGCC GATAGGCTCG GCGACAGCAC GCTCGACATC
GTGGCGAACC GGGTCGTGCT CGGCTACGGC CTCAACACCC AACCCAGCAC CACGACAGTC
GACAACCGCA TCGCGCTCGG CTTCGCCAAT GTCAACATTA CGGCCACCGA CTATGTGACC
TCGACCAACG AGAGCACGCT CGGCGTCTAT GCCCGGCAGG GTGCCTATGA CGCGACGACC
GGCTATCAAT ACAGCGGCGG CAACCTCACC ATCACGGCGC CGCTGTTCAC CGGCGCGGCC
GGCTCGGTCA ACACTATCAC GGCCGGCGGC GATATCCGCG TCGTCGGATC GGGCGGCACG
GCGGGGAGCG TCGATGAACT CGGCGCGACG CTGAAGCTGA CCGGCGCCAC CATCACCGTC
GATACCAGCG TGGTGCTGCC GTCCGGGCGG CTGGAGCTGA CCGCGGCGGG CGATATCGTG
CTCGGCGACA ATTCTCGGCT CGATCTGTCG GGCCGCGCCG TGGTGTTCTT CGATGTCACC
AAATACAGCT GGGGCGGCGA TCTGGTCATG ACCAGCACCG CCGGCAACAT CAGCCAGGCG
GCGGGATCGG TGATCGATCT GTCGGCGCAG TACAACAGCG GTGGAACCGC GACGGTCACG
GCACTCGGCG CGAGTGCCGG CCATGTCGAT CTCGCCGGCA CGTTCCGCGG CGGCGCGACG
GGCACTTACG ACGCCGGCGG CACCTATGTG CCCTACGATG CGGCCGAGCT CAGCGTGTAT
GCGCAGACGC TCGCCGATTT CGCGGGGCTG AATGCCCGGC TCAACAGCAG CGAGCTGTTC
GGCGCCCGCC GCTTCCAGAT CAAGCAGGGC AGCCTCGTCG TCGGCAACGA GGTCAAGGCG
CGCGAGGTCG AAATCACGCT CGACGGAGGC AGCCTGACCG TCAACGGCAC CATCGATGCC
AGCGGCTATC AGGTCGGCAC GATCCGGCTG GCGGCGATGG GCGATCTGAC CATCAACGGC
ACGCTCGATG CGCACGGCAC CGGGATGCGC TTCGACAGCT ATGGCAAGAT CATCGAATCG
CCGAACCGCG CCATTGTCGA TCTAACGACG CGCATGGGCA CGCTGACGCT GACCGGCAAC
GCGGCCGTCG ACCTGCGCGC CGGCACCAAT GTGATGTTCG GTTCAGGCGA ATATCTCAAT
GACGGCGTGG CGCGCGGCAC CCTGACGCTC AACGCACCAC GCCTCGGCGG CTCGGGCGTT
GCCGCCGGCA CGCGCGGCAA CGACGGCGCC AATGACGTCG CCGTCAATGT GCAGGGCACG
CCGCTGATCC GGGGCGCCAA GACCATCGCG GTCAATGCGT TCCGCATCTA TGACGACGCG
CCGCTGGCCG CTGCGCCCGA TGTCACCGGC TACAAGCCGC AGGAGATCAC GCAGAGCTAT
CTCGCTGATC TCGACAACGA CAGCGTCGCC TTCATCAATG CCGCACTCGG CAATGCCTCG
CTGAGCGCCA GGCTCGCAGG TCTCGGCAGC TATCACCTGC GGCCCGGCGT CGATATCGTC
AGCAAGGTCA GCGCCGACAA TCCGAACGGC GACCTGACGG TGGCCGGCGA TCTCGATCTG
TCCGGCTATC GCTACGGGCC GAATGCCGAT CGCAACGTGA ACTCGGCGAC CTACGGCTTC
GGCGAGCCCG GCGCCTACAA CATCCGCGCC GTCGGCAATC TCAATATCCA CGGCAGCATC
AATGACGGCT TTGCGCCGCC GCCGTCGACG CCCGACGACG TCAAGGGCTG GCTGCTCGAG
GAGGGCGTGG TGCCCTATGG CGGCGATCTG GTGCTGGCCA CCGCGGTCAC GCTCGAGACC
GGAACGGTGT TCAAAAAGGG CGTCACGCTG AATTACGACC TTCCGGCCTC TTTCGGCACC
CTGCCGTCCG GCACGGTGCT GCCGGTGCGC GCCACCCTGG CAAGTTCGCT CGCGCTGTCG
GCGGGCACCG TGGTGCAGGC GACCATCTAC AATGCGGACG GCTCGGTGGC CTACGCGGCC
GGCATGGTCC TGCCGAACGC GGTGACGCTC ACCGCCGGCA TGCAGCTCGG CGCCGGCACG
GTCCTGAAGA GCGCCGCCAG CTTTGCGGCG CTGGTCTGGC CGAAGGGCGT GCCGCTGCCG
GTGGACATGA CCTCCACGGC ACAGATGACG CTCGCGGCCG GCTCGCTGAT CCCGGCCCAG
ACCAACGTCA AGCTCGTCGG CGGCACTGCG GTGGATCTGC GCGGGACGAC CGGCGGCATC
CAGGGCCGCA ACTGGGCGCT CGCCTCGATG CTCGCCGCCG GCGCGAAATC GTCGGACCTG
ACGCTGGTGG CCGGCGCCGA TCTCGGCTCG GCCAATGTGC GGGCGCGCAA CGCGCTGGGC
AAGGGCGACA TCATTCTGGC CGACACCCAC TATGTCTCGC GGTACGCCTC GGCTGGCGAT
GTCCTCAATT TGAGCCGGGC GGGAGCGGAG GCTCTCTGCG CGGCAGTCTG TAACGACTAC
GGTTATACGG TGGACGACTT TATCGGAAAG TCGGAGGCCG AAATCTCTGC GCTCTTGTGG
GGTACATGGG AGGAAATCGT TTACTACTAC GGGATGCCGG CCAATTTCTG GGATCCTGCG
CAGGGCAATA TCGAACTGGG CCTGACCCAG AAGGGACTCG ATGCCCTGAT GGTCGTTCTG
GGCGGTTACC TTCCCGAGGG TATAACCAAT CCGTCCGCAC TGCTCAACAA GACGTGGGTT
CAGATCGCCA CGATCTACGG CGATCCCAAC TACACCATGT TTGATTTCGG CTTGCCCGAC
GATTTCGCGG ATCCTGCTCA AAATTACCTC GGCACCATCA GCCTGCCGGC GACGACCACG
GTGGTGGCGC CCTCGTTCAG CGTGGTCCGC ACCGGCACCG GCGACCTGTC GCTGGTCGCC
GCCGGCAGCA TCAAGATGCA GTCGCTCTAC GGCGTCTACA CCGCCGGCAC GGCGAACGCG
GTCGATCCGA GCTACAATCT CGCCCGCAGT GTCAATGTCG ATGGCACGCT GCTCGGCAGC
ACCAATGCCG ATTACGCGAC CGCCGCGCTG GCATCCTATC AGGCGTGGTA CCCGGATCAC
GGCGGCAATG TCCTGATCGC GGCGGGCGGC GATCTGATCG GCGACATCTA CGGCCAGTCC
GCCCGGCAAA GCCCATCGAG CGTGCTGACC GGCAACTGGC TGTGGCGCCA GGGCAGCGGC
ACCGCGGCGG TCGACCAGAC GATCGCGACC TCGTGGTGGA TCAATTTCGG CACCTACGTC
GAGAATACAA GCTATAGCAA TCCGTCCGAC ATCCCCGATC TCATCGGCTT CACCGGCATC
GGCGCGCTGG GCGGCGGCAA TATCACCATC CGGGTCGGCG GCGATGCCGG CGCGATCACG
CAGCGCAGCA GCGCGGGCGG CCAGACCAGC GGCGTGGATC GAAGCCAGGG CCTGGTGGTT
GCCGTGGGCA GCACCGGACG CGTCGGCGCC GACGGATCGC TCACCTTGAC CGGCGGCGGC
GACATCGACA TGCGGATCGC CGGCGCGCTC AATCCGAATC TCGCAATCAC CACCTCGAAT
GACAAGCAGG CCCTCGGCGG CTCGCTGATC AACCTGCGCG GCACGCTCGC CGTGACCGCG
GCGTCGATCG GCGGCATCGA GCTGCTCTAC GGATTCAGGG ACAGCTACGA TACCCGCGGC
GCCGATCCGT ATGAGGCGAC GCGGTCCGAA GCGCGATCGG GGATCACGGT GGTCCCCGGC
GATTCCGCGG TCTATCTGCA AACCGCGGGC GATCTCGTGC TCGGCGGCGT CGGCGATGCC
GGCCGGTCGG CGACGCCGAG CAGTTCCGCC TTCTCGGTCG ATGGCGTCGA CTACGCCAGC
GGCGGCGGCA GCTGGTTCAC GCTGTGGACC GATCACACCG CGATCAACCT GATCTCGGCG
GGCGGCAACC TCACGCCCAC CACCTCGACG ACGGAGACGT GGAGCATCGG CTCTTCGAAT
TACAACAGCG ACCAGAACGC CGCCGACGGA TGGTTCACCT ATCCATCGAT CCTGCGGGCG
GCGGCGCTCG GCGGCAGCAT CTATTACGGC ATCGATGCGC TGTCCTTCAT GCGCTACGAT
GCTTCGGCCG TCTATCCCAC CATCACCCTG GCGCCCTCGG CCACCGGCTC GCTGGAGATG
CTGGCCGCCG ATTCGATCTA TGCCGGGCAC TATGCGTTCA GCCTGTCCGG CACCGGCACG
GCATTGCCGA CCCCGTTCAA TCCGGCCTTC GCAGGCTACC CATCGGACGG TTCAATCGAT
GTCATCGTGA CCAACGCATC GCCGAACGGC AGCCACAGCT CGGGCAGCAG CAACCCGATC
AACGGGTCGC TGTTCGTGTT CGGCCCGAAC GATGCGAAAG TCGCGCTCGA CCGCGCTGAT
GATGCCGATC CCGTCCGCTT CTACGCCCGC GAAGGCGACA TCGTCGGCCT CATGACCGGT
GAGACGGTGA CGTTTGGGAG CGGCACGACC TGGCTCAATG CATCGGCTCC GGTCGTCGTG
CGCGCCGGCC GCGACATCGT CGCGGCCGGC CTTGCGCCCG GCGTCACGGT GTATGGCACC
GCCCTCTACG GCTATTCGGA CGGCAACCTG ATCGTGCACA GCGATGCCGA CGACGTCTCG
ATCTTTTCCG CCGGACGCGA CATCCTCTAC GCCAATATCG ACATCGCCGG CCCCGGCGCG
CTCGAAATCT CGGCGGGGCG CAACCTGTAC CAGGCCGACA AGGGCGTGAT CACCAGTCTC
GGCGCGATCG CGTCGGGCGA TACCCGCCCG GGCGCCAGCA TCGCGCTGCT CGCCGGCGTC
GGCGAGGCGG GACCGGATTA CGAGAACCTG GCGGCGCTCT ATCTCGATCC GGCCCGGCTG
GCGGTGGCAG GGGTGCCGCT GGAGGACCAG TTCGGCAAGG TCGCCAAGAC CTATGAGAAG
GAGCTCGCGG CCTGGCTGAA GGAGCGCTAC GGCTTCACCG GCACCGACGC GGAGGCGCTG
GCCTATTTCG GCACCCTGGC GCCCGAGCGG CAGCGCATTT TCCTGCGCCA GGTCTACTTC
GCCGAACTCA CTGCCGGCGG CCGTGAATAC AACGACAGCA CCAGCTCGCG TTACGGCAGC
TATCTGCGCG GACGCAACGT GATCGCCGCG CTGTTCCCGG ACCAGGATGC GAATGGACGG
CCTGTCGTGC GGGCGGGCGA CATCACCCTG TATGGCGCCT CCGGCGTCCG CACCCAGAAG
GGCGGCGACA TCCAGACGCT CACACCGGGC GGCCGCACCA TCATCGGCAT TGAGGGACAA
GTGCCGCCGG CATCGGCGGG TCTGGTCACG CAGGGCCAAG GCGATATCCA GCTTTACAGC
AAGGGCAGCA TCCTGCTCGG CCTGTCGCGC ATCATGACCA CCTTCGGCGG TGACATCATT
GCATGGTCTG CGGAAGGCGA CATCAACGCC GGCCGCGGCT CCAAGACCAC CGTGGTCTAC
ACCCCGCCGC GGCGCGTCTA CGACAACTAC GGCAATGTCA GCCTGTCGTC GCAGGTGCCG
TCTTCCGGCG CCGGCATCGC GACGCTCAAT CCGATCCCGG AAGTGCCAGC CGGCGACGTC
GACCTGATCG CGCCGCTCGG CACCATCGAC GCCGGCGAGG CGGGTATCCG CGTCTCCGGA
AACGTCAACC TGGCGGCGCT GCAGATACTC AATGCCGCAA ACATCCAGGT GCAGGGCAGC
TCCACGGGCA TTCCGACCGT GCAGGCGCCG AACATGAGCG CGGGCCTCGC GGCATCCAAC
GCGACCGCCG CCACCCAGCA GACGGCCGCG CCGAACACTG GCGCCGGGAA CGACCGGCCG
TCCGTCATCA TCGTCGAATT TCTCGGCTTT GGCGGCGGCG ATGGCGAACC TGCGCAGGAG
AACAAGCGTC GCAAGGATAG CGAGCGACAG ACCTACAATC AGAACAGCGC GGTGCAATTC
GTCGAGTTCG GCGCCAAACC ATAG
 
Protein sequence
MRGIRSVLRA AGGSGRSAMT TRRKSKLMSV HAGSTPSSFS RWSFSHRYRR PALLAGASAL 
ALMLAMPAHA RSLNGAASTV SAPNIASDAA AQAAQQAAAA ARQTQDSLTR AARAVQDMQV
VQAAARAAAA ARQTSATSPI SVPNGLAVGG LDPNLAAGWS GANVPTEGVN ASGQTQVGIR
QTSAQAILNW NTFNVGAKTT LTFDQQGNSS WVALNRVTAA TAPSQILGQI KADGQVYVIN
QSGIIFGGNS QINVGSLIAS TANITDTQFS ANGIYSTQSG SSYTPSFTAA GGKVVVEAGA
SISTSAPASV TSGGGFVLMI GSEVSNAGSI STPKGQTMLA AGDNFILRKG YGTDANQFST
TNGNEIATVI AAGSSAGRVT NSGIVLSQQG DITLAGRTVT QDGVLLSTTS VNQRGTIHLL
NAASDASGSV TMTGNSVSAI LPELDSDDTA LNSQRDALIA ASGLNPLAAQ FNNLSPLTDR
RDQSRIEIVT GGLVNFQNGS YTAAQGGQIA VSAGTRVFAE TGATLDVSGV RDVLLPMSAN
RVEVNVQGNE LRDSPVNRDN AALIGKNVWI DVRDLVLVAA GTGGYASDRY YTSGGLLEIS
GYLANTGHTI GEWAAVGGTI TLSAPQVIAQ QGAKFDISGG SVSYQGGWIY STVLIGSDGR
RYTVDTAPAD MTFIAAGGSF VRTHIIQGEV AESLTEVWAS PAGRDIIASY EAGYTVGRDA
GRLNLSTPTA IFEADIIADI ITGERQARAR AEGVTDGYKQ VQNAAPLEGT LGLARYDATA
NLVAVYGSDV RFSDVADITT GLSATAVLPS VRTNTAWIDA DRINEAHLGG LDIGTSGTIT
IDRAITLADG GKINLNAAVV DIKADVTARS GSIVVDNVLA GAASGGRGAY AVLLKNGLSS
ITLYDGATLD LRGLWVNAAQ ADSDDPKQAF IDGGSVTLRS THDVTLQEGS VIDVSSGAAI
LATGKTKGGR GGDVTLIADQ QNSTVTANGL LTLDGTIRAY GVSGGGTLKL ESGTAIAIGG
KVLATDGVLG AGEAAPADLV LLEDYQVKAG EVLPASYSYQ ATWFEPGDTL TADARIQYVT
LAADWTPPAP NLTFYSYTIK TGADQKGPSY NIYGSNAVTL PAGTVISIHA NSMQYLAGYT
LPAAVFPNGL PSVAPFTKTA AAGTLAPSDG TIAAGTLINA GSVLQRAAAV KEILQVDASL
LQSGFSSYDI NGRQGVVVAA GAQLDVAMPV YRLTDAAFSV ATGEDPSRAL SVWTPPEWTE
DAGKSSLTQR GGASLTLRSN VGEGTLTTAS GPISIQTGAV IRVDTGQSIS LQAKDFTIDG
TLTAPGGTIS LTQASASLNQ GIGDNKPGLV WIGDDAVLDV AARAVTATNA RGETYGVVGN
GGSILIGGAM DWETTGESST LNAFVVIRPG ALLDASGTSA VLDISGTGLA GTSAPLEVAS
NGGSIVIKSS NGIYLDGTLR AAAGGANAAG GTLALALEAP NYLRSSTSGD VLRHRELVIA
DIQGDSAIAD ADSMAEAKAA LVTGTARLGV DRIKAGGFGT LSLLSDGLIS FDGSVTLAMS
QSLSLYAGAF ALGANAAADS RVSLSGPYVR LAGVTRIAKD LYTLPAVRWD EGYGTPSQQS
SSAVFSVAAD LLDIRDRVVF GIHQSINTQA ETYTLDRRGF ALVDLLSRGD VRMLGGTGTK
GAQLQTPGNV TVTAAQIYPA TKTTGSITAG YIPGSDALLL AGSVLNILRY GDTDPDVPYS
AFGALTLAAD TINQGGVVRA PFGRIVLGSL SSGSVPQADT VHLLAGSITS VSGVGLVMPY
GGTADGTSYT YNGATVAATS PTITLNGLHI EADAGSLIDL SGGGELTGAG FVSGRGGSVN
ILTTPLVNAN PGYSYSAKGN QVYAIVPSSS VAYAPVVQEA GYGLPAVGRQ ITIPEGVPGL
AAGTYTLMPA TYALLPGAYR VELGSTVSPA VTGVAATGGG SYIVSAYLGV ANTAIRASLP
NRVIITAADQ VRKHSSFNET TYNAYVLANA ETNSVARGWM TVDAGGLYMQ LAKARVADDR
MQLMFDGALR IRAEAGSDGY SGAVSITGIS EILATGQGAT AGMVAASVSA DELSKLDAAR
LILNAGYFGD IVLRSGARLS AAEIVFYSRS RSWAQERGAI TIEEGATIST IGRGSTGTDT
SAPYLVDSGM LIVSNGVITL LQGETTAADV DITIGACVTA GCNLTTTIAS EGTIGLMTSG
SVSFADNVSY GTRNLVLGLS AINLGSDASI AAASAAGRLP TGLTLNQAVL ARLLAGNTAI
GAPALETLAL SARDAVNVFG DVTLDASTLE RLVLATPAIY GYGAAGDTAT IRAGEFVWTG
STLAPGAAMA DRLGDSTLDI VANRVVLGYG LNTQPSTTTV DNRIALGFAN VNITATDYVT
STNESTLGVY ARQGAYDATT GYQYSGGNLT ITAPLFTGAA GSVNTITAGG DIRVVGSGGT
AGSVDELGAT LKLTGATITV DTSVVLPSGR LELTAAGDIV LGDNSRLDLS GRAVVFFDVT
KYSWGGDLVM TSTAGNISQA AGSVIDLSAQ YNSGGTATVT ALGASAGHVD LAGTFRGGAT
GTYDAGGTYV PYDAAELSVY AQTLADFAGL NARLNSSELF GARRFQIKQG SLVVGNEVKA
REVEITLDGG SLTVNGTIDA SGYQVGTIRL AAMGDLTING TLDAHGTGMR FDSYGKIIES
PNRAIVDLTT RMGTLTLTGN AAVDLRAGTN VMFGSGEYLN DGVARGTLTL NAPRLGGSGV
AAGTRGNDGA NDVAVNVQGT PLIRGAKTIA VNAFRIYDDA PLAAAPDVTG YKPQEITQSY
LADLDNDSVA FINAALGNAS LSARLAGLGS YHLRPGVDIV SKVSADNPNG DLTVAGDLDL
SGYRYGPNAD RNVNSATYGF GEPGAYNIRA VGNLNIHGSI NDGFAPPPST PDDVKGWLLE
EGVVPYGGDL VLATAVTLET GTVFKKGVTL NYDLPASFGT LPSGTVLPVR ATLASSLALS
AGTVVQATIY NADGSVAYAA GMVLPNAVTL TAGMQLGAGT VLKSAASFAA LVWPKGVPLP
VDMTSTAQMT LAAGSLIPAQ TNVKLVGGTA VDLRGTTGGI QGRNWALASM LAAGAKSSDL
TLVAGADLGS ANVRARNALG KGDIILADTH YVSRYASAGD VLNLSRAGAE ALCAAVCNDY
GYTVDDFIGK SEAEISALLW GTWEEIVYYY GMPANFWDPA QGNIELGLTQ KGLDALMVVL
GGYLPEGITN PSALLNKTWV QIATIYGDPN YTMFDFGLPD DFADPAQNYL GTISLPATTT
VVAPSFSVVR TGTGDLSLVA AGSIKMQSLY GVYTAGTANA VDPSYNLARS VNVDGTLLGS
TNADYATAAL ASYQAWYPDH GGNVLIAAGG DLIGDIYGQS ARQSPSSVLT GNWLWRQGSG
TAAVDQTIAT SWWINFGTYV ENTSYSNPSD IPDLIGFTGI GALGGGNITI RVGGDAGAIT
QRSSAGGQTS GVDRSQGLVV AVGSTGRVGA DGSLTLTGGG DIDMRIAGAL NPNLAITTSN
DKQALGGSLI NLRGTLAVTA ASIGGIELLY GFRDSYDTRG ADPYEATRSE ARSGITVVPG
DSAVYLQTAG DLVLGGVGDA GRSATPSSSA FSVDGVDYAS GGGSWFTLWT DHTAINLISA
GGNLTPTTST TETWSIGSSN YNSDQNAADG WFTYPSILRA AALGGSIYYG IDALSFMRYD
ASAVYPTITL APSATGSLEM LAADSIYAGH YAFSLSGTGT ALPTPFNPAF AGYPSDGSID
VIVTNASPNG SHSSGSSNPI NGSLFVFGPN DAKVALDRAD DADPVRFYAR EGDIVGLMTG
ETVTFGSGTT WLNASAPVVV RAGRDIVAAG LAPGVTVYGT ALYGYSDGNL IVHSDADDVS
IFSAGRDILY ANIDIAGPGA LEISAGRNLY QADKGVITSL GAIASGDTRP GASIALLAGV
GEAGPDYENL AALYLDPARL AVAGVPLEDQ FGKVAKTYEK ELAAWLKERY GFTGTDAEAL
AYFGTLAPER QRIFLRQVYF AELTAGGREY NDSTSSRYGS YLRGRNVIAA LFPDQDANGR
PVVRAGDITL YGASGVRTQK GGDIQTLTPG GRTIIGIEGQ VPPASAGLVT QGQGDIQLYS
KGSILLGLSR IMTTFGGDII AWSAEGDINA GRGSKTTVVY TPPRRVYDNY GNVSLSSQVP
SSGAGIATLN PIPEVPAGDV DLIAPLGTID AGEAGIRVSG NVNLAALQIL NAANIQVQGS
STGIPTVQAP NMSAGLAASN ATAATQQTAA PNTGAGNDRP SVIIVEFLGF GGGDGEPAQE
NKRRKDSERQ TYNQNSAVQF VEFGAKP