Gene RPC_2911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2911 
Symbol 
ID3970005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3164785 
End bp3177192 
Gene Length12408 bp 
Protein Length4135 aa 
Translation table11 
GC content66% 
IMG OID637926024 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_532778 
Protein GI90424408 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGCCG GCTGCAGCGC GCTTGCCATC ATGCTGGCCG CGCCGAACGC CCATGCCGTT 
GTGGTCGGCG GGGGAAGTGG CGAAGGCACG ATCGTCTCGG CGCCCAACCT CGCGACGGAT
GCGGCGCGGG CGGCCGCGCA GCAGGCCCAG CAGGCGGCGC AGAAGGCACA GCAATCGCTG
GAGCGCGCCA CCCAAGCCCT GCAATCGATG CAGTCAATCC AGGCCGCAGC CCGCGCCGCC
GCCCAGGCTG CGCCGGGCGC GGTGCCGAAC GGGCTTGCCC CCGGTGGCCT GCAGGAGGCG
CCGGGGGTGA GGCAGGACCC CTCGCTGTGG CAGGGCGCGC GGTTGCCGAC GGAGACCCGA
GCGGGCGGTC AGGTCCAGGT CACCGTCGAG CAGACGGTGT CCAGGGCTAT TCTCAATTGG
CGCAGCTTCA ATGTGGGCAA GGAGACGGCG CTTACTTTCA ACCAGCAGGG CAACCGCGAC
TGGGTGGCGC TGAACCGGGT GACGGGCGAC ACCGGCCCGA GCCAGATCCT GGGCTCGATC
AATGCCGATG GCCAAGTCTA TGTCATCAAC CAGAACGGCA TCCTTTTCGG CGGATCGTCG
CAGGTCAATG TCGGCGCGCT GGTCGCCTCG GCAGCGAAGA TCACCGACGA GCAGTTCCGC
AACAGAGGCC TCTATTCGGC CCAGACCGGT GGCAGCTATC TGCCGAGTTT CACCGACGCC
GGCGGCAAGG TGATCGTCGA GGCGGGCGCG CGGATCGCGA CCCATGCACC GAAATCGGTC
ACCGCGGGCG GTGGCTTCGT CATGCTGCTG GGCCAGGAGG TGAAGAATGC CGGCCAGGTG
CTGCTGCCGA AGGGCCAGGC GCTGCTCGCG GCCGGCGACG ATTTTGTCCT GCGTCGCGGC
TACGGCACCG ATGGCAATCA GGCCTCCACC ACCCAGGGCA GCGAGATCGC GCCGATTATC
CGCGCTGGAA GCCTCAGTGG CAGCGTCACC AATAACGGCC TAATCTTCTC CCAGCAGGGC
GACATCACGC TGGCCGGCCG CACCATCGCG CAGCACGGCG TCCTGGTATC GACCACCTCG
GTCAACACCC GAGGCACTAT CCATCTGCTG AACTCCGCGA CAGATGGGCT GGGCAGCGTC
ACCTTGGGGG CTTCCAGCCT CACCGACATC CTCCCCGAAC TCGACAGCAG CGAGACCGCG
CTGAACGCGC AGCGCGACGC CTTGATCAAG GCCTCCGGTG AGGTCGCGCA TCTGCCGGTG
CGCAGCGCGC CGTTCGACAA CCTGTCGCTG CTATCCGACC GGCTCGACCA GTCGCGCATC
GAGATCGTCA CCGGTGGGAA TGTGCTGTTC AAGTCCGGCT CGCTGACCAT CGCACAGGGC
GGCCAGGTCG CGGTCTCGGC GGGCCAACGC ATCTTCACCG AGAGTGGATC GACCATCGAC
GTCTCTGGCG TGCGCGGCGT GCTGCTGCCG GTGACAGCCA ACAACATCAA GGTCAACATC
CAGGGCAATG AGCTGCGCGA CAGCCCGGTC AACCGCGACA GCACGTATTT GAAGAATGCC
AACGTCTGGA TTGATTTGCG CGACCTGGTC TTGGTGCCGA AGGAGACCGG CGGCTACGCC
AGCGACCGCT ATTACACGCC GGGCGGGCTG CTGGAGGTCT CCGGCTATCT CAACACCACC
GCGCATAAGA TCGGCGAATG GACCGCCCTT GGCGGCACCA TCACGCTGTC GGCCCCGGAG
GTGGTGGCGC AGCGGGGATC GGTGTTCAAC GTCTCTGGCG GCTCGATCAC CTATGAAGCC
GGCAACATCC GCACCAGCAA CTTCCTCGGG GCCGACGGCC GGCTCTACAA CATCGCCAGT
GCCCCCGCCG ACATGCTGTA TTATGGCTTG GGCGAGGGCT TCATCCGCAA GCACGAGCGC
TGGGGCATTA CCGAGGTGTG GATGAGCCCA CTCGGGCGTG GCCGGGAAAG TTTGCATTGG
GATCCGGGCT ACACGGTCGG CCGCGACGCC GGTCGCCTCA ATCTGTCCAC GCCGACTGCG
ATGTTCGAAG CCGAGATCAT CGCCGACGTC ATAGCCGGCG AGCGCCAGAA CAGCCCACGC
GCGGCCAATG TCGCCGACGG CTACAAGCAG GTGCAGAACG CCGCAGCGGT GGCCGGTACG
GTGGGGATCG GGCAATATAC CGCGCTCGGG CAGCCGGATC TCTATAACAG CGACGTTCGG
ATCGGAGATG TCTCCTTCGT CACGGCGGCG CTCGGTGCTG CCGATGTTCT GCCGAGCGGC
CGCGCCAATA CCGTGTGGCT CGATGCCCGG CATCTTTCCG AACAGGGGCT CGGCGGCCTG
CAGATCGGCA CCCGCGGCAC CATTACCATC GACGGGGATC TCACGCTCGC CAAGGGCGGC
GCGCTCGGTC TCATCGCGCC GATCGTCGAG ATCAAGGCCG ACATCACGGC CCAGGGCGGT
TCGATCGACG CGACCAATGT GTTCACGTCC GCGTTGCTGG GCACTGTCAA CCTTGCATCG
AACGGGATGT CGCGGATCGC GGTGCGCGCG GGTGCCGTGC TCGATCTCAG CGGCGTGCAA
GGCCTCGACG CGCCGGGGCA GGATCCGAAT GTGCTGGCTT ATCTCAATGG CGGTTCGGTC
TCGCTGCGTT CGAGCCGCGA TGTTGTCGTC GAAGACAACG CGGCGATCGA CGTGTCGTCG
GGCGCTGCAA TGCTCAGCAC CGGCAAGATC CAGGGCGGCC GGGCCGGCTC CGTCACGCTC
AAGGCCAATC TCAACGGAAC GAATTCCACG GGTGACCTGA CCCTTGCCGG CGACGTGCGC
GGCTATGGCA TGAACGGCGG CGGCACGTTG ACTCTGCAGA CCGATCGCGT CGCGATCGGG
GCCGCGTCCG ACATCACCGG CGCGGGTACG CTGGCGCTGG CCGCGAATTT CTTTTCCAAG
GGGTTCGGCT CTTACACGAT CATCGGCAAC CGCGGCGTGA CGGTGGCGGA CGGTGCGATC
GTGGATGTCA CGATGCCGGT CTATCGCCTT GCGAATTCGA CGGCGGCGGG CCTCGAAGCC
TGGACGCCGC CGGTTTACAC GGCCGACAGC CTGACCGGCG TGCTGACGCA GCGCGGCGGG
GCCAGTCTGA CGCTGCAGGC CGGCAACAGC CAGTTGCCGG CAAGCGAGTT GCCGTCGGTC
GGTCTCACCG TCGGCACGAA GGCCGCGATC AGTGTCGATC CCGGCAAGTC GATCGCCCTG
AGCAGTGTCG GGCAACTGAC GGTCGACGGC ACGCTGCACG CGGCAGGCGG CAACATCTCG
CTCAAGACCA TCGATCTGGT GGGGTCCCAG GTCGATAAGG TTGCAGCCAC GACCAACAAC
CGTTCGATCT GGATCGGCGA GCGCGCCCGG CTCGATGTGG CAGCACAGGC CGCCACCGCG
CTCGATGCAG GCGGAGGACG CTATGGCGTT GTCGCCAATG GCGGCAGCAT CGTGATCGGT
GGCGAGATCG ATAGCGCGCT GGGGCAGGCG ACGAGCTCCA ACAACTTCAT CGTGGTGCGG
CCCGGCGCGG TGCTCGACGC CTCCGGCACC GCGGCGACGC TGGACGTCGC CGGTCTCGGC
AGCGTCGACG TCGCCAGCAA TGGCGGCAGC ATCTCGCTGG TCTCCAATAA CGGATTGTTC
CTCGACGGTA CGATGATCGC CCGCGCCGGC GGAGCCGGGG CGGCAGGAGG CAGCCTGAAC
GTCGCGCTGC AGATGCCGGA ATACCGCATC ACCAGCGGTG ACCTGGTGAA TCTTCCGCTG
GCCGATCCGG CCTATCGGGC ATTGCGTCAG ATCGTTCTCG CCGATCGGCA GGGTAGCTCG
GTGCTGGGTG CCAACCTGCA GGCCGGTGCG GCCGATCCGG CCCTCGCCTA TGGTTACGGG
CGCCTCGGTG CCGACACCGT GGCGCGTGGC GGCTTCGACA ATCTGTCGGT GATGAGCAGC
GCCATCGGTT TCGACGGCAA TGTCAGCCTG CGTGTCGGGC AGAGCCTGAA CCTCTACAGC
GATATGATGA TGCCGGTGAC CGGCAGTACG CAGGATGCCC GGATCACGCT GGCCGCCGCC
TATGCCCGGC TCGGCCAGAT CAACTACGTC TACAAGGAAC CGGACCCCCA AAAATCGCGC
ACACCGGTCT TCACTGCCGA CCCGATACGC TACGACGTGC AGCTCGACGT CGCCGCGCAA
CTGATCGACA TTCGCGACAC CGTCAATCTG ACCTTCGGCA AGACGCGGCT GGCAAGCGCC
GGCGATCTGC GCTTCGTACA GGGCGTGCGG CCCATGGGCG GTCTGATCAG CACGTCGCTG
CTCAGTCCCG GCAGCGTCAC GCTGCGCGCG GCGCAGGTCT ATCCGACGAC GGGCACGGTC
GCGCAGGTGC AGGTCGGGCA AAACACCGTG AACGCTGTGA TCGGCTATGA TCCGGCGCAG
ACACTGCGCA TCGAGGCAAT TTCGGCCGAG ACGCCGGCGG TGCCCTATTC GGTGTTCGGC
GACCTGACGC TGGCCGCGGC CGTGGTCGAG CAGGCCGGCG TCGTTCGTGC GCCATTGGGC
AAGCTGACGA TCGGAGCGGG CGGCTATGAC AACAACAAAT CCACCCGTGT CACCCTGCTG
GCGGGCAGCC TGACCTCGGT CAGCGGTCGC GGGCTGGTGA TGCCCTATGG CGGCACCATC
GACGGTCTGA GCTACAGCTA CAACGGCAAG AACGTGACGC TGATCGGCGC CGGCGGTGCG
AGCTTCGTCG GCGGCAGCGG CGGGCTGACA GTCGGTGTTG CTTTGGGCGG ACAGAGCGCG
GTGGTCGAAA CCGGCGCGGT GCTGGATCTG TCCGGCGGTG GCGATCTCAC CGGCGCGGGC
TTCATCAGCG GCCGCGGTGG CTCGGTCGAT ATTCGCACGA CGGCATTCGC CAACGCCAAT
CCCGGCAACA GATTCAGCGC GTCCGGCAAT ACGGTCTATG CGATCGTGCC GGGCTTCAAG
GGCAACTACG CCCCGGTGGG GTCGGAAAAC GGCGCGAACG ATCCGGTTGT CGGGCGCCAG
ATCACCCTGG ACGGCAGCGT GCCTGGGCTG CCGGCCGGCA CCTATACGTT GATGCCCGCG
ACCTATGCGC TGTTGCCCGG CGCGTTCCGC ATCGAGATCG GCGCGCAGCA ATCGCGCGCT
GTTGTCGCCG GCGCCCATGC TGTCGGCAAC GGTTCCTATC TCGCCTCGGG CCAGCTCGGC
ATTGCCAATA CGACGATCAA AGACAGTCTC CTCAGCCGGA TCGTGGTGAC GCCGGGCGCC
GTGCTGCGCA GCTATTCGAA CTACAACGAA ACCAGCTTCG CCAATTATGT GGTCGCCGAT
GCCGTCAAGC GCGGCGTGCC GCGGGCCATG TTGCCGATCG ATGCGCGCAG CCTGCTGCTG
AATCTCAATC CCGGCGCCGG CATCAATGCC TTCCGTTTCA ACGGCAGCGC GCGTTTCGAT
GCGGCGAAGG GCGGCCATGG CGGCGCCGTG ATGGTCGCGA CCAGTTCCGG CGGCGGCTCC
GATATCGAGA TCGTGGCGGA AGGCGGTACG GCGACGGCCG GCTTCGCCGG ATTGTCGATC
GATTCATCGA TGCTCAATTC GCTCGGCGCG TCGCGCATCG TGGTCGGCGG CATTCAATAT
GTCACTTACG GGCAGGGCGG CCGGCTCGTC GATAGCGCCG ACTCCACGCA ATTCATCTAT
CTGCGTTCGG GCTCGCTGCT GGCGGCGCCC GAAGTCTTTC TGATGGCCAA CAGCAACAAC
ACCCGAACCG GAAGCGGCAT CGTCATCGAG CAGGGCGCAC GTATCAACAC CATCGGCAAA
GGCGCTGCTG CCTATGACTC GCGGGACGGC TATGTCTATG CCGCGACCGG CAACCAGGTG
ATCGTGTCCA ACGGCCTGCT CACTCTGGCG CCGAACACCG CTGCGGCGAA TGCCGTCACG
GCCAGCGGCG TCCGCATCGG CGTCTGTGAC AGCGGCAACT GCAGCGGCCA GACCGAAATC
TATTCCGAAG GCACCATCGC GCTGGCCACG CCGAACCAGT TCCAGATGAA CGATGCCGTG
GCCTACGGCA CCCGCAATCT CACGATGTCG GTCAGCCGGA TCAATCTCGG CGACAGCGCG
GTGCTGATTG ACGCGGCAGC GCGCAACATC CTGCCGAGCG GGCTGTCGAT GAACCAGGCG
CTGCTCGACC GCCTGCTGCG CGGCGATACG CGCTACGGCG CGCCCGGGCT CGAGACGCTG
ACGCTGTCGG CGTCCAATGC CGTCAACATC TTCGGCAGCG TGACGCTCGA CACCGTCGAT
CCCGCGACCG GAAAATCGAC GCTGGCCAAT CTGGTGCTGG GCACCCCGGC AATCTACGGC
GCCGGCGCCG CCGGCGATGT CGCCACCATC CGCACCGGCA ATTTCATCTG GCGCGGATCG
ACGCTGGCGC CGGGCGAACT CGTCGCCGGC GGCGCCGGGA CCGGCAGCGG CACGTTGAAC
ATCGATGCCG AGCGCATCGT GTTCGGTTAC GGCGCCAATT CGCGCCCGAC CGGAAATGAT
GTCGATGGCC GGCTGGCGCT CGGCTTCGCC AATGTCAATC TGTCGGCGAC CGACCGCGTC
AGCGCCAACC AGCAGGGCAA CCTGTCGGTC TATCAGTCGC GCGGCGCTTA TGTCGCCGGC
CAGGGCTACA GCTATAGCGG CGGCAACCTC ACCATCGCGG CGCCTTTGGT GACCGGCGAG
GGCGGCTCGG TCAACAGCAT TACCGCAGGC GGCGCGCTGC GGCTGCTCGG AGCGGGCACG
CCCGGCGTGG TCGCCACCGA CACGCTGGGC GCGCAGCTGA CGTTGAAGGG CGACAGCATC
GCGCTCGACA CCGCGATCGT ATTGCCGAGC GGCAAGCTGA CGCTGAAAGC GACCAATGAC
ATTTCCCTGA CCGGGGCGTC GCGGATCGAT ATGAGCGGCC GCGAGGTGGT GTTCAACGAT
GTCAGGAAAT ACAGCTGGGG CGGCGATGTC ACGCTCGACA GCAGCAACGG CAACATCCGC
CAGGCGGCCG GCTCGGTGAT CGACCTGTCG GCGCGCTTCA ACAATGCCGG CAGCCTCACC
GCGATCGCGC TCGATGCCGC GGCGGGCACG GTCGATCTGC AGGGCGCGAT CCTCGGCACC
ACCGTCGGGC GCTATGACGC CGGCGGCACG CTGGTGCCGT ATCGCGCCGG CAGCGCCACC
GTGCGCGCGC AAAACTTCGG CGATTTCGCC GCGCTCAATG CGCGGCTCAG TGCCGGCGGC
GTGTTTGGCG CGCGCAGCTT CCAGATCAAG CAGGGCGACC TCACCATCGG CAACGAGCTG
AAGGCCAGCG AGATCAACGT GTCGCTCGAC AACGGGCGTC TCACCGTGGC CGGGACCATC
GACGCCAGCG GCGAGAGCGT CGGCACTATC AGGCTTGCCG CGAAGAACGG CCTGACCCTG
ACGGGATCGG CGGTGCTCGA TGCTCACGGC ACCGTGCTGC GCGTCGACAG CTACGGCAAG
ATCATCGATA GTCCGAACCG TGCGATCGTC GAACTCAGTT CGGGCAATGG CCGGCTCACG
CTGGCGCAGG GCGCGCGGAT CGATCTGCGG CACGGCACCA GTGCGTCCGT TGGTTCGGCC
GCCGGACAGA ATGACGGCCG CGCCCGCGGC ACGCTGGAGC TCAACGCGCC GCGGATCGGC
AGCAGCGGCA GCGTCACCGA CCTCGATGCC GTGGCTTATG GCGACATCGA TATCGATGTC
GGCGGCATGC CGAACATCCA GGGTGCCCGC TCGATCGCCG TCAATGCGAT GCGCAGCTAC
GACGACGCGC CATACGCGGC GGATCCGGCG GCCAATGGCC GACGCTATCA GTACATCGAT
CAGGCCTGGC TGAAACACCG TCACGACGAA AGCACCGACT TCATCAATGC GGCGCGCGTC
AACGCCACGC TGGTCAACGG CAAGCTCGCC GGCCTCAACA ATGCGACCTA TCGCGACGTC
TTCCACCTGC GGCCGGGCGT CGAGGTGGTC AGCAAGACGG CGGACGGCGA TCTCGTGGTG
CAGGGCGATC TCGATCTGTC CGGCTACCGC TACGCCAGCC TCAATCCGCA TTCCCAGTTC
ACCGGCATCG CCGGAGATGG ATCTGGCGAA GCGGGCGCGC TGACGCTGCG CGCCGGCGGC
GATCTCAATA TCTATGGCAG CATCAGCGAC GGCTTCGCCG CGCCGCCGGC GACGCAGGAT
GACACTGGCT GGCTGCTGCT GCCCGGCAAG GAATTCACCG GCGCCGATAT CGTCATTCCG
CGCACCGGCG TGATGCTCGC CACCGGCACC GTTTTCTTCG GCGCCAAGGC GCTGAACTTC
GATCTGCCGA TCCGGGCGCG TCTGTTTTCC GCCGGCATGG TGATCCCCGT CGCCAGCGCG
CTGAATGCCG TGATGAACTT GACGCCGGGG ATGGTCATAA CCGCCAATAT TCGCAACGCC
GGCGGCACGA TCCTCTACGC TGCGGGCACG ATCATCGGGA CCGCCGTGAA GCTGCCGGTC
GGGACTCGGT TCGATGCCGG CTGGAAGATG CCGGTGAACG CCAGCCTGGC GGCGATGGTC
TGGCCGAAAG GCGTTCCGCT GCCGGGCGTA TCCACTGCGC AGTACCTTCT GAACGGCAAC
CTCACGCTTG CGATGGGGGC GGTATTGCCG GCCGGGACCG ATATCAAACT GCCGGACGGC
GTGGCGTCGG TCGAGTTGCG GCCCGGCGCC GCAGGCAAAA TGTATGGCGT CGCGGCGATG
CTGCCGGAGG GCTCGCAGTC GTGGTCGATG CGGCTGGTGG CCGGCGCCGA TACCCAGGCG
GCCGACAATC GCCTGACCGA TCCGCATGCG ACGTATGGCG ATCTGCGATT GGCCGACCAT
CACTACGGCC TCTACGGCAA GTCCAAGCCG GGGGGCGCCC TGGTCTGGAC GCAGGAAGGC
GTCGACGGCT GGGGTGACCC CAACAACGCG TCGATTTTTG TGGGGGCGCC GGTCGACGAG
GCCGCGCTCG GCTATATCGG CATGTGCACC GACAATCCCG GCTGGTGCGT GGTCAAGCCG
GTCTTTACCT GGACCCAGTA CGCAGCCGAC GATCTCGCGA ATCAAGGAAT CCCCGGCATC
GTCGTGGGCG CGGTGATCAC CGACAAGTTT CTGGACAGTT TGATCCCCGG CCTCACCGTC
GATTCCCTGT GCGCCGGCAC GCCCTCCTAC TGCCTGAACA CGGCCAAGGA CACGTTCGAC
TTGCTTCCCG GCAGCACGCG CTTCAGCGTG GTCCGCACCG GTGCCGCCGA TCTCGATCTG
CTCTCGGCGC GCAATCTGCA GATGAACTCG CTGTTCGGCG TCTACACCGC CGGCACGTCA
TCGACCACGA CCAGCGGCAG CGATCCCTAC AATCTGCCGC GGGCCAAGAC CGCGTCGGGC
ACGGTGCTTG GCGACGCCAA TCATGGCTAC GATAAATTCG TCGATGGCGG GGCGGACAGC
CTGGCGCGCG CCTGGTATCC GACCGGCGGC GGCAATCTCA CCATCAAGGC GGGCGGCGAT
CTGACCGGCA ATCTGATGAC CCCGCCGGTC TACACCAGTG GTGTTGGTCG GCCCAATCCT
AAGGACGTCG GTTACAACAG TGCTTCCATC GGCAATTGGC TGTGGCGGCA GGGCACCGGC
ACGACGCTCG GCGGCGGCGC CGATCAGGCC ACCGCCTGGT GGATCAATTT CGGCAGTTAC
GTCGCGCGGA ATGGCTGGGC CGACACCTTG ATCGGCTTCA CCGGTTTCGG CACGCTGGGC
GGCGGTGATC TGCGCGTCGA TGTCGGCGGC AATGCCGGCA TCGTCACGCT GATGGGCAGG
AACGTCGTCG CATCGGATGG TGTATCCGCG CAGGATCAGA CCAGCAACCA GCGCACCCAA
GGTCTCGTGC TGGCCGTGGG CAGCACCGGC CGTGTCGGCG CCGATGGCAG CCTGACGCTG
ACCGGCGGCG GCGACCTCGA TCTGCGGGTC GGCGGCAAGC TCAATCCGGC CTCGCTGTAC
AAGGACGCCA GCTTCAACGG CACCGTCACC AATTTGCGCG GCGACACCAG CATCGCCGCA
GCCTCGCTGG GCATCGTGAG CCCGACCTAT GGCGTGCAGC CGATCGATCA GTCGCCGCGC
GAAAGCCGAG CCTTTGACGT ATTCAATGCG TCGCGGGGCT TGGCCTTCGG CGGTTTGCTC
CTGGCGCCCG GCGATTCCAC CTTCACCCTG AACAGCGGCA GCGATCTGGT GGTGTCCGGC
GCCAGCGATC CCGGCCGCGT CACCACGATG ATCGATGCAC CCTATGGGCC AGACTATGGG
CGCAACGGCG CGGCCGGCGG CAAAACCTGG TTCTCGCTCT GGACCGACCA CAGCGCCATC
CACCTGTTCT CGGCGGGTGG CGACCTGGTG CCGATCAGCC TCGGCACGTT TGTGCCTTCC
ACCGACTCGG CGATGATGTA TCCGTCGATC CTCACCGCCG TCGCGGCCGG CGGCAGCTTC
TACTACGGCA ATGCGATCTC CGATCGCTCC GCCCTGATCT CGGACGTTTA CGCGCCGCTG
CTGCTGGCGC CTTCGGCCAA CGCCAAGCTC GAATTCCTGG CCTCGGGCTC GATCTATGCG
GGCGGCTACA CCGTGGCCCG GACCAGCGCC TCGGCGGCGT CGCTGGCGAC CCCGTTCCAT
CCGGCTTTCA CCGTGATCGA CCCGACAACA GGCGCCAGCG CATTCAGCAA TCTGTCTGCG
GCCGGAAACG TGGCCGATTG GAGCCAGGGC ATCTATCCGC TGTTCGCCTT CGGGCCCAAC
ACCGCCAGCA CGGAATGGGG CTCCCTCGAC CCGGCACGGT TCTACGCCGT CAACGGCGAT
CTCGTCGGCG TCAGCAGCGG CCGCATCGTC AGCTTCGCCG CGAATGATCC GACGCGGGCC
GGACAGGTCT GGTATCAGGG CGCAGGCCCG GTGCGCATGA TGGCCGGCCG CGACATCGTC
GGCAGCGGCA CCGCGATCGG GACCACCGAG GACGGGCTGG GGTCCAATTC CAATACCTAC
TTCTCCTCAA CCGGCAATCT GTTCGTGCAC AATGCCGAGA CCGATCTGTC GCTGGTGCAG
GCCGGGCACG ACATCATCTA CAGCAGCTTC AATGTTGCCG GTCCCGGCGC GCTGCAGATC
ACCGCGGGCC GCAACATCCA GATGGACAAC AAGGCTGGCG TCGTCTCCGT CGGCCCGGTG
GTCGCCGGCG ATCACCGTCC AGGCGCCAGC ATCACGATGC TGGCGGGTAC CGGCGCGACA
GAGCCGAACT ACCAGGCGTT GCTACGCTAT CTCGATCCGG CCAACCTGCT GCCGAAGGGG
ACGCCGCTCG ACGGCTCCGG CAAGGTCGCC AAGACTTACG AGAAGGAGTT GGTCGCCTGG
CTCAAAGATC GTGCCTTCAC CGATAAGGAC GTTGACGAGG TCGGCGCGAA GTTTGCCAAA
CTGTCGCCAG AGCAGGTCGC CAAGACTATC GGCGAGGCCC GCGCGAAGTT CGCCGATCTG
ACGTCCGAGC AACAGGCCAT CTTCCTGCGT AAGGTCTATT TCGCCGAGTT GACGGCGGGC
GGCCGCGAAT ACAACGACGC CACCAGCACG CGTTACGGCA GCTACCTGCG CGGTCGCGAG
GCGATCGCGG CGTTGTTCCC GGATGCTTCG GCGACGTCGG GCGGCATCAC CATGTTCGGC
GCATCCGGCG TGCAGACTTT GTTCGGTGGC GACATCCAGA TGTTCACGCC ATCAGGCAAA
CTGGTGGTCG GCGTCGAAGG TCTTGCGCCG CCGGCAACCG CAGGCCTGGT GACGCAGGGG
GCGGGCAATA TCCAGATCTA CAGCCAGGGC AGCGTGCTGC TCGGCCTGTC GCGCATCATG
ACCACCTTCG GCGGCAACAT CGTCGCCTGG TCTGCCACCG GCGATATCAA CGCCGGCCGT
GGCGCCAAGA CCACGGTGGT CTATACGCCG CCGAAGCGCG TCTACGACAT CTACGGCAAC
GTTGCCTTGT CGCCGCAGGT GCCGTCGGCC GGTGCCGGCA TCGCCACGCT GAATCCGATC
CCGGAAGTCA AGGCCGGCGA TATCGACCTG ATCGCGCCGC TCGGTACCAT TGATGCGGGC
GAGGCCGGCA TCCGCGTCTC CGGCAACATC AACCTCGCAG CGCTGCAGAT CGTCAATGCG
GCCAACATCG CGGTGCAGGG CACGTCGTCC GGCATCCAGA CCGTGCAGGC GCCGCCCGTC
GCGGCGTTGA CGGCCGCGGG CAACATCGCC GGCGCCACCC AGCAGCCCCC TCTGGCGGGG
CAGTCGAACA GCAGCGGGCA AAGCTCGGTC ATGATCGTCG AGATCATTGG TTACGGAGGG
GCACAAGGCA CGGACGACGA CGAGGAGCGC CGGCGTCGTG GACAATAA
 
Protein sequence
MLAGCSALAI MLAAPNAHAV VVGGGSGEGT IVSAPNLATD AARAAAQQAQ QAAQKAQQSL 
ERATQALQSM QSIQAAARAA AQAAPGAVPN GLAPGGLQEA PGVRQDPSLW QGARLPTETR
AGGQVQVTVE QTVSRAILNW RSFNVGKETA LTFNQQGNRD WVALNRVTGD TGPSQILGSI
NADGQVYVIN QNGILFGGSS QVNVGALVAS AAKITDEQFR NRGLYSAQTG GSYLPSFTDA
GGKVIVEAGA RIATHAPKSV TAGGGFVMLL GQEVKNAGQV LLPKGQALLA AGDDFVLRRG
YGTDGNQAST TQGSEIAPII RAGSLSGSVT NNGLIFSQQG DITLAGRTIA QHGVLVSTTS
VNTRGTIHLL NSATDGLGSV TLGASSLTDI LPELDSSETA LNAQRDALIK ASGEVAHLPV
RSAPFDNLSL LSDRLDQSRI EIVTGGNVLF KSGSLTIAQG GQVAVSAGQR IFTESGSTID
VSGVRGVLLP VTANNIKVNI QGNELRDSPV NRDSTYLKNA NVWIDLRDLV LVPKETGGYA
SDRYYTPGGL LEVSGYLNTT AHKIGEWTAL GGTITLSAPE VVAQRGSVFN VSGGSITYEA
GNIRTSNFLG ADGRLYNIAS APADMLYYGL GEGFIRKHER WGITEVWMSP LGRGRESLHW
DPGYTVGRDA GRLNLSTPTA MFEAEIIADV IAGERQNSPR AANVADGYKQ VQNAAAVAGT
VGIGQYTALG QPDLYNSDVR IGDVSFVTAA LGAADVLPSG RANTVWLDAR HLSEQGLGGL
QIGTRGTITI DGDLTLAKGG ALGLIAPIVE IKADITAQGG SIDATNVFTS ALLGTVNLAS
NGMSRIAVRA GAVLDLSGVQ GLDAPGQDPN VLAYLNGGSV SLRSSRDVVV EDNAAIDVSS
GAAMLSTGKI QGGRAGSVTL KANLNGTNST GDLTLAGDVR GYGMNGGGTL TLQTDRVAIG
AASDITGAGT LALAANFFSK GFGSYTIIGN RGVTVADGAI VDVTMPVYRL ANSTAAGLEA
WTPPVYTADS LTGVLTQRGG ASLTLQAGNS QLPASELPSV GLTVGTKAAI SVDPGKSIAL
SSVGQLTVDG TLHAAGGNIS LKTIDLVGSQ VDKVAATTNN RSIWIGERAR LDVAAQAATA
LDAGGGRYGV VANGGSIVIG GEIDSALGQA TSSNNFIVVR PGAVLDASGT AATLDVAGLG
SVDVASNGGS ISLVSNNGLF LDGTMIARAG GAGAAGGSLN VALQMPEYRI TSGDLVNLPL
ADPAYRALRQ IVLADRQGSS VLGANLQAGA ADPALAYGYG RLGADTVARG GFDNLSVMSS
AIGFDGNVSL RVGQSLNLYS DMMMPVTGST QDARITLAAA YARLGQINYV YKEPDPQKSR
TPVFTADPIR YDVQLDVAAQ LIDIRDTVNL TFGKTRLASA GDLRFVQGVR PMGGLISTSL
LSPGSVTLRA AQVYPTTGTV AQVQVGQNTV NAVIGYDPAQ TLRIEAISAE TPAVPYSVFG
DLTLAAAVVE QAGVVRAPLG KLTIGAGGYD NNKSTRVTLL AGSLTSVSGR GLVMPYGGTI
DGLSYSYNGK NVTLIGAGGA SFVGGSGGLT VGVALGGQSA VVETGAVLDL SGGGDLTGAG
FISGRGGSVD IRTTAFANAN PGNRFSASGN TVYAIVPGFK GNYAPVGSEN GANDPVVGRQ
ITLDGSVPGL PAGTYTLMPA TYALLPGAFR IEIGAQQSRA VVAGAHAVGN GSYLASGQLG
IANTTIKDSL LSRIVVTPGA VLRSYSNYNE TSFANYVVAD AVKRGVPRAM LPIDARSLLL
NLNPGAGINA FRFNGSARFD AAKGGHGGAV MVATSSGGGS DIEIVAEGGT ATAGFAGLSI
DSSMLNSLGA SRIVVGGIQY VTYGQGGRLV DSADSTQFIY LRSGSLLAAP EVFLMANSNN
TRTGSGIVIE QGARINTIGK GAAAYDSRDG YVYAATGNQV IVSNGLLTLA PNTAAANAVT
ASGVRIGVCD SGNCSGQTEI YSEGTIALAT PNQFQMNDAV AYGTRNLTMS VSRINLGDSA
VLIDAAARNI LPSGLSMNQA LLDRLLRGDT RYGAPGLETL TLSASNAVNI FGSVTLDTVD
PATGKSTLAN LVLGTPAIYG AGAAGDVATI RTGNFIWRGS TLAPGELVAG GAGTGSGTLN
IDAERIVFGY GANSRPTGND VDGRLALGFA NVNLSATDRV SANQQGNLSV YQSRGAYVAG
QGYSYSGGNL TIAAPLVTGE GGSVNSITAG GALRLLGAGT PGVVATDTLG AQLTLKGDSI
ALDTAIVLPS GKLTLKATND ISLTGASRID MSGREVVFND VRKYSWGGDV TLDSSNGNIR
QAAGSVIDLS ARFNNAGSLT AIALDAAAGT VDLQGAILGT TVGRYDAGGT LVPYRAGSAT
VRAQNFGDFA ALNARLSAGG VFGARSFQIK QGDLTIGNEL KASEINVSLD NGRLTVAGTI
DASGESVGTI RLAAKNGLTL TGSAVLDAHG TVLRVDSYGK IIDSPNRAIV ELSSGNGRLT
LAQGARIDLR HGTSASVGSA AGQNDGRARG TLELNAPRIG SSGSVTDLDA VAYGDIDIDV
GGMPNIQGAR SIAVNAMRSY DDAPYAADPA ANGRRYQYID QAWLKHRHDE STDFINAARV
NATLVNGKLA GLNNATYRDV FHLRPGVEVV SKTADGDLVV QGDLDLSGYR YASLNPHSQF
TGIAGDGSGE AGALTLRAGG DLNIYGSISD GFAAPPATQD DTGWLLLPGK EFTGADIVIP
RTGVMLATGT VFFGAKALNF DLPIRARLFS AGMVIPVASA LNAVMNLTPG MVITANIRNA
GGTILYAAGT IIGTAVKLPV GTRFDAGWKM PVNASLAAMV WPKGVPLPGV STAQYLLNGN
LTLAMGAVLP AGTDIKLPDG VASVELRPGA AGKMYGVAAM LPEGSQSWSM RLVAGADTQA
ADNRLTDPHA TYGDLRLADH HYGLYGKSKP GGALVWTQEG VDGWGDPNNA SIFVGAPVDE
AALGYIGMCT DNPGWCVVKP VFTWTQYAAD DLANQGIPGI VVGAVITDKF LDSLIPGLTV
DSLCAGTPSY CLNTAKDTFD LLPGSTRFSV VRTGAADLDL LSARNLQMNS LFGVYTAGTS
STTTSGSDPY NLPRAKTASG TVLGDANHGY DKFVDGGADS LARAWYPTGG GNLTIKAGGD
LTGNLMTPPV YTSGVGRPNP KDVGYNSASI GNWLWRQGTG TTLGGGADQA TAWWINFGSY
VARNGWADTL IGFTGFGTLG GGDLRVDVGG NAGIVTLMGR NVVASDGVSA QDQTSNQRTQ
GLVLAVGSTG RVGADGSLTL TGGGDLDLRV GGKLNPASLY KDASFNGTVT NLRGDTSIAA
ASLGIVSPTY GVQPIDQSPR ESRAFDVFNA SRGLAFGGLL LAPGDSTFTL NSGSDLVVSG
ASDPGRVTTM IDAPYGPDYG RNGAAGGKTW FSLWTDHSAI HLFSAGGDLV PISLGTFVPS
TDSAMMYPSI LTAVAAGGSF YYGNAISDRS ALISDVYAPL LLAPSANAKL EFLASGSIYA
GGYTVARTSA SAASLATPFH PAFTVIDPTT GASAFSNLSA AGNVADWSQG IYPLFAFGPN
TASTEWGSLD PARFYAVNGD LVGVSSGRIV SFAANDPTRA GQVWYQGAGP VRMMAGRDIV
GSGTAIGTTE DGLGSNSNTY FSSTGNLFVH NAETDLSLVQ AGHDIIYSSF NVAGPGALQI
TAGRNIQMDN KAGVVSVGPV VAGDHRPGAS ITMLAGTGAT EPNYQALLRY LDPANLLPKG
TPLDGSGKVA KTYEKELVAW LKDRAFTDKD VDEVGAKFAK LSPEQVAKTI GEARAKFADL
TSEQQAIFLR KVYFAELTAG GREYNDATST RYGSYLRGRE AIAALFPDAS ATSGGITMFG
ASGVQTLFGG DIQMFTPSGK LVVGVEGLAP PATAGLVTQG AGNIQIYSQG SVLLGLSRIM
TTFGGNIVAW SATGDINAGR GAKTTVVYTP PKRVYDIYGN VALSPQVPSA GAGIATLNPI
PEVKAGDIDL IAPLGTIDAG EAGIRVSGNI NLAALQIVNA ANIAVQGTSS GIQTVQAPPV
AALTAAGNIA GATQQPPLAG QSNSSGQSSV MIVEIIGYGG AQGTDDDEER RRRGQ