Gene RPB_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2207 
Symbol 
ID3907949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2499290 
End bp2511439 
Gene Length12150 bp 
Protein Length4049 aa 
Translation table11 
GC content67% 
IMG OID637884102 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_485823 
Protein GI86749327 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.370432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTCA ACCGCTCGTC CCGCCCGCGC CGTTCCCGTC AGATGTCGAA GCCTCGGCCG 
CTCGCCGTGC TGCCGCTGTC GCGCAAGACG CTGTTGCACG GCACGAGTCT GCTGGCTCTG
TTGCTGTTGC CGACGATTGC GGCCGAAGCC CGCTCGCTCG GTGCGAGCGC GACGTCGTCG
GCGACGGCGA TTGCGGCGGA TGCAGCGGCG CAGGCCGCGG CGCAGGCCGC ATCCGCGGCG
GCGCAGGGCT CGGCGTCGCT ATCTCGGGCG ACGGCAGCGC TCCAGGCCAT GCAGGCGGCG
CAAGCCGCGG CGCGCGCGGC GGCTGGTAGT GCGGCCGGCG TGCCGAACGG CCTGACTGTC
GGCGGTCTGG TGCCGGATTC CGGCCTCGCG GCGGCCGGTG TCGCGCGTCC GGTGGTGAGC
TGGACCAACG CGCGCACGCC GGTGCAGAGC GGACCGACGG ATTCGCCGAC CGTCAAGGTG
CAGCAGACCG GCGCGCAGGC GATCCTCAAC TGGTCGAGCT TCAATGTCGG GGCGAGCACG
AGCCTCGTGT TCGACCAGCA GGGCAATTCG AGCTGGGTCG CGCTCAACCG CGTCGGCGCC
TCGTCGTCGC CGAGCCAGAT CCTCGGCAAC ATCAAGGCCG ACGGTTCGGT CTACATCATC
AACCAGAGCG GCATCATCTT CGGCGGCGGC AGCCAGATCA ATGTCGGCAG CCTGATCGCC
TCGGCGGCCG GGATCACCGA CGCTCAGTTC CTGACCTACG GAATCTTCTC GCCGCAGAGC
GGCAGCTACA CGCCGTCGTT CACTGCGGCG GGCGGCAAGA TCGTCGTCGA GTCCGGAGTT
CTGATCACCA CGGCTGCGCC CCTGGCGGTG ACCAGCGGCG GCGGTTCGGT GATCCTGCTC
GGCACCGAGG TGCTGAATGC CGGCACCATC ACCACGCCGA AGGGCCAGAC CGTGCTGGCG
GCCGGCGACA GCTTCATTCT GCGCGCAGGC TACAGCACCG ACGGCAACGC CACGTCGACC
ACCCGCGGCG TCGAAGTCGC GCCGGTACGA GCCGCCAACA GCCCCAACGG GGCGGTGAGC
AACAGCGGTC TCGTTTTCGC TCAGCAGGGC GACATCACGC TGGCCGGCCA TGCGGTGACG
CAGAACGGCC TCCTGATCTC GACCATCTCG GTCGATCAGC GCGGCACGAT CCATCTTTTG
AACTCGCGCA GCGATGCGAC CGGCAGCGTG ACGCTGACCG GCGACGCCAT CACCGCCGTG
TTGCCCGAGC TCGACAACAC GGCGACTGCC TTCAACAGCC AGCGCGACGC TTTGATCGCC
GACTCGGCGA AGCAGAACGC CAACCGCTTC AACGCGGGGC TGCCGCAGTT CGACAATCTC
AGCCCGATGA ACGACCGGCA GGATCAGTCG CGGATCGAGA TCGTCTCGGG CGGCTTGGTC
GACTTCAAGC CGAATTCGCT GACGCTCGCC CAGGGCGGCC AGATCTCGAC CTACTCGACC
ACGCGCATCC TGGTCGAAAC CGGCGCTACG CTCGACGTGT CCGGTACCGC CGGCACGGTG
CTGCCGGCCT CGGCCAACAG CCTCAAGATC AACGTCCAGC CCAACGAGCT GCGCGACAGC
CCGATCAATC GCGACAGCGG CGTCCTGATC AACAAGGACG TCTGGATCGA CGCCCGCGAT
CTGGTGCTGG TGCCGGCCGG CACCGGCGGC TACGCGTCTG ATCGCTATTA CACCAAGGGT
GGGCTGCTCG AAGTCGCCGG CTACGTCGCC AACACCGGCC ACACCATCGG CGAATGGACC
GCGGTCGGCG GTACCATCAC GCTGACTGCG CCGAACGTCA CGGCGCGCAA GGGCGCGTTG
TTCGACCTGT CGGGCGGCAT GGTGACCTAC GCGGCGGGCG ACATTCTCAC CGCCATTGTG
ATCGGCGCCG ACGGGCGCGC CTACAGCATC GGCGACGCGC CGGCGAATTT GCCGATCATC
AGCTACGGCA GCGGATTCAC CCGCGAACAT CCGCGCTGGG GTCAGAGCGA GACCTGGACC
AGCCCCAATC GCGGCGCGAC GCGCATCGTG CATCAAGAAT CCTACAGCGT CGGCCGCGAC
GCCGGCCTGC TGCAGATCTA TGCGGGCACG TCGGAGATCG AGGCGACGAT CGAGGCGCAG
GTCTATGACG GCGAGCGCCA GACTGCGGCG CGGCCCGCGT CGACCGCCGA CGGTTACAAG
CTTGGCCAGA CCCAGGTGCC GCTGGCGGGC CGGCTAGGGC TCGGATTCTA CGAGACCGCC
ACGGGCATCT ATGGCGACGC GCGGTCTTCC GTCGTATTCA CCGATGCGCC GCAATCGGCT
GGCGCGCAAG GTGCCGGCCA TGTGTTCGAC AGCGCAGCGT TGTCGAGCTA TCAGCTCGGC
GGCATCAGTG TTTCGACGAC GTCGACCATC GATATCAAGG GCGACCTGAC GCTGGCGCCG
GGCGGCGTGC TCGATCTCAA CGCCCAGGGT AAGATCGACG TCGCCGGCAA TCTCACCGCG
CCGGCCGGCC GCATCGCCCT CACATCGTCG GGCGGCAGCC TGACCGTCGA AAGTGGCGTG
GCGATCGACA CCCGCGGTCT GTGGACCAAC CTGTCGCTCG ATCCGCTCAA GTCCTACGGT
CTCGGCTATC TCAACGGCGG CGCGATCTCG CTGTCCGCAA CCAACCTGAC GCTTGCGGAC
GGATCGTCGC TCGACGCCTC CGCCGGCGGC GCCATCCTGG CCAACGGCAA GACCAAGGGC
GGCGACGGCG GCGACATCAC GCTGCGCGCC ACCGGCGCAA TGACGCTCGA CGGGGAGTTC
GCGTCCTACG GCTTCGGTAA GGGTGGCAAA CTCAATATCA TCACCCAGGG GCCGATCTCG
ATTGGCGGAA GACTGGCGGA CGGCGGCGTC GTCGCCGCGA ACCAGAGCTT CCTGATGGAC
CTCAAACTGG CCAAGGACAC CGTGTTTCCG GCGGGCAGCA TATTGCCGTT CGCGACGACC
ATAACGACGC CAAGCTTGTA TTTGCCTGGG CAGACGGTGG TCGGCACTGC CGAATACAGG
GGAGTCGATC GGTACGAATC CGTCCTCATC GGTCCAGGCG GCTGGACGGT CCCGGCCGGA
AGCAGAGCGA ACTACACGGA CCCGGATACG TACGACACCA CGTGGTACCA AGCCGGCGCC
ATCATCCCAG CTGGAACCCA GATCAACTAT ATTTATCTGA ACGCTCCGTC CGGATTTCTC
GTCAGCGCGA GTGCTTTCCC GAACGGACTT CCGGCCGAGC CGTCGACGCA GAGCGTCGCT
GCCGGCACAC CGATTCCCAG CGCCTACACG GCGCTCGCCG GCACCGTCAT TCCCAAGGAC
ACGGTGCTGC AGCAGGCCGT GGCGGTTCTG ACGGGGCCGC AGATCGCCGA CGCGGCGACG
TTCTTCAGCC GCGGTTTCTC GTCTTACGGC CTGACCAGCC CGGGCGGCGT GTACGTGCTG
CCGGGTGCAT CGATTCAGGT GACGCAGCCG GTGTTCCAGT TCACCACGGC GAGCCTGACG
GCGCCTACGG GCAGCAATCC GGCCATCGTC GCGCCCGTGG TCCTGCCGCC ATTGTTCACC
GAGGATGCCG TTCGCGGCAC GATCACGCAG CGCGCCGGCG CCGATCTGGT GATCGGCAAT
CCCAACGACA TCAATGTAAC GCCGGGCTCG ATGAACAGTT CGAGCGCGAG CCAGGGCGAC
TTCGTGCTCG GCCGCGGCGC GTCGATCACG GTCGATCCCG GCCGCTCGGT CAGCGTCGGC
AGCTCCGGCC AGATCACCAT CGACGGCAGC ATCACCGCCC GCGGCGGCGC GATCACCCTC
GTCAACAATG CGTGGCAGCC ACAGGTGCTG TCGACTGCGC CCTATGTGTT GAGCGGCCGC
TCGGTGTGGA TCGGCAGCGA GGCGCGGCTC GACGTCTCCG GTCTCGCCTA TACGGCGTTC
GATCTCAAAG GCCGTGCATA CGGCTCTCTG AGCGCCGGCG GCTCGGTGGT GCTGGGCGCG
CTCAACGTTA CGCCGGCGAG CTATGGGCTG CGGTCCGGCA ACGCCTTCGT GGTGATCCGC
CCCGGCGCGG TGATCGACGC CTCCGGCGCC AGCGCCACAC TCGACATGCT GTCGTCCGAC
AACCCGGCCG ACGCCCGTCT CGTCGCCAGC GACGGCGGCC TGATCGCGCT GGCATCGCAA
TGGGGCATCT ACAACGACGG CCGCCTCAAT GCCCGCGCGG GCGGTGCAGG GGCCAGCGGC
GGCACGCTGG TCTATACGCT CGAGACGCCG CCGATCGCTT TGGAATCCTT TCAGGACCCA
ATCGCCCTGC TGCCGGTCGA TGCGCGCGCC GCGCGGGTGA TCACGGTGGC GCAGACCTAT
GGCGCGAGCA GCCTGTCGAG CAACCTCGTC GCCGGTGCGA CCGATACGGG CCTTCGGCTC
GGCCAGGCCC GGCTCGGCGT CGATCAGATC CAGGCCGGCG GCTTCGGCTC GCTGTCGCTG
TGGGCGCGGG GCGCCATCCT GTTCGACGGC AATGTCAGCC TGGCGATGTC GCGCAGCGTG
ACGCTGCGGC AGGGTCCGCT GATGAACAGT TCGGCGACCG GCCAGGCGTC GGTGTCGGCG
CCGTACGTTC TGCTCGACGG CATGACGGTG ATGGGGGATG ACAGCTACCA GTCTGTCTAT
CTCCCGAACG AACGCGCGGC CTATCCGACC GGCGCCGCGC TGACCATCTC GGGCGATCTC
GTCGATGCCA GAAACCGGGT GCGGACCACT TATCGCGATA CAAACGTGAT CAGCTCCGGC
GATCTGCGCT TCCTTGCCGC CACTGCCGCG ACGGAAGTCG GCCAGAACGG CACCGCCACC
TCCGGCCGGG TCACCACGCT CGGCGCCAGC GGCAATCTGA CGCTCACCGC CGCGCAGGTC
TATCCGGCCT CCGGCGCGCG CGCGGTCGCT GCCGCTGGCG TGACGCAGAC CACTTGGCAG
GGCGCCCTGA CCTATGGCGC AACCGGCTCG GTGCTGACCA TCCGCCGCAA CGGCGACACG
CCGGCGATGC CGTATTCGCT GTTCGGCTCG CTCGAACTCT ATGCGGACAA TATATATCAG
GGTGGTGTCG TTCGCGCGCC GTTCGGCTCG ATCATCGTCG GCCGCAGCGA TTTGAATGGG
GGGAAGGCTC CGACCCAGGT CGAATTGCTG TCCGGCAGCA TCACCTCGGT GAGCACCGAC
GGGCTGGTGA TGCCCTATGG CGGCTCAAGC GACGGCGTCA ACTATCTCGT AGACGGCGCC
GCGGCGGTGT TCGCGGATCT GGGAGGCGCC TTCGCGCTCA AGGACGGCAC GCGGGCGAGT
ATCGGCATAA CGCTTAATGG CCGGCGGCAG ACGGTCGATG CGGGCGCGCT GCTCGATCTG
TCGGGTGGCG GTACGCTCAC CGGAGCGGCC TTCCTCACCG GCCGCGGCGG CTCGGTCGAC
GTGCTCATGA CGCCGCTCGC CAACGCCAAT CCCGGCAACA CTTACAGCTC GGCCCGCAAT
TCCGTTTATG CGATCGTTCC TGGCGTGGTG ACGGCACCGG TCACTGGCGG CTACAACACG
GCGTGGACCG GATCGACGCC CGGCATCGGT CAACAGATCA CCGTTCCGGC CGGTGTTCCC
GGCCTGCCGG CCGGCACCTA CACGCTCTTG CCGGCAAGCT ACGCGCTGTT GCCCGGCGCC
TTCCGTGTCG AACTCGCGGC CACGACGACC ACGCGGCTGC CGAACGTGGC GACGCTTGCC
AACGGCTCGA CCATCGTCCC GGTCTATCGG GGTATCGCCG ACACCTCGAT CGTCAATGCG
CTGCCGACCC AGGCCATCGT CACTCCGGCG CCGACGGTGC GGAAATATTC GCAATACCAG
GAGCAGAGCT ATACCGGCTT CGTGCTGGCC CAGGCCGCTC AGTTCGGCAA CATCCGCTAC
ATGATCGAGG CGGACGCTCA CACCTTGACC GCCAGCGTGC GCTCGACCGC GGCGGATGCC
GTCAACAGCG CCTTTATGTT CAGCGGCAAG GCGGATTTCA CCGCGGACAC CGGTGGGCTG
TCCGGCGTGT TTGTGCTGGC GCCGGCGCGC GGGATCTACG ACAACGCCAC CACGGATCTC
GTGATCACGG GCGATGCTTC TCAGGCGGTC AACAGCGCCA ATGTAACGAC GGTCTCGGCG
GCGGCGATCG CGGCCGTCGA CGCACCGAAC GTCTCGCTTG GCAGCAGTGT CTACAGCTTG
GGCAGTCTCA ACGGCAACAC TGACCGCAAT GTCTATATCA CCGCCGCCGC CCGCAGCCTC
ACGCTCGACG CCAACGTCCA CCTGACGGCG GACTCCATCT TCCTCGGCGC GGTCAATACC
GTCACGCTGT CGGAGGGGGT GATCGTCTCG ACGCTAGGCC ACGGCTACTC CTGGCTCAAT
ACCGCCACCG GCCTCAAGTT GGGCTACATC AACGGAGGCG CTGTGCTTGG CGCCTCCAAT
GACACATTCA TCATCGGCAC GCCCACCGGC TCGACCACCT CGATCCTGAT GAAGAACGGC
AGCGGCCTCT ATTCGGAGGG CAGCATCGGA TTCTTTGCCG ATCGCGGGCT GTCAACGCAG
GGCAGCATCG GCCTCGGCAC CCGGCGGCTG GCGCTCAACG CGGCGTCGCT CAACATTGGC
ACCGAGGCTT CGCTCGCCGC GGTTTCGAGC GTGCTGCCGG CCGGGCTCAA TCTCACCCAG
ACATTCCTCA ACACGCTGTT CGCCGGCGGC GGCGTTCCCG GCGCGCCGGC GGTCGAACAA
TTGACGCTCG GTGCCCGCAA CGGCGTCAAT TTCTTCGGCG ACGTCACCCT CAGCACCATC
GATCCCGCGA CCGGCAAGTC CCGTCTCGCC GAGCTGGTGC TGGCCGGTCC TTCGATCTAC
GGCTACGGCA CCTCGGCTGA CACCGCCCGG ATCGTCACCG ACACGCTGGT CTGGCAGGGC
AATTCGCAGC CCGTGGCGCG CATCGCCGGC CTCGGCAACG CGGGCGCATT CGCGGTCGAC
GCCCGGCAGA TCCTGTTCGG CACGCCGAGC AACGTCGCGC CGGTGAGCGG CGCGAACTAC
AATCGCCTGA TGCTCGGCTT CGACCGCGTC CAGTTCGGCG CCTCCGAGCG CATCAGCGCC
AATGGCAGCG GCACGTTGTC GGTCTATGGC AGCGGGCCGG CGGTGACGTC GAGCTTCGAT
CCCGCGACCT ATCGCGGCAC CGGCAATGCG CTGAGCTTGC AGACGCCGCT GATGACGACC
GAATCCGGCG CGACCTTCAG CCTCTACAGC AACGCCATCG ACGTGACGGC GCCTGCGGGC
AGTTCGGCCG GCGCAGCCGG CAGCCTCGGC GGCACACTGT CGATCAATGG CGACAGCCTC
ACGGTGGCCT CGGCGATCGT GCTGCCGGCG GGCCGGCTCA GCCTGACCGC GCGCAACGGT
GATCTCGTTC TCACCGATGC GGCGCGGCTC GACGTCGCCG GTCATGCTAT CGATTTCAAC
GGCGTGCTCA AACAGGCTTG GGCGGGCGAC ATCTTGCTGA CCAGCCAGAC CGGCAACGTG
ATAACGGGGC AGGGCTCGGT GATCGATCTG TCGTCGGAGG GCAGCGACGC CGGCAATCTG
ACCATCACGG CGGCGGGCGC CGCGCGTCTT GCCGGCGCCC TGAAGGGCTC GGGCGGCGGC
AACGGCGCGC GCGACGGCGC GTTTAACCTC ACCGCCGACT CGCTCGGCGG CGATCTCACC
GCATCGTTCG CGGCGCTGAA CCAGGCGCTG AACGGCAGCG GTTTTGGCTA CAGCCGCGCC
TTCACCTTCG CCAGCGGCAG CCTTGTGGTC GGCAACGACG TCCGCGCCCG CAACGTGGCG
ATCGCGGTCA ACGGCGGCAG CCTGACCGTC GCCGGCCGCA TCGACGCCGC AGGCCGCGGC
ACCGGTTCGG TCCGGCTGTC GGCGCGCGAC GACGTGCGGC TGACCGGGAG CGCCGTGCTT
GACGTCCGTG GCACGTCGCT GGTGCTGGAC AGCTATGGTC AGGTGATCGA GGCCTCCAAC
CGCAACAGCG TCGAGATCAA CACCGGCGCG GCCGGCTGGC TGCGCCTCGA TGCCGGCGCC
ACCATCGATA CCTCGTCGCC CGACGGCGTC GCGCGCGGCC GTGTCGACCT CAACGCCCGC
CGCAGCACGG AGACCGGCGG CGACATCCAG ATCGACGCCT CCGGTCCGCT CAATCTGCGC
GGCATCGGCA GCCTCGCCGT GAATGCCTAC TGGACCTACA GCCCGAGCGA TGCCAACGGC
TCGATCGTGC AGGACAACAC CACGACGGGA GTTCCGACCG GCGCTGTCGG CTTGAAGCAG
ATCGACGCCA CCAACCAAAC CTTCTACGCC AATGCGCTCG GCAATAGCGG GCTGCAGACC
CGTCTCACCG GCCTGAAGAC TGCGGCGGGC GCCGCCTATC ACCTGCGGCC GGGCGTCGAG
ATCGTCTCCA GCGTTGCCTC GGGCGGCAAG CTGACGGTGC TCGGCGATCT CGATCTCGCC
GGCTATCGCT ACGGGGCGGA CGCCGACCGC AACTCGGCTT CGGTCACCTA CGGCTTCGGC
GAGCCGCTCG CGCTCGCCAT CCGCGCCAGC GGCAATCTCG ATATCAAGGG CTCGATCACC
GACGGCTTCG GCGTGCCGAA GACCAGCCCT GACGAGAACA ACTTCCGCGA CCGGACGATC
TCGACGACCT CTACGATAGA CATGGGATTA TTTCCCGGCG GGTTTACCGT GGACGATGGC
GACACATCGC ACGACACGAC AGCCGGCTAT TTCGACGCCG ACAATTGGTG GGCCTATGTG
TACCCGCAGC CCATGACCCT CGTCCACGAT TGGACGAACA CCGAATGGAC AGCGATGAAA
GACAGCACCG GCCGCACTTA CGCGCCGGGG ACTGTGGTGC CGGCAGGCAC CACATTGGTC
GGATATGTCG GAGGCTATCC ACTCCATTTC AACGTCGGCA CAGTCGTTCC CGACTTCCGG
TTGATCGTCA CCACCCTCTC CAAGGGCAAG ATCTACGCGA TCGCGCCGAT GCTGGCGCCG
GGGGCGATGT CGGCTTCGAT CCGGCTGGTC AGCGGCGCCG ATCTCGCCTC GGCCGACAGC
CGTGCCTTGC TCTCCGCCAC CGCTCTGGCG GGCGTGGGCA ACATGACTCT CGACGACTAC
CACACCCACA CGCCGTCCGG GTCCGGCGGC GACAATCAGA CGTCGCAGAT ATCGAGCGTG
ATCCGCACTG GCACCGGCGA TCTCGATCTG CTGGCCGGTG GCTCGCTGGT CACCAAGTCG
CTGTTCGGCA TCTACACCGC CGGCACCCAG AGCGCGCCCC GCTCCGACGA TAGCGACTTC
ACCACCAAAC TCCCGGCCGG GTCGTTGTCG ACCCCCGCTT TGATGGCGAC GCTGAACGAC
CGCATCAGCT ACTATCCGGA GCATGGCGGC GACCTGCGCG TCGTCGTGCA GGGCGATTAC
ACCGGCTACC AGACCTTGCA GACGCTGAAT GCCGGCAACT GGCTGATTCG CCAGGGGGCC
AACGAGATCG GCCAGAAGAC CGCCTGGTCG GTCAATTTCG GCAAATACTA TAATAGCGGC
TCATCGATCT CCGTCGATGC GATCAACGGC TTCGGCACCT TCGGCGGCGG CAACGCGGCC
GTTACGATCG GCGGCAACGC CGGCGGCATC GCGCCTTATG ATTATTTTCT TCAGAAATAC
ACCGGCCTGA TCGTCACCGT CGCGTCCACC GGCCGGGTAC TCGGCGGCGG CACACTGGTC
GAGACCGGCG GCGGGACGGT CGATATCGCC GTCGCCGGCA AGCTCAATGA GAAGTCGAGA
GGCGGCTCGA TCGTCGATCT GCGCGGCGCC GTCACGCTCA CCGCCGGCCA GATCGGCGCC
CTGTCCAAGA CCTATAACAC AGTGACGATC GGCGATCCAC GTCCGAGCTC GCCACTGGTG
GCTGGCGGGT TGGACAACAA CGGTACCTTT GCTGGCCCTG GCTTGGGCCT CGGCGATGCC
ATGGCCACCG TGCGGTCGCG CGGCGATATG AATATGAGCG GCGCCGGCGA TCTTGGCATC
CTCGACAGCG CGACGACGGG CGACATCGCC AGCCCGACCA CCGGCAGCAC GGTCGCGAGC
TGGTTCTCGC TGATGCGGGC GTCCACCGCG GTCGACATCA TGTCGCTCGG CGGCGATCTC
GTCACATCCA TTGGGATGTC CCCGATCTAC ACGGCGGCGG CGGCCAGCGG CAGCATCTAC
TACATCAACG GGAACGGGAT AAGTCAGGAG CCGGCACCTT TGGCGCAGAT CGAGCTGCTC
GCCCGCAATT CGATTTACGC CTCCGGACAA TCTATTGGCG GGATGTCGCC GGCCGATCCG
GCGCTGCTGC CGACGCCGTT CCACCCGGCC TTCGTCGAAT GGACGAATAG CGTGATGTCG
CGCACCAATA CGCTGACGAC GTACGATGCC TTCACCGATA TCTACACTCC AACCTTCGGT
CTCTACGCAC TGGCTCCGAA CGTCGCCACC AGCGATCTGC ACGCCGGCGA TCCAAACCCG
GTACGCATCT ATGCGGTGAA CGGGGATATC GTTGGGCTGC AGCTTGCGAC CGCCAAGCGG
CTCGAGATGA TCGCGGCCGG CGACATCGTG CTTCCCGGCA CCAGTTTCGC CGCCGGAATA
TCCATCCTCC ATGCCGACGC CAAGGACGTC TCCGTGATCC AGGCCGGTGG CAAGATCATC
TACCCGAACG TCCGCATCGC CGGCCCCGGA ACACTGTCGA TGACCGCCGG CGGCGACATC
AACCTCGCCG ACAAGGGCAA CATCGTCAGC ACCGGCCCGG CCTATGCGGC CGATCTGCGG
CCGGGGGCCT CGGTCGCGCT GCTCGCCGGC GCCGGCAGTA CCGGCGCCGA CTATGCCGCG
GTGGCGTCGC TCTATCTCGA CCCTGCCAAT CGCGCCGTCA GCGGTACGCC GCTGGCCGAT
CAGCCCGGCA AGGTCGCCAA GACCTACGAG GCCGAGCTGA AGGATTGGCT GGCGCAGCGT
TACGGCTACG CCACCCGCGA CGACGCCGAC GCCCGCGCTT ACTTCGGCGG GCTCAAACCG
GAGCAGCAGA ACATCTTCCT GCGCCAGGTC TACTTTGCCG AGTTGAAGGC CGGCGGGCGC
GAGTACAACG ACGCCTCCAG CCCGCGCTAC GGATCCTATC TGCGCGGGCG CCAGATGATC
GCCACGCTGT TCCCAGACCG TGACGCGCTC GGCGGGCCGA TCACCTACGA CGGCGACATC
TCGATCTTCT CCTCCGTCGC AAGCAACGGA TTCCAGAACG GCGGCGGTGT GCGCACTATC
GGCGGCGGCG ACATCGCGCT GCTGACGCCC GGCGGCCGCC AGGTGATCGG CGTCGAAGGC
GTGACGCCGC CAGCCGCCTC CGGCCTGATC ACGCAAGGCG CCGGCGACAT CAACCTCTAT
AGCAAGGGCT CGATCCTGCT CGGCCTGTCG CGCATCATGA CCACCTTCGG CGGCAGCATC
CTGGCCTGGT CGGCGGAGGG CGACATCAAC GCCGGCCGCG GCTCCAAGAG CACGCAGATC
TACACGCCGC CGAAGCGGAT CTATGACAAA TACGGCCATG TCGTGCTGTC GCCGCAGGCG
CCCTCGACCG GCGCCGGCAT CGCCACGCTG AATCCGGTGC CGGGCACCAC CGCCGGCGAC
GTCGACCTGA TCGCGCCTGA AGGCACCATC GACGCCGGCG AGGCGGGCAT CCGGGTCTCG
GGCAACATCA ACCTCGCGGC GCTGCAGATC GTCAATGCCG CCAACATCCA GGTTCAGGGG
ACATCGACAG GCTTGCCGAC CGTGCAGGGC CCGCCAGTCG CGGCGCTGAC GGCTGCAAAT
AACGTCGCAG GTGCCGCGAC GCCAGCCGCC TCCGGCCCGT CGCAGAACAA CGACCGACCG
TCGATCATTC TGGTTGAATT CCTCGGCTTC GGCGGAGGCG ACGGCAGCGG TCAGCCTGGG
TCTGGCAATC AGGGCCCAAA AGACAACTCA CGCGAGCAGC ACAGCTATAA CGTCAACAGC
CCGGTTCAAC TCGTCGGCAA TGGCCCGCTG ACTGAGGCGC AGATGAGCGC GCTGACCGCC
GAAGAGAAAA GGAACCTCGA GAGTCGCTGA
 
Protein sequence
MSFNRSSRPR RSRQMSKPRP LAVLPLSRKT LLHGTSLLAL LLLPTIAAEA RSLGASATSS 
ATAIAADAAA QAAAQAASAA AQGSASLSRA TAALQAMQAA QAAARAAAGS AAGVPNGLTV
GGLVPDSGLA AAGVARPVVS WTNARTPVQS GPTDSPTVKV QQTGAQAILN WSSFNVGAST
SLVFDQQGNS SWVALNRVGA SSSPSQILGN IKADGSVYII NQSGIIFGGG SQINVGSLIA
SAAGITDAQF LTYGIFSPQS GSYTPSFTAA GGKIVVESGV LITTAAPLAV TSGGGSVILL
GTEVLNAGTI TTPKGQTVLA AGDSFILRAG YSTDGNATST TRGVEVAPVR AANSPNGAVS
NSGLVFAQQG DITLAGHAVT QNGLLISTIS VDQRGTIHLL NSRSDATGSV TLTGDAITAV
LPELDNTATA FNSQRDALIA DSAKQNANRF NAGLPQFDNL SPMNDRQDQS RIEIVSGGLV
DFKPNSLTLA QGGQISTYST TRILVETGAT LDVSGTAGTV LPASANSLKI NVQPNELRDS
PINRDSGVLI NKDVWIDARD LVLVPAGTGG YASDRYYTKG GLLEVAGYVA NTGHTIGEWT
AVGGTITLTA PNVTARKGAL FDLSGGMVTY AAGDILTAIV IGADGRAYSI GDAPANLPII
SYGSGFTREH PRWGQSETWT SPNRGATRIV HQESYSVGRD AGLLQIYAGT SEIEATIEAQ
VYDGERQTAA RPASTADGYK LGQTQVPLAG RLGLGFYETA TGIYGDARSS VVFTDAPQSA
GAQGAGHVFD SAALSSYQLG GISVSTTSTI DIKGDLTLAP GGVLDLNAQG KIDVAGNLTA
PAGRIALTSS GGSLTVESGV AIDTRGLWTN LSLDPLKSYG LGYLNGGAIS LSATNLTLAD
GSSLDASAGG AILANGKTKG GDGGDITLRA TGAMTLDGEF ASYGFGKGGK LNIITQGPIS
IGGRLADGGV VAANQSFLMD LKLAKDTVFP AGSILPFATT ITTPSLYLPG QTVVGTAEYR
GVDRYESVLI GPGGWTVPAG SRANYTDPDT YDTTWYQAGA IIPAGTQINY IYLNAPSGFL
VSASAFPNGL PAEPSTQSVA AGTPIPSAYT ALAGTVIPKD TVLQQAVAVL TGPQIADAAT
FFSRGFSSYG LTSPGGVYVL PGASIQVTQP VFQFTTASLT APTGSNPAIV APVVLPPLFT
EDAVRGTITQ RAGADLVIGN PNDINVTPGS MNSSSASQGD FVLGRGASIT VDPGRSVSVG
SSGQITIDGS ITARGGAITL VNNAWQPQVL STAPYVLSGR SVWIGSEARL DVSGLAYTAF
DLKGRAYGSL SAGGSVVLGA LNVTPASYGL RSGNAFVVIR PGAVIDASGA SATLDMLSSD
NPADARLVAS DGGLIALASQ WGIYNDGRLN ARAGGAGASG GTLVYTLETP PIALESFQDP
IALLPVDARA ARVITVAQTY GASSLSSNLV AGATDTGLRL GQARLGVDQI QAGGFGSLSL
WARGAILFDG NVSLAMSRSV TLRQGPLMNS SATGQASVSA PYVLLDGMTV MGDDSYQSVY
LPNERAAYPT GAALTISGDL VDARNRVRTT YRDTNVISSG DLRFLAATAA TEVGQNGTAT
SGRVTTLGAS GNLTLTAAQV YPASGARAVA AAGVTQTTWQ GALTYGATGS VLTIRRNGDT
PAMPYSLFGS LELYADNIYQ GGVVRAPFGS IIVGRSDLNG GKAPTQVELL SGSITSVSTD
GLVMPYGGSS DGVNYLVDGA AAVFADLGGA FALKDGTRAS IGITLNGRRQ TVDAGALLDL
SGGGTLTGAA FLTGRGGSVD VLMTPLANAN PGNTYSSARN SVYAIVPGVV TAPVTGGYNT
AWTGSTPGIG QQITVPAGVP GLPAGTYTLL PASYALLPGA FRVELAATTT TRLPNVATLA
NGSTIVPVYR GIADTSIVNA LPTQAIVTPA PTVRKYSQYQ EQSYTGFVLA QAAQFGNIRY
MIEADAHTLT ASVRSTAADA VNSAFMFSGK ADFTADTGGL SGVFVLAPAR GIYDNATTDL
VITGDASQAV NSANVTTVSA AAIAAVDAPN VSLGSSVYSL GSLNGNTDRN VYITAAARSL
TLDANVHLTA DSIFLGAVNT VTLSEGVIVS TLGHGYSWLN TATGLKLGYI NGGAVLGASN
DTFIIGTPTG STTSILMKNG SGLYSEGSIG FFADRGLSTQ GSIGLGTRRL ALNAASLNIG
TEASLAAVSS VLPAGLNLTQ TFLNTLFAGG GVPGAPAVEQ LTLGARNGVN FFGDVTLSTI
DPATGKSRLA ELVLAGPSIY GYGTSADTAR IVTDTLVWQG NSQPVARIAG LGNAGAFAVD
ARQILFGTPS NVAPVSGANY NRLMLGFDRV QFGASERISA NGSGTLSVYG SGPAVTSSFD
PATYRGTGNA LSLQTPLMTT ESGATFSLYS NAIDVTAPAG SSAGAAGSLG GTLSINGDSL
TVASAIVLPA GRLSLTARNG DLVLTDAARL DVAGHAIDFN GVLKQAWAGD ILLTSQTGNV
ITGQGSVIDL SSEGSDAGNL TITAAGAARL AGALKGSGGG NGARDGAFNL TADSLGGDLT
ASFAALNQAL NGSGFGYSRA FTFASGSLVV GNDVRARNVA IAVNGGSLTV AGRIDAAGRG
TGSVRLSARD DVRLTGSAVL DVRGTSLVLD SYGQVIEASN RNSVEINTGA AGWLRLDAGA
TIDTSSPDGV ARGRVDLNAR RSTETGGDIQ IDASGPLNLR GIGSLAVNAY WTYSPSDANG
SIVQDNTTTG VPTGAVGLKQ IDATNQTFYA NALGNSGLQT RLTGLKTAAG AAYHLRPGVE
IVSSVASGGK LTVLGDLDLA GYRYGADADR NSASVTYGFG EPLALAIRAS GNLDIKGSIT
DGFGVPKTSP DENNFRDRTI STTSTIDMGL FPGGFTVDDG DTSHDTTAGY FDADNWWAYV
YPQPMTLVHD WTNTEWTAMK DSTGRTYAPG TVVPAGTTLV GYVGGYPLHF NVGTVVPDFR
LIVTTLSKGK IYAIAPMLAP GAMSASIRLV SGADLASADS RALLSATALA GVGNMTLDDY
HTHTPSGSGG DNQTSQISSV IRTGTGDLDL LAGGSLVTKS LFGIYTAGTQ SAPRSDDSDF
TTKLPAGSLS TPALMATLND RISYYPEHGG DLRVVVQGDY TGYQTLQTLN AGNWLIRQGA
NEIGQKTAWS VNFGKYYNSG SSISVDAING FGTFGGGNAA VTIGGNAGGI APYDYFLQKY
TGLIVTVAST GRVLGGGTLV ETGGGTVDIA VAGKLNEKSR GGSIVDLRGA VTLTAGQIGA
LSKTYNTVTI GDPRPSSPLV AGGLDNNGTF AGPGLGLGDA MATVRSRGDM NMSGAGDLGI
LDSATTGDIA SPTTGSTVAS WFSLMRASTA VDIMSLGGDL VTSIGMSPIY TAAAASGSIY
YINGNGISQE PAPLAQIELL ARNSIYASGQ SIGGMSPADP ALLPTPFHPA FVEWTNSVMS
RTNTLTTYDA FTDIYTPTFG LYALAPNVAT SDLHAGDPNP VRIYAVNGDI VGLQLATAKR
LEMIAAGDIV LPGTSFAAGI SILHADAKDV SVIQAGGKII YPNVRIAGPG TLSMTAGGDI
NLADKGNIVS TGPAYAADLR PGASVALLAG AGSTGADYAA VASLYLDPAN RAVSGTPLAD
QPGKVAKTYE AELKDWLAQR YGYATRDDAD ARAYFGGLKP EQQNIFLRQV YFAELKAGGR
EYNDASSPRY GSYLRGRQMI ATLFPDRDAL GGPITYDGDI SIFSSVASNG FQNGGGVRTI
GGGDIALLTP GGRQVIGVEG VTPPAASGLI TQGAGDINLY SKGSILLGLS RIMTTFGGSI
LAWSAEGDIN AGRGSKSTQI YTPPKRIYDK YGHVVLSPQA PSTGAGIATL NPVPGTTAGD
VDLIAPEGTI DAGEAGIRVS GNINLAALQI VNAANIQVQG TSTGLPTVQG PPVAALTAAN
NVAGAATPAA SGPSQNNDRP SIILVEFLGF GGGDGSGQPG SGNQGPKDNS REQHSYNVNS
PVQLVGNGPL TEAQMSALTA EEKRNLESR