Gene RPB_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2234 
Symbol 
ID3909017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2545909 
End bp2557938 
Gene Length12030 bp 
Protein Length4009 aa 
Translation table11 
GC content66% 
IMG OID637884129 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_485850 
Protein GI86749354 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGT CGAACGGCAA CCTGCGTAGG ACTTTCTCGC GTCGCGGAGT ATGGCTCACC 
ACGGTCAGCG CGGTCGCGAT GTTGCTGCTT CCCGCGGCCG GTCAGGCCCG ATCGCTGAAT
GGCGCGGGTT CCGAAGTCCA ATCGGCGCCG AATGTCGCAG CCGACGCGGC GTCGCAGGCG
GCACAGCAGG CGGCGGCGGC TGCGCGGCAG ACCGGGGACT CGCTGGCGCG TGCCGCACGG
GCTGTCCAGG AAATGCAGGC AGCGCAGGCG GCGGCTCGCG CGGCTGCAGC CGTCGCACAG
GCATCTGCGG TCGTCCCGAA CGGGCTTGGC GTGGGAGGTC TGCTGCCGAA CGTCTCGGCC
GGATGGAGCG GAGCCAAGAC GCCGACGCAG AGCGTCGACG CAGATGGCCG GACCCAGGTC
GGCATCGAAC AGACGTCGCA GCGAGCGATC TTGAATTGGC AAAGCTTCAA CGTCGGCGCG
CGGACGACGT TGACGTTCGA CCAGAAGGGC AACGCGAATT GGGTCGCGCT CAATCGCGTC
GACAGCGCGA CGGCGCCCAG CTTGATCCTC GGCAATATCC GGGCCGACGG CCAAGTCTAT
GTGATCAATC AGAGCGGCAT CATTTTCGGC GGTGGCAGCC AGGTCAACGT CGGGGCGTTG
ATCGCGTCGG CTGCCGGCAT CACCGACAGC CAGTTTCTCA CCAAGGGCAT CTTCAGTAGC
CAGAGCGGCG GAGCTTACGC GCCGAGCTTC ACGGCCTCCG GCGGCAAGGT GGTGCTGGAA
ACTGGCGCGA CGATTAGCAC ACATGCACCG GCCTCGGTGA CCTCCGGCGG TGGCTTCGTA
CTGCTGATCG GCTCCGAGGT CGCCAATGCC GGCATGATCG GCACGCCGAA CGGTCAGGCT
CTGCTCGCCG CAGGCGACAG CTTCATCCTG CGCCCCGGCT TCGGCACCGA CAGTAACGTC
ACATCGACGA CACGCGGCAT CGAGATCGCA CCGGTGATCG CACAGGGCGG CACCGCTGGC
GGCGTCGTCA ATAGCGGGCT GATCGTGGCC CAACAAGGCG ACATCACGCT GACCGGTCGC
ACCATTACCC AGGCCGGGGC GCTGGTGTCG ACGACCTCGG TGAACAGTCG CGGAACCATT
CATCTGCTCA ATTCGGCCAG CGACGATACT GGAACTGTTA CGCTCGCGGG CGGCAGCCTC
AGCGCGATCC TGCCGGAGCT CGTCAGCACC GACACGGCGC TGGACGCCCA GCGCGACGCG
CTGATCGCCG CATCCGCCGC AAACCTCGCG CGGCCGAGCG GCGCCACTGG CGTGTTCGAC
AATCTTTCCA GTCTCGCCGA CCGTCAGGAT CAGTCCCGGA TCGAGATCGT CACCGGCGGG
ACGATCAACT TCAAGAACGG CTCCTACACG GCGGCGCAAG GCGGCCAGAT CGCTGCGAGC
GCGGGAAAGC GCATCTTCGT CGAGGACGGC GCCGCGCTCG ACGTCTCCGG CGTCCGCCAC
GTCGCGGTGG CGATGGCGTC GAACAATATC AAGGTCAACG TCCAGGGCAA CGAGCTGCGC
GACAGTCCGC AGAATCGCGA CTCTGAAGTG CTCAAGAACA ACGACGTCTG GATCGACATC
CGAGACCTCA CGCTGGTGCC GGCGGGCACC GGCGGCTATG CGTCGGACCG TTACTACACG
GCCGGCGGCC TGCTCGAAGT CGGCGGTTAT CTCGGCACCA CCGCCCACAG CATCGGCGAG
TGGTCGGCGC TCGGCGGCAG CATCACGCTG GCCTCCGCTG AAGTGATCAC ACAAAGAGGA
TCGTTGGTCG ATATTTCGGG CGGTTCGCTG GACTACGCGG CCGGCTGGAT CCGTTCGACA
AACTTGATCG GAAGCGACGG CCGCAGCTAC AGCATCGATC AGGCGCCCAG CGACCTGACG
TTCACGAGCT TCGGCGGAAG TTTCTCGCGC CGCCATAGCA TCCAGGGTAA AACGGACAAC
CGCCTGACCG AAATCTGGGG CTCCGTCTCG GGGCGCGGGC GTGATTCGTA TCGTTGGGAG
GAAGGATACA GCGTCGGCCG CGATGCAGGC CGGCTGACCG TTTTCTCGCC CACCGTGCTG
CTGGATGGCG ACATCGTTGC CGATACCATC GTCGGCGACC GGCAGGTGAG CAAACGTAGC
GCCGGCGTCG CCGATGGCTA CAAGGCGGCG CAGACCACCG TTGCGCAGGC GGGCGGCCTG
GTGATCGGCC GCAGCAACGG GATCGATGAA GACGGTGCTT TCGGCACGCC GATCGTTCTC
GCCCAGGTCG GCCCGACCAC CTCGGAGCTG ACCGCCGATG CCAGCCTTTC GACTCAGGCG
ACCAATCAAA TCCAGCTCGA TGCCGCCCGG CTCAGCAGTT TTGGGCTCGG CTTGCTGAGG
CTGACCTCGT CCAGCTCCAT AACGGTTGCA GCTCCACTAA CTCTCGCCGA TGGCGGATCG
TTGCAACTGA TCGCGCCGGA TGTCGGCATC AACGCCAACG TGACGGCGCG CAGCGGTTCG
GTGTCTATCG GCAACGTCGC CCCGGTGTCC AAGTCGGCTT CGAGCGCTGC GACGCCGCTG
TTCGTCGATG GCACCGCGAA CTTCACTTTG GCCAATGCGG CCACGATCGA CCTCCGCGGC
GTTTGGACCA ACAGCTCGCT CGAGCGTGAC GAGCCCGATC AGGCCGCTTA TGTCGATGGC
GGTAGCCTTG ACGTGCGCAT GACGCACGGC TCCGTCGTCG TCGAATCCGG GACTACGATC
GACGTCACCT CCGGCGCAAC GCTGACGTCG AAGGGCAGAC TGATCGGTGG TCGTGGCGGC
AGCGTAAGCT TGGTCGCCGG AGCCGATATC GTCGAGGGCG TCTCCGACTC AATTCCGGCA
TCGGCAAAGC TCGTGCTCGA CGGAACGATC CGGGCCGAGG GCGTCGTTGG TGGCGGGACG
TTGACTCTGC GCGCGCCGCA GGCCGTGACG TTCGGCGGAA ATGCGATCCT CGCCTCGGGT
GTGCTGCCGC CCGGCGTGCC GTTGCCGGCC TCGGTGCAAT TGACTGAAGG CTTTGTGCTG
CCCGCCGGAA CGGTGATGAC CTTCGCGGCG ACGCGGACGC TCGACGTGTT TGCGCCGGGC
GCGCCGATTC CGATGGGCGC ACGGCCGCTG AACGGCGTTC CGACGATCCT CGCGGATAGC
TGGACCGTTC CGGTCGGCGT GGGCGGCACC GCGAACGGCA ACACTCCGCT CGTCGCCGGA
CGGGTGATGC CGGCGGGGAC GTCGGTCGTG CTGTACAGCA TGCCGGGCGG CTATGTGCTG
CCGGCCTCGG CGTTTCCGAA TGGACTGCCG GCACAGCCTT ACACGGCGAC CCTGATGCCG
GGCGATCGCT TGTTGAAGGC GGTGCGTTTC GAAGCCGGCG AGATTCTGCC GCAAGGCGCG
GTGCTGACAC AGGCGATCTC GTTCAAACCG GCGCTGACGC TGGATCCGGC GATCCTGGCG
ACGGGATTCT CGAATTACAG CATCGCCAGT CTCGGCGGCA TCGCGATCGG CGACGGCGTC
GCGCTGCGGC CGATCGTTCC GACACTGCGG CTGTCAGAGG GCAGCGCGGC TGCAGCCAGC
GGAACCGATC CGGCTCGCGC GCTCGAGAGC TGGCAGCCGT CGGTTTATCT CGACTCCCCG
AACGACGCGG TCATCACGCA GCGCCCCGGT GCCAGCATCG CACTGGCGGG GGCCAGCATG
GCGCTCGGCC GGGGGGCGGT CATCGAGGTC GATCCGCTGC AATCCATCAG CCTGACGACC
AACCAAGCAA CCGTCGACGG CTCCTTGATC GCGCATGGCG GCCGGATCGC GCTGCTGCCC
GATCTGCAGC GTCGGTTCTT GATCGGCTCG CTTTGGATCG GAGGCGATGC GGTTCTCGAC
GTTTCTGGTG AAGCCGTCAC CGCGCGCGAC TCGCGAGGTT TCCGCTATGG CCTGGTGCAG
GACGGCGGAT CGCTGTTGAT CGGGCTGGCC GACGCGACGC CGGCCAGCAA CGGCTATCTG
GCCGCCACGG CATCGCCCGT CGTCATCCGC CCGGGCGCGC GATTGCTCGC CAATGGCGCG
AGCGCGATCA TCGATGCCGA TCGCGCTGAC GGCCCAGACT TCTCGCGGTC GGCCACCCTG
GTGGGGGGCG ACGGCGGATT GATCCGGATC GGATCGCTTG CGAGCATTCT GCTCGATGGA
ACGCTGCAGG CGCGCGCCGG CAGCCTACAG GCCGCAGGCG GAACCTTGAA CATCGCACTC
GAAAACTCGA CGCTCGCCGA TCAGGCGAGT GTTCTGCGCA CCATCACTCT GGCGCAGGAA
CACGTCGGGT CGGGACTGCC GTCGGACGCG ACCGCCACCT CCATGGGCCG ACTGCCGGTC
GGCGTCGCGC GGCTGTCCGC GGCGGACATC ACGGCCGGTG GTTTCGGTAC ACTCGATCTG
TGGGCGCGCG ATGTTCTCGC CTTCGGCAGC GACGTCTCGC TCCAGATGAG CGAGGCGCTG
CTGATCCATC GCGGTGTCTT CACGGTCGCC GCTGCGGCGG GCGCGCCCGA CATTCGGCTT
TCGGCGCCCT ATCTGTTGCT CGACGGCAAG ACGCCGAACC TGGTCGATGG CGGCGCGATC
CCGCCCGGCT TGGGACTGAA TGATTATCTG GGTGACTTCG CCGCCACCAA TGTCGGCCAT
TTGACGCTCA CGGCTGATCT CGTCGATGTC CGTAATTCGG TTCAGTTCGG CATGGTCGGC
AGAGTGTTCA CGCTGAGCGC CTCGGAGACC GTCGAGGTTC CGGGGTTCGC CCATGTCGCG
GTGAACAGCT CGGGTGACGT TCGCTTTACC AACGGTGAGC TGATCTCGAA CGGGCCGGAC
CTGACCATCA CCGCGTCGCA GCTCTATCCG ACCACAGGCG CCACAGGAAA CGTGCGCGTG
GGCCCGCTGC GCAGTCCGCT GCCGGGCGAT CCGGACTGGA CCCTGACCAT TCGCGGCCTC
GGCGATGCGC CGGCGGTTCC ACTCTCGGTA TTCGGCTCGC TCACGCTGAC AGCTCCGACC
ATCGATCAGG GCGGCATCGT GCGTGCGCCG CTCGGTCTCA TCACGTTCGG CGTCGTGCCT
CGCAGTTTCG GGACCACCGC TCAGTATCTT TCTGAGGTGT CGCTGAAACC GGGAAGTATC
ACGTCGGTGA GCGCCGATGG GCTGACGATG CCCTATGGCG GCACGTCGGA CGGGCTGGCC
TATCTCTACA ACGGACACAG CGTTGCGTTT GTCGATCTCG CCGATTTCAA CAGCAACAAC
AATTTCGTCG AGACGACGAT CAATCGTGGC ATCGTGTTCG GACAGGCGAC TCTGACTGCC
GATCAGGGGG CGCGGCTGGA TGCGTCCGGC GGCGGCCGTT TGACGGGCGC CGGCTTCTTC
ACCGGGCGTG GCGGATCCAT CGACGTGCTC CGCACCGCGC TGGCCAATGC CAATCCGGCT
AACATCTTCA GCAGTAGCGG CGCGCAGGTC TATGCGCTGG TACCGGGCGC TGCGGCGGCC
GGCTACGCAC CGGTGACGCC CGATGTCAGT GCTCCCGCCA TCGGCCAACA GGTCACGCTC
GATCGAGCAG TCGGTGATCT CCCGGCCGGA ACCTACACAC TGATGCCGGC AACCTACGCG
TTGCTGCCGG GAGCGTACCG GATCGAGCTA GGCGCCGAGA CGAGCATCGG CGTCAATGTG
ACCCGGCTGG AGAACGGTTC GTACCGTGCG AGCGGCTATC TCGGGACCGC CAACACCGCG
GTTCGCGATG CGTTACCGAC GGTGCTGACG ATCACGCCGG CGGCGACCGT TCGCACCTAT
TCACAGTACA ACGAAACGAG CTACGCTGAT TTCGCGATCG CCAATGCCAA TCTGTTCGGT
GCGGTGCGGC CGCGTCTCGA ACGTGACGCT TCCGCGATCC ATTTCGATTT CGGCCCGGCA
ACAGGCCAGG TGCTCGATTT CCGCGGCGTC GCCGATCTCG CCCCCGCTGC CGGAGGCAGG
TCCGGAACGC TGTTCGTCTC GAGCCGTGCC AATATCGAAG TTCGTGCAAT CAGCTCTGAT
GCGGCGGCGG CCGGCTACGC ATCCATCGTT GCCGACGACC TGAACCGATT CAACGCCGGC
GCGCTGGTCA TCGGCGGCCT CTATAGCCTC GTTCAGGCGC TGGACGGTGC CGGTGTCGCG
CTCGGCCCCC AGGTGGCTTT CTTAAGTGCG GGCAACGACA CGGTGGTGCG AAGCGGCGCC
GTCATCAGCG CCGGTCAGGT GTTTCTGGTC GGACAGTCAG TCCGGGTCGA GGGCGGCGGC
GTGGTCGATA CGCGAGCGGG CAGCAACGAT GTGATCGATT CAGCGCTCGG CTATGTCTTC
GCAGCAGGCT CTGGCGCCGT GCTCGGCGTC GGCAATGGAC GCCTGGATTT TCTCGGACCC
ACCGGCGGCT CGAACCAGCC CGCGAACACG ATCGTGGTCG GTGATGGCGC GTCGTTGCTC
ACGCAGGGCA CGATCAGCCT GTCCAGCAGC GGCGCGCCGC AGCTCGGCGA CGTCAACCTT
GCGGCGCGCT ACGTCGCGCT CGCGGCGTCG ACCTTCGATG TCGGAACGGA CGCGGATATC
GTCGCCGGCG GTGGCTCAGG CGTGCGTTTG ACCCAGCATC TGATCGATCG GCTCCTCACC
TCGGCGACAC CCGTCGAGCG GGTGACGCTG GCTGGCGGCA GCTCGATCAA TCTGTTCGGC
AACGCCACGC TGAACCTGCT CGGGCAGGTT GGTGCCACTC CGACTCTGGT GCTCAACACC
CCCGCGATCT ACGGCTGGGG CGCGTCGACG GATCGCGCAA CGATCTCCGC CGACACGGTG
GTGTGGAACG GTCTCGCCAC GGGCGTGGGC TCGGCGAGCA GTCCCTACGT CAGCGTTGCT
CCCGGGGCTG TGGCGCCGGG CGGCGCGGGG ACCGGTTCCG GCAATCTGAC GATCAGCGCC
AGCCACATCG AGTTCGGCTA TGGCCCGGGC ACACGACCGC AAACCCAGAC CGCGCTCGAC
CGATTGGCGC TCGGCTTCAC GACCGTTGAT CTCGTCGCAA GTGATCGGAT CACGGCCAAC
AGCCGTGGCA CGCTGTCTGT CTATGCGTCC GGGACATCCT CTCAAACCTA CGCCGGCGGC
GATCTCAACA TGTTGACGCC GCTGCTGACC GGGGAGGCCG GCTCGGCGAT GAGCTACACC
GCCGGTGGCC GCATCACATT GACGGCTCCT GCGAGCGGGG CAACGAACAC TGCTGCGGTC
GCGGCGCTCG GCGCTGAGAT CAAGCTCAAC GGTGCCAGCG TGATGATCGA CACCGCGGTG
GCGCTTCCGA GCGGCCGCTT CGCGATCACC GCGGCGCAAT CGATCGTACT CGACCAACAC
GCGGTGGTTG ATCTGTCGGG TCGCGCCGTC CAGTTCTTCG ACGTGACGAA GTACAGCTGG
GGCGGCGACC TTGTGATGGA GGTGACCGAT GGCGCCATCG TGCAGCGGCC GAATGCGCTG
ATCGATGTTT CGGCGACCCA CGCGAATGCC GGCACGATCA AAGCGACCGC GACCGGTGCG
AGCGGGCTTG TCGATCTTCG CGGCAGGTTG CTGGGCCAGG GCGGCGACGG CTTCGTCGCC
GGTGGCTTCG ATGTTCGCGT CACCAGCTTG CCGGATTTCG CCGGCCTCAA CGCCAGTCTG
AACGCCGGCG GCTTCTTCGA CGCGCGCAGC TTCGTCATCA AGACCGGCGA CCTCGTGATC
GGCAACGAAG TGCGCGCCAA CCGCATCTCG GTTTCGTTGG ACGGCGGTGC GCTGACCATC
GACGGCAGGC TCGATGCGTC CGGCGCCAGC GTCGGAACGA TCTCGCTGGC CGCCCGCGAC
GATCTGATCC TCACCAACAA CGCCGTTCTG GATGCGCACG GCAGTAGCGT CGTCCGCGAC
GGCCGCGGTG TCGCGATCGA CGCGTCGAAC CGGGCCGTCG TCGAACTGAC CAGCGTCGCC
GGCGTGGTCC GCATCGGCAG CGGCAGCATC ATCGACCTTC GGGCGGGCGA CGGCGTCGCC
CGAGGGCAAT TGGAAATCAA CGCGCCGCGG GTCGGCAGCG ACGACGTCGG CATCGACGCT
GCCGCGGGCC TGACGGTGCG CGGCGCTGCC AGCATTTCTC TCAACGCCTT CACGAGCTAC
ATGCCGAGCG GGGGGATCAT CGATCAGGGG TTGCTCGATC TGATCCATGG CGACAGTACG
GCGTTCATGG CGGCGGCCGG CGGCAACGCC GCGCTGGCGG CGCGGTTGAC CGGGCTGTCG
AATTATGGCG ATGCGTTCCG GCTGCGACCC GGCGTTGAAA TCCGCAGCGC GACGTCGGAC
GGAAATCTGA TCATCACCGG CGATCTCGAT CTGTCCGGCT ATCGCTACGG CAACATCACG
ACGGGCGTCC GAGGATCGGG CGACGCAGGC ACGCTGGTCA TCCGCGCCGG CGGGTCGCTC
ACCGTCAACG GAAGTATCAA TGACGGCTTC GCGCCGCCCT CGGTCACGCC CGACGACGGG
AACTGGTACA CGCCAAACAC CGTTGTGCGC CCGGGCCTGG CCGCAACGGC AGACGTGACC
TTCACGCCGC CGTTCGATCC GAATTATTTC GACAACGTCT ACGTGTTCCC GAGCGCCAAT
GAATTCAACA ATCCGGATTG GCCGGTCGTC GTCAGCGGAT CGATCACCGA TTCGACGAGG
ACGTACTATC AGGGCGACGT CATTGAATTC GGCGTCCTGT ACGGCCAGAT CACGATCACA
CAGGGCACCA CGCTGTCGGC GCGGAATCCT GCCAACGCGA CGATCACCCA GCGAGAGCCG
CGCGCGGGCA GCAATTGGGC AGTGGCTCCG ATGCTCGATC CGGGCATGCA GTCGTGGTCG
ATCCGGCTGG TCAGCGGCGC GGACCTCGGG TCGGCGAGCA GCAGAACGTT GAGCGCGGCC
TCCGCGCTGC ACGGCCGCGG CGACATGGTG CTGGATGCGC CGGGACTCGC CGGTCCCGAG
ATGGCCAGCC CGCTGATCGC GGTCATCCGC ACCGGCACTG GTTCGCTCGA GCTGCTCGCC
GGCAGTGACT TCAAGCAGCA GTCACTGTTC GGCATCTACA CGGCGGGGAC ATCGGTAGCC
GGGACGGACG CGTACAATCT GGACCGCGCG CCGAGCAGCG ACGGCACCGT TCTCGGCGCC
GGCAATTCGG CCTACGAAGA CACGCTGAAT CCGCAGCGGA TGTACTTCGC CGATCACGGC
GGCGATCTGA CGCTGACGAC GCAGGGCGAG CTGCGCGGCT TCACCAGCTA CGGCAATAGC
TCGTCGAGCG GGGAGATCGG CAACTGGCTG TGGTTGCAGG GCAACGCGTC GCGCGGCGAG
ACGGCGGCGT GGGGGATCAA TTTCGGTCAG TATCGTTTCG ACGGGGCGCT GAACTACGGA
TCCGGCGGCG TGACGATGTC GGGCTTCGCG GGCATCGGCA CGTTGGGCGG CGGCAATGTG
CGGGTGACGG CCGGCGGTGA CGCCGGCTCG ACCACGAACT TCAACCTCCT GGCCGATCCG
GCGCTGACGT CGAACCAGTC GTTTTCCGTT GTGGTCGGCA GCAGCGGCTA TGTGACCGCC
GACGGCAATC TCGTGCAGTT CGGCGGCGGC GCATTGCGGC TCGATGTCGG CGGATGGATC
AACACTGGAT TGTCGAACGA CACCGAGCTT GCGACAGGAA GCGTGGTCAA TCTGCGCGGC
GACACGCGGG TCACGGCAGG ATCGATCGGT CAGGTGGGCG AGACCGGCTA CGGGGTGGTC
GCTGCCGGCG ATCCGCGCGC GCCCGACGTG ACGCGACCTC GCGACCGCGT GGTGTTCTCG
CCGCTCGGAT TGGCGGTCGG CGACGGCGCG ATCTCGTTGA CCACGCGCGG AGACCTTGGG
CTGTTGACTG CGACCGATCC GGGACGGCAG ACCCCGCTGG GCTCCGGCAC CGCGACCAAC
GATGCCGGCG ATCCGACGCT CGGCAGCAAC ACGGCATTCT CGCTGTGGAC CGACCGCAGC
GGCTATACGG TGTTCTCGGG CGGCGGCGAC ATCGTGCAGG TGCCGCGCAC CTTGTCGTCG
ACCACGACGG CCTTCCGACT GTACGATCCC GGTCAGTTCA CAGCGATCGC GCCCGCGGGG
TCGATCACGA CGCAGATCGT TCTTGCCCCG AGCCGCGCCG GAACGCTCGA ACTGTTTGCC
GGCGGCAGCC TGTTCGGACA GGCTTCGATG TCCAGCGGAG CGGCAAGTTC GCTGGCGATG
CCGTTCCAGC CGCTGTGGGT CGAAGGCGAT CTGGCATGGG CCAGCTCGCC TGCCGCGACT
ACAAGCAACG GCTTCCAGAT CACGAACGTC TGGGGTGGCA CGGTCGGATT CTCGCTATTC
GCATTCGCGT TTGATGCGGC CACCAATCTC CACGCCGACG CGTCCGACCC GATCCGCGCC
TACGCACGCG GCGACGTCAT GATGCAGATC GGCAGCGTCA TCCAGACCGA TCCGTACAGC
AATCCGGATC TGTTCACGCT GGTGGCCGCC AAACCGGTCG ACATGCGGGC GGGGCGCGAC
GTCCTCGCCA GCGGCTTCAT CCTGAACACC TCGCCCAACG ATATTTCCAA CGTCGCGGCC
GGTCGCGACC TGCTGAACAC GACGCTGAAG ATCTGGGGGC CGGGACTTCT GCAGGTCTCT
GCCGGACGCA ACATCTATCA GGCGCTCGGC AGCAAGATCG TTCGGATCGA TCAGCTCGAC
AGCACGATCA ACAGCTATGG GCCGATCGTG TCCGGCGACC GGCAGCCTGG CGCCGGCATT
TTGCTGACCG CAGGCGCGGG TCCGCAGGGG CCGTCTTATG CCGATTTCGC GGCGCGCTAT
CTCGATCCAC GCAATCTCGC CGATCCGGCC TTGCCGTTGC TCAGTTCAGC CAATGCCGGA
AAGGTGGTGA AAACCTACGA CGCGGAGCTG CAAGTGTGGC TGCGCGAGCG CTTCGGCTAT
CAGGGAAGCG CGGCTGATGC GCTCGCGTAC TTCTGGGCGC TGCCGATCGA CCAGCAGAAC
GTCTTTGTGA GGATGGTGTA TTTCGATGAG CTGAAGGCGG GTGGTCGCGA ATACACAGAT
CCGACGGGGC CGCGGTCCGG CAGCTATGTC AGGGGCCGTG CAGCGATCGC AGCGCTGTTT
CCGGCGCAGG CAACCGGGGC CGGTTCGGTG ACGTTGTTGA ACAGTGCCGG CATCCACACC
GAACGCGGCG GCGATGTCCA GGTGCTGGCG CCGGCCGGCG GCCTGACGCT CGGGGTCGAG
GGTGTGGTGC CGCCGTCGAC CACCGGTTTG CTGACTCAAG GTGCCGGCGA CATCCAGGTC
TTCACCCGCG ACAGCGTGTT GCTCGGCTTG AGCCGCGTGT TCACCACGTT TGGCGGCGAC
ATCCTGGTCT GGTCCGAAGT CGGCGACATC AACGCCGGCC GCGGTGCGAA AACCAGCTTG
GTGTTCACGC CGCCCCTGCG TGTCTACGAC GATTACGGCA ACGTGACCTT GTCGCCGCAG
ACGCCGTCGT CCGGGGCGGG TATCGCCACC TTGAGCCCGA TTCCCCAGGT TCCCGCGGGC
GATATCGACC TGATAGCTCC GCTAGGCACG ATCGATGCGG GCGAAGCGGG CATCCGCGCG
TCCGGTAACA TCAACCTGGC CGCCTTGCAG ATCGTCAATG CGGCGAACAT CAATGTGCAG
GGCACCGCGA CCGGCCTTCC AACCGTGCAG GCACCGAATA TGACGGCAGG ACTCGCGAGC
ACGAATGCAA CGTCGGCTAC GCAGCAATCG GCAGCGCCGA CGAGCGCCGG CAACGAGCCG
CGGTCGGTCA TCATCGTCGA ATTCCTCGGC TTTGGAGGCG GCGATCGAGA TGGCGACGAT
GACAAGGAGC GTCGCCGATC CCAAACCAAG CCTGACGAAC GGGCGACGCA GGATCCGAGC
AGCCCCGTTC AGGTCATCGG CGCCGGTTCA CTCGATCAGG CGGCGATGGA ACGGCTGAGG
CCGGAGGAGC GTCAACGACT GGTGCGTTAG
 
Protein sequence
MMASNGNLRR TFSRRGVWLT TVSAVAMLLL PAAGQARSLN GAGSEVQSAP NVAADAASQA 
AQQAAAAARQ TGDSLARAAR AVQEMQAAQA AARAAAAVAQ ASAVVPNGLG VGGLLPNVSA
GWSGAKTPTQ SVDADGRTQV GIEQTSQRAI LNWQSFNVGA RTTLTFDQKG NANWVALNRV
DSATAPSLIL GNIRADGQVY VINQSGIIFG GGSQVNVGAL IASAAGITDS QFLTKGIFSS
QSGGAYAPSF TASGGKVVLE TGATISTHAP ASVTSGGGFV LLIGSEVANA GMIGTPNGQA
LLAAGDSFIL RPGFGTDSNV TSTTRGIEIA PVIAQGGTAG GVVNSGLIVA QQGDITLTGR
TITQAGALVS TTSVNSRGTI HLLNSASDDT GTVTLAGGSL SAILPELVST DTALDAQRDA
LIAASAANLA RPSGATGVFD NLSSLADRQD QSRIEIVTGG TINFKNGSYT AAQGGQIAAS
AGKRIFVEDG AALDVSGVRH VAVAMASNNI KVNVQGNELR DSPQNRDSEV LKNNDVWIDI
RDLTLVPAGT GGYASDRYYT AGGLLEVGGY LGTTAHSIGE WSALGGSITL ASAEVITQRG
SLVDISGGSL DYAAGWIRST NLIGSDGRSY SIDQAPSDLT FTSFGGSFSR RHSIQGKTDN
RLTEIWGSVS GRGRDSYRWE EGYSVGRDAG RLTVFSPTVL LDGDIVADTI VGDRQVSKRS
AGVADGYKAA QTTVAQAGGL VIGRSNGIDE DGAFGTPIVL AQVGPTTSEL TADASLSTQA
TNQIQLDAAR LSSFGLGLLR LTSSSSITVA APLTLADGGS LQLIAPDVGI NANVTARSGS
VSIGNVAPVS KSASSAATPL FVDGTANFTL ANAATIDLRG VWTNSSLERD EPDQAAYVDG
GSLDVRMTHG SVVVESGTTI DVTSGATLTS KGRLIGGRGG SVSLVAGADI VEGVSDSIPA
SAKLVLDGTI RAEGVVGGGT LTLRAPQAVT FGGNAILASG VLPPGVPLPA SVQLTEGFVL
PAGTVMTFAA TRTLDVFAPG APIPMGARPL NGVPTILADS WTVPVGVGGT ANGNTPLVAG
RVMPAGTSVV LYSMPGGYVL PASAFPNGLP AQPYTATLMP GDRLLKAVRF EAGEILPQGA
VLTQAISFKP ALTLDPAILA TGFSNYSIAS LGGIAIGDGV ALRPIVPTLR LSEGSAAAAS
GTDPARALES WQPSVYLDSP NDAVITQRPG ASIALAGASM ALGRGAVIEV DPLQSISLTT
NQATVDGSLI AHGGRIALLP DLQRRFLIGS LWIGGDAVLD VSGEAVTARD SRGFRYGLVQ
DGGSLLIGLA DATPASNGYL AATASPVVIR PGARLLANGA SAIIDADRAD GPDFSRSATL
VGGDGGLIRI GSLASILLDG TLQARAGSLQ AAGGTLNIAL ENSTLADQAS VLRTITLAQE
HVGSGLPSDA TATSMGRLPV GVARLSAADI TAGGFGTLDL WARDVLAFGS DVSLQMSEAL
LIHRGVFTVA AAAGAPDIRL SAPYLLLDGK TPNLVDGGAI PPGLGLNDYL GDFAATNVGH
LTLTADLVDV RNSVQFGMVG RVFTLSASET VEVPGFAHVA VNSSGDVRFT NGELISNGPD
LTITASQLYP TTGATGNVRV GPLRSPLPGD PDWTLTIRGL GDAPAVPLSV FGSLTLTAPT
IDQGGIVRAP LGLITFGVVP RSFGTTAQYL SEVSLKPGSI TSVSADGLTM PYGGTSDGLA
YLYNGHSVAF VDLADFNSNN NFVETTINRG IVFGQATLTA DQGARLDASG GGRLTGAGFF
TGRGGSIDVL RTALANANPA NIFSSSGAQV YALVPGAAAA GYAPVTPDVS APAIGQQVTL
DRAVGDLPAG TYTLMPATYA LLPGAYRIEL GAETSIGVNV TRLENGSYRA SGYLGTANTA
VRDALPTVLT ITPAATVRTY SQYNETSYAD FAIANANLFG AVRPRLERDA SAIHFDFGPA
TGQVLDFRGV ADLAPAAGGR SGTLFVSSRA NIEVRAISSD AAAAGYASIV ADDLNRFNAG
ALVIGGLYSL VQALDGAGVA LGPQVAFLSA GNDTVVRSGA VISAGQVFLV GQSVRVEGGG
VVDTRAGSND VIDSALGYVF AAGSGAVLGV GNGRLDFLGP TGGSNQPANT IVVGDGASLL
TQGTISLSSS GAPQLGDVNL AARYVALAAS TFDVGTDADI VAGGGSGVRL TQHLIDRLLT
SATPVERVTL AGGSSINLFG NATLNLLGQV GATPTLVLNT PAIYGWGAST DRATISADTV
VWNGLATGVG SASSPYVSVA PGAVAPGGAG TGSGNLTISA SHIEFGYGPG TRPQTQTALD
RLALGFTTVD LVASDRITAN SRGTLSVYAS GTSSQTYAGG DLNMLTPLLT GEAGSAMSYT
AGGRITLTAP ASGATNTAAV AALGAEIKLN GASVMIDTAV ALPSGRFAIT AAQSIVLDQH
AVVDLSGRAV QFFDVTKYSW GGDLVMEVTD GAIVQRPNAL IDVSATHANA GTIKATATGA
SGLVDLRGRL LGQGGDGFVA GGFDVRVTSL PDFAGLNASL NAGGFFDARS FVIKTGDLVI
GNEVRANRIS VSLDGGALTI DGRLDASGAS VGTISLAARD DLILTNNAVL DAHGSSVVRD
GRGVAIDASN RAVVELTSVA GVVRIGSGSI IDLRAGDGVA RGQLEINAPR VGSDDVGIDA
AAGLTVRGAA SISLNAFTSY MPSGGIIDQG LLDLIHGDST AFMAAAGGNA ALAARLTGLS
NYGDAFRLRP GVEIRSATSD GNLIITGDLD LSGYRYGNIT TGVRGSGDAG TLVIRAGGSL
TVNGSINDGF APPSVTPDDG NWYTPNTVVR PGLAATADVT FTPPFDPNYF DNVYVFPSAN
EFNNPDWPVV VSGSITDSTR TYYQGDVIEF GVLYGQITIT QGTTLSARNP ANATITQREP
RAGSNWAVAP MLDPGMQSWS IRLVSGADLG SASSRTLSAA SALHGRGDMV LDAPGLAGPE
MASPLIAVIR TGTGSLELLA GSDFKQQSLF GIYTAGTSVA GTDAYNLDRA PSSDGTVLGA
GNSAYEDTLN PQRMYFADHG GDLTLTTQGE LRGFTSYGNS SSSGEIGNWL WLQGNASRGE
TAAWGINFGQ YRFDGALNYG SGGVTMSGFA GIGTLGGGNV RVTAGGDAGS TTNFNLLADP
ALTSNQSFSV VVGSSGYVTA DGNLVQFGGG ALRLDVGGWI NTGLSNDTEL ATGSVVNLRG
DTRVTAGSIG QVGETGYGVV AAGDPRAPDV TRPRDRVVFS PLGLAVGDGA ISLTTRGDLG
LLTATDPGRQ TPLGSGTATN DAGDPTLGSN TAFSLWTDRS GYTVFSGGGD IVQVPRTLSS
TTTAFRLYDP GQFTAIAPAG SITTQIVLAP SRAGTLELFA GGSLFGQASM SSGAASSLAM
PFQPLWVEGD LAWASSPAAT TSNGFQITNV WGGTVGFSLF AFAFDAATNL HADASDPIRA
YARGDVMMQI GSVIQTDPYS NPDLFTLVAA KPVDMRAGRD VLASGFILNT SPNDISNVAA
GRDLLNTTLK IWGPGLLQVS AGRNIYQALG SKIVRIDQLD STINSYGPIV SGDRQPGAGI
LLTAGAGPQG PSYADFAARY LDPRNLADPA LPLLSSANAG KVVKTYDAEL QVWLRERFGY
QGSAADALAY FWALPIDQQN VFVRMVYFDE LKAGGREYTD PTGPRSGSYV RGRAAIAALF
PAQATGAGSV TLLNSAGIHT ERGGDVQVLA PAGGLTLGVE GVVPPSTTGL LTQGAGDIQV
FTRDSVLLGL SRVFTTFGGD ILVWSEVGDI NAGRGAKTSL VFTPPLRVYD DYGNVTLSPQ
TPSSGAGIAT LSPIPQVPAG DIDLIAPLGT IDAGEAGIRA SGNINLAALQ IVNAANINVQ
GTATGLPTVQ APNMTAGLAS TNATSATQQS AAPTSAGNEP RSVIIVEFLG FGGGDRDGDD
DKERRRSQTK PDERATQDPS SPVQVIGAGS LDQAAMERLR PEERQRLVR