Gene RPB_0482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0482 
Symbol 
ID3909827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp530172 
End bp542195 
Gene Length12024 bp 
Protein Length4007 aa 
Translation table11 
GC content68% 
IMG OID637882369 
Producthaemagglutinin-like protein 
Protein accessionYP_484104 
Protein GI86747608 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG CGCTTCACCA CGACGTCGGC CGCCGGCACT CCCGCGTCAC CGCCGAAGAC 
GGCGGGCAGC TTCGCGTCAC GGCGAAGCGT CGGGCGTTGT CGCGCAGGCT GATGCTCACC
AGCACCAGCG CGCTGGCCCT GACGGTGCTG CTCACCGCGG CGCCGGCGAT GGCGCGCTCG
CTCGGCGGCG CGGCGCCGGA TTATTCGGCG GCGGCGGTGG CGGCGGCTGC GGCCACGGCG
GCGGCGCAGC AGGGTAGTGC GGCGGCGCAG CGGTCGGTGG AGTCGCTGAC CCAGGCCGCG
CAGGCGCTGC AGGGCATCCA GGCGGCGCAA GCCGCGGCAC GGGCGGCCGC GCAGGCGGCC
GGCGTGTCGG TGACGGCGCC GGTGGTGGTG CCGAACGGTC TGGCAGCGGG TGGGCTGGTG
CCGGATTCCG GGCTTGCGGC GCCCGGGGCC GCGCGCCCGG TGCTGAGCTG GACCAATGCC
AGGACGCCGG TACAGAGCGG GCCGGCCGAT GCGCCGACCG TCACTGTGCA GCAAACCGGC
GCGCAGGCGA TCCTCAACTG GGCGAGCTTC AATGTCGGGG CGAGCACCAG TGTCGTGTTC
GATCAGCAGG GCAACAGCAA CTGGGTCGCG CTCAACCGCG TCGTCGGCGC CGCGTCGCCG
AGCCAGATCC TGGGCAGCAT CGGCGCCGAT GGGTCGGTGT ACATCATCAA CCAGAACGGC
ATCATTTTCG GCGGCGCCAG CCAGGTCAAT GTCGGCAGCC TGATCGCCTC GGCGGCTCAG
GTCACCGACG CGCAGTTCAA GGCGAACGGC ATCTATTCGA GCTTGTCGGG CTCATCGTAT
TTGCCGAGCT TCAGCGGCGC GACCGGCGCC ATCACGGTGG AGCAGGGCGC GCTGATCAAC
ACCAGCGCCC CCGGCGCCGT GACGCAAGGC GGCGGCTTCG TCATGCTGCT CGGCACCAGC
GTCTCCAATG CCGGCACCAT CACCACGCCC AAGGGCCAGA CCGTGCTGGC CGCGGGCCGC
GACTTCGTCG TCCGCAAGGG CTACGCCACC GACGCCAACA GCTTCTCGAC CACGCGCGGC
AACGAGATCG CGCCCGGGCT CTGGAATGCG AGCAGCAGTA GCTGGACGCC CGGCGGTGGT
GCGGTGACCA ACAGCGGTCT CGTGTTCGCC CAGCAGGGCG ACATCACGCT GGCCGGCCAT
GCGGTGGCGC AGAACGGCGT CGCCATCGCC ACCACCTCGG TCGACATCCG TGGCAGCATT
CATCTGCTCA ATGCGGCTTC GGATGCGAGC GGCAGCGTCA CGCTCGGCGC GGCGTCGCTG
ACCCAGATCA TGCCCGAGCT CGGCAGCGCC GACACCGCGC TCAACTCCAA GCGCGACGCG
CTGATCGCCG ATTCCGCGAC ACAGAATGCC GCCCGCGCCA CGGCCTCCTT CGGCCAGTTC
GGCAATCTGT CCAGCCTCGC CGACCGTCAG GATCTCGGCC GGGTCGAGAT CGTCAGCGGC
GGCGTGGTGA CGTTCGACGG CGGCTCGCTG ACGCTGGCGC AAGGCGGCCA GATCGCCGTC
AGCGCGCGGG CGCGGGTGTT TGCGGCCGGG GATGCCGTGC TCGACGTCTC CGGTACCTCG
AGCGTACTGC CGATGTCGGC CAACAGCCTG AAGATCAACG TTCAGCCCAA CGAGATGCGC
GACGCCCCGG TCAATCGCGA GTCCGGCGTG CTGACCAATC AGGACGTCTG GATCGACGCG
CGCGATCTGG TGCTGGTGCC GGCCGGCACC GGCGGCTACG CGTCCGATCG TTACTACACC
AAAGGCGGCC TGCTCGAAGT CTCGGGCTAT CTGGCCAACA CCGGTCACAG CGTCGGCGAG
TGGACCGCGG TCGGCGGCAT CATCACGCTG AGCGCGGCCG AACTGGTGGC GCAGGCGGGC
TCGGTGTTCA ACCTGTCGGG CGGCACGATC AGCTATCAGG CCGGGCAGGT GCTCACCAGC
AACGTGATGG GCGCCGACGG GCGGCTGTAT CCGATCGGCA GCGTGCCGGC CGACATGCCG
ATCATCGCGC TCGGCCGGAG CTTCACGGTC GATCATTCGC GCTGGGGCGT CAAGGACGTC
TGGGGCAGCC CGAGCCACGG CGCAACGACG TCGCGCTGGG ACGACGGCTA CACGGTCGGC
CGCGACGCCG GGCAACTCGT CCTGTCGGCG CCGACGAGCG TGTTCGAAGG CAGCATCGTC
GCCGAAGTAA TCACCGGCCT GCGCCAGACC GCGGCGCGGC CCACCGGAGT CAGCGACGGC
TACAAGCTGA CGCAGACGAC GGTGCCGCTG GCCGGGGCGC TGATGCTCGG CCAATACGGC
GCCATCGGCC GCACCGGCGC GGTGGCGAGC GAGGTGATCA TCGGCGACGT CGCCGACATC
ACCTCCGGCC TCGGCGCCGA GGCCGCGCTG CCCAGCGGCC GCGCCAATAC GGCGTGGTTC
GATGCCGGCG CGCTCAACGA AGCCGGCCTC GGCGGGCTGT CGATCGCCTC CAGCAACAGC
ATCCGCGTCA GCTCCGACCT GACGCTGGCC CGCGGAGGCA TCCTCGAACT GGTGGCGCCG
ACCATCGACG TCGCCGCCAC ACTCACCGCG CGCGGCGGCG CGGTGACGCT GAGCACGCTG
CTGAGCCGGG TCTATCCGAA CGGCGACAGC ATCGGCAGGG GTACGCTGCC GCTGACTCCT
GCGCTCGGCC GCGCCAGCAT CACGCTGGAG GAAAATGCGG TCATCGACAC GCGGGGCCTG
TGGACCAACG CGCTGCTCGA CCCGGCGGGC GACGCTTCCG GCCAGGCCTT CGCCGACGGC
GGCGCGGTGA CGCTGCGCTC CAGCGGCGAC GTCACGCTAG GCACCGGCTC CATCATCGAC
GCCTCCGCCG GCGCCGCGCT GCTCGCGGGC GGCAAGCTCA AGGGCGGCAA TGGCGGGAAT
ATCTCGCTGA TCGCCAGCGA GTCCCTGACC GACGTCAACG GCACCACCTC GGGCAATACG
ATCCACGGCG ATCTGACGCT GCGCGGCACC CTGATCTCGT ATGGCTTCGG CAAAGGCGGC
AAGCTGACGC TGCAGACCGG GGGGAGCGTC AGCATCGGCG GCACAAGCGC GTCGCTGAAC
CTGTCGCCGG AGCTGTTCGG GTCGGGCTTC TCGGCCTATT CGATCAACGG CCTCACTCGT
CTGATGGTGG CGGATGGCGT CGCGATCACT GCGACGATGC CGGTGCTGAG GGCAGACCCC
AATCGCGCCT TCTCAGCGCC GAGCGGCGGC GCGGCCACGG CGGTGGCCAG CCTGTGGACG
CCGCCGCTCT ATCTCGAAAA CCCCGCCAAG GGCACGCTCG CGCAGCGGGC CGGCGCGAGC
GTCGCGCTGC TGTCCGGCGC CGGGGCCGAC AGCAGTGGCT ACATCACGAC CGAGGGCGCG
CTCAGCATCG GCACGGGCGC CTCGATCACG GTCGATCCGG GCCAGAGCGT GCGCCTCGCC
GCCAGCGGCC AGGTGACGGT GGACGGCACC ATCACGGCGC CGGGCGGCAG CATCGCCATC
ACCAATAATT ACACTCAGCT GCACTATGTC GACCCGGCGG CGATGTCGGT GTGGATCGGC
GAGCACGCGG TGCTCGACGT TGCGGCGCGG GCCTATATCG CGGCCGACGT CGCCGGCCGC
CGCTACGGCG TGATCCCGGA CGGCGGCGCC ATCACGCTCG GCAGCACCGG CGGCGCTTTC
AAGGTCATGG GCAGCTCCAC GAATGAGGTC TACGAACCCG GCAACGAGGC CTCGACCCAC
GCCTATGTGG TGATCCGGCC CGGCGCCGTT CTCGACGCCT CCGGCACCAG CGGCGAGCTC
GATATCGCGA CCGGCGGCTC TCTGCGCAGT GCCTTCGCGC CGGTGACGGT GGCGAGCGAC
GGCGGCACCA TCGCGCTGCG CAGCTGGTCC GGCATCCATG CCGAGGGCAC GATGCAGGCG
TTCGCGGGCG GGGCGGGCGC GGCCGGCGGC ACGCTGATCG TCAGCCTGGA AGCGCCGATC
CAGACCGCCG TGGCCAGCAT CGTCGTGCCC ACCAACCTGA AGGTGGGACG GGTGCTGACG
GTGACGCAGG ACATTGCCGG CACCCTGGCC GCGGGGCTTC AGCCGGGCCA GAGCGACGCC
ACCCTGGCAC TCGGCCAGGG CACGATCGCG GTGTCGCAGA TCGCGGCGGG CGGCTTCGAT
GATGTGTCGC TGCTGGGGCG CGGCGCCATC CTGTTCGACG GCGACGTGTC GCTCGAGGCA
GGGCGCAGCC TGAGCCTGGA AGCCGGGCAG TACGCCGATC TCGGCGGCGG TCGCGCGGTC
GTGTCGGCCC CCTATGTGCG GTTCGGCGGG CAGGACGTCT TCATCTTCGA TTACAACACG
GTGCCCTCGA CGGTGCCGGC CTATGCGGCG GTCGGGACGC TCGCGGTCGA TGGCGACCTG
ATCGAGCTCG GCGGCAACGT CGCCACCACC TTTGCCACCA CGAGCTTCAC CAGCCACGGC
GACCTGCGCT TCGTGACCAG CGCCTCCGTC CCCAACCCGG GTCTCTATGC GGCCCACGAC
CTCACCCTGT CGGCCGTGCG AATTTATCCG GAAAGCGGCA CGATGGCGAC CGTCGCGGCC
GGCTATACCA AGAGCAGCTA TTCCCCTGGC TCGTCGCAAT CCTGGATGTT TACCGATCCG
GCTGCCGTGC TGACGCTGGT GCGACCCGAT GCCGGGGCGG CGGTCGAGGC GCCGTATTCG
GCGTTCGGGG AGCTGAGCCT GACCGCCGGC ACCATCCGCC ACGGCGGCGC GCTGTTCGCG
CCGTTCGGCG GCATCACGCT GACCGCCACG CGTGTCGGCT CCAACGTCGT CAATACCGCG
CCGGCCGTGG TGGAATTCCT GCCCGGCAGC ATCACCTCGG TCAGCGGCGC CGGGCTCGTG
ATGCCCTATG GCGGCACCAC CGACGGTGTC GGCTGGACGG TGAACGGAAC CGATGGAGCG
ACCGTCCACC TGATCACCGG GCAGTACATG AAAGCCGACG GCACGGTGAT CACCTTGGGC
GCGAGGAGGA GCACCGGCAT CTCGATTCGG GCCACCACCA TCGTCGGCGA CGCCGGAGCG
ACGCTCGACC TCTCGGGCGG CGGCGCGCTT ACGGGCGCGG CCTTCATCTC GGGACGTGGC
GGCTCGGTCG ACACGCTGCT CGCATCGCTG ACCAAGGGCG GCACGGTCTA TGCCATCGTG
CCGGGCGTCG TCACGACGCC GGTGGCCGGC AACTACTACT CGGCCTGGAC CGGCGCCGTG
CCGGCCGTGG GCCAGACCAT CACGATTCCG GCCGGTGTTC CGGGTCTACC GGCCGGTACC
TACACGCTGT TGCCGGCCAA CTACGCGCTG CTGCCCGGCG CGTTCCGGGT CGAGCTCGGC
GGCACCGGCA CCACGGCGTT CGCCGGCACC ATCGGGCTGA CCAACGGCTC CTGGCTGACC
TCCGGCTATC AAGGCATCGC CCACACGGCG ATTCGAGATG TGCTGCCGAC CTCGGTGACC
ATCACGCCGG CCGCGACGGT GCGGACATAT TCGCAGTACA ACGAGACCAG CTACGACGCC
TTCCAGATTG CGCAGGCGGA CACGTTCGAC ATGGTGCGGC CGCGCCTCGA GGCCGATGCC
GGCAACCTGA CTTTCTTGTT CAGCGCCCTG AACGCCAGTG GGGCAGCGGC GGCCGTGCCG
GCGCTGGTGT GGGACGGCAC CACCGATTTG ACCCCTGCCG CGGGCGGCTA TGGCGGCACC
GTCTCGGTGA TGGTGGGCGG GAGCCAGGAC CTGATCATCG CAGCCGACGG CAGCACGACC
GAGCGCTCGG CCACCAAGAC GGTCCTGTCC GCCTCGGCCA TCAATGCCTT GAACGCGCCG
CACCTGTTCA TCGGCGGCAA GCCGATGATC AGCTCCAACA TCTCGATCAT CGGCGGCGAT
CCGTTCACCC AAAACAACAG CGCGAACGCG ATCACGCTGG AGAGCGGCGT GGTGCTGACC
GGCGGCCAGA TCGTCATGAA CGTGCGACAG GGCGGCACCA TCCGGCTGGA CCCCGGGGCG
ATGATCGACA CCCGCGGCTT CGCCGACGTG GCCTTGCTCG ACTCCTCGCT CGGCTATTAT
CTCGGCGGCG GCGTGCCGAT GCTGGTCGCC TCCAACGGAG CCGTCGCGCT CTACACCAAC
AACGCGGCAA ACGCGACCGG TACCATCAGC ATCGGCGCCG GGGCCGGGAT CTTCACCCGC
AATGCCATCG GCTTCGTCTC GGCGATGGGG GTCTCGTTCG ACGGCACGCC GCTGCTCGGC
GCGGAGAACC TCGAGCTGGT GGCGGCGTCG ATCAATCTCG GCACCGCCGC CAATCTCGCT
GCGGCGCGGG CCGCCGGAAC GCTGCCGGTC GGCTTCGACC TGACGCAGGA GCTGTTCGCA
TCGCTGGTCG CCGGCGACCC GAGCCGCAAC ATCACCGGAG TCCGCAAGCT CGGGCTCGGC
ACCGGCAACT CGGTCAATAT CTACGGTTCG GTGAACCTGG ACCTGTCGGC GCTGGACAGC
CTCACCCTCA CCACCGCCGC CATTTACGGC GCGGGCGGCG GCAACGATAG TGCGGTCTTG
AAGGCCGGTT CCCTTAATTG GAATGTCGGC CAGCGGACGG TATCCACCGA CGGCAATGGC
ACGCCCAACT ATGGCAGCGC CCTGCCGGGG GCAGTAACCG CCAATGGCCC AGGCACCGGC
CGAGGTTCGC TGACGATCCA GGCACGCGAC ATCACGCTCG GCCACGCCGA CATGCTGGGG
GGTGGGTCGA CCGTGACGTT CGACCGGCTG ATCCTCGGCT TCCAGGATGT CACGCTCTCA
GCGTCCAACT CCATCACCTC CGAAGATCAT AATACGCTGT CGTTCTGGCG CACGGGCGCA
AGCCCCACCT CCACCTTCGA TGCGAAGACC TACACCGGCG AAGTCGGCCA CAGGCTGAGC
CTGATCACGC CCCTGCTCAC GGGCGAGTCT GGCTCGGTGC TTTCGGTCTA CTCCGGCAGC
AGCATCTTCG TGACGGCCCC CGAAGGCAGC ACGCCCGTCG ACACCAGCAC GGTGAGCAGT
CTCGGCGCGA CCATCAACCT GCATTCCATC TATAACGGCG TGGTGCTCGA TACCGCTGTC
GCATTGCCAT CCGGCCGTTT CACCGCGACC GGCGGCGGCA TGAATGTCGA ATTGCGCAAT
CGCGCCCATC TCGATCTGTC GGGCCGCGCG GTCTCGTTCG GTGACGTTAC CAGATATTCC
TGGGGCGGCG ACATCGGGCT CGAAGCCTCG ACCTCTGGCG CTGTTATTTT CTTCCAGGGA
GCCGTGCTCG ACGTTTCGGC CGTGAACAAT GATGCCGGCT CGATCTCCAT CGCTGCGCCC
GAATCTTATG GGGGTGGCTT CGTCGTGTTC GTCGACGGCC AAGGCCGCAC CTTGAGCTCC
CTCGACGGCA TGTTCAAAGG CGCAGCCACC GGCGGCTACG AGGGAGGCTC GTTCAAGCTC
GAGACGTATC ACCTCGACAG CGGTCCCGGC TGGTCGAACA CGACGAGCTT TGCCCGTCTC
AACCGTGCGC TCAACGCGAG CGGCTTCTTC GGCTCGCGCG AGTTCAACCT GAAGGACGGC
AACCTCACCA TCGGCGACGA GCTGAAGGCG CATCATGTCG TCGTCTCGGT CGACGACGGC
AGCCTCACGG TGAACGGCCA TATCGATGCC AGCGGGGCGA CGCCCGGAAC CATCCGGCTC
GCTGCCAGTG ACAATCTGAC GATTGCTTCG TCCGCCGTGC TCGACGTTCA CGGCACGGAG
CTGCAAGTGG ACAGCTACGG CCAGCCGATC GAAGCGAAGA ACCGCGGGCA CATCGAGTTG
ACGTCAAGCG ACGGCACGCT CAGCCTCGGC GCGGGCTCGG TGATGGATCT TTCGACGCCG
AGCGGCGCTT ATGGGCAGGT GGTGCTGAAT GCGCGGCGCA CCAGCGAGAC GAGCGGCGAT
ATCAGGATCA GCGCGGCAGG CTCGCTCGCC ATCCGTGGCG CAAGCAGCAT TGCGCTGAAT
GCCTTCTGGC GATACGGCAA TTACGCATCG AACACGGTCG TCACCCAGGC GATGCTGGAC
ACCTTCGACA TCGCCAGCCA GAGCTTCATC AAGGCCGCCT ACGACAACGA TCTCGCCACT
GGCCAGCTCA CGGCGGGGCT GAAAGGAAAG CTCGCAGGGC TGACCGCTCA TGGCAGCGCC
TTCCATCTGC GCCCGGGCGT CGAGATTTCC TCCGGCGGCG ATCTTGCCAC CTCGGGCGAT
CTCGATCTGG CCAAATATCG CTACGGCCCG AATGCCGACC GCTACACCAC ATCCGCGAGC
TTTGGCGCGG GCGAGCCCGG CGCGCTCGTG ATCCGCGCCG GCGGTAACCT CACTATCGGC
GGCAGCATCT CGGACGGGTT CGGCACACCG GTGTCGGTCT ATTTGGGCGA GGCCGCCGGA
GCCCAAGAGA CCCTGAACGC GACGAAGACC CTGACGGCCG CCACCGGAAT CAGTACCGCA
AGCATTCCGA TCGCGAGTTT GCCGACGACA CTGAACTTCG ATGCGGTTCT CAACACGAGC
AATTCTCTGC GGCGCGGTAT CGCCATCCCG TTCGCGTTCA CTAGCGCGGC GAATGTGGCG
CAAACGGGCT GGACAGCGAC CGCGAACATC TACAACGCCA GCGGCACCCT TCTATACGCG
ACCGGAAGTA TGGTAACGAC TACGCTGCCG AGCGGGACAC AGTTTGCGGC CGGGAGCACG
CTGCCAGGCA GCGGAGCGTT GAGGATTCAA GCTGGCGCGG TGATTCCGGC GAATACGGTC
ATCAAGACCT ACATTTCCCA GACGACATCG CTCTTTCTCG GTAGCCAGAC GCTGCCGCTT
GGCGCGACCT TGCCGAGTGG GACCGCGATC AATTACGCGA CGACACCGAT GTATGGCCAA
ACCTCGCGGA TGCTCGCAGC CGGAAGCCAA TCCTGGTCGA TCCGCCTGGT TGCGGGCTCC
GACCTGTCGG CCGCCTCGAC GCTCGTGCTG CGGGCCGCGA GCGGCCTGGC CGGGTCCGGC
AACATCGTGC TGAACAACGC CGGGGCGGTC GATTTCAGAG GTATTCCCAT CCCGAGCGTC
ATCCGCACCG GCACGGGCGA TCTCGAGCTC CTCGCCGGCG GCGATCTCAC CATGCGCTCG
CTCTACGGCG TCTACACGGC TGGGACCGAC GCCGGCGCAA CCGGGCTTCC CGCAGGCACC
TACATGCCCG ATCACGGCGG CGATTTGACC GTGACGGCGC AAGGAGACGT CACCGGCTAC
AGCTACAATT GCACGTACAG CTCCTGCGGC AATGCGAAGT CCTGGTCGCC TGCGATTTGG
TTGATCCGCA GTGGCGACAG CACCACCAAC GCTGCGTGGT CCATAGCCTT TGACCGCACG
ATCTCCGGTA GCAGCGCCTC CTACATCACG GGCTTCGCCG GCTTCGGCAC GCTCGGCGGC
GGCGACGTTA CCCTGACGGC GGGTGGCGAT GCGGGCGCCA TGACGCGGAC CTACACTGGA
AGTGGTGTCA CCGCCCAGAC CTACGGCGCC CTGCAAGTCG CAGTCGCCGC CAGCGGCAAG
GTGCTGTCCG TCACCAAGGA TGGCGCCATC GTCACGGGCG GCACGCTGGT GCAGGCCGGC
GGCGGCGATC TCGTCGTCAA GGTCGGCGGC TCGCTCAATA CAGGCTATAA TTATTCGAAT
TCCGAGCAGG CAGCCGACGG CTCGCTCAGC ATCTTCACCA ACCTTCGCGG CGACACCACG
ATCACGGCGG GTGCGATCGG CGCAATCGGC GAGATCTATG GCTTCAAGGA AACCGGCGAT
CCGCGCGGTC TCGACGCCAA CGAGGCCAAC CTCGCCAGTG CCCGCGGCGG CATCGCGCTC
ATCCCTGGCG ACGGGACCAT CACCGTGCGA ACGGCGGGAG ACCTCGTGAT CGGCTCCTAT
GCCGACGCCG GGCAGCTCGG TGGGACAGCG TCGAGCACGT CCTTCTCACT CTGGCAGCCG
CAAACGACCG CGATCTCGCT GTTCTCCGCC GGCGGCAATC TCGTGCCGGT GACGAATATG
GGGATCGGCA ATCCGTGGTC GATGAAGACC CTCTCGACCT CGGTCGTGAT GGCTCCAGGG
TTGTTCGATG CCGTCGCCGC CGACGGCAGC ATTTATCTCG GCGGCAATTA TGCCTTCACG
TTCGAGCTGG CGCCTTCGCC GAACGGAAGG CTGAACTTGC TCGCGGGGCA ATCGATCTAC
GGCGCGGGCT TGTGGGTCGG AGGTCTGGTG AATGTGACCG CCGCTGGCAG TTCGACGCGG
ATCGTCGTTT CCAGCGCTGC GTCTGGGGTC AACGACATCC CCAATCCGTT CCGCCCCTCG
ACCGCGACCG GCAGCTTCTT TAGCTTTCAG GACGACCGCG TGTCAGCGTC GACGACGGAC
GACACCACGC CGAACCGCAT CGTCGCGCTC ACCGGCGACA TCATCAACCT CGCGCTCGGA
GAAACGCTCA CGGACGTCTA CTACGACAAC GCGTATCACG TCATCAGCCA CGGTACCGGC
AAGTCCGCTC AGGTGATCGC CGGCGGCGAC ATCGTCAATT TCGGGGCCGG CAGCGCCGGC
GCCACGGCGG GCCTGATCCT CAACACCCGG CCCACCGACG TCTCGGTGAT CCGGGCCGGC
GGCAGCATCT TCTATCTCAA CATGAACATC GCCGGCCCCG GCGCGCTCGA CATCAGTGCC
GTCGATACAA TTTACCAGGG CAATCTCGGC GCCATCACCT CGATCGGGCC GCTGGCATCC
GGCGACACGC GCCCCGGCGC CTCGATCGTC GTCGCCGCGG GGCTCGGGAG CGACGGGGCG
GACTACGCGG CGCTGACCAA ATATCTCGAC GCCGCCAATC TCGCCGCCAC CGGCATCCCG
CTCGCCGATC AGTCCGGCAA GGTCGCCCGG AGCTACGATG CCGAGCTGAT CGCCTGGCTG
TTCGATTATT ACGGCTACCG GGCGCAGAAC GCCGCTGACG CCCGCGCGGT GTTCGCCGCC
AAGCCGGCCG AACAGCAGAA CATCTTCCTG CGCAGCGTGT ACTTTGCCGA ACTGAAGGCC
GGCGGCCGCG AGTACAACGA TCCGTCGTCG AGCCGCTACG GCTCGTATCT GCGCGGCCGT
CGAGCGATCG CCGCGCTGTT CCCGGAGGAT CGCAGCTACG CGGGCGATCT GATCATGTTC
GGCGGTTCGG GTATCCGCAC GCTTTATGGC GGGGACATCG CGATCCTCAC GCCCGGCGGC
CGCCAGGTGC TCGGCGTCGA GGGCACGGTG CCGCCGGCCT CGTCTGGGGT GATCACCCAA
GGCGCCGGCG ACATCGGCCT CTATTCCAAA GGCTCGATCC TGATCGGCCT GTCGCGCATC
ATGACGACGT TCGGCGGCGG CATCCTCGGC TGGTCGGCCG AAGGCGACAT CAACGCCGGC
CGCGGCTCCA AGACCACGCA GGTCTATACG CCGCCCAAAC GCATCTATGA TTCCTACGGC
AACGTGACTC TGTCGCCCGC CGCGCCGGCG ACCGGCGCCG GCATCGCCAC GCTCAATCCG
GTGCCCGGCA CACCGGCCGG CGACGTCGAC CTGATCGCGC CACTCGGGAC CATCGACGCC
GGCGAGGCCG GCATCCGGGT CGCGGGCAAC GCCAACCTCG CGGCGCGCCA GATCCTCAAC
GCCGCCAACA TCCAGGTGCA GGGCGCGAGC AGCGGCCTGC CGACCGTGCA GGCGCCGCCG
ACCGCGGCGC TCACCGCCGC CAACAACGTC GCCGGCGCCG CGACCCCGAC CGCCGCGGGT
CCTGCGCAAG ACAACGATCG GCCGTCGATC ATCCTGGTCG AGTTCCTAGG CTTCGGCGGA
GGAGACGGCG GTGACACCAA GCCGCGAAGC CCCGACCGTG ACGACCGCCG CCGCACCGAG
CAGCAAGGCT ACAACTTCAA CAGCGCCTTT CAAATTATTC AGCTCGGCGA TCTGGCCGGG
CCGGCGCGGA GCAACAGTCG CTGA
 
Protein sequence
MSPALHHDVG RRHSRVTAED GGQLRVTAKR RALSRRLMLT STSALALTVL LTAAPAMARS 
LGGAAPDYSA AAVAAAAATA AAQQGSAAAQ RSVESLTQAA QALQGIQAAQ AAARAAAQAA
GVSVTAPVVV PNGLAAGGLV PDSGLAAPGA ARPVLSWTNA RTPVQSGPAD APTVTVQQTG
AQAILNWASF NVGASTSVVF DQQGNSNWVA LNRVVGAASP SQILGSIGAD GSVYIINQNG
IIFGGASQVN VGSLIASAAQ VTDAQFKANG IYSSLSGSSY LPSFSGATGA ITVEQGALIN
TSAPGAVTQG GGFVMLLGTS VSNAGTITTP KGQTVLAAGR DFVVRKGYAT DANSFSTTRG
NEIAPGLWNA SSSSWTPGGG AVTNSGLVFA QQGDITLAGH AVAQNGVAIA TTSVDIRGSI
HLLNAASDAS GSVTLGAASL TQIMPELGSA DTALNSKRDA LIADSATQNA ARATASFGQF
GNLSSLADRQ DLGRVEIVSG GVVTFDGGSL TLAQGGQIAV SARARVFAAG DAVLDVSGTS
SVLPMSANSL KINVQPNEMR DAPVNRESGV LTNQDVWIDA RDLVLVPAGT GGYASDRYYT
KGGLLEVSGY LANTGHSVGE WTAVGGIITL SAAELVAQAG SVFNLSGGTI SYQAGQVLTS
NVMGADGRLY PIGSVPADMP IIALGRSFTV DHSRWGVKDV WGSPSHGATT SRWDDGYTVG
RDAGQLVLSA PTSVFEGSIV AEVITGLRQT AARPTGVSDG YKLTQTTVPL AGALMLGQYG
AIGRTGAVAS EVIIGDVADI TSGLGAEAAL PSGRANTAWF DAGALNEAGL GGLSIASSNS
IRVSSDLTLA RGGILELVAP TIDVAATLTA RGGAVTLSTL LSRVYPNGDS IGRGTLPLTP
ALGRASITLE ENAVIDTRGL WTNALLDPAG DASGQAFADG GAVTLRSSGD VTLGTGSIID
ASAGAALLAG GKLKGGNGGN ISLIASESLT DVNGTTSGNT IHGDLTLRGT LISYGFGKGG
KLTLQTGGSV SIGGTSASLN LSPELFGSGF SAYSINGLTR LMVADGVAIT ATMPVLRADP
NRAFSAPSGG AATAVASLWT PPLYLENPAK GTLAQRAGAS VALLSGAGAD SSGYITTEGA
LSIGTGASIT VDPGQSVRLA ASGQVTVDGT ITAPGGSIAI TNNYTQLHYV DPAAMSVWIG
EHAVLDVAAR AYIAADVAGR RYGVIPDGGA ITLGSTGGAF KVMGSSTNEV YEPGNEASTH
AYVVIRPGAV LDASGTSGEL DIATGGSLRS AFAPVTVASD GGTIALRSWS GIHAEGTMQA
FAGGAGAAGG TLIVSLEAPI QTAVASIVVP TNLKVGRVLT VTQDIAGTLA AGLQPGQSDA
TLALGQGTIA VSQIAAGGFD DVSLLGRGAI LFDGDVSLEA GRSLSLEAGQ YADLGGGRAV
VSAPYVRFGG QDVFIFDYNT VPSTVPAYAA VGTLAVDGDL IELGGNVATT FATTSFTSHG
DLRFVTSASV PNPGLYAAHD LTLSAVRIYP ESGTMATVAA GYTKSSYSPG SSQSWMFTDP
AAVLTLVRPD AGAAVEAPYS AFGELSLTAG TIRHGGALFA PFGGITLTAT RVGSNVVNTA
PAVVEFLPGS ITSVSGAGLV MPYGGTTDGV GWTVNGTDGA TVHLITGQYM KADGTVITLG
ARRSTGISIR ATTIVGDAGA TLDLSGGGAL TGAAFISGRG GSVDTLLASL TKGGTVYAIV
PGVVTTPVAG NYYSAWTGAV PAVGQTITIP AGVPGLPAGT YTLLPANYAL LPGAFRVELG
GTGTTAFAGT IGLTNGSWLT SGYQGIAHTA IRDVLPTSVT ITPAATVRTY SQYNETSYDA
FQIAQADTFD MVRPRLEADA GNLTFLFSAL NASGAAAAVP ALVWDGTTDL TPAAGGYGGT
VSVMVGGSQD LIIAADGSTT ERSATKTVLS ASAINALNAP HLFIGGKPMI SSNISIIGGD
PFTQNNSANA ITLESGVVLT GGQIVMNVRQ GGTIRLDPGA MIDTRGFADV ALLDSSLGYY
LGGGVPMLVA SNGAVALYTN NAANATGTIS IGAGAGIFTR NAIGFVSAMG VSFDGTPLLG
AENLELVAAS INLGTAANLA AARAAGTLPV GFDLTQELFA SLVAGDPSRN ITGVRKLGLG
TGNSVNIYGS VNLDLSALDS LTLTTAAIYG AGGGNDSAVL KAGSLNWNVG QRTVSTDGNG
TPNYGSALPG AVTANGPGTG RGSLTIQARD ITLGHADMLG GGSTVTFDRL ILGFQDVTLS
ASNSITSEDH NTLSFWRTGA SPTSTFDAKT YTGEVGHRLS LITPLLTGES GSVLSVYSGS
SIFVTAPEGS TPVDTSTVSS LGATINLHSI YNGVVLDTAV ALPSGRFTAT GGGMNVELRN
RAHLDLSGRA VSFGDVTRYS WGGDIGLEAS TSGAVIFFQG AVLDVSAVNN DAGSISIAAP
ESYGGGFVVF VDGQGRTLSS LDGMFKGAAT GGYEGGSFKL ETYHLDSGPG WSNTTSFARL
NRALNASGFF GSREFNLKDG NLTIGDELKA HHVVVSVDDG SLTVNGHIDA SGATPGTIRL
AASDNLTIAS SAVLDVHGTE LQVDSYGQPI EAKNRGHIEL TSSDGTLSLG AGSVMDLSTP
SGAYGQVVLN ARRTSETSGD IRISAAGSLA IRGASSIALN AFWRYGNYAS NTVVTQAMLD
TFDIASQSFI KAAYDNDLAT GQLTAGLKGK LAGLTAHGSA FHLRPGVEIS SGGDLATSGD
LDLAKYRYGP NADRYTTSAS FGAGEPGALV IRAGGNLTIG GSISDGFGTP VSVYLGEAAG
AQETLNATKT LTAATGISTA SIPIASLPTT LNFDAVLNTS NSLRRGIAIP FAFTSAANVA
QTGWTATANI YNASGTLLYA TGSMVTTTLP SGTQFAAGST LPGSGALRIQ AGAVIPANTV
IKTYISQTTS LFLGSQTLPL GATLPSGTAI NYATTPMYGQ TSRMLAAGSQ SWSIRLVAGS
DLSAASTLVL RAASGLAGSG NIVLNNAGAV DFRGIPIPSV IRTGTGDLEL LAGGDLTMRS
LYGVYTAGTD AGATGLPAGT YMPDHGGDLT VTAQGDVTGY SYNCTYSSCG NAKSWSPAIW
LIRSGDSTTN AAWSIAFDRT ISGSSASYIT GFAGFGTLGG GDVTLTAGGD AGAMTRTYTG
SGVTAQTYGA LQVAVAASGK VLSVTKDGAI VTGGTLVQAG GGDLVVKVGG SLNTGYNYSN
SEQAADGSLS IFTNLRGDTT ITAGAIGAIG EIYGFKETGD PRGLDANEAN LASARGGIAL
IPGDGTITVR TAGDLVIGSY ADAGQLGGTA SSTSFSLWQP QTTAISLFSA GGNLVPVTNM
GIGNPWSMKT LSTSVVMAPG LFDAVAADGS IYLGGNYAFT FELAPSPNGR LNLLAGQSIY
GAGLWVGGLV NVTAAGSSTR IVVSSAASGV NDIPNPFRPS TATGSFFSFQ DDRVSASTTD
DTTPNRIVAL TGDIINLALG ETLTDVYYDN AYHVISHGTG KSAQVIAGGD IVNFGAGSAG
ATAGLILNTR PTDVSVIRAG GSIFYLNMNI AGPGALDISA VDTIYQGNLG AITSIGPLAS
GDTRPGASIV VAAGLGSDGA DYAALTKYLD AANLAATGIP LADQSGKVAR SYDAELIAWL
FDYYGYRAQN AADARAVFAA KPAEQQNIFL RSVYFAELKA GGREYNDPSS SRYGSYLRGR
RAIAALFPED RSYAGDLIMF GGSGIRTLYG GDIAILTPGG RQVLGVEGTV PPASSGVITQ
GAGDIGLYSK GSILIGLSRI MTTFGGGILG WSAEGDINAG RGSKTTQVYT PPKRIYDSYG
NVTLSPAAPA TGAGIATLNP VPGTPAGDVD LIAPLGTIDA GEAGIRVAGN ANLAARQILN
AANIQVQGAS SGLPTVQAPP TAALTAANNV AGAATPTAAG PAQDNDRPSI ILVEFLGFGG
GDGGDTKPRS PDRDDRRRTE QQGYNFNSAF QIIQLGDLAG PARSNSR