Gene RPB_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2214 
Symbol 
ID3907956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2515895 
End bp2527987 
Gene Length12093 bp 
Protein Length4030 aa 
Translation table11 
GC content67% 
IMG OID637884109 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_485830 
Protein GI86749334 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.699209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCG TGTCGCCGAT CGAACTGAGA GAGACCATGC CCGTTGCCAA TGCCGCCTCC 
CGTATGCTCT TTGGTTCCCC CGGCGCAACC CGTCGACTCG CCGCGCATGC CGCGTTGCTG
GTCACGGTAA GCGCGAGCGC TCTGCTGCTT TGCGCGCCGC CCGCGTCGGC GCGTTCGTTG
GGCGGCGGCG CAACGCTGTC GGCGCCGACT CTCGCCGTAG ACGCGGCGGC GCAAGCCGCC
GCGCAGGCGG CCTCGGCTGC GGCGCAGGGG TCGGCGTCGC TGACGCGGGC AACCACTGCG
ATCCAGGCCA TGCAGGCGGC GCAAGCAGCC GCACGTGCCG CCGCGAGCAG CTCTGGCGGC
GTGCCGAACG GCCTGACCGT CGGCGGACTG GTGCCTGATT CCGGCCTCGC GGCGGGTGGT
ATCGCGCGTC CGGTGGTGAG TTGGACCAAC GCGCGCACGC CGATGCAAAG CGGTCCGAGC
GATGCGCCGA CCGTCACGGT GCAGCAGACC GGCGCGCAGG CGATCCTCAA TTGGTCGAGC
TTCAACATCG GCGCGAACAC CAGCCTTGTG TTTGATCAGC AAGGAAATTC AAGCTGGGTC
GCGCTCAACC GCGTCGGCGC TTCAAGCTCG CCCAGTCGCA TTCTCGGCAG CATCCGGGCT
GACGGCTCGA TCTACATCAT CAACCAGAAC GGCATCATTT TCGGCGGCGC CAGCCAGATC
ACTGTCGGCA GCCTGATCGC CTCGGCGGCG AGCATCACCG ACAATCAGTT CCTCACTTAC
GGCATCTATT CGACCGCCAT CAACAAAGTG TACAACCCGT CGTTCACGGC GGCCGGCGGC
ACCATCGTGG TCGAGGCCGG CGCGTCGATC GCCACCGCGG CGCCCAAGAC CGTCACCAGC
GGCGGCGGCT CGGTGATCCT GCTCGGCACC GAAGTGCGCA ATGCCGGCTC CATCTCCACG
CCCAAGGGCC AGACCGTGCT GGCCGCCGGC GACAGCTTTA TTCTTCGCTC GGGCTACGCC
ACCGACGGCA ACACCACGTC GACGACGCGT GGTGTCGAAG TCGCTCCGGT TCGGGCCAGC
AACAGTCAGA GCGGCGCCGT GGGCAACAGC GGCCTCGTCT TCGCGCAGCA GGGCGACATC
ACGCTTGCGG GTCACAGCGT CACCCAGGAC GGCATCCTGG TCGCGACCAC TTCGGTCGAT
CAGCGCGGCA CCGTGCACTT GCTCAATTCA ATCACCGACC CCACCGGCAG TGTGACTGTG
GCGGATGGCG CGCTCACTGC CGTTCTGCCC GAGCTCGACA GCAAGGTCAC CGCGCTCGAC
AGCCAGCGCG ACGCGCTGAT CGCCGATTCG ACGAAACAGA ACTCCGGCCG CGCCGCCGCT
GCGAACGACG CGCCACAATT CAACAATCTC AGCCGGCTCG ACGACCGCCA GGATCAGTCG
CGTGTCGAGA TCGTCTCGGG TGGCCTCGTC ACCTTCAGGC GCAATTCGCT GACGTTGGCG
CAGGGCGGTC AGATCGCGGT GTCGTCGGTG AGCGACATCC GGGTCGAAAC GCTGGCGACC
CTCGATGTCT CCGGCACCAC GGGCACGGTA CTGCCGGCCT CGGCCAATAA TCTGAAGGTG
AACATCCAGC CCAACGAGAT GCGCGACAGC CCGGTCAACC GCGACAGTGG CGTGCTCGTC
AGCAAGGACG CCTGGATCGA TATCCGGGAC CTCGTGCTGG TGCCGGCCGG CACCGGCGGC
TACGCATCCG ACCGCTACTA CACCAAAGGT GGCTTGCTCG AAGTCGGCGG TTATCTGGGC
AACACCGGTC ACACCATCGG CGAGTGGACC GCGGTCGGCG GCACCGTGAC GCTGGCAGCC
GGCAGGAAAG TGATCGCGGA ACGCGGCGCG TTGATCGATC TGTCCGGCGG CATGGTGAAC
TATGCGGCGG GCGACCTCCT CACCACAATG GTGATTGGTG CCGACGGCCG TATCTACAGC
ATCGGCAACG CGCCGGCAGA CATGCCGATC ATCAGCTATG GCAGCGGTTT CACACGCGAC
CACGCCCGTT GGGGGCAGAA GGAAACCTGG GGCGCTCCCG CGCACGGCGC TGTGCGCAGC
TTCCACCAAG ACGCCTACAG CGTCGGCCGC GACGCCGGAT TGCTGCAGAT CCTCGCGGCG
ACGTCGGATT TCAAGGCGGA TATCGACGCC GCCGTCTATA CGGGCGAGCG TCAGACGACA
TCGCGGCCGT CCACCAAGAC CGACGGCTAC AAGCTGACTC AGACGCAACC CGGCTTGCCC
GGACAGCTTC TGGTGGGAAC TGTCGATATC GGACTTGGAA ACGGCGGCAC GCAGACGAGC
GGCATCGTTT TGTTCGCCGG ACCCGGGGCC ACACAAGTCA CCGCGCCGAG CCATGTCTTC
GATGCGGCGG AAATCAGCGC CATGGGGCTC GGCGGGCTGA GCGTTTTCGG CAGCTCTGAA
ATCGAGGGCG ATCTGACGCT CGCGCCCGGA GCGATCGTGT CGCTCAGCGT CAACCATCTG
GGCGGCAACA TCACCGCGCG TGGCGGCAGT GTCACCCTCT CGGTGTCTTC GTTTGATCCG
GGCGTTACTG TCGACACCCG CGGGCTTTGG ACCAATGCGC TGCTCGACCC GACGCGCGTG
TCGGGGCTTG CTTACCGCAA TGGCGGTGCC ATCACGGTCA GCGGCAGCCC GCTCAACGGC
TACGATCTGG TGATCCCTGC GGGCGTGCGG CTCGACGCCT CGGCCGGCGG CGCAATCCTG
TCGACCGGGA AATTCTCCGG CGGCAATGGG GGTGACATCA CGCTGGGAGG AAACTCGCTG
GCGCTGAACG GCGAGGTGGT CTCGAACGGC TTCGGCAAGG GTGGCAAGCT GGCCCTGTCG
ACGTCAGGCG CGATGGTGAT CGGCCCGTCG CCGCTTCCCG GCGGCACGCT GGCGGCCGGC
GAGGCGGCAC CGGTCTTGCT GGTGCTGGCG TCGCCTGTGA CACTGGCCAG GGGCACGGTT
GCACCGTTCT CGATCACCAT GACGATTACC TCGGTCGCGG GCGGACAGTT GGTGCCGGCA
GGCGCCCAGG CGCAGGTCTC GAGCACCGCC GTGGTGACCG TCGGCAGCGC CGGATGGACG
GTCCCGGCCG GCATTTTCTA TGCCATGGAT ACCACCTTCC ATTATTACTA CCCCGGCAGC
ACGGTGCCGG CCGGCACCAT TCTGCAGACG ATGGCCGGCA ATTTCAGCAG CGGCTACGTG
CTTCCGCCGA ACTCCTTCAC GACCCCGCTC AGGCTCTCTC CGATCAACGT GAACTTCGCA
GCCGGAAGTG TGCTGGCCTA TGACGCGGTT CTGTCGAAGG GCTACGTCCT GCCGGCCGGA
GCGGTTCTGC CGCAGTCGGT CGAGGTGGCG CCGGTGCTGG CGCTCGACCC ATCTCTGTTC
CAGAAGGGCT TCTCTTCCTA CGCAATCAGC AGCGGGCTCG GCATTACGGT CCAGCCCGGG
CTGCAAATCG AAGTCGTTGA GCCAACCTAT CAGTTCTCCC GAGCGCTGGT CGATCTGCCG
ACCGGCGGCA GGGTCGACAC GGCGGCATCG CTCGCGCTGG CGCCGCTGTT CGCCGAGGAC
GCGCTACGTC GAACCTTGAA CCAGCGGCCC GGCGCCGACC TGGTTCTCAC CGCGCGCGCA
GCGGGATTGT ACTCCAGCGT AGCGCTTGCC AACGAGCATG CCGCGGCTCC GATCACGATC
GGGAGCGGAG CGTCGATCAG CGTCGATCCA GGCCGTAGCG TCAGCCTCTT CGGCGACGAT
CAGATAACCA TCGAGGGCGA AATCACTGCG CGCGGCGGTT CGATCAGCGC GTTCAATCTC
ACGCCATCCG CTCCCCTCAC GGCCAACTCC AGCTTCGGAC GTTCGGTCTG GATCGGCAGC
AATGCGGTGC TCGACGTCTC GGGCATCGCC CATGCGGCGA CCGACAGTCG TGGCCGACGC
TACGGGACGG TCACCGATGG TGGTTCGATC GTGCTCGGCA CCGACGAGAC GCCGACCACC
GACGGGATGA ACTCGGGCAA CGCGTTCGTG GTCATCCGTC CCGGCGCGCT GATCGATGCG
TCGGGCGCGA GCGCAGTCCT CGACGTGCTC GGTGAGGGGC CGGCGCTGGT CGCGGGCAAC
GGCGGATCGA TTCGACTCTC CAGCCTGTGG GGGTTTGCTA TAGACGGGAC GATGCGGGCC
GCGGCCGGGG GCGTTGGCGC CAGCGGCGGA TCGCTCTCGC TGACTCTGGA ATCGCCGATC
GTCGCTATCG GCGCTGGCGC CTCATCGACT CTCGTCCAGG CTGCGACGGC CGGTCGCGTG
ATCGCCATCA CCCAGGACAC GACACCGACC GGATTGACCG ACGGTCTCGC GCCCGGCGTC
GCGGACGCCG GGCTGGTCCT CGGCAGCGGC CGCCTCAGTG CCGCGCAGGT CAAGGCCGGC
GGCTTCGACT CGCTTTCGCT GTGGGGACGC AGCGCGATCG CCTTTGACGG TGATGTCAAT
CTGTCGCTCG GCCGCAGCCT CGTTTTGCAG CAGGGTAAAA TCGTCAACAC CGCCGCGACA
GGCAAGGTCA CGCTCGCCGC GCCCTATGTA TTCATCGATG GTCACACGCT TCAGGCCAAC
ACGCTGCATC CGGAATGGAC GACGATGTTC GCGGATCGGC TGCCGGTCTA TGCGAAGGGA
GCTTCGTTCT CCGTGTCGGC CGACCTCGTC GATATCCGCA ACCTGGCGTC CCTCGCTTAC
GAAATGGCCA GCGTCGCCAG CTCCGGCGAC ATCCGTTTCC TGAAAGCCAC GGAACGCGCC
AACGCCCGCC TTGTCGATTA CGTCACGATA CTCGCGGCGG GCGGCGATCT CGATCTCACG
GCGGCACAGA TCTATCCGGC CTCCGGCGCG GTCGCGCTCG CTGCCGCCGG CGGCATCGAA
ACGGCGGGCG ACACGTTCGG ATTCACGAGG TTCAACGATG CCGGCTCGCG GCTGACGGTG
CATGGAACGG GCGCGGTGCC CGACATGCCA TATTCGGTGT TCGGCAATCT GACGCTGCTC
GCCGAGACGA TCGAGCAGGG CGGCGTCGTT CGCGCCCCGA TGGGGCAAAT TCAGTTCGGC
GATTTCGCCA CGCCCGCCGG CCGGGCAGTC AATACCACGC GTCTTATCGA GTTCATGCCC
GGCAGCATCA CCTCGGTCAG CGCCGACGGA CTGATCGTCC CTTACGGTGG CACGGCCGAC
GGCATCCGCT ACACGGTTGA TGGCGGAGCT CCGGTGACGC CGTCCCTGGT CAACGGCCTG
TACAGGACCG GCGCATCGGG ATCGACGTCC CTTTATGTCG GCGTCGGCGT CGCCGGCATA
TCCGTTCAGA CCGACGCCGG CTCGCTGATC GATTTGTCGG GCGGCGGCAC GCTCGCCGGC
GCGGCCTTCA TCTCCGGGCG CGGCGGGTCG GTCGATCCGC TGCTGACGCC GCTCGCCAAT
GCGGGAATCC ACGGCTTCAG CAGTGCCGGC AACAAGGTCT ATGCGATCGT GCCGGGCACC
TTCACCGCGC CGGCTGCCGG TGGATACAAC AATCTTTGGA CGGGCGGCGT ACCGACGATC
GGCCAGCAGA TCACCATCCC GGCCGGCGTA CCGGGCCTGC CGGCCGGCAC CTACACCTTG
TTGCCGGCCA ACTACGCGCT GCTGCCGGGC GCCTACCGGG TCGAACTCGG CGCCAGGACG
GATCAAGCCG TCAACCCCGT CGCGCTCGCC AACGGCTCCT ACGTCCTGAC AGGCCATCAG
GCCGTGGCCC ACACCAATAT ACAGGATGCG CTTGCAACCC GGCTGGTGAT CACGCCGGCC
GATGCCGTGC GCAACTATGC GCAGTTCAAC GAACAGAGCT ACGGCGACTT CGTGCTGAGC
ACGGCCGCGC AGTTCGGCAC GCGGCGCACG ATGGTCGAGG CCGATGCCAA GTTCCTCACG
ATCGACCTCG GGCTCGATGC GACGGTCGCA GCGAACAGCG CCCTCAAGGT CAACGGCATC
ATAGACTTTG CCGCCGCAGC AGGCGGCATC GACGGCTCGT TGTTCCTGAC CACCGGAGAT
GCGCTGCACA AGGGCGCGCT GGTGATCACG GCACCGGGCA GCGCGACCGC CAACACCGCG
AGCAGGACCA CGGTGAGCTC CAGCACGCTC GCCGCCGTCG GCGCACCGAA CCTCGTGCTT
GGCGGACGGC CATACGATAA CAACACCTAT GTGAGTGTCG ACAGCGGCAA TCTTGCCCTC
GTCGACTCGC TGACGATCGA AGCCGGGGTC CGGGTGTCCG CCTCGCAAGT TCTGGCCTTC
GCCGGGACCA CGATTACCCT GCAAGCGGGG GCCGAAATCA ACACGCTCGG CCGCGGCTTC
TCGGGTCTCG ATTCGCGCTC GGGACTTGCG TTCGGTACTG CGCCGCAATC TTATCCGAGC
CTGTACACCC TGCTGGTCGT CTCCAACGGC TCGATCATCC TGAACGCCAA TACTGGCCTC
GGCAGCGGCG GAATCGTCAT GAACGACGGC GCCGCGCTCT ACTCCGAAGG CACGATCGGA
TTCTACGCGC CCACCGGGCT GCAGACGAGC GGCAGCATCA AGCTCGGCAC CCGCAACCTG
GCGCTCAGCG TTGCCTCGTT AAACATCGGT TCGGCAGAGG CGCTTGCCGC TGTGCCCAAC
CTTCCGGTCG GGCTGCGGTT CGACCAGGGC TTCCTCGATA CGTTGTTGTC GGGCGGCGGC
GCGGCCGGGG CGCCGGCGGT CGAGCATCTG GTCATGACGG CCGGCCAGTC GATCAACTTC
TTCGGCTCTA GCGATCTGTC GACCTTCGAT CCGCTCACCG GCCGATCGCG GCTGGACGAG
CTGGTTCTGG CGAGCCCCGC TATCTACGGC TATGGCTCCG GTTCGGACAG CGTGCATCTC
GCCACCAACC GGCTGGTGTG GACCGGCGGC AGCGTGTTCG ATCCCGTTGC CACCGGACCT
AGCAGCACCC CGCAGTTCAA GAGCGCGACG CCTCCCGAGC GCCTCGCCGG TCTCGGCAAC
AGCGGAGCCT TGGTGGTCGA TGCCCGCGAA GTCGTGTTCG GCTATCCGGC CGAGGCGCAG
TCCGACTCCA ATCTCGTATT CAACCGCCTG ATGCTCGGCT TCGATACCGC GGTCTTCAAC
GCGGCCGAAC GGGTCACGGC GAACAACAAG GGCACAGTGT CGGTCTATGC GGGCGGCCCC
GCGATCGGCG GCAGCTTCGA TCCGGCCAGC TACCAGGGGG TCGGCGGCAC GCTCGTCTTC
AATACGCCTG CGATCACTGG CGATGCCGGC GCCAACCTCG CGATCTACGG CGGCGCGGTG
ACGCTCACGC AAACCGGAGC CGTGGCAGGT CCCGCCGGCG GCCTCGGCGC GACGCTGTCC
GTCACTGCCG ACACGCTGCG GGTCGATACC GCCGTGCTGC TGCCGGGCGG CGCGCTGACG
CTGACGGCGC GGCAGGATCT GCTGATCGAT TCTCGGGCGA GGCTCGATCT GTCCGGGCAC
GCCGTCGCCA TCGGCCAGGC GGTTAGCCAG GTTTGGGCCG GCGATGTGAC GCTGGTGAGC
CTGAACGGCA ACGTCACGTT GGCGGCCGGC TCGATCGTCG ACCTGTCGTC CGACGGTAGC
GACGCCGGCA CGTTCAAGCT GCGCGCCGCC GGCGCGGCGC GGTTCGCCGG CGCGCTGCTG
GCGAGCGGCG GCGGCAACGG CGCCCGTGAC GGCGCCTTCG ATCTCGCCGC GGATTCGCTC
GGCGGCGGCA ATCTGAGCGC CGACTTTGCC GCGCTGAACC AGAGGCTGAA TGCGGAGGGA
TTCGCCTACA GCCGCGCCTT TGTGTTCGGC AGCGGCAACC TGGCGGTCGG CAGCGACGTG
AAGGCTCGCA ACGTGACGAT CACGGCCGAC GGTGGCAGCC TGACGGTCGA TGGTCGAATC
GATGCCAGCG GTCTGCGCAG CGGCGCGATC CGGCTCGCCG CGCGCGACGA CGTCGTGCTG
ACCGGCAATG CGGTGCTCGA CGTGCACGGC ACTCAGCTCG TCGTCGACAG CTACGGCCAG
GCGATCGAGG CCTCCAACCG CAACTCGATC GAGCTCGGCA CCAGCCAGGG CTGGATCCGG
ATCGGCGGCG GCGCGACATT GGACACGTCG TCACCGGACG GCATTTCGCG CGGCCGGGTC
GAGATCAACG CTCGCCGCAG CTCCGAGACC GGCGGCGATG TCAACATCGA TGCGCCGTCG
CCGCTGAACC TGCGCGGCAT CGCCAGCCTC GCTGTCAACG GCTATTGGAC TTACAGCCCG
ACCGATGCCA ACGGAACCAT CGTGCAGGAC GATACAGCGG CCGGCGTGGT GTCCGGCGCG
GTTGGTTTGG CGCAGATCGA TGCGACCAAC CAGACCTTCT ACGCCAATGC GCTCGGCAAT
AACGCTTTGC AGGCGAGGCT TGCCGGCTTG AAGACCGTCG GCGCCGCCTA TCACTTCCGA
CCCGGCGTGG AGATCGTCTC CAGCGCGGCG TCGGGTGGCA AACTGACGGT CGTCGGCGAC
CTCGATCTCG CCGGCTTTCG TTACGGACCG AACGCCGACC GTAACACCGC CTCGGCGACC
TATGGCTTCG GCGAGCCGCT CGCGCTGATG ATCCGCGCCA GTGGCAACCT CACCATCAAG
GGCAGCATCA CCGACGGATT CGCCGAGCCT AAATCGAGCC CCGACGAGGT CAATTTCCGC
GACCGGCCGC AGAACGGCGT CGTCACGGTG ATTCCGGCCT ATGCCCCGCC TTACGATGAT
TACACGCCGA CAGAAGATAT GACTCTCGGC GAGGATTGGG TGCTCCCTTA TTATTGGTAC
GATGGCGAAT CCAGATTGTA TGTCAGTACC GACCGTGGTT ACTTCTATTC GGGTGACCTA
ATTCCCGCCG GAACGAAAAT TCTGTGGACC GGTGACACCT ATTTCTACTT CGGCGGGGAT
CAGCTTGTGC CGGCAGCCGC CGATATCGTC CGTCTGTCGC AGGGCAAGAT CTATGCGATC
GCGCCGATGC TGGCGCCGGG CGCCATGTCA GCGTCGATCC GCCTGGTCGG CGGCGCCGAT
CTCGCCTCCG GCGACAGCCG CGCGCTGCTG CCCGCCGATC GGCTTGGCAA CTCCGGCAAT
ATCGTGCTGG ACGATTACCA CACCCACTCG CCGACCAAGA TGGGAGCGGA CGTGCGACAA
ACCGAAATCG CCAGCGTGAT CCGCACCGGC ACCGGCGACC TCGATCTCCT GGCCGGTGGC
TCACTGACCG AGCGCTCGCT GTACGCGGTC TACACGGCGG GCACGCAGCA AGATCTCGCC
AGCGGCAACA GCGCCTACAT CATCAACCAG CCTTCGATCC AGTTGCCGAG CGGCACCGGG
CTCGAGTCCA CCTTCAACGA TCGCGCGAAT TACTTCCCGG ATCACGGCGG CGATCTGCGA
GTGACGGTCC AGGGCAATCT TGCCGGCTAC AACGCCCAGC TGAGCCTGAT GCGCGACGGC
GGTGAGTGGC TGATGCTGCA GGGTGCGCCG GATCTCGGAC AGGCCACCGG CTGGTCCGTC
AATTTCGGCA ACTACAAAGG CGCGGTTTAC GTCAGCGGGC CTCTCGGCTT CGGAACTTAT
GGCGGCGGAA ATGCCACCGT GGTCGTCGGT GGCGCTGCCG GTGGCATCAC CACGACAAAT
ATCTATGCGG CCAATCGGGT GACCACCGGC CTGTCGGTTG CGGTCGGTGC CACTGGCCGC
GTGCTGGCCG ACGGCACGCT GATCGAGACC GGCGGCGGCG ATCTGTCCAT CCAGGTCGGC
GGGCGGTTGA ACCCGCTCAG CGACGGCGGC CAGATCACCA ATCTGCGCGG CAGCATCACG
CTACAAGCGG CGGAGATTGG CAGTCGAGGC AAGGTCTATG CGACGTCGAT CGCCGGTGAT
CCGCGCCCGG CGGAGTTCAC GAGCGGCGCC AACATCGCTT ATTCGTTCGC CGGTCCGTCG
CTGCGGCTTG GCGACGCCGC AGCGAGCATC CGCACGCTTG GGACCGCCTG GCTCGGTTCG
GTAGGCGAAA TCGGCATTAA TCAGGCAATG AACCCGAATG CATATGGCGA TGTGGCGGGT
CCGGCCAACA CTTCGATCGT GACCTGGTTC TCGCTCTTCC AGCCCTCGAC GGCGGTCGAC
ATGGTGTCAC TCGGCGGAGA TCTGGCTCCG TTTTCGGCGA ATTCCGGGGA CGGCCTCTAT
CTGCCGCCGC AGTTCACGGC GGCGGCGCTC AGCGGCAACA TCTACGCCAG CGCCGCTAGC
AAGCAGGTGC CCGCGGCACG GGCCGGCTTC GAATTGCTGG CGCAGGGCTC GATCTACGGA
TACGGCAGCA TCGTGATGTC GGCGGCCGAT CCTGCGCTGC TGCCGACGCC GTTCCATCCC
GCCTTTGCGC AAGTGCGGAG CTACTCTGTC TTCTTGACCA ATGCGACCTC CCAAAAGAGG
GGAGATCTGT TCGCCTTCGC CCCCGACCAG CCGACCTCCG ACGTTCACGC CGGCGATCCC
GAGCCGGTGC GGATCTACGC CGCGAATGGC GACATCGTCG GACTTTGGCT GACGACCGCC
AAGAGGCTCG ACATGGCGGC CGGCGGCGAC ATCGTGATGA CCAAGGACTC GCAACTCTCC
AGCAGCTTGA CGATCACCCA TGCGAACGCC ACTGACGTCT CGACGGTGAC GGCCGGCGGA
AAGATCATCT ATCCGCGTAT CACGGCGAAG GGGCCCGGTG AGGTGTCGAT CACCGCCGAC
AGCGACATCT TCCTCGGAGA TCTCGGATCC ATCGTCAGCA CTGGACCGGC CTATGCGGCT
GACCAGAGGC CGGGCGCGTC GATCGGGCTG CTCGCCGGTG CCGGCGCGGC GGGGCCGAAC
TACGCGGCGA TCGCAACGCT CTATCTCGAC CCTGCCAATC GCGCGGTGAG CGGCACGCCG
CTCGCAGATC AGGCCGGAAG GGTCGCCAAG ACCTACGAGG CCGAGCTGCG GCAATGGCTG
GCCATACGCT ACGGCTACGT CGCCGCCAGC GACGCCGATG CCCGCGTCTA TTTCGGCAAC
CGCGCCCCCG AGGAGCAGGC GATCTTCCTG CGCCAGGTGT ATTTTGCGGA GTTGAAGGTG
GCGGGCCGAG AATACAACGA TCCGGCGAGC GTTCGCTACG GTTCTTACTT CCGCGGCCGG
CAGGTGATCG CGACGCTGTT TCCGGACCGC GACGCAAGCG GCGAGCCGAT CGCCTATGAC
GGTGATTTCG TGATGTTCGG CGACAGCGGT GTGCGCACCA TCGCCGGGGG GGACATCACA
GTCTTGACGC CGGGCGGCCG CCAGGTCCTC GGCGTCGAGG GTAAGGTGCC CGCGGGCACC
GCCGGTGTCA TCACCCAGGG ACAAGGCGAC ATCAGCCTCT ATTCCAAGGG CTCGATCCTG
CTCGGCCTCA GCCGCATCAT GACGACTTAC GGCGGCAGCA TCCTGGCCTG GTCGGCGGAA
GGTGACATCA ACGCCGGTCG CGGCTCCAAA ACCACGTTGG TCTATACGCC TCCCAAGCGT
ATCTACGATC GTTATGGCAA GGTCACGCTG GCACCACAGG TCCCGTCGAC CGGCGCCGGC
ATCGCCACCC TCAACCCGGT GCCCGGCACT ACGGCGGGTG ATATCGACCT GATCGCGCCG
CTCGGCACCA TCGATGCCGG CGAGGCCGGC ATCCGGGTCT CGGGCGATAT CAACTTGGTG
GCGCTGCAGA TCGTCAATGC CGCCAATATC CAGGTGCAGG GGACCTCGTC AGGGCTGCCG
ACCGTTCAGG CGCCCCCGGT CGCTGCGCTC ACTGCGGCTG GCAACGTCGC TGGTGCCGCC
ACCCCGGCGG CTGTTGCTCC AGCCCAGGCC AACGACCGGC CGTCGATCAT CCTGGTGGAG
TTTCTGGGCT TCGGCGGCGG AGACGGCGGC AGCCAGCCCG CGCCACCGCT GCCGCGAGAT
CGCCGCAACG ATCAGGATCG CCAGAGCTAC GACCCTGACA GCGCATTTCG CGCAGTCGGC
CATGGCCGCC TGACTGAACA GCAACGCAGT CAACTCACTG CGCGGGAACG CGCCAAGCTC
GACGAGTTAA TGACGCGGGC GGAGGTGCGC TGA
 
Protein sequence
MPAVSPIELR ETMPVANAAS RMLFGSPGAT RRLAAHAALL VTVSASALLL CAPPASARSL 
GGGATLSAPT LAVDAAAQAA AQAASAAAQG SASLTRATTA IQAMQAAQAA ARAAASSSGG
VPNGLTVGGL VPDSGLAAGG IARPVVSWTN ARTPMQSGPS DAPTVTVQQT GAQAILNWSS
FNIGANTSLV FDQQGNSSWV ALNRVGASSS PSRILGSIRA DGSIYIINQN GIIFGGASQI
TVGSLIASAA SITDNQFLTY GIYSTAINKV YNPSFTAAGG TIVVEAGASI ATAAPKTVTS
GGGSVILLGT EVRNAGSIST PKGQTVLAAG DSFILRSGYA TDGNTTSTTR GVEVAPVRAS
NSQSGAVGNS GLVFAQQGDI TLAGHSVTQD GILVATTSVD QRGTVHLLNS ITDPTGSVTV
ADGALTAVLP ELDSKVTALD SQRDALIADS TKQNSGRAAA ANDAPQFNNL SRLDDRQDQS
RVEIVSGGLV TFRRNSLTLA QGGQIAVSSV SDIRVETLAT LDVSGTTGTV LPASANNLKV
NIQPNEMRDS PVNRDSGVLV SKDAWIDIRD LVLVPAGTGG YASDRYYTKG GLLEVGGYLG
NTGHTIGEWT AVGGTVTLAA GRKVIAERGA LIDLSGGMVN YAAGDLLTTM VIGADGRIYS
IGNAPADMPI ISYGSGFTRD HARWGQKETW GAPAHGAVRS FHQDAYSVGR DAGLLQILAA
TSDFKADIDA AVYTGERQTT SRPSTKTDGY KLTQTQPGLP GQLLVGTVDI GLGNGGTQTS
GIVLFAGPGA TQVTAPSHVF DAAEISAMGL GGLSVFGSSE IEGDLTLAPG AIVSLSVNHL
GGNITARGGS VTLSVSSFDP GVTVDTRGLW TNALLDPTRV SGLAYRNGGA ITVSGSPLNG
YDLVIPAGVR LDASAGGAIL STGKFSGGNG GDITLGGNSL ALNGEVVSNG FGKGGKLALS
TSGAMVIGPS PLPGGTLAAG EAAPVLLVLA SPVTLARGTV APFSITMTIT SVAGGQLVPA
GAQAQVSSTA VVTVGSAGWT VPAGIFYAMD TTFHYYYPGS TVPAGTILQT MAGNFSSGYV
LPPNSFTTPL RLSPINVNFA AGSVLAYDAV LSKGYVLPAG AVLPQSVEVA PVLALDPSLF
QKGFSSYAIS SGLGITVQPG LQIEVVEPTY QFSRALVDLP TGGRVDTAAS LALAPLFAED
ALRRTLNQRP GADLVLTARA AGLYSSVALA NEHAAAPITI GSGASISVDP GRSVSLFGDD
QITIEGEITA RGGSISAFNL TPSAPLTANS SFGRSVWIGS NAVLDVSGIA HAATDSRGRR
YGTVTDGGSI VLGTDETPTT DGMNSGNAFV VIRPGALIDA SGASAVLDVL GEGPALVAGN
GGSIRLSSLW GFAIDGTMRA AAGGVGASGG SLSLTLESPI VAIGAGASST LVQAATAGRV
IAITQDTTPT GLTDGLAPGV ADAGLVLGSG RLSAAQVKAG GFDSLSLWGR SAIAFDGDVN
LSLGRSLVLQ QGKIVNTAAT GKVTLAAPYV FIDGHTLQAN TLHPEWTTMF ADRLPVYAKG
ASFSVSADLV DIRNLASLAY EMASVASSGD IRFLKATERA NARLVDYVTI LAAGGDLDLT
AAQIYPASGA VALAAAGGIE TAGDTFGFTR FNDAGSRLTV HGTGAVPDMP YSVFGNLTLL
AETIEQGGVV RAPMGQIQFG DFATPAGRAV NTTRLIEFMP GSITSVSADG LIVPYGGTAD
GIRYTVDGGA PVTPSLVNGL YRTGASGSTS LYVGVGVAGI SVQTDAGSLI DLSGGGTLAG
AAFISGRGGS VDPLLTPLAN AGIHGFSSAG NKVYAIVPGT FTAPAAGGYN NLWTGGVPTI
GQQITIPAGV PGLPAGTYTL LPANYALLPG AYRVELGART DQAVNPVALA NGSYVLTGHQ
AVAHTNIQDA LATRLVITPA DAVRNYAQFN EQSYGDFVLS TAAQFGTRRT MVEADAKFLT
IDLGLDATVA ANSALKVNGI IDFAAAAGGI DGSLFLTTGD ALHKGALVIT APGSATANTA
SRTTVSSSTL AAVGAPNLVL GGRPYDNNTY VSVDSGNLAL VDSLTIEAGV RVSASQVLAF
AGTTITLQAG AEINTLGRGF SGLDSRSGLA FGTAPQSYPS LYTLLVVSNG SIILNANTGL
GSGGIVMNDG AALYSEGTIG FYAPTGLQTS GSIKLGTRNL ALSVASLNIG SAEALAAVPN
LPVGLRFDQG FLDTLLSGGG AAGAPAVEHL VMTAGQSINF FGSSDLSTFD PLTGRSRLDE
LVLASPAIYG YGSGSDSVHL ATNRLVWTGG SVFDPVATGP SSTPQFKSAT PPERLAGLGN
SGALVVDARE VVFGYPAEAQ SDSNLVFNRL MLGFDTAVFN AAERVTANNK GTVSVYAGGP
AIGGSFDPAS YQGVGGTLVF NTPAITGDAG ANLAIYGGAV TLTQTGAVAG PAGGLGATLS
VTADTLRVDT AVLLPGGALT LTARQDLLID SRARLDLSGH AVAIGQAVSQ VWAGDVTLVS
LNGNVTLAAG SIVDLSSDGS DAGTFKLRAA GAARFAGALL ASGGGNGARD GAFDLAADSL
GGGNLSADFA ALNQRLNAEG FAYSRAFVFG SGNLAVGSDV KARNVTITAD GGSLTVDGRI
DASGLRSGAI RLAARDDVVL TGNAVLDVHG TQLVVDSYGQ AIEASNRNSI ELGTSQGWIR
IGGGATLDTS SPDGISRGRV EINARRSSET GGDVNIDAPS PLNLRGIASL AVNGYWTYSP
TDANGTIVQD DTAAGVVSGA VGLAQIDATN QTFYANALGN NALQARLAGL KTVGAAYHFR
PGVEIVSSAA SGGKLTVVGD LDLAGFRYGP NADRNTASAT YGFGEPLALM IRASGNLTIK
GSITDGFAEP KSSPDEVNFR DRPQNGVVTV IPAYAPPYDD YTPTEDMTLG EDWVLPYYWY
DGESRLYVST DRGYFYSGDL IPAGTKILWT GDTYFYFGGD QLVPAAADIV RLSQGKIYAI
APMLAPGAMS ASIRLVGGAD LASGDSRALL PADRLGNSGN IVLDDYHTHS PTKMGADVRQ
TEIASVIRTG TGDLDLLAGG SLTERSLYAV YTAGTQQDLA SGNSAYIINQ PSIQLPSGTG
LESTFNDRAN YFPDHGGDLR VTVQGNLAGY NAQLSLMRDG GEWLMLQGAP DLGQATGWSV
NFGNYKGAVY VSGPLGFGTY GGGNATVVVG GAAGGITTTN IYAANRVTTG LSVAVGATGR
VLADGTLIET GGGDLSIQVG GRLNPLSDGG QITNLRGSIT LQAAEIGSRG KVYATSIAGD
PRPAEFTSGA NIAYSFAGPS LRLGDAAASI RTLGTAWLGS VGEIGINQAM NPNAYGDVAG
PANTSIVTWF SLFQPSTAVD MVSLGGDLAP FSANSGDGLY LPPQFTAAAL SGNIYASAAS
KQVPAARAGF ELLAQGSIYG YGSIVMSAAD PALLPTPFHP AFAQVRSYSV FLTNATSQKR
GDLFAFAPDQ PTSDVHAGDP EPVRIYAANG DIVGLWLTTA KRLDMAAGGD IVMTKDSQLS
SSLTITHANA TDVSTVTAGG KIIYPRITAK GPGEVSITAD SDIFLGDLGS IVSTGPAYAA
DQRPGASIGL LAGAGAAGPN YAAIATLYLD PANRAVSGTP LADQAGRVAK TYEAELRQWL
AIRYGYVAAS DADARVYFGN RAPEEQAIFL RQVYFAELKV AGREYNDPAS VRYGSYFRGR
QVIATLFPDR DASGEPIAYD GDFVMFGDSG VRTIAGGDIT VLTPGGRQVL GVEGKVPAGT
AGVITQGQGD ISLYSKGSIL LGLSRIMTTY GGSILAWSAE GDINAGRGSK TTLVYTPPKR
IYDRYGKVTL APQVPSTGAG IATLNPVPGT TAGDIDLIAP LGTIDAGEAG IRVSGDINLV
ALQIVNAANI QVQGTSSGLP TVQAPPVAAL TAAGNVAGAA TPAAVAPAQA NDRPSIILVE
FLGFGGGDGG SQPAPPLPRD RRNDQDRQSY DPDSAFRAVG HGRLTEQQRS QLTARERAKL
DELMTRAEVR