Gene Pfl01_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPfl01_3171 
Symbol 
ID3712931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas fluorescens Pf0-1 
KingdomBacteria 
Replicon accessionNC_007492 
Strand
Start bp3638770 
End bp3651282 
Gene Length12513 bp 
Protein Length4170 aa 
Translation table11 
GC content64% 
IMG OID 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_348900 
Protein GI77459393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.336923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCTC GCCCATCCCG TCGCAGTACC CGTGTTCAGC CTCAGCAGCG CATGGCCGCC 
GATCCGGCGC TGTGGTTGCT CAAGCCCCTG GCGCAGGCGA TTGCCTTGTG TCTGGTGGCA
GGCAGTGCCG AGGCGGCGAC TGCGTTCAGT TCGGGATGGT TTGCCGCCAA GGGCGCGGCG
CAACAGGCCG CCGCAGCGCG CCCGAGCGTG GGCGGATTGC CGGGGATGAC GCCACCGCTG
GCGCAGCAGC AAAAGGCCAA TCAGCAGCTG CAACGCTCGA TTCAGACCCT GAACAACACC
GTGGCTGCGA TTGCGGCGCA ACAGGCCGCG CAGGCGGCCG GTCGTGCGGC GGCGCTGGGC
ACGGTGCAGT TTGTGCCTGA CGGTCTGGGC GAGGGCGGCC TGAAGGTCGA TAACAGCCTC
ACGCAAGGTT GGCAGAACGC CAAGGGCCCG CAGCAGACTC AGGTTGATGG CAAGACCACG
GTGAAGATCG AGCAGACCGC CGACAAGGCG ATCCTCAATT GGGAGACCTT CAACGTCGGG
CGCAACACCA CGGTCGACTT CGCCCAGCAG TCGAACTGGG CGGTGCTCAA CCGGGTCAAC
GATCCGAATG CGCGGGCCAG CCAGATTCAG GGCCAGATCA AGGGTGACGG CACGGTGATG
CTGATCAACC GCAACGGCAT CGTGTTCAGC GGCACCAGTC AGGTCAACGT GCGCAACCTG
GTGGCGGCGG CGGCCAACAT CACTGACATT CAGTTCCGCG ACCGCGGCCT GTATTTCGAC
AGCACCGGCA GCCAACCGAC GTTCACCGAG GCGGCCGGCA AAGTGCTGGT GGAGCGCGGC
TCGTCCATCG AGACCGCGCG GCCGGCCAAA TCGACCGATG CTGGCGGCTA TGCCTTGCTG
CTCGGCAGCG AAGTGCAGAA CGACGGCAGC ATCAACACCG CCAAGGGCCA GACCGTGCTC
GGCGCGGGGG ATCGCTTTTA CATTCGCAAA GGCTCGGGCA CTGAAGGCAA TGCCTTCTCC
ACGACGTTCG GCAACGAAGT CACGCCGGGT TTCAAGGCCG GCAGCACTGC CGGCAAGGTC
AGCAACAACG GCCTGATTCA GGCCGCGACC GGCGACATCA CCCTGACCGG CCATGAGGTC
GTGCAAAACG GCGTGTTGCT GGCCAGCACC TCGGTCGCCA CTCGCGGCAC GATCCACTTG
CTCAACCCGG CGACCGACAC CCAAGGCAGC GTGACGCTGG GGCAGGGCAG CGCGACAGCG
ATCCTGCTCG ACAGCAGTGA CCTGACGGCC CTCGACAGTC AGCATCAGGC CGCCCTGACC
GGCGTGACGC CCAACAACAG GATTCGCGGC GATCAGTCGC GCATCGACAT CCAGAGCGGC
GGCAGCGTCG AGTTCCAGAA CGGCTCGATC ACCCTGGCCA CTGGCGGGCA AGTGGTGGTG
GCGGCGCAGC GCCGCAGCCT GGTGCGTGAC GGCGCGATGA TCGATGTCTC CGGTGCCACC
GGCGTGAAAG TCGCGATGGA ATCCAACAGC ATCAAGATCA ACGTCCAGGG CAACGAACAG
CGCGATGCGT CGGTCAATCG CGACAAGGGT GCTCTCAACA GCAATGACGT GTGGGTCGAT
GTACGCGAAC TGGTTTACGT CCCGGCCGGC ACCAACGGTT ACGTCACGGA TCGCTGGTAC
ACCGCCGGCG GTTTGCTGGA GGTCGGCGGC TATCTCGGCA CCCAGGGCCA CAGCATCGGC
GAGTGGATGG CTCAGGGCGG CACGGTGAGT TTCGCCGGCA ATGACGTGGT GACCGAGAAG
GGCTCGCTGA TCAACCTGTC CGGCGGCACG CTGGACGTAC AAAGCGGCGA GATTCGCCAG
AGCTGGCTGC GCGGGGCCGA CGGGCGCCTG TATGAGGTCT CAAAGGCGCC GGGGGATCTG
CTCTACACCG GGCTGTACAA AGGCTTTGAA GACAGCAGTG AGCGCTGGGG CCAGACGAGT
TATTACTACA ACCCGTTGAT TGCACCGTCC AGTCGTCGGG AGTCGGGCTA CACCGTGGGA
CGCGATGCCG GGCAACTGGT GATCGGCACC CGCAACGCTG TGCTCGAAGG GCAAATGCTT
GGCGAGGTTT ATCAGGGCGA GCGCCAGATC CAGGCACCGC GTGCCGTACT CGATGGCTAT
GCGCAGAGTC AGACGGCGCT GGCGCGGCGC GGGCAGTTGA TTGTCGGCAG TTATGACCCG
GCGTACGTGG CGGCTTCCGG TGGTTTGCTC TATGGGCTTA ACCCTTTGCT CGAACAGGTG
CAGATCGGCG GTGAGCGCCC GGTCGCCGGC AATGATCGGG ATCTGACCGG CGCCGTATCC
GATGCGCGCA ACGGCAAGCT GTTCCTCGAT GCCGGGCAAC TCAGTGACTG GCAACTGGGC
GCGCTGAAAG TGGCGGCCAA GGAACGGATC AACGTCCAAG GTGCCGTCAC CGTCGACAAT
GGCGGCGACA TCACCCTGTA CGGGCCGGAC GTGCAGGTCA ACGCAAACCT GACCGCGCGT
GGCGGCAGCC TGCATCTGGG CAATGTGCTC AATCAGCTCA ACACCGATCG CCTCACCGAC
ACCCGACTCG TCGCGCCGAC CGGCAAGGCC ACCCGCGTGA GCGTGGCTGA AGGGGCACAA
CTGGACACTC GCGGTCTGTG GAGCAACCTG CGCACCAACC CGGACGATAC AGCCAGCCTG
GCGTATCTGG ATGGTGGCGT GGTGTCGATC CGCAGCAGCA GCGACATCGA GGTCGGTGCC
GGCAGTCTGC TGGACGTGTC TTCCGGCGGG GCGATTCTGG CCAATGGCAA GACCCGCGGA
GGCAAGGGCG GTGACGTGAC GCTGGAGGCC GGCGCCTTGA CCGCGTCGGG TGACAGTCAT
CTGACTCTGG ACGGCCAGCT GCGGGGTTAC GGTGTGTCTG GCGGCGGCCG TTTGTTTCTG
CAGACCGGCG ATGTGCTGAT CGGCCAGACC AATAAGCGTC AGGCCGACTC TGTGTTGCAG
CTGACGCCGA CGCTGTTTGA AAAAGGCTTC TCGACCTACA ACCTGATTGG CCTGAACCGG
ATGGAAGTGA CGGATAACAC CGTCATTGAT GTGTCGATGC CGGTCTATCG CTTCAGCGAG
GCGGCTGCCA ATCAGGTCAG CGGTGCCGCG CCGCAGCAGG CACTGGAGCT GTGGACGCCG
CCGCTCTATC AGGAAAATCC GACCAAGGCC CAACTGACGC GGCGTGCCGG TGCCAGCCTG
ACGCTGCAGG CCGGTACCCG CGAAGTCAGG TTGCTCGATC CTGCGCACAC GACCCTGAGC
CTCGGCCAGG GCTCGCGCAT CAGCGTCGAC CCCGGGCAGA GCATCAACTT GCGCGGCGCG
GGACAGATCA CCCTCAATGG CGAACTGAAT GCCTGGGGTG GCACCCTCGA TATTCGCCAG
CAACAGTTCG GTTCGAACGA CGTCGCCGAC AACTCACAAC TGGCTGACAA CGCAGCGCAC
AACCGCTCGA TCTGGATCGG CGAACACGCG GTGCTGGATG TGGCTGGCCG CGCGGCCACG
GCGGTGGATG CGCTGGGTCG TACTTACGGG CTGGTGGGCA AGGGCGGCAC GATCATCGTC
GGCGGTGAAA TCGATGCGAA AAAAACCACG GCAACGTCGA CCGACGCCTA TGTGATCGTG
CGTGAGGGCG CACGACTGGA CGCGTCCGGT GCTCAAGCCA CGCTGGATAT CGCCGGGCAG
GGCCCAACGC CGATAGCCAC CGATGGTGGA CGAATCAGCC TCAGCTCCTA TAACGGACTG
TTCATCGATG GCAGCCTGCG TGCAGTCGCC GGTGGTGCGG GCGCAGCCGG CGGGCGTCTG
GACATTGCAC TGGAAACGCC GCTCTACGAC CTCAACCTCG CCGCCAATAA AGTGCGTGCC
GCCAGGGAAA TCATCCTTGG ACAGGACGCA GGCAAGTCAC TGCTGCCGCA GGATCTGCAA
CCCGGTGCCG CTGCTTCAGC CTTGCAATAC GGTCGTGCCC GCCTGGGCGC CAATCAGTTG
ATGGCCGGCG GTTTCGACAA CCTCAGCCTG CTGAGCAATG GTTTGCTGTC GTTCGACGGC
GATGTCGATC TGCACAGCCG CCAGAGCCTG AGCCTGTATG CCGGCGCGTT GGCGTTGGCC
GACGGCGCCA GAGAGACTGC GCGGATCAAT CTGTCAGCGC CGTATCTGCG GTTGTCGGGG
ACCGGCAAAT ACTTCGACGC GCCGGGTTAT CTGCGCCCGC GAGTCATGAA CACTCCGACG
ACCCATGTGG CACCCGCCAC GTTCAACGCC ACGGCGGACC TGCTGGATCT GGGCAACAGC
CTGTCGTTCG GCACGGCGGG GAAGATTGCG TCACTCAACG GTGCGGCAAT CGAGGTCAGT
CGGCGCGGCT TTGATCAGGT GACCTTGCAC AGCCGGGGGG ATCTGCGCTT TCTGGCTCCG
ACCGGGTACA CCACTAAAAC GGAGTTGTGG ACGCCTGGGG ACCTGAACCT GCAGGCCGCG
CAAATCTATC CGGCCAGTGA TGTCACTGCC GAGGTGCGGG TCGGTTATCA GAGCGTGCAC
CCTGGCGCCG ACCCGGCGCG CACCTTGCGC ATCACCCGCG CCGGGAGCGC CCCGGCCGTC
GTGCCTTACT CGGTGTTTGG CGACTTGACG CTGGCTGCCG GGCACATTGA GCAGGGCGGC
GTTATTCGTG CGCCGCTGGG CATGATCCGC CTGGGCGACG AGACCGTGGT CAACACCCAT
GACCTGCATT TGTTGCCGGG CAGTGTGACG TCGGTCAGCG CGGACGGACT GGTCATGCCC
TATGGCGGCA CCACCGACGG CATCGACTAT CGATACGCTG GCAAATCGGT TGTGCTCAAG
GGGATTACCG CAGAAACCGC CGGTGTAACG CTGACCTCGC GGTACGTCGA TGTGCAGCAG
GGAGCATTGA TCGATTTGTC TGGCGGCGGC GACCTGCGAG GGGCCGGGTT TGTGTCCGGG
CGGGGCGGCT CTACCGATGC ACGCTTCAAT CCGCTGGTGC GCAATGCCAC CGACGGTACG
TTCAGCTTGC CAGGCCTGGC CAGCAATCCG GTGTACGCCA TTGTGCCCGG CAATCAGAGC
ACCTACGCGC CGATGCTCGC CGAAGCAGGA GCGGTTGATC CGCGCATCGG CCAGCAAATC
ACCCTCGGCT CCGGCGTGCC GGGGCTGGCG GCAGGCACCT ATACGCTGCT GCCCTCGACG
TTCGCCTTGT TACCGGGGGC GTTCCGGGTC GAGGTCAACG GCCAGGCCGC AGCGGGTGTC
ACGTCTGGCG CGTTGCCTTT GCGCAATGGT TCATGGACCA GCTCGGGGTT GATGTCGATC
GCCAACACCG GGCTGCGTGA CAACCTGGCC AGCCAGGTCA TCCTGACCTC GGCCGATGTC
GTGCGCCGTT ATTCGCAGTA CAACGAAACC GGTTTTACTC AATTCATCCG CAGTGACGCT
GCACGGCGTG GCGTGCCACG GGCGTTGGCG CCGATGGATG GCAAGTCGCT GAATCTGTAC
TTGATGTCGG GCGCCGAGCA GGGCATTGCG CTGGATTTCG CCGGACAGGT GTTGTTCAAA
CCGGCCACTG GCGGCGTGGT CGGGACGGCG GTGGTGAGGG GCGCGCGGGA GGTGGAACTG
CTCGGCGACG GGCATCGGCG TACCGAGGGT TTCAGCGGCG TGTCGCTGTA TGCCGACAGC
CTCAACAGAT TGGGTGCAGG CCGTCTGACC ATCGGCGCGC AACCGGCGAT CGATTACAAC
ACGGCGGGCA ATATCGTTTC GTTCATCGGT GATACATCGA ATATCACCCT GCGTGAGGGC
GCCATGCTGT CTGCGCCTGA AGTGCTGCTG CGTACCACCA GTACCCGTGG CAGCATCACG
CTCGAAGCCG GTTCGGGGAT CAACACTCTG GGGCGCGGCG ACGCACCGTA CGATTCATCC
GCAGGCTTTA TCTATGCTCC GGGCACGGCA GGTCTGCTGG TGGTCTCGAA CGGTTGGACC
AACGTGCTCG CGCCGTCGTT GTCGACCCCT ACTTCCGGCG CGGGGGATAT TCGCATCGGC
AACTGCCCGA CGAACGCCTG CACCAACCCG ACGCTGCTTT ACTCCAATGG CAGCATCACG
GCCGCCACAG ACAAACAGTT CGAGCTCGAC GAAACGGTAC GCTTCGGTAC TCGCCATCTG
ACGCTGGCGG TGGGGACGAT CAATGCCGGC AGTGCCGAGG CCTTGAGTGC CGCCGGCAGT
CGTGTGCCGA CCGGCCTGAC CCTGAACCAG AACGTGCTCG ACCGGCTGTT GCGCGGTGAT
ACGCAGTTTG GCGCTCCGGC CCTGGAGACC TTGAGCCTGA CCAGTCGCGA TGCCTTCAAT
TTCTATGGCA GTGTCAGCCT CGACACGCTG GATCGGCAGA CCGGCGTCAG CAAACTGCAG
AATCTGATAC TCAGCACACC GGCCATTTAT GGCTTGGGCC AAGCCGCAGA CGTTGCGAGC
ATCCGCACTG CGAATCTGGT CTGGAACGGC GCCACGCAAG CGCCCGCTGC TGTCATCTCT
GGCGGCGCCG GCACTGGTCA GGGCACACTC GACATTCAGG CCCAGCGTAT CGAATTCGGC
TACGGCCCCA ACCCGCAGGC CAGTGGACTG GAGCAGAACG ATCGCCTGGT GCTGGGCTTT
GCCAACGTCA ATCTGACCGC CAGCGACCAC ATCACCGCCA ACCACAAAGG CAGCCTTGCG
GTGTATCAGG CGCAAGGCGC TTTCGACCCG GTCGGGGGCT ATGCCTACAG CGGCGGCAAC
CTCAACCTGC GCACCCCGTT GCTGACGGGG GAGGCGGCCT CGGTGAACCT GCTCAAGGCC
GGTAACAACC TGACTCTCAC GGGCGCCGGT ACCCCAGCGG CGGCCAATGT CCTGGGCGCG
GAACTGACGC TCGATGCGCG TAATGTCACG CTCGACAGTC GCATTGCTCT GGCCAGCGGC
AAACTGGTGA TCAAATCGGT AGAAGACCTG ACCCTGAACA GTGGCGCTTA CCTGGATATG
GCCGGGCGTA CCCTGGCGTT CAACGATGTG AACAAGTACA GCTGGGGCGG TGATGTGGCG
CTGTACAGCA GCCACGGCAA TATCCGTCAG CTCGCGGGCT CAAGCATCGA CCTGTCGGCG
CAGAACAATC AGGCCGGCAA TCTCAGCGCC ATCGCCTTGG CCAGCGATGC CGGGATGGTC
GATCTGCAGG GCCAGATCCT CGGCGCCAGC AGCGGCAGTT ATGACGCGGG TGGCACATGG
GTGCCGTACA AGGCCGGTGG CGTGGACATC CGCGCGCAAC ACTTGAATGG CGATCCGAGC
CAGCAATTCG CGGCCCTTAA CCAGCGCTTG AACGCAGGGC AGGTTTTTGG CAGCCGCAGC
CTGCAACTCA AGCAGGGCGA TCTGTTGATC GGCGACGGGC TCAAGGCCGG TGAGGTCAAT
GTCTCGGTGG ACAACGGTAG CCTGACCGTC GCGGGTCTTG TCGACGCGAG TGGTTCAAGG
GTCGGTTCGA TTCGTCTGTC GGCGAAGAAC GGCCTGACGG TGACTGGCAG CGCCGTGCTC
GACGCCCATG GCCGTGTGCT GCGGGTCGAC AGCTACGGAA AAATCATCGA TGCACCGAAC
CGGGCGATGG TCGAGCTCAA CTCCGGCGCC GGAACGCTGA CGCTGGGCTC CGGCGCACGG
ATCGATCTGC GCCACGGCAC CGACGCTGCA CCGGGGCCAT TGCCCGGGCA GTCCGATGGA
TTGCCGCGCG GTACGCTGGA GTTGAACGCA CCACGTCTGG CCGGCGGCGA CATCGCCATC
GATGCCAGCG GCGCGCTGAA CATTCAGGGC GCCCGTTCGA TCGGGCTCAA TGCCACGCGC
CGTTACAGCG ATGCCAGCGA CGGTGCAGAC CCGGCGGCCA GCGGACGTCC GTATCAGGTG
ATCGATCAGG CCTACCTTGA CCGCATCCAT GGCGACAACA CGGCGTTCAT CGATGCCGCG
TTGCTCAACA GCAACCTGCT GCAGAACAAA CTGGCCGGCC TGAACAATGC GTCCTATGCC
GATGCCTTCC ATCTGCGACC GGGTGTCGAG ATCGTCAGCA AAACCGCCGA CGGCGATCTG
GTGGTGCAGG GCGATGTGGA TCTGTCCGGT TACCGCTATG CCAGTCTCAA TCCGCATACG
CAGAAAACCG CAGCCTATGG CTCTGGCGAG TCTGGCAGCC TGGTGATTCG CGCCGGCGGC
AATCTCGATA TCCACGGCAG CCTCAACGAC GGTTTTGCAC CGCCACCCGA GACGGTCGAC
GATGCGGGCT GGAAACTGTT GCCGGGGATT CAGCCGTTCG GCGGTGATCT GGTGGTGCCG
GGTGCAGGCG TCACACTGGC GGAGGGCACG CTGTTCCCGG CGGGCGTCAC GCTGAACTAC
GACGTGCCGC TGCAAGCGAC GACCCTGGCC GCTGGCACGT TGTTGCCAAC CGAAGCGACC
TTGGCCGCGC CGTACACGCT GGGCGCCGGT ACGGTACTGG CAGGCGCGGT TCACGATGCC
TCGGGCCAGT TGATTTTTGC CGCTGGAACG CTGCTGACCG ACAACGTCAC ATTGCCGGCC
GGCAGCCGCC TTGGCGCGGG CATTCGCTTG AATGACGTCA CGTCTGTCAA AGCGATTCGC
TGGCCTAAAG GCGTGCCGTT GCCGGGGAGC GTCGAAGCCG GTACCAACGC GATCAAAGGC
GTGCGCTTGA GTGGATCGCT GGCATTGCTG CGCGGTTCTT TGATTCCGTC GATGACCGAC
GTGGTTCTGG CCGATGGGAC GGCCTTTATC GAGTTGCGGC CATTGAACGG CAACCAGCAG
GGGCGCAACT GGGCCGTTGC CAGCCTGTTA CCGGCCGGCA GCGCGTCCTG GTCGATGCGG
GTGGTGGCGG GGGCCGATTT GGGTGCCGCT GATACGCGTG CGATCAAACC GGTTTCCAGC
GACGGCAATC TGCGTCTGGC CGACACTCAT TACGGGCTCA AAGTCACTGA AAAGGCACCA
ACCCTGGTTT GGGGTGAAGG CAATCTGGGT GGCTTCACCC CGGGCGAACC CGTGCCTGAC
GAGCTCAAGG AATGGTGCGA CTGGGCACCG AACTCCTGCG TTTCTGCACC GCGATGGACG
TGGGCACCGG ACAACTGGAT GGGCATGCCC CCGGGGTCAC CGATAAGCGA TGAAGACGTG
CTTGCCTGGT GTGGGTCTTT CCCTGAGCTG TGCGTGGAAA ACAAACCGGG CATCACCGTC
AGGACCCGCA CACAGATGTT CAGCGTACTG CGTACCGGTA CCGGGGATCT GGACGTGCTC
GCCGCCGGTG ATCTGAGCAT GGACTCACCG TTCGGCGTCT ATACCGCCGG CACCCAATCC
GAGGAGATCG ATCCGCGCTT CAACCAGCCG CGCGGACGTC TGTCGGACAA CGGTTCGGTG
CTGGGCAGTG CCGGCAGCGA TTACGAGCGG TGGGTCACAG GCAACGACAG TCTGTATCAG
GCCTGGTACC CGCAAATGGG CGGCAACCTG ACCATCAACG CCGGTGGTTC GATTTCGGGG
GATGTGCTGG GTCGGCGTGG CCCTTCGGCA ACCCTGGAAA CACCTGAGCA AGTGCCCAGT
GTGGCGGTGG GCAACTGGCT ATGGCGTCAG GGGACCGGCA ACAGCGATGT GCCGACGGCG
TGGTGGATCA ACTTCGGCAG TTATGCGACT CAGCCGTTGC CGGATCAAGG CGTCGATACC
GGGCCGTTCC TGGTTGGCTT CACCGGTTTC GGCACGTTGG GTGGCGGCAA CATCAGCCTG
CGCGCAGGTG CCGATGGGGG CATGGTCAAG GCGTTGGGCG AAAACGCCAA CACCAATCTT
TACCCCCGTG GCCAAGGCTT GATCGTGGCG GTCGGGAGTA CCGGGCGCGT CGGCGACGAT
GGACGGCTGC AATTGACCGG TGGCGGTGAT ATGGACATCC GCATCGGCGG TTCGCTCAAT
CCGTCGTTGC AGGCACGGGC CGGCAACAGT GGCGTGGGAG TCCCCCGACA CGACTTGCAA
GGCGCTCTGA TCAACCTGCG CGGCGCGGCC CGACTGACTG GCGGGGCGCT GGGTGGAATC
AATCTGCAGT ACGGCGCAGC CTCCTATGTG CAGGACCCAC GAGAAGTCCG ACCGTTTGAC
CCGTTCACAT CCACCGCTGG CAGTGCCTCC GGCGGCCTGG TCTTGATCCC TGGCGACTCG
GGCATGAGCC TCAATACCCG CGGCGACCTG GTGCTGGGTG GTGCGGCGGA CCCGGGCCGG
GTTCGGTTGC AGAACTCGAC GCCGTTCACC ACCGCTGACG GTGTCGTGCA TGACGGGGGC
GGATTGAGTG GTTTCTCGCT ATGGACCGAT CACACGGCTA TCGATCTGTT TTCGGCTGGA
GGCAACCTGA CGCCAAGTAC ACAGCTGGGC GAGGTTGACG GGCAGGGCCC CATGATCGGG
CGAAACACAT CTCCCACCGA TGGGCGCTTC GTATATCCGT CGATCCTGCG TGCGGTGGCG
GCCCAGGGCT CGATTTATGC CGGGCCGTCG GCGACCTACA TGCAAGGCAG CGGATTTTTG
CCGGCCACAG CGTACTCCTT GTTGCTGGCG CCCTCGAAGG CCGGGCAACT GGAGCTCCTG
GCGCAGGATT CGATCTATGC CGGTGGCTAC GCAATCAATC AGTCGGGCGC CAGTCCGTCG
GCCATCGCTA CGCCATTCAC ACCGGCCTTC AATGGCTACG GTAACAACAC ATCGAAAACG
CCGATCATGA CCAACCGTGG CAGTGATGGG GTTTCACCCG ATAAAAACTT GCGATATCCG
CTGTTTGCTT TTGGTGCCAA CAGTTACTCG GGTTTGAGTG AGGTGCAGGC GCCCGCACGG
TTCTATGCTT TGACCGGTGA TCTGGTGGGG GTGCGCAGTG GGGAAACACT GACGTTCAGC
CTCTCCAAAC GTACCTGGTA CGAGGCGGCC AGCACTGTCT GGATGCTGGC AGGGCGAGAC
ATCGTGGCTT CCGGTACCTT TCTCGGGCAA CCGACCACGG CGCCCAACGG CGAGATGGGG
TTACCGAATA CAGAGGGTGT CGCCTCGACC GGCAATCTGT TCGTGCACAA CAATTCACGG
GATATTTCCC GGGTCTCGGC AGGGCGCGAC ATCCTCTACA GCAGCTTCGA CATCGCCGGC
CCGGGTGTGC TGGACATCAA GGCAGGGCGC AACATCCTGA TGGAGGACCG TGCCAGCATT
ACCAGCATCG GACCGATTCT GGCGGGGGAC AACCGCCCCG GCGCCAGCCT GGTGATGCAG
GCCGGTACGG GCGCACAGGG AGCGGACTAT TCGCGGTTCA TCGCCCGTTA CCTGAACCCG
CAGAACCTTG CCGACCCGAG CGTTTCCCTC AACGGGCAAC CGGGCAAAGT GGTCAAGACT
TACCTCGACG AATTGCAGAG CTGGCTGACC CTCGGCTACG GTTTCAGCGG CAACGCAGAA
CAGGCGCAAG CCTTCTTCGC TGCGCTGCCA GGCGCCGAGC AGGCGATCTT CGCCCGTCAG
GTGTACTTCG CCGAATTGCG TGCCGGTGGT CTTGAGTACA ACGACGTCGA CGGCCCGCGC
AAAGGCAGTT ACCTGCGCGG TCGCAACGCC ATTGCTTCGC TGTTCCCGAC CGTCGATGTG
GTCGGCAACC TGATCCGTTA TGACGGCGAC ATCACCCTCT ACGGCGGTGC CGGGGTCAAG
ACGCTGTTCG GTGGCGATAT CCAGATGCTC ACGCCGGGTG GCGGTCAGGT GTTCGGCATC
GAAGGCGCGG CGCCGCCATC GACGGCGGGG ATCATCACCC AGGGTTCGGG CGACATTCAG
CTCTACTCCG AGGGCAGCAT TCTGCTCGGG CAGAGCCGGA TCATGACCAC GTTCGGCGGC
TCGATCCTGG GCTGGTCCGC CGAGGGCGAC ATCAACGCCG GTCGCGGTTC GAAAACCACC
GTGGTCTACA CCCCGCCGAA ACGCGTGTAC GACACCTGGG GCAACGTGAC CCTGTCGCCA
TCGGTGCCGA GCACCGGCGC CGGTATCGCC ACGCTCAACC CGATTGCCGA GGTGGCACCG
GGGGACATCG ACCTGATCGC GCCGCTGGGC ACCATCGATG CGGGCGAGGC GGGGATTCGC
GTCTCGGGCA ACGTCAACAT CGCCGCGCTG ACGGTGGTCA ACGCCGCCAA CATCTCGGTG
CAGGGCAAGG CGACCGGCGT GCCGGTGGTC TCGGCGGTCA ACACCGGCGC GATCACTTCG
GCCAGCTCTG CTGCGTCGTC GGCCACCCAG GCGGCGGAAG ACGTCGCCCG TCAACAACAA
GCCGCCTCGC GCCAGAACCA GGCCTCGGTG TTCACCGTGC AGGTGCTCAG CTTCGGCAAC
GAACAACTGG CCCCGACCCG CGACGGCGCC AGCCGCGCAC CGACGCCGGG TTACAACCCG
AACAGCCCGG TGCAGGTGCT GGGCGCTGGG GCGCTGGATG AACAGGCGAA ACAGCAGCTG
ACCGAAGAGG AGCGGGGACA GCTGACGTTG TAA
 
Protein sequence
MLARPSRRST RVQPQQRMAA DPALWLLKPL AQAIALCLVA GSAEAATAFS SGWFAAKGAA 
QQAAAARPSV GGLPGMTPPL AQQQKANQQL QRSIQTLNNT VAAIAAQQAA QAAGRAAALG
TVQFVPDGLG EGGLKVDNSL TQGWQNAKGP QQTQVDGKTT VKIEQTADKA ILNWETFNVG
RNTTVDFAQQ SNWAVLNRVN DPNARASQIQ GQIKGDGTVM LINRNGIVFS GTSQVNVRNL
VAAAANITDI QFRDRGLYFD STGSQPTFTE AAGKVLVERG SSIETARPAK STDAGGYALL
LGSEVQNDGS INTAKGQTVL GAGDRFYIRK GSGTEGNAFS TTFGNEVTPG FKAGSTAGKV
SNNGLIQAAT GDITLTGHEV VQNGVLLAST SVATRGTIHL LNPATDTQGS VTLGQGSATA
ILLDSSDLTA LDSQHQAALT GVTPNNRIRG DQSRIDIQSG GSVEFQNGSI TLATGGQVVV
AAQRRSLVRD GAMIDVSGAT GVKVAMESNS IKINVQGNEQ RDASVNRDKG ALNSNDVWVD
VRELVYVPAG TNGYVTDRWY TAGGLLEVGG YLGTQGHSIG EWMAQGGTVS FAGNDVVTEK
GSLINLSGGT LDVQSGEIRQ SWLRGADGRL YEVSKAPGDL LYTGLYKGFE DSSERWGQTS
YYYNPLIAPS SRRESGYTVG RDAGQLVIGT RNAVLEGQML GEVYQGERQI QAPRAVLDGY
AQSQTALARR GQLIVGSYDP AYVAASGGLL YGLNPLLEQV QIGGERPVAG NDRDLTGAVS
DARNGKLFLD AGQLSDWQLG ALKVAAKERI NVQGAVTVDN GGDITLYGPD VQVNANLTAR
GGSLHLGNVL NQLNTDRLTD TRLVAPTGKA TRVSVAEGAQ LDTRGLWSNL RTNPDDTASL
AYLDGGVVSI RSSSDIEVGA GSLLDVSSGG AILANGKTRG GKGGDVTLEA GALTASGDSH
LTLDGQLRGY GVSGGGRLFL QTGDVLIGQT NKRQADSVLQ LTPTLFEKGF STYNLIGLNR
MEVTDNTVID VSMPVYRFSE AAANQVSGAA PQQALELWTP PLYQENPTKA QLTRRAGASL
TLQAGTREVR LLDPAHTTLS LGQGSRISVD PGQSINLRGA GQITLNGELN AWGGTLDIRQ
QQFGSNDVAD NSQLADNAAH NRSIWIGEHA VLDVAGRAAT AVDALGRTYG LVGKGGTIIV
GGEIDAKKTT ATSTDAYVIV REGARLDASG AQATLDIAGQ GPTPIATDGG RISLSSYNGL
FIDGSLRAVA GGAGAAGGRL DIALETPLYD LNLAANKVRA AREIILGQDA GKSLLPQDLQ
PGAAASALQY GRARLGANQL MAGGFDNLSL LSNGLLSFDG DVDLHSRQSL SLYAGALALA
DGARETARIN LSAPYLRLSG TGKYFDAPGY LRPRVMNTPT THVAPATFNA TADLLDLGNS
LSFGTAGKIA SLNGAAIEVS RRGFDQVTLH SRGDLRFLAP TGYTTKTELW TPGDLNLQAA
QIYPASDVTA EVRVGYQSVH PGADPARTLR ITRAGSAPAV VPYSVFGDLT LAAGHIEQGG
VIRAPLGMIR LGDETVVNTH DLHLLPGSVT SVSADGLVMP YGGTTDGIDY RYAGKSVVLK
GITAETAGVT LTSRYVDVQQ GALIDLSGGG DLRGAGFVSG RGGSTDARFN PLVRNATDGT
FSLPGLASNP VYAIVPGNQS TYAPMLAEAG AVDPRIGQQI TLGSGVPGLA AGTYTLLPST
FALLPGAFRV EVNGQAAAGV TSGALPLRNG SWTSSGLMSI ANTGLRDNLA SQVILTSADV
VRRYSQYNET GFTQFIRSDA ARRGVPRALA PMDGKSLNLY LMSGAEQGIA LDFAGQVLFK
PATGGVVGTA VVRGAREVEL LGDGHRRTEG FSGVSLYADS LNRLGAGRLT IGAQPAIDYN
TAGNIVSFIG DTSNITLREG AMLSAPEVLL RTTSTRGSIT LEAGSGINTL GRGDAPYDSS
AGFIYAPGTA GLLVVSNGWT NVLAPSLSTP TSGAGDIRIG NCPTNACTNP TLLYSNGSIT
AATDKQFELD ETVRFGTRHL TLAVGTINAG SAEALSAAGS RVPTGLTLNQ NVLDRLLRGD
TQFGAPALET LSLTSRDAFN FYGSVSLDTL DRQTGVSKLQ NLILSTPAIY GLGQAADVAS
IRTANLVWNG ATQAPAAVIS GGAGTGQGTL DIQAQRIEFG YGPNPQASGL EQNDRLVLGF
ANVNLTASDH ITANHKGSLA VYQAQGAFDP VGGYAYSGGN LNLRTPLLTG EAASVNLLKA
GNNLTLTGAG TPAAANVLGA ELTLDARNVT LDSRIALASG KLVIKSVEDL TLNSGAYLDM
AGRTLAFNDV NKYSWGGDVA LYSSHGNIRQ LAGSSIDLSA QNNQAGNLSA IALASDAGMV
DLQGQILGAS SGSYDAGGTW VPYKAGGVDI RAQHLNGDPS QQFAALNQRL NAGQVFGSRS
LQLKQGDLLI GDGLKAGEVN VSVDNGSLTV AGLVDASGSR VGSIRLSAKN GLTVTGSAVL
DAHGRVLRVD SYGKIIDAPN RAMVELNSGA GTLTLGSGAR IDLRHGTDAA PGPLPGQSDG
LPRGTLELNA PRLAGGDIAI DASGALNIQG ARSIGLNATR RYSDASDGAD PAASGRPYQV
IDQAYLDRIH GDNTAFIDAA LLNSNLLQNK LAGLNNASYA DAFHLRPGVE IVSKTADGDL
VVQGDVDLSG YRYASLNPHT QKTAAYGSGE SGSLVIRAGG NLDIHGSLND GFAPPPETVD
DAGWKLLPGI QPFGGDLVVP GAGVTLAEGT LFPAGVTLNY DVPLQATTLA AGTLLPTEAT
LAAPYTLGAG TVLAGAVHDA SGQLIFAAGT LLTDNVTLPA GSRLGAGIRL NDVTSVKAIR
WPKGVPLPGS VEAGTNAIKG VRLSGSLALL RGSLIPSMTD VVLADGTAFI ELRPLNGNQQ
GRNWAVASLL PAGSASWSMR VVAGADLGAA DTRAIKPVSS DGNLRLADTH YGLKVTEKAP
TLVWGEGNLG GFTPGEPVPD ELKEWCDWAP NSCVSAPRWT WAPDNWMGMP PGSPISDEDV
LAWCGSFPEL CVENKPGITV RTRTQMFSVL RTGTGDLDVL AAGDLSMDSP FGVYTAGTQS
EEIDPRFNQP RGRLSDNGSV LGSAGSDYER WVTGNDSLYQ AWYPQMGGNL TINAGGSISG
DVLGRRGPSA TLETPEQVPS VAVGNWLWRQ GTGNSDVPTA WWINFGSYAT QPLPDQGVDT
GPFLVGFTGF GTLGGGNISL RAGADGGMVK ALGENANTNL YPRGQGLIVA VGSTGRVGDD
GRLQLTGGGD MDIRIGGSLN PSLQARAGNS GVGVPRHDLQ GALINLRGAA RLTGGALGGI
NLQYGAASYV QDPREVRPFD PFTSTAGSAS GGLVLIPGDS GMSLNTRGDL VLGGAADPGR
VRLQNSTPFT TADGVVHDGG GLSGFSLWTD HTAIDLFSAG GNLTPSTQLG EVDGQGPMIG
RNTSPTDGRF VYPSILRAVA AQGSIYAGPS ATYMQGSGFL PATAYSLLLA PSKAGQLELL
AQDSIYAGGY AINQSGASPS AIATPFTPAF NGYGNNTSKT PIMTNRGSDG VSPDKNLRYP
LFAFGANSYS GLSEVQAPAR FYALTGDLVG VRSGETLTFS LSKRTWYEAA STVWMLAGRD
IVASGTFLGQ PTTAPNGEMG LPNTEGVAST GNLFVHNNSR DISRVSAGRD ILYSSFDIAG
PGVLDIKAGR NILMEDRASI TSIGPILAGD NRPGASLVMQ AGTGAQGADY SRFIARYLNP
QNLADPSVSL NGQPGKVVKT YLDELQSWLT LGYGFSGNAE QAQAFFAALP GAEQAIFARQ
VYFAELRAGG LEYNDVDGPR KGSYLRGRNA IASLFPTVDV VGNLIRYDGD ITLYGGAGVK
TLFGGDIQML TPGGGQVFGI EGAAPPSTAG IITQGSGDIQ LYSEGSILLG QSRIMTTFGG
SILGWSAEGD INAGRGSKTT VVYTPPKRVY DTWGNVTLSP SVPSTGAGIA TLNPIAEVAP
GDIDLIAPLG TIDAGEAGIR VSGNVNIAAL TVVNAANISV QGKATGVPVV SAVNTGAITS
ASSAASSATQ AAEDVARQQQ AASRQNQASV FTVQVLSFGN EQLAPTRDGA SRAPTPGYNP
NSPVQVLGAG ALDEQAKQQL TEEERGQLTL