Gene Vapar_4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4525 
Symbol 
ID7972814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4772994 
End bp4784648 
Gene Length11655 bp 
Protein Length3884 aa 
Translation table11 
GC content65% 
IMG OID644795111 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002946398 
Protein GI239817488 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTT ATCTGCATCG CATCATCTTC AACGCCGCCA GAGGCATGCG GATGGTGGTG 
CAAGAAACAG CCAGCAGCAC GGGGAAGGGC AAATCCAAAC CCACGGGCGG GCCCGGCGGC
GCGGGCGCCG CGGTGAAGGC TGTCGCGTTG CTCGGTGCAC TGGTCGCGCT TCCTGGAGAA
GCCCAGATCG TCGGCGCGCC CAACGTTCCC GCCAACCTGC GGCCCACGGT GCTGGTCGCG
CCCAACGGCG TGCCGCTCAT CAACATCCAG ACGCCCAGCG CCGCCGGTGT CTCGCGCAAT
GTGTACCAGC AGTTCAACGT GGCGCCCAAC GGCGCCATCC TGAACAACAG CCGCACCAAT
GTGCAGAGCC AGCTCGGCGG GTTCGTGCAA GGCAATCCGT ATCTCGCGAC CGGGCCTGCA
CGCATCATCC TCAATGAGGT GAACGGCGGC AGCCCGAGCC AGCTGCGCGG CTACATCGAA
GTGGCGGGGC AGCGGGCCGA GGTGGTCATC GCCAACCCGG CCGGCATCAG CGTCGACGGC
GGCGGCTTCA TCAATGCGAG CCGGGCCACG CTGACCACCG GCACGCCGCA GTTCAATGCC
GTGGGCGGCC TGGACAGCTT CCTCGTGCGC GGCGGCACCA TCACCATCGA CGGCGCGGGC
CTGGACGCCA GCAAGACCGA CTACGCCGCC ATCCTCGCGC GCGCCGTGCA GGCCAACGCC
GGCATCTGGG CCAGCGAACT GAAGGTGGTG ACCGGCGCCA ACACGGTCAG CGCCGACCAC
AGCCAGGTCA CGCCCACCGC GGGCACCGGC ACGGCGCCCA CCTTTGCCCT GGACGTCGCT
GCGCTCGGCG GCATGTACGC CGGCAAGATC ACCCTCATCG GCACCGAGGC CGGCCTGGGC
GTGCGCAACG CCGGCACCAT CCAGGCCGCG CCGGGCGCCG CCGCGCTGAT GGGCGCGGGC
CAACTGGTCG TCACCAGCGC CGGGCGCCTG GAGAACATCG GCACGCTCCA GGCCACCGCC
GATGCGAACC TCGCGGCCTC GGCCCTCGCC AACAGCGGCC GCGTCAGCAG CGGCGGCAAC
CTCAAGATCA CCACGCAGGG TGACCTGGCC AATGCACTGA ACGGCACCGG CGGCACGCTC
GAAGGCGCGC GGCTGGAGCT CGCCAGCACC GGAGGCGACA TCGACAACCG CGGCGGCACC
CTGCGCCAGA CCAGCAGCGC CGGTCTCGGG CTGAGCGCGC CGGCCCTGAG CAACACCAGC
GGCGGCGTCA TCGGGCTGGA GCCCGTGGCG GCTGCGCCGT CCACCTCTGG CACCGGAACG
GGCGGCGGCA CGGGCACAGG AACAGGCACG GGCGGCACCA CCGACCCCGC CGCGCCAACC
ACGGGCACCG GCACGGAGAC CGGCTCCGGC AGCACCGTCA CGCCCGCCCC GTACGTTCCA
CCGTCGCCAG GTGCGATCAC CGCCAGCGGC ACGATCCGCA ACGACAGCGG CAAGATCTAC
GCCGGCGGGC CCATCAGCCT GCAAAGCGCC AACATCAACA ACAACGGCGG CACGCTGAAT
GTCGCCAACA TGGCCGTGAG CCAGCCGACG TTCGACAACC ACGGCGGCAC GCTGAACGTC
AGCAACGGCT TCAGCGCCAA TGTCGATCGG TTCGACAACA CCGGCGGCAC GCTCAACGCG
GGCAGCCTGA ACATCACCAC GACCGGCGAC CTGGTCAATG TCGACGGCAA GCTGACCAGC
GCGGCCGACG CCACGCTGAC CGTGGGCGGA CAGGCCGACA ACACGCGCGG CGTGATTTCG
GCAACGGGTG CGTTGACGGC CAACGTGGCG GGCGCGGTGA ACAACACCGG CGGCACACTG
GTGGCGAACC AAGGCGTCGC GCTCGGCGCG GGCAGCCTGG ACAACACGCA GGGCAGCATC
CAGTCGGCGC AGGCTGCAGT GCAGCTGGGG GTGACTAACC AACTGACCAA TGGCAGCAGC
GGCACGATCG GGGCGGCCAC CGACCTGAAG GTGCAGGCGG GTTCACTCGT CAACTCCAAT
GGCGCCAGCC TGCGCGGCGC GAACGATGTG AGTGTCGCGG TCGGCGGGGC GATGACCAAC
GACGGCAGCA TCACGGCCGG GCGGCACACG GCGGTCGCCG CAGGCAGCCT GCAAAGCGGC
AGCACCGGCG TGCTGGGTGC CGGCATCCAG AGCGACGGCA AGCTTGGCGC AGCGGGTGAC
CTGGTCGTGA CCACGAGCGG CGCCTTGGTC GCCAATGGCA CCAACCTGGC GGCTGGCAAT
GCCACGTTGC AGGGTGCAAG CGTCGATCTC TCGGCCAGCC AGACCAGTGC GGCGAATATT
GCGGTTACGG CGACGCAGGG CAACGTGACC ACCGACAAGG CCACGATCAC AACGCCGGGC
ACGTTGAGCG TGACTGCCAA CGCACAGCCT AGCCAGACGC TGGTCAACGA GGGCGGCAAG
CTCAGCGCCA ATCAGCTGGA CCTGAATGTC TCCAACCTCG CCAACACGAA CGGCGGCGAG
ATCGTGCAGA CCGGCACGAG GGCGACCACC ATCGCGACCT CGGGCGCCAT CGACAACAGC
GGCGCCACGC TGGCCAGCAA CGGCAGCCTC GCCCTCACCG CAGCCAGCCT GAACAACCGG
GGCGGCACAC TGCAGGCCGC CCAGGTCTCC GACCTGAGCG TCAATGTGAC GGGTCTGCTG
GACAACGGCC AGGGTGAGAT CAGTGCGGGC GGCAACACGA CCCTGCAAGC CGGCAGCCTT
GCCAACGATG CCGGGCGCGT CACGGCTGCC ACGGGGGATG TCGCAAGCAC CACGAGCGGC
GCCACCAGCA ACCGTGGCGG CACCATCGCC GCCAATGGCA GTACCGCATT GAATGCCGGC
AGCCTGGACA ACAGCGGCGG CACCGTGTCC GCGCTGAACC GCCTGGCGGT CAATGTCCAG
GGCGCAGTCG ACAACACGGC CGGCACGCTG GCGGCCACCC AGGAGCTGGC CCTCGACGCC
GGCTCGCTCG CCAACGACAA GGGCTCGATC CAGTCGGCCC AGGCCGCGAC CCAGGTGAAC
GTCACCGGCG CTTTGACCAA TGGCCAGGGC TACATCGGCG CCGCCACCGA CCTCAGCGTG
CAGGCGGGCA GCCTGAGCAA CGCCGCGGGC GGCAGCCTGC GCGGGGCCAA CGACACCACC
GTGGCAGTGG CTGGCCTGCT CGCCAACGAC GGCAGCATCA CGGCCGGGCG AAATGCCACC
ATCACCACCG GCAGCCTGCA AGGCGGCAGC ACCGGCGTGC TCGGTGCAGG CGTCCAGAGC
GATGGCAAGC TCGGTACGGT GGGCGAGTTG AGTGTGACGG TCTCCGGGGC GCTCGCCACA
CACGGCACCA ACCTGGCAGC CGGCAATGCC GCGCTGCAAG GCGCCAGTGT CGATGTCTCG
GCCGGTCAGA CCAGCGCCGC GAACATCGCG ATCACGGCCA CGCAGGGCGA CGTGATCACC
AGCAAAGCTA CGGTCGTCAC GCCGGGAACC CTGAGCGTGA CCGCCAACAG CAAGGCGCCG
CAGACCCTTG TCAACGACGC GGGCCGGCTC AACGCGAAAC AGCTCGACCT GAAGCTTTCG
AACCTCGCCA ACACGAACGG CGGCGAGATC GTGCAGACGG GCACGGGCGC GACCACCATT
GCGACCACCG GCACCCTGAA CAACAACGGC GGACGCATCG CGAGCAATGG GCAGGACCTG
AGCCTGGGTG GCGCCAACAT CACCAACGCC GGCGGCAAGA TCGAGCATGC CGGTGTCGGC
ACGCTGAGCA TTGCCGGCGG CAGCTACAAC GGCACCAACG GCCAGGTCAC GGCCAATGGT
GCGCTGACCG TCGCGATGTC GGGCGCGTTC AACCAGGACG GCGCGACGGC CGCAGTCAGC
GCCAAGCACA TCACCATCGA CGCCGGATCG CTCGGCAACC GCGCCGGCGC CCAGATCGTG
CAAACCGGCG CCGATGCCAC GCGCATCACG GTGGTGGGCG CGCTGGACAA CGGCGGCGGC
ACGCTGGCCA GCAATGGCAA CACGACCGTC GCGGCCGGCA GCCTGTCCAA CCAGGGCGGC
ATCCTCCGCG CCGCCGAGGC CTCCGATCTC GGCCTGACCG TGGGCGGCCT GCTCGACAAC
AGCAGCAAGG GCGTGATCGG CGCGGGCGGC ACCACCACCA TCGCTGCCGG CAGCTTGAGC
AACAACGCGG GCAGCGTGAC CGCTGCCGAA GACCTGAGCG TTACCGTGGG CGGCGCGGCC
TCGAACGTGG GCGGCACGTT GGCGGCCAAC GGCAACACCA CGCTCGTGGC CGGAACGCTG
GACAACAGCA GCGGAACCGC CGCGGCGGTC AATGGCAACC TGAGCGTCAC GACGTCAGGC
GCGACGACGA ACAACGGCGG CACGTTGCAG GCCGGCGCCG CGACCACGCT GCTCAATAGC
GGGCTGAACA ATGTCGCGGG CAAGATCTTT GGCAACAGCC TGGCGGTGAA CACGCGCGCG
AACGGGCAGA ACAATGCGCT GGACAACACG CAAGGCACGC TGGCGGCCAC CACCACGGTG
GCCGTGAACT CGGGCGCGCT CATCAACGCT GCCGGCCTGA TCCAGTCGGG CGGCGCGATG
ACCATCGACA CGAATGGTCA GTTGCTCACG AACACGAATG CGGCCGGGTA CATCAATGGC
CAAGGCGGCA TCAGCAGCGG CGGCACGCTG AACCTGACCA CCGGGTCGGT CAATAACAAC
GCGGGCTTCA TCGGCGCGAA GAACGCCGTG ACGGCCAGCA CCCAGGGCTT CTCCAACACG
GGCGCAGGCT TGGTGCTCGG GCAGTCCACC GTCGCCATCA ACACCAACGC GGCCGCCTAC
GACAACACCG GCGGCCGCAC CCTGGCCGCG GGCGATCTGG CCGTGAACGC CGGGACGCTG
ACCAATACCA GCGGCCTGAT CCGCTCGGCG GCCACAACGA CGCTGAACGC CGGCCGCATC
GTCAACACCA GCACTCTTGG CACCGAGCAG GGCATCGAAG GCCTGAATGT GGCCGTTGGT
GCCGGCAATC TCGACAACAG CTCGGGTGCC ATCCGCGCCG ACGTCAACGC CACCATCACC
AGCGGCGGCA CCGTCAACAA CACGAATGGC TTGATCTCGG CCGGCAACAC GCTCGCCATC
GCCGATCCCA ATGCCGCCAA CCCCGGCGCC AAGACGCTCA ACCTGGTCAA TACCGGCGGC
ACGCTGGTGG CCGACAAGAG CCTGAAGATC GATGTTGCGA ACTTCAGCGG CGATGGCCGG
GCCGTCTCGG GCCAGAACCT GAGCATTGCG CTCCAGCAGG ACATCGTGAA CAACGGCGAG
GTGGCTGCCA ATGGCAACCT GAGCTACACG ACGACGGGCA ACTTCACGAA CAACGGCAAG
CTGCTGGCGG GCCAGACGCT CACCGTGGGC GGCAACAACG TGGACAACAC GGCGAATGCG
GAGATGTCGG GGACGAACAC CATCGTCAAT GCCGGCGGTA CGCTGACCAA CCGCGGCCTG
ATTGACAGCC GCGGCGAAAC CGAGATCAAC GCAGGCGCGC TGAACAACAT CGGCACCGGC
CGGATCTATG GCGATGCGAT TTCCATCTCG GCCGGCACGC TGCTCAACGA CAGCGAAACC
GCCAATGGCG TGACCAAGGC GGGAACGATC GCATCGCGCG GCGACCTCGA CATCGGTGCG
GGCACCATCA CCAACCGTGA GCACGCCCTG ATCTACAGCC AGGGCAACAT GTATATCGGT
GGCGCCCTCG ATGCCAACCG TCAGGCGATC GGCCAAGGTG GGACGCTCGA CAACCTGAGC
GCCGACATCG AGTCCATTGG CGACATGTCC ATCTCGATGG CGCAGGTCAA CAACCGCGAT
GTCCACATCC AGAAGGGCGC GCCCACGGTA ACGCCCAGCA CCTTGACCGG CATCGCACCC
AACACGCTCG TGGGCAGCGG TGTCGGCAGG ACCACGACCT ATCCATTGGA CGAAGTCAAT
GTCGACCCCG TCAACGGTTT CGTCTATCTC AAGGCTACCG GCGAGCTCGT CGGCATCGGA
GGCTACGCCG TCTGGCACAA CGCCATCACG ACCACGGAAG ACACGGCGAC CAACATCGAC
CCCGCCCACT TGGTGGCGGG CGGCAGCATG ACCGTCAACG GGCGCCTGTA CAACGAGAAC
AGCCAGGTGC TGGCCGGCGG GACGATCACG GCCACTGACT ACCAGTCTTA CCAGCTGACG
GGTACGCGCA CGATCACGGG CTCTGCCACC GTGATCGACA ACAAGGGCCA GGTGCAGGCG
ATGAATGTGC CTCTGATCCT TCCGCCGCAG ACCATTTCGC TGGGCGCCTA CAAGTACCAG
GAAAACATCA ATGCGGCCGC CGGCTACAAC GCGGGCGTCG CCCCGGTCGG CAGCGGGACG
GGCGGGGCGA CCGGCAAGGG CGCGGTGGGC GGCGGCCAAG GGCCGGCCAC CATTGTCGAA
GTGCCGGCGA ATGTGGGCGA CACAGTCAAG GTCGACGGCC AGAGTGCTGG CAGCGCGACA
GGGTCGACCG GACCCGATGG CACGACGGCA ACGCAATTGG GTACGGGCGC CACGGACGTA
GGTGCCAACG GAGCTGCGAC CGGCACCGCT TCGTCGGCCC AAGCCGGCGC AAGCCGGACC
GTCCCGATGG TCGTGCGCAC CAGCATGCCC AACGTGCGTA TCCCGAATGC CAGCCTGTTC
AACCTTCGTG CCGGTCCGGG CAGCTACCTG ATCGAGACCG ACCCGCGCTT CGCCAACTAC
CGCAACTGGC TCAGCAGCGA TTACCTGCTG AACAGCCTCG GCCAGGACCC GAGCAACATC
CTCAAGCGCC TGGGCGATGG TTTCTACGAG CAGAAGCTGA TCCGCGAACA GGTGGCGCAA
CTCACAGGCT ATCGCTACCT GGATGGCTAC AACAGCGACG AAGACCAGTA CATGGCGCTG
ATGGACGCCG GCGTGATTTT CGCGAAGCAA TACGGATTGC GCCCGGGCGT GGCGCTGAGC
GCGGCGCAGA TGGCGCAGCT GACCAGCGAC ATCGTGTGGC TCGTCGAACA GACGGTGACC
TTGCCGGACG GAACGATGCA ACGCGTGCTG GTGCCTCAGG TGTATGTGCG TGTGCGGCCA
GGCGATATCG ACGGTTCAGG CGCATTGTTG AGCGCGGATG CCGCCATCAT CAAGAGTTCG
GGCGATGTCA CCAACACGGG CACCATCGCC GGACGGCGTC TGGTGTCGAT CACGGCCGAG
AACGTCAACA ATCTTGGTGG CCGCATCTCG GGTGGCAGCG TTGCCCTCGA CGCCAGGTCC
GATCTGAACA ATATCGGCGG CACCATCGAT GCGCGGGACG CGGCCATGCT GACTGCCGGC
CGGGACATCA ACATCCGAAC GACCACCCAG AGCACGGGCG GGCTGTTGAA CAACACGGCC
GTGGACCGTG TGGCCGGGGT CTATGTCAGC GACCCTGGCG GAGTCCTGCT GGCCTCCGCG
GGGCGCGATG TCAACCTCGT CGGCGCGGTC CTCGCCAATG TCGGCAAGGA CAGCCGAACC
TTGGTCAATG CGGAGCGGGA CGTCAACCTC GGCACCGTCT CTGAATCGAG CACCATCTTC
GCCAGCGGCG GCAACAAGGG CCGTAGCTTC AGCGCCGTGT CCTCGCAGAG CCGCGAAATC
GGCTCCACCA TCGTGGGCAG CGGCAGTGTC GCCATCACCG CGGGCAACGA TGTCCGAGCA
CGCGCCGCTG ACGTGTCGGC CGGAGGCACC CTGGCGGTCA CCGCTGAGCA TGACATCCGC
ATCGATGCCG GACAGTCAAG CCAAGCAATC CTGACAACCA GCAACTCCTC ACGCAAAACG
CTCACTTCAA AGAGCAGCAG CTCCGAAATC AAGGCGCAGA GCGACACCAC CGTGCTTGGC
AGCAACTTCT CCGGGCAGAA CGTGGTGATG TCCGCGGGCA ATGACCTGGG CGTCCGCGGC
AGTCGAGTGT CTGCAGAGAG CCAACTCGTG CTGAGCGCGG GTCGCGATGT GCGCATCGAG
AGCGCGCAGG AGCAACATGC GACCGGCAAC GTTTCGAGAA GCAGCCAAAG CGGCTTCAAC
GGAGTGAAAG ACAGCCTGCT GTACGGAAAG GGCTACAGCA GCAGTTCCAG CAATCGCAAC
GAAATGAGTG CTGGCACGAC ACAGGTCGGC AGCACCATCA GTGGCGGCAG CGTCAGCATC
GATGCTGGGC GCGATGCCCA GATCGTGGCG AGCAATGTAC TGGCCGACAC GAACATCGCC
ATTACTGCCG GCCGTAACAT CGACGTGCTG GCGGCTCAGG ATACCTCTGT GTCGGCAACT
GCCAGCAGCG GCAAGAGCCG CAGCTTCAGC CCGTCGCCAG GCCTTGCGCC ACGCCATACG
GCCTACAGCA ATGTGAAGGG TTCGGAGGAC GGTACGGGCG AGTCCAGCAC CGCCGTCACA
AGCCTGATCA GTGCCAACGG CGGCAATCTC ACCATGGTGG CTGGCCTGGA CTCCAAATAT
GCGGGCACTG GCCAAGGCAA CATCACGGCC GAAGGCGCGG ATCTGCTGGC CAAGAACAAA
GTCGCCTTGT CTGGCAATGC AGTGAACCTC AACGCCGCCA CGTCCAGCGG CAGCAGCAAG
CACCACGCAG AATCCAAGAG CCACACCATC GGTGCGCAGC TGAGCGGCAT CGTGGGCAGC
GCGATCACCA GCGCCTACGA TGCGGCGCAG GAGTCGCGCA AGACCGATGA CAGCCGGCTG
AAGGGCGCCT TGGAACTCAA GGCTGGGTAC GACGCCTACA AGCTGGCCAC CGATGGGGCG
TTAGGGAATG GCATCCAGGG ACTGACGGCT GCGGGCACCG GCGGCGACCC CAGCGGTGCG
GCCTTTGGTG TGAGCGTGAG CGAAAGCCGC ACACGTTCGC GTAGCGACAC GGCCGAGGTC
TACAGCAACC AGCGCGGCAC CAACATCCAG GCCGGCAGCA TCGACATCAC CGCGCGCGAG
ACCGACATCA ACATGCAGGG TGCCAAGCTG CAGGCTCGCG ATATCGCGCT GGACGCGAAG
CGCGACATCA ACATGCTCGC CGCGGAAAAC AAGGCGGCCA CCCTCAGTAC CAATTCGGGC
AGTTCGCTCG GCGGGGGCGT GACCTTCGGC TTCGGTTCGC AGAACGGCTT CAGCATCCAG
GTCAATGCCG GCAGCAACCA GGGCAAGGCC ACGGGCATCG AGACGCGCCA CGACAACACC
CTCATCACCG CCACCGACTC CGTGAAGATC AAGAGCGGCG GCGACGTGAA CATGAAGGGG
GCGCAGATCA CGGCCGATTC GGTCAAGGCC GACATCGCAG GCAACCTGAA CATCGAGAGC
CTGCAGGACA GCACCACCTA CAACAGCAAC CAGAGCAGCA GCGGAGTCGC GCTCAGCCTG
TGTATTCCGC CGATCTGCTA CGGTCAGTTC GTCTCCGCCA CGGTCGATGC CTCCAAACAG
AGAGTCGACC ATAACTACCG CAGCGCCACC GGGCAAAGTG GCATCGCTGC GGGCAATGGC
GGCTACGACG TGTCGGTCAA GGGCAACACC GACCTGAAGG GCGGCGGGAT CACCAGCACA
GCGCCCGAAA GCAAGAATAG CTTGGTCACT GGGAGCCTCA CGACCAGCGA TCTGCAGAAC
CGGCAGAACA CCAATTCAAG CAGCAGCGCC GTCAGCCTGA GCTTCAGCTA TGGCACGGGT
GCAGCGAACA ACGTGCTGAG CAACGTAGCT CGGAGCGCGA CCAACACCGT GCTGGCGAAT
CTGAACGGAG GCAAGGGACT GCCTGCGGAT AACAGCCAGT CCAGCCAGAC CTTGAGCGTG
ATCAGCCCGG GCAACATCAA GATCGTTGGC ACAGGCGTCA AGGAGATCGA CGACAAGAGC
AACGCCAACG TGGCCACTCT GACAACTAGG GATCCGGTTA CGGCCAACGG CGCGCTCGTC
AATACGCTGA CCTTGCAACA GGCCAAGGAG ATTCCGAGAC TGCAGCAGCA AGCGCAAGAT
CATCAGCGCG CAGCTCAGCT CGTGGGCAGC GTTTTGAGCG GCGTGATTGG AGACGTGAGC
CAGTCGTTAA ATCAACAAGC GCAGGCCCAA GAGAATGCAA GAGCGCTCGC TGCTGGAGAA
GCTCCACGGA CTGTGACCAA CTTTGCAGAT GGTTCGTTGG AAAAAACTGT ACTGCACGGG
ATTGCCGGGT TCATCCAGGC CAAAGTCGGC GACGGCAGCG GTTTGGCGGG GGCCGCTGCT
GGTGTTGTCA ACGAGCAGTT GCTGCCTGCG ATGGAGAAAT ACTTAAAGGA GAACGGCTAT
GACTACAACG ACCCTTCGTT AACGGTTGAG CAAGCTGCAC AGAAGAAGAG TGACTACAGC
GCGCTATTGA CGGCCGCTTC GACTTTGGTG GGTGCGGCTG TTGGCACGGT GGCTGGCGGA
AGCTCGAGTG CCGGCGTCGG CGCGACAGTT GCCAACAATG CAACGGTCAA CAATTTTCTA
AAGCACGACC AAGTGGCCGC GATGAAGAAG GAGTTCGCGG CGTGCAGGGC CAAGGGGGAG
TGCAATGATG AGGAAGTCCG AGCGATCGCG GGCAAATATG CTGCCCTCTC GCAGAAGAAC
ATCGATCTGA TCAAGTCCTA CATCAAGGCC GGCGACGTCG CCAGTGTCAG TGCCTTGGAA
AGCCAAGCAG CCAGTGCTGC CGACGTGGAT TCGGCCATCC CGTTTGGCTA TGCCCAGATG
TCAACCGTAT TCCAAGGCTG GCAGAACAAC GTGAACGTCT TGGGCACAGT TGGCGGGGTG
GGGGCACTGG GTGGCACTGA CGTCCAACAG GCACTGGAAG TAGCGAAGTT CCGACAGACC
TACTGCGGAG GGCTCAGTGC TGGAGCTTGC GACGCAAGGG TGGACGACGC TATTGCCGAC
AGAGCGACGC GTGCCTTGAT ACTCGGAGCC ACCACCGTCG CGATTCCGAC GGCCATACAG
GCCCTAGGAG GATTACGCCC TGTCAGCCCC TCCAGGAATT CTGTTCGGCC AACGGAGATC
ACCAGCGACG GGGATCTTTC AAATGTCTAT TCGACGCAGA ATCAAACCGT CGTCCATATG
CCCATCAGGG GCACTTCCAT TGAGGACATT TCTGGGGCAA CAGTATTCAC GATGCCCCCA
CAAGGACAAA GAATCTCGGG GCAAAACGCC GGGGTGCGGC TTGGGGCTTG GGGTGAAGGT
CCGAGCGGCC TCGGTACGGA GATCATTGAG CAACTGTCGC CAGGGACCAG GCCCCTGCAA
ACCGGCGGAG GCTATGGTGT GGATAGCATT GGCGGGAAGA TTAATACGGA GAACAAGACG
ATTCCCGCAT TCGAGATCAA AACGACTGAT ACCGGGAATC GGCAGCCTGT CGACAAACCA
AAGCCGCTTC CTGAACGAGT GAATGATTGG GTCAATGAAG CCGCGAATAC AGGCATGATC
AGTGGACAGC GCGTTAGCGC AGCTGATAGG GCGTATGCAA GGGGGCTACA AAATTTGCTT
CGCGAAGGCT ATACGATTCA ACCTTACGTA GTTGAAGTCG CTGTCCCACC GCAAGGGCAG
AGTGCGCGTC CGACAGTGAC CGTAGTTCCG TGGCCTGTTC CCAAGGGGTT ACCACGTCCT
GGCACCGCAC CTTGA
 
Protein sequence
MNRYLHRIIF NAARGMRMVV QETASSTGKG KSKPTGGPGG AGAAVKAVAL LGALVALPGE 
AQIVGAPNVP ANLRPTVLVA PNGVPLINIQ TPSAAGVSRN VYQQFNVAPN GAILNNSRTN
VQSQLGGFVQ GNPYLATGPA RIILNEVNGG SPSQLRGYIE VAGQRAEVVI ANPAGISVDG
GGFINASRAT LTTGTPQFNA VGGLDSFLVR GGTITIDGAG LDASKTDYAA ILARAVQANA
GIWASELKVV TGANTVSADH SQVTPTAGTG TAPTFALDVA ALGGMYAGKI TLIGTEAGLG
VRNAGTIQAA PGAAALMGAG QLVVTSAGRL ENIGTLQATA DANLAASALA NSGRVSSGGN
LKITTQGDLA NALNGTGGTL EGARLELAST GGDIDNRGGT LRQTSSAGLG LSAPALSNTS
GGVIGLEPVA AAPSTSGTGT GGGTGTGTGT GGTTDPAAPT TGTGTETGSG STVTPAPYVP
PSPGAITASG TIRNDSGKIY AGGPISLQSA NINNNGGTLN VANMAVSQPT FDNHGGTLNV
SNGFSANVDR FDNTGGTLNA GSLNITTTGD LVNVDGKLTS AADATLTVGG QADNTRGVIS
ATGALTANVA GAVNNTGGTL VANQGVALGA GSLDNTQGSI QSAQAAVQLG VTNQLTNGSS
GTIGAATDLK VQAGSLVNSN GASLRGANDV SVAVGGAMTN DGSITAGRHT AVAAGSLQSG
STGVLGAGIQ SDGKLGAAGD LVVTTSGALV ANGTNLAAGN ATLQGASVDL SASQTSAANI
AVTATQGNVT TDKATITTPG TLSVTANAQP SQTLVNEGGK LSANQLDLNV SNLANTNGGE
IVQTGTRATT IATSGAIDNS GATLASNGSL ALTAASLNNR GGTLQAAQVS DLSVNVTGLL
DNGQGEISAG GNTTLQAGSL ANDAGRVTAA TGDVASTTSG ATSNRGGTIA ANGSTALNAG
SLDNSGGTVS ALNRLAVNVQ GAVDNTAGTL AATQELALDA GSLANDKGSI QSAQAATQVN
VTGALTNGQG YIGAATDLSV QAGSLSNAAG GSLRGANDTT VAVAGLLAND GSITAGRNAT
ITTGSLQGGS TGVLGAGVQS DGKLGTVGEL SVTVSGALAT HGTNLAAGNA ALQGASVDVS
AGQTSAANIA ITATQGDVIT SKATVVTPGT LSVTANSKAP QTLVNDAGRL NAKQLDLKLS
NLANTNGGEI VQTGTGATTI ATTGTLNNNG GRIASNGQDL SLGGANITNA GGKIEHAGVG
TLSIAGGSYN GTNGQVTANG ALTVAMSGAF NQDGATAAVS AKHITIDAGS LGNRAGAQIV
QTGADATRIT VVGALDNGGG TLASNGNTTV AAGSLSNQGG ILRAAEASDL GLTVGGLLDN
SSKGVIGAGG TTTIAAGSLS NNAGSVTAAE DLSVTVGGAA SNVGGTLAAN GNTTLVAGTL
DNSSGTAAAV NGNLSVTTSG ATTNNGGTLQ AGAATTLLNS GLNNVAGKIF GNSLAVNTRA
NGQNNALDNT QGTLAATTTV AVNSGALINA AGLIQSGGAM TIDTNGQLLT NTNAAGYING
QGGISSGGTL NLTTGSVNNN AGFIGAKNAV TASTQGFSNT GAGLVLGQST VAINTNAAAY
DNTGGRTLAA GDLAVNAGTL TNTSGLIRSA ATTTLNAGRI VNTSTLGTEQ GIEGLNVAVG
AGNLDNSSGA IRADVNATIT SGGTVNNTNG LISAGNTLAI ADPNAANPGA KTLNLVNTGG
TLVADKSLKI DVANFSGDGR AVSGQNLSIA LQQDIVNNGE VAANGNLSYT TTGNFTNNGK
LLAGQTLTVG GNNVDNTANA EMSGTNTIVN AGGTLTNRGL IDSRGETEIN AGALNNIGTG
RIYGDAISIS AGTLLNDSET ANGVTKAGTI ASRGDLDIGA GTITNREHAL IYSQGNMYIG
GALDANRQAI GQGGTLDNLS ADIESIGDMS ISMAQVNNRD VHIQKGAPTV TPSTLTGIAP
NTLVGSGVGR TTTYPLDEVN VDPVNGFVYL KATGELVGIG GYAVWHNAIT TTEDTATNID
PAHLVAGGSM TVNGRLYNEN SQVLAGGTIT ATDYQSYQLT GTRTITGSAT VIDNKGQVQA
MNVPLILPPQ TISLGAYKYQ ENINAAAGYN AGVAPVGSGT GGATGKGAVG GGQGPATIVE
VPANVGDTVK VDGQSAGSAT GSTGPDGTTA TQLGTGATDV GANGAATGTA SSAQAGASRT
VPMVVRTSMP NVRIPNASLF NLRAGPGSYL IETDPRFANY RNWLSSDYLL NSLGQDPSNI
LKRLGDGFYE QKLIREQVAQ LTGYRYLDGY NSDEDQYMAL MDAGVIFAKQ YGLRPGVALS
AAQMAQLTSD IVWLVEQTVT LPDGTMQRVL VPQVYVRVRP GDIDGSGALL SADAAIIKSS
GDVTNTGTIA GRRLVSITAE NVNNLGGRIS GGSVALDARS DLNNIGGTID ARDAAMLTAG
RDINIRTTTQ STGGLLNNTA VDRVAGVYVS DPGGVLLASA GRDVNLVGAV LANVGKDSRT
LVNAERDVNL GTVSESSTIF ASGGNKGRSF SAVSSQSREI GSTIVGSGSV AITAGNDVRA
RAADVSAGGT LAVTAEHDIR IDAGQSSQAI LTTSNSSRKT LTSKSSSSEI KAQSDTTVLG
SNFSGQNVVM SAGNDLGVRG SRVSAESQLV LSAGRDVRIE SAQEQHATGN VSRSSQSGFN
GVKDSLLYGK GYSSSSSNRN EMSAGTTQVG STISGGSVSI DAGRDAQIVA SNVLADTNIA
ITAGRNIDVL AAQDTSVSAT ASSGKSRSFS PSPGLAPRHT AYSNVKGSED GTGESSTAVT
SLISANGGNL TMVAGLDSKY AGTGQGNITA EGADLLAKNK VALSGNAVNL NAATSSGSSK
HHAESKSHTI GAQLSGIVGS AITSAYDAAQ ESRKTDDSRL KGALELKAGY DAYKLATDGA
LGNGIQGLTA AGTGGDPSGA AFGVSVSESR TRSRSDTAEV YSNQRGTNIQ AGSIDITARE
TDINMQGAKL QARDIALDAK RDINMLAAEN KAATLSTNSG SSLGGGVTFG FGSQNGFSIQ
VNAGSNQGKA TGIETRHDNT LITATDSVKI KSGGDVNMKG AQITADSVKA DIAGNLNIES
LQDSTTYNSN QSSSGVALSL CIPPICYGQF VSATVDASKQ RVDHNYRSAT GQSGIAAGNG
GYDVSVKGNT DLKGGGITST APESKNSLVT GSLTTSDLQN RQNTNSSSSA VSLSFSYGTG
AANNVLSNVA RSATNTVLAN LNGGKGLPAD NSQSSQTLSV ISPGNIKIVG TGVKEIDDKS
NANVATLTTR DPVTANGALV NTLTLQQAKE IPRLQQQAQD HQRAAQLVGS VLSGVIGDVS
QSLNQQAQAQ ENARALAAGE APRTVTNFAD GSLEKTVLHG IAGFIQAKVG DGSGLAGAAA
GVVNEQLLPA MEKYLKENGY DYNDPSLTVE QAAQKKSDYS ALLTAASTLV GAAVGTVAGG
SSSAGVGATV ANNATVNNFL KHDQVAAMKK EFAACRAKGE CNDEEVRAIA GKYAALSQKN
IDLIKSYIKA GDVASVSALE SQAASAADVD SAIPFGYAQM STVFQGWQNN VNVLGTVGGV
GALGGTDVQQ ALEVAKFRQT YCGGLSAGAC DARVDDAIAD RATRALILGA TTVAIPTAIQ
ALGGLRPVSP SRNSVRPTEI TSDGDLSNVY STQNQTVVHM PIRGTSIEDI SGATVFTMPP
QGQRISGQNA GVRLGAWGEG PSGLGTEIIE QLSPGTRPLQ TGGGYGVDSI GGKINTENKT
IPAFEIKTTD TGNRQPVDKP KPLPERVNDW VNEAANTGMI SGQRVSAADR AYARGLQNLL
REGYTIQPYV VEVAVPPQGQ SARPTVTVVP WPVPKGLPRP GTAP