Gene Paes_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0783 
Symbol 
ID6459897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp844035 
End bp858281 
Gene Length14247 bp 
Protein Length4748 aa 
Translation table11 
GC content54% 
IMG OID642724779 
Productouter membrane adhesin like proteiin 
Protein accessionYP_002015476 
Protein GI194333616 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03660] T1SS-143 repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTC CTGTTGTACC TCTTTTTCCT GTCGTTGTTT TTCTTTCCGG TAAAGCCTGG 
GCAAGATCAG TCACTGGTTC GATGCGCCCG ATCCATGAAG GAGATGTTCT TCACGAGGGC
GAAATGGTCA TTACTGATAA TGGTTCAAGG GCTGACGTAA AAATGCCCGA TGGTTTAATT
GTGCCTGTTG AAGGAGAACT GCTTCTGTCA TTGCAGGAGG CTTCAGGTTC TGAACAGGAT
GCATCCGATA AAGAGGGGTT TTCAACGGCT GAGGAGAACT TGCCTTCCCC CAGCGATATC
GTATCAGAGA ATGTTCATGG TTCACACCAG AATGGTGCAC CGGACGCTCC TTTTGCCGTT
ACGCCAGACA GCGTTGAACT GAACAATGAG CCGAATAATT TCTACAGGAT CCTGCGCTCT
CAGGATATCA TGGAAGTTCA GCTGGTTGAA GGGGGAAACG CATTTAATAT GGCTCCTATT
CTCTCGGTCT CTATCGTTGG AGGATACAGC GATTCTCTTG CCGGTCGTAC GGGAGGCTAC
AATGCGTTTA TGGATGGCCA TGCCACCTAT AACCCGCGTT TAATTGAAGG AACTGGCATT
GTTCAGCGCG ATCTCAATGA GTTTGAGCCT GAGTTGAGGC CGTTTGCCGG GGGTGCTGAT
GATGACGACT ATGTTAATCG ACAGCCTAAA CCTCTTCCTG ATGTCGCTGA GGTTGTCGAA
GGCGGCAATG TTGTTTCAGG CAATGTTCTT GATAACGATA ACAGCGGAAA CGGCTCATCG
AGAGTTACAT CGATAACCTA CATCCCTGAA GGAGGCGGCG ATCCGCAGTC TGCAGATGTT
CCAAAAGGAG GATCTGTGAC GGTGGATACT CTTTATGGAG AATTGACCAT CAACAGCGAC
GGTAGCTGGG AGTATACCAG TGATCCTTTT GAACCCCACC CGTTACCGCC TTCTGGAGAT
CCTTTACAGG ATGTCATTAC CTATACCATA GCTGACGCTG ACGGGGATAC GGCTTCTTCG
ACATTGACAA TAGATGTGCT GGACACGGTC CCTGCAATAG GCAACCCTGA GCCATCAGTG
GTTGACGAAG ATGATCTTCC TTCGGGCAGT GATGATGAGA AAGAGTCGAT TATTGTCGGT
GGTTCGCTTG GTGTAACGCC GGGTGAAGAT CCTATAGATA CAACATTTTC TGATCAGGAT
GCTCCTGACT CTCTGACATC AGGTGGAGAA AAGGTTCTTT ACGAAATACT CGATGACGGT
CATACGCTTC GAGCCTATAC AGAAGAAGGT AATGAGACGG TTTTTACCGT TGATATTCTC
AATCCTGACA GCACGACTGG CGCACAGGAA TACGAGTTCA CGCTGGTTCG ACCTCTTGAC
CATGATCCTG TTCAGGGGGA AAACCCGTAT CATCTGGTTT TCGATTTTGA AGTGCAGGAT
ACCAATGATC TCGATCCTGA TAAGGGTTCG TTCACCGTGA CGGTGGTTGA TGATATACCC
GTGCAGACGG AGGAAGTCGA GGAGCAGTCG GTGCAGGAAG ACGCGCTGAG TGGCGGCAAC
ATAGATGATG CGGTAAACGA TACGGTAACG GCGACAGGGA GCGTAGCCAG TCTGGTTGAC
GCAGGCGCCG ACGAGCCGGT AACGTTCAGC CTGAACCCGG ACACGACAGG TCTGCCGGAA
GTGACGTCGA AAGGCGAGCT GGTGACCTAC AGTGTCGAAG GTGACGTTCT GACGGGCACC
GGCCCGGGCG GACCGGTGTT CACGCTGACG CTCGATCCTG TTACAGGCGG GTACACGTTC
GTACTGCTCG ACCAGCTTGA TCACAGCGGA GCGGAAAACG ATAACGAAGA ACTGCCGCTG
GATCTGTCGA GCGGCATCAT CGCGACGGAC AAGGATAATG ACCAGCTGAC GCTGGATGAA
GGATCGCTGG TGATCAACGT GGAGAACGAC GTACCGGCGC AGACGGAGAA AGTCGAAGAG
CAGTCGGTGC AGGAAGACGC GCTGAGTGGC GGCAACATAG ATGATGCGGT AAACGATACG
GTAACGGCGA CAGGCAGCGT AGCCAGTCTG GTTGACGCAG GCGCCGACGA GCCGGTAACG
TTCAGCCTGA ACCCGGACAC GACAGGTCTG CCGGAAGTGA CGTCGAAAGG CGAGCTGGTG
ACCTACAGTG TCGAAGGTGA CGTTCTGACG GGCACCGGCC CGGACGGACC GGTGTTCACG
CTGACGCTCG ATCCTGTTAC AGGCGGGTAC ACGTTCGTAC TGCTCGACCA GCTTGATCAC
AGCGGAGCGG AAAACGATAA CGAAGAACTG CCGCTGGATC TGTCGAGCGG CATCATCGCG
ACGGACAAGG ATAATGACCA GCTGACGCTG GATGAAGGAT CGCTGGTGAT CAACGTGGAG
AACGACGTAC CGGCGCAGAC GGAGGAAGTC GAGGAGCAGT CGGTGCAGGA AGACGCGCTG
AGTGGCGGCA ACATAGATGA TGCGGTAAAC GATACGGTAA CGGCGACAGG CAGCGTAGCC
AGTCTGGTTG ACGCAGGCGC CGACGAGCCG GTAACGTTCA GCCTGAACCC GGACACGACA
GGTCTGCCGG AAGTGACGTC GAAAGGCGAA CTGGTGACCT ACAGTGTCGA CGGCAACGTG
CTGACAGCCA CAGGCCCTGA CGGACCGGTG TTCACGCTGA CACTCGATCC TGTGACAGGC
GAGTACACGT TCGTATTGCT CGATCAGCTT GATCACAGCG GAGGGGAAAA CGATAACGAA
GAACTGCCGC TGGATCTGTC GAGCGGTATC ATCGCGACGG ACAAGGATGA TGACCAGCTG
ACGCTGGATG AAGGATCGCT GGTGATCAAG GTGGAGAACG ACGTACCGGT CGCCGAGGAC
GATATTGATC ATACAGCAGT AGGCGAGCCG ATCATTATCG ACGTGTTCAG TAATGATAAT
TCCGGCGCTG ATGAGGTCAC GACTGTTATC GGCGTCGATG ATCCGGATAA CGGAACAGCC
GTTGTAAATC CTGACGGAAC AATTACCTAT ACGCCTGATC CCGGGTTTAC AGGTGTGGAG
ACGTTTGATT ACACGATTGA GGATAAAGAC GGGGATATTT CTGTAGCGAC GGTGACGGTT
ACGGTCATTT CGGTCGATAT CGATGATGCG ATTTTGCCGA CAACCGATCC GAAATACGAT
GACGGCGTGA TCAGCTCGGT TGATGATCAG GAAAATACAC CGATTACCGG TACCATCGGA
GATATCATCG ATCAGGGCGG CGAAATCGTC AGCCTTGTCG TGACCGACGA GGACGGAACC
AGCATTTCAA TACCTCTTGA CGAGATTACC GTCAATCCGG ACGGTACCTT CGAAACTACG
GTCGATGTTA CAGGACTGGT TGACGGAGAG CTGACGGTGA CTCTGACAAC CTCGCTTGAA
GAGAGGACAG TCGATACGGA GGATACCATC CTTAAGGATA CGGTTACCGA GGTAACCATT
GATCCTTTGG AGGTCGTGAA TGGAGAGATC CCGACGATCA CGGGAACCGG TGAGCCGGGT
TCCACGGTTG TGCTCAGCGA TGACGACGGC CCCATTGATG GACCTGAAAT TATTGTCGAG
CCTGACGGCA CCTGGAGTTT CACTCCCGAC GAACCGCTGC CGGAAGAAGT CGATACCATC
ACGGCGGATA GTACCGATCC CTATGGTAAT ATCAATGATG ACGTGAGAGA GATACCGCAG
CTCTTTACTC CTGATGAAAA CGGCCTGGAG CCCGGCGATG TCACGGTCTA CGAGGCAGGT
CTTCCAGACG GATCAAACGA AGGAACCGAT CCGGCAGAGC CTTCCTCCTA CGATGGGCAT
TTCTATATCG ACGCTGATCC GGAACAGATC GACACGGTTA CCATCACGAC GGATGTCGAG
ACAGTGGTGA TTCCGAAGAA TGATCTTGTT GCGTCCGATA CCACGCCGAT CGAGATAGAG
ACAACCTACG GCACCTTTAC CATCACGGAA TACAATTTCA TTACCAACGA GGTGACCTAT
GTCTATGAGA TAGATGACAA CAGCGAAGCG CATACCGATC CGGGGAATGA CATTCTTCAG
GAGCCGATCG GCGTTACCGT TGAAGATGAA AACGGTGACA CCCGGTCCGC GACATTGACC
ATGACCGTCG TTGACGATCT GCCGGAAGCA GTCGATGATA CCGGGACCGT CGAAGAAGGA
GGGAATACCG TTACCGGCAA TCTCTTCGAT AACGATACCT TCGGAGCGGA CGGTTATGAC
AGCGGCAACT TTACTTATAG TGATGAGAAC GGCGCACCGG TTCCCGCAGA CATTCCTGAA
ACAGGAGAAA CGACAGTAAC GACACAGTAC GGCAGCTTCA CTATCGACAG CGCTGGCAAT
TGGAGCTATA CCAGTAACGA TACGGTCGAT CACACCAGTG CCGACAGTCT GCCCGAGACG
ATAGTTTACA CGTTTACCGA CGGCGACGGC GATACAGCCG GAGCGACGCT GACCATCAAC
GTCACCGATA CCGAACCGGA GATCGACACT CCTGAAAACC GGATTGTCAG CGACGCAAAT
CTGCCTGAGG GAACCTCTTC TGACAACGGA TTGCTCACTG TTTCCGGTCC GCTTGGCGTG
GTGAAGGGAG AAGACACTAT CGATACCACG TTTGATACCG ATCAGACAGT ACTTGAAGCA
TTTGGCCTGT ACTCCGAAGA CGAGGAAGTC AAGTATGAGA TTATTGAAAA TGGCCATACG
CTTCGGGCCT ATACGGGTGA CGGAACGGGA CCGGATGACA CGGTTTTCAC GGTGGTGATC
AATGATCCGG AGAGTGACGC GGCCACCTAT ACCTACACGC ATGTCAGGCC GCTGGACCAT
GTCGACCCGA ATACGGAAGC GTTTGTGCTG CCGTTCAATT TCTCGGTCAA GGACAGCGAT
AATGACAGCG ATTCAGATAC CTTCACCGTA ACGGTGCTCG ATGATGTGCC GACAGCGGTC
GACGACCCGC CGCAGACGGT CGAGGAGGGA TCGAACACGA TCACCGGCTC GACAAGCGTG
CTCGGTAACG ACACCTACGG CGCGGACGGA CTTGATGGCG CGACCGTGAC CTACAATCCT
GAAGGACCTG AACCCGAAGC GACAGTCGAT GTCCCTGAAA CCGGGAGTAC CGTTGTCGAT
ACCGAGTACG GCGCTCTTAC CATCAACAGT GACGGAACCT GGAGCTACAC CAGCGACCCT
TCAGCGGACC ATTCGGTTTC GGACTCACTC GATGATTCTT TCACCTATAC CATCACCGAC
GGTGACGGCG ATACCTCGAG CGCGGTACAG CCGATCACGG TTACCGATAC GGTTCCTGAG
GCAGTCGATG ACAACGGCGG TACCGTTGAA GAGGGCGGTC TGGAGATAAT CGGCAATCTC
CTTGATAACG ACACGCTCCG TGAAGACGGC GGCACCGTTA CCAGCTTCAC CTACACTCGT
GAAGACGGAA CGGAAGGTTC GGTTGCTGTT CCGGAAAACG GTACGGAAGC ATCCGCAGAT
ACGCAATACG GCACCATCTA TGTCAGAAAT ACCGGCGAAT GGCGTTATGT GAGTGACGCA
TCGGAAGATC ACACGACGGA CGATCCTCTG ACGGAAGGAA TTACCTATAC ACTGACAGAC
GGCGATACCG ATACTTCCGA AGCCGTGCTG ACGGTCGGCG TCACCGATAC CGTGCCGCTG
ATAGGCGAAC CTGACGACAG TAGTGTCCTT GAAGCAAATC TTCCCGGCGG CAGCGACCCG
CAGCCTGCAG CGCTGGTGAA AACCGGGGAT CTCAATGTTG CACCGGGAGC CGACCCGCTC
GATACACAGT TTGCGCCGAT CGGAAGCCAG ACGGCACTCA ACAGCCTCGG GCTTACATCG
GACGGCGAGA CGGTCAATTA TACGCTCAGT CCGGACGGTT ACACGCTGAC CGCCTATACT
GACGATCCGA ATGATCCGGT CTTTACCGTT GTTATCAATG ATCCTACAGA TCAGACGCAG
CCTGATGGAA ATCAAAACTA TACCTTCACC CTCCTGAAAC CGCTCGATCA GGATCCGGGA
GAGGACGATG TAGACCTGAC CTTCGCTTTC GGCGTGGAGG ACGGTGATAC CGATACCGAT
ACGGACACCT TCACGGTGAC CGTGGTTGAC GAACCGCTCA AGGCCAATGA CGACCTTGAT
TCAACGTCGG TCAATACGCC GGTGACGATA GATGTTCTTA CCAATGACAA CGATCCCGAT
CCGGCTTCTC CGCTTGAGAT TGTTTCCGGT TCGGTTACCG ATCCTCCGAA CGGTTCGGTG
ACTGTCAATC CCGACGGCAC CATCACCTAT ACTCCCGATC CGGACTTTAC CGGTACCGAT
ACGTTCGAGT ATCAGATTTT TGATCCGGGG ACGGGCAAGA ACGATACGGC AATCGTGACG
GTAAACGTGA TCGGGGTGGA TATCGATGAC AATCCTGACA ATCCGGATCC TATTCCCGAC
GGCATTGACG ATGATGTGAT CAGTTCAGTG GACAATGTGG CCGAGACGCC GCTGACGGGT
GCTATAGGAG AAATCGTCAA CCAGGGCGGT GAAATCACCA GTCTTATTGT GACCGACGAA
GACGGCGTTG AGGTTGTGGT CGATCCAGCA GCGATTACGG TCAATCCCGA CGGGACATTC
GAGGTTCCTG CCGACGTCAC CGGGCTCAAT GACGGTATCC TGACGGTGAC GCTGACAGCG
ACCGACGTGA ACGGTACGGT CGTCACGACT ACCGATACCA TTCCGAAAGA TACGGTTACT
CCGGTCACCA TCGATCCGAT CGAAGTGACG AATAATGAAG ACCTGCCGAT CATAACCGGT
ACAGGAGAGC CCGGGGCGAC CATTGTTCTT ACCGAGGAGG ACGGCTCTCC GATCAGTTCG
CAGATCACGG TTCAGCCAGA CGGAACATGG AGCTTCACTC CTGACGAACC ATTGGATAAC
GACGAAGTCA CGATTATAGC AAACGCTACC GATCCATACG GCAATACCAA CACCGCTTCG
CGTGATATAC CGGTCATCGT AATTCCCGAT ATCAACGGGA TAATAGATGG TGACGAGGCC
ACGGTATATG AGGAAGGTCT ACCGGATGGA TCGAACACCG GCAACGAGCC GACAAGCGTT
GCTGGCTACT TTATCATCGA TCTCGATCCG GATTCGCTCG ACGAGATAAC GCTGACGACA
GCGAATTCGA ATACGTTGAC GCTCAACGCG GTTCAGCTGA CAAATATCCA GAACGGTTCA
CTTGATCCGC AGGAAGTCGA GACGGAATAT GGCACCTTGA CCATCACCGG TTATGATCCC
GATCCTGATT CGGATCCGTC AACCACCGAA GGCAGTGTTT TCTATACCTA CGCCATCGAC
GAAAATACAC CGGCGCATAC GGCTGTCGGG CACGACAGGC TCGAGGATGA TGTAACTATC
AGTGTGAAAG ACTCCAATGG AGACATCCGC AATGCCACGC TCGACCTGAG CGTGATCGAC
GATGTGCCTC AAGCAGTCGA TGACGGCGCC TATGTTGTGG AGGGCGGTGA GACGGTGACG
GGTAACGTTA TCACCGATAG CGAAGAAAAC GGCGACAACG GCGCCGATAC GCCCGGGGCG
GACGGTGTGA CCCTGACCGG GTTTACCTAT ATCCCTGAAG GGGGAACTAT TCCTGTTGCC
GGTACGGTCG GTTCTCCGGT GGATACCGCA TACGGAACGC TGACGCTCAA CGCCAACGGT
TCCTGGACCT ATGAGAGCGA TCCGACCGAA GCGCATACGT TAACTGATCC CCAGGAGTTG
CTGCCGGAAG AGATCACCTA TACCATCACT GACGCAGACG GCGATACCTC GTCGGCAACG
CTCACCATTG AGGTTGACGA TACGGTGCCT GCTATCAGTG ATCCTGAAGA CGAGATTGTC
TACGAAAAAT ATCTGCGGTT CGGGTCCGAC CCGACTCCTC TGGAACTGGT GAAAACCGGT
CTACTCGATC CGGTACCGGG AGCGGATACG TTCAACGTGA CCTTTAACAC TCCGGTACCG
CCTGAAACCA CTTCAGGTGA TCCGATGACT TCCGGCGGAG CGGTCGTTCA ATACCTGATT
TCTCCAGACG GTCATACGCT GACAGCCCAT ACCGGTGACC CAAATGATCC GGTCTTTACG
GTCGAGATCA AGAATTACGA CCAGCCCGGA GCATCATACG AGTTCACTCT TTCACGGCCG
CTCGATCACG ATCCTCTCCT TGCAGATCCT GATGTGACTG ATCCTGATTT CATCCATCTG
GACTTTCCGG TCATCATCAC CGACAGCGAC GGCGATACAG ACACCGATGC GTTCACGGTG
ACGGTAGTGG ATGATCCGCC CGGAGAGGCT CCTGATGCGC TGACGGTGCC TGAAGACGGT
TCACAGACCA TCAATACCAA CGCCGACGCT ACACCGTCCA ATACGACGGT GCCGGACAAG
GGAGATCCTG ACGGCCCTTC GCACGGAACC GCGGTGATCA ATTCGGATGG AACGCTTACC
TATACGCCTG ATCCCGATTA CAGCGGACCG GACGAGCTGA CCTATACCTA TATCGATGAA
GATAACGTTG AACACGACAT CACGGTGACG ATTACCGTTG AGCCGATATC CGATCCGCCA
CTCCTCAGCA GGGATGCTGC AGGTGTTGAA ACACTTGAAG ACGTCTCCGT CGCTCTTGGT
CTCAAAGCTC CTGTTGTGAA GGATGCTACA GACGATAACG GGCCTACTGT TTCCGGAGAT
CCGACAACAG AAGGCGATAA TCCGGAACTG CTCGGTCCGA TAACGATGAG CGGCATACCC
GAGGGAGCGA AGCTCCTTTA TGCGGATGGT ACAACCGCCG CGGTGTCGAA TGGCGGCGAT
ATCACTATCG TGCTGAGCGA CGGCGACCAC ATAGAGAGTG CCACCGGCGA TGTCATCATG
ACCAGTGCCA AGTTCGAAGC GCTGCGCATC CTGCCGCCGG CTGATGAACA TAACGATTTC
ACCGTCACCA TGACGGTGAC AGAATACGAG GTTGACGACA TCGGCAACCC GCTTGCCGGG
GTGGACGGAG CGAACTCGTC AACCACGGTG GTGGTCGAAG TGCTGGCGAT TACCGATCTG
GTGGATATCT CCTGGAAGGA AGCCGGTGAT TTCCCGGACA CCGATGAAAT ACCTGACCAT
CCGGACACCA TCAACAAGGT CATCGATGAG GATACGGCGC TCGATCTTAC CGCGATGCTG
AATTACTCTT TCGCGGCAGA GCCGGAAGAA ACAACGACAC AAACGCCCCC GGTTGCTGAC
GGCAACAATA CGACACCTGA TTTTGACGGC AGCGAAGACC GTGTGCTGAC TATCGGAGAA
ACTGGCGGTG CCATTCTGCC CACAGGCTCG ATCGTCATGG TGAACGGAAC TGAAATTGCT
CCTGAAACCG ACGGAACCTA CATTATTCCG CTGACCAACG ACCAGACCAT TCCGCCGATC
ACCGTGCAGC CTCCGGCTGA TTTCAGCGGC GATATCCTCG AGGATATTCC TGTCACGGTG
TCAGCGCTGG ACAGAGACAG CGACTCGCCC GCGGCGTCCC CGACGGAGGA ATCCGATACC
GTTTATTTCA ACCTCTTTGT CAACCCGGTT GCGGGCGACG TCGAGGCGCC GCACGTTTCC
ACTGTAGAGG ATACCGCGGT CAAGTTCATG GAGGAGCTCG TTGTCACCGA CGATAACGAT
GGCAGCGAGA TCATAGACAG CATTGTTATC AAGGCTATTC CGGATGGCTG GAAGCTCTAT
GATGAGACGG GCGCCCCGCT GATGACGGGC AACGGCGCCG AAGACTGGAC GATAGATACA
GCCGATATCA CAAGCGGAGC CTATCGGAAC TATACCATTC TTCCACCGGG CCACAGCAGT
GTGGACGAGA CGGTCGATAT CGACGTGAGG ACGACCGATA CACAGACAGT CAACGGTATG
CCCGTAAGCG ATACGCAAAC GGTCACCCTG CCGGTCCTTA TCAAGGTGAC GCCCAAAGGC
GAACGGATCG GGACGGACAG CGATCAGAAC GGCGAAACCG ACGATATCGT CGGGCAGCAG
GAAATAGCCG ATACGGATGA TAATGCTACG GCCGATCTCC AGATGAACCC GTCGCATGAG
TATACTCTCG GCGGAACAGA GGATACATGG TTTGATTTAA GCTCCGACCC TGGTGATGGA
GCCTTCGATC TCAAGTCGCC ATGGTTCAAC GAGGATACGG AAATAACCGC ATATCAGGAT
AATTCCGAAA CCACCTACGC CCTGTTCACT CCTCAGGACA GTGATGGAGC GCCGTTGATC
GGGTCGCAGT TCCACTACGA CAACGGTACC CTTAACGGCC GGACCCTGAC ATATACAGGG
GTTCCTGTCG AGATCCCCGT GCAATACCTC GATAGTCTTG AATTCAAGCC GCCGGCAAAC
TATGCGGATG ATGATGGTAT AGAGATCATC GTCAACGCCA AGACGGTTGA TGTCGATCCG
GATGACGGTG CAGTGGATGT GCAGATTACA GGGAAAGTGG TCCTGACCAT CCCGAGCGTG
ACTGCAGTGC CCGACATCGT CAGCCTCGCG GTTTCAAGCC CCGGCGGCGA CGAGGATACC
GAGATTCCCA TCAGTATACG ACCGCAGAGC GACGACAAGG ACGGTTCTGA AACCTTTACG
ATTACTCTTG ACAATGTGCC GGACGACGCC ATTCTTCGCT ATAACGACGT TGTGCTGACA
GGTTCACCGG GATCTGAAAC CGGTACCACG AAATATGAGA TAGAGGAATT TGATGAAACT
GTATCGCTTA CCATCCAGCC GGGTCTGAAT AGCGATGAGG ATATTCTCAC TGATCTCAAT
GTTGCTGCGG TGAGTGAGGA AGGCGGAAGT GTTTCGCCTC CGACATCCCT GCCGCTTCAG
GTCCAGGTAT GGGGTGTGGC CGATGAAGTT ACAGTTACGG CTGTAACGCC GAAATATGCC
GAGGTGGACG TTGATATCAC CAATGAAGTA GCGTTCAGCC AGGTGGTGAC AGGCTATACC
ATGGAGGATG ATGATGGTTC CGAGAGCCTC ACGTTCAAGC TGACTGGTCT CGATCCCCAG
TTCGATATCA GCGGAGGAAC CTTCCTGGGC GGTTCAGGGA GCGACAGGGT ATGGGTCCTG
ATGCCCGATG ATCTTGCGAC GGCAAAGATT ATCGTGCCGG AGAACTACAG CGGCACGGTG
AACCTTGGCC TTGTACCGGT GACCACCGAG CGCGAAGGCG ATAGCCAGAC GCAGGCAGTC
ATCCCTCTAT CAGTTGAGAT TACTCCGTCA CCGGAAGCGA CCATTACCAC CTCGACGACA
ATTCCCGAGG ATGCACTGCA GCAGGTTGAT TTTTCGATCC AGTACCAGAA CGGTGATACC
GATGAAACGG TGACCTCGGT CTGGATCGAA GCGGCCGATG TCGACAGTGC CGACTTTACC
CTCTATCTTG GCAGCGACGG TGTCACCACA CTTGCCGACG CGGCAGCCAG TCCAGCGGTG
ACCGATGTGG TGCTTGAAGG CGGCTACTAC AAGCTGACCG GATCGGCCAT CGGCAACGTC
TATGCGCAGA ACGAGGCTGA CCTGCATGAC AGCTACACCT TCGATATCAA GTATGAGATT
ACCGACAGCA CAACCGACGG GACACTGAAC CCCCCGGATA CACCCGAGAC GACGCAGTCT
GACGCGGTGT ATGAACTGAC CTTCACGGCA GTGACCGACC CGATAACCGA AACGCTGGGG
ACTATCACAC TCGAAGACTC AGGTGACGGC ATAGTTTCTC CTACATCGGT CTATGTAAAC
CAGAACACGG TGATTACCGT GCCGGTGACC GTGACCCAGG ACGACGACTC GGCCGAAGGA
CCGAACGGGC AGGATGACGA CGGCAGTGAA AAGCTCGAGC AGTTCATTAT CGACGGTGTG
CCTCCGGGGG TCACTGTGGT GGGCGGAACC TACATCGGGG ATGTCTATGA TCCTGATTCT
GGTGACGAAG TCAATTCCGG CCGCTGGGTG ATTGAAGTGA ATCCAGACCA GGCATTTACC
ACCATAGACG ATGGAGCATT GACGTTCGAT CTGGAGTTCG CGGTTGACGG CATTGCGGAT
CAGCTTGCAG ACATCAACCA GACCATTACC ATTACTGCAG TGAGCCGGGA TGAAAATTCC
AGTATCGCAC AGTCGGCACA GGCCTTCACG CTCAATGTCG CACCGGTAGG CATTTTTGAT
GATAGTGAAG GACAGGGGAT TGCTGATGCA AATGAACCGG ACAATATCGA TATATGGAGC
CAGAATACAG CCTTCGGCGC TATAGAGGAT ACATCACCGA CACTTGGCCA GATTGTCGAT
CTGCAGATCA GTGGTGGGAA CAACAGCTCA TTCAGCTTTA TTCTTGACGA CCTTCCTCCG
GGTACCGTTG TCACCGGTAT GACTGAAATG CTCCTTCCAA GCGGCCAGAA TGTCTATACT
GCTTCGGGAG TTGGGGGCAA TGCAGGACTG CAAAACCTGC TCAGCAACAT TACCATTACT
CCTCCTCCGG ACTGGAACGA CAACAATCAC GAGGACCAGT TTGATTTCAC CCTGACCCTT
ACCACCTATT CGCCGGGCGG AGAAAGCAAA CAGGAGACTA TCCTCGTTGA AGACGTACCG
GTCGATCCGG TCACTGACCC GACAACCATC GTGATCGACG CGGCGGATGT TGATGAAGAC
AACAACGTGA TCTTCACGGT TTCCTTCAGC AACAGCGCTG ATGGAGATGC CGCCATTGAC
GAGTATACCA CCATGCAGAG CCCGCTCTAT CTGAAAGTTG ACGATTCGGA CATGAATCCC
GCATCGGGAG GAACGCTATA CTACGAGGGC TCGCCGATTT CGACACAGAC GATATCCGGG
GTTTCCGGTG TGCCGGATGG TGACTACTAC GTTATTGAAA ATGTAGAGGT CACCGACGAG
CTTGCCTTTA CCTATGAGCC AGAAGGCAAT GCGTCCGGTC CGGTGGATGT GGCCGCTTAC
ATCAGAACGC AGGAAGTCGA TGCGCCGAAT ACAGTGACCA ACGAGACGTC AACAAGTATT
GCCGTCAATC CGGTCAGTGA CGGGTACGAT CTTGTTGTTG ACAGCGCAAC AGGACCGGAG
GATACGCCGA TCGAGATTGT TCTGGGAGGG ACTGGCCTCC GAGATACCGA CGGATCGGAA
GAGGCAGTCA GCGCTTTACT GGGCAGCGTC CCTGATGGAT ATCTGGTCAT CACGGGAACA
TCGGCGACAA ACGCGGAAGA GGCGGTCAAC GTTGGAGACG ACGGAACCGG TAATTCTACA
AATTCATGGA GCATTCCTTT GAATCCCGAT GGAACGCTGC CGGCCTATAT CGCTGTTGTG
CCCGGGGCAA ACTTCAGCGG AAGCGTGGAG GATATCACAC TTTCGGTCTA CTCTCGTGAA
GAAGGGCTTG AGCCTGTCCT CGATACGGTC TCGTTTGATC TCGAGGTGAC TCCTGTGGCC
GATCCGATTG ATTCTGATCT GTTTATACCG ACAACATCAT TTGGTGATTT CGGGGAGAAG
ATCCCCATCA ACCTCAATAT GGTGCTTGAA GATCAGGACG GTTCTGAAAC CGTGACCCTG
ACCTTTACAG GTATTGGACC TGGCGCAGAA TTTTTTGACA GCGACGAAAA TCCTGTGGCT
TCAACGTATG ATGAGGGTAC AGATACCTAT ACGCTGACAG GGGTACCGGT ATATGATTCT
TCTGGTATCT ATGATGTGAA TCACCTTACC CTCGCTCAGT CTGCAGAAGT GGTGCAGGTA
GAGGTTACCG CATTCACGGT TGACGGCAGT GACGATTCCT CAGCTGATGC TGTTACCAAA
ACATTCACGC TCAATATTTC GACAACCAGC GGTAATGATG ATCTGCTTTA TGATGGAGCC
GCCGGTCTGG ACTTTGACGG TAAAGGGGGA GAGGATACGG TGTATCTTCA TCTTGATCAG
GATATTGATT TCGATAATGA CACCTCTTCG CTTGACAATG TCGAAATCCT CGATCTTGGT
CCGAACGGTG ATCACCAGGT GCTGAGTCTT TCAGTCTCGG ATGTTCTGCA AAGTACCGAC
GCCGATAATG AACTCACGAT TCTTGGCGAC AGCGGTGATA GTGTGGAGTT GGTCGGGATT
TGGGCTTCCG AGGAGATTGC TGGTTTTACG GTTTATACAA GCGGGGGAGC AACGCTGAAT
ATTGAGGATA CTCTTTCAGT TCTATGA
 
Protein sequence
MSFPVVPLFP VVVFLSGKAW ARSVTGSMRP IHEGDVLHEG EMVITDNGSR ADVKMPDGLI 
VPVEGELLLS LQEASGSEQD ASDKEGFSTA EENLPSPSDI VSENVHGSHQ NGAPDAPFAV
TPDSVELNNE PNNFYRILRS QDIMEVQLVE GGNAFNMAPI LSVSIVGGYS DSLAGRTGGY
NAFMDGHATY NPRLIEGTGI VQRDLNEFEP ELRPFAGGAD DDDYVNRQPK PLPDVAEVVE
GGNVVSGNVL DNDNSGNGSS RVTSITYIPE GGGDPQSADV PKGGSVTVDT LYGELTINSD
GSWEYTSDPF EPHPLPPSGD PLQDVITYTI ADADGDTASS TLTIDVLDTV PAIGNPEPSV
VDEDDLPSGS DDEKESIIVG GSLGVTPGED PIDTTFSDQD APDSLTSGGE KVLYEILDDG
HTLRAYTEEG NETVFTVDIL NPDSTTGAQE YEFTLVRPLD HDPVQGENPY HLVFDFEVQD
TNDLDPDKGS FTVTVVDDIP VQTEEVEEQS VQEDALSGGN IDDAVNDTVT ATGSVASLVD
AGADEPVTFS LNPDTTGLPE VTSKGELVTY SVEGDVLTGT GPGGPVFTLT LDPVTGGYTF
VLLDQLDHSG AENDNEELPL DLSSGIIATD KDNDQLTLDE GSLVINVEND VPAQTEKVEE
QSVQEDALSG GNIDDAVNDT VTATGSVASL VDAGADEPVT FSLNPDTTGL PEVTSKGELV
TYSVEGDVLT GTGPDGPVFT LTLDPVTGGY TFVLLDQLDH SGAENDNEEL PLDLSSGIIA
TDKDNDQLTL DEGSLVINVE NDVPAQTEEV EEQSVQEDAL SGGNIDDAVN DTVTATGSVA
SLVDAGADEP VTFSLNPDTT GLPEVTSKGE LVTYSVDGNV LTATGPDGPV FTLTLDPVTG
EYTFVLLDQL DHSGGENDNE ELPLDLSSGI IATDKDDDQL TLDEGSLVIK VENDVPVAED
DIDHTAVGEP IIIDVFSNDN SGADEVTTVI GVDDPDNGTA VVNPDGTITY TPDPGFTGVE
TFDYTIEDKD GDISVATVTV TVISVDIDDA ILPTTDPKYD DGVISSVDDQ ENTPITGTIG
DIIDQGGEIV SLVVTDEDGT SISIPLDEIT VNPDGTFETT VDVTGLVDGE LTVTLTTSLE
ERTVDTEDTI LKDTVTEVTI DPLEVVNGEI PTITGTGEPG STVVLSDDDG PIDGPEIIVE
PDGTWSFTPD EPLPEEVDTI TADSTDPYGN INDDVREIPQ LFTPDENGLE PGDVTVYEAG
LPDGSNEGTD PAEPSSYDGH FYIDADPEQI DTVTITTDVE TVVIPKNDLV ASDTTPIEIE
TTYGTFTITE YNFITNEVTY VYEIDDNSEA HTDPGNDILQ EPIGVTVEDE NGDTRSATLT
MTVVDDLPEA VDDTGTVEEG GNTVTGNLFD NDTFGADGYD SGNFTYSDEN GAPVPADIPE
TGETTVTTQY GSFTIDSAGN WSYTSNDTVD HTSADSLPET IVYTFTDGDG DTAGATLTIN
VTDTEPEIDT PENRIVSDAN LPEGTSSDNG LLTVSGPLGV VKGEDTIDTT FDTDQTVLEA
FGLYSEDEEV KYEIIENGHT LRAYTGDGTG PDDTVFTVVI NDPESDAATY TYTHVRPLDH
VDPNTEAFVL PFNFSVKDSD NDSDSDTFTV TVLDDVPTAV DDPPQTVEEG SNTITGSTSV
LGNDTYGADG LDGATVTYNP EGPEPEATVD VPETGSTVVD TEYGALTINS DGTWSYTSDP
SADHSVSDSL DDSFTYTITD GDGDTSSAVQ PITVTDTVPE AVDDNGGTVE EGGLEIIGNL
LDNDTLREDG GTVTSFTYTR EDGTEGSVAV PENGTEASAD TQYGTIYVRN TGEWRYVSDA
SEDHTTDDPL TEGITYTLTD GDTDTSEAVL TVGVTDTVPL IGEPDDSSVL EANLPGGSDP
QPAALVKTGD LNVAPGADPL DTQFAPIGSQ TALNSLGLTS DGETVNYTLS PDGYTLTAYT
DDPNDPVFTV VINDPTDQTQ PDGNQNYTFT LLKPLDQDPG EDDVDLTFAF GVEDGDTDTD
TDTFTVTVVD EPLKANDDLD STSVNTPVTI DVLTNDNDPD PASPLEIVSG SVTDPPNGSV
TVNPDGTITY TPDPDFTGTD TFEYQIFDPG TGKNDTAIVT VNVIGVDIDD NPDNPDPIPD
GIDDDVISSV DNVAETPLTG AIGEIVNQGG EITSLIVTDE DGVEVVVDPA AITVNPDGTF
EVPADVTGLN DGILTVTLTA TDVNGTVVTT TDTIPKDTVT PVTIDPIEVT NNEDLPIITG
TGEPGATIVL TEEDGSPISS QITVQPDGTW SFTPDEPLDN DEVTIIANAT DPYGNTNTAS
RDIPVIVIPD INGIIDGDEA TVYEEGLPDG SNTGNEPTSV AGYFIIDLDP DSLDEITLTT
ANSNTLTLNA VQLTNIQNGS LDPQEVETEY GTLTITGYDP DPDSDPSTTE GSVFYTYAID
ENTPAHTAVG HDRLEDDVTI SVKDSNGDIR NATLDLSVID DVPQAVDDGA YVVEGGETVT
GNVITDSEEN GDNGADTPGA DGVTLTGFTY IPEGGTIPVA GTVGSPVDTA YGTLTLNANG
SWTYESDPTE AHTLTDPQEL LPEEITYTIT DADGDTSSAT LTIEVDDTVP AISDPEDEIV
YEKYLRFGSD PTPLELVKTG LLDPVPGADT FNVTFNTPVP PETTSGDPMT SGGAVVQYLI
SPDGHTLTAH TGDPNDPVFT VEIKNYDQPG ASYEFTLSRP LDHDPLLADP DVTDPDFIHL
DFPVIITDSD GDTDTDAFTV TVVDDPPGEA PDALTVPEDG SQTINTNADA TPSNTTVPDK
GDPDGPSHGT AVINSDGTLT YTPDPDYSGP DELTYTYIDE DNVEHDITVT ITVEPISDPP
LLSRDAAGVE TLEDVSVALG LKAPVVKDAT DDNGPTVSGD PTTEGDNPEL LGPITMSGIP
EGAKLLYADG TTAAVSNGGD ITIVLSDGDH IESATGDVIM TSAKFEALRI LPPADEHNDF
TVTMTVTEYE VDDIGNPLAG VDGANSSTTV VVEVLAITDL VDISWKEAGD FPDTDEIPDH
PDTINKVIDE DTALDLTAML NYSFAAEPEE TTTQTPPVAD GNNTTPDFDG SEDRVLTIGE
TGGAILPTGS IVMVNGTEIA PETDGTYIIP LTNDQTIPPI TVQPPADFSG DILEDIPVTV
SALDRDSDSP AASPTEESDT VYFNLFVNPV AGDVEAPHVS TVEDTAVKFM EELVVTDDND
GSEIIDSIVI KAIPDGWKLY DETGAPLMTG NGAEDWTIDT ADITSGAYRN YTILPPGHSS
VDETVDIDVR TTDTQTVNGM PVSDTQTVTL PVLIKVTPKG ERIGTDSDQN GETDDIVGQQ
EIADTDDNAT ADLQMNPSHE YTLGGTEDTW FDLSSDPGDG AFDLKSPWFN EDTEITAYQD
NSETTYALFT PQDSDGAPLI GSQFHYDNGT LNGRTLTYTG VPVEIPVQYL DSLEFKPPAN
YADDDGIEII VNAKTVDVDP DDGAVDVQIT GKVVLTIPSV TAVPDIVSLA VSSPGGDEDT
EIPISIRPQS DDKDGSETFT ITLDNVPDDA ILRYNDVVLT GSPGSETGTT KYEIEEFDET
VSLTIQPGLN SDEDILTDLN VAAVSEEGGS VSPPTSLPLQ VQVWGVADEV TVTAVTPKYA
EVDVDITNEV AFSQVVTGYT MEDDDGSESL TFKLTGLDPQ FDISGGTFLG GSGSDRVWVL
MPDDLATAKI IVPENYSGTV NLGLVPVTTE REGDSQTQAV IPLSVEITPS PEATITTSTT
IPEDALQQVD FSIQYQNGDT DETVTSVWIE AADVDSADFT LYLGSDGVTT LADAAASPAV
TDVVLEGGYY KLTGSAIGNV YAQNEADLHD SYTFDIKYEI TDSTTDGTLN PPDTPETTQS
DAVYELTFTA VTDPITETLG TITLEDSGDG IVSPTSVYVN QNTVITVPVT VTQDDDSAEG
PNGQDDDGSE KLEQFIIDGV PPGVTVVGGT YIGDVYDPDS GDEVNSGRWV IEVNPDQAFT
TIDDGALTFD LEFAVDGIAD QLADINQTIT ITAVSRDENS SIAQSAQAFT LNVAPVGIFD
DSEGQGIADA NEPDNIDIWS QNTAFGAIED TSPTLGQIVD LQISGGNNSS FSFILDDLPP
GTVVTGMTEM LLPSGQNVYT ASGVGGNAGL QNLLSNITIT PPPDWNDNNH EDQFDFTLTL
TTYSPGGESK QETILVEDVP VDPVTDPTTI VIDAADVDED NNVIFTVSFS NSADGDAAID
EYTTMQSPLY LKVDDSDMNP ASGGTLYYEG SPISTQTISG VSGVPDGDYY VIENVEVTDE
LAFTYEPEGN ASGPVDVAAY IRTQEVDAPN TVTNETSTSI AVNPVSDGYD LVVDSATGPE
DTPIEIVLGG TGLRDTDGSE EAVSALLGSV PDGYLVITGT SATNAEEAVN VGDDGTGNST
NSWSIPLNPD GTLPAYIAVV PGANFSGSVE DITLSVYSRE EGLEPVLDTV SFDLEVTPVA
DPIDSDLFIP TTSFGDFGEK IPINLNMVLE DQDGSETVTL TFTGIGPGAE FFDSDENPVA
STYDEGTDTY TLTGVPVYDS SGIYDVNHLT LAQSAEVVQV EVTAFTVDGS DDSSADAVTK
TFTLNISTTS GNDDLLYDGA AGLDFDGKGG EDTVYLHLDQ DIDFDNDTSS LDNVEILDLG
PNGDHQVLSL SVSDVLQSTD ADNELTILGD SGDSVELVGI WASEEIAGFT VYTSGGATLN
IEDTLSVL