Gene Paes_0782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0782 
Symbol 
ID6460448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp828668 
End bp843697 
Gene Length15030 bp 
Protein Length5009 aa 
Translation table11 
GC content57% 
IMG OID642724778 
Productvon Willebrand factor type A 
Protein accessionYP_002015475 
Protein GI194333615 
COG category 
COG ID 
TIGRFAM ID[TIGR01643] YD repeat (two copies)
[TIGR03660] T1SS-143 repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGA ATTCTACCCC GATAGCAACC GTTGCCGTTA TTAAAGGTCA GGCATGGGCA 
CGTTCCCAGG ATGGGAGCAT GCGCCCATTG ACGGAAGGCG ATGTTCTCTA CGAAAACGAA
GTGATTATTA CCGCCGACGG CAGTCGTGTC GATCTCGAAC TGCCCGATGG TTCCATGTAT
CCCATACAGG GGCCCCTTTT GGCAGAGGTT GTCGCTGATG CGGAATATTC GCGTGATGAT
GCTTCGGACG AAGATGCCGA TCCTGACGAA GAGGGCCAGG AGAATACACC TGAGGGCGTT
GGCGCCGGTG ACGATATCGT CGTTGACTAT TCTGCGCCTG TCATTGAGCG GACCGGATCT
CTCAGCGATG AACCGAGCGG CTATATTCGG GTTTCCAAAG ATCAGAATCT CACGGAATCC
CAGTTTGTCG TAGGCGACAA CAGTTTTGAA ATATCCCCTG TTCTTTCGGT GTCTATTGTT
GGTGGATACA ATGAAGGCAT CGGCGGCCGC GCCGGCGGGT ACGATGCATT TATCGATGGT
CGTGCTACCT ACAATCCGCG CATCATCGAA CCCACCCGGG AGTTCGAAGG TGATGAGTTT
GTCGATGCAT TGAGGATCGA GCCTTTTTTT GGAGGGGATG AGAGAGCACC AGTCATTAAT
ACCGTACCTG AAATAGGTGT TCCGGAAGAT GCAGAGGTTT ATGAAGAGGA TCTTGATGAA
GGTAATGATG ATAATCCGCC CAAAGACTCT TGGATTGACG CGGGACGATC ATTGGCTGTC
ATTCCTGCAG GTGAGGGCCT CGATACATTT TTTGAGAATA ATGTGCCACC ATCTGGCCTG
ACTTCAGCGG GCCAGCCAGT CAACTATTAT GTTTCTCCTG ATGCCCATAC TCTTGTCGGG
TATGTTGGTG ATCTTCCGGT AAGCGGTGTT CCTGCAGAAG GTCAGCAGGT ATTCAAGATT
ACAATCAATA ATCCTGACTC CATATCCGGA GAACAATCAT ACACCTTTGA ACTGAAAGAT
CAGGTCGATC ATCCATCGCT TGATGGACTG CCTGGTGATG ATACTGAAAA TGAACTGATA
CTTTCGTTCA ATTTCACTGT TGAGGATGAA AGTGGGGACC GTGTATCATC GAGTTTCAAT
GTGACGGTTC ATGATGATAT TCCGATTGCA CAGGATGATT CGATAAGTGA TAAAGTATAT
GAAGATCATC TCGATAATTA TGATCCGGAC GATTCGGATG CTGATGGTAT TGAAGGCTCA
CGTGGAAACC CGGGAACCGA TCCTTTGCAA ACTGTCGCTC AAGGCTCACT TGCATCCCTA
TTCAAATCCG GTGCTGATGA AGATGAATCC CAGTCAGAAG ATGGTGCTCT GATATATTCT
GTCAAACACC AGGACGATCT CCCCGACGGT TATATCAGCT ATTTCATAGA AGATGCTAAC
AATCCCGGAG AATTTATCAA AGCCGTTGAT GGTACAGGAA CAGATCTGTC CCTGACATCC
AAGGGATCTG CGATTTCTTA CGATTATGAT GACATCAATG ACCTGATTTC AGGGGTTACG
GTTGATGGTC GTATCGTATT TACGCTGGAT GTCACTGAGG ATGGAGCCTA CACGTTTACG
CTCCTCGACC AGATTGACCA CCCTGCAGCT TCAGGAGATG ATGCTGTGAT TCGCATAGAC
ATCGCTAACC TGATCGAAGC AAGAGACTTT GATTACGATC ATATTTCGCG TGAGACAGGT
TTTACCATTG ATATAGAAAA CGACGTTCCA GAAAACAATA CAGCGACGGA GAGCGGCGCA
GTGCAGGAAG ATGCGCTGCC GACGGGTAAC GAGGACACTC CTGCCGATAC GACGGTAGCC
ACCGGAACGG TGAGCGGCCT GGTGAATGTC GGAGCGGACG AGCCACTGAG TTACAGTGTG
CAGACGACGG GTCTGACGGC TCTTGGTCTG AAGAGCAAGG AAGTCGCGCT GACCTATGCG
GCAAGCGACA CCAACAGCGA TGGAATCGAT GACACGATCA CGGCGACCGG CCCGGATGGA
ACGGTGTTCA CGTTGAAGGT CGAGACGGAT GGGTCGTATA CGTTCACGCT GGCGGATCAG
CTTGATCATG AGGCAGGGTC AGGAGATGAT GCGGAACTGA GCATCGACCT GTCGAGCGCG
GTCGTCGCGA CGGATAAAGA CGGAGACAGC GTGACGGTGG ACAGCGGGTT CACGGTGACG
GTAGAAAACG ACGTTCCAGA AAACAATACA GCGACGGAGA GCGGCGCAGT GCAGGAAGAT
GCGCTGCCGA CGGGTAACGA GGACACTCCT GCCGATACGA CGGTAGCCAC CGGAACGGTG
AGCGGCCTGG TGAATGTCGG AGCGGACGAG CCACTGAGTT ACAGTGTGCA GACGACGGGT
CTGACGGCTC TTGGTCTGAA GAGCAAGGAA GTCGCGCTGA CCTATGCGGC AAGCGACACC
AACAGCGATG GAATCGATGA CACGATCACG GCGACCGGCC CGGATGGAAC GGTGTTCACG
TTGAAGGTCG AGACGGATGG GTCGTATACG TTCACGCTGG CGGATCAGCT TGATCATGAG
GCAGGGTCAG GAGATGATGC GGAACTGAGC ATCGACCTGT CGAGCGCGGT CGTCGCGACG
GATAAAGACG GAGACAGCGT GACGGTGGAC AGCGGGTTCA CGGTGACGGT AGAAAACGAC
GTTCCAGAAA ACAATACAGC GACGGAGAGC GGCGCAGTGC AGGAAGATGC GCTGCCGACG
GGTAACGAGG ACACTCCTGC CGATACGACG GTAGCCACCG GAACGGTGAG CGGCCTGGTG
AATGTCGGAG CGGACGAGCC ACTGAGTTAC AGTGTGCAGA CGACGGGTCT GACGGCTCTT
GGTCTGAAGA GCAAGGAAGT CGCGCTGACC TATGCGGCAA GCGACACCAA CAGCGATGGA
ATCGATGACA CGATCACGGC GACCGGCCCG GATGGAACGG TGTTCACGTT GAAGGTCGAG
ACGGATGGGT CGTATACGTT CACGCTGGCG GATCAGCTTG ATCATGAGGC AGGGTCAGGA
GATGATGCGG AACTGAGCAT CGACCTGTCG AGCGCGGTCG TCGCGACGGA TAAAGACGGA
GACAGCGTGA CGGTGGACAG CGGGTTCACG GTGACGGTAG AAAACGACGT TCCAGAAAAC
AATACAGCGA CGGAGAGCGG CGCAGTGCAG GAAGATGCGC TGCCGACGGG TAACGAGGAC
ACTCCTGCCG ATACGACGGT AGCCACCGGA ACGGTGAGCG GCCTGGTGAA TGTCGGAGCG
GACGAGCCAC TGAGTTACAG TGTGCAGACG ACGGGTCTGA CGGCTCTTGG TCTGAAGAGC
AAGGAAGTCG CGCTGACCTA TGCGGCAAGC GACACCAACA GCGATGGAAT CGATGACACG
ATCACGGCGA CCGGCCCGGA TGGAACGGTG TTCACGTTGA AGGTCGAGAC GGATGGGTCG
TATACGTTCA CGCTGGCGGA TCAGCTTGAT CATGAGGCAG GGTCAGGAGA TGATGCGGAA
CTGAGCATCG ACCTGTCGAG CGCGGTCGTC GCGACGGATA AAGACGGAGA CAGCGTGACG
GTGGACAGCG GGTTCACGGT GACGGTAGAA AACGACGTTC CAGAAAACAA TACAGCGACG
GAGAGCGGCG CAGTGCAGGA AGATGCGCTG CCGACGGGTA ACGAGGACAC TCCTGCCGAT
ACGACGGTAG CCACCGGAAC GGTGAGCGGC CTGGTGAATG TCGGAGCGGA CGAGCCACTG
AGTTACAGTG TGCAGACGAC GGGTCTGACG GCTCTTGGTC TGAAGAGCAA GGAAGTCGCG
CTGACCTATG CGGCAAGCGA CACCAACAGC GATGGAATCG ATGACACGAT CACGGCGACC
GGCCCGGATG GAACGGTGTT CACGTTGAAG GTCGAGACGG ATGGGTCGTA TACGTTCACG
CTGGCGGATC AGCTTGATCA TGAGGCAGGG TCAGGAGATG ATGCGGAACT GAGCATCGAC
CTGTCGAGCG CGGTCGTCGC GACGGATAAA GACGGAGACA GCGTGACGGT GGACAGCGGG
TTCACGGTGA CGGTAGAAAA CGACGTTCCA GAAAACAATA CAGCGACGGA GAGCGGCGCA
GTGCAGGAAG ATGCGCTGCC GACGGGTAAC GAGGACACTC CTGCCGATAC GACGGTAGCC
ACCGGAACGG TGAGCGGCCT GGTGAATGTC GGAGCGGACG AGCCACTGAG TTACAGTGTG
CAGACGACGG GTCTGACGGC TCTTGGTCTG AAGAGCAAGG AAGTCGCGCT GACCTATGCG
GCAAGCGACA CCAACAGCGA TGGAATCGAT GACACGATCA CGGCGACCGG CCCGGATGGA
ACGGTGTTCA CGTTGAAGGT CGAGACGGAT GGGTCGTATA CGTTCACGCT GGCGGATCAG
CTTGATCATG AGGCAGGGTC AGGAGATGAT GCGGAACTGA GCATCGACCT GTCGAGCGCG
GTCGTCGCGA CGGATAAAGA CGGAGACAGC GTGACGGTGG ACAGCGGGTT CACGGTGACG
GTAGAAAACG ACGTTCCAGA AAACAATACA GCGACGGAGA GCGGCGCAGT GCAGGAAGAT
GCGCTGCCGA CGGGTAACGA GGACACTCCT GCCGATACGA CGGTAGCCAC CGGAACGGTG
AGCGGCCTGG TGAATGTCGG AGCGGACGAG CCACTGAGTT ACAGTGTGCA GACGACGGGT
CTGACGGCTC TTGGTCTGAA GAGCAAGGAA GTCGCGCTGA CCTATGCGGC AAGCGACACC
AACAGCGATG GAATCGATGA CACGATCACG GCGACCGGCC CGGATGGAAC GGTGTTCACG
TTGAAGGTCG AGACGGATGG GTCGTATACG TTCACGCTGG CGGATCAGCT TGATCATGAG
GCAGGGTCAG GAGATGATGC GGAACTGAGC ATCGACCTGT CGAGCGCGGT CGTCGCGACG
GATAAAGACG GAGACAGCGT GACGGTGGAC AGCGGGTTCA CGGTGACGGT AGAAAACGAC
GTTCCAGAAA ACAATACAGC GACGGAGAGC GGCGCAGTGC AGGAAGATGC GCTGCCGACG
GGTAACGAGG ACACTCCTGC CGATACGACG GTAGCCACCG GAACGGTGAG CGGCCTGGTG
AATGTCGGAG CGGACGAGCC ACTGAGTTAC AGTGTGCAGA CGACGGGTCT GACGGCTCTT
GGTCTGAAGA GCAAGGAAGT CGCGCTGACC TATGCGGCAA GCGACACCAA CAGCGATGGA
ATCGATGACA CGATCACGGC GACCGGCCCG GATGGAACGG TGTTCACGTT GAAGGTCGAG
ACGGATGGGT CGTATACGTT CACGCTGGCG GATCAGCTTG ATCATGAGGC AGGGTCAGGA
GATGATGCGG AACTGAGCAT CGACCTGTCG AGCGCGGTCG TCGCGACGGA TAAAGACGGA
GACAGCGTGA CGGTGGACAG CGGGTTCACG GTGACGGTAG AAAACGACGT TCCAGAAAAC
AATACAGCGA CGGAGAGCGG CGCAGTGCAG GAAGATGCGC TGCCGACGGG TAACGAGGAC
ACTCCTGCCG ATACGACGGT AGCCACCGGA ACGGTGAGCG GCCTGGTGAA TGTCGGAGCG
GACGAGCCAC TGAGTTACAG TGTGCAGACG ACGGGTCTGA CGGCTCTTGG TCTGAAGAGC
AAGGAAGTCG CGCTGACCTA TGCGGCAAGC GACACCAACA GCGATGGAAT CGATGACACG
ATCACGGCGA CCGGCCCGGA TGGAACGGTG TTCACGTTGA AGGTCGAGAC GGATGGGTCG
TATACGTTCA CGCTGGCGGA TCAGCTTGAT CATGAGGCAG GGTCAGGAGA TGATGCGGAA
CTGAGCATCG ACCTGTCGAG CGCGGTCGTC GCGACGGATA AAGACGGAGA CAGCGTGACG
GTGGACAGCG GGTTCACGGT GACGGTAGAA AACGACGTTC CAGAAAACAA TACAGCGACG
GAGAGCGGCG CAGTGCAGGA AGATGCGCTG CCGACGGGTA ACGAGGACAC TCCTGCCGAT
ACGACGGTAG CCACCGGAAC GGTGAGCGGC CTGGTGAATG TCGGAGCGGA CGAGCCACTG
AGTTACAGTG TGCAGACGAC GGGTCTGACG GCTCTTGGTC TGAAGAGCAA GGAAGTCGCG
CTGACCTATG CGGCAAGCGA CACCAACAGC GATGGAATCG ATGACACGAT CACGGCGACC
GGCCCGGATG GAACGGTGTT CACGTTGAAG GTCGAGACGG ATGGGTCGTA TACGTTCACG
CTGGCGGATC AGCTTGATCA TGAGGCAGGG TCAGGAGATG ATGCGGAACT GAGCATCGAC
CTGTCGAGCG CGGTCGTCGC GACGGATAAA GACGGAGACA GCGTGACGGT GGACAGCGGG
TTCACGGTGA CGGTAGAAAA CGACGTTCCA GAAAACAATA CAGCGACGGA GAGCGGCGCA
GTGCAGGAAG ATGCGCTGCC GACGGGTAAC GAGGACACTC CTGCCGATAC GACGGTAGCC
ACCGGAACGG TGAGCGGCCT GGTGAATGTC GGAGCGGACG AGCCACTGAG TTACAGTGTG
CAGACGACGG GTCTGACGGC TCTTGGTCTG AAGAGCAAGG AAGTCGCGCT GACCTATGCG
GCAAGCGACA CCAACAGCGA TGGAATCGAT GACACGATCA CGGCGACCGG CCCGGATGGA
ACGGTGTTCA CGTTGAAGGT CGAGACGGAT GGGTCGTATA CGTTCACGCT GGCGGATCAG
CTTGATCATG AGGCAGGGTC AGGAGATGAT GCGGAACTGA GCATCGACCT GTCGAGCGCG
GTCGTCGCGA CGGATAAAGA CGGAGACAGC GTGACGGTGG ACAGCGGGTT CACGGTGACG
GTAGAAAACG ACGTTCCAGA AAACAATACA GCGACGGAGA GCGGCGCAGT GCAGGAAGAT
GCGCTGCCGA CGGGTAACGA GGACACTCCT GCCGATACGA CGGTAGCCAC CGGAACGGTG
AGCGGCCTGG TGAATGTCGG AGCGGACGAG CCACTGAGTT ACAGTGTGCA GACGACGGGT
CTGACGGCTC TTGGTCTGAA GAGCAAGGAA GTCGCGCTGA CCTATGCGGC AAGCGACACC
AACAGCGATG GAATCGATGA CACGATCACG GCGACCGGCC CGGATGGAAC GGTGTTCACG
TTGAAGGTCG AGACGGATGG GTCGTATACG TTCACGCTGG CGGATCAGCT TGATCATGAG
GCAGGGTCAG GAGATGATGC GGAACTGAGC ATCGACCTGT CGAGCGCGGT CGTCGCGACG
GATAAAGACG GAGACAGCGT GACGGTGGAC AGCGGGTTCA CGGTGACGGT AGAAAACGAC
GTTCCAGAAA ACAATACAGC GACGGAGAGC GGCGCAGTGC AGGAAGATGC GCTGCCGACG
GGTAACGAGG ACACTCCTGC CGATACGACG GTAGCCACCG GAACGGTGAG CGGCCTGGTG
AATGTCGGAG CGGACGAGCC ACTGAGTTAC AGTGTGCAGA CGACGGGTCT GACGGCTCTT
GGTCTGAAGA GCAAGGAAGT CGCGCTGACC TATGCGGCAA GCGACACCAA CAGCGATGGA
ATCGATGACA CGATCACGGC GACCGGCCCG GATGGAACGG TGTTCACGTT GAAGGTCGAG
ACGGATGGGT CGTATACGTT CACGCTGGCG GATCAGCTTG ATCATGAGGC AGGGTCAGGA
GATGATGCGG AACTGAGCAT CGACCTGTCG AGCGCGGTCG TCGCGACGGA TAAAGACGGA
GACAGCGTGA CGGTGGACAG CGGGTTCACG GTGACGGTAG AAAACGACGT TCCAGAAAAC
AATACAGCGA CGGAGAGCGG CGCAGTGCAG GAAGATGCGC TGCCGACGGG TAACGAGGAC
ACTCCTGCCG ATACGACGGT AGCCACCGGA ACGGTGAGCG GCCTGGTGAA TGTCGGAGCG
GACGAGCCAC TGAGTTACAG TGTGCAGACG ACGGGTCTGA CGGCTCTTGG TCTGAAGAGC
AAGGAAGTCG CGCTGACCTA TGCGGCAAGC GACACCAACA GCGATGGAAT CGATGACACG
ATCACGGCGA CCGGCCCGGA TGGAACGGTG TTCACGTTGA AGGTCGAGAC GGATGGGTCG
TATACGTTCA CGCTGGCGGA TCAGCTTGAT CATGAGGCAG GGTCAGGAGA TGATGCGGAA
CTGAGCATCG ACCTGTCGAG CGCGGTCGTC GCGACGGATA AAGACGGAGA CAGCGTGACG
GTGGACAGCG GGTTCACGGT GACGGTAGAA AACGACGTTC CAGAAAACAA TACAGCGACG
GAGAGCGGCG CAGTGCAGGA AGATGCGCTG CCGACGGGTA ACGAGGACAC TCCTGCCGAT
ACGACGGTAG CCACCGGAAC GGTGAGCGGC CTGGTGAATG TCGGAGCGGA CGAGCCACTG
AGTTACAGTG TGCAGACGAC GGGTCTGACG GCTCTTGGTC TGAAGAGCAA GGAAGTCGCG
CTGACCTATG CGGCAAGCGA CACCAACAGC GATGGAATCG ATGACACGAT CACGGCGACC
GGCCCGGATG GAACGGTGTT CACGTTGAAG GTCGAGACGG ATGGGTCGTA TACGTTCACG
CTGGCGGATC AGCTTGATCA TGAGGCAGGG TCAGGAGATG ATGCGGAACT GAGCATCGAC
CTGTCGAGCG CGGTCGTCGC GACGGATAAA GACGGAGACA GCGTGACGGT GGACAGCGGG
TTCACGGTGA CGGTAGAAAA CGACGTTCCA GAAAACAATA CAGCGACGGA GAGCGGCGCA
GTGCAGGAAG ATGCGCTGCC GACGGGTAAC GAGGACACTC CTGCCGATAC GACGGTAGCC
ACCGGAACGG TGAGCGGCCT GGTGAATGTC GGAGCGGACG AGCCACTGAG TTACAGTGTG
CAGACGACGG GTCTGACGGC TCTTGGTCTG AAGAGCAAGG AAGTCGCGCT GACCTATGCG
GCAAGCGACA CCAACAGCGA TGGAATCGAT GACACGATCA CGGCGACCGG CCCGGATGGA
ACGGTGTTCA CGTTGAAGGT CGAGACGGAT GGGTCGTATA CGTTCACGCT GGCGGATCAG
CTTGATCATG AGGCAGGGTC AGGAGATGAT GCGGAACTGA GCATCGACCT GTCGAGCGCG
GTCGTCGCGA CGGATAAAGA CGGAGACAGC GTGACGGTGG ACAGCGGGTT CACGGTGACG
GTAGAAAACG ACGTTCCAGA AAACAATACA GCGACGGAGA GCGGCGCAGT GCAGGAAGAT
GCGCTGCCGA CGGGTAACGA GGACACTCCT GCCGATACGA CGGTAGCCAC CGGAACGGTG
AGCGGCCTGG TGAATGTCGG AGCGGACGAG CCACTGAGTT ACAGTGTGCA GACGACGGGT
CTGACGGCTC TTGGTCTGAA GAGCAAGGAA GTCGCGCTGA CCTATGCGGC AAGCGACACC
AACAGCGATG GAATCGATGA CACGATCACG GCGACCGGCC CGGATGGAAC GGTGTTCACG
TTGAAGGTCG AGACGGATGG GTCGTATACG TTCACGCTGG CGGATCAGCT TGATCATGAG
GCAGGGTCAG GAGATGATGC GGAACTGAGC ATCGACCTGT CGAGCGCGGT CGTCGCGACG
GATAAAGACG GAGACAGCGT GACGGTGGAC AGCGGGTTCA CGGTGACGGT AGAAAACGAC
GTTCCAGAAA ACAATACAGC GACGGAGAGC GGCGCAGTGC AGGAAGATGC GCTGCCGACG
GGTAACGAGG ACACTCCTGC CGATACGACG GTAGCCACCG GAACGGTGAG CGGCCTGGTG
AATGTCGGAG CGGACGAGCC ACTGAGTTAC AGTGTGCAGA CGACGGGTCT GACGGCTCTT
GGTCTGAAGA GCAAGGAAGT CGCGCTGACC TATGCGGCAA GCGACACCAA CAGCGATGGA
ATCGATGACA CGATCACGGC GACCGGCCCG GATGGAACGG TGTTCACGTT GAAGGTCGAG
ACGGATGGGT CGTATACGTT CACGCTGGCG GATCAGCTTG ATCATGAGGC AGGGTCAGGA
GATGATGCGG AACTGAGCAT CGACCTGTCG AGCGCGGTCG TCGCGACGGA TAAAGACGGA
GACAGCGTGA CGGTGGACAG CGGGTTCACG GTGACGGTAG AAAACGACGT TCCAGAAAAC
AATACAGCGA CGGAGAGCGG CGCAGTGCAG GAAGATGCGC TGCCGACGGG TAACGAGGAC
ACTCCTGCCG ATACGACGGT AGCCACCGGA ACGGTGAGCG GCCTGGTGAA TGTCGGAGCG
GACGAGCCAC TGAGTTACAG TGTGCAGACG ACGGGTCTGA CGGCTCTTGG TCTGAAGAGC
AAGGAAGTCG CGCTGACCTA TGCGGCAAGC GACACCAACA GCGATGGAAT CGATGACACG
ATCACGGCGA CCGGCCCGGA TGGAACGGTG TTCACGTTGA AGGTCGAGAC GGATGGGTCG
TATACGTTCA CGCTGGCGGA TCAGCTTGAT CATGAGGCAG GGTCAGGAGA TGATGCGGAA
CTGAGCATCG ACCTGTCGAG CGCGGTCGTC GCGACGGATA AAGACGGAGA CAGCGTGACG
GTGGACAGCG GGTTCACGGT GACGGTAGAA AACGACGTTC CAGAAAACAA TACAGCGACG
GAGAGCGGCG CAGTGCAGGA AGATGCGCTG CCGACGGGTA ACGAGGACAC TCCTGCCGAT
ACGACGGTAG CCACCGGAAC GGTGAGCGGC CTGGTGAATG TCGGAGCGGA CGAGCCACTG
AGTTACAGTG TGCAGACGAC GGGTCTGACG GCTCTTGGTC TGAAGAGCAA GGAAGTCGCG
CTGACCTATG CGGCAAGCGA CACCAACAGC GATGGAATCG ATGACACGAT CACGGCGACC
GGCCCGGATG GAACGGTGTT CACGTTGAAG GTCGAGACGG ATGGGTCGTA TACGTTCACG
CTGGCGGATC AGCTTGATCA TGAGGCAGGG TCAGGAGATG ATGCGGAACT GAGCATCGAC
CTGTCGAGCG CGGTCGTCGC GACGGATAAA GACGGAGACA GCGTGACGGT GGACAGCGGG
TTCACGGTGA CGGTAGAAAA CGACGTTCCA GAAAACAATA CAGCGACGGA GAGCGGCGCA
GTGCAGGAAG ATGCGCTGCC GACGGGTAAC GAGGACACTC CTGCCGATAC GACGGTAGCC
ACCGGAACGG TGAGCGGCCT GGTGAATGTC GGAGCGGACG AGCCACTGAG TTACAGTGTG
CAGACGACGG GTCTGACGGC TCTTGGTCTG AAGAGCAAGG AAGTCGCGCT GACCTATGCG
GCAAGCGACA CCAACAGCGA TGGAATCGAT GACACGATCA CGGCGACCGG CCCGGATGGA
ACGGTGTTCA CGTTGAAGGT CGAGACGGAT GGGTCGTATA CGTTCACGCT GGCGGATCAG
CTTGATCATG AGGCAGGGTC AGGAGATGAT GCGGAACTGA GCATCGACCT GTCGAGCGCG
GTCGTCGCGA CGGATAAAGA CGGAGACAGC GTGACGGTGG ACAGCGGGTT CACGGTGACG
GTAGAAAACG ACGTTCCAGA AAACAATACA GCGACGGAGA GCGGCGCAGT GCAGGAAGAT
GCGCTGCCGA CGGGTAACGA GGACACTCCT GCCGATACGA CGGTAGCCAC CGGAACGGTG
AGCGGCCTGG TGAATGTCGG AGCGGACGAG CCACTGAGTT ACAGTGTGCA GACGACGGGT
CTGACGGCTC TTGGTCTGAA GAGCAAGGAA GTCGCGCTGA CCTATGCGGC AAGCGACACC
AACAGCGATG GAATCGATGA CACGATCACG GCGACCGGCC CGGATGGAAC GGTGTTCACG
TTGAAGGTCG AGACGGATGG GTCGTATACG TTCACGCTGG CGGATCAGCT TGATCATGAG
GCAGGGTCAG GAGATGATGC GGAACTGAGC ATCGACCTGT CGAGCGCGGT CGTCGCGACG
GATAAAGACG GAGACAGCGT GACGGTGGAC AGCGGGTTCA CGGTGACGGT AGAAAACGAC
GTTCCAGAAA ACAATACAGC GACGGAGAGC GGCGCAGTGC AGGAAGATGC GCTGCCGACG
GGTAACGAGG ACACTCCTGC CGATACGACG GTAGCCACCG GAACGGTGAG CGGCCTGGTG
AATGTCGGAG CGGACGAGCC ACTGAGTTAC AGTGTGCAGA CGACGGGTCT GACGGCTCTT
GGTCTGAAGA GCAAGGAAGT CGCGCTGACC TATGCGGCAA GCGACACCAA CAGCGATGGA
ATCGATGACA CGATCACGGC GACCGGCCCG GATGGAACGG TGTTCACGTT GAAGGTCGAG
ACGGATGGGT CGTATACGTT CACGCTGGCG GATCAGCTTG ATCATGAGGC AGGGTCAGGA
GATGATGCGG AACTGAGCAT CGACCTGTCG AGCGCGGTCG TCGCGACGGA TAAAGACGGA
GACAGCGTGA CGGTGGACAG CGGGTTCACG GTGACGGTAG AAAACGACGT TCCAGAAAAC
AATACAGCGA CGGAGAGCGG CGCAGTGCAG GAAGATGCGC TGCCGACGGG TAACGAGGAC
ACTCCTGCCG ATACGACGGT AGCCACCGGA ACGGTGAGCG GCCTGGTGAA TGTCGGAGCG
GACGAGCCAC TGAGTTACAG TGTGCAGACG ACGGGTCTGA CGGCTCTTGG TCTGAAGAGC
AAGGAAGTCG CGCTGACCTA TGCGGCAAGC GACACCAACA GCGATGGAAT CGATGACACG
ATCACGGCGA CCGGCCCGGA TGGAACGGTG TTCACGTTGA AGGTCGAGAC GGATGGGTCG
TATACGTTCA CGCTGGCGGA TCAGCTTGAT CATGAGGCAG GGTCAGGAGA TGATGCGGAA
CTGAGCATCG ACCTGTCGAG CGCGGTCGTC GCGACGGATA AAGACGGAGA CAGCGTGACG
GTGGACAGCG GGTTCACGGT GACGGTAGAA AACGACGTTC CAGAAAACAA TACAGCGACG
GAGAGCGGCG CAGTGCAGGA AGATGCGCTG CCGACGGGTA ACGAGGACAC TCCTGCCGAT
ACGACGGTAG CCACCGGAAC GGTGAGCGGC CTGGTGAATG TCGGAGCGGA CGAGCCACTG
AGTTACAGTG TGCAGACGAC GGGTCTGACG GCTCTTGGTC TGAAGAGCAA GGAAGTCGCG
CTGACCTATG CGGCAAGCGA CACCAACAGC GATGGAATCG ATGACACGAT CACGGCGACC
GGCCCGGATG GAACGGTGTT CACGTTGAAG GTCGAGACGG ATGGGTCGTA TACGTTCACG
CTGGCGGATC AGCTTGATCA TACAGGTGGA GGACTGTCAG GCAACGGCGA CGACCAGATC
AAGACGCTCG ATTTTTCAAG CGTACTGGTG GCTACAGACA GTGATGGAGA TAGTGTAACG
GTCGATAATG GGTTTACGAT TACGGTCCAG GACGATGTGC CGTCGGCTAT TCCTGTAACT
GAATCAGCTA CTGCTACTCC GATTGATACG AATATCATGC TCATCATGGA TACTTCTGGT
AGTATGGATT GGCCTTCCGG TATACCGGGT TATACACGCT TGCAGGCAAC TGTGGCTGCA
GCGCGACAGC TTGTCGATAA ATATGAGGCA TTGGGTGATG TGAGGGTAAA CATCGTAGAG
TTTGACACAG ACGGAAATAA ATGGACTACA GGATGGGTTG ACGGAGCTAC GGCGGACAGC
AGACTGACAG CGTTGCTGAC CCAGGGCGGT GGCTCTACAA ACTTTGATGA CGCGTTGCTT
ACAGCAATGG ATGCCTGGGA CGATACGACC GGCCTGACGC AAATACCTGA TGCCCAGAAT
GTTTCCTATT TTCTTTCTGA TGGAGATCCT ACCGCAAGAA CTCGATGGTC CTCGGGTTGG
CCAAACCAGA ATGGTATTCA AACGCAAGAA CAGAACTATT GGGAGAACTG GTTAGAAAGC
AACGAAATTA CCTCATACGC TTTCGGTCTC GGGACTCAGG TGACCGAAGA TAATCTTGAC
CCAATCGCAT ACGATGGTGA TCCGCCACAG GATCCTGCGC TGGATGAAGG CCTACAGATT
TCTTTTTCAG ACCTTGATTC AGTCCTGTCA GCAACGATAC CTCAGGATGA ACTAACAGGG
TCACTCAGTC TTAACCTGGG TGCCGATGAC GGTGGCTATG TCTCTTCAGT TACTATCGAG
GGAACAGTGT ATACCTATGA CTCGGCAAAC TCGATAATTA CCCATGTTAC AACCGAGGGA
GGCAGTATTA CTATCGATAT GGATACCGGG GATTACGAAT ACACGGTGCC TGCTATTTTC
AGTTCTCCTT TTGATGAAAT TATTGATTTC ACGCTGGTCG ATGCTGACGG AGATACCAAC
AGTTCATCGT TGACGATCAC CAACTATCCG TTGCCTTCCC CTATCAGTGA AACCTGGACA
GGAGATGACA ATGATAATAC TCAGAGTTAC GAGTCATTAC CAGCTGGTGA AAATGCTTAC
CTGGATGGTC AGGGTGGTGA TGATACCTTG ACTGGCTCAT CTGGAACTGA CATCCTCAAA
GGCGGTTATG GTGACGATCT GCTTGATGGA GGGGACGGAG CTGATGTCGT ACTCGGGTAT
CAGGGGGCTG ATACTCTGGT GTTTGATCCT GATGATACTG TCATTGACGG AGGAAATGGT
GGCGGTAATG ATACCTTGAT CCTTTCGGAT GATGATGATA TCGACTTCGG GGAGAGCGGG
TTTATTAACC CCGCCATCAA TATTGAGGTT ATTGATCTTA CTGATGGCGA TCACTCCCTT
ACCAACCTCT CTGCGCAGGA TGTTCTCGAT ATGACTGATG GTGATAATGA GCTGTATATC
CTTGGTGATA GCGGAGATAG TGTTTCAGGG ACAGGTTGGC AGGCTGACGG CAGTGATGGT
GACTACTATG TCTATGTCAA TACAATCCTT GGCGTTTCGT TGTATGTTCA GGATGATATT
GGAACCAGCA ATATCAATCT TACGACATGA
 
Protein sequence
MNTNSTPIAT VAVIKGQAWA RSQDGSMRPL TEGDVLYENE VIITADGSRV DLELPDGSMY 
PIQGPLLAEV VADAEYSRDD ASDEDADPDE EGQENTPEGV GAGDDIVVDY SAPVIERTGS
LSDEPSGYIR VSKDQNLTES QFVVGDNSFE ISPVLSVSIV GGYNEGIGGR AGGYDAFIDG
RATYNPRIIE PTREFEGDEF VDALRIEPFF GGDERAPVIN TVPEIGVPED AEVYEEDLDE
GNDDNPPKDS WIDAGRSLAV IPAGEGLDTF FENNVPPSGL TSAGQPVNYY VSPDAHTLVG
YVGDLPVSGV PAEGQQVFKI TINNPDSISG EQSYTFELKD QVDHPSLDGL PGDDTENELI
LSFNFTVEDE SGDRVSSSFN VTVHDDIPIA QDDSISDKVY EDHLDNYDPD DSDADGIEGS
RGNPGTDPLQ TVAQGSLASL FKSGADEDES QSEDGALIYS VKHQDDLPDG YISYFIEDAN
NPGEFIKAVD GTGTDLSLTS KGSAISYDYD DINDLISGVT VDGRIVFTLD VTEDGAYTFT
LLDQIDHPAA SGDDAVIRID IANLIEARDF DYDHISRETG FTIDIENDVP ENNTATESGA
VQEDALPTGN EDTPADTTVA TGTVSGLVNV GADEPLSYSV QTTGLTALGL KSKEVALTYA
ASDTNSDGID DTITATGPDG TVFTLKVETD GSYTFTLADQ LDHEAGSGDD AELSIDLSSA
VVATDKDGDS VTVDSGFTVT VENDVPENNT ATESGAVQED ALPTGNEDTP ADTTVATGTV
SGLVNVGADE PLSYSVQTTG LTALGLKSKE VALTYAASDT NSDGIDDTIT ATGPDGTVFT
LKVETDGSYT FTLADQLDHE AGSGDDAELS IDLSSAVVAT DKDGDSVTVD SGFTVTVEND
VPENNTATES GAVQEDALPT GNEDTPADTT VATGTVSGLV NVGADEPLSY SVQTTGLTAL
GLKSKEVALT YAASDTNSDG IDDTITATGP DGTVFTLKVE TDGSYTFTLA DQLDHEAGSG
DDAELSIDLS SAVVATDKDG DSVTVDSGFT VTVENDVPEN NTATESGAVQ EDALPTGNED
TPADTTVATG TVSGLVNVGA DEPLSYSVQT TGLTALGLKS KEVALTYAAS DTNSDGIDDT
ITATGPDGTV FTLKVETDGS YTFTLADQLD HEAGSGDDAE LSIDLSSAVV ATDKDGDSVT
VDSGFTVTVE NDVPENNTAT ESGAVQEDAL PTGNEDTPAD TTVATGTVSG LVNVGADEPL
SYSVQTTGLT ALGLKSKEVA LTYAASDTNS DGIDDTITAT GPDGTVFTLK VETDGSYTFT
LADQLDHEAG SGDDAELSID LSSAVVATDK DGDSVTVDSG FTVTVENDVP ENNTATESGA
VQEDALPTGN EDTPADTTVA TGTVSGLVNV GADEPLSYSV QTTGLTALGL KSKEVALTYA
ASDTNSDGID DTITATGPDG TVFTLKVETD GSYTFTLADQ LDHEAGSGDD AELSIDLSSA
VVATDKDGDS VTVDSGFTVT VENDVPENNT ATESGAVQED ALPTGNEDTP ADTTVATGTV
SGLVNVGADE PLSYSVQTTG LTALGLKSKE VALTYAASDT NSDGIDDTIT ATGPDGTVFT
LKVETDGSYT FTLADQLDHE AGSGDDAELS IDLSSAVVAT DKDGDSVTVD SGFTVTVEND
VPENNTATES GAVQEDALPT GNEDTPADTT VATGTVSGLV NVGADEPLSY SVQTTGLTAL
GLKSKEVALT YAASDTNSDG IDDTITATGP DGTVFTLKVE TDGSYTFTLA DQLDHEAGSG
DDAELSIDLS SAVVATDKDG DSVTVDSGFT VTVENDVPEN NTATESGAVQ EDALPTGNED
TPADTTVATG TVSGLVNVGA DEPLSYSVQT TGLTALGLKS KEVALTYAAS DTNSDGIDDT
ITATGPDGTV FTLKVETDGS YTFTLADQLD HEAGSGDDAE LSIDLSSAVV ATDKDGDSVT
VDSGFTVTVE NDVPENNTAT ESGAVQEDAL PTGNEDTPAD TTVATGTVSG LVNVGADEPL
SYSVQTTGLT ALGLKSKEVA LTYAASDTNS DGIDDTITAT GPDGTVFTLK VETDGSYTFT
LADQLDHEAG SGDDAELSID LSSAVVATDK DGDSVTVDSG FTVTVENDVP ENNTATESGA
VQEDALPTGN EDTPADTTVA TGTVSGLVNV GADEPLSYSV QTTGLTALGL KSKEVALTYA
ASDTNSDGID DTITATGPDG TVFTLKVETD GSYTFTLADQ LDHEAGSGDD AELSIDLSSA
VVATDKDGDS VTVDSGFTVT VENDVPENNT ATESGAVQED ALPTGNEDTP ADTTVATGTV
SGLVNVGADE PLSYSVQTTG LTALGLKSKE VALTYAASDT NSDGIDDTIT ATGPDGTVFT
LKVETDGSYT FTLADQLDHE AGSGDDAELS IDLSSAVVAT DKDGDSVTVD SGFTVTVEND
VPENNTATES GAVQEDALPT GNEDTPADTT VATGTVSGLV NVGADEPLSY SVQTTGLTAL
GLKSKEVALT YAASDTNSDG IDDTITATGP DGTVFTLKVE TDGSYTFTLA DQLDHEAGSG
DDAELSIDLS SAVVATDKDG DSVTVDSGFT VTVENDVPEN NTATESGAVQ EDALPTGNED
TPADTTVATG TVSGLVNVGA DEPLSYSVQT TGLTALGLKS KEVALTYAAS DTNSDGIDDT
ITATGPDGTV FTLKVETDGS YTFTLADQLD HEAGSGDDAE LSIDLSSAVV ATDKDGDSVT
VDSGFTVTVE NDVPENNTAT ESGAVQEDAL PTGNEDTPAD TTVATGTVSG LVNVGADEPL
SYSVQTTGLT ALGLKSKEVA LTYAASDTNS DGIDDTITAT GPDGTVFTLK VETDGSYTFT
LADQLDHEAG SGDDAELSID LSSAVVATDK DGDSVTVDSG FTVTVENDVP ENNTATESGA
VQEDALPTGN EDTPADTTVA TGTVSGLVNV GADEPLSYSV QTTGLTALGL KSKEVALTYA
ASDTNSDGID DTITATGPDG TVFTLKVETD GSYTFTLADQ LDHEAGSGDD AELSIDLSSA
VVATDKDGDS VTVDSGFTVT VENDVPENNT ATESGAVQED ALPTGNEDTP ADTTVATGTV
SGLVNVGADE PLSYSVQTTG LTALGLKSKE VALTYAASDT NSDGIDDTIT ATGPDGTVFT
LKVETDGSYT FTLADQLDHE AGSGDDAELS IDLSSAVVAT DKDGDSVTVD SGFTVTVEND
VPENNTATES GAVQEDALPT GNEDTPADTT VATGTVSGLV NVGADEPLSY SVQTTGLTAL
GLKSKEVALT YAASDTNSDG IDDTITATGP DGTVFTLKVE TDGSYTFTLA DQLDHEAGSG
DDAELSIDLS SAVVATDKDG DSVTVDSGFT VTVENDVPEN NTATESGAVQ EDALPTGNED
TPADTTVATG TVSGLVNVGA DEPLSYSVQT TGLTALGLKS KEVALTYAAS DTNSDGIDDT
ITATGPDGTV FTLKVETDGS YTFTLADQLD HEAGSGDDAE LSIDLSSAVV ATDKDGDSVT
VDSGFTVTVE NDVPENNTAT ESGAVQEDAL PTGNEDTPAD TTVATGTVSG LVNVGADEPL
SYSVQTTGLT ALGLKSKEVA LTYAASDTNS DGIDDTITAT GPDGTVFTLK VETDGSYTFT
LADQLDHEAG SGDDAELSID LSSAVVATDK DGDSVTVDSG FTVTVENDVP ENNTATESGA
VQEDALPTGN EDTPADTTVA TGTVSGLVNV GADEPLSYSV QTTGLTALGL KSKEVALTYA
ASDTNSDGID DTITATGPDG TVFTLKVETD GSYTFTLADQ LDHEAGSGDD AELSIDLSSA
VVATDKDGDS VTVDSGFTVT VENDVPENNT ATESGAVQED ALPTGNEDTP ADTTVATGTV
SGLVNVGADE PLSYSVQTTG LTALGLKSKE VALTYAASDT NSDGIDDTIT ATGPDGTVFT
LKVETDGSYT FTLADQLDHE AGSGDDAELS IDLSSAVVAT DKDGDSVTVD SGFTVTVEND
VPENNTATES GAVQEDALPT GNEDTPADTT VATGTVSGLV NVGADEPLSY SVQTTGLTAL
GLKSKEVALT YAASDTNSDG IDDTITATGP DGTVFTLKVE TDGSYTFTLA DQLDHEAGSG
DDAELSIDLS SAVVATDKDG DSVTVDSGFT VTVENDVPEN NTATESGAVQ EDALPTGNED
TPADTTVATG TVSGLVNVGA DEPLSYSVQT TGLTALGLKS KEVALTYAAS DTNSDGIDDT
ITATGPDGTV FTLKVETDGS YTFTLADQLD HEAGSGDDAE LSIDLSSAVV ATDKDGDSVT
VDSGFTVTVE NDVPENNTAT ESGAVQEDAL PTGNEDTPAD TTVATGTVSG LVNVGADEPL
SYSVQTTGLT ALGLKSKEVA LTYAASDTNS DGIDDTITAT GPDGTVFTLK VETDGSYTFT
LADQLDHTGG GLSGNGDDQI KTLDFSSVLV ATDSDGDSVT VDNGFTITVQ DDVPSAIPVT
ESATATPIDT NIMLIMDTSG SMDWPSGIPG YTRLQATVAA ARQLVDKYEA LGDVRVNIVE
FDTDGNKWTT GWVDGATADS RLTALLTQGG GSTNFDDALL TAMDAWDDTT GLTQIPDAQN
VSYFLSDGDP TARTRWSSGW PNQNGIQTQE QNYWENWLES NEITSYAFGL GTQVTEDNLD
PIAYDGDPPQ DPALDEGLQI SFSDLDSVLS ATIPQDELTG SLSLNLGADD GGYVSSVTIE
GTVYTYDSAN SIITHVTTEG GSITIDMDTG DYEYTVPAIF SSPFDEIIDF TLVDADGDTN
SSSLTITNYP LPSPISETWT GDDNDNTQSY ESLPAGENAY LDGQGGDDTL TGSSGTDILK
GGYGDDLLDG GDGADVVLGY QGADTLVFDP DDTVIDGGNG GGNDTLILSD DDDIDFGESG
FINPAINIEV IDLTDGDHSL TNLSAQDVLD MTDGDNELYI LGDSGDSVSG TGWQADGSDG
DYYVYVNTIL GVSLYVQDDI GTSNINLTT