Gene Sare_4829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4829 
Symbol 
ID5707734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5460973 
End bp5476530 
Gene Length15558 bp 
Protein Length5185 aa 
Translation table11 
GC content64% 
IMG OID641274225 
Producthypothetical protein 
Protein accessionYP_001539570 
Protein GI159040317 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.27186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACT TTCATATTCC GAATGATCCT GAGTGGCAGT GGGCCTATGA CGCCATCCTC 
TACACCACCG GCGGTGAGTT CCCCGAGGCC GATCCCACGG CACTGCGGGC CATGGGGGAT
GAGCTCTACG CCTTCACCAA CACTCTGCTG AACGGGGTCG CGTCCACGTC CAACCTGGGC
AACGGGCTCT CCGCGCATCT GGACGGGCCC GCCGCGGACG CGTTCAACCA GTTCCGGGGC
GGCATCGCGA ACAACGTCCC GACCGCCGGG AACATCTCGT GGGCCCTCGG GAATGCGGCG
TACGAGTTTG CCTTGGACTC GGAATCCACT CAGTACAACA TCGTCATTGC CGCGTTCACC
CAGGTGGTGG AGATCGCCTT CGCCATCGCC TCCGGTTTCG GCGCGGCGGC TGTCCCCGCA
CTGATCAAGA TCGGGCAGGA GATTGTCAAG ACCCTGATCG AATTCCTCCG CATGCGACTA
CAAAACCAGC TGCTGAGGCT GGTCTGGGAA GGTATCGAGG AAGGGCTCGA GGAACTCTGG
CAGGGCATGG CCGCCCAGCT CACGCAAATG GCCGAGGGTA ACCGTAAAGA TTTCGACTAC
CAGGACCTTG CCCTTGCCTT CGCGGGCGGA TTCTTCATCG GCATGGGCGT CTCCGGCATG
CACGCGATCG GCGGAAAGCT CTACCCGAAG ATCAACAAGA ACACCTACAC CCGCGAGACG
TTGTCCGCAC TCGCGGAAAC TCTCTTCGAG GGCCTGTTCA GCATGATGGT CGGGGGCGGC
TTCAACCCAT TTGCCACGAT GACCTCGTCG ATCATCGGCG GCCTCGCACA CCAATACGCA
CACGACTTCG GCAACCAGTT CGGCGTCGAC CCCAACAACC TGCCTAACCG CCCACCGCCC
GACCCTAAAA AGGGTTCAAG CAGGGGTAAT CCGCCCCCGA CCTCGGTTGA GGGGCCTCCA
CCACCCGCAT ACCACGACAT CGGCGACAGC CCTCCGTCGT ACGACCAGGC TGCCGGAAGT
CCACCACCGC ACAGCGAGAC GCAGGGAGAC CCCCCTCCAT ACAACGAAGC CACCGGCCCT
GCGAACGCCG CGACGGGCAC ACCGTCCTCC GTCGGCAGTT CCCCGGTTCC CGGCAACCCC
GTGACGAACG GTAGTCACAA CTCCCAACCT TTGGACACCG GACGCGATGG CGCCCCGACC
CCAGCCACCG CTCCGGCATT CGCCAATCGA CCCGACCTCA GCACCGTCAA CGCGCCAGCT
CCCAACGGTT CAGTCGCGCC TGCTCCTGGC ACTGCCCCGG CATCGGCGGG CGCACTCCGA
CCCGGATTCG CAGGCGTAAG CCACTTCGGG CCCTCAAGCG TTCAGGTGCC CACGCCAACC
ACAGGCTCAG CCGGCTCGAT CGCCGCGAAT CCTGCCGCCA CACCCGCCGC GCCCCCTCCA
GCCACATCGC CGGCGGAGGA AATCGTACCT ATCGCGACGC CTGCTGGAAC CGACGCGACC
TTGCCCGCGA CTGCCGGTGG CCTACCCGGC TTCGAGTCCA CGAGCCCCGC GGCATCTCCC
CTTACCGACC CAACAGCCCC GGCCCCGGTC GTCGCCGAGA CGACGGCACC TGTGTTAACT
CAGGCAGCGG ATCCTGCTAC TTCGCCTGTG GGAGACCCGG CTGGGCAGGC GCCCACGGTT
GCGATGCCAG TGACAGCGGT TCCGGCCGTC GAGGTGCCTC GCACCGGAAG CCCCAACGCC
GAGTCCCCGG TGGTTCAGGC GCCAACGGGC CAACCGTCCG CCTTTCAACC GCCCACCACA
CAGTTGCCCA GTCCACAGAC ATCTGGTGGG TCAACACCGG GTGCCCAGGG AAGCAACACG
CCGTTCTCAT CCGCGTCCCC GCCCCCACAA GTCACCAATC CGGTCGCTCC CGCACCGGCG
ATGGTCGAGC AGGCTTCGGC TGTGCCTGAC CCTGCTGTCG AGCCGGCGGT CGCGTCGGAG
AGTGGGTTTA CGGGCCAGGC CCTCGACGCG CAGCAGACCC TGGTGACGAG CAACCCGCAC
CACGTATCCA CGACGTCCAC CACTACCCAG CCCTCCCTCA CCAGCGGGAT CGTGCAGTCA
CCACCGTCGA GTGATACCGC GCTTCGCAAC GACTCCACGC GCGACACCGC GACACGCACC
GAGAGCGCAG AGGACCGCGA TTCACGTTTC CAGGCCGATG GTCCACGGAG GGCTAGTGGT
CAGCACGATC CGGCAGTGAC TCCGGCACCG ACCGCTGAGG TCGCGCCAGC CACCAGCCGC
GGACAAACCG ATATCGTCAT CGACTTACCA TCGGGTTCGT CAACCGATCC TGGCGCCGAA
AACGGCCCGA GGACCACAGC CGCTTCGATC GGGCCGACAG CCGATGGCAC GGAGCCCCCG
CCGGTCACCC CGCAGGCCGG GCCCACGATC CCCACCCCGC CGGGTGCTGC CGATGCTGTG
CTGCGAACTG AACCGGCTGC CGCTTCGACC TCCAACCTGG AGGCTACGGC ACCGACATCG
CCATCCGCAC CGGTGTCAGC CAATGAGTCG GATCAGGGTC AGTCTACGAC CGAAATTTTC
GATGGTCTTG CGCTGGGCGG CACGGGCAGT CAGGTGGTTC CCGGGCCAGC GCCTGCCCTA
CTCGCCGATG TTGTACCTCG TGATCAGTGG TGGCGGCTCT ACCTCGATCC CAGGCACCAT
GATGCAGCAC AGGCCGCGTA CCCCAACGAC CCCGGTTCCT ACTACGATAA CGATCAATCG
CCGGGCTTTC AGAGAGGCAT GACCGAAGCC TACGAACAGT TCCTCAATAA TTCTGGCATC
AATAGCAACA CCCTTAACGC TGATACCTAC AAACAAATAC ACGAAACAGT CACTCGATAT
CTCAATAAGC AGTTCGACTG GACCGGCGGC ACCACACGAC AAGGAGTCAT TAAACCCACA
TCGTTCCCTG TTCGTAACCC CGATATCAGC TCCGACGTCC TCGCCGAACG TATCGGCGAA
CGCCCTCTCA TCAAAGACTT CCACGAAGCT CTCCGGCAAC GTGGCGGCCC CAAACCAGTC
ACCTTCACCA GCACCATCAT GGGACGCGGT GGCATCAGCG CCACCACTAT GCCCCGTACT
ATCACCACCA ACTACTCCCG CACCGACGCA CCCGCCCTAA TCAACACCAT TTTCGACCAA
TACCGTCACG AACGACAGAA CGCCACGACC CCCGACGCGA CCCTCGCCGC CATTGCCCGC
ACCATCCGTA AACTGCATAT CACCCACCCC TTCGAAGACG GCAATCGCCG CCTCAACGTC
CACGTCATCC TCCCCAAACT ACTGCTCGAC AACGGCTTCC AACCCGTCAT CGCACCCGAC
ATGGACCAAC TGTTCCAAGG CGGCCGTTCC ATTGACGACA TCGTCCTTGT CCTGCGAAAC
ACCCAACCCG CCACCGACGG CATACGCCTC GGCACTGAAC CAAACCCGCA AGCGAATGTC
GTCCCTGTCG CGCAGATTCC GACCGTTGTC GATGTTCCCG CCATCACGAG TGTCGCGGCT
CAGGCAGACA CCACTGCGCT GGCGGCCTCG TCGGATGGTC ACCGGCACGA CAGTTCGTCG
GTCGTGCCCC TCGCCACGGT TGTCAGGGGT GCGGACGGGG TTCTGCGTGC GTCTGGCAAG
CCGCTGGTCA TCCGGCCGTT GACAGGTGAC CGGCCTGGTA TCGCGCTGCT TGACGACGGT
GACTGGAACA GGGTCAGCAC CCGAACCCTC CTGCCGCCGG ACACACATGC CGCCGCACAG
GACTTTAAGG GAGACCCGGC GACCGCGGTC ATCGTGCACG GCGTCGGTGA CGTCGTGCAC
GCCCCGGTCC GCAACCCGGA TGGCTCCACC ACCACCGTCG CGCTGGGTCC GGACGGCCTC
GCCCAAGTCA TTGGCCGGCC CGGCGCACCC ATCGTCCTGG TCTCTTGCGA GACCGGTGCT
CTGGCCGGTG GTTTCGCCCA TCAGCTCGCC CACGCCCGTG CCGGTGACGT CCTGGCCCCC
AAAACCGTCA TCAATATCGG ATCGACTGGA GTCGAAGGAG GACTGCGCAC CACCGATAGC
GAATCTTGGC AGCTCTACAC TCCTCAATCC CTGGACGACG AGGGTTGGGA CGCCGTCAAC
GACGTCGCCA CCGGCACACC ACCCCTGTTC GGCATCAGCG CCAACCCAGA ACCCATTGCC
GGTCCCGGTG CAGGCACCGG GACCGGTTCG ACCACTGGCC AATCCAACCG ATCCAACGAC
GGCATCGTTA CTGTCCGGCG TAGCACCCAA CCCACCACCG ACGGCATACG CCTCGGCCCC
AAGCCAGCCC CCACATCGAA CATCGCCCGC GCCGCGCAGC CTCCGAGCCT TCCAACAATT
CCCGAAGAAA CCGAGCAGGA TCTCGACCCT CCCGGTCCGT CCGCCCCGAG CACGGTCTCC
CGTGAACGGG AGAAGCTACC ATCGTCCCAG CCTGACTCGA TGGCATCGAG CGAGGAGGCG
GGCTACACGA GTGCTGGTGA TCTTGCTGAC CAGGTAGCCG CTGCGGCGGA TGGCCGAGAC
GTTGCGGGGT CCCGAGCCCC GGTGGTGGCT ACGGAGGGTC TGGCCCACGC CGTGGCCGAT
ACTCTGCCGT CACCGGTGGA GACTTTCGCG TCCGAGTCCG GTCTCCTGAT GCAGGCTGGC
CCTCAGCCGG TTGGTGAGTC GGCGCGGGCT GGGGCAACGG ACATCGCGGG CGCGGACCAA
GCCAGTGGCC AAATCAACCC CGACGGAAGT CCCCAAAACC CCGATACCCC GAGCGCAGAC
ACCTCCCCCC AGCCCACACC CATCCTATCC AAAGCCACAT CACAACACTT AGCTCGGCCA
GCCGAAGCGG CCACAACACA AGCTTTCCCA CCCGTAGACC CCGACTCCAA GGCTGAGCCC
AGGTTTGAGT TTAAGGTAGA GGGTGATGCC AAAGCCGAGC CCAAGTCCGG GTCTGAGTCG
AAGGCGGAGG CTCAGCCCGA GCTGGAGCCT GAGCCCAGGT CCGAGGCAAA GGCTGATGCT
AAGACGGAGG CTAAGGCCGA GCTTAAGTCC GAATCTGAGT CAAAGGCGGA GGCTCAGCCC
GAGCCAGAGC CCAAGTCCGA GCCCGAGCCC AAGGCGGAGG CTCAGCCCGA GCTAAAGCCC
AAGTCCGAGT CGGAGCCTGA GCCCAGGTCC GAGGCGAAGG CTGATGCTAA GACGGAGCCC
AAGTCCGAGT CCGAGCCCAA GATTGAGTCG AAGGCAGAGG CTCAGCCCGA GCCCGAGCCC
GAGCCCAAGC CCAAGCCCAA GCCCAAGCCC AAGCCCAAGT CTGAGTCCGA GCCCGGGCCC
AAGTCCGGGT CTGAGTCGAA GGCGGAGGCT CAGCCCGAGC CCGAGCCCGA GCCCGGGTCC
GAGGCGAAGG CTGATGCTAA GACGGAGGTT AAGGCCGAGT CCGAGCCCAA GCCCAAGTCT
GAGTCCGAGC CCGGGCCCAA GTCCGGGTCT GAGTCGAAGG CGGAGGCTCA GCCCGAGCCC
GAGCCCGAGC CCGGGTCCGA GGCGAAGGCT GATGCTAAGA CGGAGGTTAA GGCCGAGTCC
GAGCCCAAGC CCGGGCCCGG GCCCAAGTCC GAGCCCGGGC CCAAGTCCGG GTCTGAGTCG
AAGGCGGAGG CGGAGGCGGA TTTCAAGGCC GAGGCCGAAC AGAGATCTGA CAGCGGGGAC
GGCAATTCTT CAACGGTTCT GTCGCCTGAT AAGTTGCGTA GTGCTCTGCC TGACTACTTG
CGTCGAAGCG AGTCCTTGGG TCCGGCTGCG CAGTTCCGAG CCGTCACTGT CGACGATGAC
ACAAACCTTA TTCGTGAGCT CGAGTTGTTG GCGCCAGGCA TATCGCAGTC GGTATTAGAG
GAACTGGACC GCGACGTACG GGATGACTTC GGCAGATTTC TTGGTGAGGG GTATTCCTAC
CCTGTCCGCA TTGACGGGGA AGCTGCCGAG TTGACCGTCA CGGCTCGGTT TGATTGGGAT
GGCCTCAAGT CCGAGAGGAA AGCTAAGGAA GGTCCGAAGG CGGAAAGTAA GCCCGACGGA
TCCCACTCGC GTTCAGTCAA GTTAGATCAT TATTCGCATG TAGATCCAAA GTTCTTCGTT
TCATTTGTTC CCAGTGCACT GGTTGCTGGG GGGATGTCCA TCCCGACCGC GCCAAGATCG
ACGATCTCCA GGACCAGTAA AACGAGCTTC AAAACTATTA CCACCGTTGA GGTGGAGGAT
CAAATAGAGG TCACTGTTCC GGCCGTGATC CACGCTTCTC TGCGAGGGCC AAACGGGCAG
CCAATCGATG CCCGGCCACC CGAGAAGCGC GCGTTCTCAC ATGGGCCGGG AAAGTTCACC
GCGACGGGCA TCGTTCCGTT GCGGGTGCCG ACCAAGCTGG CTGTTCCAGA ACCTGGCCCG
GACCCCCTTC CCGCGAGGGT GCCCACCGCC CTGCCCAAGA GGTTCGGTGT GGAGAGCGTT
GTCGTTCACT CCGACCCGGA TCGCGGTACG CATTTCGAGC AGGTCGCAAA AATCCTGCAG
GAGGAAGGGC TTGGGGACAT TGTTCGCGTA GGGGCGCCAG GCCGGAGTGT ACTAAAGCAG
TTTCTGTCCA GAGACAAGCT TCCTGTCGCT GACGCGACAA CAGCTGGCGC CGACAGGGCC
GAGGAAGGTT GGGCCACCTC GGAGCCGCTG CTGCGAACGC CTGCCGAGCA TAATCTCTGG
CGCCGTGTTG TCCCGGGACG TGGCAGCGCC CTTCAGGCGC GGCTCGTGGC CAGGCAGGTG
CGGTCTCTCG AGACTGCGGA CGGCGTCAAG TATGAAGATG GCTTCAAGCT GGCCAGCGAG
AACACCGACA GCAAAATCGC CGACCGGCCG TGGGAGGCAA TGTTGACCGG CGGTGGCGGT
GGTGTGGCGG GGCCACTGAT GTACGTCGTC GGGCCGAGAG TGACGGTGTC GCGCACCGGG
CCGAGCGAGC AGAAGGTGGC GGCTACGAGA GTCGGGCGCC AAAAGGTAAA GGTCAAAGGT
TCGGCCGTTC GTTACCGAAC TGTCTATGAC CTGGAAGTGC GCCTGATTGG AAGGCCAGCT
CAGACGCTGG GCGGGGCCGT CGCGACGGTG CAGTGGACGA CCAAGGATCG GGCACGGGGC
ACTTCACTGG ATAGCGACCC GACCGAGTGG CGGGGGCGCA CGGGTGACCG CCGTACGCAC
TTTGCGCCAG CGCCTATTGA GCAGGGACAC TCCTTCGGAG GGGCATACAT TAACGATCTT
GATGGTGGTG ATCAGCTGAG GAGAAGTGTG ATCAACGCAT TAAGGGACGT GCCTGGAGCC
CGTGGAAATC TGGAGCTATT CGGCAATATC GGGGACAAGA GTCTGCCGAT GAGGCAGCGT
CTGATCCCAG CCTTAAAGAA GGAAGCATTT GTCCGGCAGT TCGGTGATCC GGAGTTGGCC
CAGGGTCTGA CCGGAACGGT TCAGGCGATC TTGTCGCGTC GGGAGGATAT TGAAGTTCAA
CTTTCGGATC GTCAACTGCG CCTTTTGCTG GACCGGATCG TTGGGCCGGG GCTGGAGATC
CCGCTAACCA TGAATGGCAA GCTGCACGAC TACACCACGG TCGTGACGAT CAAGGGTCGG
CTCTCCAGCC TATCCGATGG CGAAATTTTG AACCAAAAGG AGTCGGGAGC TACCGAGAAT
CGTGCAGCGA AGGCCGAAGC CAGTGCCAGT AAGGATAAAA GTTGGACTTC CTCCATAGGA
GTTGACGGCC GGCTCATCGC GGTACTCGGC CCGGCCTTCC CGGTCGGGAT TGTTGGCCCT
CGTGTCGAAT ATACCACGAG CAGTGGCCAT GATGTCGGCG TCAATCGTCG TCACGAGTCT
GAGATCGCGC GCGCACCGGA CCTCAAAGAA GACGGCGGGC TCGCAGATGT ACCGATGCGC
GAATTCGCGG CTACCCTGAC CGTGACGACG ACCTTCGAAT CCAGCGTGCG GCCCAACCCC
AGCAGACTAC TTCTGCTGCC CGGCAGGCCC GGTCTCCACG TCCCGACCGT CGTTTCCGGT
GCCCTGCCGC CCACGGAACA CCAGTTGGAG CTGCGCCTGC TGGTGCCCGA GCACCGGGTC
GCACTTAGCC CGCCGCCGTC CACGCCAGCA CCGGAACCGG TCGACAAGGA GTGGATGACG
AATCCACCGG TGTTGAGCGA AATGTCCGGT GGGTATCCGT ACAAGCTGGA CGGCAGCCTC
GTCGAGGCGT TCCTCGGCAC GGCTCATCTA CTCGATGTGG TCAAGGAGAC CCTGACGCGG
GCCACGAATG ACCCCATCTG GGGCTTCCCA GACGGGTCGA TTTCCACCCT GATTTCCCAG
TCGCTGGCGC CAGACCAGTT GACGGGCAGT CACGAGCTGT TCAGCACACC GCTGTCGCTA
TCCCAGCTCA CCTATGGCCG CCGCCGGGCG GATGCTTACG CCGAGGTCAA GATCCGACTG
AGGCCCCGCA ACCCCCGGGT GCAGGCGCCA ACCGAGTTCC ACACCGTCAA GGATATTCTC
GCGGGTGGCT CCAGCAGCGG CGCCAAGCAG GCTCGCTCGT GGGGCGGAAG CTTGTCGGTC
ACACCTGTGG GCGTGGTGCG CGGACCAGCC GGCGATCCGA ACGAAGATCG CGTCACCTCA
AGCGGTGTCT TTGTCATGTC GGGAACCTTC CTGCAATTCG CCCGCGGCCA GAAGTACGAG
CACAACATCA CGGGTACGTC GAAACTCGAG GTAGGCGGCA GGCCAGAACG TCGGGTGTTG
GTGCAACTTG ACGTAGACGT GGAGGTAGTC GCGGAAACCC GGCGTCGAGG CAACCTGGAT
GTGCTGGAAG TGCTGCCCGA CAGTCCGATC CAGCGGGCAG GGCAGAGCTT CACCTTGCGC
GACTCACTCC TGATGTGGAT GACCGAGAGG CAGGCGCGGG AACTCGGGGA CGACGACAAA
GCGTATTCTG GCCAGCTGGC CGAGGTCGAG CGACAAACGC GATATCTGTC GGACAACGCC
GTGCAGGCCG ATCCACACGA GCGCGACCGT CGAGACAACG CCACCGACCG TGCCCAGCCC
TCGGTTCCGA CTAACCGCAC CATCACGTTG CCCGCGATGA TTGGTATTCC CGGCAGACCG
TCATTCGGAA TCGGCGGCCT AGATCGCACG CTGGACCTGA GCAGTCATAT CGGTGCCGTG
CGACGTGCGA TTACGAACGC GATCGGCGGA ACCCACGGTA CACGGGTGGC ACAGGCGTTG
CTGCCGGAGT CCTCGCTGGA CGCACCGCAC GACAACGTAC GGATGTTGCA GACTTTCCTG
GGCCACGCCG ACAGCCATAT CGCCAGCGCC CTCAACGGTG GCCGGAGCCT GCTGCTTCGC
CTCGAGGGGC GGGTCCGGGG CCACACCTAT CAAATGACGG TAACCGGATC GATGGTGAGT
GAGCCTGAAT TCAAGGGAAT CGTGCACGTC GACAAGCTGA CTGTCAGTGA CAAGACAAAG
ATAACTCTGA CCGACGCCAC CACTCGCAGC CGAACGAAGG CATCGGCATC CCTCATGCTG
ACGGGTCGGG GGATGCACGT TGAAGGCCAA AGCCCGGAAG CGCAGCAGCA CTCTGGCCCA
GTCAGAGTGC TGGGTGGGGC GGGAGCGGCC ACCTCCGCGA ATTGGGGTGC CCGCGAAAAA
GAGTACTCCG ACGCTGACAC TCTGAAATAT TCCCAGTCTC TGGCCGTGGA GGGGCCAGTC
GCCACCTTCG GCACCGACAT CGTCCTCAGC ATCACTGTGA CCGGCCGCGA GCTACCCAAA
GCGGGCGCGG TCCTCAGGGA GGTCCGTCCA CTGACACTTC GCGTGTCGCC GTACAGTTCG
CGGATAAACG GCCAGGTCGG AGTCGCCGGC GACAAGGGCA CACTGCCGAT CAACGAGCTC
ACCGACTCCG CTCGAAGCGA ATGGCGTAAC ACTGCGGGAG TCAAGAGCCT GCCAGAGCCC
TCGCAGTACG CGGTCGAGCA CGTGTTCCTG GATGTGGCGA GACTGCACGA CGCGGCCGAG
CTGGCCTTGA CCAACTCCGG CGTGACCGTC GACGCCACCA CGCAGGCCGC GCTACGCGCC
GCGATCAACA CCACCAACCT GAAGGCTGGG CTGCCAGCCA TGCTGGCTGG CCGATTCCCG
ATCCCGCTGC CCCGCCAGAC AAAACGGGAA CTCTTCCTGG ATGCCCGGGT GGTAGGGCGA
CCGAAGTTTG CTGGCGCCAA CACGGATGTC ACTATCAAGA ATTCGTTGAA GGGTGAGCGC
TCCCAGAAGG TGACGCACAA GTCCGGGAGC ACCTATCTGG TAAAAACGCG CGGCCTACTC
GCCGTTCGCG AGGACGGGAG GGGGGGTATG CCAGTGGAGA GGCGTCACTA CGCGACGGCT
GACAGCCAAC GATACGACGC CAGTGCGACG CCAGAAAAAT ACAAGACGAC CGCCCAGCCA
TCGTCCACCC CAGAGGATGA CACGCTCGTC TGGGGTCTTT CCTATTCGTT GGAATTCTCC
CTGGTCGCTC TCCCTGTGCA TGTGCACCGC CCACGAAAAC GGCTCCAACC TTCCGCCCGT
CGTACACACC AGCACCGCCT CAACCCACCC ACGTCACGAA AGCGTGCGGG CGTCATGCTT
CGAATCGACG ACGCCGTGAT CCTGCGGATG AACGACGAGG CAGCACGTGA ACACACTGAC
ATCGTGCTAC CGCCGGAACT CAAAGAATCA GCCAAAAGTT TGGCCGAGAG TAGTAAGAAG
TGGACCAAGG CAGTCAATCG CAGGGACGGG TTGAAAGCTC AAAAGGCCAC AGACGCTGAC
CTCCTGAAAG CTGCCGAACA AGAGGCCACC GATGCTGAAT CCAAGTGGTG GGATGCCTAT
CAGGCCCATG AACGCCAACT CGACACGCTG CGTCCCGCCG CACCCCACAC CGACTCTGTG
CACACGGCTG AGGACGCCAC TGGGACCGAG GAGGCCCAGC CCACCGCCGC GACGCAGGAG
ACCACTGATC TCACCAACGC TGAGCCATCA GCTCCCAAAA GGGGGGCTCA GACCTCGACC
GATGCTTTGT CAGGTCCACC GCCCACCGAG GGGGCGCAGT TGTCCCCTGC GGCGGGCGGC
ACCCCGACGA TCGAGCCCGG CGCCTCCGCA CCCGCGCCGG CCGTCAGCGA CGTAGCGGAG
GACACCGTCT CCGCAGCCGC CGACGGTGGG CCGGATGGTG CGGAGACTGC CGCTGATCCG
GTGCCTGCTA CTCCATCGGT AGTGCGGCCG TCGTTGTTGG CTACTCCTGG TGATGGTCGA
TGTCAGTTGT ATGCGGTTAT CGGTAGTGAC CCCGGTCTGG TGGGGGAGAG GTTGGCTTGG
GCAGATCTCG ACACCCCAGC CCTACGTGCC TGGCTGGCCG ACCCGGAGTT GGTCCGTACC
CAGCTCACCG AGCAGAGCAG CCCTGAGAGG CGGGATGCAC GGGAACTCGT ACCGCAGTCC
ACCGAGTTGG GGCAGGCGGC CGAGCGTCTA CGTCAACTCG TCGAACGACA CCTGATCACG
CTGGGTCCGG CGAACATGCC GACCGCGGCC CTGCAGGCAT ATCGGGGAAA CCGCAACCAG
ACGCTCCACG CCACCGTCGA CTCCCTGGAT CACGCCACGG TGCTCAACCG GCTCCACGCG
GCGGGAATTA CCAGCCTGCG TGACCCCAGC CTGCTCCCCG TGGCCCACCT ACGCGACCTG
TACGTGCATC ACCGGACCGC CGACCTGGCC GCCAACGGTG CCGACCATGC CAGCGCCCGA
ACCATCGCGG AACAGGAGGT GCCCCTCAAA CCCGCCGCTG ACGGCACCCC AGAACGCGAC
CTTGCGGACG GATCCCTGTC TGCGCAGCAA ATGTTCGATT TCCTCAGCAC TCACCACGAC
ATCCCCCTGC ACGACCTGCC TGAGCAGGTA CAGCGCGACC AGCTGACGGT GCACCTGCTC
GACCCGGCCA GACCAGCTGA CCCCGTCGAG TTCGCCGAGC TCACCAAGGC CGTACAACAC
TGGGAGCAAC ATTGGCACCG CGATGCCGGT GAGGCATACA TCGGCCTACT CGCCTCCGCA
CTGGGCTCCC GCATCCGCAT TCACACACCG GATCACGTCC AGACCATCGG CCCCGACAAC
GCCCCCCTGA TTACCATCCA CCGGGAGCAC GACCACTACC AAGCCACCAT CACCCCACCC
CCGGCCAGCA CCACCACCAC TGACACGCGG GTTTCGCCGG CAAACAGCCA CAGCGAACCC
AACGGCCGCA CCGCCCCGAC GGCTACACCA AGCACAACCG ATGGCACCCC GATCGCGTCG
CCAAGGACTG GCGATGCCAG TACCGCCCCG GATCTGACAT CCAGCACGGA TCCGGACCGG
TCCGAGTCTG TCACGGCGGT ATCCCCCCAA CCCGCCGGTG ACGGAGTCGA GGCAACCGAT
GACTCCCGCC CCAGCATCAT CGGACCGTCC GCTGCCCCCG ACGCTGGGGC CGCCGTGCAG
CAGCCCGTCG TATCGCCGGG TGGCCCACGG CAGGCCGCCC CGCACCCCCC GGCCGACGTG
GACCTCTCGT CGAGCCACGC CGATGATGAC CGACCAGATG CCGCGGTAGC CCGACCGGAT
ACCGAGGTGG ACCGGCCGGA CACGGTGGTG GACCGGCCGC TGGTCCCATC GGAACCGGGC
CAGACCGAAC GTGACCTGGC ACCACCAAGG GTGGCAGGCC GAGATCGACA CCACGTCCCT
GCGGACGAAG CCGAGGCCCT GGACGCGGCG TCGCGTGGCC GGCCCAGTCC CAAGGAGCCC
GCCGGGAAGC GGCCGACGGA CCCCGAGACC CCGGACGTCG GCGGGCAGGC CGCCACCACC
GTTTCCGGCA TCCACACCAC ACCACCGCCG GTCACCTACG TTCCCGCCGG ACTTGTCCTG
CTCGATCCCG AACATACCGA CAAACTGCGT GTCGCCCTCA ATTATCCACC CCCATCCGAC
GAATATCTGG TTTTTGCCGA GGGCACCCCC GACGGCATTG TCGTCGGCAA CAAAATCGTC
ACGCCCGAAC ACCTCGCGGC CCTCATCACC GCTGACACCC GGTCTCACGG TAAAGATGTC
GTCCTCGTCG CCTGCGAGAC CGGGCAGCGC GATTACGACC TCCGCCTGTC CGGTCAAGCC
GCCATCACCG CTGTCAGCGC ATCCCCGGAC ACCGCCTGGG TCACCGCCAC CGGCCGAATC
TTTGCCGCTG CCACCAGCTA CACCCCAACT GGTCACCCAC AACTCGATGT CACCCGCCCT
GGCACCTGGC GTCGCACCGC CGGCCAACCC CCCACTACCA GAATCCACCA CCACTACGAA
TACCCCACCC GCCACACCAG CGCCGAACCT CCGAACATTG CCGATGCCAT TGCCTTTGGA
TCGTCGAGCA AGAAGCCGGC CACATCCTCG GACAAGAAGC GGCCCAAAAA GAGCCTGATC
GACGATATCG ATCGGGCCGC CAAAAAGTTC ACCCGGGCCT ACGGGGGTGG CCAGTACGGG
CGGGCAGGGG ACCAATTCCG GCTGCGATTC GCCCGGGACA AGAGTGCCGA GAGGGCGGCT
CAGACGCTCT CCCATCTCAG CTTCAAGCTG CACGACGTGT TCCGCCAGCT GCGTGAGCAG
AGCGAGCACC GAGAGAGTCT GCCCAAGGAT CTCGAGGTGC AAGGGATGTT GATCAATGGT
CGGTTGGTCT TTGCCACCAA CTTCAACGCC ACCGCCGGCC TGATCAGCAA GGTGAAATTT
TCGGTAAAAA ACGAACAGGT CACACCTTTG GAGCAGCTGC TGCGCATTTC GCAGAGCGAC
AGCAGCCGGC GAAAGCATTT GGCGCCGCAG GATGCCGACG AGTATGTCGG GCGGCTGAGA
CGCAGTGAGG AGAAAATCGA TGAGGCGTTC AGGGGAGTCC GCGACAACGA CACGGCCAGG
GCGATGCGCG CCAACCGAGA CGTCCAGCTC GCGGACGCCG CTCCGACTGA CGAGAGTCGG
CTGTGGTTGA ACCACCTCCT CACCAGTGAC GAAAAGATGG GTGCCGTCAT CGTGCTACAG
CATTCGGACA GCGATGCCAA CTCGATGCAC GCCGAGCAGA AACTACTGCT GGCCGTGCAC
AGCGCCGATC TGCAGCCATC GGACATCGAG GGTACCTACC TGATCATGGG TAAGTATCGG
CCCTGTCTGG GATGTTGGGC GGCCCTCCAC CATCATCGGG AGGAAAATTT CCCCGTCGAC
TTCGACGACA ACTACGGCAA CTACTATCTC GAGTCGATTC GTTCGCTGGT GTCCTACCTT
CCGCACACCA TCCGCGACCC GCAGGGCAGG GTGGACAACT ACCTGGAGGG TGTCATCAGT
GGCGTGGCCA GCCAAATGAT GAGCGTGTCC GCGCTGTCCC GCCAAGCGCC ACCCCAAGAC
GCCGTCGACA GAAACGGCCT CGAGAATGTC ATCCCAGCTG AGAACGCCCC CAACCGTGGC
TACGTCACCG CCTCGGACTC AGAGGTCGCG TTTGACGAGG AATCTAACCG GACTGTGAGC
GTGAAGCGGA CCCTCGACTT CACCACCGGC ACGCGGAACC GTACTCTCGG CACCGGAAAA
GAAAAGTCGC ACGGCTCACG TGCAGCCCAG CGTGTGCTCA ACGAGGCTGA GCGCAGGCAA
CTGGCAGCAG CGTGGAAGGG GCGTGATCCC GAACAACGCG CCGCGCTGTT CAAACACCAT
GCGGCCCGCG GTGTGTCGCA GGCGGAAATC GCCGAGGCAT CCGGGGCTGA CCCGAGCAGC
GTCTACAAGA TCGTTCGTGA CAAGGATGGC CATGAGCACC GCGACGGCCG CAATAGTGTC
AAGCGACGGG TTGCGTCCCG CAGTAGCCGC CACGATTCAA CAGCGTCCTC CACGACCAGC
AGGGGCGCCG GCAAGTTCAA AAAGCGCCCC GACATCGACG ACGTCGGCCG CGCCTCCCTG
CGCGACGTCA TGCGCGGCAC CGACTTCTAC CAGGGCTGGA AGCAGATCGA GAAGTCGCAG
AGTAAATCGA CCCTGAAGGC CCGTGACATC CCATCCGCAC TCGACAAGGC CCTCGCCGCT
GCCCGACAGC AGTACAGCAT GCAGAGTATC GCTGACTACC TGCACATGCC ACGCAAGAGT
CTGCAGCAGC ACTTGGACAA GCGCTACGGC CCGGTCAACG ACGTCAGCAA TACCCGTCAG
CCCAGCCCCA TCGATGAGGA GATGCCGGAT TCATATCCCA CGCCTGGCCC CGACGTCATG
GACTATGAGT CGACGGGAGC CAGCCGCGCG GGGCCCAGCA GCGCGTCCGG GTCACGACTG
CCGCAGGTTG CCTACAGCAC CGCACCTATC ACCTACAACG TCGCGCCGCA GGCCGCACCC
GCGGCGGACC CCGCGCCACC ACCCGCGAAC ACCGTTCCGA TCCACGACCC CGACCACGGC
CCCGCTGTTT ACGCAGACAA CAGCGGCCGC CAGTTCTACT GGAACTCTGA AACCAACCAG
TGGGAGTACT ACGACGACAA CGAGTGGCAC GACTACCGGG CGTCCGACCC GCATGGCAAG
CGCAAGTACA CCCGCTAG
 
Protein sequence
MPDFHIPNDP EWQWAYDAIL YTTGGEFPEA DPTALRAMGD ELYAFTNTLL NGVASTSNLG 
NGLSAHLDGP AADAFNQFRG GIANNVPTAG NISWALGNAA YEFALDSEST QYNIVIAAFT
QVVEIAFAIA SGFGAAAVPA LIKIGQEIVK TLIEFLRMRL QNQLLRLVWE GIEEGLEELW
QGMAAQLTQM AEGNRKDFDY QDLALAFAGG FFIGMGVSGM HAIGGKLYPK INKNTYTRET
LSALAETLFE GLFSMMVGGG FNPFATMTSS IIGGLAHQYA HDFGNQFGVD PNNLPNRPPP
DPKKGSSRGN PPPTSVEGPP PPAYHDIGDS PPSYDQAAGS PPPHSETQGD PPPYNEATGP
ANAATGTPSS VGSSPVPGNP VTNGSHNSQP LDTGRDGAPT PATAPAFANR PDLSTVNAPA
PNGSVAPAPG TAPASAGALR PGFAGVSHFG PSSVQVPTPT TGSAGSIAAN PAATPAAPPP
ATSPAEEIVP IATPAGTDAT LPATAGGLPG FESTSPAASP LTDPTAPAPV VAETTAPVLT
QAADPATSPV GDPAGQAPTV AMPVTAVPAV EVPRTGSPNA ESPVVQAPTG QPSAFQPPTT
QLPSPQTSGG STPGAQGSNT PFSSASPPPQ VTNPVAPAPA MVEQASAVPD PAVEPAVASE
SGFTGQALDA QQTLVTSNPH HVSTTSTTTQ PSLTSGIVQS PPSSDTALRN DSTRDTATRT
ESAEDRDSRF QADGPRRASG QHDPAVTPAP TAEVAPATSR GQTDIVIDLP SGSSTDPGAE
NGPRTTAASI GPTADGTEPP PVTPQAGPTI PTPPGAADAV LRTEPAAAST SNLEATAPTS
PSAPVSANES DQGQSTTEIF DGLALGGTGS QVVPGPAPAL LADVVPRDQW WRLYLDPRHH
DAAQAAYPND PGSYYDNDQS PGFQRGMTEA YEQFLNNSGI NSNTLNADTY KQIHETVTRY
LNKQFDWTGG TTRQGVIKPT SFPVRNPDIS SDVLAERIGE RPLIKDFHEA LRQRGGPKPV
TFTSTIMGRG GISATTMPRT ITTNYSRTDA PALINTIFDQ YRHERQNATT PDATLAAIAR
TIRKLHITHP FEDGNRRLNV HVILPKLLLD NGFQPVIAPD MDQLFQGGRS IDDIVLVLRN
TQPATDGIRL GTEPNPQANV VPVAQIPTVV DVPAITSVAA QADTTALAAS SDGHRHDSSS
VVPLATVVRG ADGVLRASGK PLVIRPLTGD RPGIALLDDG DWNRVSTRTL LPPDTHAAAQ
DFKGDPATAV IVHGVGDVVH APVRNPDGST TTVALGPDGL AQVIGRPGAP IVLVSCETGA
LAGGFAHQLA HARAGDVLAP KTVINIGSTG VEGGLRTTDS ESWQLYTPQS LDDEGWDAVN
DVATGTPPLF GISANPEPIA GPGAGTGTGS TTGQSNRSND GIVTVRRSTQ PTTDGIRLGP
KPAPTSNIAR AAQPPSLPTI PEETEQDLDP PGPSAPSTVS REREKLPSSQ PDSMASSEEA
GYTSAGDLAD QVAAAADGRD VAGSRAPVVA TEGLAHAVAD TLPSPVETFA SESGLLMQAG
PQPVGESARA GATDIAGADQ ASGQINPDGS PQNPDTPSAD TSPQPTPILS KATSQHLARP
AEAATTQAFP PVDPDSKAEP RFEFKVEGDA KAEPKSGSES KAEAQPELEP EPRSEAKADA
KTEAKAELKS ESESKAEAQP EPEPKSEPEP KAEAQPELKP KSESEPEPRS EAKADAKTEP
KSESEPKIES KAEAQPEPEP EPKPKPKPKP KPKSESEPGP KSGSESKAEA QPEPEPEPGS
EAKADAKTEV KAESEPKPKS ESEPGPKSGS ESKAEAQPEP EPEPGSEAKA DAKTEVKAES
EPKPGPGPKS EPGPKSGSES KAEAEADFKA EAEQRSDSGD GNSSTVLSPD KLRSALPDYL
RRSESLGPAA QFRAVTVDDD TNLIRELELL APGISQSVLE ELDRDVRDDF GRFLGEGYSY
PVRIDGEAAE LTVTARFDWD GLKSERKAKE GPKAESKPDG SHSRSVKLDH YSHVDPKFFV
SFVPSALVAG GMSIPTAPRS TISRTSKTSF KTITTVEVED QIEVTVPAVI HASLRGPNGQ
PIDARPPEKR AFSHGPGKFT ATGIVPLRVP TKLAVPEPGP DPLPARVPTA LPKRFGVESV
VVHSDPDRGT HFEQVAKILQ EEGLGDIVRV GAPGRSVLKQ FLSRDKLPVA DATTAGADRA
EEGWATSEPL LRTPAEHNLW RRVVPGRGSA LQARLVARQV RSLETADGVK YEDGFKLASE
NTDSKIADRP WEAMLTGGGG GVAGPLMYVV GPRVTVSRTG PSEQKVAATR VGRQKVKVKG
SAVRYRTVYD LEVRLIGRPA QTLGGAVATV QWTTKDRARG TSLDSDPTEW RGRTGDRRTH
FAPAPIEQGH SFGGAYINDL DGGDQLRRSV INALRDVPGA RGNLELFGNI GDKSLPMRQR
LIPALKKEAF VRQFGDPELA QGLTGTVQAI LSRREDIEVQ LSDRQLRLLL DRIVGPGLEI
PLTMNGKLHD YTTVVTIKGR LSSLSDGEIL NQKESGATEN RAAKAEASAS KDKSWTSSIG
VDGRLIAVLG PAFPVGIVGP RVEYTTSSGH DVGVNRRHES EIARAPDLKE DGGLADVPMR
EFAATLTVTT TFESSVRPNP SRLLLLPGRP GLHVPTVVSG ALPPTEHQLE LRLLVPEHRV
ALSPPPSTPA PEPVDKEWMT NPPVLSEMSG GYPYKLDGSL VEAFLGTAHL LDVVKETLTR
ATNDPIWGFP DGSISTLISQ SLAPDQLTGS HELFSTPLSL SQLTYGRRRA DAYAEVKIRL
RPRNPRVQAP TEFHTVKDIL AGGSSSGAKQ ARSWGGSLSV TPVGVVRGPA GDPNEDRVTS
SGVFVMSGTF LQFARGQKYE HNITGTSKLE VGGRPERRVL VQLDVDVEVV AETRRRGNLD
VLEVLPDSPI QRAGQSFTLR DSLLMWMTER QARELGDDDK AYSGQLAEVE RQTRYLSDNA
VQADPHERDR RDNATDRAQP SVPTNRTITL PAMIGIPGRP SFGIGGLDRT LDLSSHIGAV
RRAITNAIGG THGTRVAQAL LPESSLDAPH DNVRMLQTFL GHADSHIASA LNGGRSLLLR
LEGRVRGHTY QMTVTGSMVS EPEFKGIVHV DKLTVSDKTK ITLTDATTRS RTKASASLML
TGRGMHVEGQ SPEAQQHSGP VRVLGGAGAA TSANWGAREK EYSDADTLKY SQSLAVEGPV
ATFGTDIVLS ITVTGRELPK AGAVLREVRP LTLRVSPYSS RINGQVGVAG DKGTLPINEL
TDSARSEWRN TAGVKSLPEP SQYAVEHVFL DVARLHDAAE LALTNSGVTV DATTQAALRA
AINTTNLKAG LPAMLAGRFP IPLPRQTKRE LFLDARVVGR PKFAGANTDV TIKNSLKGER
SQKVTHKSGS TYLVKTRGLL AVREDGRGGM PVERRHYATA DSQRYDASAT PEKYKTTAQP
SSTPEDDTLV WGLSYSLEFS LVALPVHVHR PRKRLQPSAR RTHQHRLNPP TSRKRAGVML
RIDDAVILRM NDEAAREHTD IVLPPELKES AKSLAESSKK WTKAVNRRDG LKAQKATDAD
LLKAAEQEAT DAESKWWDAY QAHERQLDTL RPAAPHTDSV HTAEDATGTE EAQPTAATQE
TTDLTNAEPS APKRGAQTST DALSGPPPTE GAQLSPAAGG TPTIEPGASA PAPAVSDVAE
DTVSAAADGG PDGAETAADP VPATPSVVRP SLLATPGDGR CQLYAVIGSD PGLVGERLAW
ADLDTPALRA WLADPELVRT QLTEQSSPER RDARELVPQS TELGQAAERL RQLVERHLIT
LGPANMPTAA LQAYRGNRNQ TLHATVDSLD HATVLNRLHA AGITSLRDPS LLPVAHLRDL
YVHHRTADLA ANGADHASAR TIAEQEVPLK PAADGTPERD LADGSLSAQQ MFDFLSTHHD
IPLHDLPEQV QRDQLTVHLL DPARPADPVE FAELTKAVQH WEQHWHRDAG EAYIGLLASA
LGSRIRIHTP DHVQTIGPDN APLITIHREH DHYQATITPP PASTTTTDTR VSPANSHSEP
NGRTAPTATP STTDGTPIAS PRTGDASTAP DLTSSTDPDR SESVTAVSPQ PAGDGVEATD
DSRPSIIGPS AAPDAGAAVQ QPVVSPGGPR QAAPHPPADV DLSSSHADDD RPDAAVARPD
TEVDRPDTVV DRPLVPSEPG QTERDLAPPR VAGRDRHHVP ADEAEALDAA SRGRPSPKEP
AGKRPTDPET PDVGGQAATT VSGIHTTPPP VTYVPAGLVL LDPEHTDKLR VALNYPPPSD
EYLVFAEGTP DGIVVGNKIV TPEHLAALIT ADTRSHGKDV VLVACETGQR DYDLRLSGQA
AITAVSASPD TAWVTATGRI FAAATSYTPT GHPQLDVTRP GTWRRTAGQP PTTRIHHHYE
YPTRHTSAEP PNIADAIAFG SSSKKPATSS DKKRPKKSLI DDIDRAAKKF TRAYGGGQYG
RAGDQFRLRF ARDKSAERAA QTLSHLSFKL HDVFRQLREQ SEHRESLPKD LEVQGMLING
RLVFATNFNA TAGLISKVKF SVKNEQVTPL EQLLRISQSD SSRRKHLAPQ DADEYVGRLR
RSEEKIDEAF RGVRDNDTAR AMRANRDVQL ADAAPTDESR LWLNHLLTSD EKMGAVIVLQ
HSDSDANSMH AEQKLLLAVH SADLQPSDIE GTYLIMGKYR PCLGCWAALH HHREENFPVD
FDDNYGNYYL ESIRSLVSYL PHTIRDPQGR VDNYLEGVIS GVASQMMSVS ALSRQAPPQD
AVDRNGLENV IPAENAPNRG YVTASDSEVA FDEESNRTVS VKRTLDFTTG TRNRTLGTGK
EKSHGSRAAQ RVLNEAERRQ LAAAWKGRDP EQRAALFKHH AARGVSQAEI AEASGADPSS
VYKIVRDKDG HEHRDGRNSV KRRVASRSSR HDSTASSTTS RGAGKFKKRP DIDDVGRASL
RDVMRGTDFY QGWKQIEKSQ SKSTLKARDI PSALDKALAA ARQQYSMQSI ADYLHMPRKS
LQQHLDKRYG PVNDVSNTRQ PSPIDEEMPD SYPTPGPDVM DYESTGASRA GPSSASGSRL
PQVAYSTAPI TYNVAPQAAP AADPAPPPAN TVPIHDPDHG PAVYADNSGR QFYWNSETNQ
WEYYDDNEWH DYRASDPHGK RKYTR