Gene Sare_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3153 
Symbol 
ID5706211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3600600 
End bp3611126 
Gene Length10527 bp 
Protein Length3508 aa 
Translation table11 
GC content72% 
IMG OID641272585 
ProductAcyl transferase 
Protein accessionYP_001537952 
Protein GI159038699 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.215777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG ACAAGACGTT GCTCGACTAT CTGAAGTGGG TCAGCGCGGA TCTACACGAC 
ACCCAGGAGC GACTGCGCGA GCTCGAACAG GCGCAGCGGG AACCGATCGC TGTTGTCGGG
ATGTCCTGCC GGTTCCCGGG CGGCGTCGAC GGTCCCGACG ACCTGTGGCG GCTGGTCGAG
ACCGGCACCG ACGCCATCAC CGGGTTCCCC ACGGACCGAG GCTGGGACCT CGACGCGATC
GCCGACTCGG TGGCCGTGCA CGAGGGCGGC TTCCTCGCCG GTGCCGACCG GTTCGACGCT
GGCTTTTTCG GTATCAGCCC GCGCGAGGCG ACCGCAATGG ACCCGCAACA GCGGTTGCTG
CTGGAAGGCG CGTGGGAGGC GCTGGAGTCT GCCGGCATCG ACCCCCGCTC GCTGCGCGGC
ACGCGTACCG GGGTGTTCGT CGGCAGCAAC AACCAGGACC ACGTCATCGT GCTGTCCAGC
GCCACTGCCG ACAGAACCGG TCAGGGACTG ACCGGCGCCA CCGCCAGCGT GCTCTCCGGA
CGGGTGTCGT ACGTCCTCGG GCTTGAGGGT CCCGCGCTGA CAGTCGACAC CGCCTGCTCG
TCCTCCCTGG TCTCGTTGCA TCTGGCTGTG CAGTCGCTGC GGCGTGGCGA GTGCTCGATG
GCGCTGGCCG GCGGTGTCGC GCTGATGGCC ACCCCGGCGA TGTTCCTCGA GTCGAGCGGT
CAGGGCGTGC TGGCATTGGA TGGCCGGTGC AAGGCGTACG CCGAGGGCGC TGACGGCACC
GGCTGGGGTG AGGGCGTCGG TTGGGTGCTC GTCCAGCGGC TGTCCGACGC CGTCCGCGAG
GGTCGGCGGG TCCTCGCCGT CGTGACCGGG ACGGCGGTCA ACCAGGACGG CGCATCCAAC
GGCCTCACCG CCCCGAGCGG CATCGCCCAG CGTCGCGTCA TCCGGGCCGC TCTCGACGAC
GCCGGGCTCA GCGCCGCCGA CGTCGACGCT GTCGAGGGAC ACGGCACCGG CACCGCGCTG
GGCGACCCGA TCGAGGTGGG CGCGCTGCTC GAGACCTACG GGCGGGAACA CGCCGCGGAG
CGGCCGCTCT GGCTGGGCTC GCTGAAGTCG AACATCGGCC ACACCCAGGC CGCCGCCGGG
GTCGGCGGCA TCATCAAGAT GATCCAGGCG ATCCGGCACG GGATACTGCC GGGCACCCTG
CACGCCAGCC AGCGCACCAG CCGGGTCGAG TGGGACCGCG GCGGGCTCGC CCTGCTGACC
GAGGCGACTC CGTGGCCGGA GACCGGTGCC CCACGCCGAG CTGGTGTCTC CTCGTTCGGG
ATCAGCGGTA CCAACGCACA TGTGATCCTC GAGCAGGCGC CGCCGAAAGA AGCGGCCGAG
CCGGAGGTCA CCCGCACCCC TGGGCTGCTG CTGTGGGTGC TGTCGGGCCA CACCGAGGAG
GCCCTGCGTG GGCAGGCGCG TCGCCTGCTC ACCTATGTCG AGGACAACCC ACAACTGTCC
GCGACGGACA TCGCCTGGTC CCTGGCCACG ACGCGGAGTC GGCTCGCGCA CCGGGCCGTC
GTGCTCGGCG CCGACCGGCC CGGCCTGCTG CGCGGGTTGG CAGACCTGGC CCGTGGCGAG
CCGTCCGCCG CGGTCGTCCA GGGAACTGCC CACTTCGGCG ACAAGGTCGT CTTCGTCTTT
CCCGGCTATG GCTCCCAGTG GCGTGGCATG GCACTGGAGC TGATCGAGCA GGAGCCGGCC
TTCCGCGATG AGTTCCTGGC GTGCGAAACG GCGTTTCTCC CGTGGCTGGG CTGGTCGTCG
GTGGCCGCGT TGCGTGGCGA TCCCGAGGCG CCGGACCTGG AGGAGTTCGC GACCACCCAG
GTTCTGCTGT TCATCGTGAT GGTGGCGCTG GCCGCGCTCT GGCGCTCGTA CGGGCTCGAG
CCGTCTGGTG TCATCGGCCA CAGCCAGGGC GAGATTGCCG CCGCATACGT GGCCGGCGCC
GTGACACGCG AGGACGCCGC GCGCTACGTC GTTGCCCGCG CCCGGCTGAT GCAGCAGCAC
CTGGACGGGC GGGGCGGCAT GGTCGCCGTA CAGCTCAGCG CGGACGAACT ACGGGAGCGC
CTGACACCGT ACGGCGACCG CCTCGCCCTC GCGGCGATCA ACGGACCCTG TGCCGTGGTC
ATGTCCGGGG AGGACGCTGC CCTTGACGAG CTGGCCGGGC AGCTGGTCGC CGAAGGCGTA
CGCGTGCACA AGCTCGCGCT GAACGTCGCC GCCCACTCCG CCCAGCTCGA TGCGTACCGC
GCCGAGATCG AGGAGATGCT CGCCACCCTG CGGCCCGTCT CCAGCGACGT CCCGTTCTTT
TCGACGGTCA CCGGTGGCCA GCTGGATACC GTCGGTCTCG ACGCGGGGCA CTGGTTCAGC
AACCTGCGGC AGACCGTCCT GTTCGAGCAG GCCACCCGGA CGGCGTTGAC CCAGGGCTAC
CGGCTGATGC TGGAGGTCAG CGCGCATCCG GTCGTGGCGA TGGCGGTGCA GGACATCATC
GACGACACCG GCATACCCGC CATCCCCCTG ACCACGTTGC GTCGCGGCGA GGGCGGCCGC
GACAGGTTTC TGCGGGCCCT GGCTGAAGCG CACGTGCAGG GCGCCCCGGT CGACTTCGGC
GTGCTGTTCG GGGGCACGGG CGCGCGACGG GTGGCCCTGC CGACGTACGC CTTCCAGCAC
GAGCGGTTCC GCCTCGACCC TGGAACGACG ACCGGCGACG CGTCCTCGCT CGGGCTCACG
CCACTGGACC ACCCGCTGGT CACCGCGGCG ATCGCGCTCC CGGAGCCGAG CGGGCAACTG
CTGACCGGGC GGCTCACCCG CCGTACGCAT CCGTGGCTGG CCGACCACAT CGTGCTGGGC
GCCGCACTAT TGCCGGGAAC CGCTCACCTC GAGCTGGCCT TCCGCGCCGG TGACCAGGTC
GGCTGTGACA CCGTGGAGGA GCTGACCCTG CAGACGCCGC TGACGCTGCC CGAGCACGAC
GGTGTGGCGC TGCAGGTCTT CGTCGGCGGC CCGCAGGACG ACGGCCGGCG GTCCGTCGCG
GTCTGGTCCC GCCGGGACAC TCCCGACCCG GCGGACGACG GCGACGAGTG GACCTGCCAC
GCCACCGGGC TGCTCGGCCA CGCCGCGGCC ACCCCCGTCG ACGGCCTGAC CTCGTGGCCC
CCGGCCGCCG CGGCAGTGGA CATCGACGGT CTCTACGAGC GGTTGTCGGC CGCCGGATAC
GGTTACGGTC CGGCGTTCCG TGGGCTACGT TCCGTGTGGT CGGACGGCGG GGACATCTAC
GCCGAGGTCA CGCTGCCCGA GGAGACCGTT GCCGACGCTG GCCGCTTCGG GCTGCATCCC
GCCCTGCTCG ACGCCGCCTT CCACGCGCTG ATCGCCGCCC GTCCGCCGCA GCGCAGTGCT
ACGCGGCTGC TGTTCTCGTG GGCGGGTGTC CGGCTGGTCC GCGCTGGTGC GTCCGCCCTG
CGGATCCGGC TGCGCCCGCG CGCCGACGGG ACGGTCACGC TGCTGGCCGC CGACGTGGCC
GGTACGCCCG TGGCCACCGC TGACGCGCTG GTCATGCGTG AGGTGACCGA GACGAGCCTC
ATTCCGGATC CCGCACGCGA ACTGTTCCGG GAGCAGCTGC AGCCGTTGGT GGTACCCAGG
CAAGCCGCGG ACGAGCCGGT GATCTGGACC GATCCGGGGG GCTTGCTGGA GTCCCTGACC
GAGCCTGGCA CCCGGTACGC GGTCTTCGCC GCGCCGACAC CGCCCGGTGT TGGCGTCGTG
GCCGCCGCCC GGCAGGTCAC CGAGCGGCTA CGGACACTGC TCAGCTCCTG GCTCACCAGT
CCCGGCGCCG AGTCCACGAC GCTGGTGATC CGTACTCGTA ACGCCCGTTC CGGGTCGCCC
GACCCGGTCC AGGCAGCCGG TCGCGGCGTG GTTCGTTCGT TCCAAGCCTC CCATCCGGGC
CGGATCCTGG TGCTGGACAC CGCGTCGGCC GACGAGCCGG TCGCCGCAGT GATCTCCGCC
GCCCTGGCGG CGGAGGAGGA CGATGTCGTC CTTCATGACG AGACCGCCAT GGCCGCCCGA
CTCGTGCGCG TCCCGGCAGG TGACCTGGAC CTGCCGTGGT CCGGCACCGT ACTGGTCACC
GGTAACGCCG GCACCCGTGC CGCCGAGGTC GCAAGGCACG TGGCCGCCCG AGGCGCCGCA
CGCGTGGTGC TGGCTGGTGC GGGCGAGCCG GACCACGCCG GCGTCGAGGC GGTGGCCTGC
GACCTCACCG ACGCGGACGC GCTCACAGCC CTGGTCTCCG ACGTGGCACC CGACGTCGTC
CTGCACGCCG CCGAGACCGG TGATCCCGTC GCCACGGCGT GGGCGCTGGA CCGGGCAGCC
ACCAGCGTGG ACCTTCGAGC CTTCGTCCTG TTCTCGACCG CGGACGGTGT GCTTGGCGGC
GCCGGTCGCG CCGAGCGCAG CGCCGCGAGC GCCTTCACTG ATGCCCTGGT GCGGCGGCGA
CGGGCGACCG GACGGGTGGG ACAGGCTCTC GCCTGGGGCT CTTGGGCCTC AACCGCTGGG
GGCCTCGCCC TGCTCGATGC GGCCGTCGTC GTGGACGAGG CGGTCGTCGT GCCGTTCCGT
CCCCGGGCCC GTGGGAACGT GCCGCCGCTG CTGCGCAGTC TCGTGCGGGC GCCGCTGCGG
CCTACGGTCG ACGCCGTCGC CCAGTCCTCG TCCGCGCTGG CCCGGCGGAT CGGCGGGCTC
GACCCGGCCG AACGGGTCAA GGCGCTGGTC GAGCTGGTCC AGTCCGAGGC GGCGGTGGTG
CTCGGCTTCG CCGACGTCCG CGCGGTTCCG GCCACCCGGG CGTTCCGCGA CATCGGCTTC
GTCTCGATGA ACGCGGTTGA GCTCCGTAAC CGACTCACAG CGGTGACCGG CCTACCGCTG
GCCGCCACGG TCGTATTCGA CCACCCGTCC GCCGTCGCGC TCGGCACCCA CCTGGGAACG
CTGCTGCTCG GCGGCGCGCC GGCGGAGACC GTAGAGCCGG TCACCGACCG CGTCGTGGAC
GAGCCCATCG CCGTGATCGG CATGTCCTGC CGCTTCCCCG GTGACGTGCG GTCGCCGGAG
GACCTGTGGC GACTGGTCGC CGACGGAGTC GACGCCACCG GCGATCTTCC GGGGAACCGG
GGCTGGAACG TGGACTCGTT CTACGATCCG GCGCCCGGGC AACCCGGCCG CTCGTACGTG
CGCCGGGGCG GCTTCGTCCG CGACGCCGAT CGGTTTGACG CGGGCTTCTT CGGCATCAAC
CCGCGGGAAG CCCTCGCGAT GGATCCGCAG CAACGGCTCC TGCTGGAGAC AGCGTGGGAA
GGCCTGGAAC ACGCCGGCCT CGACCCGGCC ACGCTACGCG GCAGCGACAC CGGGGTGTTC
GTCGGCAGTA ACGGTCAGGA CCACGCCCTC GTGCTGTCCG GCGCGGTGGG TGAGCTGTCC
GGGTACCGGC TCACCGGCAC CAGCGCGAGC ATCATGTCCG GTCGGATTTC GTACGAGTTC
GGCTTTGAGG GGCCGGCGCT GACCGTCGAC ACGGCCTGCT CCTCGTCGCT GGTCTCGCTG
CACCTGGCCG CCGAGGCGCT GCGCGGCGGG GAGTGCTCGC TCGCGCTGGC CGGCGGCTGC
TCGATCATCT GTACGCCCAC CCAGTTCGTC GAGTTCAGCA TGCAGAATGC TCTCTCTCCG
GATGGCCGGT CGAGGGCGTT CTCGGCCGGG AGCAACGGCT TCGGTATGGC CGAGGGCGTG
GGCTGGCTGG TGTTGGAACG GCTGTCCGAG GCCCGACGTC ACGGCCACCC GGTGCTGGCC
GTGGTGCGCG GCAGCGCAGT CAACCAGGAT GGCGCCTCCA ACGGGTTGAC CGCCCCGAAC
GGCCCCTCCC AGGAACGGGT CATCCGACAG GCGCTCGCCT CCGCGGGGCT CACCCCGGCC
GACGTGGACG CCGTCGAGGC GCATGGCACA GGTACGCCGC TCGGCGACCC GATCGAGGCC
ACCGCGTTGC TCAACACGTA CGGCCGGGAC CGCACCGGGC CCCCGCTCCT GCTCGGGTCG
CTCAAGAGCA ACTTCGGGCA CGCCCAGGCC GCGGCCGGCG TCGGCGGTGT CATCAAGTCC
GTCATGGCGC TGCGGGCCGG CGTGCTACCG CCGACCTTGC ACGCGGACGA GCCGACGCCC
CAGGTGGACT GGTCGTCCGG TGGCGTCCAA CTGCTAACCG AGCGCAGGGA CTGGCCCCGG
ACGGACCGAC CGCGGCGGGT CGGTGTGTCC GGCTTCGGTA TCAGCGGCAC CAACGCCCAC
GTGATTCTGG AGCAGGCACC TGACCCGCCC GCCGACGGGT CGGCCGACCC GGAGGGCGAC
CTGCCAACGC TGTGGACACT CTCCGCCCGT ACTTCGGAAG CGTTGCGTGC TCAGGCGTCC
CGGCTCCGCG TCTTCGTCGA CGAAAACCCG GAGTTGTCCC CGCATGACAT CGGACGTTCG
CTGGTGGAGA GCCGGTCGGC GTTCGACCAC CGGGCGGCGT TGGTCGGCCG GACCCGAGCG
GAGCTGCTGC AGGGGTTGAC CGCCCTGGCC GACGGCGAGT CCGCTGCCAC CCTGGTCGAG
GGGCGCGCGG TCACCGGTGG TGGCGTGGTA TTCGTCTTCC CTGGTCAGGG TTCGCAGTGG
GCCGGCATGG CCGCCGAGCT GTTCGACAGC AGCGAGATCT TCGCCGAGGA GTTCAGGGCG
TGTTCGCTGG CCCTGTCGGA ATGGATCGAC TGGTCACCGG TCGACGTGCT CCGTAAGGCC
GACGACGCGC TGCTCGGCCG GGTCGACGTC GTGCAGCCGA TGCTGTGGGC GGTGATGGTG
TCGCTGGCGG CGCTGTGGCG CTCGTATGGT GTCGAACCGG CCGCGGTGGT CGGGCACAGC
CAGGGCGAGA TCGCTGCCGC CTGCGCGATC GGTGCGCTCT CCCGCGATGA CGCCGCCAAG
GTCGTGGCGG TGCGCAGCCA AGCCCTGACC CGGATCTCGG GCAGCGGCGG CATGATGTCG
GTGCAGCTGG GCCGGGCCGT GCTCGAGCCC CGGATGCTGC CGTGGGGCGG ACGGATCTCG
GTCGCGGCCG TCAACAGCCC CCGGAGCGTC GTCGTTTCCG GCGAGGTCGA GGCGCTGCAG
GAGCTGCACG CTGCACTGGT GGCGGACGGG GTCAGCGCCC GGCTGATTCC CGTCGACTAT
GCCGCCCACT CCGCGCAGGT CGACCAGATC CAGGACGAAC TCGCCGACCT GCTCGCGACA
GTGAGGTCCG CCCCCGCACA CGTCCCCTTC TACTCCTGCG TCGAGGGCGA CGAGCGGCCG
ACCAACGACC TCGACGCCGG CTACTGGCAC CGTAACCTGC GCGAGACCGT GCTGTTCGAG
GACTCGATCA AGGCGGCCCT GGCGCGCGGC CACCGGCTGG TGCTTGAGGC CAGCCCGCAC
CCGGTGCTGG TGACGGCGGT ACAGGACGTG CTGGACGACA GCGGCGTCGA CGCCCACAGC
TGGGGCACGC TGCGCCGCTC CGCGGGCGGA CTGGACCGCT TCCTGCTCTC CCTGGCGCAG
GCCCATGTCA ACGGCGCGGA GATCGACTTC GGACCGCTGT TCGCCGGTGC CCGGCTGATA
TCCCTGCCCA CGTACGCGTT CCAGGGCGAG CGCTTCTGGA TCGACGCTGC TGCGCCGGCC
GGTGACGCGT CCTCGCTTGG CCTGGTCGCC GCCGGCCATC CGATGCTGGC CGCGGAGGTC
CCGCTGTCCG ACAGCGGCGC GTTGCTGCTG ACCGGCCGGC TCAGCGTCCA CGATCAGACC
TGGCTCGGCG ACCACGTGGT GTCCGGCATG ACCATCGTCC CCGGTACGGC GTTCATGGAG
ATCGCCTTCA AGGCCGCCGA ACGGGTCGGC TGCCCGCGCG TGGAGGAACT GGCCGTCCAG
GCGCCGCTGC TGCTTGACCC GGACGTCCCG ACCGCGTTGC AGGTGGTGAT CGCCGCACCG
CTGGACACCG GGCAGCGTCA GCTCACCGTC TACGCCCGTC CCGAGCCGGC CGGTGATGGC
CTCGACGAGG ACTGGACCCT GCACGCCACC TGCTGGCTCA CCGCCGCGAC TCCGGCCGCA
CCGGACACGT CGGATCTGCT GGTATGGCCG CCGCGTGACA CGGTGCCGTT CGCCGCGCAG
GAGTGCTACC GGCAGCTGGT CGCGGCCGGC TACGCCTTCG GCCCGACATT CCAGGGGCTG
CGCGAGGTGC ATGTCCGCGG CAAGGAAGTC TTCGCCCTGG TTAGCATCCC CGCCGAGGGC
CGCGCCGACG CCGCCGTCTT CGGTCTGCAC CCGGCCCTGT TCGACGCGAT GTTTCACTCA
CTGATCTCCG CCCGGCCGCA GGGGAGCGAC GAAGTGCTGC GGCTACCGTT CACCTGGACC
GGCGTCCAGT TGCACGCCGA GGGCGCCGTC ACGATGCGGG TCAAGATGAC GCTCACCAGC
GCCGACTCGC TGACGCTCGT CGCCGTCGAC GAATCGGGCG GCCTGGTCGT CTCGGCTGAC
GCCCTGCTGC TGCGCGAGAT CTCCGGGGCG GGCGGCCCGA CCGCTGCCAG CCGCCGCTAC
CGGTCCTTGT ACCGGCTCGA GTGGGCACCC TTGGTTTTGC GCGAGTCCGT CCCGGGGGAG
AGCCGCTGGG CTGTGCTCGG TGACTCCCCG GTCGCCGTCT CGTTGACCCG CCACGACAGC
TCGGTGCGGC GGCACGACGA CTGGTCCTCG CTGGCCGTCG CCGGGCGGGT GCCCGACCTG
GTGGTGCTGC CGGTGGACAC GCTCGGAGGT TCCTTCGAGG GCACCGATGT GCCCGGGGCT
GCCCGTCGGC TGCTGATCCG GGTGACCGGG CTGGTGCAGG ACTGGCTGAC CGACGAGCGG
TTCGCCGACT CGCGGCTGCT GGTGGTGACC CGGCAGGCGG TTCCGGCCGG GGCTGGCGAG
CCGCTCGACA TCGTCCAGTC GCCGCTGTGG GGACTGTTGC GGTCAGCCCA GTCGGAGAAC
CCGGACCGGC TGCTGTTGGT CGACCTTGAC GACGACGACA CGTCACTGGA GGTGTTGCCG
CGGGCGGTGG CGGCCGCCGT GGCCGCCGAC GAGTTCCAGG TCGTCGTGCG GGCTGGGAAG
GTCTTCGTGG CGCGGCTCGG ACTCGTGTCA GCGAGCGAGG CCGAGCAGGC CCCGGCCCTG
CACCCGGACG GCACCGTCCT GATCACCGGT GCGTCCGGTC TGCTCGCCGG TCACCTGGCC
CGCCATCTCG CGGGACGCGG CGTGCGCCAG CTGTTGTTGC TGAGCCGGCG GGGCGAGCGG
GCCCGTACCA CCGCGCCCCT GGTGGCCGAG CTCGCTGCCC TGGGGGCTCG GGCCACGGTC
GTCGCCTGTG ACACCGCGGA TCGCGCGGCG CTGGCCGATG TTCTGGCCGC GATTCCGACC
GAGCATCCGC TGACCGGGGT GGTGCACGCG GCCGGGGTGC TGCGGGACGG CACCGTCGCC
ACTCTGAACC CGGACCAGCT CGACGTCGTG CTGCGACCGA AGATCGACTC CGTTTGGAAC
CTGCACGAGC TCACCGCAGA CGCTGATCTG GCGCTGTTTG CGATGTTCTC GTCGGCCGCG
GGCACCCTTG GCGGTAGCGG GCAGGGCGCT TACGCCGCGG CGAACGTCTT CCTCGACAGC
CTGGCCGCGC GCCGCCGTAG CCTCGGGCTG CCCGCCACGT CCCTGGCCTG GGGCGCCTGG
GAGGCCAGCG CTGACCCGGG CAACCGTGGC ATGGCCGGTA ACCTGGCGAG CGTCGACGCC
GGCCGAGCCC GCCGGGGCGG ACTGTTGCCC TTCACCTTCG CGCAGGGCAT GGCGTTGTTC
GACACGGCGG TCGGGCTCGA CGAGGCGTTC GTGTTGCCGA TGCGGATGGA CCTGGCCGGC
GTGCGGGCCT CCGGCGGCCC GGTGCCATCG TTGCTGCGAG CCCTGGTCAA AGCTCCGACG
CGGCGGACTG CGGAGGCGGT AGCGGCCTCG CCGTCGTCGC TGCGCGACCG GTTTGGTGAG
CTGTCCGGTG AGGACCGGGA ACGATACGCG ATCGATCTCG TCCGCGGGCA GGCCGCCGCC
GTGCTCGGCC ACGTCTCCGC TCAACTTGTG CCCGCCGACG AGGCGTTCCG GGACCTGGGC
TTCGACTCAC TGACCGCGGT CGAGTTGCGC AACCGGCTCA AGACCACCAC CGGGCTCTCG
CTACCGGCCA CCCTCGTTTT CGACCATCCC AACCCGCGGG CGCTGGCCCG GTTTTTGCTG
ACCGAGCTGA TGCCGGGCGG GTCTGGCCCG GTGATGTCCG CCCAGGCCGA ACTCGACCGG
CTCGAGGCAG CACTGGCAGC TGCCCCGGTC CGGCCAGGCG ACGACGCGGT GGCCGAACGG
CTGAGCCGGA TCCTGTCCGG CTGGCTGCGC AACCGGCCGG CGGCCGAGCC CGTGGCCGAG
CCGGAGGCGG GCACCGACCT CGGGACCGCG ACCACCGACG AGATCTTCGC CTACATCGAC
AACGTGCTCG GACGCAACAG GGCGTGA
 
Protein sequence
MPDDKTLLDY LKWVSADLHD TQERLRELEQ AQREPIAVVG MSCRFPGGVD GPDDLWRLVE 
TGTDAITGFP TDRGWDLDAI ADSVAVHEGG FLAGADRFDA GFFGISPREA TAMDPQQRLL
LEGAWEALES AGIDPRSLRG TRTGVFVGSN NQDHVIVLSS ATADRTGQGL TGATASVLSG
RVSYVLGLEG PALTVDTACS SSLVSLHLAV QSLRRGECSM ALAGGVALMA TPAMFLESSG
QGVLALDGRC KAYAEGADGT GWGEGVGWVL VQRLSDAVRE GRRVLAVVTG TAVNQDGASN
GLTAPSGIAQ RRVIRAALDD AGLSAADVDA VEGHGTGTAL GDPIEVGALL ETYGREHAAE
RPLWLGSLKS NIGHTQAAAG VGGIIKMIQA IRHGILPGTL HASQRTSRVE WDRGGLALLT
EATPWPETGA PRRAGVSSFG ISGTNAHVIL EQAPPKEAAE PEVTRTPGLL LWVLSGHTEE
ALRGQARRLL TYVEDNPQLS ATDIAWSLAT TRSRLAHRAV VLGADRPGLL RGLADLARGE
PSAAVVQGTA HFGDKVVFVF PGYGSQWRGM ALELIEQEPA FRDEFLACET AFLPWLGWSS
VAALRGDPEA PDLEEFATTQ VLLFIVMVAL AALWRSYGLE PSGVIGHSQG EIAAAYVAGA
VTREDAARYV VARARLMQQH LDGRGGMVAV QLSADELRER LTPYGDRLAL AAINGPCAVV
MSGEDAALDE LAGQLVAEGV RVHKLALNVA AHSAQLDAYR AEIEEMLATL RPVSSDVPFF
STVTGGQLDT VGLDAGHWFS NLRQTVLFEQ ATRTALTQGY RLMLEVSAHP VVAMAVQDII
DDTGIPAIPL TTLRRGEGGR DRFLRALAEA HVQGAPVDFG VLFGGTGARR VALPTYAFQH
ERFRLDPGTT TGDASSLGLT PLDHPLVTAA IALPEPSGQL LTGRLTRRTH PWLADHIVLG
AALLPGTAHL ELAFRAGDQV GCDTVEELTL QTPLTLPEHD GVALQVFVGG PQDDGRRSVA
VWSRRDTPDP ADDGDEWTCH ATGLLGHAAA TPVDGLTSWP PAAAAVDIDG LYERLSAAGY
GYGPAFRGLR SVWSDGGDIY AEVTLPEETV ADAGRFGLHP ALLDAAFHAL IAARPPQRSA
TRLLFSWAGV RLVRAGASAL RIRLRPRADG TVTLLAADVA GTPVATADAL VMREVTETSL
IPDPARELFR EQLQPLVVPR QAADEPVIWT DPGGLLESLT EPGTRYAVFA APTPPGVGVV
AAARQVTERL RTLLSSWLTS PGAESTTLVI RTRNARSGSP DPVQAAGRGV VRSFQASHPG
RILVLDTASA DEPVAAVISA ALAAEEDDVV LHDETAMAAR LVRVPAGDLD LPWSGTVLVT
GNAGTRAAEV ARHVAARGAA RVVLAGAGEP DHAGVEAVAC DLTDADALTA LVSDVAPDVV
LHAAETGDPV ATAWALDRAA TSVDLRAFVL FSTADGVLGG AGRAERSAAS AFTDALVRRR
RATGRVGQAL AWGSWASTAG GLALLDAAVV VDEAVVVPFR PRARGNVPPL LRSLVRAPLR
PTVDAVAQSS SALARRIGGL DPAERVKALV ELVQSEAAVV LGFADVRAVP ATRAFRDIGF
VSMNAVELRN RLTAVTGLPL AATVVFDHPS AVALGTHLGT LLLGGAPAET VEPVTDRVVD
EPIAVIGMSC RFPGDVRSPE DLWRLVADGV DATGDLPGNR GWNVDSFYDP APGQPGRSYV
RRGGFVRDAD RFDAGFFGIN PREALAMDPQ QRLLLETAWE GLEHAGLDPA TLRGSDTGVF
VGSNGQDHAL VLSGAVGELS GYRLTGTSAS IMSGRISYEF GFEGPALTVD TACSSSLVSL
HLAAEALRGG ECSLALAGGC SIICTPTQFV EFSMQNALSP DGRSRAFSAG SNGFGMAEGV
GWLVLERLSE ARRHGHPVLA VVRGSAVNQD GASNGLTAPN GPSQERVIRQ ALASAGLTPA
DVDAVEAHGT GTPLGDPIEA TALLNTYGRD RTGPPLLLGS LKSNFGHAQA AAGVGGVIKS
VMALRAGVLP PTLHADEPTP QVDWSSGGVQ LLTERRDWPR TDRPRRVGVS GFGISGTNAH
VILEQAPDPP ADGSADPEGD LPTLWTLSAR TSEALRAQAS RLRVFVDENP ELSPHDIGRS
LVESRSAFDH RAALVGRTRA ELLQGLTALA DGESAATLVE GRAVTGGGVV FVFPGQGSQW
AGMAAELFDS SEIFAEEFRA CSLALSEWID WSPVDVLRKA DDALLGRVDV VQPMLWAVMV
SLAALWRSYG VEPAAVVGHS QGEIAAACAI GALSRDDAAK VVAVRSQALT RISGSGGMMS
VQLGRAVLEP RMLPWGGRIS VAAVNSPRSV VVSGEVEALQ ELHAALVADG VSARLIPVDY
AAHSAQVDQI QDELADLLAT VRSAPAHVPF YSCVEGDERP TNDLDAGYWH RNLRETVLFE
DSIKAALARG HRLVLEASPH PVLVTAVQDV LDDSGVDAHS WGTLRRSAGG LDRFLLSLAQ
AHVNGAEIDF GPLFAGARLI SLPTYAFQGE RFWIDAAAPA GDASSLGLVA AGHPMLAAEV
PLSDSGALLL TGRLSVHDQT WLGDHVVSGM TIVPGTAFME IAFKAAERVG CPRVEELAVQ
APLLLDPDVP TALQVVIAAP LDTGQRQLTV YARPEPAGDG LDEDWTLHAT CWLTAATPAA
PDTSDLLVWP PRDTVPFAAQ ECYRQLVAAG YAFGPTFQGL REVHVRGKEV FALVSIPAEG
RADAAVFGLH PALFDAMFHS LISARPQGSD EVLRLPFTWT GVQLHAEGAV TMRVKMTLTS
ADSLTLVAVD ESGGLVVSAD ALLLREISGA GGPTAASRRY RSLYRLEWAP LVLRESVPGE
SRWAVLGDSP VAVSLTRHDS SVRRHDDWSS LAVAGRVPDL VVLPVDTLGG SFEGTDVPGA
ARRLLIRVTG LVQDWLTDER FADSRLLVVT RQAVPAGAGE PLDIVQSPLW GLLRSAQSEN
PDRLLLVDLD DDDTSLEVLP RAVAAAVAAD EFQVVVRAGK VFVARLGLVS ASEAEQAPAL
HPDGTVLITG ASGLLAGHLA RHLAGRGVRQ LLLLSRRGER ARTTAPLVAE LAALGARATV
VACDTADRAA LADVLAAIPT EHPLTGVVHA AGVLRDGTVA TLNPDQLDVV LRPKIDSVWN
LHELTADADL ALFAMFSSAA GTLGGSGQGA YAAANVFLDS LAARRRSLGL PATSLAWGAW
EASADPGNRG MAGNLASVDA GRARRGGLLP FTFAQGMALF DTAVGLDEAF VLPMRMDLAG
VRASGGPVPS LLRALVKAPT RRTAEAVAAS PSSLRDRFGE LSGEDRERYA IDLVRGQAAA
VLGHVSAQLV PADEAFRDLG FDSLTAVELR NRLKTTTGLS LPATLVFDHP NPRALARFLL
TELMPGGSGP VMSAQAELDR LEAALAAAPV RPGDDAVAER LSRILSGWLR NRPAAEPVAE
PEAGTDLGTA TTDEIFAYID NVLGRNRA