Gene Amir_5064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5064 
Symbol 
ID8329262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6025894 
End bp6039243 
Gene Length13350 bp 
Protein Length4449 aa 
Translation table11 
GC content75% 
IMG OID644945500 
Productamino acid adenylation domain protein 
Protein accessionYP_003102732 
Protein GI256379072 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGC TGCCGCTCAC CGGCGCCCAG GCGGGCGTGT GGCTCGCCCA GCGGATCGAG 
CCGGACAACC CCGTCTACAG CATCGGCTGG TACCTGGAAC TGCGCGGCGA GCTGGACCTC
GCCCGCCTGG AAGCGGCCGT GCGGCAGGCG ATCGCGGAGG CGGCCGCGCT GCACGTGCGG
TTCGACGAGC GGGGGCAGAC CCCGGTCGAG CCCAGCGCCG CGGTCACGCA CCTGGAGTTC
GACTCGCTCG ACGCGGCCCG CGCGTGGATG GACGAGGACC TCGGGCAGCT CACCGACCTC
GCCGCCGGAC CCCTGTTCCA CCACGCCTTC CTGCGCCTGG GCGAGAACCA CCTGCTGTGG
TACCAGCGGT ACCACCACCT CGCCGTGGAT GCCTACGGCT GCGTGCTGAT CATGGCGCGC
GCCCGCGAGC TGTACGCCGA GCCCGGCCCG CTGCGCCCCC GCCCGCTGGA GGCGTTCGTC
GACGCCCAGC GGGCCTACCT GGACTCCCCG GCGCACGCCG CCGACCGCGC CTACTGGCAC
GCCAAGACCG CGGACCTGCC CGAGCCCGTG CGGCTCGTCC CCCGCGCCGA CGGGCAGGTG
CGCGTGGTCA ACCGGCTGGT CGCGGAACTG CCCGAGGTCA AGGGCTCCCG CCAGGTCATC
GCCGCCGTCG CCGCCTACGC GCACCGGGTC AGCGGCGCGC CCGAGGTCGT CCTCGGCCTT
CCGGTGACCG GCCGCCGCGA CCCGGTCTCC CGCACCACGC CCGCCATGGC CTCCACCGTG
CTGCCGCTGC GGCTGCGCGT CCGCCCGGAC AGCACGCCCG CCGACCTCGT CGCCCAGGCC
GACGCGCAGG TGCGCGAACT GCTGCCGCAC AGCGGGTACC GGGGCGAGGA CCTGGCCCGC
GAGCTGGGCC TGCGCGACGG CATCGCCGAG CTGGTCGGCC CCACCGTCAA CTACCTCGGG
TTCGGCGCGG AGACCGGGTT CCCCGGCGTG GACACCGCGT TCCACATGCT CTCCCTCGGG
CCGGTCAACG ACCTCACCGC CTCGTTCATG CCGTCTACCA GCGGCACCCT GGTGATCTCC
TTCGAGGCCA ACGCCGCCGT GTGCGACGAG GAGGAGCTGG CCGAGCACCT GCGCCGGTTC
CTGCGCGTGC TCGACGCCGA CCCGGACCTG CCGATCGCCC GCGTCGACCT CACCGACCCC
GACGAGCGCG CCGAGCTGCT CGACCTCGGC ACCAGCCCGC GCGAGACCGA GGACACCACC
TGGCCCGGCG CGGTCGAGCG GCAGGCCGCG CGCGTCCCGG ACGCGATCGC CGTGGTGTGC
GAGGACGTCC GGCTCACCTA CGCCGAGCTG AACGCCGCCG CCAACCGCAA GGCCCGCGCC
CTGCGCGCGC TCGGCGTCGG CCACGAGGAC GTCGTCGGCG TGATGCTGCC GCGCTCCGCC
GACCTGGTCG TCACCCTGCT CGCGATCATG AAGGCGGGCG CCGCGTACCT GCCGCTCGAC
CCCGACCACC CGCGCGAGCG CGTCACCGGC ATGGTCGCCG ACGCGGGCGC GAAGCTGGTG
ATCACCGAGG ACCTCGCCTG GGAGCTGCCG GACGACGGCG GCGACCTCGG GCTGGACATC
GCGCTGCACC AGGCCGCCTA CGTCATCTAC ACCTCCGGCT CCACCGGCCG CCCCAAGGGC
GTCGTCGTCA CGCACGAGGG CGTCGGCAGC CTCATCGCCA CCGCCGTCGA CCGCATCGGC
GTCACCGAGT CCAGCCGCGT CGCCCAGTTC GCGTCGGTCG GCTTCGACGT CGCGGTGTGG
GACCTGTGCA TGTCCCTCGG CGTCGGCGGG CGCGCGATCG TCGTGCCCTC GCACCGCCGC
GTCGCCGGCC CCGAGCTGAC CGACTACCTG GCCGAGCACG GCGCCACGCA CATGATCCTG
CCGCCGTCGC TGGTGGCCGC GCTGCCCGCC GAGTGCGAGC TGCCCGAGGG CGCGGTCCTG
GTGGTGGGCA CCGAGACCGT GCCGTCCGAG CTGATCGCCC GCTGGGCGCG CCGGATGGGC
GTGGTCGCCG CCTACGGCCT CACCGAGGCC ACCGTGAACT CCACGCTGTG GCCCGCGCAG
CCGGACTGGA CCGGGCCGGT GCCGATCGGC GTGCCCGACC CCAACACCCG CTGCTACGTC
CTGGACTCCG CGCTGCGGCC GGTCCCGGTC GGCGTGGAGG GCGAGCTGTA CGTGGGCGGT
CGCGGCCTGG CGCGCGGCTA CCACGGCCAG CACGCGCTCA CCGCCGCCCG CTTCGTCGCC
GATCCCTTCG ACGGGCCCGG TTCCCGCATG TACCGCACCG GCGACCGGGT CCGCTGGCGC
GCCGACGGCA GCCTGGACTT CCTGGGCCGC GCGGACGGGC AGGTGAAGAT CCGGGGCTTC
CGGGTCGAGC CCGGCGAGGT CGAGAGCGTG CTCATGGCCC ACCCCGGCGT CGCGCAGGCG
GCGGTGCGGG TGCGCCGCGA CCACCGGGGC GTCAAGCGCC TGGTCGCCTA CACCGCAGGC
GCCGCCGAGC CCGCGGAGCT GCGCGAGCGG GCCGCCGCGC TGCTGCCCGA GTACATGGTG
CCCTCGGTCG TGGTGCCGAT GACCGGGCCG CTTCCCCTGA CCCCCAACGG GAAGCTCGAC
GACAAGGCGC TGCCGGACCC GGACTGGACC GCGCTGGGCG GCGACGCCCC GGCCACCACC
CCGGCCGAGC ACGTGCTGGT CGACGCCTTC ACCGCCGTGC TGGGCCTCAA GCCCGGCGTG
CACGACAACT TCTTCGAGCT GGGCGGCGAC AGCATCGTCG CGATGCAGGT CGTCTCCCGC
GCCCGCGCCG CCGGACTGGT GATCACCCCG GCGCAGGTGT TCGAGCACCG CACCCCGGCG
AACCTGGCCG CCGCCTCCCG TCCCGTCGGC GCGGTGGCCG ACGCGGACGT GGAGCTGCTG
CCCGCCGACC TGGTCGCGCA GTTCCCCGGC GACCACGAGG TGCTGCCCGC CTCCCCGCTC
CAGCAGGGCT TCTACTTCCA CTCCCGCTTC GACGCGGACG ACTCCTACCT GGTGCAGGAG
GTCCTGTCGC TGGGCCGCGC GGTGCGGCGC GCCGACGTGC AGGGCCTGCT GGACCGGCAC
CCCGCGCTGC GCGCCGGGTT CGGCCAGGGC GCGCGCGGCG AGGTCGTGCA GCTCGTGCCG
CCGCACGTCG AGCTGCCCTG GCACGAGCAC CCGGCCGACA CCGACCTGGA GCGGGTGCTG
GCCGCCGAGC GCGCGGGTGG CTTCGACCTG GAGCACCCGC CGCTGCTGCG CGCCGCCGCC
GTCGGCGACA AGCTCGTCCT GACGCTGCAC CACATCCTGG TCGACGGCTG GTCGGTCGGC
CTGCTGCGCC GCGAGCTGGA CGGCGTGCGG GTCGGCGCGC CCGACCACCT CGCCTACCAC
CGCTGGCTCG CCACCCGCGA CGCCGAGGCC GCCCGCGCCG CCTGGCGCGC GGCGCTGTCC
GACGTGGACG AACCGACCCG CGTCGTCCCC GACCCCTCGC CCCGGCACGG CCGGCACAGC
GCGTTCCTGA CCGCCGAGGC CACCGCCGCG CTCACCACCG CCGCCCGCAG GCTCGGCCTG
ACCACCGGCT CCGTGCTGCA CGGCGCCTGG GGCCTGCTGA TCGGCGGTCG CACCGGCCGC
CGGGACGTGG TGTTCGGCAG CACCGTCGCC GGTCGTCCCG CCGAGGTCGA GGGCATCGAG
TCCGCCGTGG GCCTGTTCAT CAACACCGTG CCGGTGCGGC AGCGCTGGTC GCCCGCCGAC
ACCCTCGCCC AGGTCCTCAC CCGCTTCCAG GACGAGCAGG CCGCGCTGCT GGACCACCAG
CACCTCGGCC TCGCCGAGAT CCAGCGCGCG GCCGGTCAGG GCGAGCTGTT CGACACGCTC
GTCGTGGTCG AGAACTTCCC CGACGTGGTC ACCGACGTCG AGGTCCGCGA CCGGGTGCAC
TACCCGCTCG CCCTGATCGC GATCCCCGGC GAGCGCCTGG AGCTGCGGAT CAAGCACCGG
GTGCCCGACG CCGAGGCGGT CGCGCTCGCC GACCGCCTGG TGCGGCTGCT GGAGGTCTTC
GCCGCCGACC CGCACCGGCC GGTGGCCCGC GTCCCGCTCG TCACGCCCGG CGAGGCCGAG
TGGCTGGCCG CGCAGACCGG GCAGGCCGTG GTCGTCCCCG AGGGGACCCT GGTCGACCGC
ATCGCCGCGC GCGTCGCCCA GACCCCGGAC GCGGTGGCCG TCGTCTTCGA GGGCTCCACC
CTCACCTACG CCGAGCTCGA CCGCCGCTCG GCCGCCGTCG CGCGCGGGCT GCGCGAGCGC
GGCGCGGGGA AGGTCGTGGG CGTCGCGATC CCGCGCTCGG CGGAACTGGT CGTGGCGCTG
CTGGGCGTCC TGCGCTCCGG GGCCGCGTAC CTGCCGCTGG ACCTGGACTA CCCGGCCGAC
CGGGTCGAGT TCATGCTCGC CGACTCGGGC GCGGACGTGG TGCTGCGCCA GGAGGACTGC
CTCGCCGAGG ACCCCGCGCC CGGAGCAGGG CTCCCGGACG CCGACCTGCC CGACGCCGAC
CTGCCCGACG CCGCCCCCGC CCCCGACGAC CCGGCCTACC TGATCTACAC CTCCGGTTCC
ACCGGGCGGC CCAAGGGCGT CGTCGTCGCG CACCGCGCCG TCGTGAACCG GTTGGCCTGG
ATGCAGGACC ACTACGGGCT GGCCGCCGAC GACCGGGTGC TGCAGAAGAC CCCGTCCAGC
TTCGACGTGT CGGTCTGGGA GTTCTTCTGG CCGCTGTGCG AGGGCGCGGC GATCGTGCTC
GCCCGCCCGG ACGGCCACCG CGACCCCGCC TACCTCGCGG ACCTGGTGCG GGAGCAGGGG
ATCACCACGC TGCACTTCGT GCCGTCGATG CTCGCCGCGT TCCTCGGCGT GCCCGAGGTC
GCCGACGACC CGGCGTGGGC GCGCAGCCTG CGCCGGGCGT TCAGCAGCGG CGAGGCGCTG
ACCGGGGACG TCGCGGGCCG CTGGCGCGCG CTCACCGGCG TGCCGCTGCA CAACCTGTAC
GGGCCGACCG AGGCCGCCGT CGACGTCACC TGGTACCCGT TCCGGGGCGA GGCGGACCTG
GCCGTGCCGA TCGGCTTCCC GGTGTGGAAC ACCGGTCTGC ACGTGCTGGA CCCGCTGCTG
CGCCAGGTCC CGGCCGGGGT GCCGGGTGAG CTGTACCTCT CGGGCGTGCA GCTCGCCCTC
GGCTACCACG GCCGGTTCGC GCTGACCGCG GAGCGGTTCG TGGCCTCCCC GTTCGGCGGG
CCGGGGGAGC GGATGTACCG CACCGGCGAC CTCGTCGTGC GCCGCGCGGA CGGTGCGCTG
GTCTACCTGG GCCGCACCGA CCGGCAGGTC AAGGTGCGCG GCAACCGCAT CGAGCTGGGC
GAGGTGGAGG CCGCGCTCGC CGCCCTGCCC GGCGTCACCC GCGCCGCCGT CATCGCGCGG
GACAACCAGC TCCTGGGCTA CGTCACGCCC GCGACCGCCG ACGTGACCGC CCTGCGCGCC
GAGCTGGCGC GGGTCCTGCC CGCAGCCATG ACCCCGCACG CCCTGATCGC GCTGGCCGGC
TTCCCGCTGT CCCCCAGCGG GAAGCTCGAC CAGAACGCGC TGCCCGACCC CGCGCGGACC
ACCGCCGCCG AGCCCGGCAG GGCCGCCGCG ACGGAGCGGG AGGAGCTGCT CTGCGGGATC
GTCGCCGACG TCCTCGGCCT GCCCGCCGTC GGCCCGGACG ACGACTTCTT CGCGCTGGGC
GGGGACAGCA TCACGTCCAT CGCCGTCTCC ACCCGCGCCC GCCGCGCCGG GGTGGGGGTC
AGCCCGCGCG ACGTCTTCGC GCACCGCACG GTCGCCGCGC TGGCCGCGCT GGAGGACGTC
GTCGACGCGC GCGCCGACGA CCGCCCGCTG CTGGAGCTCA CCGACGCCCA GCGCGCCGCG
CTGCCCCCGC ACGAGGACGC CTGGCCGCTG TCCCCGCTGC AGGAGGGCCT GTTCTTCCAC
TCCAGCTACG ACACGGGCGC GCTCGACGTC TACCTGTCCC AGGAGGTCAT GGACTTCACC
GGCCGGGTCG ACGCCGACCG GCTGCGCGCC GCGGGCGGGC TCCTGCTGCG CCGCAACCCC
AGCCTGCGCG CGGGCTTCAC CAGCGACGGC GTGCCGCAGC CGGTGCAGTT CATCGCGCCG
GACCTGGAGA TCCCGCTGCG CGTGGTGGAG ATCGGCGGGA CCGGGGTGGC CGGTGACGCC
GAGGTGCGGG AGCTGCTGGC CGCCGACCGG CGCGAGCGGT TCGACCTCGC CGCGCCGCCG
CTGGCCCGGT TCCTGCTGCT GCGCCTGCCG GACGGCCGCG ACCGGCTCGT CATCACCCAC
CACCTGGTGC TGTGGGACGG CTGGTCGGCG TGGCTGTTCC TGGAGCAGCT GCTGGAGCTG
CTCCGCGACC CGGACGCGGA CCTGGTCGAA CCGGGCTCGT ACCGGGACTA CCTGGTGTGG
CTGGCCCGCC AGGACCACGA CGCCGCGCTC GGCGCGTGGC GCACCGCCCT GTCGGGCCTC
GCCGAGCCGA CCCTGGTCGC GCCGGTGGAC CGCACCCGCG AACCGGTCAT CCCCGCCGAG
CTGCACGCCG ACCTGCCGGA GGAGCTGACC GACCGGCTGC GGGCCCTGTC CCGCGCCCAC
GGCGTCACCG CGAACGCCGT GCACAACACC GCGTGGGCGC TCGTGCTGGC CGCCCAGGTC
GGCCGGGACG ACGTGGTGTT CGGCACCGCC GTCGCCGGGC GCCCCGTCGA GGTCGACGGG
GTCGAGAACA CCATCGGCAT GTTCCTCAAC ACCATCCCCA CCCGCGTCAC CGTCGACCCG
CGCGAGAGCG TGCTGGACCT GCTGGTCCGG GTGCAGGCCG AGCGCACCGC GCTGATGTCC
TACGAGCACG TGGGCCTCGG CGAGATCCAG CGGCAGGCCG GGCACCGGCA GCTGTTCGAC
ACCCTGTTCG TGCTGCGCAC CGGCGACGGC GACGACCGCT CCGCCGCGCT GGCCGCCCGG
CACGGCGTCA CCGGTGTGTC CAACGTGGAC GGAACGCACT TCCCGCTCAC CCTGATCATC
ACGCCCGGCC GCACCACGCG GGTCACCCTG GCCTCGCGCC CCGACCTGGT CGACGCCGAC
CAGGCGCGGG CCGTGCTCGA CCGCTACACC ACCGTGCTGG AGCGCCTGGT CACCCGGCCG
GACGCGCTGC TCGGCGCGAT CGACGTGCTG CCCGACGCCG AGCGCGCCGC GCTCGCCGCC
GAGTGGGGCG CCAGCGCCAA CCCGGTGGTC GACCGCACCA TCGCCGAGCT CCTCGCCGAG
CAGGCCGGAC GCACGCCCGA CGAGACCGCG CTGGTGTTCG GCGACCTGCG CCTGACCTAC
GCCGACCTGC AAGCCCGCGT CGACCGCTTG GCCTCCGTGC TCGTCGCGCA GGGCGCCGGA
CCGGAGCGGG TGATCGCGCT TGCCCTGCCG CGCGGCGTGG ACATGGTCGT CGCGCTGTTC
GCCGTGCTGC GCACCGGGGC CGCGTACCTG CCGCTCGACC TGGACCACCC GGCCGACCGG
CTGCGCCTGA TGATCGCCGA CACCGACCCG CTGCGCGTGC TGTCCACCGC GTCGGTCGGC
CTCGTCGACG ACGCGCTGCT CGTCGAGGAC CTGCCGGACG CGCCGGAAGT CCTGGACTGG
CCGAGGTTCT CGCTCGACCA CCCGGCGTAC GTCATCTACA CCTCCGGCTC CACCGGCAGG
CCCAAGGGCG TCGTCACCCC GTACCGGGGC CTGACGAACA TGCAGCTCAA CCACCAGCGC
GCGATCTTCG CCCCGGCCAT CGCCTCGGCG GGCGGTCGGC GGCTGCGGGT GGCCCACACC
GTGTCGTTCG CGTTCGACAT GTCGTGGGAG GAGCTGCTGT GGCTGGTGGA GGGCCACGAG
GTGCACGTGT GCGACGAGGA GTTGCGCCGC GACGCCGCCG CGCTGGTCGC GTACTGCGAC
GAGCACCGGA TCGACGTGGT CAACGTGACC CCGACCTACG CGCAGCTGCT GATCGAGGAG
GGCCTGCTGG GCGGGCACGT CCCGCCGCTG GTGCTGCTCG GCGGCGAGGC CGTGCCGGAC
TCGGTGTGGC AGCGCCTGCG CGACACCCCC GGCACCTTCG GCTACAACCT GTACGGCCCC
ACCGAGTACA CCATCAACAC CCTCGGCGCG TCCACTGAGG ACAGCGCGAC CTCCACCGTG
GGCCGGGCGA TCTGGAACAC CCGCGCCTAC GTGCTCGACG CGCGCCTGCG GCCCGTGCCG
CCCGGCGCGC CCGGCGAGCT GTACATCGCG GGCATCGGCC TGGCGCGCGG CTACCACGAC
CGGTTCGGCC TCACCGCGTC CCGCTTCGTC GCCGACCCGT TCTGCGCCGT CCCCCCTGGG
GGCTCCGGCG AGCGCATGTA CCGCACCGGC GACCTGGTCC GCGCCCGCCC CGACGGGAAC
CTGGACTTCC TCGGCCGCAC CGACGACCAG GTCAAGATCC GGGGCTACCG GGTCGAGCTG
GGCGAGGTCA CCTCGGCGCT GGCCGCGCTC CCCGGCGTCA CCCACGCCGC CGTGGTCGTG
GACGCCGGAC CGGGCGGGGT GAAGCGGCTC ATCGGCTACG CGGTGTCCGA AGTGGACACC
GCGGAGCTGC GCGAATCGCT CAAGTACGGC CTGCCCGACT ACATGGTGCC CGCCGCCGTC
ATGGCCGTGG ACGCGCTGCC GCTGACCGTC AACGGCAAGC TCGACACCCG CGCCCTGCCC
AAGCCCGTCG TCACCGGCGG GGCCGCCTCC CGCGAACCGG CGAACGAGCG CGAGCGCGTG
CTGCGCGAGC TGTTCGCCGA CCTGCTCGGC GTGCCCGAGG CCGGTGTGGA GGACAGCTTC
TTCGACCTCG GCGGGCACTC GCTGCTGGCC ACCAGGCTGG TCAGCCGGGC CCGCACCGCG
CTCGGCGTCG AGCTGTCGAT CCGCGACCTG TTCGACGCCC CCACCCCCGC CCTGCTCGCC
GCCCGCGCGA GCGGCGGCGC CCCGGCCAGG CCACCGCTGC GCCCCGCCGA CTGGCCGGGC
GGGCGGCCCG AGGAGCTGCC GCTCTCCTAC GCGCAGCAGC GGCTCTGGCT CATCCAGCAG
ATGGAGGGCA CCTCCGCCGC GTACAACTTC CCGCTGGTCT TCCGGCTGCG CGGCGGGCTG
GACGTGGCCG CGCTGCGCGA CGCGCTCCGG CTGCTCACCG ACCGGCACGA GGCGCTGCGC
ACGGTCTTCG GCGAGCGCGA CGGCGTGCCG TTCCAGCGGG TGCTGACCGA CGTCGACCCG
GTGTTCCGGA GCGTGGAGGC CACCGAGGCG CAGGCCGCCG ACCTGGTCCG CGCCGAGGTG
GGCAGGCCGT TCGACCTGGA GCGCGAGCTG CCGGTGCGGG TCACGGTCGC GCGCCTGGCC
GACGACGAGC ACCTCGTCGT CGTGCTGCTG CACCACATCA CGACCGACGA GTGGTCCGAC
CGGCCGTTCC TGCGCGACCT CGCCGAGTCC TACCGGGCGC TGCGCGCGGG CGACGAGCCC
GACCTGCCCG CGTTCCCCGT GCAGTACGCG GACTACGCGC TGTGGCAGCG GGAACTCCTG
GACGTCGTCG GCGACGAGCA GCTCGCGCAC TGGGAGCGCG CGCTCGCCGG GCTGCCCGAG
GAGCTGGAGC TCCCGGTCGA CCGGCCGAGG CCCGCGCGCC CGACGTTCCG GGGCGCGGAC
CTGGAGGTGG AGTTCCCGGC CGAGGTGGTC GCCGGGGTGC GGAAGATCGC CCAGGAGACC
GGGGCCAGCG CGTTCATGGT CCTGCAGGCC GCCGTCGCCG CGCTGCTCGG CGCGCTCGGC
GCGGGCGACG ACATCCCGCT CGGCGCGCCG ATCGCCGGGC GCACCGACGA GGCGCTGGAC
GACCTGGTCG GGTTCTTCGT CAACACCCTG GTGCTGCGCG CGGACCTGAC CGGCGACCCG
AGCTTCACCG AGCTGGTCGG CCGGGTGCGG GAGACCTCGC TGGCCGCGTT CTCGCACCAG
GACGTGCCGT TCGAGGCCGT GGTCGACCGG GTCAACCCGG TCCGCTCGGC CGCCCGCAAC
CCGCTGTTCC AGGTCATGGT GGGCTACCAC AGCCGGGGCG CCGAGCAGTT CGCGCTCGAC
GGGCTCGCCG TCGAGTGGCA GCCGCACGAG ACCGGCACCG CCAAGTTCGA CCTGGTGTTC
AGCTTCACCG ACCGGGCGGG CGGCCCGATC GGCTGCCGCG TCGAGTACGC CACCGACCTG
TTCGACGCGA CCACCGCGCA CCGGCTCGCG CGGCGGCTGA CCCGGCTGCT CGCCACCGTC
GTCGCCGATC CCGGCGCGCC GGTCGGCGCG GTCGACCTGC TCGCCGACGA CGAGCGGCAG
CTGGTGCTGC GCGAGTTCAA CGACACCGCG CGGGACGTGC CGGAGCTGAC GTTCACCGAG
CTGTTCGCGC GGGTCGTGGC GCGCAAGCCG GACGAGGTCG CGGTGGTCGA CCGGGCCCGC
TCGGAGAGCT ACGCCCGGCT CGACGCCCGC TCCAACCGGA TCGCCCGGCT GCTGGCGGCG
CGGGGCGTGC GGCGCGAGTC GGTGGTGGGC GTGGCCGTGC CGAAGTCGGT GGAGATGGTC
GCGACCGTGC TCGCCGTGCT CAAGCTCGGC GCCGCCTACC TGCCGCTGGA GCTGTCGAAC
CCGGCGGACC GGATCGCCTA CGTGATCGAG GACTCCGGGG CCGAGCTGGT CGTGGCGACC
AGCGAGGTGG TGGTGCCCGG CGACGTGCCC AGGCTGGACC TGGACGCGGT CGCCGACGAG
CTGGCGGCGG CCGACGAGTC CACTGTGGAC GGTGGTCCGG CCTCGCTCGA CTCGGCCATG
TACGTCATCT ACACCTCCGG CTCCACCGGG CGGCCGAAGG GCGTGGTGGT CCCGCACGAG
GGCATCGCCA GCCTCGCGGC CACCGCGATC GACCGGATGG GCCTGACCGA GGACAGCCGG
GTGCTCCAGT TCGCCTCGGT CGGCTTCGAC GTCGCGGTGT TCGAGCTGAC CATGGCGCTG
TGCGTCGGCG GCAGGCTCGT GCTCGCGCCG GACGAGGTGC GCGTCGCCGG TCCCGCGCTC
ACGGACTTCC TGCGGGACAG GCGGATCACG CACATGGTGC TGCCGCCGTC CCTGGTGTCC
GCGCTGCCCG CCGACTGCGA GCTGCCCGAC GGCTCGACGA TCCTGGTCGG CACCGAGACC
GTGCCGCCGG ACCTGATCGG CCGCTGGGCC GGTCGGCTCA ACCTGCTCGC CGCGTACGGG
CTCACCGAGG CCACGGTGAA CTCCACGCTG TGGCGCGCCG TTCCCGGCTG GGGCGGCGCG
GTGCCGATCG GCGTGCCCGA CCCGAACACC ACCGTGCACG TGCTGGACTC CCGGCTGCGG
CCGGTGCCGC CCGGCGTGGT GGGGGAGCTG TACGTGTCCG GGCGCGGGTT GGCGCGCGGC
TACCTCGGCC GCCCCGACCT GACCGCGTCC CGGTTCGTGG CCTCGCCGTT CGGCCCGCCC
GGCGCGCGGA TGTACCGGAC CGGCGACCGG GCGCGGTGGC GCGCGGACGG GAACGTGGAC
TTCCTCGGCC GGGTCGACAC CCAGGTCAAG ATCCGCGGGT TCCGGGTCGA GCTGGGCGAG
GTGGAGGCGG TGCTCGCGGG GGCAGGCGGC GTCGCCCAGT CGGCCGTGGT CGCCGACCGC
GAGGGCGAGA TCACCAGGCT GGTCGGGTAC GTCGTGCCGG TCTCCGGCGA GGTCGAGCTC
GACCCGTCCG GGCTGCGGTC GCGCGTCGCG GAACGCCTGC CGGAGCACAT GGTCCCGGCC
GCCGTGGTCG TCCTGCCCGG CCCGCTCCCG CTGACCCCCA ACGGGAAGCT GGACCGGCGC
GCGCTGCCGC GCCCGGACTG GTCGGCCCTC GCGGGCGACG ACCGGCCGAG CACGCCGGAG
CAGAAGGCGG TGGCCGAGCT GTTCGCCGAG GTGCTGGGCG TCTCGCCGGG CGCGCACGAC
AGCTTCTTCG AGCTGGGCGG GCACTCGATG GCGTCGATGC GGCTGGTCGG GCGCATCCGG
GCGGTGTTCG GGGTCGACCT GGCGCTGCGG GACGTGTTCG ACTCGCCGAC CGTCGCCGGG
CTCGCGGGCA GGCTCAGCGG CGCGGTGTCC GAGCGGCCGA GGCTGCGGCG GATCGAGCGG
CCGGTGGAGC TGCCGATGTC GGCGGTGCAG CGGGCGAGGT GGGCCACCCG GGACGGGTGG
GACCACGCGC TGGCGCTGCA CGGGACCTTC GACGCGGGTG CTCTCGCTGC GGCATATGGG
GACGTGGTCG CCAGGCACGA GCCGCTGCGG GTCGTGCTGG ACGGCCGGGT GCAGCGCCCG
GTTCCCGCGC CGGAGCTGGA AACCGTTGTG GTGGGGGAAC TCGACGCCGC GCTGGCCGCG
TTCGCCCGCG AGGAGGTCGA CCTGCTGGCC GGGCCGCCCG CGCGCGCCCG GCTGGTGGTG
GCGGGGGAGC GGCAGGCGCT GCTGCTGACC ATGCACTACC TGGCCGTGGA CGAGTGGTCG
GTGGTCCCGC TGGTCCGGGA CCTGGTTCAG GCTTACCGGG CAAGGGTTTC CGGTGACGTG
CCGCAGTGGT CGGAGCTGCC GGTCGGGTAC GCGGACTACG CGCTGTGGTC GGCCGAGGTC
GCGGAGGTGG TGGGGGAGCG GCAGCGGGCG TTCTGGCGCG ACGCGCTCCG CGACCTGCCG
AGGCTCGAAC TCCCTTACGC CGGAACGGCT CCCGGTCCCC GCTACGCGGC CGACGTGGTG
CCGTTCGAGC TGCCCGCCTC GCTGCACGAG GCGATCGGCG CGGTGGCGTC GGCGACCGGG
ACGAGCCTGT TCATGGTGGT GCAGGCGGCG TTGGCGGTGG TGCTGTCCGA GCACGGCGCG
GACGTGCCGA TCGGCGCGCT GGTGGCCGGG CGGACCGAGG AGGCGCTGGC CGACGTGGTC
GGGTCGTTCT TCACGCCCGT GGTGCTGCGC ACCGACGTGG CCGGGTCGCC GACCAGGGCG
GAGCTGCTGG CGCGGGTGCG GGAGGCGGAC CTGGCGGCGT TCGACCACGC CGACGTGCTG
CCCGGCGCGG ACCCGCAGGT CGTGGTCGTG CACCACGAGC GGGCGGCGCT CGGCGGCGAG
CTGGGCGAGC TGGTCGCGGT GCCGACCGGG AGCACGACGG CGGAGCTGAC GCTCAGCTTC
TACGAGCCCA GGGGCGGCGG ACCCGTGCCC TGCTATCTGG TGTACGCGGT GGACCTGTTC
GAGCGGGCCA CGGTGGAGCG GTGGGCCGAG CGGTTGCGCG CCGAGCTGGA CGCGCTGGTC
GGCGACCTTG ACGGGAGGAT CGACTCATGA
 
Protein sequence
MTSLPLTGAQ AGVWLAQRIE PDNPVYSIGW YLELRGELDL ARLEAAVRQA IAEAAALHVR 
FDERGQTPVE PSAAVTHLEF DSLDAARAWM DEDLGQLTDL AAGPLFHHAF LRLGENHLLW
YQRYHHLAVD AYGCVLIMAR ARELYAEPGP LRPRPLEAFV DAQRAYLDSP AHAADRAYWH
AKTADLPEPV RLVPRADGQV RVVNRLVAEL PEVKGSRQVI AAVAAYAHRV SGAPEVVLGL
PVTGRRDPVS RTTPAMASTV LPLRLRVRPD STPADLVAQA DAQVRELLPH SGYRGEDLAR
ELGLRDGIAE LVGPTVNYLG FGAETGFPGV DTAFHMLSLG PVNDLTASFM PSTSGTLVIS
FEANAAVCDE EELAEHLRRF LRVLDADPDL PIARVDLTDP DERAELLDLG TSPRETEDTT
WPGAVERQAA RVPDAIAVVC EDVRLTYAEL NAAANRKARA LRALGVGHED VVGVMLPRSA
DLVVTLLAIM KAGAAYLPLD PDHPRERVTG MVADAGAKLV ITEDLAWELP DDGGDLGLDI
ALHQAAYVIY TSGSTGRPKG VVVTHEGVGS LIATAVDRIG VTESSRVAQF ASVGFDVAVW
DLCMSLGVGG RAIVVPSHRR VAGPELTDYL AEHGATHMIL PPSLVAALPA ECELPEGAVL
VVGTETVPSE LIARWARRMG VVAAYGLTEA TVNSTLWPAQ PDWTGPVPIG VPDPNTRCYV
LDSALRPVPV GVEGELYVGG RGLARGYHGQ HALTAARFVA DPFDGPGSRM YRTGDRVRWR
ADGSLDFLGR ADGQVKIRGF RVEPGEVESV LMAHPGVAQA AVRVRRDHRG VKRLVAYTAG
AAEPAELRER AAALLPEYMV PSVVVPMTGP LPLTPNGKLD DKALPDPDWT ALGGDAPATT
PAEHVLVDAF TAVLGLKPGV HDNFFELGGD SIVAMQVVSR ARAAGLVITP AQVFEHRTPA
NLAAASRPVG AVADADVELL PADLVAQFPG DHEVLPASPL QQGFYFHSRF DADDSYLVQE
VLSLGRAVRR ADVQGLLDRH PALRAGFGQG ARGEVVQLVP PHVELPWHEH PADTDLERVL
AAERAGGFDL EHPPLLRAAA VGDKLVLTLH HILVDGWSVG LLRRELDGVR VGAPDHLAYH
RWLATRDAEA ARAAWRAALS DVDEPTRVVP DPSPRHGRHS AFLTAEATAA LTTAARRLGL
TTGSVLHGAW GLLIGGRTGR RDVVFGSTVA GRPAEVEGIE SAVGLFINTV PVRQRWSPAD
TLAQVLTRFQ DEQAALLDHQ HLGLAEIQRA AGQGELFDTL VVVENFPDVV TDVEVRDRVH
YPLALIAIPG ERLELRIKHR VPDAEAVALA DRLVRLLEVF AADPHRPVAR VPLVTPGEAE
WLAAQTGQAV VVPEGTLVDR IAARVAQTPD AVAVVFEGST LTYAELDRRS AAVARGLRER
GAGKVVGVAI PRSAELVVAL LGVLRSGAAY LPLDLDYPAD RVEFMLADSG ADVVLRQEDC
LAEDPAPGAG LPDADLPDAD LPDAAPAPDD PAYLIYTSGS TGRPKGVVVA HRAVVNRLAW
MQDHYGLAAD DRVLQKTPSS FDVSVWEFFW PLCEGAAIVL ARPDGHRDPA YLADLVREQG
ITTLHFVPSM LAAFLGVPEV ADDPAWARSL RRAFSSGEAL TGDVAGRWRA LTGVPLHNLY
GPTEAAVDVT WYPFRGEADL AVPIGFPVWN TGLHVLDPLL RQVPAGVPGE LYLSGVQLAL
GYHGRFALTA ERFVASPFGG PGERMYRTGD LVVRRADGAL VYLGRTDRQV KVRGNRIELG
EVEAALAALP GVTRAAVIAR DNQLLGYVTP ATADVTALRA ELARVLPAAM TPHALIALAG
FPLSPSGKLD QNALPDPART TAAEPGRAAA TEREELLCGI VADVLGLPAV GPDDDFFALG
GDSITSIAVS TRARRAGVGV SPRDVFAHRT VAALAALEDV VDARADDRPL LELTDAQRAA
LPPHEDAWPL SPLQEGLFFH SSYDTGALDV YLSQEVMDFT GRVDADRLRA AGGLLLRRNP
SLRAGFTSDG VPQPVQFIAP DLEIPLRVVE IGGTGVAGDA EVRELLAADR RERFDLAAPP
LARFLLLRLP DGRDRLVITH HLVLWDGWSA WLFLEQLLEL LRDPDADLVE PGSYRDYLVW
LARQDHDAAL GAWRTALSGL AEPTLVAPVD RTREPVIPAE LHADLPEELT DRLRALSRAH
GVTANAVHNT AWALVLAAQV GRDDVVFGTA VAGRPVEVDG VENTIGMFLN TIPTRVTVDP
RESVLDLLVR VQAERTALMS YEHVGLGEIQ RQAGHRQLFD TLFVLRTGDG DDRSAALAAR
HGVTGVSNVD GTHFPLTLII TPGRTTRVTL ASRPDLVDAD QARAVLDRYT TVLERLVTRP
DALLGAIDVL PDAERAALAA EWGASANPVV DRTIAELLAE QAGRTPDETA LVFGDLRLTY
ADLQARVDRL ASVLVAQGAG PERVIALALP RGVDMVVALF AVLRTGAAYL PLDLDHPADR
LRLMIADTDP LRVLSTASVG LVDDALLVED LPDAPEVLDW PRFSLDHPAY VIYTSGSTGR
PKGVVTPYRG LTNMQLNHQR AIFAPAIASA GGRRLRVAHT VSFAFDMSWE ELLWLVEGHE
VHVCDEELRR DAAALVAYCD EHRIDVVNVT PTYAQLLIEE GLLGGHVPPL VLLGGEAVPD
SVWQRLRDTP GTFGYNLYGP TEYTINTLGA STEDSATSTV GRAIWNTRAY VLDARLRPVP
PGAPGELYIA GIGLARGYHD RFGLTASRFV ADPFCAVPPG GSGERMYRTG DLVRARPDGN
LDFLGRTDDQ VKIRGYRVEL GEVTSALAAL PGVTHAAVVV DAGPGGVKRL IGYAVSEVDT
AELRESLKYG LPDYMVPAAV MAVDALPLTV NGKLDTRALP KPVVTGGAAS REPANERERV
LRELFADLLG VPEAGVEDSF FDLGGHSLLA TRLVSRARTA LGVELSIRDL FDAPTPALLA
ARASGGAPAR PPLRPADWPG GRPEELPLSY AQQRLWLIQQ MEGTSAAYNF PLVFRLRGGL
DVAALRDALR LLTDRHEALR TVFGERDGVP FQRVLTDVDP VFRSVEATEA QAADLVRAEV
GRPFDLEREL PVRVTVARLA DDEHLVVVLL HHITTDEWSD RPFLRDLAES YRALRAGDEP
DLPAFPVQYA DYALWQRELL DVVGDEQLAH WERALAGLPE ELELPVDRPR PARPTFRGAD
LEVEFPAEVV AGVRKIAQET GASAFMVLQA AVAALLGALG AGDDIPLGAP IAGRTDEALD
DLVGFFVNTL VLRADLTGDP SFTELVGRVR ETSLAAFSHQ DVPFEAVVDR VNPVRSAARN
PLFQVMVGYH SRGAEQFALD GLAVEWQPHE TGTAKFDLVF SFTDRAGGPI GCRVEYATDL
FDATTAHRLA RRLTRLLATV VADPGAPVGA VDLLADDERQ LVLREFNDTA RDVPELTFTE
LFARVVARKP DEVAVVDRAR SESYARLDAR SNRIARLLAA RGVRRESVVG VAVPKSVEMV
ATVLAVLKLG AAYLPLELSN PADRIAYVIE DSGAELVVAT SEVVVPGDVP RLDLDAVADE
LAAADESTVD GGPASLDSAM YVIYTSGSTG RPKGVVVPHE GIASLAATAI DRMGLTEDSR
VLQFASVGFD VAVFELTMAL CVGGRLVLAP DEVRVAGPAL TDFLRDRRIT HMVLPPSLVS
ALPADCELPD GSTILVGTET VPPDLIGRWA GRLNLLAAYG LTEATVNSTL WRAVPGWGGA
VPIGVPDPNT TVHVLDSRLR PVPPGVVGEL YVSGRGLARG YLGRPDLTAS RFVASPFGPP
GARMYRTGDR ARWRADGNVD FLGRVDTQVK IRGFRVELGE VEAVLAGAGG VAQSAVVADR
EGEITRLVGY VVPVSGEVEL DPSGLRSRVA ERLPEHMVPA AVVVLPGPLP LTPNGKLDRR
ALPRPDWSAL AGDDRPSTPE QKAVAELFAE VLGVSPGAHD SFFELGGHSM ASMRLVGRIR
AVFGVDLALR DVFDSPTVAG LAGRLSGAVS ERPRLRRIER PVELPMSAVQ RARWATRDGW
DHALALHGTF DAGALAAAYG DVVARHEPLR VVLDGRVQRP VPAPELETVV VGELDAALAA
FAREEVDLLA GPPARARLVV AGERQALLLT MHYLAVDEWS VVPLVRDLVQ AYRARVSGDV
PQWSELPVGY ADYALWSAEV AEVVGERQRA FWRDALRDLP RLELPYAGTA PGPRYAADVV
PFELPASLHE AIGAVASATG TSLFMVVQAA LAVVLSEHGA DVPIGALVAG RTEEALADVV
GSFFTPVVLR TDVAGSPTRA ELLARVREAD LAAFDHADVL PGADPQVVVV HHERAALGGE
LGELVAVPTG STTAELTLSF YEPRGGGPVP CYLVYAVDLF ERATVERWAE RLRAELDALV
GDLDGRIDS