Gene Amir_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4604 
Symbol 
ID8328802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5470016 
End bp5483839 
Gene Length13824 bp 
Protein Length4607 aa 
Translation table11 
GC content78% 
IMG OID644945051 
Productamino acid adenylation domain protein 
Protein accessionYP_003102283 
Protein GI256378623 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCTGC ACGCAGGGCT GCGCGGAGGC TCGGCCGAGG GGAACGCCCG GAGCGGCGGG 
GAGGCGTCCG CCGATCCCGT CGCCGTGGTC GGGATGGGGT GCCGCTACCC GAGGGCGCTG
TCCTCGCCGC AGCAGTTGTG GGACTTCGCG CTGCGCGGCG GGAACGCCGC CCGCCCGGAC
TTCCCCGGCG ACCGGGGGTG GGACCTGCGC GCGCTGACCG ACACCGGCGC GCCCGGCTCG
ACCTACGCGC GCGGCGGTTC CTTCCTGGAC GGACCGGGGG AGTTCGACGC CGCCTTCTTC
GGCGTCAGCC CGCGCGAAGC GCTCACCATG GATCCGCAGC AGCGGTTGCT GCTGGAGGTG
TCCTGGGAAG CGCTGGAGCG CGCGGGCATC GCCCCCGACG CGCTGGCGGG CAGCGACGCC
GGCGTCTACT TCGGAGTGGT GGCGCAGGAG TACGGCCCGC GCGTGTTCGC CGGGGCGCTG
GAGCACGCGG GCCACCTCAC CACCGGCACC ACCCCGAGCG TGGCCTCGGG CCGCGTCGCC
TACGCCCTGG GGCTCGAAGG CCCTGCCGTC ACGGTGGACA CCGCCTGCTC CTCGTCGCTC
ACCTCGGTGC ACCTCGCGGT GCGGGCGCTG CGCGCGGGCG AGTGCTCGCT CGCGCTGGCG
GGCGGCGCGA ACGTGGTGTG CGCGCCCAGC ATCCTCGTCG GCTTCGGCCA CCTCGGCGCG
CTGGCCCCGG ACGGGGTGAG CAAGCCGTTC TCCGACGACG CCGACGGGTT CGGCGTCGCC
GAGGGCGCGG GAGTGCTGGT GCTGGAACGG CTCTCCGACG CGCGTCGCCT CGGCCACCCC
GTGCTCGCCG TGCTGCGCGG CACCGCCATC GGGCAGGACG GCGCGTCCGA GGGGCTGTCC
GCGCCCAGCG AGCACGGCCA GCGGCGGGTG ATCCGCGCCG CCCTCGCCGA CGCCGGGCTC
ACCCCCGCCG ACGTGGACGT GATCGAGGCG CACGGCACCG GCACCCGCGT CGGCGACCCC
GTCGAGGCCC GCGCCCTGCT GGCCACCTAC GGCGCGGCCC GCCGCGCCGA CGACCCGCTG
CTGATCGGGT CGATCAAGTC CAACATCGGC CACACCCAGG CCGCGTCCGG GGTCGCGGGC
GTGATCAAGG CCGTGGAAGC CCTCCGGCAC GGCCTGGTGC CCGGCACCCT CCACCTGACC
CGCCCCACCG GCGCGGTCGA CTGGTCCGGG GGCGCCCTGC GCGTCCCGAC CGGAACCACC
CCCTGGCCTG CCTCGGCCGC GGGCAGGCCC CGGCGCGCGG CCGTGTCCTC GTTCGGGATC
AGCGGGACGA ACGCCCACGT CGTCCTGGAG CAGGCCCCGC CCCCCGACCG CGCGGCGACC
CCCGCCCCGT CGGGCGAGCC GCCTGCGCCG CTCGTGCTCT CCGCCCGCAC CGCCCACGCC
CTGCGCGAGC AGGCCGCCGC GCTGCGCGCG CACCTGGACC GCCGCCCCGA CCTCGACCTG
GCCGCCACCG CCCACACCCT CGCCGTCCGC CGCAGCCGGT TCGACCACCG CGCCGTGCTG
GTGGCGGGCG ACCGCGCCGA AGCGCTCGCC GGGCTGACCG CCCTCGCCGG GGACGCCGAG
ACCGTCCGGG CGCGGGCGGA GGCGGGTGCG GTGCTCGTCT TCCCCGGCCA GGGCTCGCAG
TGGGTGGGCA TGGCGTCGGG GCTGCTCGGG GTGGACCCGG TGTTCACCGC CTCGATCGAG
GAGTGCGGCG CTGCGCTCGC CGAGTTCGTC GACTGGTCCC TGTCCGACGT GCTGCGCGGC
GCCCCCGGCG CGCCTGGACC CGACCGGGTG GACGTGGTGC AGCCCGCCCT GTTCGCCGTC
ATGGTCTCGC TGGCCCGCTG CTGGCAGTCC TTCGGCGTGC GCCCCGCCGC GGTGATCGGG
CACTCCCAGG GCGAGATCGC CGCCGCCTGC GTCGCGGGAG CGCTCTCACT GCGCGACGCC
GCCGCGGTCG TCGCGTTGCG CAGCAAGGCC ATCACCGCCA TCGCGGGCAC CGGCGGCATG
GCCTCGGCGC CGCTGCCCGC CGCCGAGGTC GTCGCCCGCC TCGCGCCCTG GGCGGGCAGG
CTGGAGGTGG CCGCGGTCAA CGGGCCGCGC TCGACCGTGG TCTCCGGCGA CGCCGAGGCC
ATCGGGGAGT TCGTCGCCGC CGCCGAGGCC GACGACGTCC GGGTCCGCCG GGTGCCGGTG
GACTACGCGT CCCACTCGGC GCACGTCGAA GCGCTCGACG GCGCGGTCCA GGGGGCGCTC
GCCGACATCG CGCCCACCGC CTGCGAGGTG GAGTTCCTCT CCTCGGTGAC CGGCGGACCG
GTCGAGGGCG GGCGGTTGGA CGCCGCGTAC TGGTACCGCA ACCTGCGGCG CACGGTGCGC
TTCGAGGAGG CCGTGCGCGC CGCCCACGCG CGCGGGCACC GCTTCTTCAT CGAGGTCAGC
CCGCACCCCG TGCTCACCGT GGGGGTGGAG GAGTCGCTCC AGGACGCCCC GGTCGACGAC
CCGGTCGTGG TCGTGGGCAC CCTGCGCCGC GACGACGGCG GCCCGCGCCG CCTGCTGGCC
TCGGTCGCCG AGGCGCACGC CGCGGGCCTG CCGGTGGACC TGACCGCCCG CCTGGGCGAC
CGGCCCGCGG TCGAGCTGCC CACGTACCCC TTCCGGCGCC AGCGGTTCTG GCTGCCGCCC
ACCACCGGGA CGACCCCGGC CGGACCGGCC CCGGAGCACC CGCTGCTGGA CGCCGCGCAC
GACCACCCCG AGCACGACGG CCTCCAGTTC ACCGGCCGCC TCTCCCTGTC CACCCACCCC
TGGCTGGCCG ACCACGCCGT GGACGGGGTG GTGCTGCTGC CCGGCGCGGC CGTGGTCGAG
CTGGCCGCCT TCGCCGGGCG GCGCGCGGGC CACCCCGAGG TGACCGAGCT GGTGCTGCGC
GAGCCGCTGG TGGTCCCGGC GGAGGGCGCG GTGCTGCTGC GCGTCGTCGT CACCGGGGGC
ACCCCCGACG GCCCCGGCGT GCGGCTCTAC TCCCGCCGCG ACACCTCGCT GGGCGCCGGG
GCAGGCTGGA CCAAGCACGC CGAGGGCTCC CTGGGCGCCG CACCGCCCGC GCCCGCGCCG
GACCGGGCCG CCTGGCCCCC CGAGGGCGCC GAACCGGTGG ACGTGTCCGA CGCCTACGAC
CGGCTGGCCG CGCGCGGCTA CCACTACGGC CCGGCCTTCC GGGGACTGCG CGCGCTCTGG
CGCCGTGGCG CGGAGATCCA CGCCGAGGTC GCCCTGCCCG GCGGGGGCGG CCCAGGGGAG
TCCGGCACAG CGGAGTCCGG CACAGGGGGG TTCGGCACCG GGGAGTTCGG CCCAGGGGAG
TCCGGCACAG CGGAGTTCGG CACAGGGAAG TCCGGCACCG GGGAGTCCGG CACCGGCGAG
TTCGACCTGC ACCCGGCCCT GTTCGACGCC GCCCTGCACG CCCTCGTGCA CGTCGCCCCC
GACGCCGGGG CAGGCGGGAT CAGGCTGCCC TTCTCCTGGT CGGGCGTCCG GCTGCACGCC
GTGGGCGCCA CCGCCCTGCG GGTGAGCATC ACCGAGACCG GCCCCGACAC CTACGCGCTG
GCGCTGGCCG ACCCCACCGG GCAGCCGGTG TGCGAGGTGG GGGAGCTGCT CCTGCGGCCG
ATCACCCCCG ACCGGCTCCA CGCGGTGCGC GCCGGGGTCC GCGACGCGCT GCACAGCGTG
GAGTGGACCC CCGTGCCGGT GGGCGCGGCG GGCGTCCCCG AGTGGGTGGA GTGGGCCGAG
TGGGTCGGCG GGACCCCGGC GCGGGCCGTC GTCCTGCGCC TCGACGCCGC CGCCGACGGC
GCGGACCTCC CGCACCGGGC GCGGCGGGCG CTGGACGAGG TCCTGCTCCT CGTGCAGCGG
TGGCTGGCGG ACCCGCGCGC CGAGGGCGCC ACCCTGGTCG TGCTCACCCG GCGCGCGGTG
TCGACCGGTC CCGGCGAGGA CGTCGAGGAC CTGGCCGCCG CCCCGGTGTG GGGGCTGGTG
CGCAGCGCCC GGACCGAGCA CCCCGACCGC GTCGCGCTGG TGGACCTGGA CGACTGGGCG
GGCTGGCGCG ACCGCGTCCC GCGGGCCCTC GCCGCCCTCG CGGCGGGAGA GGGCGAGGTG
GCGCTCCGGC GGGACGCGCT CGTGGCGCCG CGCCTGGCGC CCGCGTCCGG CGCGGTCACC
GAGGGGCTCG ACCTGCTGGG CGGCCCCGGC TGGCGGCTGC GCACCCTGGG CGCGGGCACC
CTGGAGGCGG CCAACACCGC GTTCGCGGAC TGGCCCGAGT CGCGGCGACC GCTCGCCCCC
GGACAGGTGC GGGTCGCGGT GCGCGCGGTC GGGCTGAACT TCCGCGACGT CATGATCGCC
CACGGGCTCT ACCCCGATCC GCTCGTGGAC CTGGGGAACG AGGGCGCGGG CGTGGTGCTG
GAGGTCGCGG ACGACGTCCA CGACCTGGCG CCCGGCGACC GGGTCATGGG CATGTTCTAC
GGCGTCGGCC CGGTGGTGGC GCGCGAGCGC GGCTACTTCG CCCGCTTCCC GGACTCCTGG
ACCTGGGCCG AGGCCGCCGC CACCCCGGCG GTGTTCCTCA CCGCGCACTT CGCGCTGCGC
GGGCTCACCC CCGGCGCGCG GGTGCTGGTG CACGCCGGGA CCGGCGGGGT CGGCATGGCG
GCCACCCGGC TCGCCCACCA CCTCGGCGCC GAGGTCTTCG CCACCGCCTC CCCCGCGAAG
CAGCACCTGC TGCGCGCCGC GGGCCTGGCC GAGGACCACG TGGCGGGCTC GCGCACCCTC
GACTTCCGCG ACCGGTTCCT GGCAGCCACC GGCGGCGCGG GCGTGGACGT GGTGCTCAAC
TCCCTCGCGG GCGGGTTCGT CGACGCCTCG CTCGATCTGC TGCCGCGCGG CGGCGACTTC
GTCGAGCTGG GCAAGACCGA CCCGCGCGAC GCCGACGAGA TCGCCGGGAC CCGTCCCGGC
GTGCGCTACC GGCACTTCGA CCTGACCCAG GTCGGCCCGG AGGAGACCGG GGCGGCCCTG
GCCGAGCTGG GCGCGCTGTT CGAGCGGGGC GCGCTGCGCC CGCTGCCGGT GTCGGTGCGC
GACGTGCGGC AGGCACCCGC CGCGCTCAGG ACCCTCGCCC AGGCCCACCA CACCGGCAAG
CTCGCCCTCA CCCTCCCGCG CCCCCTCGAC CCCGACGGCA CCGTGCTGAT CACCGGCGGC
ACCGGCGTCC TGGGCGGCCT GCTCGCCCGC CACCTGGTCA CCACCAGGGG CGTGCGGCGC
CTGCTGCTGG CCTCGCGCGG CGGACCCGAC GCCCCGGCCG CCCGCGCGCT GGTGGCGGAC
CTGCGCGCGC TGGGCGCCGA ACCCCTGGTC GTGGCCTGCG ACGTGGCCGA ACCCGGGGCG
TGGGCCGCGC TGTCGGCCGC CGTGCCCGCG CGGCACCCGC TCACCGCCGT CGTCCACGCC
GCGGGCGTGC TGGACGACGC CGTGCTCACC GCGCTGACCC CGCAGCGGCT CGACCGCGTG
CTGCGGCCCA AGCTCGACGC GGCCTGGCGC CTGCACGAGG CCACCGCCGA CCTGGACCTG
GCCGCCTTCG TGCTGTTCTC CTCGGCGGCG GGCCTGCTGG GGACGCCGGG GCAGGCCAAC
TACGCCGCGG CCAACACCTT CCTCGACGCG CTGGCCGAGC ACCGCCGCCA GCGCGGCCTG
CCCGCGACCG CGCTGGCCTG GGGCCTGTGG GCGGACGGCT CCGGCATGAC AGGCCACCTC
GACGACCGCG ACCTCGCCCG GCTGCGCCGC GCGGGCTTCA CCGCCATGTC CGCCGCCGAG
GGCCTCGCCC TGTTCGACGC CGCCCTGGAC GACGGCCGGG CGCTGCTGGT CCCGGCCCGC
CTCGACCCGA CGTCCGGGCG CGCGGCGGAC CGGCCCGCCC TGCTCGGCGG CCCGGTCAGG
TCCGCGCCCC GCGCGGCGGG CGGCCCCGCC ACCGCGCGGC GGCCCCGCGC GGGCGACCTG
GCGGGCCTGC CCGAGGCCGA GCGCCGAGCG GTGCTGACCG ACCTGGTCCG CGCCGTCGCC
GCCACCGTGC TGGGCCACGA CTCGGCCGAG GACATCGACC CGAACGCGCG GTTCCAGCAC
CTGGGGTTCG ACTCCCTGGG CGGGGTGGAC TTCCGCAACA GGCTCCGCAG GGCCACCGGC
CTGCCGGTGC CGACCAACGC CGTGATCGAC CACAAGACCC CGGCGGCGCT CACGACGCAC
CTGCTGACCC TCGTCACCCC GACCGCCGAA GCACCCGCCC CCGCGCCACC CGCCCCACCC
GCGCACGGCG CTGTTGCGAA TGGCGCTGTC CCGAATGGCG CTGTCCCGCG CAGCACCGCC
CCGAACGGCA CCGCCCCGAA CGGGGCCGCC CCGCAGAGCC CTGTTCCCCA CCGCGCCGCC
CCGAACGGCG CCACTCCGAA CGGCGCCACT CCACACGCCA CCGCCCCGCC GGTCACCACC
CCGCCGGAGG CGACCGGGGA AACGCACCCG CTGAGCCACT ACCAGCGCGA CGTCGTGGGC
GTCGGCCTCG CCCACCCCGA GCTGCGCCAG GCCCAGCCCA GCGGGTACCT GCGGTTGCGC
GGCGCCACCG ACGTCGAGCG GGTGCGCGCC GCCATCCGCC GGACCGCCCT GCGCCACGAC
GCGATGCGGC TGCGGCTGGA GGTCTGCGGC GACGAGTGGC GGCAGCGCGT GCTGCCCGAC
TACCCCGACG TCGAGGTGGT CGACTTCCGC GCCGGGCCCG ACCCGGCGGG GGAGTGCCTG
CGGTGGATCG AGCGGGCCGT CGACACCCCC GTCCCCCTGG CCGGACCGCT GGTGCGGACG
GTGGTCCTGG TCGACGACGA CGCCTCCCTG GTCGTCTTCA CCCGCTTCCA CCACGTCGTG
GCCGACGCCT GGGGCATCAA CCTCGCGCTC GGCGACATCG CCGCCGCCTG CCGCGCCGAA
GCGCCCCCGG CCCCGCTCTC CCCGGCGCCG AGCTGCCTGG AGGCGGTGGC GGCCGACGCG
CGCTACCGGT GCTCACCCGA GCGCGAGGCC GATCGTGACG CCCTGGTCGA CGTGCTGGCG
CCGTTGCGGC CCGCCCTGTT CCCCCGCGCC GCGTCCGCGC GCGCCCACCG CCGCCTGCGC
CACTCCACCA GGGTGGACGC CGACGTGGTC GACCGCATCC GCGCCACCGG GCGTTCGGTG
TTCTCGGTGA CCGCCGCCGC CCTGGCCGCC TACCTGCGCC GCCTGCACCG CGAGGGCGAC
GTCGTGCTGG GCGTCCCGCT GCTCAACCGC CGCACGCCCG CCGAGCGGGC CACCATGTCG
CACGTCACCA ACACCCTTCC GCTGCATGTG CCCGTCGACG AGCGGGACAC GCTGCCGGAG
CTGGCCGACC GGGTGGCCTC CGGGGTCGGC GAGCTGCTGA CCCACCAGCG GTTCGCGCTC
GGCGACCTGC GCGCCGCGCT GCGCTCGGCG GGCAACCCGG CCAGGGACCT GTTCGACGTC
ACCTACTCCT ACATCACCGT CCCCGAGGAC GGCCTCGGGG AGGACGTCGA GCTGACCGTG
CTCGCCTCCG GCTACGCCCT CGACGCGGTC AACGTCGTCG CCCGCGAGCA CCGCTCCGAC
GGCTCGTTGC ACCTGGACGT CTTCTACGCC GACGACGTGC TCGACGGCGA CCTGACCGTC
GACGCGGTGA CCGGGCACGT CGCCCGCCTG CTCCGGGCGG GCCTGGACGA CCCGACCGCC
CCGGTGGGCT CGCTGCCGGT GCTCGGCCGG GCCGAGTCCG CCCGCCTCGA CGCCTTCGAG
CGCCCCGAGG CGGTGGGCTT CGACGAGACC ACCACCCTCG ACCGGCTGTT CGCCGAGCAG
GCGGAGCGCA CCCCCGGCCG CCGCGCGCTG ACCTGGTCCG ACCCGGACGG GACCACCCGC
GAGCTGACCT ACCGGCAGTT CCACCGGCAC GTCGCCCGAG TGGCCGGGCT GCTGCGCGCG
CGGGGCCTGC GACCCGAGGA GCCCGTGCCG GTGGTGCTGC CGCGCTCCCC GAGCTTCCTG
GTCGCCGTCC ACGCGGTGCA CGCGGCGGGC GGGGCCTACG TGCCGGTCGA CCCCGCCCAC
CCGGCCGAGC GCGTGCGGAC CATCCTGGGC GACTGCGGCG CCCGCCTCGC CATCGCCGAC
GTCGACCTGG GCGACCTCGG CGTGCCGTCC CTCGCGGTGG ACCTCGACGC GGACCTCGCT
GCGGACCTCG ACGCGGACCT CGCTGCGGAC CTCGCTGCTG ACCTCGCTGC TGACCTCACT
GCCGACCCGG CGGCGGCCCC CTGGACGGAC CAGGCGGCAG ACCCCGCGGC CGACAGCGCC
GTCCCCGCCC CGGTCACCAC CACCCGGCCC GGCGACCTCG CCTACGTCAT CTACACCTCC
GGCTCCACCG GCGTGCCCAA GGGCGTGATG GTCGAGCACC GCTCGGTGGT CAACCGCCTG
GCCTGGATGC AGCGCCGCCA CCCGCTGGGG CCGGACGACG TCGTCCTGCA CAAGACCCCC
ACCACGTTCG ACGTCTCCGT GTGGGAGCTG CTGTGGTGGG CGCACGCGGG CGCGACGGTG
GCGGTGCTCC CGGCGGGCGC CGAGCGCGAC CCGCGCGAGC TGGCCGCCGC GATCGAGCGC
CACGGCGCCA CGGTCGTCCA CTTCGTGCCC TCCATGCTCG GCCCCTTCCT CGACCACGTG
GAGGCCGACC CGCGCGCCGC CGAGCGGGTG CGCACCCTGC GCCGGGTCTT CGCCAGCGGC
GAGGCGCTGA CCCCGGCGCT GGCGGAGCGG TTCCGCGCGG TGCTCGCCGC CGCCGGCAAC
CCGGAGGCGC GGCTGACCAA CCTGTACGGG CCCACCGAGG CCACCGTGGA CGTGTCCCAC
CACGACGTCC CGGCCGACGG CCCGCTGACG CGCGTCCCCA TCGGCAGGCC CGTCGACAAC
ACCGCGCTGC TGGTGCTCGA CGCCGCCGGT CGCCGCTGCC CGGTCGGCGT GCCCGGCGAG
CTGAACATCG CCGGTGTCGC CCTGGCCCGC GGCTACCTGG GCAGGCCCGC GCTCACCGCC
GAGGCGTTCG TGGTGGACGA GTCGATCCCC GAGGGCCGCC GCTACCGCAC CGGCGACCTG
GCCCGCTGGC TGGCGGGCGG CGACCTGGAG TACCTGGGCA GGCTCGACGA CCAGGTCAAG
ATCCGGGGCA ACCGGGTCGC CCCCGGCGAG GTCGAGGCCG CGCTGACCCG CTGCCCTGGG
GTGTCGGCCG CCGCGGTCGT CCCGGAGGAC GGCGGCTCGG GCGCCCGGCT GGTCGCCTAC
CTCGTCGCAC CGCACGCCGA CGGGACCGAC CTGGTGCGCG CGGTGGTGGA CTCGCTGAAC
CTGCGGCTGC CGGGGTACAT GGTGCCCTCC GAGTACGTCC GGGTGGACGG GCTGCCGCTG
ACCCCCGGCG GCAAGGTCGA CCGCCGGGCG CTGGCCGCGC TGGGCCGCCG CGAGCGCGGC
TCGGGCCGCC CCGGCAACCC GCTCGAGGAG GCCGTCGCCG AGGTCTGGCG CGAGGTCCTG
GGCACCGACG CCTTCGGCGT GCACGACGAC TTCTTCACCG TTGGCGGCGA CTCCATCCTG
GTGCTGCGGA TGCGCACCGA GGCGGAGAGG CGCGGCCTGC CCTTCGACCT CGACCGGTTC
CACGCCCACC CCACCGTCGC CGCGCTCGCC GCGGGCCTGG ACGCGGCGGC GCCCGCCCTC
CCCGCCGTCG CCCCGGTCAC CGAGCCGTTC GAGCTGGTGC CGCTGATCGA CCGCGCGGGC
CTGCTCGGCG TCGAGGACGC CTTCCCCGCC ACCCGGCTCC AGCTGGGGAT GCTCTTCCAC
AGCCGCCAGC GCGCCGGTTC CCCGCTGTAC AAGGACGTCT TCCGCTACCG GCTGCGCCTG
CCCTGGGACG AGGCCGCCTT CCGCCGCGCC GCGGCGGACC TGGCGCGCCG CCACCCCGCG
CTGCGCTCGA CGTTCGACCT CACCGGCCGC TCCACCCCGC TCCAGCTCGT CCTCGCCGAG
GCGCCCGACG CGGTGGAGGT CGTCCACGGC CCCGCCGACC CCGAGGCGCT GGAGCGGGAG
GTCGAGGCCC GCCTGGACGC GCTGCACCAC GCCGACTACC CGCTGCTGGA CGACGACCCC
GACCACCCCG CCCCACTGCA CCGCGTCTGC GCGGTCGTCC ACGGGGACGA CTCCGGCCAG
GTGGACCTGG TGCTCGCCTT CCACCACGCG ATCCTCGACG GCTGGAGCGC GGCGGCCCTG
GTCGGCGAGC TGCTGGAGGA CCACCTGGCG CTGGTGGCGG GCCGCGCGCC CGCCGACCGC
GCCCCCAGCC CGGCGCTGCT GCTGGCCGAG CAGGTGCGCG CCGAGCACGA CGCCTGCCAC
GACCAGGCCC ACCACCGCCA CTGGGCGCGG GCGCTGGAGC GCTCCCGGCC CACCACCGTC
GAGTCGGTGC GGGCGCACGT GAGCGCGCCC GGCGGGGGCG AGCGCACCCT GGTGCTGCCC
GCCTGGCTGG ACGCCTGCGC CAAGGACTTC GCCCGCCGCC GGGGCGTGCC GCTGAAGTCG
CTGCTGCTGG CCGCCCACTG CCTGGCGCTG CGGGCCGTGA CCGGCGAGGA GGACGTCACC
ACCGGCTGCG TCACCCACAC CCGCCCCGAG CGCGCGGGCG CGGAGCTGTG CGCCGGGCTG
TTCCTCAACA CCGTCCCGGT CCGCCTGGCC GACGGCCCGC GCACCTGGGG CGAGGCCGTG
GACCACGTGC TGCGCTGGGA GCGCGAGTCC TACCCGCACC GCCGCCTGCC GCTGAGCGTC
CTGCAGGACG AGCGGGGCGG GCCGGTGCTC GAGACCGCGT TCAACTTCGT CAACCACCAC
GTCCTGGCGC AGACCCTGCG CGGCGACGAC CCTGGCTCGC CCGCCGCGTC GCTGGTGGAC
GTGGCGGTGC GCGAGGAGAC CAACTTCGCC CTGCTCACCA CCGCGGTGGT GGACCCGCGC
GACGGCAGGC TGGCCCTGCG GCTGAGCTCG GGCGGTGACG CGCTCACCGA CGAGCAGTGC
GCCGAGTTCG GCAGGCTCCT GATCGGCGTC CTTGCCGACA CCGTCCGGCG GCCCGACGCC
GCGATCGACC TCGACGCCCT CCGGTGGCGC GACGTGACCG AGGCGGTCGC CCACGCCGCC
GCGCTGCACC CGTCCGCCAC CGCCGTCGTG GGTGACACCG CGAGCCTGGA CTACCGCGCG
CTCGACGACA CCGCCGACGC GATCGCCCGA CACCTGCTGG ACCTGGGGAT GCCGCCCGGC
GCCCGCGTGG CGGTGCGGAT GCGCCGGGGC GTCGCCCTGG TGGCGGTGGT GCTGGGCGTC
ATGCGCGCGG GCGCGGCCGT GGTGCCGCTG GACCCGGACT ACCCGCAGGC CCGCGTCCGC
GCCATGGCGG ACATCGCCGC GCCGTGGCGG GTGGTCGCCG ACCCGGACCT GCGCGCCGCG
CTCGGCGGCG TGCCCACGAT CGACCCCGCA GAGCTGCTGG CCCCGCCAGG GCGCGACCCC
GCGCCGCTGC CGCGACCGGG ACCGGAGGAC ATCGCCTACG TGCTGTTCAC CTCCGGCTCC
ACCGGCGAGC CGAGGGGCGT GGCCATGCCC CACCGGGCGC TGGCGAACCT CGTCGAGTGG
CAGAACCGGC GCGCCACCGG CCGTGCGGCG GCGGGCGGGC GCGCGCCGAC CACGCTGCAG
TTCGCGCCGC TGAGCTTCGA CGTCTCCTTC CAGGAGATCT TCTCCACCCT GTGCGGGGGC
GGCGCCCTGC GCGTGCCCGA GGACGGCCTG CGGCAGGACG TGCCCGCGCT CCTGCGGGTG
ATCCGCGCGG CGGGCGTCGA GCGGGTCTAC CTGCCCTACG TGGCGCTCCA GGCGCTGGCC
GAGGCCGCCG TCGACGCCTG CCCGGACTCG CTGCGCTCCA TCGTCTCCTC CGGCGAGCAG
TTGCGCGTCA CCCCCGAGAT CCGCGCGCTG TGCGCGGCCA ACCCGGCCCT GGTGCTGGAG
AACCAGTACG GCCCGACGGA GACCCACGTG GTGCTGGCCC ACCCGCTGCC GGGGGGCGGC
GGGCACCCGC CGCTGCCGCC GGTCGGCGAC CCGGTCTCGG GGGTCGGGGT CAGCCTGCTC
GACGAGCGGC TGCGACCGGT GCCGCCGGGG ACGCGGGGGG AGATCTACGT CGAGGGCCCG
TGCCTCGGCC AGGGCTACGA GAACCGGCCC GGCCTGACCG CCGAGCGGTT CGTCGCGGCG
CCCTCGGGGC GGCTGCGCTA CCGCACTGGC GACATCGGGC TCATGCTCCC GGACGGGGGG
ATCGTGTTCC TGGGCCGTGC CGACAACCAG GTCAAGGTGC GCGGCTACCG CGTCGAGTGC
GCCGAGGTGG AGCTGGCGCT GCTGCGGCTC GCCGGGGAGC GGGCCGGGCT GGAGCAGGTG
GCCGTGGTGG CCCGCGACCT GGGCGGCGGG GACGCGGCGC TGGAGGCGTT CCTGGTCGGC
GACCCGGCGC GGGTCGATCC CGCGGAGCTG CGGGTGCGGC TGGCCGAGCA CCTGCCGCCG
CACCTGACGC CCTCGCGCTA CCACTGGGTC GCGGCGATCC CGCTGACCCC CAGCGGCAAG
CGCGACGACG CCGCGCTGCG CGCGCTGGCC ACCCGCCCCC CGGCCCCCGC CGCGCCGGGG
GACGAGCTGG AGGAGGCCGT CGCCGGGCTG CTGGCGGAGT TCGCCCACGT GGACCGACTC
GGCGTGGACA CCGGCTTCTT CGACGCGGGC GGCACCTCCA TCGGCGCCAC CCGCGCCGCG
ATGACCATCG CCCGGCGGTG GGCGGTGGAC CTGCCGCTCC AGGCGTTCCT CGCCGCGCCC
ACCGCGCGGG AGCTGGCCGG GGTCATCCGG GCGCACGGCG CCCGCCGACC GGCCTTCGAC
CCGGTGGTGA CCCTGCGGGA GGGCGGGCGC GGGACGCCGC TGTTCCTGGT GCACCCCATC
GGCGGCAACG TGCTGTGCTA CCGCGAGCTG GCCGCGCTCC TGCCCGGCGA CCGACCGGTG
CACGGCCTCC AGGCCGCGGG CGCCGACCCC GGCACCGAAC CGCTGACCTC GGTGCCCGCG
CTGGCCGACG CCTACACCCG CGCGATCCGC CGCGTCCACC CGCACGGCCC CTTCCACGTC
GCGGGCTGGT CCTTCGGCGG CTACGTCGCG CTGGAGATCG CCGAGGCGCT CGGCGCGGCG
CAGGTGCCCA CCGCGACCCT CCTGGACACC GTCGCCCTGG ACACCGCGGC CCTGGACACC
GCAGCCCCCG GCGAGCGCGC CCGCCCACCC GTGCCGGAGC GGCAGCTCAT CGGGTTCTTC
TTCCGCGAGC TGCTCTGGTA CTCCTCCGGC GGGGCCGACC TGGCCGACGA GCCGGACCCG
ACCGGGGACG CCGAGCCGCT CCTCGACGCC GAGCGGCTCT TCGACGAGGG CCTTGCCCGG
TGCGTGGCGC TGGGCATCCT GCCCGAGGAC GGCTCGCCGC AGCTGCTGCG CAGGCTCTAC
GAGGTCTTCC GGGCCAACTA CCGGGCGGTG CTCGACCACC GGCCGCGACG GGTGCGCCGC
CCGGTGCGCC TGCTGCGCGC CGCCGAGGAG CTGCCCGCCA ACCTCGCCAT CGCCCACCGC
GCGGTGGGCG GTCTGCTCGC CGACCGGGGC AACGGCTGGC GGGCGGCGGA CGGGCATCCC
GTCGAGGTGG TCGAGGTCCC CGGCAACCAC CTGTCGATGA TGACCGCGCC GCACGTCCGC
ACCGTCGCCA GGGTCCTCGG CGACGGCTTG GACCGCGCCG ATGACCCGCG CCGCGCCGAC
TCGGGCGTCG AGGTGGCCCG GTGA
 
Protein sequence
MLLHAGLRGG SAEGNARSGG EASADPVAVV GMGCRYPRAL SSPQQLWDFA LRGGNAARPD 
FPGDRGWDLR ALTDTGAPGS TYARGGSFLD GPGEFDAAFF GVSPREALTM DPQQRLLLEV
SWEALERAGI APDALAGSDA GVYFGVVAQE YGPRVFAGAL EHAGHLTTGT TPSVASGRVA
YALGLEGPAV TVDTACSSSL TSVHLAVRAL RAGECSLALA GGANVVCAPS ILVGFGHLGA
LAPDGVSKPF SDDADGFGVA EGAGVLVLER LSDARRLGHP VLAVLRGTAI GQDGASEGLS
APSEHGQRRV IRAALADAGL TPADVDVIEA HGTGTRVGDP VEARALLATY GAARRADDPL
LIGSIKSNIG HTQAASGVAG VIKAVEALRH GLVPGTLHLT RPTGAVDWSG GALRVPTGTT
PWPASAAGRP RRAAVSSFGI SGTNAHVVLE QAPPPDRAAT PAPSGEPPAP LVLSARTAHA
LREQAAALRA HLDRRPDLDL AATAHTLAVR RSRFDHRAVL VAGDRAEALA GLTALAGDAE
TVRARAEAGA VLVFPGQGSQ WVGMASGLLG VDPVFTASIE ECGAALAEFV DWSLSDVLRG
APGAPGPDRV DVVQPALFAV MVSLARCWQS FGVRPAAVIG HSQGEIAAAC VAGALSLRDA
AAVVALRSKA ITAIAGTGGM ASAPLPAAEV VARLAPWAGR LEVAAVNGPR STVVSGDAEA
IGEFVAAAEA DDVRVRRVPV DYASHSAHVE ALDGAVQGAL ADIAPTACEV EFLSSVTGGP
VEGGRLDAAY WYRNLRRTVR FEEAVRAAHA RGHRFFIEVS PHPVLTVGVE ESLQDAPVDD
PVVVVGTLRR DDGGPRRLLA SVAEAHAAGL PVDLTARLGD RPAVELPTYP FRRQRFWLPP
TTGTTPAGPA PEHPLLDAAH DHPEHDGLQF TGRLSLSTHP WLADHAVDGV VLLPGAAVVE
LAAFAGRRAG HPEVTELVLR EPLVVPAEGA VLLRVVVTGG TPDGPGVRLY SRRDTSLGAG
AGWTKHAEGS LGAAPPAPAP DRAAWPPEGA EPVDVSDAYD RLAARGYHYG PAFRGLRALW
RRGAEIHAEV ALPGGGGPGE SGTAESGTGG FGTGEFGPGE SGTAEFGTGK SGTGESGTGE
FDLHPALFDA ALHALVHVAP DAGAGGIRLP FSWSGVRLHA VGATALRVSI TETGPDTYAL
ALADPTGQPV CEVGELLLRP ITPDRLHAVR AGVRDALHSV EWTPVPVGAA GVPEWVEWAE
WVGGTPARAV VLRLDAAADG ADLPHRARRA LDEVLLLVQR WLADPRAEGA TLVVLTRRAV
STGPGEDVED LAAAPVWGLV RSARTEHPDR VALVDLDDWA GWRDRVPRAL AALAAGEGEV
ALRRDALVAP RLAPASGAVT EGLDLLGGPG WRLRTLGAGT LEAANTAFAD WPESRRPLAP
GQVRVAVRAV GLNFRDVMIA HGLYPDPLVD LGNEGAGVVL EVADDVHDLA PGDRVMGMFY
GVGPVVARER GYFARFPDSW TWAEAAATPA VFLTAHFALR GLTPGARVLV HAGTGGVGMA
ATRLAHHLGA EVFATASPAK QHLLRAAGLA EDHVAGSRTL DFRDRFLAAT GGAGVDVVLN
SLAGGFVDAS LDLLPRGGDF VELGKTDPRD ADEIAGTRPG VRYRHFDLTQ VGPEETGAAL
AELGALFERG ALRPLPVSVR DVRQAPAALR TLAQAHHTGK LALTLPRPLD PDGTVLITGG
TGVLGGLLAR HLVTTRGVRR LLLASRGGPD APAARALVAD LRALGAEPLV VACDVAEPGA
WAALSAAVPA RHPLTAVVHA AGVLDDAVLT ALTPQRLDRV LRPKLDAAWR LHEATADLDL
AAFVLFSSAA GLLGTPGQAN YAAANTFLDA LAEHRRQRGL PATALAWGLW ADGSGMTGHL
DDRDLARLRR AGFTAMSAAE GLALFDAALD DGRALLVPAR LDPTSGRAAD RPALLGGPVR
SAPRAAGGPA TARRPRAGDL AGLPEAERRA VLTDLVRAVA ATVLGHDSAE DIDPNARFQH
LGFDSLGGVD FRNRLRRATG LPVPTNAVID HKTPAALTTH LLTLVTPTAE APAPAPPAPP
AHGAVANGAV PNGAVPRSTA PNGTAPNGAA PQSPVPHRAA PNGATPNGAT PHATAPPVTT
PPEATGETHP LSHYQRDVVG VGLAHPELRQ AQPSGYLRLR GATDVERVRA AIRRTALRHD
AMRLRLEVCG DEWRQRVLPD YPDVEVVDFR AGPDPAGECL RWIERAVDTP VPLAGPLVRT
VVLVDDDASL VVFTRFHHVV ADAWGINLAL GDIAAACRAE APPAPLSPAP SCLEAVAADA
RYRCSPEREA DRDALVDVLA PLRPALFPRA ASARAHRRLR HSTRVDADVV DRIRATGRSV
FSVTAAALAA YLRRLHREGD VVLGVPLLNR RTPAERATMS HVTNTLPLHV PVDERDTLPE
LADRVASGVG ELLTHQRFAL GDLRAALRSA GNPARDLFDV TYSYITVPED GLGEDVELTV
LASGYALDAV NVVAREHRSD GSLHLDVFYA DDVLDGDLTV DAVTGHVARL LRAGLDDPTA
PVGSLPVLGR AESARLDAFE RPEAVGFDET TTLDRLFAEQ AERTPGRRAL TWSDPDGTTR
ELTYRQFHRH VARVAGLLRA RGLRPEEPVP VVLPRSPSFL VAVHAVHAAG GAYVPVDPAH
PAERVRTILG DCGARLAIAD VDLGDLGVPS LAVDLDADLA ADLDADLAAD LAADLAADLT
ADPAAAPWTD QAADPAADSA VPAPVTTTRP GDLAYVIYTS GSTGVPKGVM VEHRSVVNRL
AWMQRRHPLG PDDVVLHKTP TTFDVSVWEL LWWAHAGATV AVLPAGAERD PRELAAAIER
HGATVVHFVP SMLGPFLDHV EADPRAAERV RTLRRVFASG EALTPALAER FRAVLAAAGN
PEARLTNLYG PTEATVDVSH HDVPADGPLT RVPIGRPVDN TALLVLDAAG RRCPVGVPGE
LNIAGVALAR GYLGRPALTA EAFVVDESIP EGRRYRTGDL ARWLAGGDLE YLGRLDDQVK
IRGNRVAPGE VEAALTRCPG VSAAAVVPED GGSGARLVAY LVAPHADGTD LVRAVVDSLN
LRLPGYMVPS EYVRVDGLPL TPGGKVDRRA LAALGRRERG SGRPGNPLEE AVAEVWREVL
GTDAFGVHDD FFTVGGDSIL VLRMRTEAER RGLPFDLDRF HAHPTVAALA AGLDAAAPAL
PAVAPVTEPF ELVPLIDRAG LLGVEDAFPA TRLQLGMLFH SRQRAGSPLY KDVFRYRLRL
PWDEAAFRRA AADLARRHPA LRSTFDLTGR STPLQLVLAE APDAVEVVHG PADPEALERE
VEARLDALHH ADYPLLDDDP DHPAPLHRVC AVVHGDDSGQ VDLVLAFHHA ILDGWSAAAL
VGELLEDHLA LVAGRAPADR APSPALLLAE QVRAEHDACH DQAHHRHWAR ALERSRPTTV
ESVRAHVSAP GGGERTLVLP AWLDACAKDF ARRRGVPLKS LLLAAHCLAL RAVTGEEDVT
TGCVTHTRPE RAGAELCAGL FLNTVPVRLA DGPRTWGEAV DHVLRWERES YPHRRLPLSV
LQDERGGPVL ETAFNFVNHH VLAQTLRGDD PGSPAASLVD VAVREETNFA LLTTAVVDPR
DGRLALRLSS GGDALTDEQC AEFGRLLIGV LADTVRRPDA AIDLDALRWR DVTEAVAHAA
ALHPSATAVV GDTASLDYRA LDDTADAIAR HLLDLGMPPG ARVAVRMRRG VALVAVVLGV
MRAGAAVVPL DPDYPQARVR AMADIAAPWR VVADPDLRAA LGGVPTIDPA ELLAPPGRDP
APLPRPGPED IAYVLFTSGS TGEPRGVAMP HRALANLVEW QNRRATGRAA AGGRAPTTLQ
FAPLSFDVSF QEIFSTLCGG GALRVPEDGL RQDVPALLRV IRAAGVERVY LPYVALQALA
EAAVDACPDS LRSIVSSGEQ LRVTPEIRAL CAANPALVLE NQYGPTETHV VLAHPLPGGG
GHPPLPPVGD PVSGVGVSLL DERLRPVPPG TRGEIYVEGP CLGQGYENRP GLTAERFVAA
PSGRLRYRTG DIGLMLPDGG IVFLGRADNQ VKVRGYRVEC AEVELALLRL AGERAGLEQV
AVVARDLGGG DAALEAFLVG DPARVDPAEL RVRLAEHLPP HLTPSRYHWV AAIPLTPSGK
RDDAALRALA TRPPAPAAPG DELEEAVAGL LAEFAHVDRL GVDTGFFDAG GTSIGATRAA
MTIARRWAVD LPLQAFLAAP TARELAGVIR AHGARRPAFD PVVTLREGGR GTPLFLVHPI
GGNVLCYREL AALLPGDRPV HGLQAAGADP GTEPLTSVPA LADAYTRAIR RVHPHGPFHV
AGWSFGGYVA LEIAEALGAA QVPTATLLDT VALDTAALDT AAPGERARPP VPERQLIGFF
FRELLWYSSG GADLADEPDP TGDAEPLLDA ERLFDEGLAR CVALGILPED GSPQLLRRLY
EVFRANYRAV LDHRPRRVRR PVRLLRAAEE LPANLAIAHR AVGGLLADRG NGWRAADGHP
VEVVEVPGNH LSMMTAPHVR TVARVLGDGL DRADDPRRAD SGVEVAR