Gene Svir_20780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_20780 
Symbol 
ID8387402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2211916 
End bp2222535 
Gene Length10620 bp 
Protein Length3539 aa 
Translation table11 
GC content72% 
IMG OID644976136 
Productnon-ribosomal peptide synthase/amino acid adenylation enzyme 
Protein accessionYP_003133918 
Protein GI257056086 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.040794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCATTG CCCGCAACCC TGTCCACACG ACGTCGACCT CAACCGACCC GGTCCTGGAA 
CTGACCAGCG CACAGCTGGG GATCTGGAAC GCCCAACGGT TGGAACCGGA CTCCGGCTAC
TACCTGGTCG GCGACGTGAT CGAGATCTCC GGGGACCGTC CCGTGGACAC CCGGCTGCTC
GCCGAGGCCA TGCGGGCCAC CGTCGACGAA GCCGAGACGA TGCGGTTGCG CGTGTTCGAC
ACGCCTTCCG GACCGCGCCA GGTGATCAGC GACGAGCCGA CGCCGTTGCC GGAGGTGATC
GACGTCAGTG CGGAACCGGA TCCCACGGCG GCGGCGAACG AACTCGTGGC ACGGGAACGT
GCCCGCGCCG CCGAGGCCTG CCGAGGCATG GTCGACCGCA GGCTGTACTC CCAGACCATC
ATCCGCCTGT CCGACACCGA GGTCTGGTAC ACCCAGCTCG GCCACCACCT GATCTTCGAC
GGCTACAGCG CGGCGATGCT GGCACGGCGG ACCGCCACCC ATTACACGGC CCTGTGCCGG
GGCACCGAGG TGGCACCGTC GCCGTTCGGT CGATTCGTCG ACGTGATCCG GTCGGACCAG
GACTACCTCG CCAGTGAACA GTTCGAACGC GACCGTGCGT ACTGGCTCGA CCGACTCACC
CCCCTGCCCG AACTCAGCGG TTGGACCGAT CCGGTGACCG GCCCGCCGAA GAAAACGCTC
ACCGCGCGCA TCGTCGTCTC CCCCGAGGAC ACCGCCCGAC TCCGGGCCGC CGCCGAACGG
GAGAGACTCA GCTGGGGCGA GGCGCTCATC GCCTGCTACG CCGCCTTCCT GCACCGGCTG
CAGGGACGGA CGGACGTGGT CTTCGCGCTT CCGCTGATGT GCCGGACCAG TTCGGCGGAG
CTGCGCACCC CCTCGATGGC GGTCAACGTG CTCCCGCTGC GCGTGACCGT GCGGGGCGGT
GATCGACTCG GGGAGTTGGG CAGGCGCATC GCCCAGGCCA TGCGAACGAT GCGCACCCAT
CAGCGCTACC GCGGTGAGAA CCTGCCCCGG GACCTGGCCG TCCCCGGCGC GGGGGCGTTG
CTGCACGGCT GCGGGGTCAA CCTCAAGGCG TTCGACCTGT CGCTGGACTT CGCCGGTTCG
GTGGGTGTGA TGCGCAACGT CGCGGGCGGA CCCCCGGAGG ACATGGGCCT GAGCGTGCTG
CCGACCCGGG ACGGCGGACT GCTGCTCGGC TTCGAGGTCG ACGCGCGCAC CCACGACCAA
CAGGCCGTCG ACCACAAACT GGCGGTGCTG CACACCCTGC TTGAGCGACT GACCGGGCCC
GACGATCCCG CGGTGGGCCA GGTGGAGCTG GTCGGCCCCG ACCACCGACG CCGATTGCTC
GACGACTGGA CGGTGCCCGC GATACCCGGA CGGCCGCGGA CCGTTCCGGA AGCGTTGGCC
GAACTCGCGG CGAACCACCC GGACCGGACG GCTCTGGTCC ACAACGGACA CGAACTCACG
TTCGGGCAAC TGGCGGACCG GGTGTACCGG CTCGCCCGAG CCCTGCGGGC ACGGGGGATC
GGCCCGGAGG ACGTCGTCGC GTTGGCGCTA CCCCGTTCCG CCGACATCGT CATCGCTTTG
CTCGCCACGC TCGAAGCCGG AGCGGCGTTC GTGCCGTTGG ATATCGGACA CCCCCCGGAA
CGACTGCGTG CCCTCATCGA GGACGCCGGA GCCGACCTGG TGTTGACCCA CCGGACGGGC
CCCCGGCCGG ATGGTGACGT GCCCCGACTC GCGCTGGATG CCCCCGACAC ACTCGCCGAG
CTGGGATCAC TTCCGTCCGG GCCGCTGACG GTCGAGGAAC TGGCCGGGCC ACGGCTGCCC
GACCAGCTCG CCTACGTGCT CTACACCTCA GGGTCGACCG GCCGACCGAA GGGCGTGCTG
GTGCGCTCCG GCGCACTGGC CGCGCTGCTG CACCACCACC GGGCGACGAT CACGGCGGAG
GCCGAACGCG CCGTCGGGCG GCCGCTGCGG GTGGCGCACA CGTACTCGTT CGCCTTCGAC
TCCGCGATCG ACCAGGTGAT CTGGCTGCTG TGCGGACACG AACTCCACAT CTACGACGAC
GAGGTGTTCC GGGACGCCCA GACGCTTCGG TCCTGCTGCG CCGCCGACGC CATCGACGTC
GTGGACACCA CCCCGTCGAT GGCGGCTCCA CTGCTGGACG CGGGGCTGCT TGACGGCGAA
CACCGCCCCA CCCTGCTCAT CCTCGGGGGT GAGGCACTGC CGCGGACACT GTGGCGGCGG
ATCGCCGACA CCGGCGTGCT CGCCCGCAAC ATGTACGGGC CCACGGAAGC CACAGTGGAC
AGCGCCACCG CCCGGATCGA CGACGGGGAA CCCACGATCG GTCTGCCGCT GGCCGGTACC
CGGATCTACC TGTTGGACAA CGCGTTGCAG CCGGTCCCGC ACGGCACCGT CGGGGAGTTG
TACCTGGCCG GACCACAGCT GGCGCGGGGA TATCTCGGTC GTCCGGGGTT GACGGCGCAG
GCGTTCGTCG CCGACCCGTT CGGCGCGCCG GGGGAGCGGA TGTACCGCAC CGGCGACCTG
GCGCGCTGGG TGCCGGGCCG GGGACTGGAA TACCTCGGCC GCGGCGACGG ACAGGTCAAG
ATCCGCGGAC ACCGGGTGGA GCTCGGCGAG GTCGAGGCGG CGCTGGGCGC GGTGGAAGGA
GTGAGCGCGG CCGCCGCCGT CGTCCGTACC GACGGAGAGC TGACCAGGCT GGTGGGCTAC
GTCGTCCCCG AGGCCGGAGC CCAGGACCTG ACCCCGGACT CGGTCCGCAA CGCGCTGGCC
GACCGGGTCC CCGACCACCT GGTGCCCGCC GCCGTGGTGG TCCTGGACGA ACTACCGCTG
ACGAGCAACG GGAAACTCGA CCGTGCCGCG CTTCCGGCAC CCCGACTGGC CAGCGGGGGC
AGGCCCGCGA CCACCGAACG GGAACGCCTG CTCTGCGACG CCGTCGCCGA GGTGTTCGGA
TTGGACGAGG TCGGCGTCGA CGACGACTTC TTCTCCCTCG GCGGGGACAG CATCACCGCC
ATCAGTGTGA GCAGTCGCCT GCGGAACAAG GGGCTCGTAC TCAGACCGCG CGATCTGCTC
GCACAACGGA GCTTCGCCTC ACTGGCCACC ACCCTCACCG CGATACCGGG TGAGACGCCC
GCACCGCAGG ACGAACCGAC CGGGCCGGTC CCCGCGACAC CGATCGTGCG CGCCCTGCTG
GATTCGAATC CGCAGGTGCG GAACGTGGCC GGCTACGTGC AGTGGACCGC GGTGGCCGTG
GACACCGACC TGACCCTCAC CGACCTGAGG ACCGGCGTAC ACAGGGTGCT GGAACGACAC
GACGCGCTGC GTCTGCTCCT CACCCTGGCC CCGGACGGCT CGATCGAACT GGTGGTGCGA
CCGGCCGACG CGGTGTCGGC GACGGCGGTG GTCAGCGAGG TGAACGCCAC CGACGGGGAC
GTGGCCGACC AAATCGCCGC GCTGGCACAG CGGCTGGCCG CGTGCCTGGA CCCGCTCACC
GGGGACCTCC TGCGGATCGC ACTGCTGCGG ACCGGCCCCG AGACACCGGA CCGATTGGTC
GTGGTCGCGC ACCACCTCGT CGTCGACGGG GTGTCCTGGC GCGTGCTGCT GCCGGACCTG
CAGGCGGCGT GCGAGGCCGC CCGAGCCGGC CGTCCCGTCG ACCCGGCCCC CGTCGGGACG
TCGTGGCGCC GGTACGCCAT GGTGCTGGCG GAACAAGGCG TGACCGGGAG GCGGCGGTCC
GAACTGGACT TCTGGCGCCG GGCGGTCGAC CCCGACTGTC GTCCGCTGGG CTGTCGGGCG
CTGGACCCCG CCCGGGACAC CGCCGCGACG GCCGCCCGGT CCACCACGAT CGCCACGGCC
CCGGTCAGCG AGGCGGTGTT GACCACGCTG CCCGCGGCGT ATCGCATGGG CGTCGACGAG
GTGCTGTTGG CCGCGTTGGT GTTGACGGTG CGCTCGTGGC GGCGAGAACG GCACCGGCCC
ACCGACGAGG CCGTGACGAT CACCATGGAA GGTCACGGCC GGGAACAACT CGAACCCGGC
ATCGACCTGT CCCGCACGGT CGGCTGGTTC ACCAGCGAGT ACCCGGTGCG CGTCCCCCTC
GACCCGGTGC ACGCGGAGGC CGACCTGCTC GACGCGCTGG CCGGCGGACC GGCGGCGGGA
CGGTTGCTGC GCGCGGCCAA GGAGGCCAAA CGCGCCGTCC CCGACGGGGG CATCGGCTAC
GGCGTGCTCC GCTACCTGGA CCCTGAGACC CGCGATGAAC TGGCGGACAC CCCCGCACCC
GAGGTGCTGC TGAACTACCT GGGACGGTTC GCCCCCCTGC CTGGGAGCGG CTGGCGATTG
CCCGAACGCG ACGCCTTCGC CGTACTCGAC CCCGACGCCA AGGCGTTGGA ACAGGTGTTG
GCCCTCAACT GCTTCGTGCA CGAAGAGGAC GCCCCGCGTA TCGCGGTGGA ATGGACCGCA
GCCCGGGGCA TCCTGGCCGC CGGGACGGTC GAGCAGCTGC AGCGGGCGTG GACGGCGGCG
TTGGACGCCC TGGCGGCACA CGCGAGGCGG ATCGGACCTA ACGGTGGTGG GCTGACCCCG
TCCGACCTGC CGTTGGTGTC CTTGGACCAG GACACCATCG ACGAACTGGA GGCCCACCGG
CGCCTCGCCG ACGTGCTCCC CGCGACCGCC CTGCAGACGG GACTGTCCTT CCACACGCTC
GTGCGCGGCG ACGACGACAC CGACGTCTAC GTCGTGCAGG CGATCATGTC CTTGGCGGGG
GAGCTGGACC CGCGCAGACT GGCCGCGGCC GCCGAAAAAC TGCTGCGACG CCATCCCACC
CTGCGGGTGT ATCTGGCCAC GACGACGGCG GGGGACGTCG TCCAAGCGGT GCCCGCCGAC
GTCGAACTGG ACTGGCGCGA GATCGACCTG TCGGCGTTGC CGCCCGGGGA ACGGGACGCC
GCGTTCGACA CGCATGCGCG CACGGAGCAG GAACGACCGT TCGACCCCGG CGAGCCACCG
CTGATCCGTT TCCTGCTGTG CGTGCTCGGC GAGACCGAAC ACCGTCTGGT CATCACCAAC
CACCACGCGC TGCTGGACGG CTGGTCGATG CCCCTGGTCG GCCGTACCCT GTTGGCGATC
TACGCCGAGC TGGGCGGAGG ACCCGCGGCG CCCGCGGCAC CCCCGCCCAT CGAGTACTAC
CGATGGCTGT CCGAACGCGA CCGGCAGGCC GGGCTCGCCG CGTGGCGGGA AGCGCTGGCG
GGGGTCGACG AGGGGACCCG ACTCGCGCCG GCCACCGCGC ACGCCCGCAT CGAACGACCC
GGCCGGGTGT CGATCCCGCT CGGCGCGGAG TTCAGCCGAC GAATCCGAAA GTTCGCCCGC
GACCACGGGA TCACCGTGAA CACGGTGCTG CAGACCGCCT GGGGTCTGCT GTTGAGCCGG
CTCACCCGCC GACGGGACGT GCTGTTCGGC TCCCCCGTCT CGGGCCGACC CGCCGAGGTC
GAGGGGATCG AGTCGATGAT CGGCCAGTTC GGCAACACGA TCCCCGTCCG CATGCGCATC
ACCCCGGCCG AACCCGCCCA CGACCTGCTC GCCCGGGTGC ACGCCGAGTC GGTGGCGCTG
AGCGAGCACC ACCACGTCGG ACTGCCCGAC ATCCAGCGGG CCGTGGGGGT CGGTGAACTG
TTCGACACCC TGTTCGTGAT GGAGAACTTC CCCCTGGCCA GCCGGGGGCG CACCCCGCTA
GCACCGGGGC TCGAACTGAC GGGTGTGGAC ATCGTCGACG CCACCCACTA CCCGCTCACC
GTCGTGGTGA TCCCCGAGGA TGAGATCGTC ATCGGCCTCG GCTACCAACC GCAGGTGTTC
GACGAGGCGA CGGTGCGCGA CTACGGACGC TGGTTGCACA ACCTGCTGCG AGAACTCGTC
ACCGACCCGC GGCGCCCCGT GGGGCGGCTG CCGATGCTCG ACGCCGAGGA ACGGCGGTGG
CTGCTGCGGG TCGGCACCGA GGTCGTGCCC GCGCGACCAC GCCGAGGGGT GCTGGAGGAG
TTCGCCGCGT GGGTGAACCG ACAGCCTGAG GCCGAGGCGG TGGTGTGCCG CGACCGGAGC
CTGACGTATC ACGAACTCGA CCGACGGGCG AACCGGCTCG CGCACGCGTT ACTCGCCAGC
GGGGTCCGAC CGCAGGACCC GGTCGCCGTC CTGCTCGGCC GGGACGTCGA GATGGTGGTG
GCCCTGTTCG CCGTCCTCAA GGCCGGCGCG GTGTACGTGC CGCTGGACGC GAACTACCCC
CGGGAACGCC TGGCGTACAT GGTGGACGAC GCGGCCCCGA CCGCGATCGT GACGACCGAC
CGGCTGTGGG CCGAACTCGG CGGGCAGCTG CCGACGGTGC AGGCGATCCC GGTCGTGCGC
TGCGACGAGA CGCCCACCGA CGGGGTGAAC GGCCCGGCGG CGTGGGACCA CGATCCCGCC
CACGCGCGGG CGAGGATCAC CGCGGACTCC CTGGCCTACG TCATCTACAC CTCCGGCACC
ACGGGACGTC CCAAGGGTGT CGCGGTGACC CATCGAGGCC TGCCCGATCT GGTCGCCCTG
CAGGAAGAGG TCGTCGGCGT CACCGAGAAG GACCGCTACC TGCACTTCGC CTCGACCAGC
TTCGACGTCG CCTTCTGGCA GACGATGGTG CCGTTGCTGT CCGGGGGGAC GCTCGTCATC
GCTCCAGAAG AGGTGCGGGT GCCCGGCGAC GAGTTGTTCG ACTACGTCGC CAGGCACCGG
GTGACGGGGG TGAACCTGCT GCCGTCGTTC CTGGCCGCCG TGCCCGACGA CTGCACGGTG
GGACCGGACG TGTTCTTCGT CGTCGGCGCC GAACGACTCG ATCCCGAACT GGCCCGCCGC
TGGGGCGATC GACGGGCCCT GTTCAACGCC TACGGCCCCA CCGAGGTCAC GATCAACTCG
GTGACCTGGC GGTACGACCC CGACGACCCC GGCCCGCTGC CGATCGGGCG ACCCGACCCC
AACGTGCGGG CCTACGTGTT GGACGAGGGA CTGTGCCCGG TGGGTGTCGG CGTGCCCGGG
GAGTTGTACC TCGCGGGGCC GAAACTGGCG CGGGGGTATC TCGGTCGTCC GGGGTTGACG
GCGCAGGCGT TCGTCGCCGA CCCGTTCGGC GCGCCGGGGG AGCGGATGTA CCGCACCGGC
GACCTGGTCC AGTGGCGACC CGACGGGCAA CTGGTGTTCC TCGGCCGGGT GGACCGCCAG
GTGAAGATCC GCGGTTTCCG GATCGAGCCG GGGGAGATCG AATCGGCTCT GACCCGCCGT
CCGGACATCC GCGCGGCCGC CGTGGTCGTA CGCGAGGACC GGCCCGGCGA ACGCCGACTG
GTGGGGTATG TCATCCCGAG GGTGGGAGCC GGACTCGACA CCGACCGCAT CCGTGAGGAC
CTCGCCCGGG AGCTGCCCGA CCACCTGGTG CCGGCGGCGT TGGTGGTGCT CGACCGGCTG
CCCCTCAGTC CCAGCGGCAA ACTCGACCAG GCCGCGCTGC CCGCGCCCAG TGCGGGCAGC
GGTGCCCCGG CGCGGGAACC GGCCACACCG GCGGAGGAGG CACTGCTGGG CCTGGTCCGC
GAGCTGCTCG GCACGGACGA GATCAACCTC GACGACGGGT TCCTCGACGT CGGCGGGGAC
AGCATCGTGT CCCTGCAGCT GGTGTCCCGG GCCCGTCGAC TCGGCTGGCG CCTGGCCCCC
CGGGACGTGT TCGACGGGGG CACCGTCGCC GGTATCGCGG CGCGCTGTGT CCGACTCGGC
GACGGCGACG CCGAGAGCGC ACCGGCCGTC GGCGACGCCC CGCTGACACC GGTCATGCGG
GACCTGCTGC GACGGTGCGA GGCCGCGGGC GTGCCCGCCG ACGACTTCTG CCAGTGGGTG
CAGATCTGCG TCCCCCCGGG CGGCGACGTG GCGACCTGGC ACGCCGTGTT CGACGCGGTG
CTGGCCCGGC ACGACGTGCT GCGGGGCCAC CTGGCCGTCC CCGCCGACGG CGGCGAGCCG
GTGCTGCGTA TCCCGCCGGA GGGCACGGTC ACCGCCGCGC AGGTGGTGAC GCACGTGCGG
GTGTCGGAGA CCGACGACGT GCGCGCGCTT TCCGACTCCT GGCTGCGCAC CGCGCGAAGC
GGTCTGAACC TGTGGGCGGG ACCGCTGGTG CGGGCGGTGT GGTTGGACGC CGGGCCGACC
GCACCCGGCA GGCTGCTCCT CGTCGCGCAT CACCTGCTCG TCGACGGGGT GTCCCTGCGG
TTGTTGCTGG ACGACATACA CCGCGCGTAC GAGAGCGCGA CCGTCGGGGA CGGCCGGATC
TGCGTGCCCG CTCTGCCCCG GCACGGACAG TCCTTCCTCG GCTGGGCCCG TTCGCTTCGC
GAGGCCGCCC GACACCGTCG CGCGGAACTT CCACAGTGGA ACAGGACTGT CGCGGATCCC
GGGGAACCAC TCAGCGCGGT TCCGTTCGAT CCGGCCCGCG ACACCGCCGC CACCGCCGTG
CACCACGAAC GGTGGCTGGA CCCGGCGGCG ACCCGTGCGC TGCTGACCAC GTTGCCGTCG
GCCTACCGCA CCACCCCGGA CACCGTGCTG CTGACCGCGC TGGTCACGGC GGTGAGCGTC
TGGCGGGGCC GGAGGCCGGA CCTGCTCGTC GCGGTGGAGA GCCACGGCCG CCCACCACAC
AGTCCTGATC GGACCCGGCC GGTCGACCTG TCCCAGACGG TCGGCTGGTT CACCGCCGTG
TACCCGGCGC GACTGACGGT GCCGGACGGT TCCACCGCGG ACGGCATCAA GGCGGTCAAG
GAACAGCTGC GGGCCTACGG TGACGGGCTG GGCTACGGCA TCCTCGCCGC CTCGGGGGAG
CCGGCGCTCA CCACGGCCCC GGAACCGGAG ATCAGCTGGA ACTACCTGGG CCGGTTCCCC
GCCGCCCCGA CGGAGCCGAC ACCGTGGCAG CCCGCTCCGG AGGGGGAACC ACTCGGCTCG
GGCGGTGAGA CCGTGCCGCT GCCGCACAGC CTCATGGTGT CCGCGCTGGC CCGCGACGAC
GGGGAGGGTA TGGCGCTCGG GGTACGGTTC ACCTGGCCGG CGGCGGTGTT CGCCGAATCG
GAGATCCGCG AGCTCGCCGA CCACTTCCAC CGGGCCCTGG TCACCGTGGC CCACGACCCG
GACGTGCGGT CCGGGGCCGG GCTCACCCCC TCTGATCTGC CCCTGGTCGA GCTCGACCAG
GCAGCGATCG AGGCGTTGGA GAAGGAATAC CCGGTCGTCG ACGTCTGGCC GTTGACCGCG
TTGCAGGAGC TGATGCTGCG GCAGTCGCGG GCCAGGGCGG AGAACACGCC CGACCCGTAC
ACGGTGCAGT CGACGTTCTC CCTCGAAGGG CCGCTGGACG TCGACGCGTT GTTCGCCGCC
GGTGCCGACC TGTTGGAGCG CCACCCGAGT CTGGGCGCGG CGTTCCCGGA AGGACGCCAG
GACATCCAGG TCATCCGCAC CGGGGTGCGA CCGGACTGTC GGCTGCGGGA CGTCTCCGAC
CACGGACCCG AGACCCAGCA ACGGGCGGTC GAGGAGATCC TGACCGCGGA CCTGGCCGAA
CCGTTCACCC TGTCGGAGGG GCCCGCGGTG CGGATGACCG TGATCCGCCG GGGGCCCGAG
CGGGCGGAGT TCGTGCTCAC CAGCCACCAC GTGCTGTCGG ACGGCTGGTC GGCGCCGCGC
ATGCTCGCGG AGCTTTTCGC CCGGTACGCC GCGCGGGTGC GGGGAGGGGT GGACGATCGC
CTGCCCGAAC CGGTGCCGCT GAGTCGGTAC CTGCAGCGAG TGGCCGATCG GGACAGTGAC
GCCGACCATC GCGCCTGGCA GGCGGAGTTG GCGGATCTGC CCGAAGGGGA CTACGTGATC
GGTGACCGCA CGGAGATCCC GGTGGCACAG GACCCGGAAC CCGTTCTGTT CACCGTGGAC
GAGGACACGG TCGCCGATCT GACCCGGGTT GCGGCGCGGC GGGGACTGAC CCCGGGCACC
CTGGTGCAGG GCGCGTGGGC CACGGTTCTG GCCCTGCGCT CCGGTCGTCG GGACGTGTGC
TTCGGCGCCA TGGTCTCCGC ACGGACCCTC GAGGTCGACG GCATCGAGGA GATCGTCGGC
CTGCTGGCCA ACACCGTGCC GATGCGTGTG CGCTTCACCG GGACCCTGGC CGACGTGCTG
GCCGGGTTGC AGACACGTCA GCAGGCGACC GCGGACCGCC ACCACGTTCG GCTGTCCGAA
TTGGAACGGC TGACCGGGCG TGCTCGGCTG TTCGACAGCC TCGTGGTGTT CGAGAACTAC
CCGGTCGACC CCGACACCCT GCGCGAACCG GCGCCGGGGT TGACCATCGT CGGAACCCGC
TTCCGGGAAC GGACCCACCA CCCGCTGACG GTGACGATCA TGCCAGACGG CGGTGGTTGG
CGCGGTGTGC TTGGCTACCG GTCCGGATTC CTCGACGCCG ACGACGCCGC GGCACTCGCC
GACGACCTAC TCGCGGTGCT GCGGCACCTG CGCGTCGAGG ACAGGCTCGA CGTCGACGCG
CAGACCGTCC TGACGTCCGG CCTGCCCGGT GTGTCGACCC TACGAGACAA ACAGTGGTGA
 
Protein sequence
MSIARNPVHT TSTSTDPVLE LTSAQLGIWN AQRLEPDSGY YLVGDVIEIS GDRPVDTRLL 
AEAMRATVDE AETMRLRVFD TPSGPRQVIS DEPTPLPEVI DVSAEPDPTA AANELVARER
ARAAEACRGM VDRRLYSQTI IRLSDTEVWY TQLGHHLIFD GYSAAMLARR TATHYTALCR
GTEVAPSPFG RFVDVIRSDQ DYLASEQFER DRAYWLDRLT PLPELSGWTD PVTGPPKKTL
TARIVVSPED TARLRAAAER ERLSWGEALI ACYAAFLHRL QGRTDVVFAL PLMCRTSSAE
LRTPSMAVNV LPLRVTVRGG DRLGELGRRI AQAMRTMRTH QRYRGENLPR DLAVPGAGAL
LHGCGVNLKA FDLSLDFAGS VGVMRNVAGG PPEDMGLSVL PTRDGGLLLG FEVDARTHDQ
QAVDHKLAVL HTLLERLTGP DDPAVGQVEL VGPDHRRRLL DDWTVPAIPG RPRTVPEALA
ELAANHPDRT ALVHNGHELT FGQLADRVYR LARALRARGI GPEDVVALAL PRSADIVIAL
LATLEAGAAF VPLDIGHPPE RLRALIEDAG ADLVLTHRTG PRPDGDVPRL ALDAPDTLAE
LGSLPSGPLT VEELAGPRLP DQLAYVLYTS GSTGRPKGVL VRSGALAALL HHHRATITAE
AERAVGRPLR VAHTYSFAFD SAIDQVIWLL CGHELHIYDD EVFRDAQTLR SCCAADAIDV
VDTTPSMAAP LLDAGLLDGE HRPTLLILGG EALPRTLWRR IADTGVLARN MYGPTEATVD
SATARIDDGE PTIGLPLAGT RIYLLDNALQ PVPHGTVGEL YLAGPQLARG YLGRPGLTAQ
AFVADPFGAP GERMYRTGDL ARWVPGRGLE YLGRGDGQVK IRGHRVELGE VEAALGAVEG
VSAAAAVVRT DGELTRLVGY VVPEAGAQDL TPDSVRNALA DRVPDHLVPA AVVVLDELPL
TSNGKLDRAA LPAPRLASGG RPATTERERL LCDAVAEVFG LDEVGVDDDF FSLGGDSITA
ISVSSRLRNK GLVLRPRDLL AQRSFASLAT TLTAIPGETP APQDEPTGPV PATPIVRALL
DSNPQVRNVA GYVQWTAVAV DTDLTLTDLR TGVHRVLERH DALRLLLTLA PDGSIELVVR
PADAVSATAV VSEVNATDGD VADQIAALAQ RLAACLDPLT GDLLRIALLR TGPETPDRLV
VVAHHLVVDG VSWRVLLPDL QAACEAARAG RPVDPAPVGT SWRRYAMVLA EQGVTGRRRS
ELDFWRRAVD PDCRPLGCRA LDPARDTAAT AARSTTIATA PVSEAVLTTL PAAYRMGVDE
VLLAALVLTV RSWRRERHRP TDEAVTITME GHGREQLEPG IDLSRTVGWF TSEYPVRVPL
DPVHAEADLL DALAGGPAAG RLLRAAKEAK RAVPDGGIGY GVLRYLDPET RDELADTPAP
EVLLNYLGRF APLPGSGWRL PERDAFAVLD PDAKALEQVL ALNCFVHEED APRIAVEWTA
ARGILAAGTV EQLQRAWTAA LDALAAHARR IGPNGGGLTP SDLPLVSLDQ DTIDELEAHR
RLADVLPATA LQTGLSFHTL VRGDDDTDVY VVQAIMSLAG ELDPRRLAAA AEKLLRRHPT
LRVYLATTTA GDVVQAVPAD VELDWREIDL SALPPGERDA AFDTHARTEQ ERPFDPGEPP
LIRFLLCVLG ETEHRLVITN HHALLDGWSM PLVGRTLLAI YAELGGGPAA PAAPPPIEYY
RWLSERDRQA GLAAWREALA GVDEGTRLAP ATAHARIERP GRVSIPLGAE FSRRIRKFAR
DHGITVNTVL QTAWGLLLSR LTRRRDVLFG SPVSGRPAEV EGIESMIGQF GNTIPVRMRI
TPAEPAHDLL ARVHAESVAL SEHHHVGLPD IQRAVGVGEL FDTLFVMENF PLASRGRTPL
APGLELTGVD IVDATHYPLT VVVIPEDEIV IGLGYQPQVF DEATVRDYGR WLHNLLRELV
TDPRRPVGRL PMLDAEERRW LLRVGTEVVP ARPRRGVLEE FAAWVNRQPE AEAVVCRDRS
LTYHELDRRA NRLAHALLAS GVRPQDPVAV LLGRDVEMVV ALFAVLKAGA VYVPLDANYP
RERLAYMVDD AAPTAIVTTD RLWAELGGQL PTVQAIPVVR CDETPTDGVN GPAAWDHDPA
HARARITADS LAYVIYTSGT TGRPKGVAVT HRGLPDLVAL QEEVVGVTEK DRYLHFASTS
FDVAFWQTMV PLLSGGTLVI APEEVRVPGD ELFDYVARHR VTGVNLLPSF LAAVPDDCTV
GPDVFFVVGA ERLDPELARR WGDRRALFNA YGPTEVTINS VTWRYDPDDP GPLPIGRPDP
NVRAYVLDEG LCPVGVGVPG ELYLAGPKLA RGYLGRPGLT AQAFVADPFG APGERMYRTG
DLVQWRPDGQ LVFLGRVDRQ VKIRGFRIEP GEIESALTRR PDIRAAAVVV REDRPGERRL
VGYVIPRVGA GLDTDRIRED LARELPDHLV PAALVVLDRL PLSPSGKLDQ AALPAPSAGS
GAPAREPATP AEEALLGLVR ELLGTDEINL DDGFLDVGGD SIVSLQLVSR ARRLGWRLAP
RDVFDGGTVA GIAARCVRLG DGDAESAPAV GDAPLTPVMR DLLRRCEAAG VPADDFCQWV
QICVPPGGDV ATWHAVFDAV LARHDVLRGH LAVPADGGEP VLRIPPEGTV TAAQVVTHVR
VSETDDVRAL SDSWLRTARS GLNLWAGPLV RAVWLDAGPT APGRLLLVAH HLLVDGVSLR
LLLDDIHRAY ESATVGDGRI CVPALPRHGQ SFLGWARSLR EAARHRRAEL PQWNRTVADP
GEPLSAVPFD PARDTAATAV HHERWLDPAA TRALLTTLPS AYRTTPDTVL LTALVTAVSV
WRGRRPDLLV AVESHGRPPH SPDRTRPVDL SQTVGWFTAV YPARLTVPDG STADGIKAVK
EQLRAYGDGL GYGILAASGE PALTTAPEPE ISWNYLGRFP AAPTEPTPWQ PAPEGEPLGS
GGETVPLPHS LMVSALARDD GEGMALGVRF TWPAAVFAES EIRELADHFH RALVTVAHDP
DVRSGAGLTP SDLPLVELDQ AAIEALEKEY PVVDVWPLTA LQELMLRQSR ARAENTPDPY
TVQSTFSLEG PLDVDALFAA GADLLERHPS LGAAFPEGRQ DIQVIRTGVR PDCRLRDVSD
HGPETQQRAV EEILTADLAE PFTLSEGPAV RMTVIRRGPE RAEFVLTSHH VLSDGWSAPR
MLAELFARYA ARVRGGVDDR LPEPVPLSRY LQRVADRDSD ADHRAWQAEL ADLPEGDYVI
GDRTEIPVAQ DPEPVLFTVD EDTVADLTRV AARRGLTPGT LVQGAWATVL ALRSGRRDVC
FGAMVSARTL EVDGIEEIVG LLANTVPMRV RFTGTLADVL AGLQTRQQAT ADRHHVRLSE
LERLTGRARL FDSLVVFENY PVDPDTLREP APGLTIVGTR FRERTHHPLT VTIMPDGGGW
RGVLGYRSGF LDADDAAALA DDLLAVLRHL RVEDRLDVDA QTVLTSGLPG VSTLRDKQW