Gene Avin_25580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_25580 
Symbol 
ID7761470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2588080 
End bp2602323 
Gene Length14244 bp 
Protein Length4747 aa 
Translation table11 
GC content66% 
IMG OID643805440 
Productpeptide synthase 
Protein accessionYP_002799713 
Protein GI226944640 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG AAGACGCACA GAAACTTGCC CGCCGCTTCA TCGAATTGCC ACAGGAAAAA 
CGCCGTCTGT TTCTGCAAGG CTTATGGGAG GAGGGAGTCG ACTTCTCACT GTTTCCCATC
CCGGCAGACG TGGGTATCCC CGAACGCCAA GGTCTGTCCT ATGCCCAACA ACGCATGTGG
TTTCTCTGGC AACTGGATCC GCAAAGTCCC GCCTACAATT TGCCAATGGC TGTTCGTCTG
GAGGGTGAAC TGGAGCGAGT TGCGCTGCAG GATGCTTTCG ACGCTTTAGT GACACGCCAT
GAAACCTTGC GAACCCGCTT TCGCCAGCAG GATGGCGGAA TTCGCCAAGA GGTGCTGGAA
CCATTGTCGG TCACCATCGG CTTCGAAGAT CTGACGGCGC TGCTACCTGC CAAGCAGGAT
GAACAGGTTA GTAAACTGGC CCAAGTAGAG GCTATGGCAC CTTTCGACTT GGCCGAGGCG
CCCTTGTTGC GGGTACGCCT ACTGAAACTG TCGGACTCGG AACATGTGTT GTTGCTCACC
CTGCATCATA TCGTTGCCGA TGGTTGGTCA CTGAATCTGC TGATCGACGA GTTCATCCAT
CTCTACGACA CGGCCTGTTC CGGCAGGCAG GCTGAACTGC CGGCGTTACC GATCCAGTAT
CGCGACTATG CTCTATGGCA GCGCAGTTGG CTGGAAGCCG GCGAGCGAGA GCGGCAACTG
ATCTACTGGC GCGACAAACT GGGAGGCGAG CACACATCGC TGGAGTTGCC CACCGATCGC
CCTCGCCCAG CAGTTCCCAG TTACCGGGGG ACCCGTCATG AGTTCCGGAT CGAGCCATTG
CTCTCCGACC AACTGCGGGC ATTGGCCAAG CGCCATAATG TTACTTTATT CATAGTCTTG
CTGGCTGCCT TCAAGCTGCT GATGCAACGC TACAGCGGAC AATCCGTTAT CCGCATCGGC
TCCCCCATTG CCAACCGTCA TCGCTCGGAA GTAGAGGGAT TGATCGGTTG TTTTATCAAC
ACCCAGGTTC TGCATACGGA GATCGATCCG CTGATCGATG TCGGCGAACT GTTACGGCGG
GTCAAGGAAA CTGTACTAGG TGCACAGGTA CATCAGGATC TGCCGTTCGA ACAACTGGTC
GAAGTACTGA ATCTGGAGCG CGACACGGGA CAGTCGCCCT TGTTCCAAGT GCTGTTCAAC
CATCAGCCGA ACGTTACCGA CGTTCGGGAA CTGAAGACCC GTTCCGGCTT GACGCTGGAG
CGTATCGAGC CGGCAAGGCA CACGGCCCGT TTCGATCTGG CGCTGGATAC CTACGAGAGT
GCGGGACAGT TGTACGCAGC TTTCACCTAT GCCCTGGAGG TGTTCGACGG CACGACCGTC
GCCAAGTTGG AGGAGCACTG GCTGCGGTTG CTGGAAGGAA TCGCGGAAGA AGAGCCGACT
GTCGTCGGCG AACTGTCCTT GTCGCAGGTA ACAGATGCGG CGGACGAGCG TGTCGACCAC
AGCCTGGAGG AGAGCGACTG CGTTCATCAA CTGATCGAAC GAGCTGCGAG CCAACATCCG
GAACGACTGG CGGCAGTCAG CGGCAATGAT GTCATCAATT ACGCTCGCCT GAACGAGCGA
GCCGACGAAC TCGCCCGGGT GTTGTTCGAT GCCGGTGTTC TGCCCGACCA GCGGGTTGGG
GTGGTGGGTG ATCGTTCCAT CGACATGCTG GTCGGCATTC TGGGCATCCT GAAGGCAGGG
GCGGCCTATC TGCCTCTGGA ACCCGACCAG CCACAAGAAC GCCTGGCGTT CATGCTCGCC
GATAGCGATG TGCGACTGGT TCTGGGGCGT TCTTCCTGGG AGGGGCTCCT GCCGGATGGC
GTCCGGATGA TATGCCTGGA CGAGCCCCTG CCGCCTGTTT CCGGGAGCGC CGGGCTACAC
GTCCGTGTTT CCCCCGGCAA TCTTGCCTAT GTGATCTATA CCTCGGGGAC CACGGGCATG
CCAAAGGGCG TGGCGGTTCC CCATGGAGCG CTGGCGAACT ACGTCGAGGG CATTTCCCGG
CGGCTTCCCC TGGAAGCGAT CAGCAGCATG GCCATGGTGT CCACGCCCGC CGCGGACCTG
GGGCACACGG TCCTTTTCGG CGCCCTGTGC GCGGGCAAGA CCCTGCACTT GCTGGACAAA
GAGACGGTCC TCGACGCCGA GGCGTTCGCT GCCCACATGG ATGCACATGG GGTGGATGCA
CTGAAGATCG TACCCTCGCA CCTGGATGCC ATGCTGTCGG CCGGCCGCTC GGCACTGCCC
AGGCGTTGCC TGGTCCTCGG CGGGGAAGCC TGTCCGCCCG CGCTACTGGC CAGGATCGTC
GCCTTGGCTC CCGAACTGAA GGTACTCAAC CACTATGGCC CCACGGAGAC CACCGTGGGG
GTCCTGATCG GAGAATTGAA GGGACTGCCG GTGCTGGGAA GCCCGCTGGA GAATGTCGGG
GTCCGGAGAC TGGATGCCTG CCTGCAGCCC GCTCCCGGTC CGGCCAAGGG AGAGCTGCAC
ATCTCGGGGG CGGGGCTTGC CAGAGGGTAT CTGGGACGTC CTGCGCTGAC AGCCGAACGC
TTCGTCCCCG ATCCCTCCGG CACTCCTGGG GGACGAATGT ACCGGACTGG CGATTGGGTG
CGGCGCAACG CCGATGGCGG ACTGCTGTTC GCCGGGCGCA TGGACGGCCA GGTGAAGATT
CGCGGCTATC GTGTCGAACT GGCCGAGATC GAAAGCCGCC TGCGGGCCTT GCCAGGTGTC
GGGAACGCAT TGCTGCGGGT GATCGGCGAG GAACATGCCC GGCAACTGGT GGCCTATCTG
GTGCCGACAG CGGTACCGGA TGGCCAGGCG GGACAGGCGT TCCTGGACGA GATACGTACG
GTCCTGAAAC GGGTGCTGCC GGAACACATG GTGCCGACGC ATCTGCTGGT CCTGGAGCAC
CTGCCGGTAA CCGCCAACGG CAAGGTCGAC CTCAAGGCCC TGCCCGAACC CGTCGCCACC
TCGGCGACTT ACGTCGCACC CGGAACGCCT CTGCAAACGC GGTTGGCCGA GATCTGGGCC
GAGGTGCTCA AGGCCGAGCA GGTGGGCCTG ACCGACAACT TCTTCGAGCT GGGCGGACAT
TCCCTGCTGG CCACCCAAGT GATTTCCCGA GTACGCCAGA GCCTGGGCAT CGAGTTGCCC
CTGCGCGCCC TGTTCGAAGC GCAGGACCTG GCCGGTTTCG TCGGGCGGGT CGGCCTAGGC
CAGGTCAGCC AGGCGCCCGC CCTCGAAAAG GCCGACCGCG ACCAGCCCCT GGTCGCCTCC
TACGCCCAGC AGCGCCAGTG GTTCCTCTGG CAACTGGAGC CCGGGAGCGC CGCCTACCAC
ATCCCGGCGG CGCTACGCCT GAAAGGCGCC CTGGACATCG AGGCCCTGCG GCGCAGCTTC
GACGCGCTGA TCCGGCGCCA CGAGTCCCTG CGCACCACCT TCCGCCAGGA CGGCGAGCGG
ACGCTCCAGG TCATCCACCC CAGCGGCAGC CTCTGGTTCG AGCAGGAGCC GCTGCCGGCG
GATGCCGCCA TCGGCCTGGA CGAGCGGATC CGGGTTCAGG TCGAAGCAGA GGTCCAACGC
CTGTTCGACC TGGAGCAGGG CCCGCTGCTG CGGGTGAAGC TGCTGCGCCT GGACGAGGAC
GATCATGTCC TGATCCTGAG CCTGCACCAC ATCGTCTCGG ACGGCTGGTC CACCCCGATC
ATGGTGGACG AGCTGGTCCG TTTGTACGAA GGCTATAGCC AAGGCCACGA GGTGACGCTG
CCGGAGTTGC CGGTCCAGTA CGCCGACTAT GCCCTCTGGC AGCGCCAGTG GATGGAGGCC
GGCGAACGGG ACCGGCAACT GGCCTACTGG ACCGCGCAAC TGGGCGGCGA ACAGCCGGTG
CTGGAGTTGC CCGGCGACCG GCCGCGGCCG GCCGTGCAGA GCCATGCCGG AGCCCGGCTG
GCCGTCGAAC TCGACGGCGA ACTGGCCCGC TCCCTGCGGG AACTGGCCCG GCGGGAGGGT
GTGACCCTGT TCATGCTGCT GCTGGCCAGC TTCCAGAGCC TGCTGCACCG CTACAGCGGG
CAGGACGACA TCCGGGTGGG CGTGCCGATC GCCAACCGCA CCCGGGCGGA AACGGAAGGG
CTGATCGGCT TCTTCGTCAA CACCCAGGTG CTGAAGGCCG AATTCGAGCT GCAGACCACC
TTCGTCGAGC TGCTGCGCCA GGTGAAACGG ACCGCCCTGG AGGCCCAGGC ACACCAGGAC
CTGCCGTTCG AGCAACTGGT GGAGGCCCTG CAGCCGGAGC GCAGCCTGAG CCACAGCCCG
TTGTTCCAGG TGATGTACAA CCACCAGACG GCGGTGAAGG GCGCGGCGCG AAGCCTGCCC
GGGCTCAGCG TGGAAGGGCT GGCATGGGAG AACCGTACCA CCCAGTTCGA CCTGACGCTG
GACAGCTACG AAGGCCCGGA CAGCCTGGGC GCTTCGCTGA CCTATGCCAC CGACCTGTTC
GATGCCTCGA CCGTCAAGCG CCTGGCCGGG CACTGGCGCA ACCTGTTGGC AAGTATCGTC
CGCCAGCCGG AACGGCGCAT AGGCGAACTG CCGCTGCTGA CCCCGGAGGA ATACCGCCAG
ATCGTCCACG CGTGGAACCG GACCGAGGCC CGCTACCCGA GCGAGCGCGG CGTGCACCAG
TTGATCGAGG AGCAGGTGGC GAGGACCCCG GAGGTGGTGG CCCTGGTGTT CGGCGAGCAG
GAAATGAGCT ACAGGGAACT GAACCGCAGG GCCAACCGCC TGGCGCACCG GCTGATCGAA
CTGGGCGTGG GCCCGGATGT GCTGGTGGGC GTCGCGGTAG AGCGGGGCTT CGAGATGGTG
GTCGGGCTGC TGGCGATCCT CAAGGCCGGC GGGGCCTATG TGCCGCTGGA CCCGGAATAC
CCGCGGGAGC GGCTGGCGTA CATGATCGAG GACAGCGGTA TCGGTCTGCT GCTGACCCAG
CGGCACCTGC AGGACCGGCT GCCGTCGGCC GACGGGGTGC AGAGCCTGTT CCTGGAGCCG
GGCGACGACT GGCTGGAGAA CTATCCGCCG GAAAACCCGG CGAACCGGAC GGCGCCGCAA
AACCTGGCCT ATGTGATCTA CACCTCCGGA TCGACCGGCA GGCCGAAAGG GGCGGGCAAT
ACGCATACGG CCTTGATTAA CCGCCTGCAC TGGATGCAGA AAGCCTACCA GCTCGATACA
ACGGATACGG TCCTGCAAAA GACGCCGTTC AGCTTCGATG TTTCGGTATG GGAGTTGTTC
TGGCCGCTAT TGAATGGAGC ACGGCTGGCG ATCGCACGGC CCGGCGAGCA CCGCGATCCG
GAGCGCCTCA TCGACACGAT TGAGCGTCAT GGAGTAACGA CCCTGCATTT CGTGCCCTCG
ATGCTCCAGG CTTTCATCTC CGTCGAACAC ATCGAAGGCT GCCGGAGCAT CCGCCGGCTC
GTGTGCAGCG GCGAGGCCCT CCCGGCGGAG CTTGCCCGCA AGACCCTGGA AAGAATGCCC
GCGGTTGGCC TGTTCAATCT GTATGGTCCG ACCGAAGCGG CTATCGATGT GACCCATTGG
ACTTGTGATC ATGTCGATCC CGAGGGCGTG CCGATCGGCC AACCGATCGA TAACCTGAAG
ACCCACATAC TGGAGGAGAG CCTTCATCCA GTAGCACCGC GTTGTTGCGG CGAGCTGTAT
CTGGGGGGAG TGGGGCTTGC CCGTGGTTAC CACAACCGCC CGGGCTTGAC CGCCGAGCGC
TTCATCCCCG ATCCATTCGA TACCAGCGAA CAGGGCGGCG GACGCCTGTA CCGTACCGGC
GACCTGGCCC GCTACCGGGC CGGCGGGGTG ATCGAGTACC TGGGCCGTAT CGATCATCAG
GTGAAGATCC GCGGCTTCCG TATCGAGCTA GGCGAGATCG AGGCCCGGCT GCGGCAGCAC
GGGGCGGTAC GCGAGGCGGT GGTGATCGAC GTCGAGGGAG CGGGCGGCAG GCAACTGGCC
GCCTACCTGG TGCCCGACGA CCCGGCGATG CTGGACGGCG ACGAACGCCA GGGCGCTCTG
CGCGGCGAGC TGAAGGACCA TCTCCGGGCA GCGCTGCCGG ACTACATGGT GCCGGCGCAC
CTGGTGTTCC TCGCGCGGCT GCCGCTGACG CCCAACGGCA AGCTGGACCG CAAGGCGCTG
CCGCGGCCGA ACGCCAGCCT GCTGCAGCGG GCCTACGTGG CCCCGGCGAG CGAGCTGGAG
CAGCGGATCG CGGCGATCTG GGCCGAGGTG CTCAAGGTCG AGCGGGTGGG CCTGACCGAC
AACTTCTTCG AGCTGGGCGG CGACTCCATC GTCTCGATTC AGGTGGTCAG TCGGGCGCGC
CAGGCCGGCA TCGCCTTCAC GCCCAAGGAG CTGTTCCAGC ACCAGACCGT TCAGGGGTTG
ACCACGGTAG CCCAGCGCAC GGGAGGTCTG TGCATCGACC AGGGAGCGGT TACCGGTACG
ATGCCCCTGA CCTCGATCCA GCGCATGTTC TTCGAGGAGG AAATCCCCGA GCGTCACCAC
TGGAACCAGT CGGTATTGCT CGAGCCGAAC GAGCGGCTCG TCGTCGAGCC TTTGGAGGCG
GCCCTCCAGA GGGTGGTCGA GCACCACGAT GCACTGCGCT TGCGTTTCAT CCAGCAAGAG
GGGCGATGGA GCGCCGGTTT CCGGGATAGG GAGGAAGCGG AACTGCTGTG GCAGAGCCAG
GTATCCGATA TCGGTGAACT GGAGGCTGTC TGCAACGAGG CCCAGGCCAG CCTGGACCTG
GAACATGGGC CGTTGTTGCG AGCGGTACTG GCCAGCCTGC CGGATGGCCG GCAGCGTTTG
TTGCTGGTCA TCCACCACTT GGTGGTGGAC GGGATCTCAT GGCGGGTGCT GCTGGAAGAT
CTGCAGACCG CCTACCGGCA AGCCGTCCAG AGCCAGCCAC TGCACCTGCC GGCCAAGAGC
AGTTCGTTCA AGGCCTGGTC CGAGCGCCTG GAGCAGTACG CGGCGGAAGC GACCCTCGCC
GAGGAACTGG GCTACTGGCA ACGGCAGTTG CAAGAGGCTA CCGATGAACT GCCTTGCGAC
CATCCGCAAG GCGGTCATCA ACGGAAACAG GCAGCCTTCG CGACCACCCA TCTGGACCGT
GACTGGACAC GCCGCCTGCT GCAAGAGGCA CCGGCCGCCT ACCGCACCCA GGTCAACGAC
CTGCTGCTGG CCGCCCTGGC CCGGGTGATC TGCCGCTGGA GCGGTCAGGC CTCCACCCTG
GTCAGGCTGG AAGGCCACGG CCGGGAGGAC CTGTTCGACG ACCTCGACAT CACCCGGACG
GTGGGCTGGT TCACCAGCCT GTTCCCGGTG AAGCTGACGC CCCGGAGCGA AATCGCCGAC
TCGATCAAGA CCGTCAAGGA ACAGTTGCGC GCCGTGCCGA ACAAGGGCAT CGGCTACGGC
CTGCTGCGCT ATCTAGGCTC GGAAACCGCC AGGCAAACAC TGCAACGCCT GCCCCGCGGC
GAAATCGTCT TCAACTACCT GGGCCAGTTC GACCAGAGCT TCGAGGCCGA GGCGGCCTTG
TTCGCCCCTG CCGGGGAAAG CAGCGGCCAG GGACGAAGCG AAGCAGCACC ACTGGACGCT
CTCCTCAGCC TCGACGGCCA GGTCTACGGC GGTGAACTGA GGCTGACCTG GACCTTCAGC
CGGGAACGCT TCGCCGAGGC GACGATCCAG CGCCTGGCCG ACGCCTATGC CCGGGAACTC
GAGGCCCTGG TCGGGCACTG TGCGGATGAG AACAATCGCG GCATGACCCC CTCGGACGTC
CCGCTGGCCG GATTGAGCCA GGAGCAACTG GACGCACTGC CGGTCCCCGC AGGGGAAATC
GAGGATATCT ATCCGCTGTC GCCGATGCAG CAGGGCATGC TGTTCCACAG CCTACTGGAG
GGGGAGGCGG GCCACTACAT CAACCAGATG CGGGTGGACG TGGAAGGGCT GGACGTCGCA
CGCTTCAGGG CCGCCTGGCA GGCGGTGGTC GACCGGCACG AGGTGCTCAG GGCCAGCTTC
GTCGAGGTGG ACGGACGTCC GCTGCAGGTG ATCCGCAGGC AGGTTTCCAT GCCTTGTGTC
GAACTGGACT GGCGCAGCCA GCCGCAACTG CAGGACAGCC TGGACACCTG GGCGCAGACG
GACCGGCAAC GGGGCTTCGA CCTGGAGCGC GAGCCGCTGC TGCGGCTGGC GGTGATCCGC
ACCGGGGAGA ACCGCCACCA CCTGATCTAC ACCAACCACC ACATCCTGCT GGACGGCTGG
AGCGGCTCGC AACTGCAGGG CGAGGTGCTG CAGGCCTATA CGGGCAAACC CATCGGGCAC
CCGGGCCTCC GCTACCGCGA CTACATCGCC TGGCTGGGCC GGCAGGACCG GGCCGCCAGC
GAAGCCTTCT GGCGGGAGCA ACTCGGCGCC CTGGAGGAGC CGACCCGGCT GGCCCAGGCC
ATCAAGATCG ACGAAGCGGA GAAACGAGCC GGTCATGGCG ACCATCAGCA GGTGTTCGAC
CCGCAACAGA CCCGTTGCCT GAGCGAGTTC GCCCGGGCCC AGCGGGTGAC GGTCAACACC
CTGGTGCAGG CGGCCTGGCT GCTGTTGCTA CAGCGCTACA GTGGGCAGGC GACGGTGTGC
TTCGGCGCCA CCGTGGCGGG ACGTCCGGCC GAGCTGCAGG GCGTGGAAGG GCAACTGGGC
CTGTTCATCA ACACCCTGCC GGTGATCGCC AGCCCCCGGG CGGAACAGCG CGTAGGCGAG
TGGCTCGGCC AGGTCCAGGC GCAGAACCTG GGCCTGCGCG AACACGAACA CACGCCGCTG
TACGAGATCC AGCGCTGGGC CGGGCTGGGC GGCGAGGCGC TGTTCGACAG CCTGCTGGTG
TTCGAGAACT ACCCGATCGC CGAGGCATTG CAGCAGGGCG CGCCCGAGGG CCTGGTGTTC
GAGCGGCTAG CCGTCCGGGA ACAGACCAAC TACCCGCTGA CCCTGGCCAT CGGCCTGGGC
GAAACGCTGA CGGTGCGCTA CGGCTACGAC CGCGGGCATT TCGACGCAGC AGGCATCGAG
CGGATCGCGG GGCATTTCGC CCGGCTGCTG CAAGGCCTGA CCAGCGACGC CCGGGCCGCT
ATCGGCGAAC TGCCGCTGCT GGCCCCGGAG GAATACCGCC AGATCGTTCA CGACTGGAAC
CGGACCGAGG CCAGCTACCC GAGCGAGCGC AGCGTGCACC AGTTGATCGA GGATCAGGTG
GCGAGGACCC CGGAGGCGGT GGCCCTGGTG TTCGGCGAAC AGGAAATGAG CTACGGAGAG
CTGAACCGCA GGGCCAACCG GTTGGCGCAC CGGCTGATCG AGCTGGGCGT GGGTCCGGAT
GTGCTGGTGG GTATCGCGGT GGAGCGGGGC GTCGAGATGG TGGTGGGGCT GCTGGCGATC
CTCAAGGCCG GCGGGGCCTA TGTACCGCTG GACCCGGAGT ACCCGCGGGA GCGGCTGGCC
TACATGATCG GGGACAGCGG CGTCGGCTTG CTGCTGACGC ATGCGTCGCT GCTCGAGCAC
CTGCCCCGGG AGCACCACGA CAAGGCCCTG CTGCTTGATC GGTTGTCCTT GGAAGGCTAT
CCGCCGGAAA ACCCAGTCAA CCGGACGATG CCGCAGAACC TGGCCTACGT GATCTATACC
TCGGGCTCCA CCGGGCAGCC CAAGGGTGCG GCGGTGCGAA TCGGCAGCTT CGTCAACCTG
CTGCACTGGT ACCGGGCCGC CTGCGAACTG ACCGCGGACG ACCGGGTGCT GCTGCTCAGC
TCCTACAGCT TCGACCTGAC CCAGAAGAAC CTGTACGGCG TGCTGTGCGC GGGAGGGCAG
TTGCATATCG CCCCGGCGGG CTACGACCCG GACAGCCACC GCCGGCAGAT CGGAAAACAC
CGGCTGAGCG TGCTCAACTG CGCCCCGAGC GCCTTCTACC CCCTGCTGCA GGGCGATCGC
GCCGAACTGG CCAGCCTGAA ACACGTCCTC TTGGGGGGCG AGGCCATCCA GCCGGGAGAA
CTGGCGGAGT GGCTGGAGTC GCCCCAGGCG GCGAACGTCT CGATCCACAA CACCTACGGT
CCGACCGAAT GCACGGACGT GGTGATCGCC CGGGCGACGC CGGGAAGCGC GGTGCCGGGG
CTGAGCGCCC TGCCGATCGG GCGGCCGCTG CCCGGGGTCA GCGCCTATGT CCTCGACGGC
TCGGCCGGGC CGGTCGCCCT CGGGCAGGCG GGAGAACTGC ATATCGGCGG GGACTGCGTG
GGCGAGGGCT ACTGGCATCG CCCCAGCCTG ACCGCCGAAC GCTTCGTCCC CGACCCGTTC
GACGACAGCG CGCAGGGCGG CGGGCGCCTG TACCGCACCG GCGACCTGGC CCGCTATCGG
GCCGACGGGG TGATCGAGTA TCTGGGCCGT ATCGACCACC AGGTGAAGAT CCGCGGCTTC
CGCATCGAGC TGGGCGAGAT CGAGGCCCGG CTGCAGCAGC ACGGAGCCGT GCGCGAAGCG
GTGGTGATCG ATATCGACGG GGCGGGCGGC AAGCAACTGG CCGCCTACCT GGTGCCCAAC
GACCTGGCGA TACTGGACGG CGACGAACGC CAGGGCGCTC TGCGTATCGA GCTGAAGGAC
CATCTCCGGA CAACGCTACC GGACTACATG GTGCCGGCGC ACCTGGTGTT CCTCGCGCGG
CTGCCGCTGA CCCCCAACGG CAAGCTGGAC CGCAAGGCGC TGCCCCGACC GGACGCCAGT
CTGCTGCAGC GGGCCTACGT GGCCCCGGCG AGCGAACTGG AGCAACGGAT CGCGGCGATC
TGGGCCGAGG TGCTCAAGGT CGAGCGAGTG GGCCTGACCG ACAACTTCTT CGAGCTGGGC
GGACATTCCC TGCTGGCCAC CCAAGTGATT TCCCGAGTAC GCCAGAGCCT GGGCATCGAG
TTGCCCCTGC GCGCCCTGTT CGAAGCGCAG GACCTGGCCG GTTTCGCCGG GCGGGTCGGC
CTAGGCCAGG TCAGCCAGGC GCCCGCCCTC GAAAAGGCCG ACCGCGACCA GCCCCTGGTC
GCCTCCTACG CCCAGCAGCG CCAGTGGTTC CTCTGGCAAC TGGAGCCCGG GAGCGCCGCC
TACCACATCC CGGCGGCGCT ACGCCTGAAA GGCGCCCTGG ACATCGAGGC CCTGCGGCGC
AGCTTCGAGG CCCTGATTCG GCGTCATGAG TCCCTGCGCA CCATCTTCCG CCAGGACGGT
GAGCGGACGA TCCAGGTCAT CCCTCCCAGC GGTAGCCTCT GGTTCGAGCA GGAGCCGCTG
CCGGCGGATG CCGCCATCGG CCTGGACGAG CGGATCCGGG TTCAGGTCGA AGCCGAGGTC
CAACGCCTGT TCGACCTGGA GCAGGGCCCG CTGCTGCGGG TGAAGCTGCT GCGCCTGGAC
GAGGACGACC ACGTGCTCGT GCTGACCCTG CACCACATCG TCTCGGACGG CTGGTCCACC
CCGCTCATGG TGGACGAACT GGTCCGCCTG TACGAGGGCT ACAGCCAGGG GCACGAGGTG
AGGCTGCCGG AGTTGCCGGT CCAGTACGCC GACTACGCCC TCTGGCAGCG CCAGTGGATG
GAGGCCGGCG AACGGGACCG GCAACTGGCC TACTGGACCG CGCAACTGGG CGGCGAACAG
CCGGTGCTGG AGTTGCCCGG CGACCGGCCG CGGCCGGCCG TGCAGACCTA TGCCGGAGCC
CGGCTGGCCG TCGAACTCGA CGGCGAACTG GCCCGCTCCC TGCGGGAACT GGCCCGGCGG
GAGGGGGTGA CCCTGTTCAT GCTGCTGCTG GCCAGCTTCC AGAGCCTGCT GCACCGCTAC
AGCGGACAGG ACGACATCCG GGTGGGTGTG CCGATCGCCA ACCGCACCCG GGCGGAAACG
GAAGGGCTGA TCGGCTTCTT CGTCAACACC CAGGTGCTGA AAGCCGAATT CGGGCTGCAG
ACCACCTTCG TCGAGCTGCT GCGCCAGGTG AAACGGACCG CCCTGGAGGC CCAGGCGCAC
CAGGATCTGC CGTTCGAGCA ACTGGTGGAG GCCCTGCAGC CGGAGCGCAG CCTGAGCCAC
AGCCCGTTGT TCCAGGTGAT GTACAACCAC CAGACGGCGG TGAAAGGCGA GGCGCGGACC
CTGCCCGGAC TCCGCGTGGA AGGGCTGGCA TGGGAGAACC GTACCACCCA GTTCGACCTG
ACGCTGGACA GCTACGAAAG CGCGGACAGC CTGGGCGCCT CGCTGACCTA TGCCACCGAC
CTGTTCGATG AACGGACTAT CGAACGGCTT GCCCGGCACT GGTTGAACCT GCTGGCCGGC
ATCGTCCGCC AGCCGGAACG GCGCATCAGC GAACTGGCGC TGCTGGACCC GGAGGAATAC
CGGCGGATCG TCCACGCGTG GAACCAGACC GAGGCCCGCT ACCCGAGCGA GCGCGGCGTG
CACCAGTTGA TCGAGGAGCA GGTGGCGAGG ACCCCGGAGG CGGTGGCCCT GGTGTTCGGC
GACGAAGCAC TGACCTACGG GGAACTGAAC CGCAGGGCCA ACCGGTTGGC GCACCGGCTG
ATCGAGTTGG GCGTGGGCCC GGACGTGCTG GTGGGCATCG CGGTGGAGCG AGGCTTCGAG
ATGGTGGTGG GCCTGCTGGC GATCCTCAAG GCCGGCGGAG CCTATGTGCC GCTGGACCCG
GAGTACCCGC GGGAGCGGCT GGCGTACATG ATCGAGGACA GCGGTATCGA TCTGCTGCTG
ACCCAGGAAC ACCTGGCAGA CCAGCTACCA GCGGCAAGTG TGAATATCTG GCGCCTGGAC
AGCGATTGGA GTGAGCTGAA CGGATTTCCC GCATCCAACC CGGATCTTCC GCTCCATCCA
GAGCATCTGG CGTATTGCAT CTACACCTCG GGCTCGACCG GCAGACCCAA GGGGGTTGCC
GTTCGTCACC AGGCACTGAC CAACTTTATG GCCAGCATGG CCTCGCAGCC GGGGCTGGAT
GCCAACGACA GGATGTTGGT CCTGACCTCG CTGTCCTTCG ACATTGCCGC TTTGGAGCTC
TATCTGCCCC TGCTGGTCGG AGGAACCGTG GTGTTGCTGT TCAACCATCA GAACAGGGAT
GCCCAGGCCT TGCTGGAAGT CATCGACCGG CAGTCGGTCA GTGTCGTCCA GGCCACGCCT
TCGACCTGGC GAATGCTGCT GGATACGGCA TCGCCCGGAG CTTTGCGGGA CTGTAAGTTG
CTGAGCGGTG GTGAGGCGTT GTCGCCGGAC TTGACGGAGC GGCTGCTCCG TCAGGCCGGC
CATGTCTGGA ATCTGTATGG TCCGACCGAG ACGACCATCT GGTCGGGGCT GTACCACATC
GATGCGGAAC ATCCGTCTCC ATGGCTGGGC AGGCCGATTG CCAACACCAC CTTGCACATT
CTGGAAAAAA GCTTTGCTCC AGTGCCAGAA AGGGTTGCGG GTGAGCTATT GATAGGTGGC
GATGGACTCG CCAGGGGGTA TCTGCATCGC CCCGACCTGA CTGCCGAACG CTTCATCCCC
GACCCGTTCG ACGACAGCGA GCAGGGCGGC GGGCGCCTGT ACCGCACCGG CGACCTGGCC
CGCTACCGGG CCGACGGGGT GATCGAGTAC CTGGGCCGTA TCGACCATCA GGTGAAGATC
CGCGGCTTCC GTATCGAGCT GGGCGAGATC GAGGCCCGGC TGCAGCAGCA CGGGGCCGTG
CACGAGGCGG TGGTGATCGA TATCGACGGG GCGGGCGGCA GGCAACTGGC CGCCTACCTG
GTGCCCGACG ACCTGGCGAT GCTGGACGGC GACGAACGCC AAACCGGACT GCGCCGCGAG
CTGAAGGCGC ACCTCGGGGC GGCGCTGCCG GACTACATGG TGCCGGCGCA CCTGGTGTTC
CTCGAACGGC TGCCGCTGAC GCCCAACGGC AAGCTGGACC GCAAGGCGCT GCCGCGGCCG
GACGCCAGTC TGCTACAGCA GCAATACGTC GAACCGCAGA CCGAACTGGA GCAACGGATC
GCAGCGATCT GGGCCGAGGT GCTCAAGGTC GAGCGGGTAG GGCTCACCGA CAACTTCTTC
GAACTGGGCG GCCACTCCCT GCTGGCCACT CAGGCGATTT CCAGCATCAA CGTGCAACTC
GGCATCGACT TGCCGCTTCG CCTCATCTTC GAAAAACCCA TATTGAACGA ATTCTCCATG
ACATTGGAAA ACCATGGCCT GTCTCTGAGC GAGGCCGACT TGAGCGATAT CGAAAAATTG
ATGAATGAAA TGGCAGAGGT TTGA
 
Protein sequence
MNAEDAQKLA RRFIELPQEK RRLFLQGLWE EGVDFSLFPI PADVGIPERQ GLSYAQQRMW 
FLWQLDPQSP AYNLPMAVRL EGELERVALQ DAFDALVTRH ETLRTRFRQQ DGGIRQEVLE
PLSVTIGFED LTALLPAKQD EQVSKLAQVE AMAPFDLAEA PLLRVRLLKL SDSEHVLLLT
LHHIVADGWS LNLLIDEFIH LYDTACSGRQ AELPALPIQY RDYALWQRSW LEAGERERQL
IYWRDKLGGE HTSLELPTDR PRPAVPSYRG TRHEFRIEPL LSDQLRALAK RHNVTLFIVL
LAAFKLLMQR YSGQSVIRIG SPIANRHRSE VEGLIGCFIN TQVLHTEIDP LIDVGELLRR
VKETVLGAQV HQDLPFEQLV EVLNLERDTG QSPLFQVLFN HQPNVTDVRE LKTRSGLTLE
RIEPARHTAR FDLALDTYES AGQLYAAFTY ALEVFDGTTV AKLEEHWLRL LEGIAEEEPT
VVGELSLSQV TDAADERVDH SLEESDCVHQ LIERAASQHP ERLAAVSGND VINYARLNER
ADELARVLFD AGVLPDQRVG VVGDRSIDML VGILGILKAG AAYLPLEPDQ PQERLAFMLA
DSDVRLVLGR SSWEGLLPDG VRMICLDEPL PPVSGSAGLH VRVSPGNLAY VIYTSGTTGM
PKGVAVPHGA LANYVEGISR RLPLEAISSM AMVSTPAADL GHTVLFGALC AGKTLHLLDK
ETVLDAEAFA AHMDAHGVDA LKIVPSHLDA MLSAGRSALP RRCLVLGGEA CPPALLARIV
ALAPELKVLN HYGPTETTVG VLIGELKGLP VLGSPLENVG VRRLDACLQP APGPAKGELH
ISGAGLARGY LGRPALTAER FVPDPSGTPG GRMYRTGDWV RRNADGGLLF AGRMDGQVKI
RGYRVELAEI ESRLRALPGV GNALLRVIGE EHARQLVAYL VPTAVPDGQA GQAFLDEIRT
VLKRVLPEHM VPTHLLVLEH LPVTANGKVD LKALPEPVAT SATYVAPGTP LQTRLAEIWA
EVLKAEQVGL TDNFFELGGH SLLATQVISR VRQSLGIELP LRALFEAQDL AGFVGRVGLG
QVSQAPALEK ADRDQPLVAS YAQQRQWFLW QLEPGSAAYH IPAALRLKGA LDIEALRRSF
DALIRRHESL RTTFRQDGER TLQVIHPSGS LWFEQEPLPA DAAIGLDERI RVQVEAEVQR
LFDLEQGPLL RVKLLRLDED DHVLILSLHH IVSDGWSTPI MVDELVRLYE GYSQGHEVTL
PELPVQYADY ALWQRQWMEA GERDRQLAYW TAQLGGEQPV LELPGDRPRP AVQSHAGARL
AVELDGELAR SLRELARREG VTLFMLLLAS FQSLLHRYSG QDDIRVGVPI ANRTRAETEG
LIGFFVNTQV LKAEFELQTT FVELLRQVKR TALEAQAHQD LPFEQLVEAL QPERSLSHSP
LFQVMYNHQT AVKGAARSLP GLSVEGLAWE NRTTQFDLTL DSYEGPDSLG ASLTYATDLF
DASTVKRLAG HWRNLLASIV RQPERRIGEL PLLTPEEYRQ IVHAWNRTEA RYPSERGVHQ
LIEEQVARTP EVVALVFGEQ EMSYRELNRR ANRLAHRLIE LGVGPDVLVG VAVERGFEMV
VGLLAILKAG GAYVPLDPEY PRERLAYMIE DSGIGLLLTQ RHLQDRLPSA DGVQSLFLEP
GDDWLENYPP ENPANRTAPQ NLAYVIYTSG STGRPKGAGN THTALINRLH WMQKAYQLDT
TDTVLQKTPF SFDVSVWELF WPLLNGARLA IARPGEHRDP ERLIDTIERH GVTTLHFVPS
MLQAFISVEH IEGCRSIRRL VCSGEALPAE LARKTLERMP AVGLFNLYGP TEAAIDVTHW
TCDHVDPEGV PIGQPIDNLK THILEESLHP VAPRCCGELY LGGVGLARGY HNRPGLTAER
FIPDPFDTSE QGGGRLYRTG DLARYRAGGV IEYLGRIDHQ VKIRGFRIEL GEIEARLRQH
GAVREAVVID VEGAGGRQLA AYLVPDDPAM LDGDERQGAL RGELKDHLRA ALPDYMVPAH
LVFLARLPLT PNGKLDRKAL PRPNASLLQR AYVAPASELE QRIAAIWAEV LKVERVGLTD
NFFELGGDSI VSIQVVSRAR QAGIAFTPKE LFQHQTVQGL TTVAQRTGGL CIDQGAVTGT
MPLTSIQRMF FEEEIPERHH WNQSVLLEPN ERLVVEPLEA ALQRVVEHHD ALRLRFIQQE
GRWSAGFRDR EEAELLWQSQ VSDIGELEAV CNEAQASLDL EHGPLLRAVL ASLPDGRQRL
LLVIHHLVVD GISWRVLLED LQTAYRQAVQ SQPLHLPAKS SSFKAWSERL EQYAAEATLA
EELGYWQRQL QEATDELPCD HPQGGHQRKQ AAFATTHLDR DWTRRLLQEA PAAYRTQVND
LLLAALARVI CRWSGQASTL VRLEGHGRED LFDDLDITRT VGWFTSLFPV KLTPRSEIAD
SIKTVKEQLR AVPNKGIGYG LLRYLGSETA RQTLQRLPRG EIVFNYLGQF DQSFEAEAAL
FAPAGESSGQ GRSEAAPLDA LLSLDGQVYG GELRLTWTFS RERFAEATIQ RLADAYAREL
EALVGHCADE NNRGMTPSDV PLAGLSQEQL DALPVPAGEI EDIYPLSPMQ QGMLFHSLLE
GEAGHYINQM RVDVEGLDVA RFRAAWQAVV DRHEVLRASF VEVDGRPLQV IRRQVSMPCV
ELDWRSQPQL QDSLDTWAQT DRQRGFDLER EPLLRLAVIR TGENRHHLIY TNHHILLDGW
SGSQLQGEVL QAYTGKPIGH PGLRYRDYIA WLGRQDRAAS EAFWREQLGA LEEPTRLAQA
IKIDEAEKRA GHGDHQQVFD PQQTRCLSEF ARAQRVTVNT LVQAAWLLLL QRYSGQATVC
FGATVAGRPA ELQGVEGQLG LFINTLPVIA SPRAEQRVGE WLGQVQAQNL GLREHEHTPL
YEIQRWAGLG GEALFDSLLV FENYPIAEAL QQGAPEGLVF ERLAVREQTN YPLTLAIGLG
ETLTVRYGYD RGHFDAAGIE RIAGHFARLL QGLTSDARAA IGELPLLAPE EYRQIVHDWN
RTEASYPSER SVHQLIEDQV ARTPEAVALV FGEQEMSYGE LNRRANRLAH RLIELGVGPD
VLVGIAVERG VEMVVGLLAI LKAGGAYVPL DPEYPRERLA YMIGDSGVGL LLTHASLLEH
LPREHHDKAL LLDRLSLEGY PPENPVNRTM PQNLAYVIYT SGSTGQPKGA AVRIGSFVNL
LHWYRAACEL TADDRVLLLS SYSFDLTQKN LYGVLCAGGQ LHIAPAGYDP DSHRRQIGKH
RLSVLNCAPS AFYPLLQGDR AELASLKHVL LGGEAIQPGE LAEWLESPQA ANVSIHNTYG
PTECTDVVIA RATPGSAVPG LSALPIGRPL PGVSAYVLDG SAGPVALGQA GELHIGGDCV
GEGYWHRPSL TAERFVPDPF DDSAQGGGRL YRTGDLARYR ADGVIEYLGR IDHQVKIRGF
RIELGEIEAR LQQHGAVREA VVIDIDGAGG KQLAAYLVPN DLAILDGDER QGALRIELKD
HLRTTLPDYM VPAHLVFLAR LPLTPNGKLD RKALPRPDAS LLQRAYVAPA SELEQRIAAI
WAEVLKVERV GLTDNFFELG GHSLLATQVI SRVRQSLGIE LPLRALFEAQ DLAGFAGRVG
LGQVSQAPAL EKADRDQPLV ASYAQQRQWF LWQLEPGSAA YHIPAALRLK GALDIEALRR
SFEALIRRHE SLRTIFRQDG ERTIQVIPPS GSLWFEQEPL PADAAIGLDE RIRVQVEAEV
QRLFDLEQGP LLRVKLLRLD EDDHVLVLTL HHIVSDGWST PLMVDELVRL YEGYSQGHEV
RLPELPVQYA DYALWQRQWM EAGERDRQLA YWTAQLGGEQ PVLELPGDRP RPAVQTYAGA
RLAVELDGEL ARSLRELARR EGVTLFMLLL ASFQSLLHRY SGQDDIRVGV PIANRTRAET
EGLIGFFVNT QVLKAEFGLQ TTFVELLRQV KRTALEAQAH QDLPFEQLVE ALQPERSLSH
SPLFQVMYNH QTAVKGEART LPGLRVEGLA WENRTTQFDL TLDSYESADS LGASLTYATD
LFDERTIERL ARHWLNLLAG IVRQPERRIS ELALLDPEEY RRIVHAWNQT EARYPSERGV
HQLIEEQVAR TPEAVALVFG DEALTYGELN RRANRLAHRL IELGVGPDVL VGIAVERGFE
MVVGLLAILK AGGAYVPLDP EYPRERLAYM IEDSGIDLLL TQEHLADQLP AASVNIWRLD
SDWSELNGFP ASNPDLPLHP EHLAYCIYTS GSTGRPKGVA VRHQALTNFM ASMASQPGLD
ANDRMLVLTS LSFDIAALEL YLPLLVGGTV VLLFNHQNRD AQALLEVIDR QSVSVVQATP
STWRMLLDTA SPGALRDCKL LSGGEALSPD LTERLLRQAG HVWNLYGPTE TTIWSGLYHI
DAEHPSPWLG RPIANTTLHI LEKSFAPVPE RVAGELLIGG DGLARGYLHR PDLTAERFIP
DPFDDSEQGG GRLYRTGDLA RYRADGVIEY LGRIDHQVKI RGFRIELGEI EARLQQHGAV
HEAVVIDIDG AGGRQLAAYL VPDDLAMLDG DERQTGLRRE LKAHLGAALP DYMVPAHLVF
LERLPLTPNG KLDRKALPRP DASLLQQQYV EPQTELEQRI AAIWAEVLKV ERVGLTDNFF
ELGGHSLLAT QAISSINVQL GIDLPLRLIF EKPILNEFSM TLENHGLSLS EADLSDIEKL
MNEMAEV