Gene Avin_25570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_25570 
Symbol 
ID7761469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2572437 
End bp2588078 
Gene Length15642 bp 
Protein Length5213 aa 
Translation table11 
GC content66% 
IMG OID643805439 
Productpeptide synthase 
Protein accessionYP_002799712 
Protein GI226944639 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGGA CAACAGCGGA ACGTATCGCC AAGCGCTTCA TCAGCCTGCC GCTGGAGCAG 
CGTCGCCAGA TTCTCGACAA GATGATCGAA ACCGGGCAGA GCTTCAGGCT CCTGCCCATC
GTGCCCATCC GGCATGAGAT CGGGCGCCTT CCCCTTTCCT ATGCCCAGCA GCGCCTGCTT
TTCCTCTGGC AACTGGAGCC GGACAATTCC TTCTACAACG TGCCCATGGC TGTCCGTCTG
CACGGCCGGT TGGATGAGCA GGCCTTGCAT CGTGCCTTGA CGCTTCTGGT TCAGCGACAC
GAGAGCCTGC GTACTCGCTT CGTCTCGGCG GATGGCGAGT TCCATCAGGA GATCCTTGAG
GATTCGCTGG TCGCGCTGGA GATCGTTTCC GTGGCAGGGC AGGACGAGAC CTCGTTGAAG
GCCCGCATCC GGGACGAATT CGCCCAGCCC TTCGACCTGC TCAATGGTCC CATGTTACGG
GTCAAGTTAC TGCGCCTGAG CGATACCGAC CAAGTACTGA CCTTGTGTCT GCACCATATC
GTTTCCGATG GCTGGTCGGG CGAATTGATG GTGAAAGAGT TCGTGCAGCT TTACGAGGGG
TTGCTGGAAG AGCGCACTGT CGAGTTACCG GAACTGCCGA TCCAGTATGC GGATTATGCG
ATCTGGCAAC GGGCCTGGCT GGAGGCCGGG GAGGGCGAGC GGCAACTGGC CTACTGGAAG
GCGCAACTGG GTGAGGAGCA ACCGGTACTG GAGCTACCCT TGGATCGTGA GCGGCCGGCC
AGCCCCAGTT ATCGTGGCGC GCTCGTCCAG GTCGAGATCC CCAAAGGGCT GGCCCTGGAG
TTGCGAATAC TGGCCCGCAA GGGGGGCCAC ACGCTGTTCA TGCTGGTTCT GGCGGCAGTA
GCCGTGGTGC TGTCACGCTA CAGCGGTCAG TCGGACATCC GTATCGGTGC GCCCAACGCC
GGTCGTAACC GCAAGGAGTT GGAAGGGCTG ATCGGCTTTT TCATCAACAC CCAGGTATTG
CGCGTCCAGG TAGATGAGCG CGCCACCTTC GCCGAAATGA TCGATCAGGT GAAAGACGTG
GTCGCCGGTG CCCAGTCGCA TCAGGACCTG CCGTTCGAGC AACTGGTGGA TGCCCTGGTG
CCGGAGCGGA ACCTGAGCCA CAACCCCCTG TTCCAGTTCA AGCTCAATCA GAATGTCGCC
GGCGAAGCCA GGAAGCGCAA GCAACTGGGC GGATTGGAGG TGGAGGACTT TCCATTCGAC
GATGGAAGCG CACATTTCGA CCTCGCTCTC GACTTCACCG ACACACCGAC AGGAATCGAT
GCGTCTTTCA CCTATGCGAC GGACCTGTTC GACTCCGCCA CTATCGAACG CATAGCAGCC
TCCTTGCGCA GGACTCTGGA AGCCCTGGTC CGGATACCGC ATGCCCGACT ACTGGAATGC
CCCGAAAGTA CCGCTCTATC TCAGGAGCAA GACAACCGGG TTTTCCCGCA CGCCGATATG
CTCTCGCTCT GGCAGCACGG CCTGCAACTC GGCGGGGAAG GGGCCGCATT GCGCTGTGGC
GAGCAGACCC TCGGCTATAT GGATCTGGAG CGTGCCTCCA ATCGCCTGGC GCGCCACCTC
CAGGCTCTGG GAGTAGGTAG CGGAACGACC GTGGCGCTCT GCCAGGAGCG CTCCGCCGGC
TGGGTGACCG CCGTGCTGGC AGTGCTCAAG GCCGGCGGCC TGTATCTGCC GCTGGACAGC
CAGCAGCCGG CCGACCGCCT GCAACAATTA CTGGACGACA GCAGGGCCGC CCTGCTGATC
CATGATCGCC ATGATGGACG TTTCACCGAT CTCCCGGGCC TGGATGTTCT TGCCTACGAC
CCCACCCTCT GGAGCGGATA CAGCGACGAG CCGCTGTCGA CGTGCATCGT CCCCGAACAA
CCGGCCTATG TCATCTATAC CTCTGGCTCC ACTGGTCAGC CCAAGGGGGT GGTAATCAGC
CACCGGGCCT TGGCCAACTA CGTGCAGGCT GCGCTCGACC GGTTGCAACT CCCCCTTGGG
GCCAGCATGG CCATGGTTTC CACCGTTGCC GCCGACCTGG GCCATACGAT GTTGTTCGGC
GCCCTGGCAT CAGGCAGGCC ACTTCATCTG CTGCCCCAGG AGCTCGCCTT CGATCCCGAC
GGTTTCGCCG CTTATATGGC CACGCATCGG GTTGGCGTAC TCAAGCTGGT ACCGAGCCAT
CTTCAAGGCC TGTTGCAGGC CGCCAGGCCT GCGGATGTAC TGCCCGAGCA GGCCCTGATC
CTGGGCGGCG AGGCCTGCCC ATGGGCTCTG GTCGAGCGGG TCGAGCAGCT CAAGCCCACC
TGTCGGATCA TCAACCACTA CGGCCCGACC GAAACCACGG TGGGCATCCT CACCCATGAG
GCAAGGCAGC GGTCAGAGGG TGGCCGCAGC GTTCCGGTCG GTCGGCCACT GGCCAACGGC
CGAGCCGAGA TTCTGGATGC CTATCTCAAC CCCGTTCCCT TGCAGATTTC CGGCGAACTC
TACCTGGGTG GGCAAGGCTT GGCGCAGGGC TACCTGGGCC GACCGGCCCT GACGGCGGAA
CGTTTCGTGC CTGCGGAGCA GGGGGAACGA CGTTATCGAA CCGGCGATCG GGCACGCCAG
GGAAGTGATG GCCTGGTTGA ATTCGTCGGC CGTGCCGACG ATCAGGTGAA GATTCGTGGC
TACCGGGTCG AGCCGGGCGA GATAACACAA ATCCTGCAGA ACCTTGAGGG TGTGAAGGAG
GCCGTGGTTC AGGCCCTGCC ACTGGAAAGC GACGCCTCCC GGCTGCAACT GGTGGCCTAT
TGCGTGGCGG AGGCGGGCGT AACGGTGTCC GTGCTGCAAC AGGGGCTGCA GGCGCGTCTG
CCGGATTACA TGGTCCCCGC CCATATCCTC TTGCTGGAGC GCCTACCGCT GACGGCCAAC
GGCAAGCTGG ACAGGCGCGC CCTGCCCAAG CCCGGTATTG TGGCCCAGGG GTACGTAGCG
CCGGTGGGCG AGATCGAGGA AAAGCTGGCA GCGATCTGGA CCGAGGTGCT CAAGCTGGAG
CGGGTTGGCA GCCATGACAA TTTCTTCGAG CTGGGCGGCG ACTCGATTCT CAGCCTGCAG
ATCATCGCCC GGGCCAAACG CCAGGGCATC AAGCTGACGC CCAAGCAACT GTTCGAGAAG
CAGACCATCG GTCAACTGGC ACAGGTGGCC AAGCGGGTCG AAGACAAGAA GCAGGCCGCT
ACCGAGTCGG CAGAAAGAGT CGCCGGGCAG ATGCCGTTGT TGCCGATCCA GTCGCGTTTC
TTCGAACTGG ATATTCCGCA ACGCCATCAT TGGAACCAGT CGGTGCTGCT CAGGCCGAAC
GAGCCGCTCG ATCCGGAGTG CCTGAAAGCA GCCTTGAAGG TTCTGGTCGA ACACCACGAT
GCGCTACGCC TGCGTTTCAC CGGACATGAT GGGAAATGGA GTGCCCGATT CCAGGATAGG
GAAGAAGCGG AACTGCTGTG GCAGAAGGAC CTGCTGGATA TCAGCGAACT GGAAGCATCG
GGCAATGAAG CGCAAGCCAG TCTGAGCCTG GATCAAGGCC CCCTGCTGCG AGCGGTGCTG
GTGAACCTGC CCGATAACCG ACAGCGACTG TTGTTGGTCA TCCATCACCT GGTGGTGGAT
GGCGTGTCCT GGCGAGTTCT GCTGGAAGAC CTGCAAACCA CTTATCGGCA AGCCGTTCAG
GGTCAGCCGT TGCGACTGCC GGCCAGGACC AGCTCGCTGA AACACTGGGC CGAACGGTTA
CATGCCTATG CGTCGAGCGA GGCTCTGCAG GCCGAGCAGG AATATTGGCT GCGAAGCCTG
GGCGAGGCGG CGCAGGAATT GCCACGGGAC AATCCCGATG GCGATGAAAG CGGACGCCAA
TCCCGCTCGG CCAGTACCCG TCTCGATGCG GAACTGACGC AGAAACTACT GAGACTGGCC
CCGGCCGCCT ACCGTACCCA GGTCAACGAC CTACTGCTGG CGGCCCTGGC ACGAGTGCTC
TGCCTGTGGA GCGGACAGGA CTCCGTACTG ATCCAGTTGG AGGGTCACGG CCGTGAAGAC
CTGTTCGAAG ACATCGACCT TACCCGGACG GTGGGCTGGT TCACCAGCTT GTTCCCGGTC
CGACTGACGC CCCGTGACGA TTGGGGCACG TCGATCAAGG GCATCAAGGA ACAACTGCGA
TCGGTGCCGA ACAAAGGCAT CGGCTATGGA ATCCTGCGCT ATCTGGGGAG CGAAGATATC
CGGCAAAGGT TGAGCCTGCT ACCCGAAGCA CGGGTGACTT TCAACTACCT GGGCCAGTTC
GATGGCAGCT TCGCCCAGGA TGAGGGCGCC CTGTTCGAAC CGGCCAGCGA AAGCGCCGGA
CAAGCCCGCA GCGAAGAAGC GCCGTTGGGT AACTGGTTGT CCATCAATGG TCAGGTCTAC
GATGGAGAAC TGCATCTGGA ATGGACCTTC AGCCAGGATG TCTATCGGCC GGATGGCATC
GAACGATTGG CTCGCGCCTA CGAACAGGCC CTGGCAGACA TCGTCGCGTA CTGTGCGGAT
GAGAACAATC GCAGCGTGAC CCCCTCGGAT GTCCCGCTGG CCGGATTGAG CCAGGAGCAA
CTGGATGCGC TGCCATTACC CGCAGGGGAA ATCGAGGATA TCTACCCGCT GTCGCCGATG
CAGCAGGGCA TGCTGTTCCA CAGCCTGCTG GAGCACGAGG CGGGCCACTA CATCAACCAG
ATGCGGGTGG ACGTGCAAGG GCTGGACGTC GAACGCTTCA GGGCCGCCTG GCAGGCGGTG
GTCGACCGGC ACGAGGTGCT CAGGGCTGGC TTCGTCGAGG TGGACGGACG TCCGCTGCAA
GTGATCCGCA GACGGATGTC GATGCCCTGC GTCGAACTGG ACTGGCGTGG CCAGCCGCAA
CTGCAGGACA GCCTGGATAC CTGGGCGCAG GCGGACCGGC AACGGGGCTT CGACCTGGAG
CGCGAGCCGC TGCTGCGGCT GGCGGTGATC CGCACCGGGG AGAACCGCCA CCACCTGATC
TACACCAACC ACCACATCCT GCTGGACGGC TGGAGCGGCT CGCAACTGCA GGGCGAGGTG
CTGCAGGCCT ATACGGGCAA ACCCATCGGG CACCCGGGCC TCCGCTACCG CGACTACATC
GCCTGGCTGG GCCGGCAGGA CCGGGCCGCC AGCGAAGCCT TCTGGCGGGA GCAACTCGGC
GCCCTGGAGG AGCCGACCCG GCTGGCGGAT TCGATCAGGC GAGCCGATGG CCGAACGGGG
GAAGGCTATG GGGATCACGT CCAGCTCCTC GACAGCCGGC AGACCGCGGC GCTGAGCGAG
TTCGCCCGGG CCCAGCGGGT GACGGTCAAT ACCCTGGTGC AGGCGGCCTG GCTGCTGTTG
CTGCAGCGCT ACAGCGGGCA GGCGACGGTG TGCTTCGGTG CCACCGTGGC GGGACGTCCG
GCCGAGCTGC AGGGCGTGGA AGGGCAACTG GGCCTGTTCA TCAACACCCT GCCGGTGATC
GCCAGCCCTA GGTCGGAACA GCACGTGGGC GAATGGCTCG ACCAGGTCCA GGCGCAGAAC
CTGGGCCTGC GCGAACACGA ACACACGCCG CTGTACGAGA TCCAGCGCTG GGCCGGGCTG
GGCGGCGAGG CGCTGTTCGA CAGCCTGCTG GTGTTCGAGA ACTACCCGAT CGCCGAGGCG
CTGCAGCAGG GCGCGCCGAA GGGGCTGGTG TTCGAACGGA TAGCCGTCCA GGAACAGACC
AACTACCCGC TGACCCTGGC CATCGGCCTG GGCGAGACGC TGACGGTGCG CTACGGCTAC
GACCGCGGGC ATTTCGACGC GGCGGGCATC GAGCGGATCG CCGGGCATTT CGCCCGGCTG
CTGCAAGGCC TGGCCAGCGA TGCCCGGGCC GCCATCGGCG AACTGCCGCT GCTGGACCCG
GAGGAATACC AGCGGATCGT CCGCGACTGG AACCGGACCG AGGCCCGCTA CCCGAGCGAG
CACGGCGTGC ACCAGTTGAT CGAGGAGCAG GTGGCGAGGA CCCCGGAGGC GGTGGCCCTG
GTGTTCGGCG AGCGGGAGAT GTCCTACGGG GAGCTGAACC GCAAGGCCAA CCGGCTGGCA
CACCGGCTGC GCGAACTGGG CGTGGGTCCG GACGTGCTGG TGGGCATCGC GGTGGAGCGG
GGCTTCGAGA TGGTGGTGGG GCTGCTGGCG ATCCTCAAGG CCGGCGGGGC CTATGTGCCG
CTGGACCCGG AGTACCCAGG GGAGCGGCTG GCGTACATGA TCGGGGACAG CGGCATCGGC
CTGCTGCTGA CCCAGCGGCA CCTGCAGGAC CGACTGCCGC CAACCGGCGG GGTGCGAAAC
CTGCTCCTAG AGCCGGACGA CGATTGGCTG GAGGACTATC CGCAAGAAAA CCCGGCCAAC
CGGACGGCGC CGCAAAACCT GGCCTACGTA ATCTACACCT CCGGATCGAC CGGCAGGCCG
AAAGGGGCGG GCAATACGCA TACGGCCTTG ATCAACCGCC TGCACTGGAT GCAGAAAGCC
TACCGGCTCG ATACAACGGA TACGGTCCTG CAAAAGACGC CGTTCAGCTT TGATGTTTCG
GTATGGGAGT TGTTCTGGCC GCTATTGAAT GGGGCGCGGT TGGCGATCGC ACGGCCCGGC
GAGCACCGCG ATCCAGAGCG CCTGATCGAC ACGATCGAGC GTCATGGGGT AACGACCCTG
CATTTCGTGC CCTCGATGCT CCAGGCTTTC ATCTCCGTCG AACACATCGA AGGCTGCCGG
AGCATCCGCC GGCTCGTGTG CAGTGGCGAG GCACTCCCAG CGGAGCTTGC TCGCAAGACC
CTGGAAAGAA TGCCCACGGT TGGCCTGTTC AATCTGTATG GTCCGACCGA AGCGGCTATC
GATGTGACCC ATTGGACTTG TGATCATGTC GATCCCGAGG GCGTGCCGAT CGGCCAACCG
ATCGATAACC TGAAGACCCA CATACTGGAG GAGAGCCTTC ATCCAGTAGC ACCGCGTTGT
TGCGGCGAGC TGTATCTGGG GGGAGTGGGG CTTGCCCGTG GTTACCACAA CCGCCCGGGC
TTGACCGCCG AACGCTTCGT CCCCGACCCG TTCGATAGCA GCGAACAGGG CGGCGGTCGC
CTGTACCGTA CCGGCGACCT GGCCCGCTAC CGGGCCGACG GGGTGATCGA GTACCAGGGC
CGTATCGACC ACCAGGTGAA GATCCGCGGC TTCCGCATCG AGCTGGGCGA GATCGAGGCC
CGCCTGCAGC AACACGAGGC GGTACGCGAG GCGGTGGTGA TCGACATCGA CGGGCCGGGC
GGCAGGCAAC TGGCCGCCTA CCTGGTGCCC GACGACCTGG CGATGCTGGA CGGCGACGAA
CGCCAAACCG GACTGCGCGG CGAATTGAAG GCGTACCTCG GGGCAGCACT GCCGGACTAC
ATGGTGCCGG CGCACCTGGT GTTCCTCGCG CGGCTGCCGC TGACCCCCAA CGGCAAGCTG
GACCGCAAGG CGCTGCCCCG GCCGGACGTC AGCCTGCTGC AGCGGGCCTA CGTGGCCCCG
GCGAGCGAAC TGGAGCAGCG GATGGCGGCG ATCTGGGCCG AGGTGCTCAA GGTCGAGCGG
GTAGGGCTGA CGGACAACTT CTTCGAACTG GGCGGCCACT CCCTGCTGGC CACCCAGGTG
ATTTCCCGTG TACGCCAGAG CCTGGGCATC GAGCTGCCCC TGCGCGCCCT GTTCGAAGCG
CAGGATCTGG CCAGTTTCGC CGGGCGGGTC GGTCAGGGCC AGACCAGCCG GGCGCCCGCC
CTTGAAAAGG CCGACCGCGG CCAGCCCCTG GTCGCCTCCT ACGCTCAGCA ACGCCAGTGG
TTCCTCTGGC AACTGGAGCC CGGGAGCGCC GCCTACCACA TCCCGGCGGC GCTACGCCTG
AAAGGCGCCC TGGACATCGA GGCCCTGCGG CGCAGCTTCG AGGCCCTGAT TCGGCGTCAT
GAGTCCCTGC GCACCATCTT CCGCCAGGAC GGTGAGCGGA CGATCCAGGT CATCCCTCCC
AGCGGTAGCC TCTGGTTCGA GCAGGAGCCG CTGCCGGCGG ATGCGGCCAT CGGCCTGGAC
GAGCGGATCC GGGCCCGGGT GGAAGCCGAG GTCCAACGCC TGTTCGACCT GGAGCGGGGC
CCGCTGCTGC GGGTGAAGCT GCTGCGCCTG GACGAGGACG ACCACGTGCT CGTGCTGACC
CTGCACCACA TCGTCTCGGA CGGCTGGTCC AGCCCACTCA TGGTGGACGA GCTGGTCCAC
CTGTACGAGG GCTACAGCCA GGGCCGCGAG GTGACGTTGC CGGAGTTGCC GGTCCAGTAC
GCCGACTACG CCCTCTGGCA GCGCCAGTGG ATGGAGACCG GCGAACGGGA TCGGCAACTG
GACTACTGGA CCGCACAACT GGGCGGCGAA CAGCCGGTGC TGGAGCTGCC CGGCGACCGG
CCGCGGCCGG CCGTGCAGAC CCATGCCGGA GCCCGGCTGG CCGTCGAACT CGACGGCGAA
CTGGCTCGCT CCCTGCGGGA ACTGGCCCGG CGGGAGGGCG TGACCCTGTT CATGCTGCTG
CTGGCCAGCT TCCAGAGCCT GCTGCACCGC TACAGCGGGC AGGACGACAT CCGGGTGGGC
GTGCCGATCG CCAACCGCAC CCGGGCGGAA ACGGAAGGGC TGATCGGCTT CTTCGTCAAC
ACCCAGGTGC TGAAGGCCGA ATTCGAGCTG CAGACCACCT TCGTCGAGCT GCTGCGCCAG
GTGAAACGGA CCGCCCTGGA GGCCCAGGCG CACCAGGACC TGCCGTTCGA GCAACTGGTG
GAGGCCCTGC AGCCGGAGCG CAGCCTGAGC CACAGCCCGT TGTTCCAGGT GATGTACAAC
CACCAGACGG CGGCGAAGGG CGCGGCGCGG ACCCTGCCCG GACTCCGCGT GGAAGGGCTG
GCATGGGAGA ATCGTACCAC CCATTTCGAC CTGACGCTGG ACACCTATGA AAGTGCGAAC
AGCCTGAGCG CCTCGCTGAC CTATGCCACC GACCTGTTCG ATGAACGGAC TATCGAACGG
CTTGCCCGGC ACTGGTTGAA CCTGCTGGCC GGCATCGTCC GCCAGCCGGA ACGGCGCATC
GGCGAACTGG CGCTGCTGGA CCCGGAGGAA TACCGGCGGA TCGTCCAGGC GTGGAACCGG
ACCGAGGCCC GCTACCCGAG CGAGCGCGGC GTGCACCAGT TGATCGAGGA TCAGGTGGCG
AGGACACCGG AGGCGGTGGC CCTGGTGTTC GGCGAACAGG AGATGTCCTA CGGGGAGCTG
AACGGGAGAG CCAACCGCCT GGCGCACCGG CTGATCGAAC TGGGCGTGGG CCCGGACGTG
CTGGTGGGCA TCGCGGTGGA GCGGGGCTTC GAGATGGTGG TGGGGCTGCT GGCAATCCTC
AAGGCCGGTG GGGCCTATGT GCCGCTGGAC CCGGAGTACC CGCGGGAGCG GCTGGCGTAC
ATGATCGGGG ACAGCGGTAT CGATCTGCTG CTGACCCAGG AACACCTGCA AGACCGGCTA
CCGTCAACCG ACGGGGTGCA AAACCTGCTC CTGGAGCCGG GCGACGACTG GCTGGAGGGC
TATGGCGAAG AGAACCCGGC CAGCCGGACG ATGCCGCAGA ACCTGGCCTA CGTGATCTAC
ACCTCAGGCT CCACCGGGCA GCCCAAGGGG GTCACCATTA GCCATGGAGC CTTCTCCATG
CATAGCCAAG CTGTAGGCCA ATGTTATGGG CTGACGGTAA ATGATCGTTT GCTGCAATTT
GCTTCGATCA GCTTCGATGC GGCTGCCGAG CAGTTGTTCA CACCGCTTGC CAACGGGGCC
GCAGTCGTAC TGGGAGATGT CAGGCAATGG TCGGCTGTGC GTTTAGCCGA GGAGGTGGAG
CGTAGTGGCA TCACTGCTCT CAACGTGCCA CCGGCTTACA TAGATCAGAT CTCGGATGCT
TTAGAGGAGG CGCATCGTCA TATCGATGTG CGAATCTGCA TCCTTGGAGG AGAGGCCTGG
AAGGCAGGAT TGCTGGGGAA GGCCGTACGT GCCGGACAGG TTTTCAATGC CTATGGGCCG
ACCGAGACCG TTATCACGCC TCTGGTCTGG CAGGTTGAAT CCGATGAGTT CGTCGGCTAT
GCGCCCATTG GCAAGCCGGT AGGGCAGCGC CAAGCCTATC TGCTCGATGA CAGTCTGAAT
CCTGAACCTC AGGGCAATAT CGCTGAGCTT TATCTGGGAG GAGAAGGGCT CGCGCGCGGG
TATCTGAACC GTCCCAGCTT GACCGCCGAA CGCTTCGTCC CCGACCCGTT CGACGACAGC
GAGCAGGGTG GCGGGCGCCT GTACCGCACC GGCGACCTGG CCCGCTACCG GGCCGACGGG
GTGATCGAGT ACCTGGGCCG CCTCGACCAC CAGGTGAAGA TCCGCGGTTT CCGCATCGAG
CTGGGCGAGA TCGAGGCCCG GCTGCAGCAG CACGAGGCGG TGCGCGAGGC GGTGGTGATC
GACGTCGAGG GGCCGGGCGG CAGGCAACTG GCCGCCTACC TGGTACCCGA CGACTCGGCG
ATGCTGGAGG GCGACGAACG CCAAACCAAA CTGCGCCGCG AATTGAAGGC ACACCTCGGG
GCGGCGCTGC CGGACTACAT GGTGCCGGCG CACCTGGTGT TCCTCGAACG GCTGCCGCTG
ACGCCCAACG GCAAGCTGGA CCGCAAGGCG CTGCCGCGGC CGGATGCCAG TCTGCTGCAG
CAGGCCTACG TGGCCCCGGC AAGCGAACTG GAACAGCGGA TCGCGGCGAT CTGGGCCGAG
GTGCTCAAGG TCGAGCGGGT GGGCCTGACC GACAACTTCT TCGAGCTCGG CGGCGACTCG
ATCATTTCCC TGCAGGTGGT CAGCCGGGCC CGGCAAGCCG GTATCCGCTT CACGCCCAAG
GACCTGTTCC AGCACCAGAC GGTGCAGGGA CTGGCTACGG TGACCCGTCT AGGCGACGAA
GGCGGTGTGC ATGTTGACCA GGGTCCGCTC AGCGGAGAAA CCGCGCTGCT GCCGATCCAG
CAGTACTTCT TCGAAGAGGC CATCCCCGAG CGGCACCACT GGAACCAGTC GCTGCTACTC
AAGCCAGGCA GGGTCCTGGA CGGTGCCCGG CTCGAACAGG CCGTGCAGGC CCTGATCGAC
CATCACGATG CCTTGCGTCT GGCGTTCGTC GAAGACCCGG CGGGCCATTG GAGCGCCCGC
TACCGTCCGG CCAGCGAACG GCAGTCCGTG CTCTGGCAGG CCGAACTGCG CTCCGCGGAG
GAACTGGAAC GGCTGGGCAA CGAAGCCCAG CGCAGTCTGA ACCTGCAGGA AGGTCCCCTG
TTGCGGGGGG TACTGGCCGA ACTCGCCGAC GGCAGCCAAC GCCTGTTGCT GGCGATCCAT
CACCTGGTGG TGGACGGCGT GTCCTGGCGG ATCCTGCTGG AAGATCTGCA GACGGCCTAC
GGGCAATTGG AGCAGGGGCG GCCGGTCAGC CTGCCAGGCA AGACCAGCTC GACCAGGGCC
TGGGTGGAGC ACCTGCAAGG CTATGCGAAC AGCGAAGCCG GGCAACGGGA ACTGGCCCAC
TGGCGGGAGC AGCTCGAAGA TGTCGCCGGC GAACTGCCCT GCGACAACCC CGCTGGCGGC
CAGCAGAACC GACATGCGGC GCATGCGACC ACCCATCTGG CGCGCGACTG GACACGCCGC
CTGCTGCAGG AGGCGCCGGC CGCCTACCGC ACCCAGGTCA ACGACCTGCT GCTGACCGCC
CTGGCCCGGG TACTCACCCG CTGGACGGGC CAGGCCGCCG CCCTGGTGCA ACTGGAAGGC
CACGGACGCG AGGATCTGTT CGACGATGTC GACCTGACCC GTACGGTGGG CTGGTTCACC
AGTGTATATC CGGCGAAGCT GTGTCCGTCC GAAACGCTGG ACGGCTCGAT CAAGGCGATC
AAGGAGCAGT TGCGGGCCAT CCCGAACAAG GGCATCGGCT TCGGTATCCT GCGTTACCTG
GGCGACGAGC CGACCCGGCA ATACCTGGCC GGATTGCCGG TGCCGCGCAT CACCTTCAAC
TACCTGGGCC AGTTCGACGG CAGCTTCGAC CGAAGTGAGG AACATGCCCT GCTGGTCCCC
GCGGGCGAAA GCGCCGGCGC CGAGCAAAGC CCGGATGCCC CGCTGGGCAA CTGGTTGTCG
ATCAACGGGC AGGTCTATGG CGGCGAACTG AGGCTGACCT GGACCTTCAG CCGGGAACGC
TTCGCCGAGG CGACGATCCA GCGCCTGGCC GACGCCTACG CCCGGGAACT CGAGGCCCTG
GTCGAGCACT GCGGCCAAGC GGAACACCGC GGCGTGACGC CCTCGGACGT CCCGCTGGCC
GGCTTGAGCC AGGAGCAACT GGATACTACG CTGCCGGTCC CCGCTGGGGA GATCGAGGAT
ATCTACCCGC TGTCGCCGAT GCAGCAGGGC ATGCTGTTCC ACAGCCTGCT GGAGCACGAG
GCGGGCCACT ACATCAACCA GATGCGGGTG GACGTGCAAG GGCTGGACGT CGAACGCTTC
AGGGCCGCCT GGCAGGCGGT GGTCGACCGG CACGAGGTGC TCAGGGCTGG CTTCGTCGAG
GTGGACGGAC GTCCGCTGCA AGTGATCCGC AGACGGATGT CGATGCCCTG CGTCGAACTG
GACTGGCGTG GCCAGCCGCA ACTGCAGGAC AGCCTGGATA CCTGGGCGCA GGCGGACCGG
CAGCGGGGCT TCGACCTGGA GCGCGAGCCG CTGCTGCGGC TGGCGGTGAT CCGCACCGGG
GAGAACCGCC ACCACCTGAT CTACACCAAC CACCACATCC TGCTGGACGG CTGGAGCGGC
TCGCAACTGC AGGGCGAGGT GCTGCAGGCC TATACGGGCA AACCCATCGG GCACCCGGGC
CTCCGCTACC GCGACTACAT CGCCTGGCTG GGCCGGCAGG ACCGGGCCGC CAGCGAAGCC
TTCTGGCGGG AGCAACTCGG CGCCCTGGAG GAGCCGACCC GGCTGGCGGA CTCGATCAGA
CGAGCTGATG GTCGGACGGG GGAAGGATAC GGGGATCACG TCCAGCTCCT TGACAGCCGG
CAGACCGCGG CGCTGAGCGA GTTCGCCCGG ACCCAGCGGG TGACGGTCAA CACCCTGGTG
CAGGCGGCCT GGCTGCTGCT GCTGCAGCGT TACACTGGGC AGACGACGGT GAGCTTCGGC
GCCACCGTGG CGGGACGTCC GGCCGAGCTG CAGGGCGTGG AAGGGCAACT GGGCCTGTTC
ATCAACACCC TGCCGGTGAT CGCCAGCCCC CGGGCGGAAC AGCGCGTGGT CGAGTGGCTC
GGCCAGGTCC AGGCGCAGAA CCTGGGCCTG CGCGAACACG AGCACACGCC GCTGTACGAA
ATCCAGCGCT GGGCCGGGCT GGGCGGCGAG GCGCTGTTCG ACAGCCTGCT GGTGTTCGAG
AACTACCCGA TCGCCGAGGC GCTGCAGCAA GGCGCTCCCG AGGGCCTGGT GTTCGACCGG
GTGACCACTC AGGAACAGAC CAACTACCCG CTGACCCTGG CCATCGGCCT GGGCGAGACG
CTGACGGTGC GCTACGGCTA CGACCGCGGG CATTTCGATG CGGCGGACAT CGAACGGATC
GCGGGGCATT TCGCCCGGCT GCTGCAAGGG CTGGCTAGCG ATGCCCAGGC AGCCATCGGC
GAACTACCGC TGCTGGACCC GGAGGAATAC CAGCGGATCG TCCACGACTG GAACCGGACC
GAGGCCCGCT ACCCGAGCGA GTGCTGCGTG CACCAGTTGA TCGAAGAGCA GGTGGCGAGG
ACGCCGGAGG CGGTGGCCCT GGTGTTCGGC GAGCGGGAGA TGTCCTACGC AGAGCTGAAT
CGAAGAGCCA ACCGCCTGGC ACACCGACTG ATCGAACTGG GCGTGGGTCC GGATGTGCCG
GTGGGCATCG CGGTGGAGCG GGGTGTCGAG ATGGTGGTGG GCCTGCTGGC GATCCTCAAG
GCCGGCGGGG CCTATGTGCC GTTGGACCCG GAGTACCCGG GGGAGCGGCT GGCCTACATG
ATCGGGGACA GCGGCATCGG CTTGCTGCTA ACCCAGCGGC ACCTGCAGGA CCGGCTGCCC
TCAGCCGACG GGGCGCAGAG CCTGTTCCTG GAGCCAGACG ACGACTGGCT GGAGGGCTAT
GGCGAAGAGA ACCCAGCCAA CCGGACGATG CCGCAGAACC TGGCCTACGT GATCTACACC
TCGGGCTCCA CCGGGCGCCC CAAGGGCGCG GCGGTGCGAA TCGGCAGCTT CGTCAACCTG
CTGCACTGGT ACCGGGCCGC CTGCGAACTG ACCGCGGACG ACCGGGTGCT GCTGCTCAGC
TCCTACAGCT TCGACCTGAC CCAGAAGAAC CTGTACGGCG TGCTGTGCGC GGGAGGGCAG
TTGCACATCG CCCCGGCGGG CTACGACCCG GACAGCCACC GCCGGCAGAT CGGAAAACAC
CGGCTGAGCG TGCTCAACTG CGCTCCGAGT GCCTTCTACC CCCTGCTGCA GGGGGATCGC
GCCGAACTGG CCAGCCTGAA ACACGTCCTC TTGGGGGGCG AGGCTATCCA GCCGGGAGAA
CTGGCGGAGT GGCTGGGCTC GCCCCAGGCG GCGAACGTCT CGATCCACAA CACCTACGGT
CCGACCGAAT GCACGGACGT GGTGATCGCC CGGGCGACGC CGGGAAGCAC GGTGCCGGGG
CTGAGCGCCC TGCCGATCGG GCGGCCGCTG CCCGGGGTCA GCGCCTATGT CCTCGACGGC
TCGGCCGGGC CGGTCGCCCT CGGGCAGGCG GGGGAACTGC ATATCGGCGG GGACTGCGTG
GGCGAGGGCT ACTGGCATCG TCCGGGCCTG ACCGCCGAAC GCTTCGTCCC CGACCCGTTC
GACGACAGCG CGCAGGGCGG CGGGCGCCTG TACCGTACCG GCGACCTGGC CCGCTACCGG
GCCGACGGGG TGATCGAGTA CCTCGGCCGT ATCGACCACC AGGTGAAGAT CCGCGGCTTC
CGCATCGAGC TGGGCGAGAT CGAGGCCCGG CTGCGGCAGC ACGGGGCGGT ACGCGAAGCG
GTGGTGATCG ACGTCGAGGG AGCGGGCGGC AAACAACTGG CCGCCTACCT GGTGCCCGAC
GACCCGGCGA TGCTGGAGGA CGACGAACGC CAAACCGGAC TGCGCGGCGA ACTGAAGGCG
CACCTCGGGG CGGTGCTGCC GGACTACATG GTGCCAGTGC ACCTGGTGTT CCTCGCGCGG
CTGCCGCTGA CGCCCAACGG CAAGCTGGAC CGCAAGGCGC TGCCCCAGCT CGACGCCAGC
CTGCTGCAGC GGACCTACGT GGCCCCGGTG AGCGAACTGG AGCAGCGGAT CGCGGCGATC
TGGGCCGAGG TGCTCAAGGT CGAGCGGGTG GGCCTGAGCG ACAACTTCTT CGAGCTGGGC
GGCCACTCCC TGCTGGCCAC CCAGGTGATT GCCCGTATCC GCGAGCAGCT TGGTATCGAT
ATGGCTCTAA GGGAGTTGTT CGAATTGCCC GTTTTAGCTG AGTTCAGCCA AGGTGCTCAG
GAAAAATTCG GTCAGATCGA ACCACTTCAG GGAGAGTTGG CTAAATCTCT GGAGACCCTC
AAACGTCTTA CAGCGGAGGA AATTGATGAG CTGATTTCTT AG
 
Protein sequence
MDRTTAERIA KRFISLPLEQ RRQILDKMIE TGQSFRLLPI VPIRHEIGRL PLSYAQQRLL 
FLWQLEPDNS FYNVPMAVRL HGRLDEQALH RALTLLVQRH ESLRTRFVSA DGEFHQEILE
DSLVALEIVS VAGQDETSLK ARIRDEFAQP FDLLNGPMLR VKLLRLSDTD QVLTLCLHHI
VSDGWSGELM VKEFVQLYEG LLEERTVELP ELPIQYADYA IWQRAWLEAG EGERQLAYWK
AQLGEEQPVL ELPLDRERPA SPSYRGALVQ VEIPKGLALE LRILARKGGH TLFMLVLAAV
AVVLSRYSGQ SDIRIGAPNA GRNRKELEGL IGFFINTQVL RVQVDERATF AEMIDQVKDV
VAGAQSHQDL PFEQLVDALV PERNLSHNPL FQFKLNQNVA GEARKRKQLG GLEVEDFPFD
DGSAHFDLAL DFTDTPTGID ASFTYATDLF DSATIERIAA SLRRTLEALV RIPHARLLEC
PESTALSQEQ DNRVFPHADM LSLWQHGLQL GGEGAALRCG EQTLGYMDLE RASNRLARHL
QALGVGSGTT VALCQERSAG WVTAVLAVLK AGGLYLPLDS QQPADRLQQL LDDSRAALLI
HDRHDGRFTD LPGLDVLAYD PTLWSGYSDE PLSTCIVPEQ PAYVIYTSGS TGQPKGVVIS
HRALANYVQA ALDRLQLPLG ASMAMVSTVA ADLGHTMLFG ALASGRPLHL LPQELAFDPD
GFAAYMATHR VGVLKLVPSH LQGLLQAARP ADVLPEQALI LGGEACPWAL VERVEQLKPT
CRIINHYGPT ETTVGILTHE ARQRSEGGRS VPVGRPLANG RAEILDAYLN PVPLQISGEL
YLGGQGLAQG YLGRPALTAE RFVPAEQGER RYRTGDRARQ GSDGLVEFVG RADDQVKIRG
YRVEPGEITQ ILQNLEGVKE AVVQALPLES DASRLQLVAY CVAEAGVTVS VLQQGLQARL
PDYMVPAHIL LLERLPLTAN GKLDRRALPK PGIVAQGYVA PVGEIEEKLA AIWTEVLKLE
RVGSHDNFFE LGGDSILSLQ IIARAKRQGI KLTPKQLFEK QTIGQLAQVA KRVEDKKQAA
TESAERVAGQ MPLLPIQSRF FELDIPQRHH WNQSVLLRPN EPLDPECLKA ALKVLVEHHD
ALRLRFTGHD GKWSARFQDR EEAELLWQKD LLDISELEAS GNEAQASLSL DQGPLLRAVL
VNLPDNRQRL LLVIHHLVVD GVSWRVLLED LQTTYRQAVQ GQPLRLPART SSLKHWAERL
HAYASSEALQ AEQEYWLRSL GEAAQELPRD NPDGDESGRQ SRSASTRLDA ELTQKLLRLA
PAAYRTQVND LLLAALARVL CLWSGQDSVL IQLEGHGRED LFEDIDLTRT VGWFTSLFPV
RLTPRDDWGT SIKGIKEQLR SVPNKGIGYG ILRYLGSEDI RQRLSLLPEA RVTFNYLGQF
DGSFAQDEGA LFEPASESAG QARSEEAPLG NWLSINGQVY DGELHLEWTF SQDVYRPDGI
ERLARAYEQA LADIVAYCAD ENNRSVTPSD VPLAGLSQEQ LDALPLPAGE IEDIYPLSPM
QQGMLFHSLL EHEAGHYINQ MRVDVQGLDV ERFRAAWQAV VDRHEVLRAG FVEVDGRPLQ
VIRRRMSMPC VELDWRGQPQ LQDSLDTWAQ ADRQRGFDLE REPLLRLAVI RTGENRHHLI
YTNHHILLDG WSGSQLQGEV LQAYTGKPIG HPGLRYRDYI AWLGRQDRAA SEAFWREQLG
ALEEPTRLAD SIRRADGRTG EGYGDHVQLL DSRQTAALSE FARAQRVTVN TLVQAAWLLL
LQRYSGQATV CFGATVAGRP AELQGVEGQL GLFINTLPVI ASPRSEQHVG EWLDQVQAQN
LGLREHEHTP LYEIQRWAGL GGEALFDSLL VFENYPIAEA LQQGAPKGLV FERIAVQEQT
NYPLTLAIGL GETLTVRYGY DRGHFDAAGI ERIAGHFARL LQGLASDARA AIGELPLLDP
EEYQRIVRDW NRTEARYPSE HGVHQLIEEQ VARTPEAVAL VFGEREMSYG ELNRKANRLA
HRLRELGVGP DVLVGIAVER GFEMVVGLLA ILKAGGAYVP LDPEYPGERL AYMIGDSGIG
LLLTQRHLQD RLPPTGGVRN LLLEPDDDWL EDYPQENPAN RTAPQNLAYV IYTSGSTGRP
KGAGNTHTAL INRLHWMQKA YRLDTTDTVL QKTPFSFDVS VWELFWPLLN GARLAIARPG
EHRDPERLID TIERHGVTTL HFVPSMLQAF ISVEHIEGCR SIRRLVCSGE ALPAELARKT
LERMPTVGLF NLYGPTEAAI DVTHWTCDHV DPEGVPIGQP IDNLKTHILE ESLHPVAPRC
CGELYLGGVG LARGYHNRPG LTAERFVPDP FDSSEQGGGR LYRTGDLARY RADGVIEYQG
RIDHQVKIRG FRIELGEIEA RLQQHEAVRE AVVIDIDGPG GRQLAAYLVP DDLAMLDGDE
RQTGLRGELK AYLGAALPDY MVPAHLVFLA RLPLTPNGKL DRKALPRPDV SLLQRAYVAP
ASELEQRMAA IWAEVLKVER VGLTDNFFEL GGHSLLATQV ISRVRQSLGI ELPLRALFEA
QDLASFAGRV GQGQTSRAPA LEKADRGQPL VASYAQQRQW FLWQLEPGSA AYHIPAALRL
KGALDIEALR RSFEALIRRH ESLRTIFRQD GERTIQVIPP SGSLWFEQEP LPADAAIGLD
ERIRARVEAE VQRLFDLERG PLLRVKLLRL DEDDHVLVLT LHHIVSDGWS SPLMVDELVH
LYEGYSQGRE VTLPELPVQY ADYALWQRQW METGERDRQL DYWTAQLGGE QPVLELPGDR
PRPAVQTHAG ARLAVELDGE LARSLRELAR REGVTLFMLL LASFQSLLHR YSGQDDIRVG
VPIANRTRAE TEGLIGFFVN TQVLKAEFEL QTTFVELLRQ VKRTALEAQA HQDLPFEQLV
EALQPERSLS HSPLFQVMYN HQTAAKGAAR TLPGLRVEGL AWENRTTHFD LTLDTYESAN
SLSASLTYAT DLFDERTIER LARHWLNLLA GIVRQPERRI GELALLDPEE YRRIVQAWNR
TEARYPSERG VHQLIEDQVA RTPEAVALVF GEQEMSYGEL NGRANRLAHR LIELGVGPDV
LVGIAVERGF EMVVGLLAIL KAGGAYVPLD PEYPRERLAY MIGDSGIDLL LTQEHLQDRL
PSTDGVQNLL LEPGDDWLEG YGEENPASRT MPQNLAYVIY TSGSTGQPKG VTISHGAFSM
HSQAVGQCYG LTVNDRLLQF ASISFDAAAE QLFTPLANGA AVVLGDVRQW SAVRLAEEVE
RSGITALNVP PAYIDQISDA LEEAHRHIDV RICILGGEAW KAGLLGKAVR AGQVFNAYGP
TETVITPLVW QVESDEFVGY APIGKPVGQR QAYLLDDSLN PEPQGNIAEL YLGGEGLARG
YLNRPSLTAE RFVPDPFDDS EQGGGRLYRT GDLARYRADG VIEYLGRLDH QVKIRGFRIE
LGEIEARLQQ HEAVREAVVI DVEGPGGRQL AAYLVPDDSA MLEGDERQTK LRRELKAHLG
AALPDYMVPA HLVFLERLPL TPNGKLDRKA LPRPDASLLQ QAYVAPASEL EQRIAAIWAE
VLKVERVGLT DNFFELGGDS IISLQVVSRA RQAGIRFTPK DLFQHQTVQG LATVTRLGDE
GGVHVDQGPL SGETALLPIQ QYFFEEAIPE RHHWNQSLLL KPGRVLDGAR LEQAVQALID
HHDALRLAFV EDPAGHWSAR YRPASERQSV LWQAELRSAE ELERLGNEAQ RSLNLQEGPL
LRGVLAELAD GSQRLLLAIH HLVVDGVSWR ILLEDLQTAY GQLEQGRPVS LPGKTSSTRA
WVEHLQGYAN SEAGQRELAH WREQLEDVAG ELPCDNPAGG QQNRHAAHAT THLARDWTRR
LLQEAPAAYR TQVNDLLLTA LARVLTRWTG QAAALVQLEG HGREDLFDDV DLTRTVGWFT
SVYPAKLCPS ETLDGSIKAI KEQLRAIPNK GIGFGILRYL GDEPTRQYLA GLPVPRITFN
YLGQFDGSFD RSEEHALLVP AGESAGAEQS PDAPLGNWLS INGQVYGGEL RLTWTFSRER
FAEATIQRLA DAYARELEAL VEHCGQAEHR GVTPSDVPLA GLSQEQLDTT LPVPAGEIED
IYPLSPMQQG MLFHSLLEHE AGHYINQMRV DVQGLDVERF RAAWQAVVDR HEVLRAGFVE
VDGRPLQVIR RRMSMPCVEL DWRGQPQLQD SLDTWAQADR QRGFDLEREP LLRLAVIRTG
ENRHHLIYTN HHILLDGWSG SQLQGEVLQA YTGKPIGHPG LRYRDYIAWL GRQDRAASEA
FWREQLGALE EPTRLADSIR RADGRTGEGY GDHVQLLDSR QTAALSEFAR TQRVTVNTLV
QAAWLLLLQR YTGQTTVSFG ATVAGRPAEL QGVEGQLGLF INTLPVIASP RAEQRVVEWL
GQVQAQNLGL REHEHTPLYE IQRWAGLGGE ALFDSLLVFE NYPIAEALQQ GAPEGLVFDR
VTTQEQTNYP LTLAIGLGET LTVRYGYDRG HFDAADIERI AGHFARLLQG LASDAQAAIG
ELPLLDPEEY QRIVHDWNRT EARYPSECCV HQLIEEQVAR TPEAVALVFG EREMSYAELN
RRANRLAHRL IELGVGPDVP VGIAVERGVE MVVGLLAILK AGGAYVPLDP EYPGERLAYM
IGDSGIGLLL TQRHLQDRLP SADGAQSLFL EPDDDWLEGY GEENPANRTM PQNLAYVIYT
SGSTGRPKGA AVRIGSFVNL LHWYRAACEL TADDRVLLLS SYSFDLTQKN LYGVLCAGGQ
LHIAPAGYDP DSHRRQIGKH RLSVLNCAPS AFYPLLQGDR AELASLKHVL LGGEAIQPGE
LAEWLGSPQA ANVSIHNTYG PTECTDVVIA RATPGSTVPG LSALPIGRPL PGVSAYVLDG
SAGPVALGQA GELHIGGDCV GEGYWHRPGL TAERFVPDPF DDSAQGGGRL YRTGDLARYR
ADGVIEYLGR IDHQVKIRGF RIELGEIEAR LRQHGAVREA VVIDVEGAGG KQLAAYLVPD
DPAMLEDDER QTGLRGELKA HLGAVLPDYM VPVHLVFLAR LPLTPNGKLD RKALPQLDAS
LLQRTYVAPV SELEQRIAAI WAEVLKVERV GLSDNFFELG GHSLLATQVI ARIREQLGID
MALRELFELP VLAEFSQGAQ EKFGQIEPLQ GELAKSLETL KRLTAEEIDE LIS