Gene Avin_25650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_25650 
Symbol 
ID7761477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2608590 
End bp2621546 
Gene Length12957 bp 
Protein Length4318 aa 
Translation table11 
GC content72% 
IMG OID643805447 
Productpeptide synthase 
Protein accessionYP_002799720 
Protein GI226944647 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGATA CCTTCGAACG CCCCGTTTCC ATGGCCGATG CCCTGATGCG GCGCGCCGCC 
GCCCAGCCGG AGCGGCTGGC GTTGCGCTTT CTCGGCGGCG ACGGGGAGGA GGTGTTGAGC
TACCGGCAAC TGGACCGCCA GGCGCGCATC ATCGCGGCGG CGCTGGCGGA GCGCGGCGAG
CCGGGCGAGC GCGCGGTCCT GCTGTTTCCC AGCGGGCCGG ACTACGTCGC GGCATTCTTC
GCCTGCCTGT ACGCCGGGGT GATCGCGGTG CCGGCCTATC CGCCGGAGAG CAGCCAGGAA
CAGCACCTGC GCCGACTGAT CTCGATCATC GCCGACGCCC AGCCACGGCT GATCCTCACC
ACCTCCGGGG TGGCCGGTTC GCTGGCCGCC CTGGGGGAAG GCCGCGGCGA CGCGCTGCCC
GAACTGCTGG CGGTGGACGC GCTGGACCCG GCCCTGGCCG ACGGCTGGCG GGCGCCGGCG
GTGCCGGCCG AAGCCATCGC CTTCCTTCAG TACACCTCCG GCTCCACCGC GACCCCCAAG
GGCGTGCAGG TCAGCCACGC CAACCTGGAG GCCAACGAAT GGCTGATCCG CCAGGGCTAC
CGGATCGGCG ACGACGACAC TATCGTCAGT TGGCTGCCGC TGTACCACGA CATGGGGCTG
ATCGGCGGAC TGCTGCAGGG CATCTATAGC GGCGTGCCGG TGGTGCTGAT GTCGCCGCAG
CATTTCCTCG AACGGCCGGT GCGCTGGCTG GAGGCGATCG GCCGCTACCG GGGCACCATC
AGCGGCGGTC CGGACTTCGC CTATCGGTTG TGCCACGAGC GTATCGCCGA GGGCAACCTG
GCCGGCCTGG ACCTGTCCGG CTGGCGGGTG GCCTTCTCCG GTTCCGAGCC GATCCGCCAG
GACAGCCTGG CGGCGTTCGC CGAACGCTTC GCGCCCTGCG GCTTCCGTCC CGATGCCTAT
CTGGCCAGCT ACGGCCTGGC CGAGGCGACC CTGTTCGTCA GCGGCGGCCG GCCGGGGCAG
GGCATCTCCG CCCTGCGGCT CGACGCCGCC GCGCTGGCGG CCGACCGCGC CGAGCCCGGC
GAAGGTCCGG TACTGATGAG CTGCGGCTGG GAACAACCCG GCCATCCGCT GCTGATCGTC
GACCCGCGCA GCGGCGAGGC GCTGGGCGAC GGGCTGGTCG GCGAGATCTG GTCGAGCGGA
CCGAGCGTCG CCCAGGGCTA CTGGCGCAAT CCGCAGGCGA CCGCACAGGT TTTCGTCGAG
CGTGACGGTC GCACCTGGCT GCGCACCGGC GACCTGGGAT TCCGCCAGCG GGGCGAACTG
TTCGTCACCG GGCGGCTCAA GGACATGCTG ATCGTGCGAG GCCAGAACCT CTATCCCCAG
GACATCGAGC GCACCGTGGA GGAGTGCGTC GCCGAGGTGC GGCGCGGGCG CGTGGCGGCC
TTCGCGGTGG AGCACGAGGG CCGCGAGGCG CTCGGCGTGG CGGCGGAGAT CGGCCGGCGC
ACCGCCCAGC AGGTGGCCGC CGACGAACTG CTCGGGCGCA TCCGCCAGGC GGTGGCCGAA
GCGCACCACG AGGCGCCCGT ACTGGTGCTG CTGCTGGAGC CCGGCGCGCT GCCGAAGACC
TCCAGCGGCA AATTGCAGCG TTCCGCCTGC CGGCAACGCC ACGAGGCGGG GGAACTGGTG
CCGATCGCTT CCTTCCGCGA GGGACGGGGC GCCGAGCCTG CCGCCGACGC GACGCCGCCC
GGCGAACGCC TCGCCGCGCT CTGGCGGACG CTGCTCAAGG TCGAACGGCT CGACCCCGAC
GCGCACTTCT TCGCCCTGGG CGGCAATTCC ATCCAGGCCA TCCAGATGAT CGCCGAGCTG
CGCGACGAAC TGGGCGTCGA CCTGGAGCTG CGCAGCCTCT ACGAGGCCCC GACCCTGCGG
GCGTTCGGCG CCGAGCTGGA GCGCCTGCTG GCCGAAGGCG GCGCGTCCGC CACCGGTATC
CCGCGCCTGC CGCGCGAGGG CGCCCTGCGC CAGTCGGCGG CGCAGAACCG CCTGTGGTTC
CTCTGGCAAC TGGAGCCGCA CAGCGTCGCC TACAACATTC CCGGCGGCCT GCGCCTGCGC
GGCGAACTGG ACGAGCAGGC GCTGCGACAC AGCTTCGCGA CGCTGGTCGA GCGCCACGAA
TCGCTGCGCA CCACCTTCTA TGAGGAAGAC GGCATCGGCT TCCAGTGCGC CGGAACGGCG
TCCGACTGGA CGCTGCTGCG CGACGATCTT TCCGCGCTGC CCGAAGCGCA GCGCGAGGAG
CGGGCGTGGA CGATCCGCGA GGAAGAGGCG CGTTTCCATT TCGACCTGGA GCACGGCCCG
CTGCTGCGCG TGCGGCTGGT CAGCCTGGAC GAACAGGAGC ACCTGTTGCT GGTGACCCTG
CACCACATCG TCGCCGACGG CTGGTCGCTG GGCCTGCTGC TCGACGAATT CGCCCGGCTC
TACGCCGCGC GGATCCAGGG GCAGACGGCA TCCCTGGCGC CCCTGCCGAT CCGCTACGCC
GACTATGCGG CCTGGCAAGC GGACGATGGC GAACGCCTCG CCGCGCAACT GGACTACTGG
CGCGCCGAAC TGGCCGACGA GCACCAGCCC CTGGCGCTGC CGCTGGACCG GCCGCGCGGC
GGCCGGGAGT CGTCCGCCGA GCGTTTCGGT CTGCGCCTGG ACAAGGCCCT CGGCGAACGC
CTGCAGGCCA TCGCCAGGGC GCGGGGCACC ACGCTGTTCG CGGTCCTCCT GGCGGCCTTC
CAGACCCTGC TGCACCGCTA TACCGGGCAA TCCGACATCC GCCTCGGCGT GCCCAACGCC
AACCGCCCGC GCCGCGAGTT GCAGGGACTG GTCGGCTTCT TCATCAATAC CCAGGTGCTG
CGCGGGCGGC TGGATTCGCG GCAGACCTTC GCCGCGCTGC TGGAGGCCGC GCGGCGCACG
GTCGAGGGCG CCCAGGCCAA CCAGGACCTG CCGTACGAGC GGCTGGTCGA GGCGCTGGGC
GGCGAGCCGC CGTTCGAGGT GATGTTCAAC CACCAGCAGC GCGACCTGGA AGCGCTGCGC
CGCCTGCCGG GGCTGCTCGC CGAGGAACTG CCCTGGCACA GCCGCGAGGC CAAGTTCGAC
CTGCAACTGC ACAGCGAGGA GGACCGCCAG GGGCGTATCC GCCTGTCTTT CGACTACGCC
TGCGAACTGT TCGAGCGCGC CAGCGTCGAG CGTCTCGCCG GGCATTTCGT GCGCCTGCTG
GAACAGATCG GCGAGAACCC CGAGCGCGCC ATCGGCGACT TCGAGCTGCT CGGTGTCGAC
GAGCGCCAGC GACTGCACGA CTGGGCGCGC GGGCCGGCCG AACCGGCCGG CGTCCTGCTG
CCCGAATTGC TGACCGCCCA GGCACGGGCG ACGCCGCAGG CCATCGCCCT GGTCAGTGGC
GAGGCGACCC TCGACTACAC CGACCTGGAG CGCCGCGCCA ACCGCCTGGC GCACCGCCTG
CGCGAGCTGG GCGTCGGCCC GGAGGCGAAG GTCGGCCTGC TGGCCGGACG CGGCGTCGAA
CTGATGGTGG CTCTGCTGGC GGTGGTCAAG GCCGGCGGCG CCTACGTGCC GATGGACGCC
GACTACCCGC GCGAACGCCT GGCCTGGATG ATCGGCGACA GCGGCCTGAG CCTGCTGCTG
GGCCACCGGC GGGTGCTGGA CGCGCTGGAA GCGCCGGCCG GACTGGCGAC GCTGCCTCTG
GAGGAGCTGG ACGCCGAGGG CTACCCGGAC ACGCCGCCCG CCCTGGCGCT GGACGCCGGC
AACCTGGCCT ACGTGATCTA CACCTCCGGC TCCACCGGCC AGCCCAAGGG CGTCGGCGTC
AGCCATGGCG CGCTGAGCGA ACGCCTGCAC TGGATGCGCC GCGAGTACGC CCTGGATGCT
TCCGACGTGC TGCTGCAGAA GGCGCCGCTC GGCTTCGACG TGTCGGTCTG GGAATGCTTC
CTGCCGCTGA TCGCCGGCAG CCGCCTGGTG CTGGCGGCGG ATGGCGAGCA CCGCGACCCG
CGCCGGCTGG TGGAACTGGC ACGGGCGCAC GGGGCGACCT GCCTGCACTT CGTGCCGCCG
CTGCTGCAAC TGTTCGTCGA GGAACCGGCG CTCGGCGACT GCCGCCGCCT CAGGCTGCTG
TTCTCCGGCG GCGAGGCGCT CTCCGCCGAG CTGTGCCGGC GGGTGCGCGA ACGGCTGCCG
CAGGTCGCGC TGCACAACCG CTACGGGCCG ACCGAGACGG CGATCAACGT CACCCACTGG
CGCTGCGCGG AAGAGGGCGC GCGGGTGCCG ATCGGCCGGC CGTTGGCCAA CGTGGTCTGC
GAGCTGCGCG ACGCCGAACT GGAACTGGCG CCCGGCGGCG CGGTGGCCGA ACTGCTGCTC
GGCGGCAGCG GCCTGGCGCG CGGCTATCTG GGGCGTCCGG CGCTGACCGC CGAGCGCTTC
GTGCCGGGCG AGGACGGCGC GCGGCTGTAC CGCAGCGGCG ACCTGGCGCG CTGGCGGGAC
GACGGGGCGC TGGAATTCCT CGGCCGCGCC GACGAGCAGG TGAAAGTGCG CGGTTTCCGC
ATCGAACCGG AGGAAATCCG CGCGCACCTG CTGTCGCAGC CGGCGGTGCG CCAGGCCGTG
GTGCTGGTGC GCGAGGGCGC CGCGGGTGCG CGACTGGTCG CCTACCTGAC CAGCGACGGC
GCACAAGACG ATCCGGCCCT GGCCGAACGC CTCAAGCGCG CCCTGGCCGC CAGCCTGCCG
GAGTACATGG TGCCGGCGCA GTTCGTCCGC CTCGACGCGC TGCCGCTGAC GCCGACCGGC
AAGCTGGATC GCAAGGCGCT GCCGGAGCCG GACTGGCGGG CGGGCGAATA CGTCGCTCCG
CGCGACGAGC GCGAACGGCG CCTGGCGGCG ATCTGGCAGG AGGTGCTGGG CTTGCCGCGC
GTCGGCCTGG ACGACGACTT CTTCGCCCTG GGCGGCCATT CGCTGCTGGC CACCCGCATC
GTCTCGCGGG TGCGCCAGGC GTTCGACCTC GACCTGCCGC TGCGCAGCCT GTTCGAAGCC
AGCCGCCTCG GCGAGTTCGC CGCCGAAGTG GCCCGCCTGC AGGCCGAGGG CGCGCGCGAC
GGCTGGGGCG CGATCGAGCG GGCGGATCGT GGCCAACGCA TCCCGCTGTC GCATTCCCAG
CAGCGCATGT GGTTCCTCTG GCAGCTCGAC CCGCAGAGCC CGGCCTACAA CGTCGGCGGC
ATGGCGCGGC TGAGCGGGCC GCTCGATGCG CAACGCTTCG AGATGGCCCT GCAGGCGCTG
ATCCGGCGCC ACGAGACCCT GCGCACCACC TTCCCCAGCG ACGACGGCAA GCCCTGGCAG
CAGGTGGCCG AGCGATCCGA CCTGTGCATG CAGCGGCTGG ACCTGTCCTC GCTGCCGGCC
GACCAGCGCC GGGCGCGCCT GCAGGAACTG GCCGACGAGC AGGCGCACCA GCCCTTCGAC
CTGGAGCGCG GGCCGCTGCT GCGCGTGTGC CTGGTAAAGG CCGGCGAACA CAAGCATTAC
CTGGTGGTGA CCCTGCACCA CATCGTCACC GAAGGCTGGG CGATGGACGT GTTCGCCCGC
GAACTGGGGG CGCTCTACGA GGCGTTCCTC GACGGGCGCG AATCGCCTCT GGAAGCCTTG
CCGGTGCAGT ATCTCGACTA CAGCCAGTGG CAGCGCCGCT GGCTGGAGGG CGGCGAGCGC
CAGCGCCAGC TCGACTACTG GAAACGGCAG TTGGGCGACG AGCATCCGCT GCTCGAGCTG
CCCGCCGACC GTCCGCGTCC GCCGGTGCAG AGCCATCGCG GCGAGCTGTT CCGCTTCGAC
CTCGAGCCGG CGCTGGCCGG GCGGGTGCGC GCCTGGAACG CGGCCCACGG CCTGACCATG
TTCATGACCG CCACCGCCGC CCTGGCGCTG CTGCTGTACC GCTACAGCGG ACAGGGCGAC
CTGCGCATCG GCGCGCCGGT GGCCAATCGC ATCCGCCCGG AAAGCGAGGG ACTGATCGGT
GCCTTCCTCA ACACCCAGGT GCTGCGCTGC CGGCTGGACG GGCGGATGAG CGTCGACGAA
CTGCTCGAAC AGGTACGCGA CACGGTGATC GAGGGCCAGG CCCACCAGGA CCTGCCGTTC
GACGAACTGG TGGAAGCCCT GCAACCGCCG CGCAGCGCCG CCCATCACCC GCTGTTCCAG
GTGATGTGCA ACGTGCAGCG CTGGGAGTTC CAGCAGACCC GGCAACTGGC CGGGATGACC
GTCGAATACC TGGTCAACGA CGCGCGGGCG ACCAAGTTCG ACCTCTACCT GGAGGTGACC
GATCTCGACC AGCGCCTGGG CTGCTGCCTG ACCTACAGCA GCGACCTGTT CGACGAACCG
CGCATCGTGC GCATGGCCGG GCACTGGACG CGCCTGCTGG AGGCGATGGT CGAGCTGCCG
TCGCGGCGCC TCGCCGAACT GCCGATGCTG GCCGAGGCCG AAGCCACCCG GCTGGCCGAA
GCGCCCGGCG ACTACCCGCT GGACGCCTGC CTGCACGAAC TGTTCGAAAC CCAGGCGGCG
CGGACGCCCG AAGCGCCGGC GCTGACCTGC GCCGGGCGGA CCCTGAGCTA CGCCGAGCTG
GATGTCCGGG CCAACCGCCT GGCCCGCGTG CTGCGCGAGC GCGGCGGCGG CCCGGAAGTA
CCGGTCGGCC TGGCCCTGGA ACGCTCGGCG GAGATGGTCG TCGGCATCCT GGCGATCCTC
AAGGCCGGCG GCGCCTACGT GCCGCTCGAC CCGGAATACC CGCTGGAGCG CCTGCGCTAC
CTGATCGAGG ACAGCGGCAT CGCCCTGCTG CTCGGTCACG CCGTGTTGTT CGAGGCGCTC
GGCGAGCTGC CGGCCGGCGT CGCGCGCTGG TGCCTGGAGG ACGACCTCGC CGCCCTGGAC
GGCCAGTCCG GCGCACCGCT GCCGAGGCTC GCCGGCCCGG ACAACCTCGC CTACCTGATC
TACACCTCCG GCTCCACCGG CCAGCCCAAG GGCGTGGCGG TGTGCCACGG CGAGATCGCC
ATGCACTGCC GGGCGGTGAT CGAGCGTTTC GGCATGGCGG CCTCCGACTG CGAGCTGCAC
TTCTATTCGA TCAACTTCGA CGCCGCCACC GAACGGCTGT TCGCGCCGCT GCTGTGCGGC
GCGCGGCTGG TACTGCGTGG GCAGGGACAA TGGGATGCGG AATCGATCTG CCAGTTGATC
CGCGAGCAGA GCGTGAGCAT CCTCGGTTTC ACCCCCAGCT ACGGCAGCCA GTTGGCGCAA
TGGCTGATCA GTCGCGACGA GCGCCTGCCG GTGCGGCTGT GCATCACCGG CGGCGAAGCG
CTGAGCGGCG AGCACCTGCA GCGGATTCGC GCGGCCTTCG CGCCGCAAGC CTTCTTCAAT
GCCTACGGGC CGACCGAGAC GGTGGTCATG CCGCTGGCCT GCCGGGCGCC GGAAACCCTG
GAGGAGGGCG CCGCCAGCGT GCCGATCGGC AGCCTGGTGG GCGCCCGCCG CGGCGCCATC
CTCGATGCCG ACCTGGCGCC GCTGCCGCAG GGCGCCGCCG GCGAGCTGTA CCTCGGCGGC
AAGGGCCTGG CGCGCGGCTA CCACCGGCGC CCGGCGCTGA CCGCCGAACG CTTCGTGCCG
GATGCGGACG GCGCGCGCCT GTACCGCAGC GGTGACCGGG TACGCCTGCG CGACGACGGC
CAGGTGGAGT ATCTCGGGCG CATCGACCAG CAGGTCAAGG TGCGCGGCTT CCGTATCGAA
CTGGGCGAGA TCGAGGCGCG CCTGCGCGAG CATCCGGCCG TGACCGACAC GGCGGTGCTG
GCCCTGGACA CATCGTCCGG CAAGCAACTG GCCGGCTACG TGGCGACCGC CACGGCAGGG
CTCGACGAAG CGGCGCGCGC GGCACTGCGC GAAGCGCTCA AGGCCCACCT GCGCGCGCAA
CTGCCCGACT ACATGGTGCC GGCGCACCTG TCGCTGCTCG AAAAACTGCC GCTGACCCCC
AACGGCAAGC TCGACCGCCG CGCCCTGCCG GCGCCCGACC CTGAGCAGGA CCGCCAGGCC
TACCAGGCGC CGCGCAGCGA GCTGGAACGG CAACTGGCGA CGATCTGGAC CGAAGCGCTG
AACGTCGGCC GCATCGGTCT CGGCGACAAC TTCTTCGAAC TGGGCGGCGA CTCGATCCTG
TCGATCCAGG TGGTCAGCCG CGCCCGCCAG GCGGGCATCC ACTTCACCCC GCGCGATCTG
TTCCAGCACC AGACGGTGCA GGCCCTCGCC ACGGTGGCCC GGAGCGTCGC CACGGTAGCC
AGCGAACAAG GGCCGCTGAG CGGCGCCACG CCGCTGACGC CGATCCAGCA CTGGTTCTTC
GAACAGCATC TCGCCAAGCC ACAACACTGG AACCAGAGCC TGCTGCTGGA GCCGAGCGGC
CGGCTGGATG CCGACTGCCT GGAAAAGGCC CTGCATTACC TGCGCAACCA CCACGACGCC
CTGCGCCTGG CCTTCCGCCG GATCGACGGC CACTGGCGGG CCGAATACCG CGCGGTGGGC
GAGGGCGGCG GGGAACTGCT GTGGCGGGTC CGGCCGGCCG GCCCGGAAGA ACGCCGGGCG
CTGTTCGCCG AAGCCCAGCG CAGCCTCGAC CTGGCGCACG GCCCGCTGCT GCGCGCGGTG
CTGGCGGAGG ACGCCAGCGG CCGGCAGACG CTGCTGCTGG CGATCCATCA CCTGGCGGTG
GACGGGGTGT CCTGGCGGGT GTTGCTGGAA GACCTGCAGA GCCTCTACCG GCAGTTCGAA
GCCGGCCGGT CTCCGGCATT GCCGGCGCGG ACCAGCTCGT TGCGCGACTG GGCCACACGA
CTGGAAGCCT ACGCCGCCAG CGAATCGCTG CGCGAGGAAC TGGTCTGGTG GCGGACGCAC
CTGGCCGGGA CGGACGGCGA GCTGCCCTGC GACCGGGCCG GCACCGACGA CCGCTACCGG
GACGCCCGCA GCCTGAGCCT GCGCCTCGAC ACCACGCGCA CCCGGCAGTT GCTGCAGCAG
GCGCCGGCGG CCTACCGCAG CCGGGTGGAC GAGCTGCTGC TCACCGCCCT GGCCCGCGTG
CTGTGCCGCT GGAGCGGGCG CGACGCGGCG CTGGTGCAAC TGGAGGGGCA CGGCCGCGAG
GCGCTGTTCG ACGACATCGA CCTGACCCGC ACGCTGGGCT GGTTCACCAG CGCCTACCCG
GTGCGCCTGC AACCCGCCGG GGAATTGGCG GATGCCATCA AGGCGGTCAA GGAACAACTG
CGCGGCGTGC CGCACAAGGG CCTCGGCCAC GGCGTGCTGC GCCATCTCGC CGACGCCGAC
ACGCGCGCGG CGATGGCCGC GCTGCCGCAG GCGCGGATCA CCTTCAACTA CCTCGGCCAG
TTCGACCAGA GCTTCGCGCA TGACGCGCTG TTCCGGCCGC TTGACGAACC GGCCGGCCCG
GCCCACGCCG AGGACGCGCC GCTGCCCAAC CGGCTGTCGA TCGACGCCCA GGTCTACGGC
GGCGAGCTGC GCCTGCGCTG GACCTACGGC GCCGGCCGCT ACGACGAAGG CAGCGTGCGG
GCGCTGGCCG AGGAGCTGCT GGCGGAACTG CAGCGGCTGA TCGAGCACTG CCTGGAAGAG
GAGAGCGGCG GCCCGACGCC CTCGGATTTC CCCTTGGCAC AGCTCACCCA GGCCCAGCTC
GACGCGCTGC CGGTGCCGGC GGCGGCGATC GAGGACATCT ACCCGCTGAC CCCGATGCAG
GAGGGCCTGC TGCTGCACAC CCTGCTGGAG CCGGGCACCG GCATCTACTA CATGCAGGAC
CGCTACCGGA TCGACAGCGA ACTCGACCTG CAGCGCTTCG AGCAGGCCTG GCAGGCGGTG
GTGGCGCGCC ATGAGGCGCT GCGCGCCTCC TTCACCTGGA ACAGCGGCGA GACCATGCTG
CAGATCGTCC ACAAGCCGGG CGCCGCACGC ATCGACTACC AGGACTGGAG CGCGCTGGAC
GCGGACGGCC ACGAGGAACG CCTGCAGGCG CTGCACAAGC GCGAGCGCGA AACGGGCTTC
GACCTGCTGC GCGAGCCGCC CTTCCATCTG CGGCTGATCC GCCTGGGCGA AGCGCGCTAC
TGGTTCATGA TGAGCAACCA TCACATTCTC ATCGACGCCT GGTGCCGCGG CCTGCTGATG
GGCGACTTCA TGGAGATCTA CGCCGCCCTC GGCGAGGGCC GCGAGCCGCG GCTGCCGTCC
GCGCCGCGCT ACCGCGACTA CATCGCCTGG CTGCAACGCC AGGACTTCGA GGCGGCGCGC
AACTGGTGGC GCGACAACCT GCGCGGCTTC GAGCGGCCGA CCGCCATCCC CGGCGACCGC
CCGCTGCTGC GCGAGCACAC CGGCGGCATG CAGGTCGGCG ACTGCCTGAC GCGGCTGGAG
GCCGCCGACG GCGCGCGCCT GCGCGAACTC GCCCAGCGCC ACCAACTGAC CGTCAACACC
TTCGCCCAGG CGGCCTGGGC GCTGGTGCTG CACCGCTACA GCGGCGAACG CGACCTGACC
TTCGGCGTCA CCGTGGCTGG ACGGCCGGTG AGCATGCCGG CGCTGCAAGG CACCGTCGGC
CTGTTCATCA ACAGCATCCC GCTGCGCCTG CGCCTGCCGG AAGCGGGCGA GCGCCGCGCG
GTGCGCGACT GGCTGCGCGA ACTGCTGGAA CGCAACCTGG AACTGCGCGA GTACGAATAC
CTGTCGCTGG TGGACATCCA GGAATGCAGC GAGCTGCCCA AGGGCCAGCC GCTGTTCGAC
AGCCTGTTCG TGTTCGAAAA CGCGCCGGTG GACAGCGCGG TGCTGGACCG CGCCCAGGGC
CTCAAGGCCC GCTCCGAGTC CGGACGCACC CACACCAACT TCCCGCTCAC CGTGGTCTGC
TACCCGGGCG ACGACCTGGG CCTGCACCTG TCCTACGACC GGCGCTATTT CGAGGCCGCG
ACCATCGAGC GTTTGCTCGG CGACTTCAAG CGCCTGCTGC TGGCCCTGGC CGACGGCATC
CTCGGCGACC TGTCCGACTT GCCGCTGCTG GACCACGGCG AGCGCGAATT CCTGACCGAG
GGCTGCAACC GCAGCGAGCG CGACTACCCG CTGGAGCAGG GCTACGCACG GCTGTTCGAG
GCACGGGTGG CGGCGCATCC CGAGCGGATC GTCGCCCGTT GTCAGGACGC ACAGTGGAAC
TACGCAGGGC TGAACGCGCG CGCCAACCGC CTGGGCCACG CCCTGCGCGC GGCCGGCGTG
GGAGTCGACC AGCCGGTGGC GCTGCTCGCC GAGCGCGGCC TCGACCTGCT CGGCATGATG
ATCGGTGCCT TCAAGGCCGG CGCCGGCTAC CTGCCGCTCG ACCCGGGCCA TCCGGCCCAG
CGCCTGACCC GCATCCTCGA ACTCGGCCGC GTGCCGCTGC TGGTCTGCTC GGCGGCGTGC
CGCGCGCAGG CCGTCGAGCT GCTGGAGGCG CTGGCCGGCC AGGGACGTCC GCGCCTGCTG
GTGTGGGACG AGGTGCAGGC GGGCGACTGG CCGACGGCGA ACCCCGGCGT CTACAGCGGC
CCGGACAGCC TCGCCTACGT GATCTACACC TCCGGCTCCA CCGGACTGCC CAAGGGGGTG
ATGGTCGAGC AGGCCGGGAT GCTCAACAAC CAGTTGTCCA AGGTGCCCTA CCTGGGCCTG
GACGAAGCCG ATGCGATCGC CCAGACCGCC TCGCAGAGCT TCGATATCTC GGTCTGGCAG
TTCCTCGCCG CGCCGCTGTT CGGCGGCCGG GTGGAGATAG TGCCGAACGC CATCGCCCAC
GATCCGGGCG CCCTGCTGGC CCTGGCGCGC GAGCGCGGCG TCACGGTGCT GGAAAGCGTA
CCCTCGCTGA TCCAGGGCAT GCTCGCCGAG GAGCGGGACG GGCTCGGCGC GCTGCGCTGG
CTGCTGCCCA CCGGCGAGGC GATGCCGCCG GAACTGGCCC GCCAGTGGCT GCAGCGTTAC
CCGCAGGTGG GGCTGGTCAA CGCCTACGGA CCGGCGGAAT GCTCGGACGA CGTGGCGCTG
TTCCGCGTCG ACATGGAGGC CACCGCCGGC AGCTACCTGC CCATCGGCAG TCCCACCGAC
AACAACCGCC TGTACCTGCT CGACGAGGCG CTGGAACTGG TGCCGGCCGG CGCGACGGGC
GAACTGTGCA TCGCCGGCAC CGGCGTCGGC CGTGGTTACG TGGGCGATCC GCTGCGCACC
GCGCTGGCCT TCCTGCCCAA CCCCTTCGCC CGCGAGCCCG GCGAGCGTCT GTACCGCAGC
GGCGACCTGG CGCGGCGGCG CGTCGACGGC CTGCTGGAGT ACGTCGGGCG CATCGACCAT
CAGGTGAAGA TCCGCGGCTT CCGCATCGAA CTGGGCGAGA TCGAGGCGCA CCTGCACGAA
CAGGCGGAAG TCCGCGAGGC CGCCGTGGCG GTGCAGGAGG GGCCGAACGG CAAGTACCTG
GTCGGCTACC TGGTGCCCGC CGACATGGAA CTGGCGGAGG TTTCCCCGGC CACCGCGCCG
CTGCTGCACG GCGAGCTGTT CGAGCGCGTC AAGCAGCGTC TGCGCGCCGA ACTGCCCGAC
TACATGGTAC CGGCGCACTG GCTGCTGCTG GAGGGCCTGC CGCGCAACAC CAACGGCAAG
CTGGACCGCA AGGCGCTGCC GGAACTGGAG ATCGGCCAGT CGCGCGGGCA GGCCTACCTG
GCGCCGCGCA ACGATCTGGA ACGGACCCTG GCGACGATCT GGGCGGAGCT GCTCAAGGTG
GAGCGGGTCG GCGTGCACGA CAACTTCTTC GAACTGGGCG GGCACTCCCT GCTGGCCACC
CAGATCGCCT CGCGGGTGCA GAAGGCACTG CAGCGCAACG TACCGCTGCG GGCGATGTTC
GAGTGCAGCA CGGTGGGCGA ACTGGCCGCC TACATCGAAT CGCTGGAGGG CTCGGCCCTC
ACCGAACAGA AGGCCAGCCG CCTCGACGAC CTGATGTCGC GGCTGGAGGC GCTGTGA
 
Protein sequence
MTDTFERPVS MADALMRRAA AQPERLALRF LGGDGEEVLS YRQLDRQARI IAAALAERGE 
PGERAVLLFP SGPDYVAAFF ACLYAGVIAV PAYPPESSQE QHLRRLISII ADAQPRLILT
TSGVAGSLAA LGEGRGDALP ELLAVDALDP ALADGWRAPA VPAEAIAFLQ YTSGSTATPK
GVQVSHANLE ANEWLIRQGY RIGDDDTIVS WLPLYHDMGL IGGLLQGIYS GVPVVLMSPQ
HFLERPVRWL EAIGRYRGTI SGGPDFAYRL CHERIAEGNL AGLDLSGWRV AFSGSEPIRQ
DSLAAFAERF APCGFRPDAY LASYGLAEAT LFVSGGRPGQ GISALRLDAA ALAADRAEPG
EGPVLMSCGW EQPGHPLLIV DPRSGEALGD GLVGEIWSSG PSVAQGYWRN PQATAQVFVE
RDGRTWLRTG DLGFRQRGEL FVTGRLKDML IVRGQNLYPQ DIERTVEECV AEVRRGRVAA
FAVEHEGREA LGVAAEIGRR TAQQVAADEL LGRIRQAVAE AHHEAPVLVL LLEPGALPKT
SSGKLQRSAC RQRHEAGELV PIASFREGRG AEPAADATPP GERLAALWRT LLKVERLDPD
AHFFALGGNS IQAIQMIAEL RDELGVDLEL RSLYEAPTLR AFGAELERLL AEGGASATGI
PRLPREGALR QSAAQNRLWF LWQLEPHSVA YNIPGGLRLR GELDEQALRH SFATLVERHE
SLRTTFYEED GIGFQCAGTA SDWTLLRDDL SALPEAQREE RAWTIREEEA RFHFDLEHGP
LLRVRLVSLD EQEHLLLVTL HHIVADGWSL GLLLDEFARL YAARIQGQTA SLAPLPIRYA
DYAAWQADDG ERLAAQLDYW RAELADEHQP LALPLDRPRG GRESSAERFG LRLDKALGER
LQAIARARGT TLFAVLLAAF QTLLHRYTGQ SDIRLGVPNA NRPRRELQGL VGFFINTQVL
RGRLDSRQTF AALLEAARRT VEGAQANQDL PYERLVEALG GEPPFEVMFN HQQRDLEALR
RLPGLLAEEL PWHSREAKFD LQLHSEEDRQ GRIRLSFDYA CELFERASVE RLAGHFVRLL
EQIGENPERA IGDFELLGVD ERQRLHDWAR GPAEPAGVLL PELLTAQARA TPQAIALVSG
EATLDYTDLE RRANRLAHRL RELGVGPEAK VGLLAGRGVE LMVALLAVVK AGGAYVPMDA
DYPRERLAWM IGDSGLSLLL GHRRVLDALE APAGLATLPL EELDAEGYPD TPPALALDAG
NLAYVIYTSG STGQPKGVGV SHGALSERLH WMRREYALDA SDVLLQKAPL GFDVSVWECF
LPLIAGSRLV LAADGEHRDP RRLVELARAH GATCLHFVPP LLQLFVEEPA LGDCRRLRLL
FSGGEALSAE LCRRVRERLP QVALHNRYGP TETAINVTHW RCAEEGARVP IGRPLANVVC
ELRDAELELA PGGAVAELLL GGSGLARGYL GRPALTAERF VPGEDGARLY RSGDLARWRD
DGALEFLGRA DEQVKVRGFR IEPEEIRAHL LSQPAVRQAV VLVREGAAGA RLVAYLTSDG
AQDDPALAER LKRALAASLP EYMVPAQFVR LDALPLTPTG KLDRKALPEP DWRAGEYVAP
RDERERRLAA IWQEVLGLPR VGLDDDFFAL GGHSLLATRI VSRVRQAFDL DLPLRSLFEA
SRLGEFAAEV ARLQAEGARD GWGAIERADR GQRIPLSHSQ QRMWFLWQLD PQSPAYNVGG
MARLSGPLDA QRFEMALQAL IRRHETLRTT FPSDDGKPWQ QVAERSDLCM QRLDLSSLPA
DQRRARLQEL ADEQAHQPFD LERGPLLRVC LVKAGEHKHY LVVTLHHIVT EGWAMDVFAR
ELGALYEAFL DGRESPLEAL PVQYLDYSQW QRRWLEGGER QRQLDYWKRQ LGDEHPLLEL
PADRPRPPVQ SHRGELFRFD LEPALAGRVR AWNAAHGLTM FMTATAALAL LLYRYSGQGD
LRIGAPVANR IRPESEGLIG AFLNTQVLRC RLDGRMSVDE LLEQVRDTVI EGQAHQDLPF
DELVEALQPP RSAAHHPLFQ VMCNVQRWEF QQTRQLAGMT VEYLVNDARA TKFDLYLEVT
DLDQRLGCCL TYSSDLFDEP RIVRMAGHWT RLLEAMVELP SRRLAELPML AEAEATRLAE
APGDYPLDAC LHELFETQAA RTPEAPALTC AGRTLSYAEL DVRANRLARV LRERGGGPEV
PVGLALERSA EMVVGILAIL KAGGAYVPLD PEYPLERLRY LIEDSGIALL LGHAVLFEAL
GELPAGVARW CLEDDLAALD GQSGAPLPRL AGPDNLAYLI YTSGSTGQPK GVAVCHGEIA
MHCRAVIERF GMAASDCELH FYSINFDAAT ERLFAPLLCG ARLVLRGQGQ WDAESICQLI
REQSVSILGF TPSYGSQLAQ WLISRDERLP VRLCITGGEA LSGEHLQRIR AAFAPQAFFN
AYGPTETVVM PLACRAPETL EEGAASVPIG SLVGARRGAI LDADLAPLPQ GAAGELYLGG
KGLARGYHRR PALTAERFVP DADGARLYRS GDRVRLRDDG QVEYLGRIDQ QVKVRGFRIE
LGEIEARLRE HPAVTDTAVL ALDTSSGKQL AGYVATATAG LDEAARAALR EALKAHLRAQ
LPDYMVPAHL SLLEKLPLTP NGKLDRRALP APDPEQDRQA YQAPRSELER QLATIWTEAL
NVGRIGLGDN FFELGGDSIL SIQVVSRARQ AGIHFTPRDL FQHQTVQALA TVARSVATVA
SEQGPLSGAT PLTPIQHWFF EQHLAKPQHW NQSLLLEPSG RLDADCLEKA LHYLRNHHDA
LRLAFRRIDG HWRAEYRAVG EGGGELLWRV RPAGPEERRA LFAEAQRSLD LAHGPLLRAV
LAEDASGRQT LLLAIHHLAV DGVSWRVLLE DLQSLYRQFE AGRSPALPAR TSSLRDWATR
LEAYAASESL REELVWWRTH LAGTDGELPC DRAGTDDRYR DARSLSLRLD TTRTRQLLQQ
APAAYRSRVD ELLLTALARV LCRWSGRDAA LVQLEGHGRE ALFDDIDLTR TLGWFTSAYP
VRLQPAGELA DAIKAVKEQL RGVPHKGLGH GVLRHLADAD TRAAMAALPQ ARITFNYLGQ
FDQSFAHDAL FRPLDEPAGP AHAEDAPLPN RLSIDAQVYG GELRLRWTYG AGRYDEGSVR
ALAEELLAEL QRLIEHCLEE ESGGPTPSDF PLAQLTQAQL DALPVPAAAI EDIYPLTPMQ
EGLLLHTLLE PGTGIYYMQD RYRIDSELDL QRFEQAWQAV VARHEALRAS FTWNSGETML
QIVHKPGAAR IDYQDWSALD ADGHEERLQA LHKRERETGF DLLREPPFHL RLIRLGEARY
WFMMSNHHIL IDAWCRGLLM GDFMEIYAAL GEGREPRLPS APRYRDYIAW LQRQDFEAAR
NWWRDNLRGF ERPTAIPGDR PLLREHTGGM QVGDCLTRLE AADGARLREL AQRHQLTVNT
FAQAAWALVL HRYSGERDLT FGVTVAGRPV SMPALQGTVG LFINSIPLRL RLPEAGERRA
VRDWLRELLE RNLELREYEY LSLVDIQECS ELPKGQPLFD SLFVFENAPV DSAVLDRAQG
LKARSESGRT HTNFPLTVVC YPGDDLGLHL SYDRRYFEAA TIERLLGDFK RLLLALADGI
LGDLSDLPLL DHGEREFLTE GCNRSERDYP LEQGYARLFE ARVAAHPERI VARCQDAQWN
YAGLNARANR LGHALRAAGV GVDQPVALLA ERGLDLLGMM IGAFKAGAGY LPLDPGHPAQ
RLTRILELGR VPLLVCSAAC RAQAVELLEA LAGQGRPRLL VWDEVQAGDW PTANPGVYSG
PDSLAYVIYT SGSTGLPKGV MVEQAGMLNN QLSKVPYLGL DEADAIAQTA SQSFDISVWQ
FLAAPLFGGR VEIVPNAIAH DPGALLALAR ERGVTVLESV PSLIQGMLAE ERDGLGALRW
LLPTGEAMPP ELARQWLQRY PQVGLVNAYG PAECSDDVAL FRVDMEATAG SYLPIGSPTD
NNRLYLLDEA LELVPAGATG ELCIAGTGVG RGYVGDPLRT ALAFLPNPFA REPGERLYRS
GDLARRRVDG LLEYVGRIDH QVKIRGFRIE LGEIEAHLHE QAEVREAAVA VQEGPNGKYL
VGYLVPADME LAEVSPATAP LLHGELFERV KQRLRAELPD YMVPAHWLLL EGLPRNTNGK
LDRKALPELE IGQSRGQAYL APRNDLERTL ATIWAELLKV ERVGVHDNFF ELGGHSLLAT
QIASRVQKAL QRNVPLRAMF ECSTVGELAA YIESLEGSAL TEQKASRLDD LMSRLEAL