Gene Nmul_A1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1829 
Symbol 
ID3784924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2089921 
End bp2102034 
Gene Length12114 bp 
Protein Length4037 aa 
Translation table11 
GC content58% 
IMG OID637811916 
Productamino acid adenylation 
Protein accessionYP_412518 
Protein GI82702952 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTGA ACAAGCAGGA TATTGCCGAG CGGTTCGCAG CCCTTGCTCC CGAGAAACAG 
AAAGAATTCC TGAATGCCCT GAAAAAACGG GGATTCGATT TTTCTCTTCT TCCCATTGTT
CCGCAAAAAA CGGGAAACCG TAGCGCACTC TCATACGCAC AGGAGCGTCA CTGGTTTTTA
TGGCAATTGG AGCCGTTGAG CACGGCTTAT CATTTGAGCG GGGGATTGCG GCTGACGGGC
AGGGTGGATA TTGAAGCGCT GCGTTGGAGC TTTGCGGCGC TGGGCAGGCG GCATGAGTCG
TTGCGTACGA TATTCAGGGT CAATTCGGAA GGGTTGCCGG AGCAGATCAT CGAAGACGAG
CCGCGGCTTG AAATTCCGCT GACCGACTTT TCCGGACTGC CGCTGGAACA AGCCAGAGCG
CAAGCCGGTG AAGAAGCGGG CCGGATAGCC GGCACGCCCT TTGATCTGAC GCAAGGCCCG
CTGCTTCGGG TTGCCCTCAT CCGCATTGCA GCGGAAGAAC ATCTTCTCGT GGTGGTGATG
CACCACATCA TCTCGGACGC CTGGTCCAAC CGCATTGTCA TTGACGAATT TGCCGCCCAC
TATCGGGCAC GGGTGCAGCA GGAGCAGGAG GGGGAGAAAC AGGGGCAGGA ACCCTCCCTG
CCGGCCCTGC CGATCCAGTA TGCCGATTAC GCGATATGGC AGCGCAACTG GCTGGAAGCG
GGAGAAAAAG AGCGCCAGCT GGCCTACTGG CGCAGCCAGT TGGGGGAAGA GCACCCGGTA
TTGCAATTGC CCACCGATCA CCCCCGATCT TCCAGGGCCA GTTACCGTGC GGCGCGCCAC
ACCTTCACAT TACCTGCGGG TCTGGTTACA CGCTTGCAGC GTCAGGCGCA AAGCCAGGGA
GCGACCCTGT TCATGGCGCT GCTCTCGGGC TTTCAAGGCC TGCTCTATCG CTATACCGGC
CAGCGGGATA TCCGCGTGGG CGTGCCGATT GCCAACCGGC ATCGGGCTGA AATAGAAAAC
ATCGTCGGCT TCTTCGTCAA TACCCAGGTA TTGCGCACCC TCATGGATGG GCGCATGTCC
CTGCATACGT TGCTCGATCA GACGCGGGAA GCAGCGCTGG GTGCCCAGAC CCACCAGGAT
TTGCCGTTCG AGCGACTGGT TGAAGCCCTG CAACCCGAAC GCAACCTGAA TCAGAATCCT
CTGTTTCAGG TCATGTACAA CCACCTGCGC GAAGACTACC GGGCACTCGA GCAATTGCCC
GGGCTCAAGG TGGAAAATCA CGAGCTGAGC GAGCAGGCGG CGCAGTTCGA ACTGACCCTG
GATACGGTCG AGCAGCCCGA TGGCAGGCTG GAAGCCACCT TCACCTATGC CGCCGAGCTG
TTTGAACCTG CCACCATTGG GCGGCTTGGC AACCATTATC TGCTTCTTCT GGAGCAACTG
GCCGAGCATC CGCAGCAGAA CCTTGGCGAC ATCGACATCC TCAGTGAAGC CGAGCGGGCG
CAGCTCAAGG CCTGGGGGAT CAACGAGCAG CGCTACGCCA ATACCGAGCC CGTGCACAGG
CTGATCGAGC GGCAGGTTGA AGTCCAGCCG GAAGCGATTG CCCTGATCTT TGGCGATGTC
GAATTGAGCT ACGGCGAGCT GAACCGAAGG GCGAACCGCC TGGCGCACCG TTTGATCAGG
CTTGGGGTTG GGCCGGAGGT CAAGGTGGGC ATTGCGGTGG AGCGCTCGAT CGACATGGTG
GTGGGGTTGC TTGCCACCCT GAAGGCGGGC GGAGCATATG TGCCGCTTGA TCCGGAATAT
CCGCAGGAGC GGCTGGCCTA CATGGTGGCA GACAGTGGCA TCGGGCTGTT GCTGACGCAA
AGCCGGGTTC GATCGCGCAT TCCCCACTCC GGCCAATATC CGGTGCTGGA GCTGGACAGG
CTCGACCTCG AGGATGAATC CGACAGCAAT CCGCAAGCCG TCCTGCATGG ACACAACCTT
GCCTATGTCA TCTATACCTC AGGCTCCACA GGTAAACCAA AGGGCGTGAG CGTCACCCAC
GAGCCTCTGT CGATGCATGT TCAGTCCATA GGCAAGGCTT ATGGCATGAC GACCATGGAT
AGGGAACTTC AGTTCGCCTC GATCAACTTC GACGGCGCGC ATGAACGCTG GCTTGTGCCC
CTTGCCTTCG GATCGGCGCT GATGCCGCGT GATAATGATT TCTGGTCCGT CGAACGAACT
GTGGCGGAGA TTGTAAAGCA TCGGATAACC ATAGCCTGTT TTACCCCGAA CTATCTGCAC
CAGATGGCAG AGCTGCTGGG TACTGCGGGA CGCGCTTTGC CGATCCGTTC GTATACGGTT
GGTGGCGAAG CCATGAGTCG CGCCAGCTTT GATTTTGTAC AGACAACGCT TCAGCCGCCT
CGCATCATCA ACGGCTATGG CCCGACCGAA ACAGTCATTA CACCCCTCAT TTCGAAAGCC
TATCCCGGAA CAGGATTCGA GTCCGCCTAC ATGCCCATCG GTTGTCCTGT CGGTGACCGG
ATCGCCTATA TTCTCGACTC CGATCTGAAC CCGGTTCCGG CGGGCGTAGC GGGAGAGCTG
TATCTGGGTG GCATCGGGTT GGCCCGCGGC TATCTCAACC GTGGAGGATT GACGGCGGAC
CGTTTCATAG CCGATCCCTT CGATGAGACA GGGGGGCGGC TCTACCGCAC GGGGGATCTG
GCAAGATGGC GCTCGGACGG GCAGATCGAA TACCTGGGGC GGCTGGATCA TCAGGTCAAG
ATACGGGGAT TCCGCATTGA ACTGGGTGAA ATCGAAACGC AATTGCTGGC GCAGCCGGAA
GTCAGGGAAG CGGTGGTGGT TGCGAGGGAA GGATCCAGTG CATCCAATCC AACAGGGGGA
GCGCGGCTGA TAGCCTATGT TTCTGCCCAT GCAACGACCG ATTTGGACGC AGCACGACTG
CGCGAAGCGC TTGCCAGAAC CTTACCCGAC TACATGCTGC CCTCGATGAT TGTGGTGCTG
GAGAGTCTGC CGCTCAATCC GAGCGGCAAG GTAGACCGCA AGGCCTTGCC TGAGCCGGAG
TTTACGCATA CGGAGCATTA TGAGGCGCCG CAGGGAGAGG CGGAAGAGGT GCTGGCGGGC
ATCTGGGGGC AGGTGCTGGG TGTGGCAGAG GTGGGACGGC ATGACAACTT CTTTGAGCTA
GGGGGTGATT CCATACTGAG TCTCCAGATC GTCACCCGGG CGCGCCGCGC CGGCTGGAAG
ATCACGCCAC GCCAATTGTT CGAGCGGCAG ACCATCGCTT CGTTGGCGGC GGTGGCAATG
GCACTCGAGG TTCAGGATGC AACCATCGCT ACCCTAGCCC CAATCGCAAG CGCGCAATCG
AAAGAAGGAT TGAGCGGCGG AGGCTCGGGA AGACTTCTAT CGCTGCTGCC TATTCAGGCT
GAATTTTTCG AACGGGAAAT TCCCGCGCGA CATCACTGGA ACCAGGCAGC GCTCCTGAGA
AGCCGCGAGT ATTTGAATCC TGCACATCTT GGAGAGGCAC TGGAAGCCGT GGTGCGCCAC
CACGATGCGC TTCGTTTCCG CTTCATCAGA AATGAAAGGA GCCAGTGGCA GCAGTCTTGT
GGGGAATCGA TCGCTGCTGA TTTATTATGG GTGCGCAAAG CTTCTGCCGA CGCCGATCTG
GAAGCACTGT GCAACGAGGC GCAGCGCAGC CTGAATCTCG TGGAAGGCCC GGTGCTGCGT
GCCGTGCTTT TCGATATGGA CGATCGCAGC CAGCGGTTGC TGCTGGTCAT TCATCATCTT
GTGATCGACG GTGTTTCCTG GCGTATCCTG CTCGAGGATT TGCAGAGCGC CTATAGTCAG
GCCCGGAATG CCCAAATCAT CGCATTACCC GAAAAATCGG GGAGCTATGA GGTATGGAGC
GCGCGGCTGC AGCGTTATGT ACATGAAAAC AGGGAAGAGC TGATTTACTG GCGAAGCTTG
AAGGGAGTGC CGGTCATACT GCCCTGCGAT AACCCCGAGG GCGCGAGCTT CGTCTGTCAC
CAGCGCGATG CTGTGCTCAA GCTGGGCAAG GCCCAGACCC GCGCCTTGCT CAAGGAGGCG
CCCGCAGCCT ATCGTACTCA CGTAAACGAT CTGTTGCTTA CCGCCCTGGG AAGCGCCCTC
TGCCGGTGGA GCGGCCATAA GAAAATCCTG ATCGATCTTG AAGGCCATGG CCGGGAAGAT
TTATACGATA TCGACCTTTC ACAAACGGTC GGCTGGTTTA CCACCCTCTA TCCCGTGTTG
CTCGACCCAG CCGGAGACCT GGCTCAGAGA ATCAAGCGCA TCAAGGAAGA TCTGCGGAGG
GTACCGAACA AGGGAGTGGG TTACGGCCTG TTCAAGTATC ACGGCACGTC CGAACAGCGT
GAAGTCCTGG CATCCCTGCC CAGGCCTGAA GTCGTATTCA ATTACCTGGG GCAACTGGAT
GCGAGTTTCG ACGAAAGCGC GCTCTGGACG TTGGCGACCG AATCGACAGG GGATCTGATC
GATGAAAATG CGCCTCTCAG TCATGATATT TCCATAAATG GTCATGTTTA CGAGGGCGAA
CTGTGTCTTA CGGTGAGCTA TAGCGACGCG CGTTACCACA GGACGACAAT TGAGGCATTC
ATGAACGTTT ACCAGGCTGA GCTTGAAACG CTGATTGCCC ATTGCACCCG TGGCATACAA
GGCCTTACCC CTTCTGATTT CCCGTTGGTC GAAATCACCC AGCGCGAGCT GGACAGTCTG
CCCGTAGCCG CCGCACAGCT TGAAGACCTT TATCCGCTTT CTCCATTGCA GGAGGGCATA
CTTTTCCACA GCGTATTCGA CAGGGACGAC CACAGTGTTT ACCTCAATCA GTTGAGGGCC
GATATCGAGG GACTGGAGGC AACTCGTTTC AAGGCTGCGT GGCAGGATGC GATGGCTTGC
CACCCAGTGT TACGGACAGG TTTCGTGACT CAGGACAGCA AACCCCTGCA ATGGGTAGCC
AAGTCTGTCG AGCTGCCATT TGTGGAACAC GACTGGCGGG CCAGGGAGGA TAAGGAGCAT
AGAGAGGATA GGGAGCGCGA TCTTGAGGCA TTGGCCCAGG CGGAATATGC CTCGGGTTTT
GATCTCGCCA AACCTCCACT CATGCGCTTT GCACTGGTGC GGCTGGCGGA CGACCGGTAT
CACTTCATCT GGACGATCCA TCACTTGCTG CTTGACGGAT GGAGCACTTC CCAACTGGCC
GGGGAAGTAT TGCGGCGATA TGCCGGAAGA TTTTCCCCGG CTCAGGAAGC GTTACGGGGC
AGAGGCGAAT ACCGCAGGTT TATCGAATGG CTGCAGCGCC GTGATGTGGA AGTTTCCGAA
GCATACTGGA GGGAGCGACT GAAGGGCATC CGGGAGCCAA CGCGGCTGGC GGTGACATTG
CCTGCACACC GGGGAAATTC CGATCATGAA GAGCATTTCG GGGAGCATAT CACTGAACTG
CCGCTCTCTC TTTCCGAGGG GTTGATTCAA TTCGCCCGGC GCGAACGTGT CACCCTCAAC
ACGCTGATAC AGGCCGCTTG GGCGATCTTG CTGAGCCGGT ACACCGGGAA GCAAACTGTT
CTTTTTGGCG CCACGGTGGC GGGAAGGCCG GCCGACCTGC CAGGAGCGGA GCACTTGCTG
GGATTGTTCA TCAATACGCT TCCCGTGTCC GTAACGCTCC AGCCCGAGCA CCAGGTCGGC
GCATGGCTGC GCGACCTGCA AGCGCAGAAC CTTGCCTCGC GCGAACACGA GCAAACACCT
TTGTATGAAA TCCAGCGCTG GCTGGGACAA AGCGGGCAGG GATTATTCGA CAGCATTCTC
GTGTTCGAGA ACTATCCCAT GGATGAGGCG TTGCGGGAAT CCACTCCGGG CGGCCCTGCC
TTTTCCAATA TCCGTAATCG CGAGAGCAGC AATTATCCGA TGATGGTATC GGTTATGCAG
AATACCATGC TGTCGCTGGG CTACAGTTAC GATTGCAGAT ATTTTTCGCG AACGACGGTG
GAATCCATTG CAGCGCAGCT CCACAGACTC CTCGACAGGA TTGCCGCTAC TCCCACCAAC
TCGCCCCAGA GCCTTGGCGA CATCGACATC CTTGGTGCAG CCGAGCGGGC GCAGCTCAAG
GCCTGGGGGA TCAACGAGCA GCGCTACGCC AATACCGAGC CCGTGCACAG GCTGATCGAA
CGGCAGGTTG AAGTCCAGCC GGGGGCGGTT GCCCTGATCT TTGGCGATGC CGAATTGAGC
TATGGCGAGC TGAACCGAAG GGCGAACCTT CTGGCGCACC GCTTGATCAG GCTCGGGGTT
GGGCCGGAGG TCAAGGTAGG CATTGCGGTG GAGCGCTCGA TCGACATGGT GGTGGGGTTG
CTTGCCACCC TGAAGGCGGG CGGGGCGTAT GTGCCGCTCG ACCCGGAATA TCCGCAGGAG
CGGCTGGCCT ACATGGTGGC AGACAGTGGC ATCGGGCTGC TGCTGACGCA AAGCGGGGTT
CGATCGCGCA TTCCCCACTC CGGCCAATAT CCGGTGCTGG AGCTGGACAG GCTCGACCTC
GAGGATGAAT CCGACAGCAA TCCGCAAGCC GTCCTGCATG GACACAACCT TGCCTATGTC
ATCTATACCT CAGGCTCCAC AGGTAAACCA AAGGGAGCTG GCAACCGTCA CCTTGCTCTG
TATAACCGCC TGGCCTGGAT GCAGGAAGCA TACGAACTGG GTAACGACGA TACCGTTCTT
CAAAAGACGC CTTTCAGCTT TGACGTTTCG GTCTGGGAAT TTTTCTGGCC ATTGATGTAC
GGAGCACGTC TTGCCATCGC AGCGCCTGGT GATCACCGTG ATCCAGCCCG TCTGCTCTCT
CTAATCCTGC GCCAGAACGT GACTACCCTG CATTTTGTTC CATCCATGCT ACAGGCATTT
CTTGCGCATG AGGGAATAGA GGCATGCGTT GCTACTCTAC GACGCATCAT TTGCAGTGGG
GAAGCACTGC AGGCGGAAGT GCAGAAACAG GTTTTCAGAA AACTCCCTGG CGTGGGGCTC
TTCAACCTTT ACGGGCCGAC CGAAGCGGCA ATCGATGTCA CGCAATGGGA ATGTGTTGAC
GATAGGGACA ATAGCGTGCC TATCGGAAAA CCGATCTCTG GTCTCCAAGC ATACATTCTC
GATGTTCATC TTAACCAAGT ACCGCAAGGA GTGGCAGGAG AGCTGTATCT GGGTGGCATT
GGCTTGGCCC GCGGCTATCT CAACCGTGGA GGATTGACGG CGGACCGTTT CATAGCCGAT
CCCTTCGATG AGACAGGGGG GCGGCTCTAC CGCACGGGGG ATCTGGCAAG ATGGCGCTCG
GACGGGCAGA TCGAATACCT GGGGCGGCTG GATCATCAGG TCAAGATACG GGGATTCCGC
ATTGAACTGG GTGAAATCGA AACGCAATTG CTGGCGCAGC CGGAAGTCAG GGAAGCGGTG
GTGGTTGCGA GGGAAGGATC CAATGCATCC AATCCAACAG GGGGAGCGCG GCTGATAGCC
TATGTTTCTG CCCATGCAAC GACCGATTTG GACGCAGCAC GACTGCGCGA AGCGCTTGCC
AGAACCTTAC CCGACTACAT GCTGCCCTCA ATGATTGTGG TGTTGGAGAG TCTGCCGCTC
AATCCGAGCG GCAAGGTAGA CCGCAAGGCC TTGCCTGAGC CGGAGTTTAC GCATACGGAG
CATTATGAGG CGCCGCAGGG AGAGGCGGAA GAGGTGCTGG CGGGTATCTG GGGGCAGGTG
CTGGGTGTGG CGCAGGTGGG GCGGCATGAC AACTTCTTCG AGCTGGGAGG CGATTCCATA
CTCAGTCTCC AGATTGTCAC CAGGGCACGC CGTGTCGGCT GGAAGATCAC GCCACGCCAA
TTGTTCGAGC GCCAGACCAT CGCGTCACTG GCAGAAGTGG CTGAGACGAT ACAGGAAACG
GTGGCAGCTC TGCTCAAGCC GCAGCGAGGT CATCTGCATG ACTATTTGAA TGCCGGCACG
ATTGCTGGTC TTGCGTTGGA TGAGGATGAA ATTGAAGACG TTTATCCTTT GTCACCCACC
CAGGAAGGGA TGCTGTTCCA TACCCTGGAA ACGGCAGGGG ACGGAATGGG GCTTTATGTA
AACCAGCTCA GCGTTGAAGT GCAAGGTCTG GACCCGGAGC GCTTCACGCG GGCTTGGCGG
GAAATGGTGG CTCGTCATCC GATATTGCGC ACCGGTTTCC TGTGGCAAGC GGGTCTCGGG
CGCCCGCTGC AAATTGTATT CAGAAAGGTC GAGGTACCGG TGATTCATCT CGACTGGCGT
GGTCTGGATA AGACGGCTTC CCGTGTTACC CCTTATGCGG AAGAAGAGTT GAAACGGGAA
TTCGATTTTC TTGCGCCACC CTTGGCGCGA TTTGCTCTCA TACGCCTGGC TGAAAACCGT
TATCAGCTGA TATGGACCCG CCATCACATA CTGCTCGATG GCTGGGCTGA TTCCCTGCTT
ATCAGCGAAT GGCTGCGCTG CTACGACGGA AAGGTGCTTC GCGACGTAGG GCCGGACTAC
GGTCATTACA TACGGTGGCT CGCGCAACAG GACGCCGAAG CGACTCGACA CTTCTGGCAG
GGTGAGCTTC AGGCGGTCGA TGGCCCCACT TTGCTGCGGA AAACAGTTGG CAAGACAGAG
GGACCGGAGG GGCAGGAAAG CTGTGCAGGA TTTGCCCAGA TTTATACCCT CCTGGGTCAG
AACGAAACGC GTCGATTGAA GGCTTTCGCG CAACGGCAGC AGATAACGCT CAACACCTTC
GTGCAGGCCG CGTGGGCCTT GCTTTTGCAG CGCCACACCG GCAAGGACAC CGTTGTGTTC
GGTGCGACGG TAGCAGGACG TCCCCATAGT CTCCTCAAAT CGGACGAAAT CATGGGAATG
TTCATCAATA CCATTCCGGT TCCGGTCGAG CACCGAAGTG AGCTGACAGT CGCAGAATAT
CTCGACCTGC TGCAGAAAAC GAATGCCAGG CTGCGTGAGC ATGAGCACGC ATCGCTCGCC
GAAATTCAGC GGTGGGCCGG TTCACCCGGA CAGCCTTTAT TCGACAGCAT TGTCGTATTC
GAGAATTACC CGATAGACGA GGCGCTGCGC GGCAACGAAC TTTATGGTCT TCGTTTCGGC
GAAATAGAGG GCAAAGGGTT GACCGGATAC GCGATGGACT TACAGGTCGT GGTGGGCGAC
AGGTTGGAAA TCGAGTACTG CTACGGGTGT GGAGACTTTA CGGAAGCATT CGTGCTCGGT
CTTCGCAGCC AAATGGAATT CCTGATGAGG GAAATGATGG CTCATCCGGA GTGGCGTGTG
GGAGAGCTGG GGTGGATGGA AAAAAGAGAG ATTGGGCACC TCCTTTCTCT CGGATCCAAT
GCGCATTTGG AAACCTTGCC CTCGCGGCTT TCGCGTCAGT TTGTGCACAA CCTGATAGAG
CAGAACGCGG AACACCACCC TGAAGCGATT GCGCTGCTCA TGGGTGAGCA GGAATTATCC
TACGCGGAGT TGAATGAGCG GGCAAACCGG CTGGCGCACC ATCTCGCCCG TATGGGTGTG
GGACCGGAAG TGAGAGTAGG AGTGGCGATG GAACGCTCGC TCGAGGTCAT TGTCACGCTG
CTGGCCGTGC TCAAGGCTGG CGGGGCCTAT GTACCACTCG ATCCCGAATA TCCGGTAGAG
CGCCTGTCCT TCATGGTGAA CGACAGTGGC ATGTCACTCC TTTTGACGGA AGAGAAGTTG
CTCGCCAAGC TCGGCAGCGG CTTTGGAGTG CAGGTGTGGT TGCTGGATTC GCTGGATCTG
ACGGCGGAAT CAGGCTCTAA CCCCGACATT CCCCTCCATG AGCATAATCT GGCTTACATT
ATCTACACAT CGGGCTCGAC CGGGCTTCCC AAGGGAGTGG CAGTCGCGCA CGGTCCCCTG
AGCATGCACT GCCAAGCGAC GGCGGGAATT TACGGTATGA CGCCGCACTC ATGCGAGCTG
CTCTTCATGT CATTCTCATT CGATGGCGCA CATGAGCGCT GGCTGACCGC ACTGACAGTC
GGTGCCGGCC TGGCAGTGCG CGACCAGGAA CTGTGGACCG CGGAGCAGAC ATACGACGCT
TTGCACAGCT ATGGCATTAC CAATGCAGCA TTTCCACCCG CTTACCTAGG CCAGGTTGCC
GAATGGGCTG CGCCCCGCAG CGATCCGCCC CCGGTAGAGC TTTATGTCTT CGGAGGCGAA
GCCATGCCCA AGGCTTCCTA CGACCTGGTG CGAAAGACTT TGCGGCCTCG CATTCTGATC
AATGGTTACG GACCCACCGA GACGGTAGTT ACCCCATTGA TCTGGAAGAC GGAGGCAAGC
AACAGTTTCG ATTGCGCCTA CGCACCCATC GGCAGACCCG TGGGAGAACG GACGGCGTAT
GTTCTCGACC TCGATATGCA GCCGGTACCC ATAGGCAGGG TTGGAGAATT GTACATCGGC
GGCTATGGCC TTGCTCGCGG GTACCTGGGA CGAGCGGGAC TGACAGCGGA GCGCTTTGTA
GCCGACCCAT TCGACGGGAA TGGCGGGCGG CTCTACCGCA CGGGTGATCT GGTGCGTTGG
CTGGATGATG GCAATATCGA ATATATCGGC CGTGCTGACC ATCAAGTGAA GATTCGGGGG
TTCCGGATCG AGCTGGGCGA AATCGAGGCA TGCATACGTG AGCTTACCGG ACTGACCGAT
GTTGCAGTCG TTGTTCGGGA AGGGGCGGGA GGACCGCAAC TCGCCGCTTA TGTCGCGCCG
AAGGAGACGA CGGGGATGGT TGCACCAGTC CGGAAGGGGA GTGCGGGGCT GGGAAGCACG
CTGAAGCAGC AACTTGTCAG GCGGCTGCCC GAATACATGG TGCCCGCGCA TATCGTCATC
CTGGACAAGC TGCCCCGACT GCCCAGTGGC AAGCTTGACC GCAGTGCATT ACCGGAGCCG
GATGCGGTTG CTGCCGACAC TTATCGAGCA CCCTCTACTT CGGAAGCGAG GCTTCTGGCG
CAGATCTGGC AGGAGGTGCT GGGCGTGGAA CGGGTAGGCG AGACGGACAA CTTCTTCGCC
CTCGGCGGAG ATTCGCTCTC CAGCCTGAAA GTCATGGCCC GAATGCGCAA CCTGCCCGGC
TTGAAATTCG ATTTCAAGTT GCGGGATCTG ATGCAGCGCC CGACCATTGC CGGTTTGCTC
GGCCTGGACG TTGAAGCCCC CGGCAAAACA CAGCCACTGC TGCTGTTAAA CCAGCGCGAC
GAAAAAGCGA AAGTCGAACC GTTATTCTGC ATCCATGCCG GTTTGGGCAC TGTGTTCGAC
TACCAGCCGC TTGCACGGCA TCTACAGGGT ACGCGCACGG TCTATGGAAT ACCTTGCCGC
ATGCTCTCTG ATCCTGGACA CCGCGATACT TCTCTCGCCC AGATGGCGGC AGATTATGTC
CAGATCATTC GCCGGGCGCA GCCGGAGGGG CCCTACCATC TGTCAGGGTG GTCCCTCGGC
GGCACTCTTG CAGCCATGAT GGCTGTGTTA CTGGAGGCGG AGGGGCAGGA GGTGGCGTTT
CTGGGTCTCA TCGATTCCTT CATTCCCGCC ATAGACGAAC CCGAGCCCGA TGACTGGCGA
CAGGATTTTT CCGATTTTGT TTCCGTGGTG CTGCCCGGAG CGAAAATCGA TGGCGCGGAT
GGAGTTTTTC CTGATGGAAA TCAAAAGAGT TGTCTGCAAT CCTTGAAGCA GCCATCGGAA
GAAACTTTGG CCGGTTTACT GGATGAATTG ATTTCCGCCA TGCAGGCTTC GGAAGATGCT
GTTTCGCCGG AAGGTGCACC TTCGTCTACG AAAAAGAAGC GTGGCGGGTA TGCGGATCTG
GGAGCAAGCG AACTGGCTCG TATCTTTGGC ATCGCGCGGC ACTTGAAAGC GCTTGCTGCC
CAGGCTTCGG AATTAGGCTG CTTGAACATC CAACCCACCT GCTGGTGGAT TGCGACCCGC
CCCTTGTCCG ATCGGCTGGC TCTTTCAAGG CAAACAGGCC AGCCTGAATT GTCCGAAAAC
GAAATCGATA CCGATCACTT TTCGATCATA CGGGCAGAAG CGTTGTTTAT CGGGATGGAA
AGTACGCTTC GATTTGAGAG TGCCAGGACA GCAATTCCCT CGAAGGTTCT ATAA
 
Protein sequence
MELNKQDIAE RFAALAPEKQ KEFLNALKKR GFDFSLLPIV PQKTGNRSAL SYAQERHWFL 
WQLEPLSTAY HLSGGLRLTG RVDIEALRWS FAALGRRHES LRTIFRVNSE GLPEQIIEDE
PRLEIPLTDF SGLPLEQARA QAGEEAGRIA GTPFDLTQGP LLRVALIRIA AEEHLLVVVM
HHIISDAWSN RIVIDEFAAH YRARVQQEQE GEKQGQEPSL PALPIQYADY AIWQRNWLEA
GEKERQLAYW RSQLGEEHPV LQLPTDHPRS SRASYRAARH TFTLPAGLVT RLQRQAQSQG
ATLFMALLSG FQGLLYRYTG QRDIRVGVPI ANRHRAEIEN IVGFFVNTQV LRTLMDGRMS
LHTLLDQTRE AALGAQTHQD LPFERLVEAL QPERNLNQNP LFQVMYNHLR EDYRALEQLP
GLKVENHELS EQAAQFELTL DTVEQPDGRL EATFTYAAEL FEPATIGRLG NHYLLLLEQL
AEHPQQNLGD IDILSEAERA QLKAWGINEQ RYANTEPVHR LIERQVEVQP EAIALIFGDV
ELSYGELNRR ANRLAHRLIR LGVGPEVKVG IAVERSIDMV VGLLATLKAG GAYVPLDPEY
PQERLAYMVA DSGIGLLLTQ SRVRSRIPHS GQYPVLELDR LDLEDESDSN PQAVLHGHNL
AYVIYTSGST GKPKGVSVTH EPLSMHVQSI GKAYGMTTMD RELQFASINF DGAHERWLVP
LAFGSALMPR DNDFWSVERT VAEIVKHRIT IACFTPNYLH QMAELLGTAG RALPIRSYTV
GGEAMSRASF DFVQTTLQPP RIINGYGPTE TVITPLISKA YPGTGFESAY MPIGCPVGDR
IAYILDSDLN PVPAGVAGEL YLGGIGLARG YLNRGGLTAD RFIADPFDET GGRLYRTGDL
ARWRSDGQIE YLGRLDHQVK IRGFRIELGE IETQLLAQPE VREAVVVARE GSSASNPTGG
ARLIAYVSAH ATTDLDAARL REALARTLPD YMLPSMIVVL ESLPLNPSGK VDRKALPEPE
FTHTEHYEAP QGEAEEVLAG IWGQVLGVAE VGRHDNFFEL GGDSILSLQI VTRARRAGWK
ITPRQLFERQ TIASLAAVAM ALEVQDATIA TLAPIASAQS KEGLSGGGSG RLLSLLPIQA
EFFEREIPAR HHWNQAALLR SREYLNPAHL GEALEAVVRH HDALRFRFIR NERSQWQQSC
GESIAADLLW VRKASADADL EALCNEAQRS LNLVEGPVLR AVLFDMDDRS QRLLLVIHHL
VIDGVSWRIL LEDLQSAYSQ ARNAQIIALP EKSGSYEVWS ARLQRYVHEN REELIYWRSL
KGVPVILPCD NPEGASFVCH QRDAVLKLGK AQTRALLKEA PAAYRTHVND LLLTALGSAL
CRWSGHKKIL IDLEGHGRED LYDIDLSQTV GWFTTLYPVL LDPAGDLAQR IKRIKEDLRR
VPNKGVGYGL FKYHGTSEQR EVLASLPRPE VVFNYLGQLD ASFDESALWT LATESTGDLI
DENAPLSHDI SINGHVYEGE LCLTVSYSDA RYHRTTIEAF MNVYQAELET LIAHCTRGIQ
GLTPSDFPLV EITQRELDSL PVAAAQLEDL YPLSPLQEGI LFHSVFDRDD HSVYLNQLRA
DIEGLEATRF KAAWQDAMAC HPVLRTGFVT QDSKPLQWVA KSVELPFVEH DWRAREDKEH
REDRERDLEA LAQAEYASGF DLAKPPLMRF ALVRLADDRY HFIWTIHHLL LDGWSTSQLA
GEVLRRYAGR FSPAQEALRG RGEYRRFIEW LQRRDVEVSE AYWRERLKGI REPTRLAVTL
PAHRGNSDHE EHFGEHITEL PLSLSEGLIQ FARRERVTLN TLIQAAWAIL LSRYTGKQTV
LFGATVAGRP ADLPGAEHLL GLFINTLPVS VTLQPEHQVG AWLRDLQAQN LASREHEQTP
LYEIQRWLGQ SGQGLFDSIL VFENYPMDEA LRESTPGGPA FSNIRNRESS NYPMMVSVMQ
NTMLSLGYSY DCRYFSRTTV ESIAAQLHRL LDRIAATPTN SPQSLGDIDI LGAAERAQLK
AWGINEQRYA NTEPVHRLIE RQVEVQPGAV ALIFGDAELS YGELNRRANL LAHRLIRLGV
GPEVKVGIAV ERSIDMVVGL LATLKAGGAY VPLDPEYPQE RLAYMVADSG IGLLLTQSGV
RSRIPHSGQY PVLELDRLDL EDESDSNPQA VLHGHNLAYV IYTSGSTGKP KGAGNRHLAL
YNRLAWMQEA YELGNDDTVL QKTPFSFDVS VWEFFWPLMY GARLAIAAPG DHRDPARLLS
LILRQNVTTL HFVPSMLQAF LAHEGIEACV ATLRRIICSG EALQAEVQKQ VFRKLPGVGL
FNLYGPTEAA IDVTQWECVD DRDNSVPIGK PISGLQAYIL DVHLNQVPQG VAGELYLGGI
GLARGYLNRG GLTADRFIAD PFDETGGRLY RTGDLARWRS DGQIEYLGRL DHQVKIRGFR
IELGEIETQL LAQPEVREAV VVAREGSNAS NPTGGARLIA YVSAHATTDL DAARLREALA
RTLPDYMLPS MIVVLESLPL NPSGKVDRKA LPEPEFTHTE HYEAPQGEAE EVLAGIWGQV
LGVAQVGRHD NFFELGGDSI LSLQIVTRAR RVGWKITPRQ LFERQTIASL AEVAETIQET
VAALLKPQRG HLHDYLNAGT IAGLALDEDE IEDVYPLSPT QEGMLFHTLE TAGDGMGLYV
NQLSVEVQGL DPERFTRAWR EMVARHPILR TGFLWQAGLG RPLQIVFRKV EVPVIHLDWR
GLDKTASRVT PYAEEELKRE FDFLAPPLAR FALIRLAENR YQLIWTRHHI LLDGWADSLL
ISEWLRCYDG KVLRDVGPDY GHYIRWLAQQ DAEATRHFWQ GELQAVDGPT LLRKTVGKTE
GPEGQESCAG FAQIYTLLGQ NETRRLKAFA QRQQITLNTF VQAAWALLLQ RHTGKDTVVF
GATVAGRPHS LLKSDEIMGM FINTIPVPVE HRSELTVAEY LDLLQKTNAR LREHEHASLA
EIQRWAGSPG QPLFDSIVVF ENYPIDEALR GNELYGLRFG EIEGKGLTGY AMDLQVVVGD
RLEIEYCYGC GDFTEAFVLG LRSQMEFLMR EMMAHPEWRV GELGWMEKRE IGHLLSLGSN
AHLETLPSRL SRQFVHNLIE QNAEHHPEAI ALLMGEQELS YAELNERANR LAHHLARMGV
GPEVRVGVAM ERSLEVIVTL LAVLKAGGAY VPLDPEYPVE RLSFMVNDSG MSLLLTEEKL
LAKLGSGFGV QVWLLDSLDL TAESGSNPDI PLHEHNLAYI IYTSGSTGLP KGVAVAHGPL
SMHCQATAGI YGMTPHSCEL LFMSFSFDGA HERWLTALTV GAGLAVRDQE LWTAEQTYDA
LHSYGITNAA FPPAYLGQVA EWAAPRSDPP PVELYVFGGE AMPKASYDLV RKTLRPRILI
NGYGPTETVV TPLIWKTEAS NSFDCAYAPI GRPVGERTAY VLDLDMQPVP IGRVGELYIG
GYGLARGYLG RAGLTAERFV ADPFDGNGGR LYRTGDLVRW LDDGNIEYIG RADHQVKIRG
FRIELGEIEA CIRELTGLTD VAVVVREGAG GPQLAAYVAP KETTGMVAPV RKGSAGLGST
LKQQLVRRLP EYMVPAHIVI LDKLPRLPSG KLDRSALPEP DAVAADTYRA PSTSEARLLA
QIWQEVLGVE RVGETDNFFA LGGDSLSSLK VMARMRNLPG LKFDFKLRDL MQRPTIAGLL
GLDVEAPGKT QPLLLLNQRD EKAKVEPLFC IHAGLGTVFD YQPLARHLQG TRTVYGIPCR
MLSDPGHRDT SLAQMAADYV QIIRRAQPEG PYHLSGWSLG GTLAAMMAVL LEAEGQEVAF
LGLIDSFIPA IDEPEPDDWR QDFSDFVSVV LPGAKIDGAD GVFPDGNQKS CLQSLKQPSE
ETLAGLLDEL ISAMQASEDA VSPEGAPSST KKKRGGYADL GASELARIFG IARHLKALAA
QASELGCLNI QPTCWWIATR PLSDRLALSR QTGQPELSEN EIDTDHFSII RAEALFIGME
STLRFESART AIPSKVL