Gene ANIA_02621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_02621 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001306 
Strand
Start bp3309871 
End bp3321183 
Gene Length11313 bp 
Protein Length3770 aa 
Translation table 
GC content50% 
IMG OID 
ProductN-(5-amino-5-carboxypentanoyl)-L-cysteinyl-D- valine synthase (EC 6.3.2.26)(Delta-(L-alpha-aminoadipyl)-L-cysteinyl-D- valine synthetase)(ACV synthetase)(ACVS) [Source:UniProtKB/Swiss-Prot;Acc:P27742] 
Protein accessionCBF84349 
Protein GI259486479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.830211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCTC CCGGGCTATT GAGCGAAGAC GGCCCTGGCT ACAGTGGCGG CTATGCAGAC 
CCTACGGTGC CAAAGGTTAA TTGGAAGCAG TCCAATGGGA AAAGCGCCGG GGGCAATGGC
GACGTTGATG CAGGCAATGG CAACATTGAC CCTAGCAAAT CGGGTGTTGG TGTCCAAGTG
TGTTTTGCAG GAGGGCTTGA AGGTTGGAAA GCCGGCATCA GCAAAATAAC TGAACGTTGT
GATCTGAGCA GTATTGCAAC AAACTCGACG AAATACCAGC TTGCGGTAAC CGGGTTCAGT
GATGGACCGG ATGACTACAA TGAGTACTCG GTTCCTTTTC CCTCAGAAGT ACTTGTCGCG
ATGGAAGAAA TGTGTCTTGC ACGAGATATT AGTATGAGGT CTGTGATCCA GTTTGCAGTG
CATTATGTGT TGAAAGGGTT CGGTGGTGGC TCACATACTG TTGCTGCGTC GATCGATGTG
GGTGACGACC CCAATAACAT AGCGACATCA TACACTATTA CACCCTCAAT TGTCTGCCAT
GAGAGCAGAC AAGGACAGAC CGTGATGCAG GAGATTCAGA GTATGGAAAA GTTAAACCAA
TTGAGGAAGC AAGAAATGCA TCCGGGGGAG GCTGGATTAA GTCTCATCAG AATGGGGTTA
TTCGACATTC TGGTTATCTT CGCAGATGCA AACAAGTGTG AGGGTCTAAT TGCTGGCTTG
CCTCTAGCAG TAATGGTGTG CGAAGGAGGT GGAAGACTTC AGGTTAGAAT ACACTTCTCA
GGGTCCCTTT TTCGACAGAA GACGTTAGTG GATATCGCCG AAGCCCTGAA CGTCTTGTTC
GCTAAGGCTG CGTCGGGGGG AGCGACGCCG GTCCGAGATC TTGAACTTCT TTCTGCAGAG
CAAAAGCAGC AGTTAGAAGA GTGGAACAAG ACGGATGGAG AGTACCCTGA ATGCAAAAGA
CTCAATCACC TTATTGAGGA GGCGACACAG CTGCATGAAG ACAAAGTTGC CATCGTGTAC
AAACGTCGCC AGCTTACATA CGGCGAATTG AACGCGCAGG CCAACTGTTT CGCGCACTAT
CTGCGGTCCA TCGGGATCTT ACCTGAGCAG CTGGTGGCTT TATTTCTCGA GAAGAGCGAG
AACCTTATCG TGACTATATT GGGTATCTGG AAGTCCGGCG CCGCATATGT GCCCATTGAC
CCAACCTACC CTGATGAACG AGTCCGCTTT GTGCTTGAAG ACACTCAGGC AAAAGTCATC
ATTGCGAGCA ACCACCTTGC AGAGAGACTT CAAAGCGAGG TCATCAGCGA CAGGGAGCTC
TCCATTATTC GTCTAGAGCA TTGCTTGAGC GCCATTGATC AGCAGCCATC GACATTCCCG
AGAGCCAATT TGCGCGACCC ATCTCTGACC AGCAAGCAGC TTGCCTACGT TACCTATACA
TCGGGGACCA CGGGTTTTCC GAAGGGCATT CTCAAGCAAC ACACTAACGT GGTGAACAGC
ATCACTGACC TTTCAGCTCG GTATGGGGTG ACAGGGGACC ATCATGAAGC CATCCTGCTC
TTTTCAGCGT ATGTGTTTGA GCCCTTCGTG CGGCAGATGC TCATGGCACT AGTGAATGGC
CATTTGCTCG CTATGGTCGA TGATGCTGAG AAGTATGATG CCGAAAAGTT GATACCATTC
ATTCGTGAGC ACAAGATCAC GTACCTCAAC GGCACTGCCT CCGTCCTGCA GGAATACGAC
TTCTCCTCTT GCCCATCTCT AAAGCGTTTG ATCTTGGTCG GTGAGAACTT GACTGAATCT
CGGTATCTGG CACTACGTAG ACATTTCAAG AATTGCATAT TGAACGAGTA TGGCTTCACA
GAATCAGCCT TTGTGACGGC GCTCAATGTT TTCGAACCAG GCTCGGCGCG CAATAACACG
AGTCTTGGGA GGCCGGTGCG CAACGTCAAG TGTTATATCC TCAACAAGTC TCTCAAGCGA
GTGCCTATTG GTGCCACTGG TGAATTACAC ATTGGCGGGC TGGGTATATC CAAGGGCTAC
CTTAACCGTC CCGACCTTAC GCCGCAACGC TTCATTCCCA ACCCATTCCA AACGGACCAT
GAGAAGGAGC TCGGATTAAA CCAGCTGATG TACAAGACCG GGGATCTCGC CCGTTGGCTT
CCAAACGGTG AGATCGAGTA CCTCGGCCGC GCGGACTTCC AAATCAAGCT GCGAGGGATC
CGTATCGAGC CCGGCGAGAT AGAGTCCACT CTGGCGGGTT ACCCTGGGGT ACGAACCAGC
CTAGTCGTCT CTAAAAGGTT GCGGCATGGC GAAAAGGAGA CTACCAACGA GCATCTGGTA
GGCTATTATG TGGGCGATAA TACCTCTGTC TCTGAAACGG CTCTCTTGCA ATTTCTGGAG
CTGAAGCTGC CCCGATACAT GATTCCGACA CGACTTGTGC GCGTGTCTCA AATCCCAGTG
ACTGTTAATG GAAAGGCAGA CCTCCGTGCC CTACCTTCTG TCGACCTTAT TCAACCCAAA
GTGTCCTCTT GCGAGCTCAC GGATGAGGTG GAAATAGCTT TGGGGAAGAT ATGGGCAGAT
GTTCTCGGAG CCCATCACCT GTCGATATCC CGTAAAGACA ACTTCTTTCG TCTTGGAGGG
CACAGCATCA CATGCATCCA GCTCATCGCA CGTATTCGCC AGCAGCTTGG TGTAATTATT
TCCATTGAGG ACGTTTTCTC ATCCCGGACA CTGGAGCGTA TGGCTGAGCT TCTGCGAAGC
AAAGAGTCCA ACGGAACTCC GGATGAGAGG GCTAGGCCTC AACTAAAAAC CGTGGCGGGA
GAAGTTGCAA ATGCTAATGT CTATCTTGCT AACAGTCTCC AGCAAGGCTT CGTTTATCAG
TTCCTGAAAA ATATGGGCCG ATCAGAGGCT TATGTGATGC AATCCGTGCT GCGATACGAT
GTCAATATCA ATCCTGATCT ATTTAAAAAA GCCTGGAAGC AGGTACAACA CATGCTTCCA
ACACTGAGGC TCCGATTTCA ATGGGGACAG GATGTTTTGC AGGTGATTGA CGAGGACCAG
CCGCTGAACT GGTGGTTCTT ACACCTTGCC GACGATTCAG CCCTGCCCGA GGAGCAGAAA
CTACTAGAGT TACAGCGCAG GGACCTGGCT GAGCCATACG ACCTAGCAGC CGGAAGCCTG
TTCCGCATTT ATCTGATCGA GCATAGCTCA ACTCGGTTTT CGTGCTTGTT CAGCTGTCAT
CACGCAATCC TTGATGGATG GAGCCTGCCG CTTCTTTTCA GGAAGACTCA TGGAACTTAT
CTGCATCTCC TGCACGGACA TTCTCTCAGG ACTCTGGAAG ACCCTTACAG GCAGTCTCAG
CAGTATCTCC AAGATCATCG CGAAGATCAT CTCAGGTACT GGGCTGGTAT CGTGAATCAG
ATTGAAGAGC GTTGTGACAT GAACGCTTTG CTGAACGAAC GCAGTCGGTA CAAGATTCAA
CTGGCGGACT ATGACAAAGT GGAGGATCAA CAACAATTAA CTTTAACAGT CCCTGATGCT
TCCTGGCTAA GCAAATTGCG CCAAACATGC TCTGCGCAAG GCATTACATT GCACTCTATT
CTGCAGTTTG TTTGGCACGC GGTATTGCAT GCTTACGGTG GCGGTACTCA TACTGTCACT
GGCACTACTA TCTCAGGGAG GAACCTGCCT GTGAGTGGGA TCGAACGATC TGTGGGTCTC
TACATAAATA CGCTCCCACT GGTAATTAAT CAGTTGGCCT ATAAGAATAA AACCGTCTTG
GAGGCTATCC GTGATGTGCA GGCCATTGTA AATGGCATGA ACAGCCGGGG AAATGTGGAA
CTTGGCCGTC TACAGAAAAA CGAGCTGAAG CATGGGTTAT TTGACTCGCT ATTTGTGCTG
GAGAATTATC CAATACTGGA CAAGTCCGAG GAGATGCGGC AGAAGAGTGA ATTGAAGTAT
ACCATCGAAG GCAATATTGA AAAGCTCGAC TATCCCCTTG CTGTTATCGC GCGCGAGGTC
GACCTAACTG GGGGATTCAC CTTCACCATC TGCTACGCTC GAGAGCTTTT CGATGAGATT
GTTATATCTG AGTTGCTCCA AATGGTCCGG GACACGCTCC TGCAAGTCGC GAAGCATTTA
GATGACCCCG TCCGCAGCCT AGAGTATCTG TCATCAGCGC AAATGGCTCA ACTTGACGCA
TGGAATGCGA CAGACGCGGA ATTCCCCGAC ACCACCCTAC ACGCGATGTT CGAAAAAGAA
GCGGCCCAGA AACCAGACAA GGTCGCGGTG GTCTATGAGC AACGCAGCTT GACGTATCGT
CAGCTAAATG AGCGGGCGAA CCGTATGGCG CACCAGCTCA AATCTGATAT CAGCCCAAAG
CCGAACAGTA TCATTGCTCT GGTAGTGGAT AAGAGTGAGC ATATGATAGC TACCATTCTG
GCTGTGTGGA AGACTGGCGG TGCCTATGTA CCGATCGACC CTGAGTACCC CGACGACCGT
ATCCGCTATA TCCTAGAAGA CACCAGCGCC ATTGCCGTGA TTTCAGACGC GTGTTACCTC
TCACGAATCC AAGAATTAGC GGGAGAGAGT GTCCGTCTGT ATCGGTCTGA CATCTCTACT
CAGACTGACG GTAACTGGAG TGTGTCGAAT CCTGCACCGT CCAGTACGAG CACGGATCTT
GCATATATTA TCTACACTTC GGGAACAACT GGGAAGCCAA AGGGCGTCAT GGTGGAGCAC
CACGGAGTGG TAAATCTGCA GATATCGCTG TCTAAAACCT TCGGGCTGCG CGATACTGAT
GACGAGGTAA TCCTCTCATT CTCCAACTAC GTCTTTGACC ATTTCGTGGA ACAGATGACG
GATGCCATTC TCAACGGCCA AACATTAGTT ATGCTCAACG ATGCAATGCG CAGTGACAAA
GAGCGCCTCT ACCAATATAT CGAAACTAAT AGGGTAACAT ACCTGTCTGG AACCCCATCC
GTTATTTCCA TGTATGAGTT CAGTCGATTT AAAGACCACC TACGCCGTGT CGACTGCGTT
GGAGAAGCTT TTAGCCAGCC CGTCTTTGAT CAAATCCGTG ACACTTTCCA AGGGCTGATT
ATCAACGGCT ACGGTCCAAC AGAGATCTCC ATCACGACAC ACAAGCGGCT GTACCCTTTC
CCTGAGCGGC GCACAGATAA GAGCATCGGC CAGCAGATTG GCAACAGTAC GAGCTACGTG
CTGAATGCAG ACATGAAACG CGTTCCAATT GGGGCTGTAG GTGAGCTCTA TCTGGGTGGT
GAAGGCGTCG CGCGAGGATA TCATAACCGA CCGGAAGTGA CTGCTGAGCG ATTTTTACGC
AATCCGTTCC AAACAGACAG TGAACGGCAA AATGGGCGCA ACAGCCGCTT GTACAGGACC
GGTGACTTGG TACGCTGGAT CCCAGGCAGT AACGGTGAAA TTGAATATTT GGGACGCAAT
GACTTCCAGG TCAAGATTCG CGGGCTCCGT ATCGAATTGG GGGAGATTGA GGCTGTCATG
TCCTCACATC CTGACATTAA ACAGTCTGTT GTAATTGCAA AGAGTGGCAA GGAAGGAGAC
CAGAAGTTCC TTGTTGGTTA CTTCGTGGCT AGCTCGCCAT TGTCTCCGGG TGCAATCCGG
CGCTTTATGC AATCCCGGCT TCCTGGCTAT ATGATACCTT CAAGTTTCAT TCCTATCAGT
TCTCTCCCAG TGACTCCCAG TGGAAAGCTG GATACAAAGG CCTTACCTAC AGCAGAGGAG
AAAGGCGCAA TGAACGTGCT GGCTCCACGT AATGAAATCG AGAGCATCCT GTGCGGTATC
TGGGCAGGGT TGTTAGATAT ATCCGCCCAA ACAATTGGCA GCGACAGCGA TTTTTTCACC
CTCGGAGGCG ATAGTTTGAA GAGTACAAAG CTCTCATTCA AGATTCACGA GGTATTTGGC
CGCACAATCT CCGTCAGCGC TCTGTTCCGT CACCGAACCA TCGAGAGTCT GGCACACCTA
ATTATGAACA ATGTTGGAGA CATACAGGAG ATCACGCCTG TGGATTATGA TAACAGACGC
AAAATAGCCG TATCTCCCGC TCAAGAGCGC CTTCTATTCA TTCACGAGCT TGAAGGTGGA
GGCAATGCAT ATAATATCGA TGCTGCCTTT GAGCTACCTC CATACATTGA TCAATCTCGA
GTCGAAGAGG CATTATATAC CATTCTTTCA AGACACGAAG CCTTACGAAC ATTTCTGCTG
CGGGACCAGG CAACTGGCAC GTTCTACCAA AAGATATTGA CTACCGATGA GGCCAAGTGC
ATGTTGATCA TTGAGAAAAG TGCAGTGAGC ACCATTGATC AAATTGATTC CATAGTCGGA
CGCCTATCGC AGCACATTTT CCGTCTCGAT TCTGAGCTTC CCTGGTTGGC GCATATTGTC
ACGCACAAAA CGGGCAATCT TTATCTGACC CTGTCCTTCC ATCACACTTG CTTCGATGCA
TGGTCATTGA AGATCTTCGA GCGGGAGCTC CGCGTTTTTT GCGCGTCAAA CGAAAAAGGC
GGCAACATGC CAATCCTACC AATGCCTCAA GTCCAGTACA AGGAGTATGC CGAGCACCAT
CGTCGACGAC TAGGTAAGAA TCAGATTCAA AAATTATCCG ACTTTTGGCT GCAAAGACTA
GACGGCCTGG AGCCCCTACA GCTCCTACCG GATTATCCGC GGCCTGCCCA ATTCAACTAC
GATGGAGGTG ACCTCTCCGT CATTCTGGAC GGTGTGGTTC TGGAAACCCT CAGGGGCATT
GCAAAAGACC ACGGAGTAAC TCTGTACGCA GTGCTTCTCG CTGTTTACTG CCTGATGCTT
TCGACATATA CACACCAGGT AGATATCGCT GTGGGAGTCC CCATCAGTCA CCGAACCCAC
CCCCTGTTCC AGTCTATTGT CGGATTCTTC GTCAATATGG TAGTTGTGAG GGTCGACGTG
AAGGACTTTG CCGTTCACGA TCTCATTCGA AGGGTAATGA AAGCGCTTGT TGATGCCCAG
TTACATCAGG ACATGCCATT CCAAGACGTG ACTAAACTGC TGCGGGTGGA TAACGACGCC
AGCCGACATC CCCTAGTTCA GACTGTGTTC AACTTTGAAA GTGACATGGA CAAAGAATTC
GAGACGACAC CTTCAATCCA AGACACTGCC ACAATCGCAC CATACCAGTC CGTTCAGAGG
ATAAAGTCGG TTGCGAAATT TGATCTGAAC GCGACAGCTA CAGAGTCGGG CTCAGCCTTA
AAGATTAACT TTAACTATGC CACCAGCCTG TTCCGGAAAG AAACGATCCA GGGCTTCTTA
GAGACATACA GGCATCTCCT GTTACAGCTC TCTTATCTGG GGTCCCAGGG ACTTAAAGAA
GATACAAAGC TACTGTTGGT CCGCCCTGAG GAGATGAGTG GTCCGCATCT GCCATTAGCA
GGATTATCCA ATGGTGCGGA AACCCTAGAA GCTATATCAC TCAGTAGAGC ATTCGAGTTT
GAAGCTTTCA GGGTACCGGA TAGAGCTGCC GTCGTACAGG GAGATAAATC ACTCAGCTAT
ACCGAGCTCA ATAAACGGGC AAACCAGCTA GCCCGGTACA TACAATCCGT GGCACACCTT
AGGCCGGACG ACAAGGTGCT CCTCATTCTG GATAAGAGCA TCGACATGAT TATTTGCATC
CTCGCAATCT GGAAAACCGG TAGCGCATAT GTGCCTTTGG ATCCATCATA TCCCAAGGAG
CGTGTCCAGT GCATTTCGGA GGTAGTTCAA GCAAAGATTC TGATTACAGA GTCACGGTAC
GCCTCTGCAT GGGGAAGCCA GACGTCAACA ATACTTGCAA TTGACTCGCC CAAGGTCTCG
AATATGGTCA ATAATCAGGC AACTCATAAC TTGCCCAACA TTGCGGGAAT AAAAAATCTG
GCATATATAA TTTTCACATC TGGCACCTCC GGCAAGCCAA AGGGTGTTCT GGTCGAACAA
GGTGGAGTTC TTCACTTGCG TGATGCGCTT AGGAAGCGGT ACTTTGGCAT TGAATGCAAT
GAATACCATG CTGTGCTCTT CCTATCCAAT TACGTGTTTG ATTTCTCTAT CGAGCAGTTG
GTCTTATCAA TTATGAGCGG CCACAAGTTG ATCATCCCGG AAGGAGAATT CGTTGCGGAT
GATGAATTCT ACATAACAGC CAACGGTCAA CGCCTCTCAT ATTTGAGCGG TACACCATCC
CTGTTGCAGC AAATTGACCT AGCACGCCTC AATCATCTAC AGGTCGTAAC TGCAGCTGGT
GAGCAACTCC ATGCTGCGCA GTTTAATAAG TTGCGCTCCG GATTCCGCGG CCCGATCTAC
AACGCATATG GAATTACGGA GACCACGGTA TACAACATAG TCAGCGAGTT CAGTGCGCAA
TCCCAATTCG AAAATGCTCT GCGAGAGCTG CTACCAGGCA CTAGGGCATA TCTTCTTAAC
CACGCCACTC AGCCAGTTCC TATGAACGCA GTCGGAGAGC TGTATCTCGC TGGTGATTGT
GTGGCCCGTG GCTATCTCAA CCAGCCTGTT CTAACAGGTG ACCGTTTTAT CCAGAATCCA
TTCCAAACAG AGCAAGATAT TGCTTCCGGA AGCTATCCTC GGCTCTATAG AACTGGCGAC
CTGTTTCGAT GCCGGCTTGA CCGTCAGCAC CAGCCATATC TAGAATATCT TGGAAGAGCT
GATCTCCAGG TCAAGATAAG AGGATACCGT ATTGAGCCGT CAGAAGTTCA GAACGTGCTT
GCTTCCTGTC CTGGCGTTCG AGAATGTGCA GTAGTGGCCA AGTATGAGAA CACCGATGCT
TACTCCAGGA TAGCCAAATT CCTGGTCGGA TATTATACCC CTGACACCGA GACGGTCTCC
GATTCAAGTA TCCTCGCCCA CATGAAAAGC AAGCTTCCCG CATATATGGT CCCTAAATAT
CTATGCCGTC TAGAAGGTGG ACTTCCAGTG ACAATCAACG GGAAACTTGA CGTTCGAAAG
CTGCCTGATA TCGGCAACCC TCAACATCAA ATATCGTACA ACCCCCCAAG GGATGTCCTG
GAGGCCGACT TGTGTAGATT ATGGGCATCA GCACTAGGAA CAGAGCGATG CGGTATTGAT
GATGATCTGT TTAGGTTAGG CGGAGACAGT ATTACTGCTT TGCATCTCGC AGCCCAAATC
CACCACCAGA TCGGCCGAAA GGTCACTGTT CGAGATATTT TCGACCACCC TACCATTCGT
GGTATTCATG ACAACGTTAT GGTGAAACTC GTTCCACACA ATGTTCCTCA ATTCCAAGCA
GAGCAGCAAA CAGTACTCGG TGATGCGCCT CTGCTACCGA TCCAAACTTG GTTCTTATCA
AAATCGCTAC AGCACCCAAG CCATTGGAAT CACACCTTCT ACCTACGGAC CCCTGAGCTG
GACGTGACTA CTCTGAGCAC AGCAGTCGCC GAATTGCAGC TGTATCATGA CGCCTTCAGA
ATGCGGTTGA GGCAAATAGA TGGAAGGACG GTGCAATGCT TCGCAGATGA CATTTCTCCA
GTACAGCTCC GAGTGTTGAA CGTCAAGGAT GTCGACGGAA GCGCGGCTAT TGACCAGCAA
CTCCAGAAAT ATCAGTCTGA CTTCGACCTT GAGAAAGGCC CAATCTGTGC TGCTGCCTAC
CTCCATGGCT ACGAGGATCG ATCTGCACGA GTCTGGTTTT CTGTCCACCA CATCATCATT
GATATAGTTA GCTGGCAGAT TCTTGCGCGC GACCTACAAA TCCTGTACGA GGGTGGAACT
CTCGGTCGTA AGAGTAGCAG CGTCAGACAA TGGGCAGAGG CACTACAGAG CTACCAGGGG
TCGGCATCGG AGAGGGCCTA CTGGGAAGGA CTTCTTGCTC AAACGGCTGC CAACATATCC
GCTTTGCCCC CAGTGACCGG GACCCGTACC CGGTTGGCTC GAACTTGGAG TGACGACAGG
ACGGTCATTC TCCTGAATGA AGCTTCTAAT CAGAATGCAT CTATACAAGA CCTCTTACTC
GCCGCTGTTG GATTGGCACT TCAACAGGTC ACCCCGGGTA GCCCGAGTAT GATTACTCTC
GAGGGCCATG GGCGTGAGGA AATTGTTGAC CCGACATTAG ACCTCAGCCG TACCTTGGGT
TGGTTCACCA GCATGTATCC CTTCGAGATC CCTCCCCTGA ATGTTGAAAC CCTTAGCCAG
GGCATAGCCA GCTTGCGAGA ATGCCTTAGG CAGGTGCCTG CACGGGGCAT CGGGTTTGGA
TCACTCTACG GTTATTGCAA ACACCAAATG CCTCAGGTTA CGTTCAACTA CCTGGGCCAG
CTGACAAGCA AGCAATCGAT AACTGATCAG TGGGCCCTCG CTGTTGGTGA CGGAGAGATG
CAATATGGGC TTACAACAAG TCCTGCGGAC AGAGACCAAA GCTCGTTCGC GGTTGATATC
ACCGCCAGCT GTGTAAATGG TGCCCTGTCA GTCGAAATGA ATAGTGCCTG GAGCCTTGAA
AAAAGCATGC GATTCATATC CAGGATTGAG GAAGTATTGA ATATGATTCT TAGCGGGACC
CTAGCTCAGC AGGCGACTCC AGTGCTTACG CCACAGGTAT TCAACGAGGA GATGTACACA
CCATATTTTG AATTTTCCAA AACCCCACGA CGCGGACCGA TCTTGTTCCT ATTGCCGCCA
GGGGAGGGAG GGGCAGAAAG CTACTTTAAC AATATCGTCA AGCACTTGCC CACGACTAAT
ATGGTCGTCT TTAACAATTA CTACCTTCAC TCCAAGAGTC TGAACACGTT TGAAAAGCTA
GCTGAGATGT ATTTGGGGCA CATCCGTCAG ATCCAGCCAG ACGGGCCTTA CCATTTCATC
GGATGGAGTT TTGGAGGAAC AATCGCGATG GAAATATCGC GACAGCTCGT GGGGCTAGGT
TCAACGATTG GTCTTTTAGG TATCATTGAC ACGTATTTCA ACGTGCCTGG AGCAACGCGG
GCAATTGGCC TCGGTGATAC TGAGGTCTTG GATCCCATTC ATCATATATC CCAACCAGAA
CCAGCCGATT TCCAGTGCCT CCCAGCCAGC ACAGACTACA TCATTTTATT CAAAGCTACT
AGGGTGAACG ACAAGTTTCA GTCTGAAAAC CAGAGGCGTC TGTACGAGTA CTACGACAAA
ACATTGCTTA ATGATCTCGA CTGGTTACTC CCTGGTGCTT CAAACATTCA TCTAGTCCGC
CTTGAGGAGG ATACTCACTT CTCCTGGGCG ACCAATCCAC GCCAAATCGC CCACGTTTGT
TCAACAATCG AGAAATTTCT CGCCAGATAT TAG
 
Protein sequence
MSPPGLLSED GPGYSGGYAD PTVPKVNWKQ SNGKSAGGNG DVDAGNGNID PSKSGVGVQV 
CFAGGLEGWK AGISKITERC DLSSIATNST KYQLAVTGFS DGPDDYNEYS VPFPSEVLVA
MEEMCLARDI SMRSVIQFAV HYVLKGFGGG SHTVAASIDV GDDPNNIATS YTITPSIVCH
ESRQGQTVMQ EIQSMEKLNQ LRKQEMHPGE AGLSLIRMGL FDILVIFADA NKCEGLIAGL
PLAVMVCEGG GRLQVRIHFS GSLFRQKTLV DIAEALNVLF AKAASGGATP VRDLELLSAE
QKQQLEEWNK TDGEYPECKR LNHLIEEATQ LHEDKVAIVY KRRQLTYGEL NAQANCFAHY
LRSIGILPEQ LVALFLEKSE NLIVTILGIW KSGAAYVPID PTYPDERVRF VLEDTQAKVI
IASNHLAERL QSEVISDREL SIIRLEHCLS AIDQQPSTFP RANLRDPSLT SKQLAYVTYT
SGTTGFPKGI LKQHTNVVNS ITDLSARYGV TGDHHEAILL FSAYVFEPFV RQMLMALVNG
HLLAMVDDAE KYDAEKLIPF IREHKITYLN GTASVLQEYD FSSCPSLKRL ILVGENLTES
RYLALRRHFK NCILNEYGFT ESAFVTALNV FEPGSARNNT SLGRPVRNVK CYILNKSLKR
VPIGATGELH IGGLGISKGY LNRPDLTPQR FIPNPFQTDH EKELGLNQLM YKTGDLARWL
PNGEIEYLGR ADFQIKLRGI RIEPGEIEST LAGYPGVRTS LVVSKRLRHG EKETTNEHLV
GYYVGDNTSV SETALLQFLE LKLPRYMIPT RLVRVSQIPV TVNGKADLRA LPSVDLIQPK
VSSCELTDEV EIALGKIWAD VLGAHHLSIS RKDNFFRLGG HSITCIQLIA RIRQQLGVII
SIEDVFSSRT LERMAELLRS KESNGTPDER ARPQLKTVAG EVANANVYLA NSLQQGFVYQ
FLKNMGRSEA YVMQSVLRYD VNINPDLFKK AWKQVQHMLP TLRLRFQWGQ DVLQVIDEDQ
PLNWWFLHLA DDSALPEEQK LLELQRRDLA EPYDLAAGSL FRIYLIEHSS TRFSCLFSCH
HAILDGWSLP LLFRKTHGTY LHLLHGHSLR TLEDPYRQSQ QYLQDHREDH LRYWAGIVNQ
IEERCDMNAL LNERSRYKIQ LADYDKVEDQ QQLTLTVPDA SWLSKLRQTC SAQGITLHSI
LQFVWHAVLH AYGGGTHTVT GTTISGRNLP VSGIERSVGL YINTLPLVIN QLAYKNKTVL
EAIRDVQAIV NGMNSRGNVE LGRLQKNELK HGLFDSLFVL ENYPILDKSE EMRQKSELKY
TIEGNIEKLD YPLAVIAREV DLTGGFTFTI CYARELFDEI VISELLQMVR DTLLQVAKHL
DDPVRSLEYL SSAQMAQLDA WNATDAEFPD TTLHAMFEKE AAQKPDKVAV VYEQRSLTYR
QLNERANRMA HQLKSDISPK PNSIIALVVD KSEHMIATIL AVWKTGGAYV PIDPEYPDDR
IRYILEDTSA IAVISDACYL SRIQELAGES VRLYRSDIST QTDGNWSVSN PAPSSTSTDL
AYIIYTSGTT GKPKGVMVEH HGVVNLQISL SKTFGLRDTD DEVILSFSNY VFDHFVEQMT
DAILNGQTLV MLNDAMRSDK ERLYQYIETN RVTYLSGTPS VISMYEFSRF KDHLRRVDCV
GEAFSQPVFD QIRDTFQGLI INGYGPTEIS ITTHKRLYPF PERRTDKSIG QQIGNSTSYV
LNADMKRVPI GAVGELYLGG EGVARGYHNR PEVTAERFLR NPFQTDSERQ NGRNSRLYRT
GDLVRWIPGS NGEIEYLGRN DFQVKIRGLR IELGEIEAVM SSHPDIKQSV VIAKSGKEGD
QKFLVGYFVA SSPLSPGAIR RFMQSRLPGY MIPSSFIPIS SLPVTPSGKL DTKALPTAEE
KGAMNVLAPR NEIESILCGI WAGLLDISAQ TIGSDSDFFT LGGDSLKSTK LSFKIHEVFG
RTISVSALFR HRTIESLAHL IMNNVGDIQE ITPVDYDNRR KIAVSPAQER LLFIHELEGG
GNAYNIDAAF ELPPYIDQSR VEEALYTILS RHEALRTFLL RDQATGTFYQ KILTTDEAKC
MLIIEKSAVS TIDQIDSIVG RLSQHIFRLD SELPWLAHIV THKTGNLYLT LSFHHTCFDA
WSLKIFEREL RVFCASNEKG GNMPILPMPQ VQYKEYAEHH RRRLGKNQIQ KLSDFWLQRL
DGLEPLQLLP DYPRPAQFNY DGGDLSVILD GVVLETLRGI AKDHGVTLYA VLLAVYCLML
STYTHQVDIA VGVPISHRTH PLFQSIVGFF VNMVVVRVDV KDFAVHDLIR RVMKALVDAQ
LHQDMPFQDV TKLLRVDNDA SRHPLVQTVF NFESDMDKEF ETTPSIQDTA TIAPYQSVQR
IKSVAKFDLN ATATESGSAL KINFNYATSL FRKETIQGFL ETYRHLLLQL SYLGSQGLKE
DTKLLLVRPE EMSGPHLPLA GLSNGAETLE AISLSRAFEF EAFRVPDRAA VVQGDKSLSY
TELNKRANQL ARYIQSVAHL RPDDKVLLIL DKSIDMIICI LAIWKTGSAY VPLDPSYPKE
RVQCISEVVQ AKILITESRY ASAWGSQTST ILAIDSPKVS NMVNNQATHN LPNIAGIKNL
AYIIFTSGTS GKPKGVLVEQ GGVLHLRDAL RKRYFGIECN EYHAVLFLSN YVFDFSIEQL
VLSIMSGHKL IIPEGEFVAD DEFYITANGQ RLSYLSGTPS LLQQIDLARL NHLQVVTAAG
EQLHAAQFNK LRSGFRGPIY NAYGITETTV YNIVSEFSAQ SQFENALREL LPGTRAYLLN
HATQPVPMNA VGELYLAGDC VARGYLNQPV LTGDRFIQNP FQTEQDIASG SYPRLYRTGD
LFRCRLDRQH QPYLEYLGRA DLQVKIRGYR IEPSEVQNVL ASCPGVRECA VVAKYENTDA
YSRIAKFLVG YYTPDTETVS DSSILAHMKS KLPAYMVPKY LCRLEGGLPV TINGKLDVRK
LPDIGNPQHQ ISYNPPRDVL EADLCRLWAS ALGTERCGID DDLFRLGGDS ITALHLAAQI
HHQIGRKVTV RDIFDHPTIR GIHDNVMVKL VPHNVPQFQA EQQTVLGDAP LLPIQTWFLS
KSLQHPSHWN HTFYLRTPEL DVTTLSTAVA ELQLYHDAFR MRLRQIDGRT VQCFADDISP
VQLRVLNVKD VDGSAAIDQQ LQKYQSDFDL EKGPICAAAY LHGYEDRSAR VWFSVHHIII
DIVSWQILAR DLQILYEGGT LGRKSSSVRQ WAEALQSYQG SASERAYWEG LLAQTAANIS
ALPPVTGTRT RLARTWSDDR TVILLNEASN QNASIQDLLL AAVGLALQQV TPGSPSMITL
EGHGREEIVD PTLDLSRTLG WFTSMYPFEI PPLNVETLSQ GIASLRECLR QVPARGIGFG
SLYGYCKHQM PQVTFNYLGQ LTSKQSITDQ WALAVGDGEM QYGLTTSPAD RDQSSFAVDI
TASCVNGALS VEMNSAWSLE KSMRFISRIE EVLNMILSGT LAQQATPVLT PQVFNEEMYT
PYFEFSKTPR RGPILFLLPP GEGGAESYFN NIVKHLPTTN MVVFNNYYLH SKSLNTFEKL
AEMYLGHIRQ IQPDGPYHFI GWSFGGTIAM EISRQLVGLG STIGLLGIID TYFNVPGATR
AIGLGDTEVL DPIHHISQPE PADFQCLPAS TDYIILFKAT RVNDKFQSEN QRRLYEYYDK
TLLNDLDWLL PGASNIHLVR LEEDTHFSWA TNPRQIAHVC STIEKFLARY