Gene CNA06110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA06110 
Symbol 
ID3253675 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1640396 
End bp1652385 
Gene Length11990 bp 
Protein Length3671 aa 
Translation table 
GC content48% 
IMG OID638252932 
Producthistone acetyltransferase, putative 
Protein accessionXP_566965 
Protein GI58259105 
COG category[B] Chromatin structure and dynamics
[D] Cell cycle control, cell division, chromosome partitioning
[L] Replication, recombination and repair
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5032] Phosphatidylinositol kinase and protein kinases of the PI-3 kinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCCC CTAGTTACAC CAGTAACGGG CCAATGTCCG TGAGCGATTG CGAGTTCCTT 
GCATCTCAGT TGCTGGAACC TACGGTCACA GCACGAAAGA AGCTTGAAAT TGCTCTTGAG
CTCAGGGACT CCGCAGAGAA CAACAGGGAT TTCGGATTCT ATGACAAGTA CTTGTCAATA
TTTATTCCGG CACTCATCAG TATACTCGGG GATGAGAAAT CAATCACTTT TGTGAAGGAT
AATGTTGAGC AAGTGAGTGC AAAACAGTCA TAATATCCCC GTCTAGAGAC GTTCAATGAT
GGACTGACAT CGTATCAGCG TTTTCGCCAC ACCCTTCTTG CATTCTTGCA GCGCCTCCCC
CACACCGAAC CTTTCCGTCA TCACATGAGT TCTGTTATGG AGCTTTGCGT GAAGCTTCTC
AAGATTGAAA ATGAAGAAAA TGCGTTGCTC TGTATCAAGA TCATGATTGA TGGGCTTAGG
AGCAATAAGG ATCAAATGGA ACCTTTTACG GAACCTTTCC TTGATTTAGT CAAACAAATG
TACGCGAACA TAAAAGCGGT TGTGGAGAAA GAGTTTGGCC CGTCAGGGGG AGGATCAAAG
CCGGTATCTA CGCAAGGCGA AGGCTCGGCA AACGGTCAGC AATCTCAGCA ACAGCAAGCA
TCGTCCTCCA GTCATGCCAT TCTTCCTCAT GCGCTTCATT CGCCCAAAGT TCTCACTGAA
TGCCCTATCG CTGTTGTACT CATATTCCAA ACCTACAAGT CCATCATGCA AACTGCAATG
CTCGACTTCT ATCCATTGGT AATTGACAGC ATCAAGATAC AGCCTGAACC CCAAAGGTTG
GCGCATCTAG AAGCTAAAGA AAAGGGAGAA ACCTTTGTCG GGGTGGCAAG TGGGATAACA
AATCGGGAAA TGTTCGCAGA GCTTGTAAAG GCCCAGGTCA AGGCATGTGT ATTCTTCGTA
GTCCATGAGT GTAAGCTGAT GTTAGTCAGA CCATGGCATT TTTGGCTTAT GTCCTTCGAG
GAAACCAAGG AAACAATAGA GATTACGTCA ATGTTTTCCC AGAAGCATGT GCACGCCTCT
TGCGAGATTG CCCGCCAGAG GATGTCATCA CTCGAAAAGA ACTTTTAGTA GCAACTCGAC
ATATCCTTAC TGTTGACTCT CGATCCTCCT TCATCCCTTA TATCGATGTC CTTCTAGAGG
AGCGAGTCCT CGTCGGCACG GGTGTCTCAA GTAGGGAAAT GCTTCGACCG TTGGCTTATT
CCGTGGTGGC CGACCTCATA CATCATGTCC GAAACGAGCT TCCTCTACAG CAACTCATCC
GTGTTGTCTA CGTCTTCTCG TGTAATCTCA ATGACTCAAC TTTCTCGAGC TCAATTCAAA
CCATGTGCGC CAAACTCCTC AATACTATCA TTGATTCGAT CTATAACAAG GCGGACACGA
ACGAGATGTC AAAGATTCTC AAAGGGATGT TTTTCACGTT CTTGGAAAAG CTCTCTGCCA
TGTCGGATGC GCATGATAGG TTGAAGGCAT TGGCCGCTAG AGACAAAGGG AAGGGCCGGG
CCAAAGAGGA AGGTGATGAA GATGTAGAAG TGACTGATAC GTCAGACGAA GCGTCAGATA
AACTTATACA TGGGTGGAGA GATATTGAGC AAGCGATGCC CGTTCATTCA GTTGCTTATG
CCAACGAATC AGTGGATTCT TTCTGTCGGG GTCAGTGACT CAAAGAGGAA TATGAGTGTG
GCTGACATAT TTGTAGATTC TCGGTACCTT TTCAAGACAA TATTGCATAC CTTCCGTACC
CTTTTATCTT ACACACGCCA AGGCGAAAAC CCTCCGCCCC AACCAGATGG AGAGGTTCTC
AGCAGATTTT TCGAGTGCAG TATCAAATGT TTTGCCATTT TTGACGTTCT CAATAGAGAT
CCCAGGGAGG CGAAGGAGGC GTTAGAGCTT CTATCAGAAA TTGTTCTACT CTTTGAGCCG
CATGTGTTTG CCGAGGTCTG GACAAGCCAC ATGGAATTTT TCAGCGATAT TTCCATCACC
AACAATCAAG TCTTCTCTCT TCTCCAGATG GTCATCACCC ATGAGTCGGT CTCGCATCAA
CTGGTTTCCA TTCTACTCAA GTATTTGATG GAGAAATTGC CTGAAATTGG GAGGATGGAC
AAACAACGTG CCACTCTTAT GCTCAAAATG TTCAAGATAG CTTTCCTTGC TATCAATACA
TATATCACAA GCAACGAAGC AGTCTTGGTG CCTCACCTCC AGAAGCTTAT CATGAGTTCT
TTCGAATCGG CTGCCAAGGC CGAAGACTCG TCGTTTTATT ATCAGATCCT TCGCGCTTTG
TTCAGGTGAG TAATATTTCT TCATATCGGA CTGTCTGACA TGCAATAGGT CTATTGGTGG
AGGACGTTTT GAAGCGCTCT ACAAAGAAGT TCTTCCAATC TTGCAGGAAA TGCTTGATCA
TCTGGCATAT CTTTTGGATC ATTCGCCCGA CGAAATCTCA AAAGACATCT TTGTGGAGCT
CATGTTGACC GTTCCCGTTC GTCTCACAAA CCTTCTACCG CACCTGAGCT ACCTCATGAA
ACCATTAGTG CGAGCACTCA GCGCAGGTCC CGACCTTGTC AGTCAAGGTC TTCGTACCCT
AGAGCTTTGT ATCGATAATC TCACAGCCGA TTTCCTTGAT CCCACTCTGG CACCGGTGTT
GCGTGATCTC ATGGCTGCTT TACATCAGTT ACTCAAGCCC ATACCTGCCA ACCGTGAGCA
TGCAAGTGCG GCTCTGAAAA TTTTAGGGAA ACTTGGAGGG CGGAACAGGA GGTTCCAGGA
GGTCCACGAC AATCTTGAGT ACCGGCTACT CTCTGATCGC TTAGTTGTCC CCATCACCTT
TGAAGGAACT CGTCATCATT TAGACTTGAC ACCTTTAGTG AACTCCGCAG GCAAGGCTAT
TGACAATGAA GCGGATCTTC TTCGGGAAGA CGGGTTGCAA GTGTTGATGT ACTCGGCTCT
GACAATATTT GAGAAGGTAT GCGGACTGTA ATTACGCAAG ACTCTCTGCT AACCGAAGTA
TAGGGGGCTC CTGGCCCAGA AGGAAATGCC ACATTCAGAA CCACTATGAC TCGGCTTTTC
TTCGCCTGTG ACAGACCTGT CATTGGAGAA AGGTCGCTCA TCTTTGTGCG TGATCTTTGT
CGCAGGGCAT TCGCCCTGGA GCTGGGAAGG ACGGACGGCA TCGAGCATCC CATTAAACCT
GGTCCTGATC ATTCTCGTCG ACGATTCCTC CCCCTCACCA ATGCTTTATC CGATGCGTTC
CTGGAGACCC TTACCATCTC CAAAGCTGCA GAACAAAAGG GTCTCAGCGA CTTGTTGGCA
ACAATCGTTA TGGATTTTAA GGAGTTGGCT CTCTCCCCAA GATTCCAAGG TGTTGTAGAT
GGGCATCGAT CATTCGATCG TATGGTAACT TTCTTTGCCC TCCGCCTCGT GACGCTTTGT
CACGAGGAAG CTTGGTCAAA AAAGATGGCT GGCGTTTCTG CCATATCCAC TTTCGCTCAT
AAGATCGAAT TGAGCCGCAA AAACATAATT GATCTCCAGC TTGACTTCGT CCGAGCGCTT
TTGTACTGCT TACGTGACGC TCCCAAAGAC GTTCCCAGGA GTGCAGACGA CGTTATCGGG
CTCATTAAAC ACCTTATACG AACCTGTCAA TCTCAGGACG ATGGTAAACC TCGAATAGGA
CGCTTGATTG AGACGTTTGT TGGCGAGCTC AACAGTCAAA GCAAGTTAGC CCGTGACGCT
GCACAGCAAT GTATCGAAGT GCTCGCGGAG GTTACTGCAC AGACAGTGCC TGAACTCATC
ACCAATATCG CAAAGGTCAA GTTGCTGAGT GTTGATCACG GCCCAATTTA CTCGAAGCCT
TTACGAGCCC TTCCCTTTGC CATGCAGGTC GGTAACATCA GTGCAGTCAC ATATCTGATG
GATTTACGTC CTTCGGTGGT CGAGACGTCA GAGGAGTTCA TTCGACTGCT TCATGAGGTC
CTAGCATTAG CGGATGTCGA CGATGCGAAT TTGGTCAGCA AGCCTGCGAC TCATAAGCAA
GAGTCGTGGT TGAAAGCTCT TCGAATTTGT TGCCTTCGCC TGCTCAAGTC GTCCATGGCC
ACGCCAGACT TCATGAATAA ACCAACTCAA GGACAGCTCC GTGCTCGGTA AGTGGTGAAG
TCTAGTATCA CTTGCACATA TACTTACTGT TTACTAGTAT TATACAGGTA TATTTCAAGC
ACGTTTATTC ACAAAATCCA GAGATCGTGG CTGTCGCCCA CGAAGGTCTT AGAGACGTGC
TCCAACAGGA GAACAGACTC TCGAGGGATG TTTTACAGAA AGGACTTAGA CCTATCCTGG
TGAATCTTGC GGATGCCAAG AGGTTGAGCG TTTCCGGTTT GGATGGCTTA GCTCGTTTCC
TTGAACTTCT CACCAATTAT TTCAAGGTGG AAATCGGCGT GAAGCTATTA GATCATTTCA
AAACTCTCGG CGATCATCAA ATGTTAGTGA AGGCGGCTTA CGCTCCTCTC GATGACAATC
ACAATATTGC CAGGATGTCT CGTTTAGTAA ACATCTTTCG ACTTCTGCCC TCAAGCGCCA
TCCAATACTT GAACGACCTT GTGGCCAACG TCGTTGAAGT AGAGGCTCTT CTTCACCAAT
CGCAACCTGG TCCATTCACA GAGTATCTTG GTCGTTACCT TGACAGGTAC CATGCCAACG
CCGTCCAGAA TCTTTTCGAT AACATACGTA ACACGCGTTA TGTTTGGACT TACCGCAATA
TTATCACGTC CGGGAGTGCA CCTCATCTCG TCGAAGAGTT TGCCAGCCGT GGGGAAGCTC
TTTGTCAACT ATGTTTCAGT AATCCGGAAG TGACAGACCT TGTCTTACCT GGTTTGCTTC
TTGTCAGGGA TCTGTCCCGG GTTCAGTCTA GCTGGTTGTC TGACAGTGAA CCGGTTCTCG
AACCGATGGT TAATGTTTGG AGGATGATTG TGAATAAGTC GCGCGACCCC AAGGCAGACA
TTACCGGTTA TCAATTCCAG CAGATGCCTT CTCTATTGCT CGAAATGTTC ATGGCAAGCC
TCGAGCAGCA ACAACACATT CCTCTCCTAT TCCATGTTGT GGAAGCCTAC GAGGTGCGAG
CGGCATTTGA GCGATCTCAT GTCACCTTCT TCCTGTATCG ACAGGTTGCG TTGCAAGAAT
CTGTCGAGTA TCGTCGCGAG GTTATCGAAT ACTTCTTTAG CCTCTACGAG GCCGAGGATG
TACCGTGGAC CTACAAAACA AATGCTTTGC GCGTCATTGT GAACCCTACT CTTCGAGTAT
ATTTCGGTGA TCCCAATCAC GATGGCTCTT TAATTTCCGC TCAGCTCGTG CGGAAGATCG
CCAACCTAAT GTGGCGTCCG CTTTCTGCAA CGACGTCTTC AAAACAAAGA GAGGATACGC
ATCTCATCGA AGTCTTTGCA TTAACCACCA TGTTAGTTCA GCACTGCAGT GCGAAGGTTA
ATGAAGCGAG GAAAGAAATA TTCAAACTGG CGTGGATGGG AATCAATCTA CTGGAACCGA
CTGTCAAGCT CATGGCTTAT GTCCTTGCCG CCCGATTCAT GGCGACGTAC GACACTCCAG
TCAAATTTGT AAGGCTCACA TGGACCGGCG TTTTGCGGCT TAAAGACACG GATAACCGTG
TTCTTTATCG CCAAGCTATC GACACTCTGG CATCATCATT ATCTGTAAGA GATCCCCCAC
CAGCCAATGG TACGCCGGAA TGGGCGAAGC TCCTTCGAAC CGTCCTCATT GAAGAGGGCC
ATGCAACCAA CCAGCTTGTC ACCGTCTGTG AACTCTTGGT GCACCACCCC GACCTCTTTT
ATGACTACCG TGAGCTATAT GTCCCTCACA TTGCCAACTC TTTGGGCAAG CTGGCGTTCG
CTCAAGCGGC GACTCCTGAA CTTAAAAAAT TGACTGTCGA TATTGTGGAA CTCATCTTCA
ATTGGGAGAA GAGACGTATG GCAGCGAGAG ATGGAGAGAC CATGGATGTG GATGAGGGAC
CGAAAAGAGG CGCAGATCAG TCGGTAGAGC AAGGCCCGAC GAAGAAACAA CGAGTTGATA
GGGCAGGAAC TGCCGTATCT GGAAGCAGCG GGGGAGGTTG GGCAGCCCCC AGTCAAGTTA
GAGAGCTTAT GACGGCTCAT CTTCTGAGAT TAGTGTCAAC ATCCGCCGAT CCTGTGACCA
GGAATGGACT GACGAAGCGG GCGTTGATGC TTTTCAAAGA CATTCTAGGA CCGAAGGGTT
TGCCAAATGT ACATGTTAAA TTAGGATTTT TCCACAGAAC TATGACACAG GTTCGTTCGT
TTGGTGACGA TTGAAACTCT GCTGACAGGC GCCGTATCAG GACATCAACC CAAATACAAA
GCCTACTGTC GCTAACTCTA CTGAGGTAAT CGCTGCTGTG GCCGCTGCTG TGAAAGACAC
TCAATGGGTT AAAGCCAACC TCAGTTTGTT GTCCAAACTT CTAGAAAAAG TTTGGGTGTC
TCCCGAGACC GATTTACACG AAGTTGTAGC TCCTTTGACT GAAGACCTCT TTTCGGAAAT
GCCCGCCGAT GAGAGTGCCG AGGCTGAGCC CGATGCGAAA GCCTTGTTAG CATTCGTTCA
AACTGCAGTC AATGATGGCC TCTCCGCTAG CTTGCGATCG ACTTTGTCAC TACCTGGAAC
ATTATTTCTT CTGAAGACCT GGTTGAAGAC TCAGCCCGGG GTGTTGCAAT CAGAAGGCAT
CAGTTCGGCT TTGCTGAAAG TCCTTGCCAA TCTAATCAAG CTTCACACCA CATCCAATCA
GCCTGCTAAT GCGGCTAACG AGCCTGATAT CGTTAGACTC ATCACATCTG TCCTCGACAT
TCTTCGTGAC AGGGTCAATG ATCTTCGAGA ACAACGCAAG CATCTATATA GCAGTATCAC
AATTTTGGTC GACAGGTCGC TCAATCCAAT GCTCTGTCGT TACCTTTTAC AACTCATGAG
GCATTGGGTT ATTGGTAGCA ATGATGGAGC GGCGCACGGC AAGGAAAAGG CATTGATACT
ACTTCGTATG ATGTGTTTCG AATCAAGAAG TGACCAACTC TTCCAAGAAT TCTTGGAGGT
CGTTTATGAC GTCTACCAAC AAGAGAATCT TCGTGGCTCA GACATAACAC ACCGCTTAGA
GCCAGCCTTC TTATTGGGTA CCAGGTCAAA GAATGCCGAG CAGCGAGCAC GCTTCCTTGA
CAAATTGGAA CAGAACTTAC CTAGGTCCAT CGACAGTCGG CTGCAATACT TATGTTCTTT
TCAAAACTGG GACACATTAG CCGACAGCTA CTGGATTCCT CAAATTCTGA GCCAACTACT
TGGTGTCGCA GATCTCGAAC AAAGTCTTAC ACAGCAGCCC ATGCCTCGCA TACTTGATCT
TGACCCCATT GTGGACATGG CAGAAAGCGC TTGCATCAGG CACATCGTCC GTCCAGCACG
TAATCTCATT CATATCGATG TCACGCTGTC TCATGATCTA TGGGTTTCAG TGTTTTCAAT
GTGCTGGGGC TCACTCAGCA GATCTCACCA GCTGGCTTTC ACGCCTTATC TCATCAAGCT
GCTATCCAAA TCTCACCTCC AGAAACAAAC TGAGATGCGT CCCAACGTGG TTCAAGCGTT
CCTCGATGGC ATTGCGGCTT GTACAGTCCC TATCACACTT CCCCCGACGC TCGTCAGGTT
CCTTGCCAAA AATTTCAACG CGTGGTATGT CGGTTTCGAA ATCCTTACCC GGCTCACCGA
CGTCTATCGC GGGGATGATG GATTGCGGGA AACGTGCGCG AGTGCGTTGA GTGAACTATA
TGCGGAGCTC TGCGAAGAGG ATATGTACTA CGGCGTTGCT CGTAGTCGAT GTCAGTTTCC
TGAGACAACA GGTGCCCTTA CCTATGAGCA GAATGGGCTT TGGCCCAAAG CTATCGAGTT
ATATGAGCAA GCACAGATCA AAGCTCGCAA CAATATGCTC CCATTCAGCG AGGGCGAATA
TTGCTTGTGG GAAGATCACT GGATCCTATC GGCTCAGAAG TTACAGAACT GGGAGAACTT
GACAGAACTG GCGAGAATTG ACAGTGATGC GGATCTCCTG TTGGAATGCG CTTGGAGATT
GTCAGACTGG GCATCACCTG ATCGAGAGGC CATTGATCAG AATCTGGCAC GTGTTATTGA
TCACCCCACT CCTCGTCGAA AGACTTTTGA GGCCTTTGTG GCCCTTCTGC GTTCGCATAT
GGCTCGAGAA CCACCTAATG AGTTTCTGCG GGTACTAGAC GAGGCTCAGC AGGTCACTCT
ACGCAAATGG ATCAGTTTGC CAGCACACAT GACCAACGCA CATCTACCTC TCCTTCAAAT
GTGTCAACAA GTAGTCGAAC TGGGCGAGGC AGCTCGCGTT TTTGACAGTC TTCAAATGAC
CAATCAAGCG AACCTTGAAC TGCGATGTAA CAGTGACTTG AAGCCAATTT TTCAGACCTG
GCGGGATCGT CTACCCAACT TTTGGGATGA CATCAGCGTC TGGAGTGATC TTCTCGCCTG
GCGCCAGCAT GTTTTCCAGG CTGTCACCAA GGTCTACCAT CCACTGGTCG CTCAGCCCGA
TAACGCAACC TACGGTTACA GAGGATTCCA TGAAACGGCG TGGATGATTA ATAGGTTTGG
TGAAGTGGCA CGTCGTCACG GCCTTCTCGA TGTTTGCAGC GTCTCTCTCA ACAAAATATA
CATGTTACCC AATATTGAGA TCTCTGAAGC TTTCCTCAAG CTTCGCGAAC AGGCCCTCTG
TTTTTTCCAA AAACCCGAAA AATTCAACGA AGGTCTTGAA AACATTAGCA CCACCAATCT
CAAATTCTTT GGCCTATCTC AGCGCGCAGA ATTCCTGACC TTCAAAGGGA TGTTTATCTC
TCGCTTGGGT CAGAATGAAG AAGCGAACGC CGAGTTTGCC CACGCCATCC AAACTGATTG
GAATCTTCCT AAGGCGTGGG CTGAGTGGGG TCGCTTCAAT GATAATTTGT ACAAAGACCG
ACCTGAAAAT CCTGCTACTG GTCCTCCAGA ACCCGAACCT GGAAAGCCCA AGATGACCGA
TGCGCAATGG CAAGAGTCTT ATTCTCAAGA CCGTGCAATC CTTGCATCGA GTGCTGTGTC
ATGTTACCTT CAGGCTGCTG GCTTATACAA CAATCACAAG TCTCGAGGGT TACTTTTGAG
AGTTCTATGG TTGCTTGGTC TCGATGACAG TCACAACACT ATTTCTAAGG CATTTGAAAA
CTATAAGGGT GACTTGGTTA TCTGGTATTG GATTACTCTT ATTCCTCAAC TTCTCATGTC
TCTGTCCCAT CGTGAAGCCA GCCATGCAAG ATTGGTTTTG ATGCGCATAG CCAAGTCTTT
CCCACAGGTG AGTCTTAACG TTCGCACTAC GATCTCCAAC TAATACTCTA TTCAGGCGCT
CTTCTTCCCT CTCAGAGTTT CTCGAGAGGA TTTCGTAAAC GTTAAGAAGC AACAGCAGAT
GCAGCAACGG TTCGCCGCTG CTCGTCGCGC AGAGAATCAA GCCAAAATAG CCGCGGCTAA
TGCTATTGCC GATGCCTCTG GTCAGCCTTC CGAAACTAAA GAAGTCAAAG ATGAGCAGTC
TGCGGCCAAT GCTACTGGGA TTCAGCCTCC GGCGTCGAAT GGGCAAGCCA TGGGTCTTGC
CGCACAGAAT CAGAGTCCGT CTTCACAAAT TCCCCGTCAA CCTTGGGATC ATGTTGAAGA
GATTATGAAC ATGCTCAAGA CGGCATTCCC CCTGTTGGCA TTGACCATGG AGAAGATGGT
CGATCAGATA TCATTGAGAG CGAAGCCAGC GTCTGATGAG GACATATATC GGTTCTTCTC
TGCTTTATTG GCAGACGCGA TGCAGCAATG GGGTGGGAGG AGTGGACTGC CCAATGATGA
TGGGGAACTC AATGCGCAGA CTAAAGATAA TCTTGCAAAG TTTGCAACCA ACCTCAATGG
AGAGTTGAAG GTGAGTGCTT TGCTTTTACG ATAGAGGAGG ACTGATCTCA TATGTGCCTT
AGGTCATGAT CGAGAAAGAC TTCATGGTAG AGATGCCTAA ACTACGAGAG TACATCAGAC
GCCTTCAGAG ATGGCGAGAC CTGTATGAGA AGAACCTAGA TGACCGATCT AAGACTCTGC
CCCTTGATCA AGGAGGCTGC AATCTCACGG AATTCCACCA CACGAAGTTC GATGATGTTG
AGATTCCTGG TCAATACGTT CAGGTGAGTT AGTCCTCCAA CATTTGGAAA ATACTCATAC
TGACTATCAT GCGAAAAGCA CGTTGATCAA GGTGAAGAGT TCATCAAGAT TGCGCGTTTC
GCCCCTAGGG CCGAGCTTGG TCGAGGTCAT GGCTATTGCT TCCGGCGCAT CACAATGATC
GGCAATAACG GTGTAACGTA CACTTTCCAC GTACAAATGC CTGCCGCGAG GCACTGTAGA
CGAGAAGAGC GTTTGACACA GTTGTTTAGG ATCATGAACA GGTGAGCAGC CAACTCCGGT
AGATAAAGCT AATGTCTGTT AGTGTACTAT GGAAAAGAAA AGAGTCCCGT CGCCGAAGTT
TACAGATCCA TCTTCCTACC GCCACCCCTC TAGCTCCCCA ACTTCGTCTT GTTCAATCCG
ACTCGTCCTA TGTCAGTATG CAAGAGATCT TTGAGGACTT TGCGGCGTCC AAGAAAATGG
CTCGAGAGGA TACAGTCTTA TCCTACTTTG ACCGTATCAA GGAGCTTCAT GATCCGGCCA
TCCCCAGGGT AAGGAGCTGT GGGTCAAGCA AAGTATCTGG ATTGACAATT GGACAGAATG
ATCATAGATA TATCCAGCTC AGAGCAGAAT TAATGGAAGA AATTCGGGTG AAGATGGTGC
CAGAGACGAT CATTACAAAC GTAAGCAACA CTGAAACAGC TAGACTAGAA TGACACTGAC
TGCATTTCTA GTACATGATC AAATCAATGA ATGGTCCCGA GAACTTGTGG CTAATGCGCA
AGCAATTTGC AGCCCAGACA GCTACGACCA TGTTCCTTAC CTTTGTTTGC TGCCTTAGCA
ACCGCACACC TTCTCGCTTC TACATCAGCC GCAAGACTGG TTTGATGTAC ATGTCTGAAA
TCCTTCCTGG TACTATTTTC CGATCCCCTT TCAGTTATGT ACTGACATTA ATGCTTTTCA
CGTAGCTTTC GCTCCCGGAC AACCACTAAT TAACTCCTCC GAGGCTGTAC CCTTCCGTCT
CACACCCAAT ATGCAGCATT TTGCAACTCG TGCGGGTGTG GAAGGTGTCA TCACGGGAAC
ATGCACAGCA ATGGCCAGGT GTCTCACAGC CCCAGAGTTC GATCTGTCCG GCACTTTATC
GCTGTTCATT CGAGATGAGG TTCGTCTAAC AGTCGTCCGT CGGACTACTT CTTTTGCTGA
CCGTCGTCTT TTCCGTAGCT GCTGATATGG CACAATACTT ACATGAAAGA CTCTCGTTTG
GAAAGTCCTC TCCTTGGTCA CGTGTACAAG AATGTGGACA GCTTCATTCG CAGGGTATCC
ACCATGGGCT TCATTGGTGA AAATAGAGAC AGAGTGAGTA CAATGCACCC ACGGTACAAT
GCATACTGTT GTTTAACTAA TTCACTGACA TTGTCTTACT AGTCATCCAA CGCTCCGCCC
GTCGTTCATG CCATCATTTC CCTTATATCC CAAGCCACAT CAGCTGTCAA TCTCGCCCAA
ATGGGTGAAA CGTATATGCC GTGGTACTAA TTGTTTCTCG CCATGGCGAG GAAATGGGGA
AGTCAAAAGT AGACATGTTT CCAGTAGTTG GCACGTTATC AAGTTACAAT TCTAGATTTT
TTTCGCGATC AGAGCAATTC CGTATTTTTT TCTCTTTTGT TGTATTTGGG AGTTACAGTC
TGTTTGAACA AGCCATGCAT AATAGTCGGT GCTTAATGAG TACCTCCCAG
 
Protein sequence
MPPPSYTSNG PMSVSDCEFL ASQLLEPTVT ARKKLEIALE LRDSAENNRD FGFYDKYLSI 
FIPALISILG DEKSITFVKD NVEQRFRHTL LAFLQRLPHT EPFRHHMSSV MELCVKLLKI
ENEENALLCI KIMIDGLRSN KDQMEPFTEP FLDLVKQMYA NIKAVVEKEF GPSGGGSKPV
STQGEGSANG QQSQQQQASS SSHAILPHAL HSPKVLTECP IAVVLIFQTY KSIMQTAMLD
FYPLVIDSIK IQPEPQRLAH LEAKEKGETF VGVASGITNR EMFAELVKAQ VKACTMAFLA
YVLRGNQGNN RDYVNVFPEA CARLLRDCPP EDVITRKELL VATRHILTVD SRSSFIPYID
VLLEERVLVG TGVSSREMLR PLAYSVVADL IHHVRNELPL QQLIRVVYVF SCNLNDSTFS
SSIQTMCAKL LNTIIDSIYN KADTNEMSKI LKGMFFTFLE KLSAMSDAHD RLKALAARDK
GKGRAKEEGD EDVEVTDTSD EASDKLIHGW RDIEQAMPVH SVAYANESVD SFCRDSRYLF
KTILHTFRTL LSYTRQGENP PPQPDGEVLS RFFECSIKCF AIFDVLNRDP REAKEALELL
SEIVLLFEPH VFAEVWTSHM EFFSDISITN NQVFSLLQMV ITHESVSHQL VSILLKYLME
KLPEIGRMDK QRATLMLKMF KIAFLAINTY ITSNEAVLVP HLQKLIMSSF ESAAKAEDSS
FYYQILRALF RSIGGGRFEA LYKEVLPILQ EMLDHLAYLL DHSPDEISKD IFVELMLTVP
VRLTNLLPHL SYLMKPLVRA LSAGPDLVSQ GLRTLELCID NLTADFLDPT LAPVLRDLMA
ALHQLLKPIP ANREHASAAL KILGKLGGRN RRFQEVHDNL EYRLLSDRLV VPITFEGTRH
HLDLTPLVNS AGKAIDNEAD LLREDGLQVL MYSALTIFEK GAPGPEGNAT FRTTMTRLFF
ACDRPVIGER SLIFVRDLCR RAFALELGRT DGIEHPIKPG PDHSRRRFLP LTNALSDAFL
ETLTISKAAE QKGLSDLLAT IVMDFKELAL SPRFQGVVDG HRSFDRMVTF FALRLVTLCH
EEAWSKKMAG VSAISTFAHK IELSRKNIID LQLDFVRALL YCLRDAPKDV PRSADDVIGL
IKHLIRTCQS QDDGKPRIGR LIETFVGELN SQSKLARDAA QQCIEVLAEV TAQTVPELIT
NIAKVKLLSV DHGPIYSKPL RALPFAMQVG NISAVTYLMD LRPSVVETSE EFIRLLHEVL
ALADVDDANL VSKPATHKQE SWLKALRICC LRLLKSSMAT PDFMNKPTQG QLRARIIQVY
FKHVYSQNPE IVAVAHEGLR DVLQQENRLS RDVLQKGLRP ILVNLADAKR LSVSGLDGLA
RFLELLTNYF KVEIGVKLLD HFKTLGDHQM LVKAAYAPLD DNHNIARMSR LVNIFRLLPS
SAIQYLNDLV ANVVEVEALL HQSQPGPFTE YLGRYLDRYH ANAVQNLFDN IRNTRYVWTY
RNIITSGSAP HLVEEFASRG EALCQLCFSN PEVTDLVLPG LLLVRDLSRV QSSWLSDSEP
VLEPMVNVWR MIVNKSRDPK ADITGYQFQQ MPSLLLEMFM ASLEQQQHIP LLFHVVEAYE
VRAAFERSHV TFFLYRQVAL QESVEYRREV IEYFFSLYEA EDVPWTYKTN ALRVIVNPTL
RVYFGDPNHD GSLISAQLVR KIANLMWRPL SATTSSKQRE DTHLIEVFAL TTMLVQHCSA
KVNEARKEIF KLAWMGINLL EPTVKLMAYV LAARFMATYD TPVKFVRLTW TGVLRLKDTD
NRVLYRQAID TLASSLSVRD PPPANGTPEW AKLLRTVLIE EGHATNQLVT VCELLVHHPD
LFYDYRELYV PHIANSLGKL AFAQAATPEL KKLTVDIVEL IFNWEKRRMA ARDGETMDVD
EGPKRGADQS VEQGPTKKQR VDRAGTAVSG SSGGGWAAPS QVRELMTAHL LRLVSTSADP
VTRNGLTKRA LMLFKDILGP KGLPNVHVKL GFFHRTMTQD INPNTKPTVA NSTEVIAAVA
AAVKDTQWVK ANLSLLSKLL EKVWVSPETD LHEVVAPLTE DLFSEMPADE SAEAEPDAKA
LLAFVQTAVN DGLSASLRST LSLPGTLFLL KTWLKTQPGV LQSEGISSAL LKVLANLIKL
HTTSNQPANA ANEPDIVRLI TSVLDILRDR VNDLREQRKH LYSSITILVD RSLNPMLCRY
LLQLMRHWVI GSNDGAAHGK EKALILLRMM CFESRSDQLF QEFLEVVYDV YQQENLRGSD
ITHRLEPAFL LGTRSKNAEQ RARFLDKLEQ NLPRSIDSRL QYLCSFQNWD TLADSYWIPQ
ILSQLLGVAD LEQSLTQQPM PRILDLDPIV DMAESACIRH IVRPARNLIH IDVTLSHDLW
VSVFSMCWGS LSRSHQLAFT PYLIKLLSKS HLQKQTEMRP NVVQAFLDGI AACTVPITLP
PTLVRFLAKN FNAWYVGFEI LTRLTDVYRG DDGLRETCAS ALSELYAELC EEDMYYGVAR
SRCQFPETTG ALTYEQNGLW PKAIELYEQA QIKARNNMLP FSEGEYCLWE DHWILSAQKL
QNWENLTELA RIDSDADLLL ECAWRLSDWA SPDREAIDQN LARVIDHPTP RRKTFEAFVA
LLRSHMAREP PNEFLRVLDE AQQVTLRKWI SLPAHMTNAH LPLLQMCQQV VELGEAARVF
DSLQMTNQAN LELRCNSDLK PIFQTWRDRL PNFWDDISVW SDLLAWRQHV FQAVTKVYHP
LVAQPDNATY GYRGFHETAW MINRFGEVAR RHGLLDVCSV SLNKIYMLPN IEISEAFLKL
REQALCFFQK PEKFNEGLEN ISTTNLKFFG LSQRAEFLTF KGMFISRLGQ NEEANAEFAH
AIQTDWNLPK AWAEWGRFND NLYKDRPENP ATGPPEPEPG KPKMTDAQWQ ESYSQDRAIL
ASSAVSCYLQ AAGLYNNHKS RGLLLRVLWL LGLDDSHNTI SKAFENYKGD LVIWYWITLI
PQLLMSLSHR EASHARLVLM RIAKSFPQAL FFPLRVSRED FVNVKKQQQM QQRFAAARRA
ENQAKIAAAN AIADASGQPS ETKEVKDEQS AANATGIQPP ASNGQAMGLA AQNQSPSSQI
PRQPWDHVEE IMNMLKTAFP LLALTMEKMV DQISLRAKPA SDEDIYRFFS ALLADAMQQW
GGRSGLPNDD GELNAQTKDN LAKFATNLNG ELKVMIEKDF MVEMPKLREY IRRLQRWRDL
YEKNLDDRSK TLPLDQGGCN LTEFHHTKFD DVEIPGQYVQ HVDQGEEFIK IARFAPRAEL
GRGHGYCFRR ITMIGNNGVT YTFHVQMPAA RHCRREERLT QLFRIMNSVL WKRKESRRRS
LQIHLPTATP LAPQLRLVQS DSSYVSMQEI FEDFAASKKM AREDTVLSYF DRIKELHDPA
IPRVRSCGSS KVSGLTIGQN DHRYIQLRAE LMEEIRVKMV PETIITNYMI KSMNGPENLW
LMRKQFAAQT ATTMFLTFVC CLSNRTPSRF YISRKTGLMY MSEILPAFAP GQPLINSSEA
VPFRLTPNMQ HFATRAGVEG VITGTCTAMA RCLTAPEFDL SGTLSLFIRD ELLIWHNTYM
KDSRLESPLL GHVYKNVDSF IRRVSTMGFI GENRDRSSNA PPVVHAIISL ISQATSAVNL
AQMGETYMPW Y