Gene PC1_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_0994 
Symbol 
ID8131923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp1145235 
End bp1159859 
Gene Length14625 bp 
Protein Length4874 aa 
Translation table11 
GC content57% 
IMG OID644864277 
Producthypothetical protein 
Protein accessionYP_003016579 
Protein GI253687389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA TGAATGTTAT TGCTGGAAAT GTGAATATCC TGTCGCGCCA GAATGGCCAG 
TTCATAGAAA ACATTCCACA AGGTACAAGG AACGTTGAAC TACTGGAAAG CAGCACGGTT
CGAATTCATG GCACACCAGA TATGGTGTCT CGCTATGAGC GCGTCGGCAA CGATCTTATC
TTGCACATGA AAGACGGTAC AACCGTCCGC TATGAAAGCT TCTTCACGCT GGATGCCGCG
GGTTACCACA GCGAGCTGGT TTTTGACGAT GGTACGCGTC TCATCCATGC GCAATTCTCT
GGTGCAGCAG CGGCTGAAGG GGCAGCACTT GCGGCAGAAG CCGTTGCCCT GACGCCAGAA
TATTCTGCGC TGGGCGACAT GACGTCATTA CTGATTGGTA GCACCACCAC CAGCACGCTA
TCTGCAGCCT CATTGGGCAG CATTCTGGGG GCGGTCGCTC TCGGCGGAGC CGCTGTTGCA
GGGGTGGCGG TCGCCGTGGC CTCATCGAGT GATGACAACC ACACTACGCC ATCGGTACAG
CCAGAACCCT TAACCATTGA TCCCTTTGCC GGAAATAACG GACTGAACCG TACGGAAATC
GGCCAGCCTC ATATCCTCAG CGGGAAAACA ACGGGCGTCA GCGCGGGGCA AACCGTCACC
ATCACGTTGA ACGGCGTGGT TTACACTACC ACCGTTGCCG CCGATGGAAC CTGGCGTTTT
ACCCTGCCAG CGGATGCGTT TACCGGTCTG GAAGACGGCA TTTACGCATT GAAAGTCAGC
GTGCCGGGTG CGAACGGCGT GGTTCATGAG AAAACCCTCG ATCTCACCAT TGATACCCTG
CCTCCGCATT TAACCGTGGA TAAATTTACG GGTGACAATT ACCTCACCGT AGGAGAACTG
GCGAATGGCC AGGTTCTGAA CGGAACCGGT GAGGCGGGCC GGAACGTCAC CATCACGCTC
AATGGGAAAA CCTATACCAC GACCATTAAT GCTGCCGGTA ACTGGACTCT GACCGTACCT
GCCGCAGACC TTCGCGCACT GAGCGAAGGC GAACACGCCA TGTCGTTTAC GATCGGCGAT
AATGCCGGCA ATGTCACCGT GGTCAACCGC ACCATTATTG TTGATACCAC ACCGCCGGAA
CTGACCCTGT CGCCGTTTAC CGGAAACAAC CTGCTCACCG CCGACGAACT GCAGTCCTCA
CAGTCGGTCT CCGGCACGGC TTCCCTATCA GATGTCGGCC AGACCGTAAC CGTAACGTTT
AACGGCACGA CCTACACAAC GACGGTAGGA AGTGATGGAT CGTGGAGCGT TTTCATTCCC
TCCGGCGATA TGCAGGCGCT GACTAATGGC ACTTACAATC TGGTGGCGTC ATTAACGGAT
AAAGCAGGTA ACACCACGAC ACTGCCTCCG CAAACCATCA CGGTGGATAC TAACGCCGAA
GCGGTCAACA TCAGCATCGT CTCGACGGAT GACCGCCTGA ATGCCGTAGA AGCGGGACAG
CCGCTGACGG TCAGCGGAAC CACCGCCAAC GTTGCCGCAG GCCAAACGGT TACCGTCAGC
CTCACCGTCG CGGGGGCAGT AAAAACCTAC ACCACAACAA CGGGGGCGGA CGGAAAATGG
TCTGTGGACA TCCCCAGCGC CGATCTACTT CTGCTGCCAG ACGGCAGCCA CACTCTCACG
GCCAGCGTTC AGGGAATAAG CGGTAATACC GTCACCGTCG ACCACACGCT GGATGTACAC
ATCAATACTC TCCCCTCCAT AACGCTGACA CCGCCGTTCA CCGACGGCAT ATTAAATGCA
GCCGAAGCCG CGCAGGATCA GGTCATCAGA GGGGAAACCG GAATTAATGG CAGAGGGCAA
ACGGTTAGCC TGACGATCGG CGGGAATTAC GTTACCGGGA CGGTCGATGT CAATGGGAAC
TGGACGGTGA CGATCCCCAA AGACATCCTG CAAAGCCTGC CATCGGATAA CGTCAGCGTG
CTGGAGATTG TCGTCCGCGA TATCGCAGGC AACGAAACGA CCGTGACGCA GAATATCAGC
GTAGATACCA CGCCGCCGAC GCTCAACGTC TCCGCTATCG CTCAGGACGA TGTCCTCAAC
GGCGCAGAGC TGGCCGTCAA TCAGGTAGTC AGCGGAACGG CGTCACTCAG TGAAGCAGGC
CGCGTAGTCA CCGTTGCGCT CAATGACAAA ACCTATACCA CCACCGTCGG CAGCGACGGA
AACTGGAGCA TCACGCTGCC GACGGCTGAT CTGGTGGCTA TCGCGGACGG TAACCATAAT
TTGACCGTCA CGCTGACCGA TACCGCGGGC AACACCACAA CGGTTACCCG TCCGCTGACC
ATTGACAGCG GCGCAACCAC CGCCCCAACC ATTACCATCA ATAACGTCGC GGACGATAAC
GTCATTGATG GCGCAGAAGC CAAAGTCAGT CTGCAACTGA GCGGGACGAC CACTAACGTT
GAAGCGGGAC AAGTTGTCAC CATTAGCCTG AATGGGAAAA CCTATCTCGC GACCGTGCAG
TCCGGCGGCG TCTGGAGCGT CAACGTTTCA ACCGCAGATA TCGCCTTGCT CGCGGACGGT
GCGCACAGCA TTAGCGTCAA CGTGAGCAAC AAAGCGGGCA ACGCCGCGAG CGGAAGCCGC
GATATCAGCG TGGATAAATC TGGCGACAGC ATCGCCATCA ATATCATCGC TAACGACAAC
CTGCTAAATC AGGCAGAGTC ACTCCAGCCG CTGGCTATCA GCGGTAATAC CGCCAATGTT
CCCGCAGGAC AAACGGTTAC CGTGACGCTG AACGGGAAAA ACTACACCAC CACCGTTGCT
GCGGATGGCA GTTGGACGCT GCAAATCCCT AGCGCAGACC TCCAGCAGCT GTCAGACGGC
AATGCCACCA TCAGCGCTAG CGTGAATGTA GCGGGCGGAA CCGTAACGGA TGCGCAGACG
CTGGGTGTCC ACATCCATAC GCTGCCGCAG CCGACGATTG ATACCCCATT TGGTAACGGT
TCACTGAACG GCGCAGAGGC GCTGGTTAGC CAGACGATTA CCGGCCATAC CGGCATTAGC
GGTGCCGGGC AGACGGTCAT CTTGTCCCTC GGCGGGAAGT CTTACACAGG AACCGTTGAT
ACGGCAGGTA ACTGGAAAGT CACCGTCCCT GCGGCCGATC TCCAGCAACT GCCGGAAGGA
AACAATACAC TGTTGGTCAC GGCACAGGAT GCCGCAGGCA ATCAGGCAGG TAAGACGTTT
GTCAGCCATA CCGACTTTAC TGCCCCGACC CTGACTATCG GCACCATTGC TGGCGATGAC
ACCATCAATA TGGTCGAATC GCAAAGCAAC CAGACCGTCA ACGGCACCGC CTCAATCAGC
GAAGCGGGTC GTACTGTCGT TATTACCTTC GATGGACAGT TCTACACTGG CGTTGTTGGC
AATGATGGCA ACTGGAGCAT CAACTTACCC ACAGCGGCTC TGCGCGGCAT GGCTGATGGC
AGCTACACCC TGTCCGCGTC GCTAACCGAT GCTGTCGGGA ATACCGCCAT CGTCGAAAAA
TCCATTGAGC TCAGTGCAGA CCCTGCGTTC CAGCCCACGA TCTTCGTTAA CGCTTTTGTC
GATGAGAATA ACGTTATCAC CGCGGCCGAT CTCAAAGTCA GCCAGTGGCT GACAGGCACC
AGTTCGAATG TAGAAACGGG TCAGGTCGCC ACCATTCTCC TCAACAAAAA ATTCTACTTT
GCCACGATCC AGAGCGGCGG CAACTGGAGC GTAGAAATTC CTGCTGAGCA TATGGCTGAA
CTCAGCGAGG GAACGGTATC GATCTCTGCC AGCATCACCG ATATGGCCGG TAACGAGGGT
AGCCATGAAA TCTGGTCTTC GCTCGACACC AGCAATGACA GCATCTCCAT CAGTATTGTC
GCTCTGGATA ACCAGATTAA TCGACTTGAA GCCTCACAGC CGCTAACGAT TTCAGGGTCG
ACGGTCAACG TTACGCCAGG AGAAAGCGTT ACCGTTACGC TCAATGGCAA GACCTACACT
GGTACGATTG CCGCCAACGG CAGTTGGAGC GTGATCATCG ACAGCAGTGA CATGCTCGCC
TTACCGGATG GCACAACGAC CATCATCGCC AGCGTGGCCA ACCCCGGCGA TGTTCCTGTT
ACCGCCAGCC GCACGATTGA TATCCATATT AATAACCTGC CTCAGCCAAC GATTAACCAG
CCTTTTGGTG ACGGCATACT GAATATTACC GAGGCCGCCA GCGGGCAGAG CCTGACGGGC
AAAACGGGAA TCGCCGGTGG AGGTCAAAGC GTCCTCGTCA CGTTGAACGG GAAAACCTAT
ACGGCAATCG TCGATAATCA GGGCAACTGG ACTGTCGCGC TTCCCGCCGC CGACCTGCAA
TCCCTCCCGT CCGGCGTACA GACTATCCGT GTCGAAGCCA CTGACACGGC GGGCAACAGC
ATAGAGAGCA CGCGCGATGT TACCGTCGAC CTGACCAGCC CCATTCTGAC CTTGAAGCCG
CTGACTGGCG ACGGCATCAT CAACGCCGCC GAGAGCCTGA ACGATCAGGT CATCTCCGGT
AATGCGCTGC AATCCGATGC CGGACGCACC GTCACTGTCA CGATCAACAA CAAAAATTAT
CAGGCTCAGA TTCAGGCCGA CGGTAGCTGG AGCGCCACGA TCCCCGCTGC CGACCTTCAG
GCTCTGGCTG ACGGTAATTA CACCGTGACG GCAACCTTGA CCGATGCATC AGGCAACATC
GCCACCAGTA CCGGGTCATT AACGCTGGAT GCCAGCCCGG CTAACCAGCC CCTGCTCACC
ATCAATGCCA TCGCGCTCAA TAACATCATC GACGGGGGGG AAATCAACGT CGCGCAAATC
ATCAGCGGCG GCAGTCTGAA TGTTGAGGCA GGACAACGCG TTACCGTCAC GCTCGGCGAC
AATACTTACA CCACAACGGT TGACAGCAAC GGCCAATGGC GCGTCAGTGT ACCGTCTGTC
GATCTCCTTC ATCTGGCGCA AGGCGCACAT ACCGTCACGA TTGGCGTCAA TGACGTCAGC
GGTAATCCGG CAACGCTCAG CCAGACGATT ACGGTGAATA CCTCACTGAG CGGCATCGCC
ATCGATACCA TCGCTGGCGA CGACAAACTG AATCAAGCAG AGGTGGCGCA AGATCTGACC
GTCAACGGCA GCAGCCAGAA CGTGGCGGCA GGCACCACCG TCACCATCAT GCTGAACGGG
AAAAGCTACG ATGGCGTGGT ACAGCCGGAC GGTTCCTGGA GCATCATCGT ATCGGCGGCT
GACGTCAGCG CACTGGCAGA CGGCACGTCA ACCCTCACCG TGACAACGGT AGACAGCGCA
GGAAACGCGT TGAGCGGCAG CCGTACGATA GATGTGTTCA CCCACAGCAG CCCAACGCTG
ACGCTGAACA CACCATTCGG CGATGGCATA CTCAATGCCG CAGAGGCTGG CGTTACCCAG
ACGCTCAGCG GCACAACCGG TATCGCCTCA CCAGGGCAAA CCGTCACCGC CACACTTGGC
GGTGTGACGT ACACCGGGAT TGTTGATGCA GCAGGTAACT GGACGATTTC TCTGCCTGTG
AATGGCCTGC AAAATCTGCC GAACGGCACC ACGGCCTTGC AGGTTAGCGT CAGCGACGCA
GCCGGAAACA GCAGTACCCT GACCAGCAAT ATTACCGTGG CTCGCACGCC TCCTACGTTG
ACGACGGCAA GTTTTGCGAC CGACAACATT CTGAATAGCA CCGAGGTGCA AAGCAGTCAG
TTGCTAACCG GCACCGCTTC GCCATCCAGC GCGGGGCAAA CCGTTACTGC CACGCTGAAC
GGCAAAACCT ACAGCGGCAC CGTCGGCAGT GATGGCACCT GGAGCATCAC GATTCCATCA
GCCGATCTCA GCAATTTGTC CGATGGCAAC TACAGCATTG TGACGCGTCT GACGGATACC
GCTGGCAATA CCACTACCGC CACACAGGCT ATTGTCGTCG ATGCCAGCGC GCTAAACGCA
CCAGTCGTCA CGATTGGCAC CTTCGCTGGC AACAACATTA TTGATGGCGC GGAAGTCCGG
GTCAGTCAGG TGCTCAGCGG CACCAGCAAA AACGTTGAGC AAGGCCAAAC GGTGACGATT
AGCTTCAATG GCAAAGCCTA TACCGCACAG GTTCTGTCGA ACGGGAGTTG GAGCACCACG
ATCTCAGATG CGGATATGGC ACTGCTGACT AACGGCAGCC AGACCATTAC CGTCAGCGTG
AGCGATGTCT CCGGCAATAT CGCGACATCC AGCAGCACCG TCACGGTGAA CACCAATGCG
AGCGGGCTGT CTATTGCGCC AATCACCGGG GATAACCAGC TAAATGTGCT GGAAGCAACA
AACGGCATTA CGATTAACGG CAATACCGTT AACGTCGCGC CAGGCACGAA TATCAACGTC
ATACTCAACG GGAAAACCTA TACCGTGCAG GTGCAGTCAG ACGGTACCTG GAGCGCAAAC
ATTCAACCTG GCGATCTGCA AGCGCTGGGG GATGGCATCA TCGCGGTGCA TGTCACGGCG
GTCGATCAGG CGGGTAACGC TCTGTCGAGC ACACAACAAC TGGGCGTGAG CATCCATAAT
CCACCCGTTG CCTCGCTGAA TACCCCATTT GGTAACGGCT ACCTAAACGT GAGTGATGCG
CAAGCCGGGC AGACACTTTC CGGCACCACG GGGATTCACG CCGTGGGTCA AACCGTCAGC
GTTACCATCG GCGGTATCAG CTATACCGGC ACGGTTGACA GCAATGGCAA CTGGAGTCTT
CAACTCTCGC CAGCCATTCT GGGGACGCTA GCAGACGGCG TGCAGAATAT TTCCGTTACT
GTTACGGATA CGGCAGGCAA TACCTCTACC GTTCAGGGTA GCGTATTTGT TGACTTGACC
CCACCGGTGC TGACCATCAA CCCAATTGGC ATCGACGATA TCATCAACAT CGCGGAAAGC
CTGCAGCCAG TGGTCATCAG CGGTACGTCG CCTGTTAATG ACAGCGGTCG CCCTATTATT
GTCAACGTCA CCATCAACGG TCAGATCTAT CAGGGACTGG CACAGGCCGA TGGCACATGG
AGCGTCACCG TACCTGCTGG CGACTTTCAA AACATGCCGA ACGGCGTTAC GGCAATCACC
GCTACCTTGA CCGATGCCGC CGGCAATACC GGCACCGTCA GCCACTCGAT TGTTCTGGAT
ACCGACCCCG CGAAAGCACC AACCCTGACG ATCGCGACGC TGTCGACCGA CGATTATCTC
AATCTGGCCG AATCGAACCT GCCATTAACC ATCAACGGCA GCAGCCAGAA TGTGGAACAA
GGCCAGCAGG TCACCGTCAC GCTCAATAAC CAAACCTACT TTGCTACCGT GGGCGCGGAT
GGCAGTTGGA GCGTACAGGT TCCGGCAACG GATGTCGGCA ATGTGCCTGA TGGCAAACAA
ACCGTGAGCG CCAGCGTGAC GGACGTAAGC GGCAACCCTG GTTCAGCCAC ACACTCGATT
ACCGTCATTA CCGATGCCGC CAACCTGCCC GGCATCACCA TTACGACCCT ATCTGGCAAC
GACGTCATTA GTGCACAAGA TACGCAATCC GATTTGATCA TCTCGGGTTC TACAACGAAT
GTTCAGACAG GACAGCGAGT CACCGTCACG CTGAATAACA AAACCTATCT GGCGACCGTT
GGCGCTGACG GAAGCTGGAG CACAACCGTT CCCGCCAGCG ATGTGCAAAA TCTGCCGCAG
GGCAGTCAGA ATGTGACCGC AACGGTCAGC GACATCGCAC AGAACCCGGC CACGGCAACC
CATCCGGTTT CCGTCGATAC CGTTCCACCC TTGCTGTCTA TCGATATGTT GGTGGACACC
AGCGATATCG GTCTGGCGGA CGCGCTGGCC GGGCTACCGT TAAGCGGCAA GGCCGAAGCG
GGACTGTTGG TCACGATCAA AGTCGGTACG GCCGTTTATA GTGCGGTCGC CGATAGCAAC
GGCGTCTGGC AAATCGCCAT TGCGGCGAAC GACCTGCTGG CATTGGGCGA CGGCGTGAAA
ACACTGGGAG CCAGCGTGAC CGACGGCGCT GGGAATGCCA GTGCCGCCAG CATTGATATC
ACGCTAAAAA CACAGTCGCT TCCAACGCTG ACGCTAGATT CTCTCTATGG CAACAATGTT
CTCACTAGCG CAGAACTGGC GACGGAAACC ACCATCGGCG GCAGTTACAC CAATCTCCCT
GTAGGAACAG CGATTCAGGT CACGATCGGG GCGTACACCG TAACAGGCGT AACGCTCGCG
GGCGGCCTGT GGAGCGCCAC CATTCCCGCC AATGCGCTGA GCATTCTGGC AGATGGCAAC
GTGCAGGTCA GTGCGACAGT AACAGACAGC GCAGGCAATA CCGGCAGTGC AAGCGGCGCG
CTGGACGTTG TCATCCACAC CAATTTCGCC ATCACTATCG CCACACCGTT TGTCGACGGC
GTGCTAAATC AGGCGGAAAG CACGGTGGAC CAGTTGCTGA CGGGCACAAC GGGGCTGCTT
GACCCAGGCC AGAGCGTGTC GGTTTCGGTA ACCAACGGCA CGATCACTAC CACCTACAGC
GCTACCGTAG CAGCAAACGG CCAGTGGAGT GTGACGCTGC CCGCCGCCGA TCTCGTTGCA
TTTGGTGACG GCACACACAC CATCAACGTC ACCGTCACGG ATCATGCTGG CAATACAGGA
GCAGGAAGCG GAACGTTCTC CAGCGTCATT GTCGGGGTTC CCGTCGCCTC GCTGGATACC
CCTTTTGGCG ATGGCAAACT GAGTCTGGCT GATGCGCAGC CGGGTGCTAT GCTGTCCGGG
CAAACGGGGC TCACCAGCAA CGTGGGACAA ACCGTGTCGG TTAGTATCAA CGGCACGAAT
TTCCCAGCCA CGGTGAATGC CGACGGCAGT TGGACACTGT CGCTGGCTAG CCAGACGCTG
ATCGACCTAC CGGATGGCAC AGTGAATTTC ACCGTCATCG TGACCGATTC TGCGGGCAAC
ACCAGTACTG CAACGGCCAC AGCGAGTGTG CTGACCACCA CCCTGCCCGT GGCAACATTG
GATCTGCCTT TTGGCGACGG CATCCTTAAC GCCACTGAAA TTCAGGCCAT TCAGACATTA
ACCGGTAAAA CCGGTATTAC TAGCGCGGGT CAGGAAGTCA CCGTCACGGT GACCAATAAA
ACCACGCTGA TAGACACCAC CTTTACCGCC GTCGCTGACG GGTTGGGCGG CTGGTCAAGA
GAGCTGTCCC CTGCCGATTT GGCGATCTTT ACCGAAGGCA ATTACAGCAT TAGCGTCAAG
GTCACCGACT GGGTCGGCAA CGCCAATACC AGCACCCCGC GCGATGTTAG CGTAGCGCTG
ACGCTGCCTG CCCCGCTTAT CGACGTCGTC CCCTTTGGTC TCGATAATAT CCTGAGCAGC
GCCGAAGCCG CATCCGCACT CACCTTCTCT GGCCGCACGC AGATTGGTGG CAGCGGGCAA
AGCGTTAAAC TGGAGATCGA TCTCAACGGC ATCCGCTACG CCGCGACGGT AGACAGCGCG
GGCAACTGGT CAGTGACGCT GCCTCCAAAT GCGTTGAATT CACTCACCGA CGGCCAGCAT
ACCATCACGG TGACCGCCGT CGATGCAGCA GGCAACGTGG GTTCCGCACC GATTGCTTTT
ACCAGCGATT TCACCCCACC AGCCATTACG TTGAATACAC CGTTTGACGA TGGTTATCTG
AATATCGCTG AGGCCGCAAC GCTTGCAGGC AGGACGTTAA GCGGCAACGC GGGGGATGCG
GTGAGCGTGA ACGTCACGCT GGGAGGACAA ACGCTTGTTA CGCAAATTAG CGGAGGAGTC
TGGACCGCAA CGCTCACGCC GGCCCAACTT GCCCTGCTTG CTGATGGAAC CCAAAATATC
AGCATTACGG CAACGGACAG CCTGGGCAAC AGCGGAACAT TGAATTCTCA GGCGACCCTC
GCGGTGAAAG CCGCCCCAAC CGTCTCCATT ACCACCTTCG CGGGTCTGGA CGGCCTTGAC
TATGCCGAAA GCCGGACTAC GCAGTCAGTC AGCGGTACCT CGACCGGGCT GGAAGTCGGG
CAAAACGTGA CCGTCAGGCT GAATGGCCTA GATTATCAAA CCCAAATTCT CAACGGTGGC
TTATGGAGCG TTAACATCCC TTCCTCTGCG TTGCTGGTGC TGGCCAACAC GACCTATTCA
CTTTCCGTCA GTGCCGAGGA TAAAGCAGGG AACCCAACAG ACCCCTCAAA CGTTAACTTT
AACGTCAACC TGACGCCACC TCCGACGGTG ATGACCATTA ACCCTATCTC GACCGACAAC
ATCATCAACG CCGTCGAGAT CAACGGCAAT ATCACGATTT CCGGGCGTTC GATCGGGCCA
GCGTCAGCGA TGACCTCGGT ACAAGTTTCG GCCAACGGAG TGCTACTTCA GCCTAGTCCC
ATCACCGATG TCAACGGCAA CTGGAGCATC ACCATCCCGG CTCTGCCGAC ATTCTCCTCG
CAGGGTGAGG TGTTCATCAC CGCCACGTCG CTGGACGCGA CCCTGACAAC CATCGTCACC
GTGGACACCA TTGCCCCTAC GCTGGATATC GTTTCCTTTG CGTCGGACAA CGTGCTCAGC
GCCACGGAAA TGAGTACGGC GCAGGCGATT ACCGGTACGG CATCCATTAC AGAAGCAGGA
CAAATCGTTT CGATTAGCCT GAATGGAAAA ACCTACAGCG CGCAGGTTTC CGCAACAGGT
GCCTGGAGTG TCAACGTTCC CGCTGCCGAT CTGGCACAGC TAACCGATGG CAACTACACC
ATCACGGCGA CCCTCACCGA TAAAGCGGGC AATAGCACCA CCACGACACA GACGGTTGCC
GTCGATACCG CTATCCCACT GCTTAGCGTT ACGCTGTTTG ATGACAATAT TCTGACGTTG
GCAGAGGCGC TGGCGGGAGG GGCGATTACC GGGACAGGCG AAGTGGGCGC CACCGTTACG
CTAACCGCAG GCCCGCTGAC AGGCACAACC ACTGTCGGCC CGAACGGCAA CTGGAGCATT
CCGGTGCTGT CCGCCAATCT GCAAAACCTG ATCGATGGGC CACAGGTTAT TGGCGTAACG
CTGACCGATA CCTCTGGCAA CACCACACAC CTTGATGTCA CGTTGGATGT CGCGCTGAAT
AAGACGCTCG GCGCAGGCAT TACCGATATC TTCGGTAATG ACGGCATCCT CAATCTGGCC
GAATCCCTGG TGACGCAAGT CATCAGCGGT AACGCGACCG GTGACTATCT GGGTGCGAAA
GTCCAGGTCA CCGTGCTGGG AAATACGGTG GAAGGCACCG TGGGCGCCAA CGGCGCCTGG
AGCGTCGCCC TTGCTCCAAA TCTGTTCACA GGCCTGAGCA ATGGCTTACT GGCGGTTAAC
GTCGACATTA TCGATTCGCA CGGCAACGTG AAGAATCAAC TGGTCAACAT CGACGTGTTG
AAGTCCCTGC CAGTCATCAA TTCGGTTGTG GCCTTCACGG ATGGCGCGTT GAATGCGGCG
GACGTCGCGA CCAGCCAGAT TATCAGCGGC GTGGTCAGCA ATGTGGACAT CGCTGCGGGC
GCAACGGTCG CCGTAACGTT GGGCAACAAG ATTTACACTG GAATTAGCGT CGGTGCAGGT
GGAGCCTGGA GCCTGTCTGT TCCCGCCCTC GACTTGCAGG CGCTGCAGGA TGGAACGCTG
GCATTGGGCA TCGCGGTCAC CGATCACGCA GGCAACACGG CCAGCCAGAT CGTGAACGTC
CCAACAGTGA TCAGAAACCT GCCAAGTATT ACGTTGAATC CGGTCTTTGG CGATAGCCTG
CTGAATCTGG GCGACCTGCT GGTCAATCAA ACACTCAGCG GTACAGCCAC CGGCCTGGCA
GGCAGAACTA TTACGCTGAG CATTGCGGGT TCACAAATCG CCACAGCCGC CGTCGGCGCT
GATGGCAAAT GGAGCGTTGC CGTCACCCCA AGCGTACTGG GGATCCTCCA GGGACTGGGC
AGCGGCGATT TCACCGTAGC CGCTACCGCG ACAGACAGCG TAGGCAATAC CGCCAGCGGC
AACGCAGGCA TCAAGTTCGA TTTTGCCCAA CCGGTGATTA CGCTAAACCC GGTATTTGGC
GGCGATGGCT TCATCAACGC GGCAGAAGCG CTGGTAGCGC AAACGATCAG TGGCGTGGTG
ACCAATGCCA GCGCCGGATC GCAGGTTGCG GTTACCCTTG GCGGTAAAAC CTTCCTGTCT
ACGGTCGGGG CGGGAGGCGC GTTCAGCCTG ACGCTGCAAC CGTCCGATCT CAGCGCGTTG
GTCGATGGCA ACACCACGCT GAATGTGTCG ATCACGAACA CCTCCGGCAA TATCGGCACG
ATCAACAACG CCATCAATAT TATTGCCAAA AACCTGCCGA CGATTAGTCT GGGTTCACTG
TTTGGCGGCG ATGGCTTCCT CAATGCTGCG GAAGCGGCAT TGACACAAAC GATTAGCGGC
ACCACCACCA ACGCCATCGC GGGCTCATCC ATCGTCATAC GCATTGGCAC GTTGACACTA
AACGCGACGG TCGGCAGTGA CGGTACCTGG AGCGCCAGCG TCACGCCACT GCAACTGTCC
GGATTGGCCA ACGGTAACCT CACCGTCAGT GCGACGGTGA CCGACCCGGC AGGCAACAGT
AATAGCATCA GTGCAGGTCT CAACGTTTCC ATATTGCCGC CGACTATTAC GCTCAACCCT
CTGTTCAACA ATGGCATACT GGATCTCACC AGCTTGCTGA GTGCACAAAC CATCAGCGGG
ACGACGACCA ATGTGGCAGC CGGTACAGCG ATCAACGTCA CGCTGGGCAG CAAAACCTAC
ACTACAACGG TCGGCGCAAA CGGTAGCTGG AGCCTGCCAG TTCCTAACCT CGATCTGAAA
GCTCTCACAG ATGGCATCAC CAACATTGGC GTTCGTCTGG TTGATGCAGC GGGTAACGTT
GGCCAGCAAG CCGGTACCGT CAGTGTGGCG ATCAACGGCC AGCCGACGCT GACCCTCAAC
CCGCTCTTTG GCGGCGATGG CCTGCTCAAT GCGGTCGAGG CTGCTGCGGG TCAGATTATC
AGCGGAACCA GTACCAATGC GATAGGCTCC ACTATCCAGA TCTCGCTAGG CACCAAGACC
TACTCTGCCG TGGTGCAAAG CAATGGTTCC TGGTCCGTCA GTCTGCCTTC ACTCGATCTC
AACAATCTGA CTGACGGCAC ACTGTCCCTT AGCGCCTCTC TGACCAACGC AGCAGGAAAA
AGTGCCAGCG TAGGGGCTTC GATCGGCGTG GGCGTTCACA CATTGCCAAC GGTCAGTCTT
GGTTCTCTGT TTGGCGGGGA CGGCTACCTG AATCTGGCCG AAGCAGGCAT CAACCAGCTC
ATCAGCGGCA CCACGACCAA TGCGGCTGGC GGTAGCGTCA CGCTGACGGT GGGCGGACTG
GTGCTCACCG CTGCGGTCGC CAGTAACGGC ACCTGGAGCA TCAGCGTGCC GAGCGCCAAC
CTGCTCAACA TTGCCGATGG CAATTTGACC GTCGGCGTCA CCGTCGCCGA TCGCTATGGC
AACACTAACA ACACCAGCAG CAATGTGATC GTGAAAACGC ACCAGTTGCC GCAGTTGGGT
ATTGATGCCG TGGGGTCACT GATCGGGAAT ACCATTGGTC TGCTGACGAA CGGCGTCACC
ATCAGCGGAG CATCGCGCTA TGTTCAGCAA GGCGCTAAGG TCACCGTCAC GCTGCTCGGC
CAGACGCTAC AGGGTACCGT CGGCGCCGAT GGCAAGTGGA GCGCTCACTT CAGCAGTTCG
CTACTGGGCA TCAACGCTTT CAACGTGGGC GCGATTTTGA CAGCGCTATT GGGAACGGCG
GTTGAAGCCT CCGTCACCGA TCAGGCGGGT AACTTCACTG GCGTATCCGC TGGATTAACC
TCCGGTATCT CGCTCGGGTT ACCGCTGATG AGCATGATGG CGCTGGATGT TGACGACAGC
AGCCTTGCGC AGGTGTCTTC GCTGTCCAGT GAACAGCTCG ATGGCGAGGA AAGCGCCGTA
GTGCAACCTC TTCAAGTCAG CTCGAAAATG GCGGCTGCCA CGCTCAGCGA GACACACACC
ATCGACAATA GTGCCAGCAC CGAAGTGGAA GACAGCGTTT ATGCCATCGG TGGCGTCGTT
GTCAACCTGG CCGACGGCAC TGTGCAAACG GGCACGGATG TCGAAGGCGG AGAAGGCGAC
GACATTATTA CGCTTTCAAC GCTAGACTTT ACCCAGATCG ACGGTGGCGA CGGCATTGAT
ACGCTGGTTC TGGATGGCAC CGAGCTCAAT CTGGACCTCA CGGCGTTGGG GCTGAAAATC
GATAACGTAG AGATCTTCGA TTTAGGCCAA AACGGCACCA ACAGCATCAC GCTGGATCTC
GACCGTGCCC TGAATGTGAC CGACCGACCA GAAGATGATC TGCTGATCGT CGGGGGCGAA
GGCAATCAGG TGAACCTGAT TCCAGGCGAA GGAGCCTGGA GCACGGTAGG ACAGCGCGAT
ATTGATGGCC AACGCTTTGA TGTCTACCAC CACTCATCGC TGGATAGCGC CAACAACCTG
GGAGATGTAC TGGTGCAGCA AGGCCTGCTC GTCAATATGG TGTAA
 
Protein sequence
MSDMNVIAGN VNILSRQNGQ FIENIPQGTR NVELLESSTV RIHGTPDMVS RYERVGNDLI 
LHMKDGTTVR YESFFTLDAA GYHSELVFDD GTRLIHAQFS GAAAAEGAAL AAEAVALTPE
YSALGDMTSL LIGSTTTSTL SAASLGSILG AVALGGAAVA GVAVAVASSS DDNHTTPSVQ
PEPLTIDPFA GNNGLNRTEI GQPHILSGKT TGVSAGQTVT ITLNGVVYTT TVAADGTWRF
TLPADAFTGL EDGIYALKVS VPGANGVVHE KTLDLTIDTL PPHLTVDKFT GDNYLTVGEL
ANGQVLNGTG EAGRNVTITL NGKTYTTTIN AAGNWTLTVP AADLRALSEG EHAMSFTIGD
NAGNVTVVNR TIIVDTTPPE LTLSPFTGNN LLTADELQSS QSVSGTASLS DVGQTVTVTF
NGTTYTTTVG SDGSWSVFIP SGDMQALTNG TYNLVASLTD KAGNTTTLPP QTITVDTNAE
AVNISIVSTD DRLNAVEAGQ PLTVSGTTAN VAAGQTVTVS LTVAGAVKTY TTTTGADGKW
SVDIPSADLL LLPDGSHTLT ASVQGISGNT VTVDHTLDVH INTLPSITLT PPFTDGILNA
AEAAQDQVIR GETGINGRGQ TVSLTIGGNY VTGTVDVNGN WTVTIPKDIL QSLPSDNVSV
LEIVVRDIAG NETTVTQNIS VDTTPPTLNV SAIAQDDVLN GAELAVNQVV SGTASLSEAG
RVVTVALNDK TYTTTVGSDG NWSITLPTAD LVAIADGNHN LTVTLTDTAG NTTTVTRPLT
IDSGATTAPT ITINNVADDN VIDGAEAKVS LQLSGTTTNV EAGQVVTISL NGKTYLATVQ
SGGVWSVNVS TADIALLADG AHSISVNVSN KAGNAASGSR DISVDKSGDS IAINIIANDN
LLNQAESLQP LAISGNTANV PAGQTVTVTL NGKNYTTTVA ADGSWTLQIP SADLQQLSDG
NATISASVNV AGGTVTDAQT LGVHIHTLPQ PTIDTPFGNG SLNGAEALVS QTITGHTGIS
GAGQTVILSL GGKSYTGTVD TAGNWKVTVP AADLQQLPEG NNTLLVTAQD AAGNQAGKTF
VSHTDFTAPT LTIGTIAGDD TINMVESQSN QTVNGTASIS EAGRTVVITF DGQFYTGVVG
NDGNWSINLP TAALRGMADG SYTLSASLTD AVGNTAIVEK SIELSADPAF QPTIFVNAFV
DENNVITAAD LKVSQWLTGT SSNVETGQVA TILLNKKFYF ATIQSGGNWS VEIPAEHMAE
LSEGTVSISA SITDMAGNEG SHEIWSSLDT SNDSISISIV ALDNQINRLE ASQPLTISGS
TVNVTPGESV TVTLNGKTYT GTIAANGSWS VIIDSSDMLA LPDGTTTIIA SVANPGDVPV
TASRTIDIHI NNLPQPTINQ PFGDGILNIT EAASGQSLTG KTGIAGGGQS VLVTLNGKTY
TAIVDNQGNW TVALPAADLQ SLPSGVQTIR VEATDTAGNS IESTRDVTVD LTSPILTLKP
LTGDGIINAA ESLNDQVISG NALQSDAGRT VTVTINNKNY QAQIQADGSW SATIPAADLQ
ALADGNYTVT ATLTDASGNI ATSTGSLTLD ASPANQPLLT INAIALNNII DGGEINVAQI
ISGGSLNVEA GQRVTVTLGD NTYTTTVDSN GQWRVSVPSV DLLHLAQGAH TVTIGVNDVS
GNPATLSQTI TVNTSLSGIA IDTIAGDDKL NQAEVAQDLT VNGSSQNVAA GTTVTIMLNG
KSYDGVVQPD GSWSIIVSAA DVSALADGTS TLTVTTVDSA GNALSGSRTI DVFTHSSPTL
TLNTPFGDGI LNAAEAGVTQ TLSGTTGIAS PGQTVTATLG GVTYTGIVDA AGNWTISLPV
NGLQNLPNGT TALQVSVSDA AGNSSTLTSN ITVARTPPTL TTASFATDNI LNSTEVQSSQ
LLTGTASPSS AGQTVTATLN GKTYSGTVGS DGTWSITIPS ADLSNLSDGN YSIVTRLTDT
AGNTTTATQA IVVDASALNA PVVTIGTFAG NNIIDGAEVR VSQVLSGTSK NVEQGQTVTI
SFNGKAYTAQ VLSNGSWSTT ISDADMALLT NGSQTITVSV SDVSGNIATS SSTVTVNTNA
SGLSIAPITG DNQLNVLEAT NGITINGNTV NVAPGTNINV ILNGKTYTVQ VQSDGTWSAN
IQPGDLQALG DGIIAVHVTA VDQAGNALSS TQQLGVSIHN PPVASLNTPF GNGYLNVSDA
QAGQTLSGTT GIHAVGQTVS VTIGGISYTG TVDSNGNWSL QLSPAILGTL ADGVQNISVT
VTDTAGNTST VQGSVFVDLT PPVLTINPIG IDDIINIAES LQPVVISGTS PVNDSGRPII
VNVTINGQIY QGLAQADGTW SVTVPAGDFQ NMPNGVTAIT ATLTDAAGNT GTVSHSIVLD
TDPAKAPTLT IATLSTDDYL NLAESNLPLT INGSSQNVEQ GQQVTVTLNN QTYFATVGAD
GSWSVQVPAT DVGNVPDGKQ TVSASVTDVS GNPGSATHSI TVITDAANLP GITITTLSGN
DVISAQDTQS DLIISGSTTN VQTGQRVTVT LNNKTYLATV GADGSWSTTV PASDVQNLPQ
GSQNVTATVS DIAQNPATAT HPVSVDTVPP LLSIDMLVDT SDIGLADALA GLPLSGKAEA
GLLVTIKVGT AVYSAVADSN GVWQIAIAAN DLLALGDGVK TLGASVTDGA GNASAASIDI
TLKTQSLPTL TLDSLYGNNV LTSAELATET TIGGSYTNLP VGTAIQVTIG AYTVTGVTLA
GGLWSATIPA NALSILADGN VQVSATVTDS AGNTGSASGA LDVVIHTNFA ITIATPFVDG
VLNQAESTVD QLLTGTTGLL DPGQSVSVSV TNGTITTTYS ATVAANGQWS VTLPAADLVA
FGDGTHTINV TVTDHAGNTG AGSGTFSSVI VGVPVASLDT PFGDGKLSLA DAQPGAMLSG
QTGLTSNVGQ TVSVSINGTN FPATVNADGS WTLSLASQTL IDLPDGTVNF TVIVTDSAGN
TSTATATASV LTTTLPVATL DLPFGDGILN ATEIQAIQTL TGKTGITSAG QEVTVTVTNK
TTLIDTTFTA VADGLGGWSR ELSPADLAIF TEGNYSISVK VTDWVGNANT STPRDVSVAL
TLPAPLIDVV PFGLDNILSS AEAASALTFS GRTQIGGSGQ SVKLEIDLNG IRYAATVDSA
GNWSVTLPPN ALNSLTDGQH TITVTAVDAA GNVGSAPIAF TSDFTPPAIT LNTPFDDGYL
NIAEAATLAG RTLSGNAGDA VSVNVTLGGQ TLVTQISGGV WTATLTPAQL ALLADGTQNI
SITATDSLGN SGTLNSQATL AVKAAPTVSI TTFAGLDGLD YAESRTTQSV SGTSTGLEVG
QNVTVRLNGL DYQTQILNGG LWSVNIPSSA LLVLANTTYS LSVSAEDKAG NPTDPSNVNF
NVNLTPPPTV MTINPISTDN IINAVEINGN ITISGRSIGP ASAMTSVQVS ANGVLLQPSP
ITDVNGNWSI TIPALPTFSS QGEVFITATS LDATLTTIVT VDTIAPTLDI VSFASDNVLS
ATEMSTAQAI TGTASITEAG QIVSISLNGK TYSAQVSATG AWSVNVPAAD LAQLTDGNYT
ITATLTDKAG NSTTTTQTVA VDTAIPLLSV TLFDDNILTL AEALAGGAIT GTGEVGATVT
LTAGPLTGTT TVGPNGNWSI PVLSANLQNL IDGPQVIGVT LTDTSGNTTH LDVTLDVALN
KTLGAGITDI FGNDGILNLA ESLVTQVISG NATGDYLGAK VQVTVLGNTV EGTVGANGAW
SVALAPNLFT GLSNGLLAVN VDIIDSHGNV KNQLVNIDVL KSLPVINSVV AFTDGALNAA
DVATSQIISG VVSNVDIAAG ATVAVTLGNK IYTGISVGAG GAWSLSVPAL DLQALQDGTL
ALGIAVTDHA GNTASQIVNV PTVIRNLPSI TLNPVFGDSL LNLGDLLVNQ TLSGTATGLA
GRTITLSIAG SQIATAAVGA DGKWSVAVTP SVLGILQGLG SGDFTVAATA TDSVGNTASG
NAGIKFDFAQ PVITLNPVFG GDGFINAAEA LVAQTISGVV TNASAGSQVA VTLGGKTFLS
TVGAGGAFSL TLQPSDLSAL VDGNTTLNVS ITNTSGNIGT INNAINIIAK NLPTISLGSL
FGGDGFLNAA EAALTQTISG TTTNAIAGSS IVIRIGTLTL NATVGSDGTW SASVTPLQLS
GLANGNLTVS ATVTDPAGNS NSISAGLNVS ILPPTITLNP LFNNGILDLT SLLSAQTISG
TTTNVAAGTA INVTLGSKTY TTTVGANGSW SLPVPNLDLK ALTDGITNIG VRLVDAAGNV
GQQAGTVSVA INGQPTLTLN PLFGGDGLLN AVEAAAGQII SGTSTNAIGS TIQISLGTKT
YSAVVQSNGS WSVSLPSLDL NNLTDGTLSL SASLTNAAGK SASVGASIGV GVHTLPTVSL
GSLFGGDGYL NLAEAGINQL ISGTTTNAAG GSVTLTVGGL VLTAAVASNG TWSISVPSAN
LLNIADGNLT VGVTVADRYG NTNNTSSNVI VKTHQLPQLG IDAVGSLIGN TIGLLTNGVT
ISGASRYVQQ GAKVTVTLLG QTLQGTVGAD GKWSAHFSSS LLGINAFNVG AILTALLGTA
VEASVTDQAG NFTGVSAGLT SGISLGLPLM SMMALDVDDS SLAQVSSLSS EQLDGEESAV
VQPLQVSSKM AAATLSETHT IDNSASTEVE DSVYAIGGVV VNLADGTVQT GTDVEGGEGD
DIITLSTLDF TQIDGGDGID TLVLDGTELN LDLTALGLKI DNVEIFDLGQ NGTNSITLDL
DRALNVTDRP EDDLLIVGGE GNQVNLIPGE GAWSTVGQRD IDGQRFDVYH HSSLDSANNL
GDVLVQQGLL VNMV