Gene YpsIP31758_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4008 
Symbol 
ID5388396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4502514 
End bp4517375 
Gene Length14862 bp 
Protein Length4953 aa 
Translation table11 
GC content53% 
IMG OID640867038 
Productputative invasin 
Protein accessionYP_001402955 
Protein GI153949538 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTCCTG CTGCACGTGC GACTGAACCT TATACGCTGG GGCCGGGTGA CTCTATTCAA 
TCGATAGCAA AAAAATATAA TATTACGGTT GATGAACTGA AAAAACTGAA TGCTTATCGT
ACCTTTTCCA AACCTTTCGC ATCACTGACA ACAGGGGATG AGATTGAAGT TCCCCGCAAA
GAGTCATCTT TCTTTAGCAA TAATCCTAAT GAAAATAATA AAAAAGATGT TGATGATTTG
TTAGCCAGAA ATGCTATGGG AGCCGGTAAG TTACTTTCTA ATGACAATAC CTCTGATGCC
GCAAGTAATA TGGCGCGTTC AGCAGTGACA AATGAAATTA ACGCATCTTC TCAGCAGTGG
TTGAACCAAT TTGGTACTGC GCGGGTACAA TTGAATGTAG ATAGCGATTT TAAGCTAGAT
AACAGCGCGC TGGACCTATT GGTACCACTC AAAGACAGTG AAAGTTCACT CCTGTTTACT
CAACTTGGTG TTCGCAATAA AGACAGCCGC AACACAGTAA ATATTGGTGC CGGAATACGT
CAGTACCAAG GTGACTGGAT GTACGGTGCT AATACCTTCT TTGACAATGA TCTTACTGGA
AAAAACCGGC GTGTGGGGGT GGGTGCTGAA GTTGCGACTG ACTACCTTAA ATTCTCCGCT
AACACCTATT TTGGTTTGAC CGGCTGGCAT CAGTCTCGTG ACTTCAGCAG TTACGATGAG
CGTCCGGCTG ATGGCTTCGA TATCCGTACC GAGGCTTATT TACCCGCCTA TCCACAATTA
GGCGGTAAAT TAATGTATGA AAAATACCGT GGTGATGAAG TCGCCTTATT TGGTAAGGAT
GATCGCCAGA AAGACCCACA TGCTGTGACG TTGGGGGTTA ATTACACCCC GGTTCCTTTG
GTCACTATTG GTGCAGAACA CCGTGAGGGG AAAGGTAACA ACAACAATAC CAGCGTTAAT
GTGCAACTGA ATTATCGCAT GGGGCAACCG TGGAATGATC AGATTGATCA GTCAGCGGTG
GCGGCTAACC GGACACTGGC GGGCAGCCGT TATGACCTGG TTGAACGTAA TAATAATATT
GTTCTGGATT ATAAAAAGCA GGAGCTAATA CATCTGGTTC TGCCTGACAG AATCAGTGGT
TCGGGTGGTG GTGCCATTAC ATTGACCGCA CAGGTACGTG CAAAATATGG CTTTAGCCGC
ATTGAATGGG ATGCGACACC GTTAGAAAAT GCGGGCGGCA GTACCTCTCC ACTCACTCAG
AGTTCATTGT CGGTTACCTT ACCTTTCTAT CAACATATTC TCAGAACAAG CAATACACAT
ACAATAAGTG CGGTTGCCTA TGATGCCCAA GGGAACGCCT CAAATCGTGC GGTGACATCT
ATTGAAGTCA CGCGTCCAGA GACCATGGTG ATCAGTCATC TGGCGACAAC AGTTGATAAT
GCGACGGCTA ACGGTATTGC GGCTAACACG GTACAAGCCA CAGTAACCGA TGGCGACGGC
CAACCGATTA TCGGGCAGAT CATCAACTTT GCGGTTAATA CTCAGGCAAC ATTAAGTACG
ACAGAGGCAA GAACGGGAGC TAATGGTATT GCCAGTACCA CACTGACGCA TACCGTCGCG
GGTGTGAGTG CGGTCAGCGC GACGTTGGGT TCTAGTAGCC GAAGTGTGAA TACGACGTTT
GTGGCTGATG AAAGCACGGC GGAGATCACC GCCGCAAATC TGACAGTGAC AACAAATGAT
TCAGTGGCTA ATGGCAGTGA CACTAACGCT GTTCGGGCGA AGGTTACTGA TGCCTATACT
AATGCTGTTG CTAATCAATC CGTGATATTC AGTGCCAGTA ATGGTGCAAC CGTCATCGAT
CAAACAGTGA TAACCAATGC CGAAGGGATT GCTGACTCTA CGCTGACCAA TACCACCGCA
GGGGTTTCGG CAGTCACTGC AACGTTGGGA AGCCAATCTC AACAAGTTGA TACGACATTT
AAACCTGGGT CGACAGCGGC GATCAGTTTG ATGAAATTGG CTGACCGGGC GGTTGCCGAT
GGCATCGACC AGAATGAAAT CCAAGTCGTG TTACGGGATG GGACAGGCAA TGCCGTGCCA
AATGTGCCGA TGAGTATTCA GGCAGATAAT GGCGCGATAG TGGTTGCTTC AACACCGAAT
ACCGGTGTAG ATGGCACGAT TAATGCCACA TTCACGAACC TTCGGGCAGG AGAATCCGTT
GTTAGCGTGA CGTCTCCTGC ATTGGTGGGT ATGACGATGA CAATGACGTT CTCTGCTGAT
CAGAGGACGG CGGTTGTTTC TACGTTGGCC GCAATTGATA ACAATGCCAA AGCGGATGGA
ACTGACACCA ATGTGGTGCG TGCGTGGGTC GTTGATGCAA ATGGTAATTC AGTACCGGGT
GTTTCTGTAA CATTTGATGC TGGGAATGGT GCTGTTTTGG CACAGAATCC AGTGGTGACA
GACCGTAATG GCTATGCAGA AAATACACTC ACCAACCTGG CTATAGGTAC CACTACAGTC
AAAGCCACGA CGGTAACCGA CCCTGTTGGT CAGACCGTCA ATACCCACTT TGTGGCCGGT
GCAGTAGATA CCATCACCCT GACGACACCG GTTAACGGTG CGGTGGCGGA TGGGGCAAAC
AGCAACAGCG TGCAGGCGGT GGTCAGCGAC AGCGGCGGCA ACCCGGTTAC CGGTGCGACG
GTAGTCTTCA GCTCCACCAA TGCCACAGCG CAAGTCACTA CGGTGATCGG TACCACCGGT
GTGGACGGGA TCGCCACGGC GACCCTGACC AATACTGTGG CGGGGACCAG CAATGTGGTC
GCCACCATCG GTAGCATTAC CAACAATATC GACACGACCT TTGTGGCCGG TGCGGTTGCG
ACCATCACAC TGACGACGCT GGTTAACGGC GCGGTGGCGG ATGGGGCAAA CAGCAACAGC
GTGCAGGCGG TGGTCAGCGA CAGCGAGGGC AACCCGGTCG CCGGTGCGGC CGTGGTCTTC
AGTTCTGCCA ACGCCACAGC CCAAATTACC ACGGTGATCG GCACCACCGA TGCGGACGGG
ATCGCCACGG CGACCCTGAC CAATACCGTG GCGGGGACCA GCAATGTGGT CGCCACCATA
GGTAGCATTA CCAACAATAT CGACACGACC TTTGTGGCCG GTGCGGTTGC GACCATCACA
CTGACGACGC CGGTTAACGG CGCGGTGGCG GATGGGGCAA ACAGCAACAG CGTGCAGGCG
GTGGTCAGCG ACAGCGGCGG CAACCCGGTT ACCGGTGCGA CGGTAGTCTT CAGCTCCACC
AATGCCACAG CGCAAGTCAC TACGGTGATC GGTACCACCG GTGTGGACGG GATCGCCACG
GCGACCCTGA CCAATACCGT GGCGGGGACC AGCAATGTGG TCGCCACCAT AGGTAGCATT
ACCAACAATA TCGACACGAC CTTTGTGGCC GGTGCGGTTG CGACCATCAC ACTGACGACG
CTGGTTAACG GCGCGGTGGC GGATGGGGCA AACAGCAACA GCGTGCAGGC GGTGGTCAGC
GACAGCGGCG GCAACCCGGT TACCGGTGCG ACGGTAGTCT TCAGCTCCAC CAATGCCACA
GCGCAAGTCA CTACGGTGAT CGGTACCACC GGTGTGGACG GGATCGCCAC GGCGACCCTG
ACCAATACTG TGGCGGGGAC CAGCAATGTG GTCGCCACCA TAGGTAGCAT TACCAACAAT
ATCGACACGA CCTTTGTGGC CGGTGCGGTT GCGACCATCA CACTGACGAC ACCGGTTAAC
GGTGCGGTGG CGGATGGGGC AAACAGCAAC AGCGTGCAGG CGGTGGTCAG CGACAGCGGC
GGCAACTCGG TTACCGGTGC GACGGTAGTC TTCAGCTCCA CCAATGCCAC AGCGCAAGTC
ACTACGGTGA TCGGCACCAC CGGTGCGGAC GGGATCGCCA CGGCGACCCT GACCAATACC
GTGGCGGGGA CCAGCAATGT GGTCGCCACC ATTGATACGG TTAACGCCAA TATTGACACC
ACCTTTGTAG CCGGTGCGGT CGCGACCATT ACGCTGAGTG TGCTGGTTAA CGATGCAACT
GCGGACGGTG CAGACACCAA TCAGGTGGAC GCATTGGTAC AGGATGCTAA CGGCAATGCG
ATCACCGGTG CGGCTGTAGT CTTTAGTTCA GCCAATGGGG CAGATATTAT TGCCCCGACC
ATGAACACCG GTGTAAATGG CGTGGCATCA ACACTCCTGA CCCATACCGT GGCCGGGACC
AGTAATGTGA TCGCCACCAT TGATACGGTT AACGCCAATA TCGACACCAC CTTTGTAGCC
GGTGCGGTCG CCACCATTAC ACTGAGCGTG CCAGTTAACG ATGCAACAGC GGACGGTGCA
GACACCAATC AGGTGGACGC ATTGGTACAG GATGCTAGCG GCAATGCGAT CACCGGTGCC
GCCGTGGTCT TTAGTTCAGC CAATGGGGCA ACTATTCTCT CCTCGACTGT GAATACCGGT
GCCGATGGGA TCGCCAGTAC CACGCTGACC CATACCCAGT CCGGTGTGAG CAATGTGGTC
GCCACCATTG ATACGGTTAA TGCCAATATC GACACCGCCT TTGTGGCCGG TGCGGTCGCG
ACCATTACGC TGAGTGTGCT GGTTAACGAT GCAACTGCGG ACGGTGCAGA CACCAATCAG
GTGGACGCAT TGGTACAGGA TGCTAATGGC AATGCGATCA CCGGTGCCGC CGTGGTCTTT
AGTTCAGCCA ATGGGGCAAC TATTCTCTCC TCGACTGTGA ATACCGGTGC CGATGGGATC
GCCAGTACCA CGCTGACCCA TACCCAGTCC GGTGTGAGCA ATGTGGTCGC TACCATTGAT
ACGGTTAACG CCAATATTGA CACCACCTTT GTGGCCGGTG CGGTTGCGAC CATTACGCTG
AGTGTGCCAG TTAATGATGC AACTGCGGAC GGTGCAGACA CCAATCAGGT GGACGCATTG
GTACAGGATG CTAATGGCAA TGCGATCACC GGTGCGGCCG TGGTCTTTAG TTCAGCCAAT
GGGGCAACTA TTCTTTCCTC GACCATGAAC ACCGGTGTAA ATGGAGTGGC ATCAACGCTC
CTGACCCATA CCCAGTCCGG TGTGAGCAAT GTGGTTGCCA CCATTGATAC GGTTAACGCC
AATATCGACA CCACCTTTGT GGCCGGTGCG GTCGCGGCCA TTACGCTGAC GACGCCGGTT
GACGGCGCGG TGGCGGATGG CACGGACAGC AACAGCGTGC AGGCGGTGGT CAGCGACAGC
GACGGCAACC CGGTCACCGG AGCAACGGTG GTCTTTAGCT CCACCAATGC CACAGCCCAA
ATTACCACAG TGATCGGCAC CACCGGTGCG GACGGGATCG CCACGGCGAC CCTGACCAAT
ACCGTGGCGG GGACCAGCAA TGTGGTCGCC ACCATTGATA CGGTTAACGC CAATATCGAC
ACCACCTTTG TGGCCGGTGC AGTCGCGACC ATTACGCTGA GCGTGCCAGT TAATGATGCA
ACCGCGGACG GTGCAGATAC CAATCAGGTG GACGCATTGG TGCAGGATGC TAACGGCAAT
GCGATCACCG GTGCCGCCGT GGTCTTTAGT TCGACCAATG GGGCAGATAT TATCGTCCCA
ACCATGAACA CCGGTGTAAA TGGAGTGGCA TCAACACTCC TGACCCATAC CATGGCGGGG
ACCAGTAATG TGATCGCCAC CATTGATACG GTTAACGCCA ATATCGACAC CACCTTTGTA
GCCGGTGCGG TCGCCACCAT TACACTGAGC GTGCCAGTTA ATGATGCAAC TGCGGACGGT
GCAGACACCA ATCAGGTGGA CGCATTGGTA CAGGATGCTA ATGGCAATGC GATCACCGGT
GCGGCCGTGG TCTTTAGTTC AGCCAATGGG GCAACTATTC TTTCCTCGAC CATGAACACC
GGTGTAAATG GAGTGGCATC AACGCTCCTG ACCCATACCC AGTCCGGTGT GAGCAATGTG
GTTGCCACCA TTGATACGGT TAACGCCAAT ATCGACACCA CCTTTGTGGC CGGTGCGGTT
GCGGCCATTA CGCTGACGAC GCCGGTTAAC GGCGCAGTGG CGGATGGGGC AAACAGCAAC
AGCGTGCAGG CGGTGGTCAG CGACAGCGAG GGCAATGCGG TCGCCGGTGC GGCTGTAGTC
TTCAGTTCTG CCAACGCCAC AGCCCAACTT ACCACAGTGA TCGGCACCAC CGGTGCGGAC
GGGATCGCCA CGGCGACCCT GACTAATACC GTGGCCGGGA CCAGTAATGT GATCGCCACC
ATTGATACGG TTAACGCCAA TATCGACACC ACCTTTGTGG CCGGTGCGGT CGCCACCATT
ACACTGAGCG TGCCAGTTAA TGATGCAACC GCGGACGGTG CAGACACCAA TCAGGTGGAC
GCATTGGTAC AGGATGCTAA TGGCAATGCG ATCACCGGTG CGGCTGTAGT CTTTAGTTCA
GCCAATGGGG CAGATATTAT TGCCCCGACC ATGAACACCG GTGTAAATGG AGTGGCATCA
ACGCTCCTGA CCCATACCCA GTCCGGTGTG AGCAATGTGG TCGCCACCAT TGATACGGTT
AACGCCAATA TCGACACCGC CTTTGTGGCC GGTGCGGTCG CGACCATTAC GCTGAGTGTG
CTGGTTAACG ATGCAACTGC GGACGGTGCA GACACCAATC AGGTGGACGC ATTGGTACAG
GATGCTAATG GCAATGCGAT CACCGGTGCC GCCGTGGTCT TTAGTTCAGC CAATGGGGCA
ACTATTCTTT CCTCGACTGT GAATACCGGT GCCGATGGGA TCGCCAGTAC CACGCTGACC
CATACCCAGT CCGGTGTGAG CAATGTGGTT GCCACCGTTG ATACGGTTAA CGCCAATATC
GACACCGCCT TTGTGGCCGG TGCGGTCGCG ACCATTACAC TGAGCGTGCC AGTTAATGAT
GCAACTGCGG ACGGTGCAGA CACCAATCAG GTGGACGCAT TGGTACAGGA TGCTAACGGT
AATGCGATCA CCGGTGCGGC CGTGGTCTTT AGTTCGACCA ATGGGGCAAC TATTCTCTCC
TCGACTGTGA ATACCGGTGC CGATGGGATC GCCAGTACCA CGCTGACCCA TACCCAGTCC
GGTGTGAGCA ATGTGGTCGC TACCATTGAT ACGGTTAACG CCAATATCGA CACCACCTTT
GTGCCCGGTG CGGTTGCGAC CATTACGCTG AGTGTGCCAG TTAACGATGC AACCGCGGAC
GGTGCAGACA CCAATCAGGT GGACGCATTG GTACAGGATG CTAACGGCAA TGCGATCACC
GGTGCGGCTG TAGTCTTTAG TTCAGCCAAT GGGGCAGATA TTATTGCCCC GACCATGAAC
ACCGGTGTAA ATGGAGTGGC ATCAACACTC CTGACCCATA CCCAGTCCGG TGTGAGCAAT
GTGGTCGCTA CCATTGATAC GGTTAACGCC AATATCGACA CCACCTTTGT GGCCGGTGCA
GTCGCGACCA TTACGCTGAG CGTGCCAGTT AATGATGCAA CCGCGGACGG TGCAGACACC
AATCAGGTGG ACGCATTGGT ACAGGATGCT AACGGCAATG CGATTACCGG TGCGGCTGTA
GTCTTTAGTT CAGCCAATGG GGCAGATATT ATTGCCCCGA CCATGAACAC CGGTGTAAAT
GGAGTGGCAT CAACACTCCT GACCCATACC CAGTCCGGTG TGAGCAATGT GGTCGCCACC
ATTGATACGG TTAACGCCAA TATCGACACC ACCTTTGTGC CCGGTGCGGT TGCGACCATT
ACGCTGAGTG TGCCAGTTAA CGATGCAACC GCGGACGGTG CAGACACCAA TCAGGTGGAC
GCATTGGTAC AGGATGCTAA CGGCAATGCG ATTACCGGTG CGGCTGTAGT CTTTAGTTCA
GCCAATGGGG CAGATATTAT TGCCCCGACC ATGAACACCG GTGTAAATGG AGTGGCATCA
ACACTCCTGA CCCATACCCA GTCCGGTGTG AGCAATGTGG TCGCCACCAT TGATACGGTT
AACGCCAATA TCGACACCAC CTTTGTGCCC GGTGCGGTTG CGACCATTAC GCTGAGTGTG
CCAGTTAACG ATGCAACCGC GGACGGTGCA GACACCAATC AGGTGGACGC ATTGGTACAG
GATGCTAACG GCAATGCGAT CACCGGTGCC GCCGTGGTCT TTAGTTCGAC CAATGGGGCA
ACTATTCTCT CCTCGACTGT GAATACCGGT GCCGATGGGA TCGCCAGTAC CACGCTGACC
CATACCCAGT CCGGTGTGAG CAATGTGGTC GCTACCATTG ATACGGTTAA CGCCAATATC
GACACCACCT TTGTGCCCGG TGCGGTTGCG ACCATTACGC TGAGTGTGCC AGTTAACGAT
GCAACCGCGG ACGGTGCAGA CACCAATCAG GTGGACGCAT TGGTACAGGA TGCTAACGGC
AATGCGATCA CCGGTGCGGC TGTAGTCTTT AGTTCAGCCA ATGGGGCAGA TATTATTGCC
CCGACCATGA ACACCGGTGT AAATGGCGTG GCATCAACAC TCCTGACCCA TACCGTGGCC
GGGACCAGTA ATGTGGTCGC CACCATTGAT ACGGTTAATG CCAATATCGA CACGACCTTT
GTGGCCGGTG CGGTCGCCAC CATTACACTG AGCGTGCCAG TTAATGATGC AACTGCGGAC
GGTGCAGACA CCAATCAGGT GGACGCATTG GTACAGGATG CTAATGGCAA TGCGATCACC
GGTGCGGCCG TGGTCTTTAG TTCAGCCAAT GGGGCAACTA TTCTTTCCTC GACCATGAAC
ACCGGTGTAA ATGGCGTGGC ATCAACGCTC CTGACCCATA CCGTGGCGGG GACCAGCAAT
GTGGTCGCCA CCATAGGCAG CATTACCGAC AATATTGATA CCGTTTTTGT GGCCGGTGCG
GTCGCGACCA TTACGCTGAG TGTGCCAGTT AATGATGCCA CTGCGGACGG TGCAGATACC
AATCAGGTGG ATGCATTGGT AGAGGATGCT AACGGCAATG CGATCACCGG TGCTGCGGTG
GTCTTTAGTT CAGCTAACGG GGCAACTATT CTTGCGTCAA CGGTGAACAC CGGTGTAAAT
GGCGTGGCAT CAACGCTCCT GACCCATACC GTGGCGGGGA CCAGCAATGT GGTCGCCACC
ATAGGCAGCA TTACCGACAA TATTGATACC GTTTTTGTGG CCGGTGCGGT CGCGACCATT
ACGCTGAGTG TGCCAGTTAA TGATGCCACT GCGGACGGTG CAGATACCAA TCAGGTGGAT
GCATTGGTAG AGGATGCTAA CGGCAATGCG ATCACCGGTG CTGCGGTGGT CTTTAGTTCA
GCCAATGGGG CAACTATTCT CTCCTCGACT GTGAATACCG GTGCCGATGG GATCGCCAGT
ACCACGCTGA CCCATACCCA GTCCGGTGTG AGCAATGTGG TCGCCACCAT TGATACGGTT
AATGCCAATA TCGACACCAC CTTTGTAGCC GGTGCGGTCG CCACCATTAC ACTGAGCGTG
CCAGTTAATG ATGCAACCGC GGACGGTGCA GACACCAATC AGGTGGACGC ATTGGTACAG
GATGCTAACG GCAATGCGAT CACCGGTGCG GCTGTAGTCT TTAGTTCAGC CAATGGGGCA
GATATTATTG CCCCGACTAT GAACACCGGT GTAAATGGCG TGGCATCAAC ACTCCTGACC
CATACCGTGG CCGGGACCAG TAATGTGGTC GCCACCATAG GCAGCATTAC CAACAATATC
GACACCGCCT TTGTGGCCGG TGCGGTTGCG ACCATCACAC TGACGACACC GGTTAACGGT
GCGGTGGCGG ACGGTGCAGA CACCAATCAG GTGGACGCAT TGGTACAGGA TGCTAACGGC
AATGCGATCA CCGGTGCGGC TGTAGTCTTT AGTTCAGCCA ATGGGGCAGA TATTATTGCC
CCGACCATGA ACACCGGTGT AAATGGCGTG GCATCAACAC TCCTGACCCA TACCGTGGCC
GGGACCAGTA ATGTGGTCGC CACCATTGAT ACGGTTAATG CCAATATCGA CACCACCTTT
GTGCCCGGTG CGGTTGCGAC CATTACGCTG AGTGTGCCAG TTAATGATGC AACTGCGGAC
GGTGCAGACA CCAATCAGGT GGACGCATTG GTACAGGATG CTAATGGCAA TGCGATCACC
GGTGCGGCAG TGGTCTTTAG CTCGGCCAAC GGGGCAACTA TTCTTGCGTC AACGGTGAAC
ACCGGTGTAA ATGGCGTGGC ATCAATGCTC CTGACCCATA CCGTGGCGGG GGCCAGCAAT
GTGGTCGCCA CCATAGGCAG CATTACTGAC AATATCGATA CGACCTTTGT TGCGGGTGCA
ATGGCGAATA TCGTCGTCAG TATTATTGAC GATAATGCAC TGGCAAATGG TGCAGATACC
AATATTGTCG AAGCCTTTGT GACTGACCGT TTCGGTAATG GCGTGGCGAA TCAAAGCCTA
ATATTTGGTA CCAATGGGGC GTCCATTGTG GGTCCATCAA CAGTGACGAC CAATCTTGAT
GGCCGTGTTA GAGCGAGTGC TACGCATACT GTAGCAGGGA GCAGTAATAC GGTGGTTGCA
ATGAGTGGCA CTCATCAAGG ATATACCAGA GTAACCTTTG TTGCCGATGC TTCGACGGCC
CAGCTTATGC TAATACCGAT CTTGGATAAC CAAATTGCAG ATGGCACAGC GGTTAACCGT
GTGGAAGGGC GAGTTACCGA CGCTTATGGC AACCCATTAG CTAATCAATC TGTTAGCTTT
ATTCTCGATA ATGGGGCGGT GATTGAATAT TTAAGCAACG TCAGTAACGC CGCCGGGGTT
GTCTTGATCA GATTCAACAA TACGCTTGCA GGTATGACAA CGGTGACGGC AACGCTCGAC
TCTACCGGAC AAACTGAAAC CCTCGAGACG CATTTTGTGG CTGGAAAAGC GGCATCGATT
GAACTGACGA TGACGAAAGA TAATGCCGTG GCTAACAATA TCGATACCAA CGAAATCCAG
GTGTTAGTGA CGGATACAGG CGGTAACGCG ATCAACGGCG CGGTGGTCAA CCTTACTTCT
AACAGTGGTA TGAACATTAC ACCAAACTCG GTAACGACAG GCAGCGATGG TACGGCGACG
GCGACCTTGA CGCATACCCT GGCAGGGAAC CTCCCGATCA ATGCGCGGAT CGATCAGGTG
AGTAAAACGA TTAATGCCAC CTTTATCGCC GATGCTTCGA CTGCGCAGAT TATTGCGAGT
GACATGTTCA TCATCGCTAA CAATCAAGTT GCTAATGGGG AGGCTGTTAA CGCGATTCAG
GCGAGAGTCA CTGATAGCTA TGGCAACCCT ATTAAGGATC AAACGGTTGA ATTCGTGCTG
AGTAATAATG GCACTATTAA ATATAATCTG GATGTGACTT CAGCTGAAGG TGGCGTTATG
GTGACATTTA CTAATACCTT GGCGGGTATT ACCAATGTGA CCGCGACCGT GGTATCCACT
GGCGGTAGCC GAAATATTGA TACCACCTTT ATTGCCGATG TGACGACGGC ACACATTGCT
GCAAGTGATT TGATGGTCAT TGTTGATAAT GCGGTCGCCA ATAACTCGGA TGAAAATGAG
GTCCATGCGC GGGTCACCGA TGCGAAGGGC AACGTGTTAT CGGGCCAGAC GGTGGTCTTC
ACCTCTGGCA ACGGTGCCGC TATCACGACA GTAAATGGTA TCAGTGATAG CGATGGTCTG
ACCAAAGCGA CCTTAACCCA TACCTTGGCG GGTACCAGTG TGGTGACTGC AAGGGTTAGT
AACCAGGTGC AGAGCAAAGA TACGATCTTT ATTGCGGATA GAACCACCGC GACCATTAGG
GCCTCAGACC TGACCATTAC CCGGAACAAT GCGCTAGCTG ATGGGGTTGC TACTAATGCC
GCTCGTGTGA TTGTTACTGA TGCCAATGGG AACCCGGTGC CGAGTATGTT TGTGGGTTAT
ACCTCGGATA ATGGCGCACT ACTGACACCA GCATCAGGGA TGACGGATAG TAGTGGGACG
TTTAGCACAA CCTTTACACA TACGACAGCG GGTATCAGTA AGGTGACTGC GGCGATCATA
ACGATGGGGA TAAGCCAAGC TAAAGACGCC GTCTTTATTG CAGACAGCTC CACTGCCCGT
GTGTCGGAAT TGATCATCGT GAAAAATGAT TCGCTTGCCA ACAATAGCGA TAGAAATATC
GTGCAGGCGC ACATTAAGGA TGCTCATGGC AACGTGATTA CGGGAATGAA TGTGAACTTT
AGTGCCACTG AGAATGTGAC ATTGACCGCA AACACGGTCA CCACGAATGA TCAGGGGTAT
GCAGAAAATA CCTTAAGGCA TAACGTTCCT GTTACCAGTG CGGTGACTGC AACGGTCGCT
ACTGATCTGG TGGGTCTCAC AGAGGATGTC CGATTTGTTG CCGGCGACGG TGCTCGAATC
GAGCTATTTA GGCTGAATGA TGGGGCGGTG GCCGATGGCA TCCAAACTAA CAGGGTTGAA
GCCAGGGTCT ATGATGTCTC TGATCACCTG GTGCCGAATA GCAATGTGGT GTTCAGCGCA
AGTAATGGTG GGCAATTAGT GCAGGAAGAT GTGCAGACCG ATGCTTCGGG TAGTGCCTAT
GTTACGGTCA GCAATACTAC GTCAGGCGTA ACCAGAGTAT CAGTAACCGC AGATGGTGTA
TCAGCCTCAA CCACGACGAC CTTTATCGCC GATAAGGATA CGGCCACATT GGACGCGAAT
CTCTTTTTGA TCACTAACGA TAATGCAATA GCGAATGGGG TTATAGAAAA TAGAGTGTTA
TTGCAACTCG TGGATGCCAA TGGCAACAAG GTGTCTGGGG TCGAAGTTAA CTTTAGTGCC
ACTAATGGTG CGTCAATCAA TGCATCAGCC ATCACTGAGG CCAATGGGTT TGCTTTCGGT
ACCCTGACAA ACACTCTTTC AGGGCCAAGT GACGTTACGG TAACATTGGT GACGGCAGGG
GGGACTGAGA GCTTGACAGT CACACCTCAG TTTATTGCCG ATAAAAATAC CGCCCATATT
GCTACAGGTG ATTTTGTCAT TATCGATGAT GGCGCTGTGG CCAATAGCGT AGCCTTCAAT
GAGGTCCGTG CCAAAGTGAC TGACGATCTG GGGAACGCTA TTGCTGGCTA CAGTGTTATT
TTCGCATCAC AAAATGGCGC GACCATCACC ACCAGTGGTA TTACGGGTGT CGATGGGTGG
GCTAGCGCGA GGCTGACCCA TACCCAGGCT GGGGAGAGTG GGATCTCAGC GCGAGTCGCG
CGGCCTGCGA CTACGACGCA CTCGCTGATG CCGTACTTTA TCGCTGATGT GAGTACGGCA
ACGTTAAAAC TTTTTAATTT CAACACTATG CCGGTAATTG CCGATGGGGT GACGCAATTC
TTCGTGCTAG GAACGGTTTT TGATGCCAAC CAGAACCCAG TAGGGGGGCA GCAGGTGGCC
TTTAGTGCAA CAAATGAGGT GACCCTAATT GAGAGTAATG GGTCGATCAG TGCTCCAGAG
GGTGGTGTGC TCTTATCTGT CACGAGTACT CAGGCTGGGA TTCACCCTAT TACAGGGACC
TTGGTATCGA ATAACTATAC GGACACGCTT GGCGCCGAAT TTATCGCGGA CAAAAATACC
GCTCAATTGT CCACCTTAAT CGTCGTCGAT AACAATGCAC TGGCAGATGG TGTTGCACGT
AACCAAGTCC GGGCGCATGT TGTCGATAGT ACGGGCAATT CGGTGGCCGA TATGGCGGTG
ACATTTACCG CCAACCGTGG TGCGCAGCTG AGTAAGGTAA CGGTACTGAC CGACAATAAC
GGGGATGCCG TCAATACGCT GACCAACAGT TTAGCTGGTG TTACGGTTGT GACGGCCAAA
CTTGGGACGG CGGGAACGCC TCTCACTGTT GACACGGTCT TTACTGCCGG GCCGCTGGCG
ACACTGACAC TGGTGACAAC GGTCGATAAT GCTTTTGCGG ATAACAGTGC TACCAATACG
GTACGGGCGA CACTTAAAGA TACTACCGGG AACCCAGTCG TTGGGGAAGT GGTCGCCTTT
GCGGCAAGTA ATGGGGCGAC GATTACGGCC ACCGATGGTG GGGTAAGCAA TGCCAACGGT
ATTGTCTTGG CAACCTTAAC CAATGGTTCT GCTGGGGTTA GCACTGTCAC GGCGACGATA
GAGACATTAA CGGAGACAAC AGACACTACC TTTATTGTCA TGAAGAATCT GGATGTAACC
GTGAATGGTA CAACGTTTAA CGGAGATGCC GGGTTCCCAA CTACCGGGTT TGTGGGGGCA
ACCTTTAAGG TCAATTCGGG TGGAGACAAT AGCCTCTATG ACTGGAGCAG TAGTGCCCCA
GCACTGGTAT CGGTCAGCGG TGATGGTGTA GTGACATTTA ATGCGGTATT CCCGACGGGT
ACACCGGCAA TTACCATCTC TGCCACCCCG AAAGGTGGTG GTAGCCCACT CTCGTATAGC
TTTAGAGTTA ACCAGTGGTT TATCAATAAT AGTGGCGCTA CGTTAGATAG AGTCAGCGCA
ATAGCACATT GTGAGAATGT GGGCTACGTG ATGCCAATCT CTACTCAGGT CACCAATGCG
GCGACCTGGA TGTCAGGACG GCGAGCAGTG GGTAACTTGT GGTCAGAATG GGGCGATTTC
AGTGCCTATA CTGTACCGGG CTGGGTGCCT GCTGAGTTCT TCTGGCTCAG TAATAATCAT
GACGCTACTA CGGCTCTGGC TATTGGCCTG TCAACGGGTA CGCTGACGAC GATGGGTGAT
ATGACGGTCG CAACCCATGT GATGTGTACC CGATCACTCT AG
 
Protein sequence
MPPAARATEP YTLGPGDSIQ SIAKKYNITV DELKKLNAYR TFSKPFASLT TGDEIEVPRK 
ESSFFSNNPN ENNKKDVDDL LARNAMGAGK LLSNDNTSDA ASNMARSAVT NEINASSQQW
LNQFGTARVQ LNVDSDFKLD NSALDLLVPL KDSESSLLFT QLGVRNKDSR NTVNIGAGIR
QYQGDWMYGA NTFFDNDLTG KNRRVGVGAE VATDYLKFSA NTYFGLTGWH QSRDFSSYDE
RPADGFDIRT EAYLPAYPQL GGKLMYEKYR GDEVALFGKD DRQKDPHAVT LGVNYTPVPL
VTIGAEHREG KGNNNNTSVN VQLNYRMGQP WNDQIDQSAV AANRTLAGSR YDLVERNNNI
VLDYKKQELI HLVLPDRISG SGGGAITLTA QVRAKYGFSR IEWDATPLEN AGGSTSPLTQ
SSLSVTLPFY QHILRTSNTH TISAVAYDAQ GNASNRAVTS IEVTRPETMV ISHLATTVDN
ATANGIAANT VQATVTDGDG QPIIGQIINF AVNTQATLST TEARTGANGI ASTTLTHTVA
GVSAVSATLG SSSRSVNTTF VADESTAEIT AANLTVTTND SVANGSDTNA VRAKVTDAYT
NAVANQSVIF SASNGATVID QTVITNAEGI ADSTLTNTTA GVSAVTATLG SQSQQVDTTF
KPGSTAAISL MKLADRAVAD GIDQNEIQVV LRDGTGNAVP NVPMSIQADN GAIVVASTPN
TGVDGTINAT FTNLRAGESV VSVTSPALVG MTMTMTFSAD QRTAVVSTLA AIDNNAKADG
TDTNVVRAWV VDANGNSVPG VSVTFDAGNG AVLAQNPVVT DRNGYAENTL TNLAIGTTTV
KATTVTDPVG QTVNTHFVAG AVDTITLTTP VNGAVADGAN SNSVQAVVSD SGGNPVTGAT
VVFSSTNATA QVTTVIGTTG VDGIATATLT NTVAGTSNVV ATIGSITNNI DTTFVAGAVA
TITLTTLVNG AVADGANSNS VQAVVSDSEG NPVAGAAVVF SSANATAQIT TVIGTTDADG
IATATLTNTV AGTSNVVATI GSITNNIDTT FVAGAVATIT LTTPVNGAVA DGANSNSVQA
VVSDSGGNPV TGATVVFSST NATAQVTTVI GTTGVDGIAT ATLTNTVAGT SNVVATIGSI
TNNIDTTFVA GAVATITLTT LVNGAVADGA NSNSVQAVVS DSGGNPVTGA TVVFSSTNAT
AQVTTVIGTT GVDGIATATL TNTVAGTSNV VATIGSITNN IDTTFVAGAV ATITLTTPVN
GAVADGANSN SVQAVVSDSG GNSVTGATVV FSSTNATAQV TTVIGTTGAD GIATATLTNT
VAGTSNVVAT IDTVNANIDT TFVAGAVATI TLSVLVNDAT ADGADTNQVD ALVQDANGNA
ITGAAVVFSS ANGADIIAPT MNTGVNGVAS TLLTHTVAGT SNVIATIDTV NANIDTTFVA
GAVATITLSV PVNDATADGA DTNQVDALVQ DASGNAITGA AVVFSSANGA TILSSTVNTG
ADGIASTTLT HTQSGVSNVV ATIDTVNANI DTAFVAGAVA TITLSVLVND ATADGADTNQ
VDALVQDANG NAITGAAVVF SSANGATILS STVNTGADGI ASTTLTHTQS GVSNVVATID
TVNANIDTTF VAGAVATITL SVPVNDATAD GADTNQVDAL VQDANGNAIT GAAVVFSSAN
GATILSSTMN TGVNGVASTL LTHTQSGVSN VVATIDTVNA NIDTTFVAGA VAAITLTTPV
DGAVADGTDS NSVQAVVSDS DGNPVTGATV VFSSTNATAQ ITTVIGTTGA DGIATATLTN
TVAGTSNVVA TIDTVNANID TTFVAGAVAT ITLSVPVNDA TADGADTNQV DALVQDANGN
AITGAAVVFS STNGADIIVP TMNTGVNGVA STLLTHTMAG TSNVIATIDT VNANIDTTFV
AGAVATITLS VPVNDATADG ADTNQVDALV QDANGNAITG AAVVFSSANG ATILSSTMNT
GVNGVASTLL THTQSGVSNV VATIDTVNAN IDTTFVAGAV AAITLTTPVN GAVADGANSN
SVQAVVSDSE GNAVAGAAVV FSSANATAQL TTVIGTTGAD GIATATLTNT VAGTSNVIAT
IDTVNANIDT TFVAGAVATI TLSVPVNDAT ADGADTNQVD ALVQDANGNA ITGAAVVFSS
ANGADIIAPT MNTGVNGVAS TLLTHTQSGV SNVVATIDTV NANIDTAFVA GAVATITLSV
LVNDATADGA DTNQVDALVQ DANGNAITGA AVVFSSANGA TILSSTVNTG ADGIASTTLT
HTQSGVSNVV ATVDTVNANI DTAFVAGAVA TITLSVPVND ATADGADTNQ VDALVQDANG
NAITGAAVVF SSTNGATILS STVNTGADGI ASTTLTHTQS GVSNVVATID TVNANIDTTF
VPGAVATITL SVPVNDATAD GADTNQVDAL VQDANGNAIT GAAVVFSSAN GADIIAPTMN
TGVNGVASTL LTHTQSGVSN VVATIDTVNA NIDTTFVAGA VATITLSVPV NDATADGADT
NQVDALVQDA NGNAITGAAV VFSSANGADI IAPTMNTGVN GVASTLLTHT QSGVSNVVAT
IDTVNANIDT TFVPGAVATI TLSVPVNDAT ADGADTNQVD ALVQDANGNA ITGAAVVFSS
ANGADIIAPT MNTGVNGVAS TLLTHTQSGV SNVVATIDTV NANIDTTFVP GAVATITLSV
PVNDATADGA DTNQVDALVQ DANGNAITGA AVVFSSTNGA TILSSTVNTG ADGIASTTLT
HTQSGVSNVV ATIDTVNANI DTTFVPGAVA TITLSVPVND ATADGADTNQ VDALVQDANG
NAITGAAVVF SSANGADIIA PTMNTGVNGV ASTLLTHTVA GTSNVVATID TVNANIDTTF
VAGAVATITL SVPVNDATAD GADTNQVDAL VQDANGNAIT GAAVVFSSAN GATILSSTMN
TGVNGVASTL LTHTVAGTSN VVATIGSITD NIDTVFVAGA VATITLSVPV NDATADGADT
NQVDALVEDA NGNAITGAAV VFSSANGATI LASTVNTGVN GVASTLLTHT VAGTSNVVAT
IGSITDNIDT VFVAGAVATI TLSVPVNDAT ADGADTNQVD ALVEDANGNA ITGAAVVFSS
ANGATILSST VNTGADGIAS TTLTHTQSGV SNVVATIDTV NANIDTTFVA GAVATITLSV
PVNDATADGA DTNQVDALVQ DANGNAITGA AVVFSSANGA DIIAPTMNTG VNGVASTLLT
HTVAGTSNVV ATIGSITNNI DTAFVAGAVA TITLTTPVNG AVADGADTNQ VDALVQDANG
NAITGAAVVF SSANGADIIA PTMNTGVNGV ASTLLTHTVA GTSNVVATID TVNANIDTTF
VPGAVATITL SVPVNDATAD GADTNQVDAL VQDANGNAIT GAAVVFSSAN GATILASTVN
TGVNGVASML LTHTVAGASN VVATIGSITD NIDTTFVAGA MANIVVSIID DNALANGADT
NIVEAFVTDR FGNGVANQSL IFGTNGASIV GPSTVTTNLD GRVRASATHT VAGSSNTVVA
MSGTHQGYTR VTFVADASTA QLMLIPILDN QIADGTAVNR VEGRVTDAYG NPLANQSVSF
ILDNGAVIEY LSNVSNAAGV VLIRFNNTLA GMTTVTATLD STGQTETLET HFVAGKAASI
ELTMTKDNAV ANNIDTNEIQ VLVTDTGGNA INGAVVNLTS NSGMNITPNS VTTGSDGTAT
ATLTHTLAGN LPINARIDQV SKTINATFIA DASTAQIIAS DMFIIANNQV ANGEAVNAIQ
ARVTDSYGNP IKDQTVEFVL SNNGTIKYNL DVTSAEGGVM VTFTNTLAGI TNVTATVVST
GGSRNIDTTF IADVTTAHIA ASDLMVIVDN AVANNSDENE VHARVTDAKG NVLSGQTVVF
TSGNGAAITT VNGISDSDGL TKATLTHTLA GTSVVTARVS NQVQSKDTIF IADRTTATIR
ASDLTITRNN ALADGVATNA ARVIVTDANG NPVPSMFVGY TSDNGALLTP ASGMTDSSGT
FSTTFTHTTA GISKVTAAII TMGISQAKDA VFIADSSTAR VSELIIVKND SLANNSDRNI
VQAHIKDAHG NVITGMNVNF SATENVTLTA NTVTTNDQGY AENTLRHNVP VTSAVTATVA
TDLVGLTEDV RFVAGDGARI ELFRLNDGAV ADGIQTNRVE ARVYDVSDHL VPNSNVVFSA
SNGGQLVQED VQTDASGSAY VTVSNTTSGV TRVSVTADGV SASTTTTFIA DKDTATLDAN
LFLITNDNAI ANGVIENRVL LQLVDANGNK VSGVEVNFSA TNGASINASA ITEANGFAFG
TLTNTLSGPS DVTVTLVTAG GTESLTVTPQ FIADKNTAHI ATGDFVIIDD GAVANSVAFN
EVRAKVTDDL GNAIAGYSVI FASQNGATIT TSGITGVDGW ASARLTHTQA GESGISARVA
RPATTTHSLM PYFIADVSTA TLKLFNFNTM PVIADGVTQF FVLGTVFDAN QNPVGGQQVA
FSATNEVTLI ESNGSISAPE GGVLLSVTST QAGIHPITGT LVSNNYTDTL GAEFIADKNT
AQLSTLIVVD NNALADGVAR NQVRAHVVDS TGNSVADMAV TFTANRGAQL SKVTVLTDNN
GDAVNTLTNS LAGVTVVTAK LGTAGTPLTV DTVFTAGPLA TLTLVTTVDN AFADNSATNT
VRATLKDTTG NPVVGEVVAF AASNGATITA TDGGVSNANG IVLATLTNGS AGVSTVTATI
ETLTETTDTT FIVMKNLDVT VNGTTFNGDA GFPTTGFVGA TFKVNSGGDN SLYDWSSSAP
ALVSVSGDGV VTFNAVFPTG TPAITISATP KGGGSPLSYS FRVNQWFINN SGATLDRVSA
IAHCENVGYV MPISTQVTNA ATWMSGRRAV GNLWSEWGDF SAYTVPGWVP AEFFWLSNNH
DATTALAIGL STGTLTTMGD MTVATHVMCT RSL