Gene YPK_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0743 
Symbol 
ID6090251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp804249 
End bp817187 
Gene Length12939 bp 
Protein Length4312 aa 
Translation table11 
GC content53% 
IMG OID641595804 
Productouter membrane autotransporter 
Protein accessionYP_001719497 
Protein GI170022992 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA TATTTAAAGT GATTTGGAAT GCATCTTTGA ATGTCTGGGT TGTGGTGAGT 
GAGCTGGCAA AAGGCCGGAT AAAAACCAAG AGCAGTCGTA ACTTGATATC AGAGGGTGTA
TTACCAAAAT TTGAACAAAG TATGGTATCG AAACTGTTCA GAAAAAACCT TTTGGCCCTA
TCTCTGGGCA GTATTGTTTT TCTAAGCACC GGCCCTGTAT TTGCTGCTGA TATTACTGTC
AGCACGCAGG CAGAGCTTTC TGCCGCGTTA TCCAATGGTA CTTATGACAA AATTATTTTA
GGTGCTGACA TTACGCTCAT TGGGAGCCTG ACCGTCAATA TGACGAGTAA TCAGGTTGTG
ATTGATGGCC AAGGAAAGTT TGGTCTAACC GTTAACAATA CGACAAATTA TGGTCTGGTG
GTGTCATCGG GTTCAGGGAC CTTAACGCTA CAAAATATGT CCAAAATTGA CTCGGCCAAC
TACTACAGCA TGGTCGTGTT GAACGGTGCC AATACTGCCG TCAATGTGAT TTATAACAAT
ATTGATTTCC TGGGTTCCTC TCAACTTATT TATATGGGGG CCTATGGGGC GGCCACCAAT
AGTATAATGA CCTTTGGCGA TATCTTGAAC GATGTGGTGG TCAATGACCG TGCTCAGGAA
ATTGGTGAAG TCAATAAACT GGCGTTTACC GGTAGATTTC ATGTTACGCA TACGGGGTCC
TCTGTGACAT CTTTTGTCAG CACTGGCGGG GCAAATAATA CATCAACCAT GGATTTTGCC
AGTGGTGCTG ATGTCAAGAT AGACCGGACG GGGTCGACTG GAGACCTGAC CAGTACCGGT
GTTAATGCCT TTGCCTATAC TTTTGCTGAT GGTGCTAGCT TTGAGTTAAT TGCTAATCAG
AATGTTTTCA GCGGAACCAC TACCAATAGA GGCTTGGAAA TTGGTAGTTA TAATAGTATC
GATGGCTTTG GTTCCGGGGT GAAAATAGTT CTGCAATCTC GCTCCGATGG TTCAATTATT
TCAGGTAATG GGATTGATAA CGCCACCACT AATGCCGCTG GTATCAATAA TAATGCCAGT
GGTGATGCTA ATGTTATTTA CAATCTGGGT ACTGGCTCTA TTTTAAAGGC CACGAATACT
GGGATTTTAG CGACAAAAAA TGCGAATAAT GCCAGTGATA TTTATATTCG CTCTGCGGGT
GGCATCACTG CTGCAACGGG TATTTCTGCA ACACATAACG GAACCGGCAC CGTTAAAATC
AAGAATGATG GCACCATCAC CAGTACCACC GCCGGGATCG CCATTTCTTC CGCATCAATA
AAAGAGATCA GCGTTGATAA TACTGACGGT ACCATCACGG CGACTGCCGG TACAGGCGTT
AACGTACTGG CCAGTGCCAT ATTAAATCTG TTCGGTGGCA CCATTAATAC CTCGGCGACG
GCCAACGGTA TTACGTTTGC GGGCACGGAG GGTGGACATA CCCTGACTGA TTTGACGATT
AATCTGTTGG GTACGGGTAT TGCTTTGTCA AACGTGGCTG GCGTCAATTT GACGTTAAGT
AACGTAACTT TAAACACACT TAATGGGACG GCGCTAAATA GCCTGACAGG GTTAACGTTA
GTCGATAGCC TTAATGGGCG TAACACCATT AATATTGAAG GTGCCGGCAT CGGAATTGCG
GCGACAAATA CCGAATTGAA TACCTTCGAT GCAGAGGCAT TAGATATTAA TGTTAATGGT
GCCGGGATCG GCATTCAAGC CACTGGCGGT GGTGTCAATC TCAGCGCTTC TAACTTGATC
ATTAATGTTG CCAATACATT GGGTACTGCG CTGCAAATCA CGGATGGGAT AGATAATACC
ACCACCATTG GCAACGAAAT CCAACTTAAT GCCGAGAATG CGACGGCAAT TAACTTCCTC
GGCTCCTCCA GTAAAACGTT AAACAACAAT GGCACCATTA AAGGCAGTGT GATATTCGCG
GGTGTTGCTG ACCATATCAT TAATAACAAT GGAACACTGG ATGGCACATT AACCACCGGT
GCCGGTAATG ACACGTTAGT GCTAGACAGT AGCTCGCAAA GTAACGATGT GATTAACCTG
GGGGATGGCA ATAATAGCGT GACCATTCAG AATGGGGCGA CAGTGTCCTC CATTATCACT
GGGAATGGTA ATGATACTTT CACGATCAAT GGGATGAGCG TGGGGAGTAC CTACCTCGGT
TCGCTGGATG CGGGCACGGG GCTCAATACC CTAAATTTCA ATGCCTCAAC CGACGAACTA
GCGGCAGCCA CGTCACTTCA GGGCTTCACC AATATTAATC TTGTCGACAG CCATATCACT
CTGGTGTCTG ACGATAATAT CGGCAGCGGT ATGGTCAATA TCGATAGCAG TAGTGAGCTG
CTATTTGGTA GCACGTTTGA TGGGATTTTG CATGCGACAT TAGGCGCTGG CACGGGCTCT
GCCATTGTTA ATAACAGCGC CAACGTGTCG TTAGAGCAGG CCAGTATGTT TGCTGGCACG
TGGCAAGTTA ATCAGGGCGG GGCGCTAACC GCCAGTAACA GTAACCAATT AGGTTCGGCC
AAGATTGGGT TGGACGGTAC GTTGAACCTG GACAACATTG CACTATTCAA TCATGTGCTG
ACCGGGAACG GCACACTGAA TGTGGCGAAG AACCTTGCCA CCACCGCGTT TGACTTTGGT
TCGACGGTGG GCGGGGCGTT TAGTGGGATC GTCAATCTGA CAAAGACTAC TTTTGCTTTA
AGTGCGGATA ACGCGGCGGC ACTGGCCAGT GCCACCTTAA AGCTGTCGGA TGACAGTGTG
ACCACCGTGG GCACCACTGA CCGCACCCTG CACGGGCTGG ATTTGAGTGG CGGGACGCTG
ATTTTTGATG GTGCGGTGCC GCAGTCTCAG ACCAGCGGGG TTGTCACGGT CACTGATCTG
GCACTGAACA GCGGGACGGT CAATATCACC GGCTCTGGTA GCTGGGATAA CACCGATCCG
CTGGCAACAA ATGTGTCGAT CCTTGAACAG GATCGTGCTG GCTCGACGCT GGAACTGATT
AATGCCACTA ATGTGACCGG AGATGTTGAT GCCTTGGATT TACTGGTCAA TGGCACGGCC
ATTACTTCTG GTACGCAAGG GGTGCAGTCT GCCATTCAGC AGGGCGGTAG TACGGTAGCC
AATGCCATCC ACAATTATGG CCTGACCAGC AGTAACAGCA ATGGTGACAG CGGCCTGTAT
GTGAATTACA CCCTGAGTGC GCTGGAGTTG TTAGCCGATG GCGCTGATGC GTTGTTGTTG
GCAACCGAAA GCGGTTTGAC GGCCAACAGA GTACTGAATG CCGAGCTATT CGGTGTTGGC
GGTTTGGTGG TGGATGCCCA AAATGGTGCC TTAACCTTGG CTAACGGCAA TAACAGTTAC
GAGGGAACGA CCACCGTCAA TTTCGGTGAG TTGATCCTCG GGGCGAATGG GGCTTTCGGC
CAGACGTCAT TACTGGATAT CGCCAGTGGG GCCAGTGCCA ATATTAATGG CTACCGCCAG
ACAGTGGGGG CAGTGACCAA TACCGGTACG GTAACGTTGG GCAGCGGCGG GGTGTTAACC
AGTGGCCTGC TGACCAATGG CGGAGTCCTT GATTTAACGG GGGGCGCACT CAATCTGACT
GCGGGCGGGA CGTCTACCGT GGCAGGCGGC TTGACCGGTG CCGGAACCCT GAATATTAAT
GGTGGTAATT TGGCGGTCAG TGCCGCCAAC AGTGGCTTGA GCGGGCAGAC CCATATTGCC
GATGTGGCCT CGGTGACCAT GACCGGGACG GGTACGTTGG GTACCAGCGC CGTCGAGGTG
CTGGGTACGC TGAACCTGAA CGGTGCCAAT GCAGCCATGA CCAACGTACT CAGTGGTGAC
GGGACGATTA ATACCAACGC GGCAGTCACG CTGAGCGGGA ATAACAGCTT TAGTGGTGCA
CATCAGATCG GTGCCAGTGG CGCACTGACC GTCGGTCAGG CCAGTAATCT GGGGGCCAGC
AGCGCCACGG TTAATCTGGG CACCCTCACT TCTCATCTGA TCTTGAATGG CGTTAGCGAG
AGCATTGCCA ACGTTCTGAG CGGTGTAGCG GGTTCAACGG TAGATATTAT CGGCGGGGCA
GATACCGCAC TGACGGCCAA TAACAGCGGT TTCCTCGGCC AATATGCCTT GGCGGGTAAC
AGCAAACTGA CGGTTGGGTC AACGAACAAT CTGGGGGCGT CATCCAGCGT GACGCTGGCC
GGGGCGGACG ATACTCTGTC GCTGAGCGGT TTTAACGGCA CCTTTGGTAA CAGCGTCACC
GGTAGCGGTG TACTGCAAGT GACCGATGAT GCCGAGGTTA CCCTGACCAG CAGCAACGGG
GTAGGCAATA CGGTGAAGGT CGATATCGCT GATGCGACGT TGAATCTGGA CGATATTGCT
CTGTTCGACC ATGTACTGAC CGGGAACGGC ACACTGAATG TAGCGAAGAG CCTTGCCACC
ACCGCGTTTG ACTTTGGTTC GACGGTAGGC GGGGCCTTTA GTGGGATCGT CAATCTGACC
AATACCACCT TTGCCTTAAG TGCGGATAAC GCGGCGGCAC TGGCCAGTGC CACCTTAAAG
CTGTCGGATG ACAGTGTGAC CACCGTCGGC ACCACTGACC GCACCCTGCA CGGGCTGGAT
TTAACTGGCG GGACGCTGAT CTTTGACGGT TCGCCGCCAC AGTCTCAGGC CAATGGGGTC
GTCACGGTCA CTGATCTGGC ACTGAACAGC GGAACGGTCA GCATTACCGG GGTGGGCAAC
TGGGAGAATG AATCTCCAGT GACGTCACCG AATGTATCGA TCCTTGAACA GGATCGTGCT
GGCACGACGC TGGAGCTAAT TAATGCGACT AATGTGACCG GAGATGTTGA TGCTCTGGGT
CTGATGATTA ATGGCACCGC CATTACTGCC GATTCACAAG GGGTGGAGTC TGCCATCCAG
CAGGGCGGTA GCACGGTGGC CAATGCCATC CACAATTATG GCCTGACCAG CAGTAACAGC
AATGGTGACA GCGGCCTGTA TGTGAATTAC ACCCTGAGTG CGCTGGAGTT GTTAGCCGAT
GGCGCTGATG CGTTGTTGTT GGCAACCGAA AGCGGTTTGA CGGCCAACAG AGTACTGAAT
GCCGAGCTAT TCGGTGTTGG CGGTTTGGTG GTGGATGCCC AAAATGGTGC CTTAACCTTG
GCTAACGGCA ATAACAGTTA CGAGGGAACG ACCACCGTCA ATTTCGGTGA GCTGATCCTC
GGGGCGAATG GGGCCTTCGG CCAGACGTCA TTACTGGATA TCGCCAGTGG GGCCAGTGCG
AATATTAATG GCTACCGCCA GACAGTGGGG GCAGTGTCCA ATACCGGTAC GGTAACGTTG
GGCAGCGGCG GGGTGTTAAC CAGTGGCCTG CTGACCAATG GCGGAGTCCT TGATTTAACG
GGGGGCGCAC TCAATCTGAC TGCGGGCGGG ACGTCTACCG TGGCAGGCGG CTTGACCGGT
GCCGGAACCC TGAATATTAA TGGTGGTAAT TTGGCGGTCA GTGCCGCCAA CAGTGGCTTG
AGTGGCCAGA CCCATATTGC CGATGTGGCC TCGGTGACCA TGACCGGGAC GGGTACGTTG
GGTACCAGCG CCGTCGAGGT GCTGGGTACG CTGAACCTGA ACGGTGCCAA TGCGGCCATG
ACCAACGTAC TCAGTGGTGA CGGGACGATT AATACCAACG CGGCGGTCAC GCTGAGCGGG
AATAACAGCT TTAGTGGTGC ACATCAGATC GGTGCCAGTG GCGCACTGAC CGTGGGTCAG
GCCAGTAATC TGGGGGCCAG CAGCGCCACG GTTAATCTGG GCACCCTCAC TTCTCATCTG
ATCTTGAATG GCGTTAGCGA GAGCATTGCC AACGTTCTGA GCGGTGTAGC GGGTTCAACG
GTAGATATTA TCGGCGGGGC AGATACCGCA CTGACGGCCA ATAACAGCAA CTTCCTTGGC
CAATATGCCT TGGCGGGTAA CAGCAAACTG ACGGTTGGGT CAACGAACAA TCTGGGGGCG
TCATCCAGCG TGACGCTGGC CGGGGCGGGC GATACTCTGT CGCTGAGCGG TTTTAACGGC
ACCTTTGGTA ACAGCGTCAC CGGTAGCGGT GTACTGCAAG TGACCGATGA TGCCGAGGTT
ACCCTGACCA GCAGCAACGG GGTAGGCAAT ACGGTGAAGG TCGATATCGC TGATGCGACG
TTGAATCTGG ACGATATTGC TCTGTTCGAC CATGTACTGA CCGGGAACGG CACACTGAAT
GTAGCGAAGA GCCTTGCCAG CACCGCGTTT GACTTTGGTT CGACGGTGGG CGGGGCCTTT
AGTGGGATCG TCAATCTGAC CAATACCACC TTTGCCTTAA GTGCGGATAA CGCAGCGGCA
CTGGCCAGTG CCACCTTAAA ATTGTCGGAT GACAGTGTGA CCACGGTAGG CACCACTGAC
CGCACCCTGC ACGGGCTGGA TTTAAATGGC GGGACGCTGA TCTTTGATGG TTCGCCGCCA
CAATCTCAGG CTAACGGGGT CGTCACGGTT ACTGATCTGG CACTGAACAG CGGGACGGTC
AGCATTACCG GGGCGGGCAA CTGGGAGAAT GAACATCCGG TGACGCCACC GAATGTGTCG
CTCCTTGAGC AGGATCGGGG TGACATTTTA CTGGAGCTGA TTAATGCCGC GAATGTCACC
GGAGATGCCA ATGATTTGGA TCTGATGGTT GATGGCACCG CCATTACTGC CGATTCAAGC
GGGGTGCAGT CTGCTGTCCA GCAGGGCGGT AGCACGGTGG CCAATGCCAT CCATAATTAT
GGCCTGACCA GCAGTAACGG CAATGGCGGC AGTGGCCTGT ATGTGAATTA CACCCTGAGT
GCGCTGGAGT TATTAGCCGA TGGCGCTAAT GCGTTGTTGC TGGCAACCGA AAGCGGTTTG
ACGGCCAACA GAGTACTGAA TGCCGAGCTA TTCGGTGTTG GTGGTTTGGT GGTGGATGCC
CAAAATGGTG CCTTAACCTT GGCTAACGGC AATAACAGTT ACGAGGGAAC GACCACCGTC
AATTTCGGTG AGTTGATCCT CGGGGCGAAT GGGGCTTTCG GCCAGACGTC ATTACTGGAT
ATCGCCAGTG GGGCCAGTGC GAATATTAAT GGCTACCGCC AGACGGTGGG GGCGGTGACC
AATACCGGTA CGGTAACGTT GGGCAGCGGC GGGGTGTTAA CCAGTGGCCT GCTGACCAAT
GGCGGGGTCC TTGATTTAAC GGGGGGCGCA CTCAATCTGA CTGCGGGCGG GGCGTCTACC
GTGGCAGGCG GCTTGACCGG TGCCGGAACC CTGAATATTA ACGGCGGTAA TTTGGCGGTC
AGTGCCGCCA ACAGTGGCTT GAGCGGCCAG ACCCATATTG CCGATGTGGC CTCGGTGACC
ATGACCGGGA CGGGTACGTT GGGTACCAGC GCCGTCGAGG TGCTGGGTGC GCTGAACCTG
AACGGTGCCA ATGCAGCCAT GACCAACGTA CTCAGTGGTG ACGGGACGAT TAATACCAAC
GCGGCAGTCA CGCTGAGCGG GAATAACAGC TTTAGTGGTG CACATCAGAT CGGTGCCAGT
GGCGCACTGA CCGTGGGACA GGCCAGTAAT CTGGGGGCCA GCAGCGCCAC GGTTAATCTG
GGCACCCTCA CTTCTCATCT GATCTTGAAT GGCGTTAGCG AGAGCATTGC CAACGTTCTG
AGCGGTGTGG CGGGTTCAAC GGTAGATATT ATCGGCGGGG CAGATACCGC ACTGACGGCC
AATAACAGCA ACTTCCTTGG CCAATATGCC TTGGCGGGTA ACAGCAAACT GACGGTTGGG
TCAACGAACA ATCTGGGGGC TTCATCCAGC GTGACGCTGG CCGGGGCGGG CGATACTCTG
TCGCTGAGCG GCTTTAACGG CACCTTTGGT AACAGCGTCA CTGGCAACGG TGTACTGCAA
GTGACCGATG ATGCCGAGGT CACTCTGACC AGCAGCAACG GGGTAGGCAG CGCGGTAACC
ATTGATATCG CCGACGCGAC GCTGAATCTG GACGATATTG CTCTGTTTAA TCATGCGTTG
ACCGGTAACG GCTTGCTGAA TGTGGCGAAA AACGATGCCA GCACCGCGTT TGACTTTGGT
TCGACGGTGG GCGGGGCCTT TAGTGGGATC GTCAATCTGA CCAATACCAC CTTTGCCTTA
AGTGCGGATA ACGCAGCGGC ACTGGCCAGT GCCACCTTAA AATTGTCGGA TGACAGTGTG
ACCACGGTAG GCACCACTGA CCGCACCCTG CACGGGCTGG ATTTAAATGG GGGGACGCTG
ATCTTTGATG GTTCGCCGCC ACAATCTCAG GCTAATGGAG TCGTCACGGT CACTGATCTG
GCACTGAACA GCGGAACGGT CAGTATTACC GGGGCGGACA ACTGGGAGAA TGAACATCCG
GTGACGCCAC CGAATGTGTC GCTCCTTGAG CAGGATCGGG GTGACATTCT GCTGCAACTG
ATTGATGCCG ATAATGTGAC CGGCAATGCC AATGATCTGG AGCTGATGAT CAATGGCACC
ACTATTACCC CTGGGCAAGG GGTGCAGTCT ACTGTCCAAC AGGGCGGGTC TACGGTGGCG
AATGCTACGC ATAACTATGG CCTGACCAGC AATGGCGGCA GTGGCCTGTA TGTGAATTAC
ACCCTGAGTG CGCTGGAGTT GTTAGCCGAT GGCGCTAATG CGTTGTTGCT GGCGACCGAA
AGCGGTTTGA CGGCCAACAG AGAACTGAAT GCCGAGTTAT CCGGTGTCGG CGGTTTGGTG
GTGGATGCCC AAAATGGCGC TTTAACCTTG GCTAACGGCA ATAACAGTTA CGAGGGAACG
ACAACCGTCA CTGCGGGGGA ATTGATCCTC GGGGCGAATG GGGCATTCGG CCAGACGTCA
TTACTGAATA TCGCCAGTGG GGCCAGTGCG AATATTAATG GCTACCGCCA GACGGTGGGG
GCGGTGACCA ATACCGGTAC GGTAACGTTG GGCAACGGTG GGGAGTTAAC CAGTACTGAC
ACCTTGATCA ATACCGGAAT TATTAATGTG ACCGATGGCA TCCTGAATCT GGAGAATGGG
GGGACTTCTA GCATTAGCGG CGGCTTAACG GGCAACGGTA TCCTGAATAT CAAGGGTGGC
GATTTCACCA TCAGCATCGA TAACAATGGT CTGGCGGGGC AAACCAATAT TGCCGATGGT
GCATCAGTCA CTCTTGGCAA TGGGGGGACC ATGCTAGGAA CCGGTAATTT GGGCAGCAGC
GTTATTGATG TGCTGGGGGA TCTAAACCTG GTCGCGGATA ATTCACTGGC TAACGTGATC
AGTGGTGACG GGACGATTAA TACCACAGCA ACAGTGACGC TGAGCGGTAA TAGCAGCTTT
AGTGGTGCAC ATCAGATCGG GACCAATGGC GAACTGACCG TGGGTCAGGC CAGTAATCTG
GGGGCCAGCA GCGCCACGGT TAATCTGGGC ACTATCACTT CTCATCTGAT CTTGAATGGC
GTTAGCGAGA GTATTGCCAA CGTTCTGAGC GGTGTGGCGG GTTCAACAGT CGATATTATC
GGCGGAGCAG ATACCGCACT GACGGCCAAT AACAGCGGCT TCCTCGGCCA GTATGCCTTG
GCGGGTAACA GCAAACTGAC GGTTGGGTCA ACGAACAATC TGGGGGCGTC ATCCAGCGTG
ACGCTGGCAG GGGCGGGCGA TACCCTGTCG CTGAGCGGCT TTAACGGCAC CTTTGGTAAC
AGCGTCACTG GCAACGGTGT ACTGCAAGTG ACCGATGATG CCGAGGTCAC CCTGACCAGC
AGCAACGGGG TAGGCAGCGC GGTAACCATT GATATCGCCG ACGCGACGCT GAATCTGGAC
GATATTGCTC TGTTTAATCA TGCGTTGACC GGTAACGGCT TGCTGAATGT GGCGAAAAAC
GATGCCACCA CCGCGTTTGA CTTTGGTTCG ACGGTGGGCG GGGCCTTCAC CGGCACGGTC
AACCTGAACA ATTCTACTTT TGATTTAAGC GGCAATAACA CCACAGCATT GGCCCAGGTC
ACGTTGAAAT TATCCAGCGG TAACCTGACC TCGGTGGGCA ACGGTGTGCA GAATATTGGC
ACATTGGCGA TGAATGGCGG CACGTTGCTG TTTGATAATA TTGTTGATAA CTCAGGCATT
ATCACTTCAG ATGGGACGAT TGCGGCTAAT AGCATCGATA CCACCGGGGG CGGTGAAGTT
CGGGTTAATT TACCGAACAA TCTGGCTCCA AGTCTGGATG GGCTCTCGGT GATGGAACTG
GATGAAGGCG AAATCATTGT CACTCTGGCA ACCGGGACAG CGACAGGGAC AGGCCATGAG
TTGACATTGA CGGATGAGAA TGGTGACCCA ATAAGCGCGG TTACTTATCA GGGCGTCCAT
AACGCTGGCA GTACCTCAGC CGCCGCCACC GGTTCGTTTA ATTACGGCAT GACCACTGGC
GAGGATTATG ATGGCCTGTA TGTCAATTAC GGTCTGACCG CGTTAGAACT ATTGAGCACG
GGCAGCGAGG CGTTGGTATT GACCGCCACC TTGGCGAATA ACGGGACTCA ATCTAACGAT
CTTTCCGCTC AAATTACAGG GAGCGGTGAT CTGGCCTTTG CGTCGGCTAA TGATGGCAGC
ACGGCATCCT TATCTAATAG CACCAACAGC TATACCGGCA CCACTTGGGT CTCTTCAGGT
AATTTACGTC TGGATGCCGA TTCAGCACTG GGACAAACCT CCTTGCTGGC GATGAGTACC
GCCACTCATG TTGATATCAA CGGTACCCAG CAGGTGGTGG GTGAGTTAGC CACCGAAGGG
GGCAGTACAC TGGATCTCAA CGACGGTAAG TTAACCGTAA CGGGGGGCGG CCAAATTGAT
GGTGCATTGA CTGGCGGCGG TGAACTGGTA CTGAGTGGTG GTTTGCTGAA TGTTTCTTAT
GATAACACTG GCTTTACTGG CAGTACGGAT ATTGCCAATG GCGCGGTGGC ACATCTGTCT
CAGGCGCAGG GGCTAGGGAA CGGCACCATT AATAACAACG GCACACTTCA TCTGGATAAT
ACTATCGGGA CACTGTTTAA CGCTTTGACC GGTAGTGATG GCGAGGTATT GCTAAGTAAC
AATGCCAGTG TTCAATTGGC CGGTGATAAC AGCGGTTACT CGGGTCTGTT TACTAATCAG
GCCGGGAGTA TTCTCATTGC CAATAGTGCC GAGCATTTAG GTGGCAGCAG CATCGCCAAT
AGCGGTGCGT TGATCCTGAA TACGGGTTCA GTCTGGGAGT TAACCAATAC CATCAGCGGT
ACGGGTACCT TGGTTAAGCG CGGCAGCGGA ACGGTAAAAA TTGAAGGCGA TACAGTTAGC
GCAGGTCTGA CTACGATTGA AGAGGGTTTG CTGCAATTGG GCAGTTCGGC GGTTACCCAG
ACACTTTCGC TGGAAGAGTC CCTGCAAGAG GATGCACTGC TGGTATCATT CGCATCGAAT
ATGGCAAATC TGACCAGTAA CGTACTGATT ACCGCGAACG GCTCCTTAGG GGGATATGGT
CAGGTGACCG GTAATGTTGA GAACCATGGC AACCTGATTA TGCCAAATGC CTTAACGGGC
GGGGATTTTG GCACCTTCAC CATTGATGGT AATTATACCG GTGATGAGGG GATGATCACC
TTCAATACTA TCCTGGCCGG AGATACATCG GTAACGGATA GACTGGTTAT TACCGGGGGG
ACTGCAGGGC AAAGTTATGT CACGGTAAAC AATATTGGGG GTGTCGGGGC GCGCACCTTT
GAAGGTATCA AAATTATTGA TGTCGGTGGT GATTCTGCCG GGCAGTTTAC CCTGAACGGG
CGCGCCGTTG GCGGTGCTTA TGAGTACTTC TTGTATCAAG GTGGGGCCAG CACTCCAGAT
GACGGCGACT GGTATCTGCG TACTCAGGCA GATGACCGCC GCCCTGAACC GGCGAGTTAC
ACCGCTAACC TGGCGGCGGC CAACAATATG TTTGTTACCA GTTTGTCTGA CAGGATGGGT
GAAACGCTGT ATACCGATGT CTTTACCGGT GAACAGAAGA CCACCAGCCT GTGGCTGCGT
AACGAAGGTA GCCATAATCG CTCCCGCGAT GATAGCGGCG AGTTGCACAC TCAGGATAAC
CGTTATGTGA TGCAACTTGG CGGCGATGTG GCGCAATGGA GCCGCAATGC ACAGGACCTG
TGGCGTGTTG GGGTGATGGC GGGCTATGCC AATAGCAGCA GTTCTACCGT GGCAAAGGTT
GCTGGCTACC GTTCTACTGG CTCGGTGGAT GGCTACAGCG TGGGGATCTA TGGCTCATGG
CTTGCCGATA ACGCCGATGA TACTGGCGCG TATGTCGATT CTTGGGTGCA ATACAGTTGG
TTTGACAACA ACGTTAGTGG GCAGGATTTA GCCGCTGAGA AATATGACTC AAAAGGCTTT
ACCGCGTCAG TGGAAGGGGG CTATGCCTTC AAAGTTGGCG AAAGTGTTAA CCAGAGCTAC
TTTATTCAGC CAAAAGCACA GGTGGTGTGG ATGGGCGTAA AAGCCGATGA CCATACAGAA
ACCAATGGTA CGGTTATCTC TGGTGACGGT AATGGCAATA TCCAGACGCG ACTCGGGGCG
AAGGCCTTTA TCAATCCAAG TGATAAAGCC AAAGTCAGCG GCCCGGCATT CAAGCCTTTT
GTTGAAGCCA ATTGGATCCA TAACACCAAA GATTTTGGCA CGACATTAGA CGGTGTCACG
GTGAAACAAG CCGGGACGGC GAATATTGCG GAGCTGAAAC TGGGCGTTGA TGGGCAGATA
AATAACCAGC TGAATCTTTG GGGAAATATC GGCCAGCAAG TGGGTAACAA GGGCTACAGC
GAAACCAGCG TGGTGTTAGG CGTTAAATAT AATTTCTGA
 
Protein sequence
MNTIFKVIWN ASLNVWVVVS ELAKGRIKTK SSRNLISEGV LPKFEQSMVS KLFRKNLLAL 
SLGSIVFLST GPVFAADITV STQAELSAAL SNGTYDKIIL GADITLIGSL TVNMTSNQVV
IDGQGKFGLT VNNTTNYGLV VSSGSGTLTL QNMSKIDSAN YYSMVVLNGA NTAVNVIYNN
IDFLGSSQLI YMGAYGAATN SIMTFGDILN DVVVNDRAQE IGEVNKLAFT GRFHVTHTGS
SVTSFVSTGG ANNTSTMDFA SGADVKIDRT GSTGDLTSTG VNAFAYTFAD GASFELIANQ
NVFSGTTTNR GLEIGSYNSI DGFGSGVKIV LQSRSDGSII SGNGIDNATT NAAGINNNAS
GDANVIYNLG TGSILKATNT GILATKNANN ASDIYIRSAG GITAATGISA THNGTGTVKI
KNDGTITSTT AGIAISSASI KEISVDNTDG TITATAGTGV NVLASAILNL FGGTINTSAT
ANGITFAGTE GGHTLTDLTI NLLGTGIALS NVAGVNLTLS NVTLNTLNGT ALNSLTGLTL
VDSLNGRNTI NIEGAGIGIA ATNTELNTFD AEALDINVNG AGIGIQATGG GVNLSASNLI
INVANTLGTA LQITDGIDNT TTIGNEIQLN AENATAINFL GSSSKTLNNN GTIKGSVIFA
GVADHIINNN GTLDGTLTTG AGNDTLVLDS SSQSNDVINL GDGNNSVTIQ NGATVSSIIT
GNGNDTFTIN GMSVGSTYLG SLDAGTGLNT LNFNASTDEL AAATSLQGFT NINLVDSHIT
LVSDDNIGSG MVNIDSSSEL LFGSTFDGIL HATLGAGTGS AIVNNSANVS LEQASMFAGT
WQVNQGGALT ASNSNQLGSA KIGLDGTLNL DNIALFNHVL TGNGTLNVAK NLATTAFDFG
STVGGAFSGI VNLTKTTFAL SADNAAALAS ATLKLSDDSV TTVGTTDRTL HGLDLSGGTL
IFDGAVPQSQ TSGVVTVTDL ALNSGTVNIT GSGSWDNTDP LATNVSILEQ DRAGSTLELI
NATNVTGDVD ALDLLVNGTA ITSGTQGVQS AIQQGGSTVA NAIHNYGLTS SNSNGDSGLY
VNYTLSALEL LADGADALLL ATESGLTANR VLNAELFGVG GLVVDAQNGA LTLANGNNSY
EGTTTVNFGE LILGANGAFG QTSLLDIASG ASANINGYRQ TVGAVTNTGT VTLGSGGVLT
SGLLTNGGVL DLTGGALNLT AGGTSTVAGG LTGAGTLNIN GGNLAVSAAN SGLSGQTHIA
DVASVTMTGT GTLGTSAVEV LGTLNLNGAN AAMTNVLSGD GTINTNAAVT LSGNNSFSGA
HQIGASGALT VGQASNLGAS SATVNLGTLT SHLILNGVSE SIANVLSGVA GSTVDIIGGA
DTALTANNSG FLGQYALAGN SKLTVGSTNN LGASSSVTLA GADDTLSLSG FNGTFGNSVT
GSGVLQVTDD AEVTLTSSNG VGNTVKVDIA DATLNLDDIA LFDHVLTGNG TLNVAKSLAT
TAFDFGSTVG GAFSGIVNLT NTTFALSADN AAALASATLK LSDDSVTTVG TTDRTLHGLD
LTGGTLIFDG SPPQSQANGV VTVTDLALNS GTVSITGVGN WENESPVTSP NVSILEQDRA
GTTLELINAT NVTGDVDALG LMINGTAITA DSQGVESAIQ QGGSTVANAI HNYGLTSSNS
NGDSGLYVNY TLSALELLAD GADALLLATE SGLTANRVLN AELFGVGGLV VDAQNGALTL
ANGNNSYEGT TTVNFGELIL GANGAFGQTS LLDIASGASA NINGYRQTVG AVSNTGTVTL
GSGGVLTSGL LTNGGVLDLT GGALNLTAGG TSTVAGGLTG AGTLNINGGN LAVSAANSGL
SGQTHIADVA SVTMTGTGTL GTSAVEVLGT LNLNGANAAM TNVLSGDGTI NTNAAVTLSG
NNSFSGAHQI GASGALTVGQ ASNLGASSAT VNLGTLTSHL ILNGVSESIA NVLSGVAGST
VDIIGGADTA LTANNSNFLG QYALAGNSKL TVGSTNNLGA SSSVTLAGAG DTLSLSGFNG
TFGNSVTGSG VLQVTDDAEV TLTSSNGVGN TVKVDIADAT LNLDDIALFD HVLTGNGTLN
VAKSLASTAF DFGSTVGGAF SGIVNLTNTT FALSADNAAA LASATLKLSD DSVTTVGTTD
RTLHGLDLNG GTLIFDGSPP QSQANGVVTV TDLALNSGTV SITGAGNWEN EHPVTPPNVS
LLEQDRGDIL LELINAANVT GDANDLDLMV DGTAITADSS GVQSAVQQGG STVANAIHNY
GLTSSNGNGG SGLYVNYTLS ALELLADGAN ALLLATESGL TANRVLNAEL FGVGGLVVDA
QNGALTLANG NNSYEGTTTV NFGELILGAN GAFGQTSLLD IASGASANIN GYRQTVGAVT
NTGTVTLGSG GVLTSGLLTN GGVLDLTGGA LNLTAGGAST VAGGLTGAGT LNINGGNLAV
SAANSGLSGQ THIADVASVT MTGTGTLGTS AVEVLGALNL NGANAAMTNV LSGDGTINTN
AAVTLSGNNS FSGAHQIGAS GALTVGQASN LGASSATVNL GTLTSHLILN GVSESIANVL
SGVAGSTVDI IGGADTALTA NNSNFLGQYA LAGNSKLTVG STNNLGASSS VTLAGAGDTL
SLSGFNGTFG NSVTGNGVLQ VTDDAEVTLT SSNGVGSAVT IDIADATLNL DDIALFNHAL
TGNGLLNVAK NDASTAFDFG STVGGAFSGI VNLTNTTFAL SADNAAALAS ATLKLSDDSV
TTVGTTDRTL HGLDLNGGTL IFDGSPPQSQ ANGVVTVTDL ALNSGTVSIT GADNWENEHP
VTPPNVSLLE QDRGDILLQL IDADNVTGNA NDLELMINGT TITPGQGVQS TVQQGGSTVA
NATHNYGLTS NGGSGLYVNY TLSALELLAD GANALLLATE SGLTANRELN AELSGVGGLV
VDAQNGALTL ANGNNSYEGT TTVTAGELIL GANGAFGQTS LLNIASGASA NINGYRQTVG
AVTNTGTVTL GNGGELTSTD TLINTGIINV TDGILNLENG GTSSISGGLT GNGILNIKGG
DFTISIDNNG LAGQTNIADG ASVTLGNGGT MLGTGNLGSS VIDVLGDLNL VADNSLANVI
SGDGTINTTA TVTLSGNSSF SGAHQIGTNG ELTVGQASNL GASSATVNLG TITSHLILNG
VSESIANVLS GVAGSTVDII GGADTALTAN NSGFLGQYAL AGNSKLTVGS TNNLGASSSV
TLAGAGDTLS LSGFNGTFGN SVTGNGVLQV TDDAEVTLTS SNGVGSAVTI DIADATLNLD
DIALFNHALT GNGLLNVAKN DATTAFDFGS TVGGAFTGTV NLNNSTFDLS GNNTTALAQV
TLKLSSGNLT SVGNGVQNIG TLAMNGGTLL FDNIVDNSGI ITSDGTIAAN SIDTTGGGEV
RVNLPNNLAP SLDGLSVMEL DEGEIIVTLA TGTATGTGHE LTLTDENGDP ISAVTYQGVH
NAGSTSAAAT GSFNYGMTTG EDYDGLYVNY GLTALELLST GSEALVLTAT LANNGTQSND
LSAQITGSGD LAFASANDGS TASLSNSTNS YTGTTWVSSG NLRLDADSAL GQTSLLAMST
ATHVDINGTQ QVVGELATEG GSTLDLNDGK LTVTGGGQID GALTGGGELV LSGGLLNVSY
DNTGFTGSTD IANGAVAHLS QAQGLGNGTI NNNGTLHLDN TIGTLFNALT GSDGEVLLSN
NASVQLAGDN SGYSGLFTNQ AGSILIANSA EHLGGSSIAN SGALILNTGS VWELTNTISG
TGTLVKRGSG TVKIEGDTVS AGLTTIEEGL LQLGSSAVTQ TLSLEESLQE DALLVSFASN
MANLTSNVLI TANGSLGGYG QVTGNVENHG NLIMPNALTG GDFGTFTIDG NYTGDEGMIT
FNTILAGDTS VTDRLVITGG TAGQSYVTVN NIGGVGARTF EGIKIIDVGG DSAGQFTLNG
RAVGGAYEYF LYQGGASTPD DGDWYLRTQA DDRRPEPASY TANLAAANNM FVTSLSDRMG
ETLYTDVFTG EQKTTSLWLR NEGSHNRSRD DSGELHTQDN RYVMQLGGDV AQWSRNAQDL
WRVGVMAGYA NSSSSTVAKV AGYRSTGSVD GYSVGIYGSW LADNADDTGA YVDSWVQYSW
FDNNVSGQDL AAEKYDSKGF TASVEGGYAF KVGESVNQSY FIQPKAQVVW MGVKADDHTE
TNGTVISGDG NGNIQTRLGA KAFINPSDKA KVSGPAFKPF VEANWIHNTK DFGTTLDGVT
VKQAGTANIA ELKLGVDGQI NNQLNLWGNI GQQVGNKGYS ETSVVLGVKY NF