Gene SeSA_A2886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2886 
Symbol 
ID6518991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2780180 
End bp2791654 
Gene Length11475 bp 
Protein Length3824 aa 
Translation table11 
GC content58% 
IMG OID642747918 
Productlarge repetitive protein 
Protein accessionYP_002115700 
Protein GI194737591 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.659132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTAC TCGCCGTGGT TTCGAAATTG ACTGGCGTCT CCACCACTGT GGAATCCTCA 
TCGGTCACTC TTAACGCCCC GTCAATTGTT AAATTATCAG TGGCCCGGGA AGAGATTAGT
CAACTTACGC GCATTAATCA GGATCTGGTG GTGACGCTCC ATTCCGGCGA AACGATCACG
ATTAAAAACT TTTACGTTAC CAACGATCTG GGCGCAAGCC AGCTGGTACT GGCGGAAAAT
GATGGCACGT TATGGTGGGT AGAAAATCCG CAAGCCGGGC TACATTTTGA ACAAATCGCT
GATATTAATG AGCTGCTGGT CACTTCCGGC ACTTCCCATG AAGCAGGCGG CGCCGTTTGG
CCGTGGGTAC TGGCTGGCGC GGTGGCGGCT GGCGGCATTG CCGCTATCGC GTCTTCCGGC
GGCGGCGATT CCCACCATCA TTCGGATGGC GATAATCCGC CCCCCGATAA CACCAATCCT
GACGGTAATC CCCCTGATAA CAGCAATCCC GGCGGCAGTA ACCCTAACGG CAATACTCCA
GGTAGCAGTA ATCCTGTAGA TACTACCCCG CCTCTCGCTC CCGGCAAGTT GCTGATTTCA
GCGGACGGAA AAACGGTGAG CGGCGAGGCC GAAGCGGGCA GTACCATTAC CATCAAAGAT
CCTTCAGGTA ACGTCGTTGG CGAGGGCAAA GCGGATAGCG ACGGTAAATT TAGTATTGAT
CTAACAGCGC CACAGATTAG CGGCGAACAA CTTACCGTGA CCGCGACTGA CGATGCCGGC
AATACCGGCC CATCCGCAAC CATTGATGCG CCCAACATTC CTCTCCCCGA TACACCAGCT
ATCACCGCCG CTATCGATGA TGCCGCTCCC CTCACCGGCA CGCTGAGCAA TAATCAGTTT
ACGAACGACA ATACCCCCAC GCTGGAGGGA ACCGGCAGCG CTGGCACAGT CATCCATATT
TACGCCAATG GTCAGGAAAT AGGCTCCACA ACGGTTGATA GCAGCGGAAA CTGGCATTTT
GCCATTACCA GCGCGCTAGC GGATGGGGAA AATCATTTCA CCGCCATTGC GACTAACGTT
AAAGGCGAAA GTAGCGAATC AGCCCGCTTT ACACTGACTA TCGACACACT CAGCCCCGAT
GCCCCACGCG TTGAACTGAT TGCCGATAAC ACCGGTTTGC TCACCGGGCC GCTACAGAAT
AATGACCGGA CTGACGAGGC AAAACCGCTA TTTTCCGGGC AGGGAGAGGC AGGCAATACC
ATCACGATTA AAGAGGGTTC AACCGTTATC GGCAGCGCTA CCGTAGACGA AAATGGACGC
TGGACCTTTA CCCCGACTAC ACCGTTAAGC GATGGCGAAC ATACCTTTAC CGTCGAACAA
AGCGACAAAG CCGGAAACAC GAGCCGCGTG ACGACAACGC CTACTATCAT TGTGGACACC
ACGCCGCCTG ACGCCGCTAT CATTGATAAT GTCGCGAAAG ACGGCACAAC CGTTAGCGGC
ACCGCTGAAG CTGGTAGTAC CGTGTCGATC TATGACCCGG CGGGAAATTA CCTGGGCTCC
ACGATTACCG GAGAAAATAA CCACTTCAGC ATCACGCTGA ATCCGGCTCA GACCCACGGC
GAGCGTCTGG AAGCGCGTAT TCAGGACGCC GTTGGTAACA TCGGCCCCGC CACGGAGTTT
ACCGCTTCTG ACTCACAGTA TCCTGCCCAG CCGACTATCC TTACCGTGAC GGATGACGCT
GGCACCGTTA CCGGGCTGCT GAAAAATGGC GATGCCACAG ATGATAACCG CCCAACCCTC
AGCGGTACTG CTGAACCAGG CAGTACGATA TCGATTAACG ATAATGGCTT TCCTGTACCG
ACCTTTCCGC CCATTGTCGC TGACGCTGAC GGCAAATGGA GCTTTACCCC CTCGCTGGCA
CTTGCCGATG GCGACCATGT CTTTACCGCT ACCGCGACCA ACGATCGCGG CACCAGCGGG
CAGTCCGCCT CCTTTACCAT TGATATCGAC ACGCAGCCGC CGGTGCTGGA AGGCCTGGCG
GTTAGCGACG TCGGCGACAG ACTTACCGGC ACTACGGAAG CTGGCAGTAC TGTGGTTATC
AAAGACAGCC AGGGAAATAC GCTCGGTAGT GGAACGGCAG GCGACGACGG TACCTTCTCA
ATAGGTATTA GCCCGGCGAA AATTAACGGC GAAACATTAA GCATTAGCGT TACCGATAAA
GCCACGAATA GCGGTCCGGT AGAAACGCTG AACGCGCCGG ATAAAACTGC GCCAGCGGCA
CCGGACGGCC TTATCGTGGC GACCGACGGT CTGTCCGTAA GCGGTCAGGC GGAAGCCGGG
GCAACGGTCA CTATCCGCGA CAGTAGCAAC ACCGTACTTG GCAGCGCCGT CGCTAACGGC
AACGGACAAT TTATCGTTCC GCTGAATACG GCGCAGACTA ACGGCCAGGC GCTTATCGCT
ACCGCCACCG ATGTCGCGAA AAACGAAAGC GCCGCCGCGA CGGTTATCGC GCCGGACAGT
ACCGCGCCGG AAATGCCAAA AAACGTGGTA ATTAGTGAGG ATGGCACTAG TATCAGCGGC
ACCGCCGAAC CGGGTAGCGC CATCACGATC GCCACGCCGG ATGGCACGCC GCTCGGCAGC
GGCAAAGCAG ATGGCGAAGG TCATTTTACC CTTCCCCTCG TCCCCGCACA GACCAACGGC
GAACAGGTTA CCGTCACCGC CACCGACAGC GCCAACAACG TCAGCCCGCC AACCACAGCG
CAAGCGCCCG ATATCACCGC CCCGGATAAG CCCATTATCA CTCAGGTGCT GGACGATGTT
GAAAGCTTCA CCGGGCCGCT GGTTAACGGA CAAACCACCA ACGACAACCG TCCTACCCTT
AGCGGTACGA CGGAGGCCGG CGCGCGTGTC GAAATTTTTG ATAACGGCGT TTCGCTGGGA
CTCGCCACGC TACAGCCCAA CGGTGGCTGG ACGTTTACGC CGTCGCAAAA TTTAGGTGAA
GGCGCGCATC AACTGACAGT AATCGCAACC GACGCTAAAG GCAATGCCAG TCCGGCCGCG
TCATTCGACC TGGTGGTCGA TACGCAATCG CCGCAGCAAC CGGTAATCAC CTTCATTACA
GATGATGCGC CAGGTATTCT CGGTAGCGTC GCGCATCTGG GGCTCACTAA CGACAGCACG
CCAACGATTA ACGGTACAGG TGAACCGGGT TCCACAGTAC ACCTGTATCA GAATGGTGCC
CGGATAGCAG ATATTATCGT CGGTAATTCC GGCGTCTGGA GCTACGCTTA CACCACGGCC
TCGCCACTGG CGGACGACAC CTACACCTTT ACCGTGACGG CCAGCGACAG TAACGGCAAC
ACCACGCCTT TTTCGACCGA TTTTACGATT ACCATTGATA CCCAGGCCCC TGCCGCCCCC
GGCGTTATCG GCGTAGCTGA CGGCGACGGA AATACGATTG ATACCAATCA GATTACCCAG
GAATCCCAGC CCCGGTTGAG CGGTAGCGGC ACCGCAGGCG ACACAATCAT CCTTTACGAT
AACGGCAATG CCATAGGTCA GGCGCTGGTC GGCACGGACG GGCGCTGGCA GTTTACGCCG
CCTGCCGCGC TGGGCGACGG CGACCACCTT CTGACCGCTC GCGCCAACGA TCCGGCGGGG
AACGAAAGTC CCGAATCCCT CAGCTTTACC CTGCGCGTCG ATACCCAGGC GCCGGATGCG
CCGCAGATCG TGTCAGCCGC CATCACAGGC GGAGAGGGCG AGGTGCTACT GGCAAACGGC
AGTATTACCA ATCAGCGTAT GCCGGCCCTC AGCGGCACCG GCGAACCCGG CGCCATCATC
ACCCTGTACA ATAACGGCGT AGAACTGGCT ACCGTCCAGG TCAATCCACA GGGTAGCTGG
ACCTATCCGC TAACCCGTAA TCTGAGCGAA GGGTTAAACA TCCTGACGGC CACCGCCACG
GATGCCGCAG GCAATAGTAG CCAGACCTCC GGCGTTTTCT CCGTTACCCT TGATACCCAG
CCTCCAGCGC AGCCTGACGC GCCGCTAATC AGCGATAACG TCGCGCCGGT TATCGGCAAC
ATCGGCAATA ATGGCGCAAC GAACGATACC ACGCCGACCT TCAGCGGCAC GGGAGAGATC
GGCAGCACGA TTATTCTCTA CAATAATGGC AGTGAAATTG GTCGCACAAC GGTAGGCGAT
AACGGAAGCT GGAGCTTTAC GCCTGCGGCA CTGACGCCAG AAACCTATAC CATTACCGTC
ACGGAAACCG ATATAGCGGG CAATATCAGT CCACCTTCCG CCTCAGTCAC TTTTACGCTA
GACACCACTG CGCCCGCCAA TCCAGTTATC ACTTTTGCCG AAGATAACGT CGGCGAAGTC
CAGGATACTA TTGTCAGCGG CGCAACCACT GACGACAATA CACCGGTCAT TCACGGCACT
GGCGACATCG GCAGCATTAT TACGCTCTAT AACGGCAGCA GCGTTTTAGG CGTAGTCACC
GTCGATGAGA CCGGCACCTG GACGCTGCCG GTGACCAGCG CGTTGCCGGA TGGCGTCTAC
ACCCTGACCA CCATTGCCGC CGATGCCGCC GGAAACAGCA GCGGCGTATC GAACAGCTTT
ACCTTCACCG TCGACACCGT TCCGTTGCAG CCGCCCGTCG TCAATGAAAT CCTCGACGAT
GTTGCGCCAG TGACCGGGCC ATTAACCGAT GGCGCCTTTA CTAACGATCG GACGCTGACT
ATCAACGGCA GCGGCGAAAA CGGCAGCACC GTCACGATTT ACGACAATGG CGTGGCAATC
GGTACGGCGC TCGTCACCGA CGGGGTCTGG ACATTCAATA CGCCCGAATT ATCAGAAGCC
AGCCATGCGC TAACCTTCAG CGCGACTGAC GATGCTGGAA ATACCACGGC GCAAACCCAA
CCGATCACCA TTACTGTGGA TATCACCGCC CCGCCCGCGC CAACGATCCA GACGGTGGAC
GACGATGGCA CGCGCGTCGC CGGACTTGCC GATCCTTACG CTACCGTTGA AATTCATCAT
GCCGATGGCA CCCTGGTCGG CAGCGCTGTC GCTAATGGCA CCGGTGAATT CGTCGTTACG
CTCAGTCCGG CGCAAACCGA TGGCGGTACG CTGACGGCAA TTGCTATCGA TCGCGCGGGG
AATAACGGCC CGGCTACGAA TTTTCCCGCT TCCGACAGCG GTCTGCCCGC CGTCCCGGCC
ATCACGGCGA TTGAAGATAA TGTCGGGAGC GTACAGGGGA ATATTGCGGC GGGCGGCGCC
ACGGACGACA CCACGCCGAC GCTGCGCGGC ACCACGGATA TCGGCTCTAC CGTTGAAGTT
TTCATTGATG GCGATTCGGC AGGCTTTGCC ACCGTTGACG CCAGCGGGAA CTGGATCTTT
GAGATCGCGA CGCCATTAAG CGAAAGCACA CATTACTTCA CCGTCCAGGC AACCAATGCG
AATGGCCCGG GCGGCCTGTC CGCACCGGTC GGGATCACTG TCGATCTTAG CGCGCCGGCG
CAACCGGTTA TTACCAGCGC AACGGATGAT GTCCCCGGCA TGACCGGTAC GCTGGATAAC
GGCGCGCTCA CCAATGATTC ACGCCCGACG CTCAACGGAA CGGGAGAAGC AGGCGCCACC
ATCCGCATTC TGGATAACGG CGTAGAAATC GGTTCCGCCA CGATAGATCA AAGCGGCAAC
TGGCGCTTCA CCCCGAACGC GCCGCTGGAG AGCATCGCAC ATATCTTTAC CGCCGTGGCG
ACCGATCCCG CCGGCAATAG CGGCCAGCCT TCGGACGGCT TTACGCTGAA CATTGACGCG
CAGGCGCCAG ATGTGCCGGT TATCACGTCC GTGATTGACG ATAACAATCA ACCGACCGTC
CCGGTGTTAC CGGGGCAATC CACCGACGAT CGGCAGCCAA TACTGAACGG AACTGGCGAA
CCTGGCGCGA CAATCACCAT TTTTGATAAC GGTACGCCGC TTGGCACGGC TCAGGTAAGC
GAAAACGGTA GCTGGACCTT CCCGGTGCCC CGCAATTTGT CAGAGGGAAG CCATAATCTG
ACAGTTAGCT CTACCGATCC GGCGGGCAAT ACCAGCGCGG TCTCCGCGCC GTGGACGATC
GTGGTCGATA TTACGCCTCC GGCGATCCCG GTTCTCACCT CCGTCGTGGA TGACCAGCCC
GGTATTACCG GCAACCTGGT AAGCGGGCAG CTAACGAACG ATGCGACGCC CACCCTGAAC
GGGCGCGGAG AGGCAGGCGC GACGATTAAT GTCTATCTTG ACGGTAATCC CGCGTCCATC
GGTACCACGA CGGTGAATAG CGACGGCACG TGGAGTTTTA CGCCACAGAC GCCGCTTGCA
AACGGTAGCC ACACGTTCAC CCTTAGCGCC ACCGATCCGG CGGGTAATAG CAGCGCGGTG
TCCAGCGGAT TTGTGCTGAC GATTGACGCC ACACCGCCCG CCGCGCCGGT TATCGCCAGC
GTGGCAGACA ATACGGTGCC GGTGACGGGC ATCGTCCCCA ACGGCGGCTC GACGAACGAA
ACCCGACCAA CACTTTCGGG TACCGGTGAG GCGGGTACAA CCATCTCGAT TTATAATGGC
AGCGCGCTGG TCGGCACGGC GCAAGTTCAG GCCAACGGTA GCTGGAGCTT TACGCCGTCT
ACCTCGCTGG GCGCGGGCGT CTGGAACCTG ACGGCGACAG CAACCGATGC GGCAGGCAAT
ACCAGCGCCG CGTCCGAAAT ACGCTCGTTT ACTATTGATA CCACGGCTCC CGCCGCGCCT
GTTATTGATA CGGTCTACGA CGGTACGGGC CCCATTACCG GCAATCTGAG TTCAGGACAG
ATCACAGACG AGGCGCGCCC TGTCATTAGC GGCACCCGTG AAGCCAACAC AACTATTCGT
CTCTACGATA ACGGCACGCT GCTGGCTGAA ATTCCCGCCG ACAATAGCAG TAGCTGGCGC
TACACGCCCG ATGCCTCTCT GGCGACGGGC AACCATGTAA TTACCGTCAT TGCCGTTGAT
GCCGCAGGCA ACGCCAGTCC CGTTTCAGAC AGCGTTAATT TCGTCGTCGA TACCACGCCG
CCGCTGACGC CGGTAATCAC ATCAGTCAGT GACGATCAGG CGCCAGGCCT CGGCACGATC
GCGAACGGCC AAAATACCAA CGATCCTACG CCAACCTTCA GCGGCACCGC AGAAGCCGGC
GCCACGATCA CGCTCTATGA AAATGGTACG GTCATTGGCA CGACAACGGC TCAATCTGAC
GGCGCGTGGA GCGTCTCCAC CTCAACGCTG GCAAGCGGAA CGCACGTCAT CACCGCCGTC
GCCACCGATG CCGCAGGAAA CAGCAGCCCG AACAGTACGG CTTTCACCCT GACGGTCGAT
ACCACCGCGC CGCAAACGCC AATCCTGACG TCCGTGGTGG ATGACGTCGC GGGCGGGGTC
ACAGGAAATC TCGCTAATGG TCAGATAACC AATGATAACC GCCCCACGCT GAACGGCACT
GCCGAAGCGG GCAGCGTGGT CAGTATCTAT GATGGCAACA CTCTGCTTGG CATCACCTCG
GCTAACGCGG GCGGCGCGTG GAGCTTCACG CCGACGACAG GGTTAAACGA CGGCACGCGC
ACATTAACAG TGACCGCCAC CGACCCGGCA GGCAACGTTA GCCCGGCCAC CAGCGGTTTT
ACTATCGTGG TCGATACCCT TGCGCCAACG GTTCCGCTTA TAACCAGCAT CGTTGATGAT
GTCCCGAACA ACACCGGCGC CATAGGCAAT GGACAATCGA CCAACGACAC ACAGCCGACG
CTCAACGGTA CCGCGGAAGC CAACAGCGCG GTAAGCATCT TCGATAATGG CGCGCTGGTC
GCGACCGTGA ACGCCAATGC CAGCGGCAAC TGGAGCTGGA CGCCAACCGC CGCGCTCGGC
CAGGGAAGTC ACGCCTATAG CGTTAGCGCC GCCGATGCGG CTGGCAACGT TAGCGCCGCT
TCGCCATCGA CAACGATTAT CGTGGATACC ATTGCGCCCG GCGCGCCCGG CAACCTGGTC
ATCAATGCTA CCGGTAATCG GGTGACGGGC ACCGCGGAAG CAGGCAGTAC AGTGACGATT
ACCTCTGAGA CTGGTGTGGT ACTGGGAACC GCCACCGCCG ACGGTACAGG CAGCTTCACC
GCCACACTCA CGCCCGCGCA GACCAATGGT CAGCCGCTAC TGGCATTTGC CCAGGATAAA
GCAGGCAACA CTGGCATTGC CGCCGGATTT ACCGCGCCCG ATACGCGCGT GCCGGAAGCA
CCGATCATCA CCAACGTAGT GGATGATGTG GGTATTTATA CCGGCGCTAT CGCCAACGGT
CAGGTCACTA ATGACGCACA ACCCACATTG AATGGTACCG CTCAGGCGGG CGCCACGGTG
AGCATTTATA ACAACGGGGC GCTGCTCGGC ACCACCACGG CGAACGCCAG CGGAAACTGG
AGCTTTACCC CGACAGGCAA TTTGACCGAA GGCAGCCACG CCTTCACCGC CACCGCGACT
AACGCCAACG GTACAGGCAG CGTCTCCACC GCCGCGACGG TGATTGTCGA TACGCTGGCG
CCCGGTACGC CGTCAGGTAC GCTCAGCGCC GATGGCGGTT CACTTTCCGG ACAGGCTGAG
GCAAACAGCA CCGTAACCGT CACGCTGGCG GGGGGCGTGA CGCTCACCAC CACCGCCGGC
AGCAACGGCG CATGGTCTCT CACCTTGCCG ACAAAACAAA TTGAAGGTCA ACTCATTAAC
GTGACGGCCA CTGACGCTGC GGGTAACGCC TCCGACACGT TAGGCATTAC CGCGCCGGTT
CTGCCGCTGG CGGCAAGGGA TAACATCACC AGCCTTGATC TGACCTCGAC CGCCGTCACC
AGCACGCAAA ACTATTCGGA TTACGGCCTG CTGCTGGTTG GCACGCTTGG CAATGTCGCC
TCGGTTTTGG GTAACGATAC CGCTCAGGTT GAGTTCACCA TTGCTGAAGG TGGTACGGGC
GACGTCACCA TTGATGCCGC CGCAACGGGA ATCGTGCTTT CGCTGCTCAG TACTCAGGAG
ATTGTGGTAC AGCGCTACGA CACCAGCCTC GGCGCCTGGA CGACGATCGT CAACACCGCC
GTTGGCGACT TCGCGAATTT GCTTACCCTG ACCGGGAGCG GCGTTACTCT GAACCTGAGC
GGCCTGGGCG AAGGCCAGTA CCGGGTACTC ACTTATAACA CCAGTCTGCT CGCCACCGGG
TCATATACCA GCCTGGATGT CGATGTACAC CAGACCAGCG CAGGTATTAT TAGCGGGCCA
ACCATCAGTA CCGGCAACGT CATGGCTGAT GATACCGCGC CGACGGGCAC CACGGTCACC
GCCATCACCA ACGCCAACGG CGTCAGTACG CAGGTCGGCG CAGGCGGCGT GGATATCCAG
GGACAATACG GCACGCTGCA CATTAATCAG GATGGCAGTT ACACCTACAC GCTGACTAAG
CCCACGGCAG GATACGGACA TAAAGAGAGC TTCACCTACA CCATCACCCA GAATGGCGTC
GGTAGCAGCG CCGCGCAACT GGTTATCAAT CTGGGTCCCG CTCCTGTACC GGGCAGCGTG
ATAGCGACAG ACAATAACGC CTCACTGGTC TTTGATACTC ACGTTAGCTA CGTCAACAAC
GGTCCCTCGA CACAAAGCGG CGTCACGGTA TTAAGCGTCG GACTTGGTAA TGTACTGAAC
GCGAATCTGC TTGATGATAT GACTAATCCG ATCATCTTTA ACGTTGAAGA AGGCGCTACG
CGAACCATGA CGTTACAGGG AACCGTCGGC GGCGTCTCAC TGGTTTCCAC GTTCGATCTG
TACGTTTATC GCTTCAACGA TGCCATTCAA CAATATGAGC AGTTCCGGGT GCAAAAGGGC
TGGATTAACA CCCTGCTGTT AGCCGGACAG TCCCAGCCGC TGACCCTGAC GTTGCCTGGC
GGCGAATACT TGTTCGTGCT GAATACCGCC AGCGGCATTA GCGTCCTCAC TGGCTATACG
CTGGCGATTT CCCAGGACCA CACCTATGCC GTTGACAGTA TCACCGCCAA CACCACCGGC
AACGTACTGA CCAATGATGT CGCCCCTACG GACGCCCTCC TCACTGAAGT AAACGGCGTG
GCGATTGCGG CGACCGGCAC GACGGAGGTA AACGGGTTGT ATGGCTCGCT CATCATTGAC
GCAAGAGGCA ACTATACCTA CACGCTGAAG AATGGCGTCG GTGCCGACAG CATTAAAACG
CCGGACAGCT TTATCTATAC GGTCAAAGCG CCAAACGGCG ATACCGATAC GGCCTCGCTC
AATATCACGC CAACCGCCAG GGCGCTGGAT GCGATTAATG ATGTCAGCGA TACCCTTAGC
GTCGCCACGC TTCAGGATAC CGCTGCCTGG CTGGACTCCA GCGTCGGCAG CGCCAGTTGG
GGGCTACTCG GTAAATCGGG CAGCGGGAGC GGCACCTTTG ACGTTGCAAC GGGCACCGTA
CTTAAAGGCG CGTCACTGGT CTTTGATGTC TCCACGCTCA TTACGCTGGG CAATCTGAAT
ATTAGCTGGG CCATTCAGGA GAACGGGACC GTCATACGCA ACGGAACCGT TCCGGTGGCG
AATATCACGC TGGGCAGCGC GACGGTGACC GTCAACCTGA GCGGCCTGGA GCTGGATGCC
GGAACGTACA CGCTTAACTT TACCGGCACC AATACCCTGG CCGGGGCGGC GACGATCACG
CCACGCGTCA TCGGCACCAC CGTCGATCTG GATAATTTTG AAACGTCCGG AACGCATACC
GTTCTCGGCA ATATTTTTGA CGGCAGCGAT GCGGCGGGGG CGATGGATCA GCTTAATACG
GTGAATACCC GCCTGAGCAT TAGCGGGTAT AACGGCAGCG CCGCCACGCT GGACGCCGCG
GCGAATACCA CCAGCGCCAC GATTCAGGGA CATTACGGCA CATTGCAAAT TAACCTCGAT
GGCGCTTACA CCTACACCCT GAATAATGGC GTCGCGATGT CGTCCATCAC CAGTAAAGAG
GTCTTTACCT ATCAACTGGA TGACAAGATG GGTCATACGG ATAGCGCCAC ATTGACCATT
GATATGGCGC CGCAAATCGT CAGTACCAAC CAAAACGATG TTCTCATCGG CTCCGCCTAT
GGCGATACGC TGATTTACCA CCTGTTAAAC GGCGCGGACG CGACCGGCGG CAATGGCGCC
GATCGCTGGC AAAACTTCTC CACCGCGCAG GGCGACAAGA TCGATATCCA CGAACTGCTG
ACCGGCTGGG ATCATCAGGC GGCGACGCTG GGTAACTTTG TTCAGGTTCA TACCAGCGGC
GCCAATACGG TGATATCCGT CGATCGCGAC GGCGTCGGCA GCGCGTTTAA ATCCACTGAC
CTTGTCACTC TGGAGAATGT GCAGCTCACG CTAAATGATC TTTTGCAAAA CAACCACCTG
ATAACCGGCG GTTGA
 
Protein sequence
MPLLAVVSKL TGVSTTVESS SVTLNAPSIV KLSVAREEIS QLTRINQDLV VTLHSGETIT 
IKNFYVTNDL GASQLVLAEN DGTLWWVENP QAGLHFEQIA DINELLVTSG TSHEAGGAVW
PWVLAGAVAA GGIAAIASSG GGDSHHHSDG DNPPPDNTNP DGNPPDNSNP GGSNPNGNTP
GSSNPVDTTP PLAPGKLLIS ADGKTVSGEA EAGSTITIKD PSGNVVGEGK ADSDGKFSID
LTAPQISGEQ LTVTATDDAG NTGPSATIDA PNIPLPDTPA ITAAIDDAAP LTGTLSNNQF
TNDNTPTLEG TGSAGTVIHI YANGQEIGST TVDSSGNWHF AITSALADGE NHFTAIATNV
KGESSESARF TLTIDTLSPD APRVELIADN TGLLTGPLQN NDRTDEAKPL FSGQGEAGNT
ITIKEGSTVI GSATVDENGR WTFTPTTPLS DGEHTFTVEQ SDKAGNTSRV TTTPTIIVDT
TPPDAAIIDN VAKDGTTVSG TAEAGSTVSI YDPAGNYLGS TITGENNHFS ITLNPAQTHG
ERLEARIQDA VGNIGPATEF TASDSQYPAQ PTILTVTDDA GTVTGLLKNG DATDDNRPTL
SGTAEPGSTI SINDNGFPVP TFPPIVADAD GKWSFTPSLA LADGDHVFTA TATNDRGTSG
QSASFTIDID TQPPVLEGLA VSDVGDRLTG TTEAGSTVVI KDSQGNTLGS GTAGDDGTFS
IGISPAKING ETLSISVTDK ATNSGPVETL NAPDKTAPAA PDGLIVATDG LSVSGQAEAG
ATVTIRDSSN TVLGSAVANG NGQFIVPLNT AQTNGQALIA TATDVAKNES AAATVIAPDS
TAPEMPKNVV ISEDGTSISG TAEPGSAITI ATPDGTPLGS GKADGEGHFT LPLVPAQTNG
EQVTVTATDS ANNVSPPTTA QAPDITAPDK PIITQVLDDV ESFTGPLVNG QTTNDNRPTL
SGTTEAGARV EIFDNGVSLG LATLQPNGGW TFTPSQNLGE GAHQLTVIAT DAKGNASPAA
SFDLVVDTQS PQQPVITFIT DDAPGILGSV AHLGLTNDST PTINGTGEPG STVHLYQNGA
RIADIIVGNS GVWSYAYTTA SPLADDTYTF TVTASDSNGN TTPFSTDFTI TIDTQAPAAP
GVIGVADGDG NTIDTNQITQ ESQPRLSGSG TAGDTIILYD NGNAIGQALV GTDGRWQFTP
PAALGDGDHL LTARANDPAG NESPESLSFT LRVDTQAPDA PQIVSAAITG GEGEVLLANG
SITNQRMPAL SGTGEPGAII TLYNNGVELA TVQVNPQGSW TYPLTRNLSE GLNILTATAT
DAAGNSSQTS GVFSVTLDTQ PPAQPDAPLI SDNVAPVIGN IGNNGATNDT TPTFSGTGEI
GSTIILYNNG SEIGRTTVGD NGSWSFTPAA LTPETYTITV TETDIAGNIS PPSASVTFTL
DTTAPANPVI TFAEDNVGEV QDTIVSGATT DDNTPVIHGT GDIGSIITLY NGSSVLGVVT
VDETGTWTLP VTSALPDGVY TLTTIAADAA GNSSGVSNSF TFTVDTVPLQ PPVVNEILDD
VAPVTGPLTD GAFTNDRTLT INGSGENGST VTIYDNGVAI GTALVTDGVW TFNTPELSEA
SHALTFSATD DAGNTTAQTQ PITITVDITA PPAPTIQTVD DDGTRVAGLA DPYATVEIHH
ADGTLVGSAV ANGTGEFVVT LSPAQTDGGT LTAIAIDRAG NNGPATNFPA SDSGLPAVPA
ITAIEDNVGS VQGNIAAGGA TDDTTPTLRG TTDIGSTVEV FIDGDSAGFA TVDASGNWIF
EIATPLSEST HYFTVQATNA NGPGGLSAPV GITVDLSAPA QPVITSATDD VPGMTGTLDN
GALTNDSRPT LNGTGEAGAT IRILDNGVEI GSATIDQSGN WRFTPNAPLE SIAHIFTAVA
TDPAGNSGQP SDGFTLNIDA QAPDVPVITS VIDDNNQPTV PVLPGQSTDD RQPILNGTGE
PGATITIFDN GTPLGTAQVS ENGSWTFPVP RNLSEGSHNL TVSSTDPAGN TSAVSAPWTI
VVDITPPAIP VLTSVVDDQP GITGNLVSGQ LTNDATPTLN GRGEAGATIN VYLDGNPASI
GTTTVNSDGT WSFTPQTPLA NGSHTFTLSA TDPAGNSSAV SSGFVLTIDA TPPAAPVIAS
VADNTVPVTG IVPNGGSTNE TRPTLSGTGE AGTTISIYNG SALVGTAQVQ ANGSWSFTPS
TSLGAGVWNL TATATDAAGN TSAASEIRSF TIDTTAPAAP VIDTVYDGTG PITGNLSSGQ
ITDEARPVIS GTREANTTIR LYDNGTLLAE IPADNSSSWR YTPDASLATG NHVITVIAVD
AAGNASPVSD SVNFVVDTTP PLTPVITSVS DDQAPGLGTI ANGQNTNDPT PTFSGTAEAG
ATITLYENGT VIGTTTAQSD GAWSVSTSTL ASGTHVITAV ATDAAGNSSP NSTAFTLTVD
TTAPQTPILT SVVDDVAGGV TGNLANGQIT NDNRPTLNGT AEAGSVVSIY DGNTLLGITS
ANAGGAWSFT PTTGLNDGTR TLTVTATDPA GNVSPATSGF TIVVDTLAPT VPLITSIVDD
VPNNTGAIGN GQSTNDTQPT LNGTAEANSA VSIFDNGALV ATVNANASGN WSWTPTAALG
QGSHAYSVSA ADAAGNVSAA SPSTTIIVDT IAPGAPGNLV INATGNRVTG TAEAGSTVTI
TSETGVVLGT ATADGTGSFT ATLTPAQTNG QPLLAFAQDK AGNTGIAAGF TAPDTRVPEA
PIITNVVDDV GIYTGAIANG QVTNDAQPTL NGTAQAGATV SIYNNGALLG TTTANASGNW
SFTPTGNLTE GSHAFTATAT NANGTGSVST AATVIVDTLA PGTPSGTLSA DGGSLSGQAE
ANSTVTVTLA GGVTLTTTAG SNGAWSLTLP TKQIEGQLIN VTATDAAGNA SDTLGITAPV
LPLAARDNIT SLDLTSTAVT STQNYSDYGL LLVGTLGNVA SVLGNDTAQV EFTIAEGGTG
DVTIDAAATG IVLSLLSTQE IVVQRYDTSL GAWTTIVNTA VGDFANLLTL TGSGVTLNLS
GLGEGQYRVL TYNTSLLATG SYTSLDVDVH QTSAGIISGP TISTGNVMAD DTAPTGTTVT
AITNANGVST QVGAGGVDIQ GQYGTLHINQ DGSYTYTLTK PTAGYGHKES FTYTITQNGV
GSSAAQLVIN LGPAPVPGSV IATDNNASLV FDTHVSYVNN GPSTQSGVTV LSVGLGNVLN
ANLLDDMTNP IIFNVEEGAT RTMTLQGTVG GVSLVSTFDL YVYRFNDAIQ QYEQFRVQKG
WINTLLLAGQ SQPLTLTLPG GEYLFVLNTA SGISVLTGYT LAISQDHTYA VDSITANTTG
NVLTNDVAPT DALLTEVNGV AIAATGTTEV NGLYGSLIID ARGNYTYTLK NGVGADSIKT
PDSFIYTVKA PNGDTDTASL NITPTARALD AINDVSDTLS VATLQDTAAW LDSSVGSASW
GLLGKSGSGS GTFDVATGTV LKGASLVFDV STLITLGNLN ISWAIQENGT VIRNGTVPVA
NITLGSATVT VNLSGLELDA GTYTLNFTGT NTLAGAATIT PRVIGTTVDL DNFETSGTHT
VLGNIFDGSD AAGAMDQLNT VNTRLSISGY NGSAATLDAA ANTTSATIQG HYGTLQINLD
GAYTYTLNNG VAMSSITSKE VFTYQLDDKM GHTDSATLTI DMAPQIVSTN QNDVLIGSAY
GDTLIYHLLN GADATGGNGA DRWQNFSTAQ GDKIDIHELL TGWDHQAATL GNFVQVHTSG
ANTVISVDRD GVGSAFKSTD LVTLENVQLT LNDLLQNNHL ITGG