Gene Shewmr7_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_3642 
Symbol 
ID4256140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp4321997 
End bp4336372 
Gene Length14376 bp 
Protein Length4791 aa 
Translation table11 
GC content53% 
IMG OID638124326 
Productputative outer membrane adhesin like protein 
Protein accessionYP_739679 
Protein GI114049129 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCTTT TGATTAACAC TATCGGGTGT TGGGAGCTTA TTATGGGATC GATCATTACT 
ACAAAGAAAG GTTTGCTCAA ACTGGTCAAA GGGCAAATCG AAGTCGAGGT TAATGGTTCA
AATCAACCCG CTAAGGATGG CGAGCAACTC CCTAAAGGTG CTGTGCTACA TATTGGTGAA
AATGCGACCT ACGAAATCAC CTTTGATGAT GGTACTAAAC TTTCTAATGA AGTTCCCCCT
AACGCAACAG CTGCCGCCCC CAACACGGCA AATGAAGCTG CCCTAGATGA AATTCAAGCA
TTGCAAGATC TTATCGCTTC AGGCGAAGAT CCAACTCAGA ACCTGCCAGA AACCGCAGCA
GGTAATGCTC CTGCGAGTGA TGGTAATTCA GGCTATGTCA CCTTAGCCCG TGATGGTACA
GAAGCCTTAG CAACATCAGG TTATTCAACC TCAGGACAAA CGCTCACAGC AGCCACAACT
GCCGCACCAG AGCAAACCGT AGCAACCGAC TCTCCTTCTG TATTAGCTAA CGACAGCAAT
ACAGTCGATG AAGATACTGT TGCTTCAGGC AATGTGCTCG ATAATGACGT GGATGCCGAT
AATGAATTAA CTGTGGTTGG GTTTAATCTT AATGGAACCG ACTTTGCGGC GGGCACGGTT
GTTACACTCG AAGGTGGCAG CTTTATTCTA AATGCTGATG GTTCTTATAC CTTTACCCCT
AATGAGAACT GGAACGGCCA AGTTCCTGTT ATTACCTACA CCACTAATAC TGGTTCGACT
GCCACATTAA CGATTAACAT CACCCCAGTT GATGACCCAT CGGTATTAGC AAATGATAGC
AATACCATCG CTGAAGATAC CGTCGCCACA GGTAATGTGC TTGATAACGA TTCAGATGTT
GATTCCAGCT TAAGTGTGGT GAGCTTTACA GTATCGGACA GCACCTTTGC AGCAGGCACA
ACTGTCGCTC TCGAAGGTGG CAGTTTAGTA CTAAACGCCG ATGGATCTTA CACCTTTACT
CCTAATGAAA ACTGGAACGG CCAAGTTCCT GTCATTACCT ACACAACCAA TACTGGTTCG
ACTGCCACAT TAACGATTAA CGTCACCCCA GTTGATGACC CATCGGTATT AGCAAACGAC
ACCAATACCA TCGCTGAAGA TACCGTCGCC ACAGGTAATG TGCTTGATAA CGACTCAGAT
GTTGATTCCA GCTTAAGTGT GGTGAGCTTT ACAGTATCGG GCAGCACCTT TGCAGCTGGC
GCAACTGTCG CTCTCGAAGG CGGCAGCTTA GTACTCAACG CAGATGGTTC ATATAGCTTT
ACCCCTAATG CTGATTGGAA TGGCCAAGTT CCTGTTATTA CCTACACCAC CAATACCGGC
TCAACTGCAA CTTTAACCAT TAATGTCACT CCTGTTGATG ATCCATCGGT ATTAGCAAAC
GACACTAATA CCATCGCTGA AGATACCGTC GCCACAGGTA ATGTGCTTGA TAACGACTCA
GATGTTGATT CCAGCTTAAG TGTGGTGAGC TTTACAGTAT CGGGCAGCAC CTTTGCAGCT
GGCACAACTG TCGCTCTAGC AGGCGGCAGC TTAGTACTCA ATGCCGATGG CTCTTACACC
TTTACCCCAA GTGAAAACTG GAATGGCCAA GTTCCTGTTA TTACCTACAC CACCAATACT
AGTTCGACTG CAACCTTAAC CATTAATGTC ACCCCTGTGG CAGATGGCGC CCCAAGCGTC
ACTATTACAA CAGATACAGA CAATGATGGT TTCATTAGTA ATACAGAACT CGGTGGTGCG
ACAGAAGTTA GCGTCACTAT CGGATTAGAT GGCACTGGCG CAAATGCAGG TGACACTCTA
ACAGTTAATG GTGTTGACTA TGTGTTGACG CAAGAAGATA TCAACAATGG TTTTGTCAAT
TTAACACTTC CTGCCCCAGC TGAAGGTGAA ACCATTACTG TAGTGGCAAC AATCACTGAT
CCTGCGGGTA ACACCTCGCC TGAAGGAAGC GATTCCGCAC TGCTAGACAC AACGGCCCCC
ACGATTACCG TCAGCGCGCC TGATGATACC CGAGATACAA CCCCAACCAT TACAGGTACA
ACGGATGCAG CCCCTGGCAG CGTCATCACT ATTGTTGTTA CTGACAGCAA TGGTGTGCAG
CAAACTCTTA CCACCACAGT GAATCCTGAC GGTACATATG CTGTTGATGT GATTAATCCA
ATTCCTCATG GTGGCTATAC TGCAACTGCA ACAGTGACAG ATCCAGCCGG CAATACGGGC
AACGCCACTG ATAATGGTAA TGTCGATATC AAGATCGATG AAGACGGCGA TGGCAACACC
GTGGCCATCA CCAGCATCAC CCAAGACACC GGCAGCTCCA GCAGCGACTT TATTACTAAC
GACAACACCT TAGTGTTCCA CGGTACCGTC GATTTGGATG ATGAAAGCAC TTTAGTCGTG
ACCATCAACG GGGTGGATTA CACCACCGCC AACGGCTTAG TGATTGATGC GCAAGGTAAC
TGGAGTGTCG ACCTGACCGG CACCGCGCTG CCTGACGGCA CCTATCTGGT GGTCGCCACC
GTCACCGACG TGGCCGGTAA CACCACCAGT GCGACCCAAA ATGTGGTGAT TGATACCAAG
ATTGATCAAG ACAGCGACGG CAACACCGTG GCCATCACCA GCATCACCCA AGACACTGGC
AGCTCCAGCA GCGACTTTAT CACCAACGAC AACACCTTAG TGTTCCACGG CACCGTTGAT
TTGGATGATG AAAGCACTTT AGTCGTGACC ATCAACGGTG TGGATTACAC CACCGCCAAC
GGCTTAATGA TTGATGCACA AGGTAACTGG AGTGTTGACC TGACCGGCAC CGCGCTGCCT
GACGGCATCT ATACTGTGGT CGCCACTGTC ACCGACATTG CAGGAAACAC CACCAGTGCA
ACGCAAGATG TGATTGTCGA TACACAAATT GGGCTAGGGC GAGATAACGC TGTAACGATT
ACAAGTATCA CCGAAGATAC TGGTAGCTCC AATACAGACT TTATTACCAA CGACAACACC
TTAGTATTCC AAGGTACCGT CGAATTGGAT GGCAACAGTA ACTTGGTTGT CACGATCAAT
GGCGTAGATT ACACCATAGG CAATGGTCTA GTGATCGATG AGAATGGCCA CTGGAGTATT
GACCTGACTG GCACCGCGCT GCCTGACGGC ACCTACCCGG TGGTCGCCAC CGTCACCGAC
ATCGCAGGTA ATAGCAAAAC TGTCACTCAA GATGTGGTGA TTGATACCAA GATTGATCAA
GACAGCGACG GCAACACCGT GGCCATCACC AGCATCACCC AAGACACCGG TAGCTCCAGC
AGCGACTTTA TTACTAACGA CAACACCTTA GTGTTCCACG GCACCGTTGA TTTGGATGAT
GAAAGCACTT TAGTCGTGAC CATCAACGGG GTGGATTACA CCACCGCCAA CGGCTTAGTG
ATTGATGCCC AAGGTAACTG GAGTGTGGAC CTGACCGGCA CCGCGCTGCC TGACGGCACC
TACCCCGTGG TCGCCACCGT CACCGACGTG GCCGGTAACA CCACCAGTGC GACCCAAAAT
GTGGTGATTG ATACCAAGAT TGATCAAGAC AGCGACGGCA ACACCGTGGC CATCACCAGC
ATCACCCAAG ACACCGGCAG CTCCAGCAGC GACTTTATCA CCAACGACAA CACCCTGGTG
TTCCACGGCA CCGTTGATTT GGATGATGAA AGCACTTTAG TCGTGACCAT CAACGGGGTG
GATTACACCA CCGCCAACGG CTTAGTGATT GATGCCCAAG GTAACTGGAG TGTGGACCTG
ACCGGCACCG CGCTGCCTGA CGGCACCTAC CCCGTGGTCG CCACCGTCAC CGACGTGGCC
GGTAACACCA CCAGTGCGAC CCAAAATGTG GTGATTGATA CCAAGATTGA TCAAGACAGC
GACGGCAACA CCGTGGCCAT CACCAGCATC ACCCAAGACA CCGGCAGCTC CAGCAGCGAC
TTTATTACTA ACGACAACAC CCTGGTGTTC CACGGCACCG TTGATTTGGA TGATGAAAGC
ACTTTAGTCG TGACCATCAA CGGGGTGGAT TACACCACCG CCAACGGCTT AGTGATTGAT
GCCCAAGGTA ACTGGAGTGT GGACCTGACC GGCACCGCGC TGCCTGACGG CACCTACCCC
GTGGTCGCCA CCGTCACCGA CGTGGCCGGT AACACCACCA GTGCGACCCA AAATGTGGTG
ATTGATACCA AGATTGATCA AGACAGCGAC GGCAACACCG TGGCCATCAC CAGCATCACC
CAAGACACCG GCAGCTCCAG CAGCGACTTT ATTACTAACG ACAACACCCT GGTGTTCCAC
GGCACCGTTG ATTTGGATGA TGAAAGCACT TTAGTCGTGA CCATCAACGG GGTGGATTAC
ACCACCGCCA ACGGCTTAGT GATTGATGCC CAAGGTAACT GGAGTGTGGA CCTGACCGGC
ACCGCGCTGC CTGACGGCAC CTACCCCGTA GTCGCCACCG TCACCGACGT GGCCGGTAAC
ACCACCAGTG CCACCCAAAA TGTAGTGATT GACACCACCG CCGATGCGGC GACCCCAGTG
GTCACCATCC TCGATGACGT GAACAACGAT GGCATCATCA ACAAGACCGA GCTGGGCAGC
GACGACGTGC AGTTGCAAGT CAACGTCAAC CACAGCGAGC TGCTCCAAGG CGGCACCATT
AACCTGACCA TTGTCAATGA CGGCGTCAGC AGCAATGTCA GCCTCAAACT GGTGGGCGGC
GTGCTGACCT TCGCCGACGG CACCCCTGCC ACCGGCTTTA GCTATAACAA CGGCGTGATC
ACCTGGACGG CCGTCGTAGC CGAAGGCAAA ACCATCAGCG TCACCGCCAC GCAAACTGAC
AGCGACGGCA ATACCTCTGC TGAAGGCTCT GACAGCGCCA AGGTTGACAC CACCGCCGAT
GCGGCAGCCC CAGTGGTCAC CATCCTCGAT GACGTGAACA ACGATGGCAT CATCAACAAG
ACCGAGCTGG GCAGCGACGA CGTGCAGTTG CAAGTCAACG TCAACCACAG CGAGCTGCTC
CAAGGCGGCA CCATTAACCT GACCATTGTC AATGACGGCG TCAGCAGCAA TGTCAGCCTC
AAACTGGTGG GCGGCGTGCT GACCTTCGCC GACGGCACCC CTGCCACCGG CTTTAGCTAT
AACAACGGTG TGATCACCTG GACGGCCGTC GTAGCCGAAG GCAAAACCAT CAGCGTCACC
GCCACGCAAA CTGACAGCGA CGGCAATACC TCTGCTGAAG GCTCTGACAG CGCCAAGGTT
GACACCACCG CCGATGCGGC AGCCCCAGTG GTCACCATCC TCGATGACGT GAACAACGAT
GGCATCATCA ACAAGACCGA GCTGGGCAGC GACGACGTGC AGTTGCAAGT CAACGTCAAC
CACAGCGAGC TGCTCCAAGG CGGCACCATT AACCTGACCA TTGTCAATGA CGGCGTCAGC
AGCAATGTCA GCCTCAAACT GGTGGGCGGC GTGCTGACCT TCGCCGACGG CACCCCTGCC
ACCGGCTTTA GCTATAACAA CGGTGTGATC ACCTGGACGG CCGTCGTAGC CGAAGGCAAA
ACCATCAGCG TCACCGCCAC GCAAACTGAC AGCGACGGCA ATACCTCTGC TGAAGGCTCT
GACAGCGCCA AGGTTGACAC CACCGCCGAT GCGGCAGCCC CAGTGGTCAC CATCCTCGAT
GACGTGAACA ACGATGGCAT CATCAACAAG ACCGAGCTGG GCAGCGACGA CGTGCAGTTG
CAAGTCAACG TCAACCACAG CGAGCTGCTC CAAGGCGGCA CCATTAACCT GACCATTGTC
AATGACGGCG TCAGCAGCAA TGTCAGCCTC AAACTGGTGA GCGGCGTGCT GACCTTCGCC
GACGGCACCC CTGCCACCGG CTTTAGCTAT AACAACGGTG TGATCACCTG GACGGCCGTC
GTAGCCGAAG GCAAAACCAT CAGCGTCACC GCCACGCAAA CTGACAGCGA CGGCAATACC
TCTGCTGAAG GCTCTGACAG CGCCAAGGTT GACACCACCG CCGATGCGGC AGCCCCAGTG
GTCACCATCC TCGATGACGT GAACAACGAT GGCATCATCA ACAAGACCGA GCTGGGCAGC
GACGACGTGC AGTTGCAAGT CAACGTCAAC CACAGCGAGC TGCTCCAAGG CGGCACCATT
AACCTGACCA TTGTCAATGA CGGCGTCAGC AGCAATGTCA GCCTCAAACT GGTGGGCGGC
GTGCTGACCT TCGCCGACGG CACCCCTGCC ACCGGCTTTA GCTATAACAA CGGTGTGATC
ACCTGGACGG CCGTCGTAGC CGAAGGCAAA ACCATCAGCG TCACCGCCAC GCAAACTGAC
AGCGACGGCA ATACCTCTGC TGAAGGCTCT GACAGCGCCA AGGTTGACAC CACCGCCGAT
GCGGCAGCCC CAGTGGTCAC CATCCTCGAT GACGTGAACA ACGATGGCAT CATCAACAAG
ACCGAGCTGG GCAGCGACGA CGTGCAGTTG CAAGTCAACG TCAACCACAG CGAGCTGCTC
CAAGGCGGCA CCATTAACCT GACCATTGTC AATGACGGCG TCAGCAGCAA TGTCAGCCTC
AAACTGGTGG GCGGCGTGCT GACCTTCGCC GACGGCACCC CTGCCACCGG CTTTAGCTAT
AACAACGGCG TGATCACCTG GACGGCCGTC GTAGCCGAAG GCAAAACCAT CAGCGTCACC
GCCACGCAAA CTGACAGCGA CGGCAATACC TCTGCTGAAG GCTCTGACAG CGCCAAGGTT
GACACCACCG CCGATGCGGC AGCCCCAGTG GTCACCATCC TCGATGACGT GAACAACGAT
GGCATCATCA ACAAGACCGA GCTGGGCAGC GACGACGTGC AGTTGCAAGT CAACGTCAAC
CACAGCGAGC TGCTCCAAGG CGGCACCATT AACCTGACCA TTGTCAATGA CGGCGTCAGC
AGCAATGTCA GCCTCAAACT GGTGGGCGGC GTGCTGACCT TCGCCGACGG CACCCCTGCC
ACCGGCTTTA GCTATAACAA CGGTGTGATC ACCTGGACGG CCGTCGTAGC CGAAGGCAAA
ACCATCAGCG TCACCGCCAC GCAAACTGAC AGCGACGGCA ATACCTCTGC TGAAGGCTCT
GACAGCGCCA AGGTTGACAC CACCGCCGAT GCGGCAGCCC CAGTGGTCAC CATCCTCGAT
GACGTGAACA ACGATGGCAT CATCAACAAG ACCGAGCTGG GCAGCGACGA CGTGCAGTTG
CAAGTCAACG TCAACCACAG CGAGCTGCTC CAAGGCGGCA CCATTAACCT GACCATTGTC
AATGACGGCG TCAGCAGCAA TGTCAGCCTC AAACTGGTGG GCGGCGTGCT GACCTTCGCC
GACGGCACCC CTGCCACCGG CTTTAGCTAT AACAACGGCG TGATCACCTG GACGGCCGTC
GTAGCCGAAG GCAAAACCAT CAGCGTCACC GCCACGCAAA CTGACAGCGA CGGCAATACC
TCTGCTGAAG GCTCTGACAG CGCCAAGGTT GACACCACCG CCGATGCGGC GGCCCCAGTG
GTCACCATCC TCGATGACGT GAACAACGAT GGCATCATCA ACAAGACCGA GCTGGGCAGC
GACGACGTGC AGTTGCAAGT CAACGTCAAC CACAGCGAGC TGCTCCAAGG CGGCACCATT
AACCTGACCA TTGTCAATGA CGGCGTCAGC AGCAATGTCA GCCTCAAACT GGTGGGCGGC
GTGCTGACCT TCGCCGACGG CACCCCTGCC ACCGGCTTTA GCTATAACAA CGGTGTGATC
ACCTGGACGG CCGTCGTAGC CGAAGGCAAA ACCATCAGCG TCACCGCCAC GCAAACTGAC
AGCGACGGCA ATACCTCTGC TGAAGGCTCT GACAGCGCCA AGGTTGACAC CACCGCCGAT
GCGGCAGCCC CAGTGGTCAC CATCCTCGAT GACGTGAACA ACGATGGCAT CATCAACAAG
ACCGAGCTGG GCAGCGACGA CGTGCAGTTG CAAGTCAACG TCAACCACAG CGAGCTGCTC
CAAGGCGGCA CCATTAACCT GACCATTGTC AATGACGGCG TCAGCAGCAA TGTCAGCCTC
AAACTGGTGG GCGGCGTGCT GACCTTCGCC GACGGCACCC CTGCCACCGG CTTTAGCTAT
AACAACGGTG TGATCACCTG GACGGCCGTC GTAGCCGAAG GCAAAACCAT CAGCGTCACC
GCCACGCAAA CTGACAGCGA CGGCAATACC TCTGCTGAAG GCTCTGACAG CGCCAAGGTT
GACACCACCG CCGATGCGGC GGCCCCAGTG GTCACCATCC TCGATGACGT GAACAACGAT
GGCATCATCA ACAAGACCGA GCTGGGCAGC GACGACGTGC AGTTGCAAGT CAACGTCAAC
CACAGCGAGC TGCTCCAAGG CGGCACCATT AACCTGACCA TTGTCAATGA CGGCGTCAGC
AGCAATGTCA GCCTCAAACT GGTGGGCGGC GTGCTGACCT TCGCCGACGG CACCCCTGCC
ACCGGCTTTA GCTATAACAA CGGTGTGATC ACCTGGACGG CCGTCGTAGC CGAAGGCAAA
ACCATCAGCG TCACCGCCAC GCAAACTGAC AGCGACGGCA ATACCTCTGC TGAAGGCTCT
GACAGCGCCA AGGTTGACAC CACCGCCGAT GCGGCAGCCC CAGTGGTCAC CATCCTCGAT
GACGTGAACA ACGATGGCAT CATCAACAAG ACCGAGCTGG GCAGCGACGA CGTGCAGTTG
CAAGTCAACG TCAACCACAG CGAGCTGCTC CAAGGCGGCA CCATTAACCT GACCATTGTC
AATGACGGCG TCAGCAGCAA TGTCAGCCTC AAACTGGTGG GCGGCGTGCT GACCTTCGCC
GACGGCACCC CTGCCACCGG CTTTAGCTAT AACAACGGTG TGATCACCTG GACGGCCGTC
GTAGCCGAAG GCAAAACCAT CAGCGTCACC GCCACGCAAA CTGACAGCGA CGGCAATACC
TCTGCTGAAG GCTCTGACAG CGCCAAGGTT GACACCACCG CCGATGCGGC AGCCCCAGTG
GTCACCATCC TCGATGACGT GAACAACGAT GGCATCATCA ACAAGACCGA GCTGGGCAGC
GACGACGTGC AGTTGCAAGT CAACGTCAAC CACAGCGAGC TGCTCCAAGG CGGCACCATT
AACCTGACCA TTGTCAATGA CGGCGTCAGC AGCAATGTCA GCCTCAAACT GGTGGGCGGC
GTGCTGACCT TCGCCGACGG CACCCCTGCC ACCGGCTTTA GCTATAACAA CGGTGTGATC
ACCTGGACGG CCGTCGTAGC CGAAGGCAAA ACCATCAGCG TCACCGCCAC GCAAACTGAC
AGCGACGGCA ATACCTCTGC TGAAGGCTCT GACAGCGCCA AGGTTGACAC CACCGCCGAT
GCGGCAGCCC CAGTGGTCAC CATCCTCGAT GACGTGAACA ACGATGGCAT CATCAACAAG
ACCGAGCTGG GCAGCGACGA CGTGCAGTTG CAAGTCAACG TCAACCACAG CGAGCTGCTC
CAAGGCGGCA CCATTAACCT GACCATTGTC AATGACGGCG TCAGCAGCAA TGTCAGCCTC
AAACTGGTGG GCGGCGTGCT GACCTTCGCC GACGGCACCC CTGCCACCGG CTTTAGCTAT
AACAACGGTG TGATCACCTG GACGGCCGTC GTAGCCGAAG GCAAAACCAT CAGCGTCACC
GCCACGCAAA CTGACAGCGA CGGCAATACC TCTGCTGAAG GCTCTGACAG CGCCAAGGTT
GACACCACCG CCGATGCGGC AGCCCCAGTG GTCACCATCC TCGATGACGT GAACAACGAT
GGCATCATCA ACAAGACCGA GCTGGGCAGC GACGACGTGC AGTTGCAAGT CAACGTCAAC
CACAGCGAGC TGCTCCAAGG CGGCACCATT AACCTGACCA TTGTCAATGA CGGCGTCAGC
AGCAATGTCA GCCTCAAACT GGTGAGCGGC GTGCTGACCT TCGCCGACGG CACCCCTGCC
ACCGGCTTTA GCTATAACAA CGGCGTGATC ACCTGGACGG CCGTCGTAGC CGAAGGCAAA
ACCATCAGCG TCACCGCCAC GCAAACTGAC AGCGACGGCA ATACCTCTGC TGAAGGCTCT
GACAGCGCCA AGGTTGACAC CACCGCCCCT AATGCACCAA CCGTATTAAT TGTTGATGAT
GGTAATCCAG GTGATGGCCT GCTAACGACT GCTGAGATGG GCAATGATGG CGTACAACTG
ACAGTATCAA TCAACGATAC TGATTTTGAA GCCGGTGGTT ATGTCACGCT TACCATTAAT
GGTGGTGCCG CGATTGAACT TAGCTTTGCC GACTTCACTG ATAATGGCAG TGGTACCCTT
ACCTTCGGTA ACTTTACCTA CGCTAACGGC GTTATCAGCT GGAGCGAATC CGCGCCTACT
GCAGGTCAAA GCATCACTGT TACTGCAACG CAAACCGATG CTGTAGGTAA CATATCTACA
CAAGGCTCTG ACACTGCAAC TGTATATCAA CCTAACTCTT GCAACGTAAT CGTTAATGAA
AGCAGTCTAC GCGATAATAT TCCTGATACC TTCTCTAATA CTATTAGCTT CACTGCGGGT
AATCAGAATA TTACACAATT CCGTTTTAAC GACTCCTCTA TCTCTGCGGC AACTAACCTA
GCTGCAGGTG TCAGCATCAT TTGGGCACTA GCCGCAAATG GTGAGTTAGT AGGAACCATT
GGTGGTATTG AAGTAATTAA AGTCACTCTG AGTGGCACTC CTGTGACGTC AGGACTTGCT
GCTGGCACTA CGGGTGCAGT TACTGTTAAT GTTGAGTTAT TAGACAACAT CAAGCAGGTA
AATGGATTAA GTGGCGAAAA TCTCAGTTCA CTGATCAATG GAATCGTCAT TGAAGCTGTA
GGTGCCGATA ATAGTGTATT GACAGGTAAT GTAACTATTA CCATCAATGA TGACGTCATT
AATATTGACC CAAGTGCAGG CTCTGGAGTC AATAGTGCTA CGGCGGCCGA TATTGTTGGT
GTCCTCAATA TCTTAGGTGC AGATGGTAAT GATCATACTC CGACTGATAA CTACAGCGTA
AGCTTGAGTG CAAACGTGAC TGGTTGGAAC GGCACAAGCA CAACCTTCGC CGATTCAGGT
ATTACTGCGG GCGGACTCAA GGTTTACTAC TATGTCGATC CTGATCATCC AAATATCCTG
ATCGCCTATA CCGATACCAA CGCCACCACT TCCGTCTATA CTGGTGGTGC TAATCAGGCA
TTAATCTTTA CCTTAACAAC CAATGCGACT ACAGGTCAGT ACACTCTCGA TATGAATCAG
GGTATTGATA AGCTTTCAAC AATCCAAATC GCTGGACTCG TTGGAGGACA AGGCGGTATT
GGTGAAGCGG TATATGTCAC TTATGATCCT GCAACCCATG GTTATGGTGT TTATAACGAT
ATCACTAAGA TCCCCGCGGA TGCGGACATT GCCTTCACCC TTACTGCGCG TGACGGTAAT
AATAACGTTG CTCAAGTGAA CGGGACTAAT AATGGTTTCG GTGTTGCAAA TCCATTTGTG
CAAGGTAATG AAGTATTAAT TGTCGATTAC TCAGAAGATG CGGCGACTGC GAGCTTTAGC
TTTACGGGGG CCGCACAAAT CCACTATAAA GCCTATGACG ATCAAGGTAA CTTGTTGCAT
GAAGGTAATA TTACTAGTGG TCAGCTCATC CAAAATGTGG GTTCTATCGC ATACATTGAG
CTATCAGCCC TAAGCGGAAC CAGCTTCCAG TTTACGGGCA CCACTGCACA AACCATTGTC
AGTTCAAGCC AAAGCCTCGA TTTGAGCTTT GTTGTGACTG CAACTGACAG TGATGGGGAT
AATTCATCAG GTAATCTGAA TGTACATTTA GATCCACCAA GTACAACACC GCTTGCGCCA
GTAGCACTGA CGCCGAATAC TTTTGCAACC TTGAATGAAG CTGATCTACA GGCGGGTGCT
CCTGATTCAA GTGTTCAAAC CCTGAGCTTT AAGTCTGGTA GCAACTCGAT TGGCAGCTTC
CAGTTTGGTG ATATTAGTAA TATTTCTGTG GTGGGCATCA ACGCTCAAAT TCATTGGGCA
CTCAATGATC AAGGCCAATT GATCGGTACG GTATACGGGC GTGAAGCGAT TCGCTTAACC
CTAGATTGGG ACCGTATTAA TGCGGGTGAG CAAGGTGATG TGACTGTTAC CGCAGAATTG
TTAACGAACC TGCCACACAG CGTAAATACC AACAACCTAA CTGTAAATGG TATTCAGGTG
ATTGCCGTCG ACGGCGCTGG CAATACTGCA CATTCCAACG TGACCGTAAC TGTCGCAGAC
GATGTAAACC TAGCTCAGAA CGACACGGCT CAGCTTGATG TGGTGGTTGA TTCCTTCAAC
TTCTCAGGCA TTGTTGCCAA CTGGACTGGT GCAACAGGCG GAACCTATGT CAACAAATAT
GATGGTCCTG ATAACGATAG CGGTGACGAC CAACTCCGCT GGGGCACAAC CAATGGTAGT
CAGTCTGGCT ATGGCTTTGC CGATAATGAT GCCGCGCTAA ATGGCTCACT ATCCTTAAAC
CAAGACATAG TGTTGGGCAC ATTTACGCAC TATAACTATT CAATTAGTTC GGGCACTTCT
ATCACTGCAG CCACAATGAA CGTGACTTTT AACGTGACAG ATGCCTATGG CGTAGTAACA
CCGGTCACAT TAACACTTAA CTTTAGCCAT AATGAGACTC CAAATAGTAA CGACCCGATG
GCGTCTCGCG ATATCGTTAC TGTCGGCCAA ACCAATGTGA CCTTCAACTA CGAAGGTCAA
ATGTACACAA TGCAAGTTAT TGGCTTCAAA GACAGCAATG GCAACATTGT GACTTCTATC
TACACCAATG AGAATGCTTC AACAAGCTAT GAGTTAGTTG TACGCATGGT CGCGGGCACA
GGCTACACCC TGCCGCATAC AGATGGTAAT GTGCTCACCA ATGATGTTGC CGGTGCAGAT
GGTGTTCTAG CCGTAATAGG CGTCGCAAGT GGAGATCACA CAAATACTGG CGTGTCTGGC
CAGGTGGGCA CTACTATTAC TGGTACTTAT GGCAACTTAA TCCTTTACGC AGATGGTACT
TACCACTACC AGTTAACGGC GAGTGCAAAC ACCATTCCAA GCGCTGGTGC AATCGAAACC
TTCACCTATA CCACTCAAGA TGGTGACGGT GACAAAGCAA GTGCCACCTT GAAGATTGAT
GTCAATCCGG TCAATGCAGA TGGTATTAAT ATTGCTGACG CTAACTTGAT TTCAACTCAA
GGCTCTAGCT TAAATGACAC TATCGTTGTA ATGGGCGGTG AAAAAGCATC TGATGACAAT
CAAAAAATAC TTAATGTCAC ATTTGGTGGA GGTCAAAGCG GCATAATTAC GAATAGCTCA
GGTCACGAAG TCGTTGCTTC TGGTGCTAAC AACAAGAGCT ATACCAATAG CAGCGCTCAA
GTGGTGAATG GCGGCGACGG AAATGATCAT ATCGAAACTG GTAAAGGTGA CGATGTTATC
TACGCAGGTA AAACTGGTTC CGCTAACTAT GGCACCGATG ATCAACTCGA GTTATCAGTT
AATACTCTAT TAACGCACCA CATTATGACC GGCAATATCA CTGGTAATGA TCGCATGGTG
GATAATGATG GCCTACTTTT AGCCAATGAC GTGTCCTCTC AAAGAGCTGA TGTGGTCAAT
GGCGGTAGTG GTAACGATCA AATCTATGGT CAATCTGGTT CAGATATTCT CTATGGACAC
TCTGGAAACG ATTACATTGA TGGTGGCAAT CACAACGATG CACTTCGTGG TGGCGAAGGT
AACGATACCT TGATTGGAGG TCTCGGTGAT GACGTTCTAC GTGGAGATAG CGGTAATGAT
ACCTTCCTGT GGCGTTATGC TGATGCCGAT CAAGGTACTG ACCACATCAT GGACTTTAAT
GTTCGCGACG ATAAACTCGA TCTGAGTGAT TTACTCCAAG GAGAAACGGC CAATACTCTA
GAAAGCTATC TGAACTTTAG CCTAGATAAT GGTTCGACAG TCATTGATAT CGATGCCAAT
AAGGACGGCG TGTTTGATCA ACATATAGTG CTAGATGGAG TAAATCTTTT TGACCAATAT
AGCGCAACTG ATAATGCTGG CGTCATCAAT GGCCTACTAG GTTCCAATGG TAACGGTCCT
CTGATCATCG ATGCAGCGCC CGTCACACCA GAAGCTCCAC AGGGCGTAAC ATCACTGACA
GACCCTCATC ACAATAACGG TACTATCATT CCTTAA
 
Protein sequence
MHLLINTIGC WELIMGSIIT TKKGLLKLVK GQIEVEVNGS NQPAKDGEQL PKGAVLHIGE 
NATYEITFDD GTKLSNEVPP NATAAAPNTA NEAALDEIQA LQDLIASGED PTQNLPETAA
GNAPASDGNS GYVTLARDGT EALATSGYST SGQTLTAATT AAPEQTVATD SPSVLANDSN
TVDEDTVASG NVLDNDVDAD NELTVVGFNL NGTDFAAGTV VTLEGGSFIL NADGSYTFTP
NENWNGQVPV ITYTTNTGST ATLTINITPV DDPSVLANDS NTIAEDTVAT GNVLDNDSDV
DSSLSVVSFT VSDSTFAAGT TVALEGGSLV LNADGSYTFT PNENWNGQVP VITYTTNTGS
TATLTINVTP VDDPSVLAND TNTIAEDTVA TGNVLDNDSD VDSSLSVVSF TVSGSTFAAG
ATVALEGGSL VLNADGSYSF TPNADWNGQV PVITYTTNTG STATLTINVT PVDDPSVLAN
DTNTIAEDTV ATGNVLDNDS DVDSSLSVVS FTVSGSTFAA GTTVALAGGS LVLNADGSYT
FTPSENWNGQ VPVITYTTNT SSTATLTINV TPVADGAPSV TITTDTDNDG FISNTELGGA
TEVSVTIGLD GTGANAGDTL TVNGVDYVLT QEDINNGFVN LTLPAPAEGE TITVVATITD
PAGNTSPEGS DSALLDTTAP TITVSAPDDT RDTTPTITGT TDAAPGSVIT IVVTDSNGVQ
QTLTTTVNPD GTYAVDVINP IPHGGYTATA TVTDPAGNTG NATDNGNVDI KIDEDGDGNT
VAITSITQDT GSSSSDFITN DNTLVFHGTV DLDDESTLVV TINGVDYTTA NGLVIDAQGN
WSVDLTGTAL PDGTYLVVAT VTDVAGNTTS ATQNVVIDTK IDQDSDGNTV AITSITQDTG
SSSSDFITND NTLVFHGTVD LDDESTLVVT INGVDYTTAN GLMIDAQGNW SVDLTGTALP
DGIYTVVATV TDIAGNTTSA TQDVIVDTQI GLGRDNAVTI TSITEDTGSS NTDFITNDNT
LVFQGTVELD GNSNLVVTIN GVDYTIGNGL VIDENGHWSI DLTGTALPDG TYPVVATVTD
IAGNSKTVTQ DVVIDTKIDQ DSDGNTVAIT SITQDTGSSS SDFITNDNTL VFHGTVDLDD
ESTLVVTING VDYTTANGLV IDAQGNWSVD LTGTALPDGT YPVVATVTDV AGNTTSATQN
VVIDTKIDQD SDGNTVAITS ITQDTGSSSS DFITNDNTLV FHGTVDLDDE STLVVTINGV
DYTTANGLVI DAQGNWSVDL TGTALPDGTY PVVATVTDVA GNTTSATQNV VIDTKIDQDS
DGNTVAITSI TQDTGSSSSD FITNDNTLVF HGTVDLDDES TLVVTINGVD YTTANGLVID
AQGNWSVDLT GTALPDGTYP VVATVTDVAG NTTSATQNVV IDTKIDQDSD GNTVAITSIT
QDTGSSSSDF ITNDNTLVFH GTVDLDDEST LVVTINGVDY TTANGLVIDA QGNWSVDLTG
TALPDGTYPV VATVTDVAGN TTSATQNVVI DTTADAATPV VTILDDVNND GIINKTELGS
DDVQLQVNVN HSELLQGGTI NLTIVNDGVS SNVSLKLVGG VLTFADGTPA TGFSYNNGVI
TWTAVVAEGK TISVTATQTD SDGNTSAEGS DSAKVDTTAD AAAPVVTILD DVNNDGIINK
TELGSDDVQL QVNVNHSELL QGGTINLTIV NDGVSSNVSL KLVGGVLTFA DGTPATGFSY
NNGVITWTAV VAEGKTISVT ATQTDSDGNT SAEGSDSAKV DTTADAAAPV VTILDDVNND
GIINKTELGS DDVQLQVNVN HSELLQGGTI NLTIVNDGVS SNVSLKLVGG VLTFADGTPA
TGFSYNNGVI TWTAVVAEGK TISVTATQTD SDGNTSAEGS DSAKVDTTAD AAAPVVTILD
DVNNDGIINK TELGSDDVQL QVNVNHSELL QGGTINLTIV NDGVSSNVSL KLVSGVLTFA
DGTPATGFSY NNGVITWTAV VAEGKTISVT ATQTDSDGNT SAEGSDSAKV DTTADAAAPV
VTILDDVNND GIINKTELGS DDVQLQVNVN HSELLQGGTI NLTIVNDGVS SNVSLKLVGG
VLTFADGTPA TGFSYNNGVI TWTAVVAEGK TISVTATQTD SDGNTSAEGS DSAKVDTTAD
AAAPVVTILD DVNNDGIINK TELGSDDVQL QVNVNHSELL QGGTINLTIV NDGVSSNVSL
KLVGGVLTFA DGTPATGFSY NNGVITWTAV VAEGKTISVT ATQTDSDGNT SAEGSDSAKV
DTTADAAAPV VTILDDVNND GIINKTELGS DDVQLQVNVN HSELLQGGTI NLTIVNDGVS
SNVSLKLVGG VLTFADGTPA TGFSYNNGVI TWTAVVAEGK TISVTATQTD SDGNTSAEGS
DSAKVDTTAD AAAPVVTILD DVNNDGIINK TELGSDDVQL QVNVNHSELL QGGTINLTIV
NDGVSSNVSL KLVGGVLTFA DGTPATGFSY NNGVITWTAV VAEGKTISVT ATQTDSDGNT
SAEGSDSAKV DTTADAAAPV VTILDDVNND GIINKTELGS DDVQLQVNVN HSELLQGGTI
NLTIVNDGVS SNVSLKLVGG VLTFADGTPA TGFSYNNGVI TWTAVVAEGK TISVTATQTD
SDGNTSAEGS DSAKVDTTAD AAAPVVTILD DVNNDGIINK TELGSDDVQL QVNVNHSELL
QGGTINLTIV NDGVSSNVSL KLVGGVLTFA DGTPATGFSY NNGVITWTAV VAEGKTISVT
ATQTDSDGNT SAEGSDSAKV DTTADAAAPV VTILDDVNND GIINKTELGS DDVQLQVNVN
HSELLQGGTI NLTIVNDGVS SNVSLKLVGG VLTFADGTPA TGFSYNNGVI TWTAVVAEGK
TISVTATQTD SDGNTSAEGS DSAKVDTTAD AAAPVVTILD DVNNDGIINK TELGSDDVQL
QVNVNHSELL QGGTINLTIV NDGVSSNVSL KLVGGVLTFA DGTPATGFSY NNGVITWTAV
VAEGKTISVT ATQTDSDGNT SAEGSDSAKV DTTADAAAPV VTILDDVNND GIINKTELGS
DDVQLQVNVN HSELLQGGTI NLTIVNDGVS SNVSLKLVGG VLTFADGTPA TGFSYNNGVI
TWTAVVAEGK TISVTATQTD SDGNTSAEGS DSAKVDTTAD AAAPVVTILD DVNNDGIINK
TELGSDDVQL QVNVNHSELL QGGTINLTIV NDGVSSNVSL KLVGGVLTFA DGTPATGFSY
NNGVITWTAV VAEGKTISVT ATQTDSDGNT SAEGSDSAKV DTTADAAAPV VTILDDVNND
GIINKTELGS DDVQLQVNVN HSELLQGGTI NLTIVNDGVS SNVSLKLVSG VLTFADGTPA
TGFSYNNGVI TWTAVVAEGK TISVTATQTD SDGNTSAEGS DSAKVDTTAP NAPTVLIVDD
GNPGDGLLTT AEMGNDGVQL TVSINDTDFE AGGYVTLTIN GGAAIELSFA DFTDNGSGTL
TFGNFTYANG VISWSESAPT AGQSITVTAT QTDAVGNIST QGSDTATVYQ PNSCNVIVNE
SSLRDNIPDT FSNTISFTAG NQNITQFRFN DSSISAATNL AAGVSIIWAL AANGELVGTI
GGIEVIKVTL SGTPVTSGLA AGTTGAVTVN VELLDNIKQV NGLSGENLSS LINGIVIEAV
GADNSVLTGN VTITINDDVI NIDPSAGSGV NSATAADIVG VLNILGADGN DHTPTDNYSV
SLSANVTGWN GTSTTFADSG ITAGGLKVYY YVDPDHPNIL IAYTDTNATT SVYTGGANQA
LIFTLTTNAT TGQYTLDMNQ GIDKLSTIQI AGLVGGQGGI GEAVYVTYDP ATHGYGVYND
ITKIPADADI AFTLTARDGN NNVAQVNGTN NGFGVANPFV QGNEVLIVDY SEDAATASFS
FTGAAQIHYK AYDDQGNLLH EGNITSGQLI QNVGSIAYIE LSALSGTSFQ FTGTTAQTIV
SSSQSLDLSF VVTATDSDGD NSSGNLNVHL DPPSTTPLAP VALTPNTFAT LNEADLQAGA
PDSSVQTLSF KSGSNSIGSF QFGDISNISV VGINAQIHWA LNDQGQLIGT VYGREAIRLT
LDWDRINAGE QGDVTVTAEL LTNLPHSVNT NNLTVNGIQV IAVDGAGNTA HSNVTVTVAD
DVNLAQNDTA QLDVVVDSFN FSGIVANWTG ATGGTYVNKY DGPDNDSGDD QLRWGTTNGS
QSGYGFADND AALNGSLSLN QDIVLGTFTH YNYSISSGTS ITAATMNVTF NVTDAYGVVT
PVTLTLNFSH NETPNSNDPM ASRDIVTVGQ TNVTFNYEGQ MYTMQVIGFK DSNGNIVTSI
YTNENASTSY ELVVRMVAGT GYTLPHTDGN VLTNDVAGAD GVLAVIGVAS GDHTNTGVSG
QVGTTITGTY GNLILYADGT YHYQLTASAN TIPSAGAIET FTYTTQDGDG DKASATLKID
VNPVNADGIN IADANLISTQ GSSLNDTIVV MGGEKASDDN QKILNVTFGG GQSGIITNSS
GHEVVASGAN NKSYTNSSAQ VVNGGDGNDH IETGKGDDVI YAGKTGSANY GTDDQLELSV
NTLLTHHIMT GNITGNDRMV DNDGLLLAND VSSQRADVVN GGSGNDQIYG QSGSDILYGH
SGNDYIDGGN HNDALRGGEG NDTLIGGLGD DVLRGDSGND TFLWRYADAD QGTDHIMDFN
VRDDKLDLSD LLQGETANTL ESYLNFSLDN GSTVIDIDAN KDGVFDQHIV LDGVNLFDQY
SATDNAGVIN GLLGSNGNGP LIIDAAPVTP EAPQGVTSLT DPHHNNGTII P