Gene Sama_3258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3258 
Symbol 
ID4605505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3853219 
End bp3865863 
Gene Length12645 bp 
Protein Length4214 aa 
Translation table11 
GC content57% 
IMG OID639782674 
Productputative outer membrane adhesin like protein 
Protein accessionYP_929130 
Protein GI119776390 
COG category 
COG ID 
TIGRFAM ID[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.523735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCGT ATATCACACC GAAAAATGGA TTGCTCAAGT CGGTTAATGG GCAAATTAGC 
ATGGTCGCAG AAGGCGCAGA AAAATCAGTT TCTGCTGGTG ATGCCATCCC CGCTGGTGCA
GTGCTTTGGA TTAAGGAAGG CGCTCAGTTT GAATTGGTGC TGGAGGACGG CACCGTTATC
AGCGAATCAA ACACTCCCAC AACGGCCAAC CAGCCCGACG TTGGCGCGGA GCAACTTGAT
CCCAATGCGC TGAATGAAAT CGAAGCGCTA CAGGCACAAA TTGCCGCAGG TGATGACCCA
ACCGCAGACC TGCCCGAAAC CGCAGCGGGC GGCCAGACAG GTAACGAAGG CGGTTCTGGA
TTTGTCAGCC TGGATAGAAC AGGTGGCGAA ACTCTTGCAA GCACGGGTTA TGATTCAACA
TTCGACGCCC AATTCCCAGA ATCAACAATA CTCGAAGCCC AACCCATCGA ACCCCTGCTT
GTCACACCAT TGGTTATTAC TGTTGATGCT CCCGACAACT CAACAGACAC CACTCCCACA
ATCACTGGTA CCACGGATGC AGAACCCGGC AGCACAGTTA CTATCTTAGT AACCGACAGT
AACGGCGATA CCCAGACCCT GACTACCACA GTCAATGAAG ACGGCACTTA CAGTGTTGAT
GTGGTCGAAC CCTTACCTGA AGGTGGTTAC ACCGCCGATG CCAGTGTCAT AGACACTACA
GGCAACACCG GCACCGCAAC TGATGTGGGT GATGTTGTCT CTCCTGAAGT GAGCATCACG
GCCAATCAAA TCGACGTAGA AGAAGGCAGT ACTGCTTCTT TCGTTGTCAG CCTGGATGAA
GCCTCAAACG AAGACATTAC AGTCACCTTT ACTTATTCAG GTGTTGCTGA AGACGGTACC
GACTTTATCG GTGTGGCCAG CGTAGTGATC CCTGCCGGCC AAACCAGCGT TACCATTAAT
ATCAATACCA TTGACGATAA GCTGTACGAA GTTTCTGAAG ACTTCACCAT CACCATTTCC
AGCGTCACCG GTGGCAATGC CGTGATAGGT GCCGACAACA GTGCCAGCAC CAACATCGTC
GATGAAGCTG TACCTGGTCC TGAGGATACT GTCACTGTAA CTCTGAATGG TCCAACTGCA
GTTGCAGAAG GTGATAATAC CGGCACCTAC ACTGTGACCC TGAGTGACCC AGCGCCAGCT
GGCAGCATAG TTACACTCAC CTATACCTAC ATCACCGCAG ATGGCAACGA CATTGTTGAA
ACAGTACAAG CCATCATTGG TGCAGATGGC GTTACCGCGA CGTTCACCAT TGCAACCGTT
AACGATGACA TATTCGAACC AACTGAATCC TTCAGTGTCA GTGTTAGTGG CGTCGTTACG
CCAGATGGCA CCCCTGTGTT TGAAAACCTC GATCTGACCG ATGCAACGGT TACTACTGAC
ATCCTGGATA ATGATCTCTC GGCCAGCATC ACTTTGGATG CCAATATCAC CGCTGATGAC
ATCATTAACA GTGCCGAAGC AGGTCAAACT ATCCCAGTCA CAGGTGTAGT AGGTGCCGAC
GTCAAAGTCG GTGACATCGT CACCCTCACC GTCAATGGCA AGGAATTCAC CGGCGCGGTA
TATGATGACA ACGGCACCCT GCGCTTCAGC ATTGATGTGC CTGGCAGCGA TCTGGCAGCA
GATGCCGATC GTGTCATTGA TGCCTCCGTC ACCGCCACCG ATGACGCCGG CAACAGCGCC
ACCGCCACCG ACACCGAAGG CTATGGCGTT GACACAGACA TCAGTGCCAG CATCACCCTC
GATGCCGACA TCACCGCCGA TGACATCATC AATGCCCAGG AAGCCGGACA GGACATCCCT
GTGACCGGTA TCGTCGGCGG CGACGTCAAG GTCGGTGACA TCGTCACCCT CACCGTCAAT
GGCAAGGAAT TCACCGGCGC GGTATATGAT GACAACGGCA CCCTGCGCTT CAGCATCAAC
GTCCCCGGCG CCGACCTGGT GGCCGATGAA GACCATGTGA TTGATGCCTC CGTCACCGCC
ACCGATGACG CCGGCAACAG CGCCACCGCC ACCGACACCG AAGGCTATGG CGTTGACACA
GACATCAGTG CCAGCATCAC CCTCGATGCC GACATCACCG CCGATGACAT CATCAATGCC
CAGGAAGCCG GACAGGACAT CCCTGTGACC GGTATCGTCG GCGGCGACGT CAAGGTCGGT
GACATCGTCA CCCTCACCGT CAATGGCAAG GAATTCACCG GCGCGGTATA TGATGACAAC
GGCACCCTGC GCTTCAGCAT CAACGTCCCC GGCGCCGACC TGGTGGCCGA TGAAGACCAT
GTGATTGATG CCTCCGTCAC CGCCACCGAT GACGCCGGCA ACAGCGCCAC CGCCACCGAC
ACCGAAGGCT ATGGCGTTGA CACAGACATC AGTGCCAGCA TCACCCTCGA TGCCGACATC
ACCGCCGATG ACATCATCAA TGCCCAGGAA GCCGGACAGG ACATCCCTGT GACCGGTATC
GTCGGCGGCG ACGTCAAGGT CGGTGACATC GTCACCCTCA CCGTCAATGG CAAGGAATTC
ACCGGCGCGG TATATGATGA CAACGGCACC CTGCGCTTCA GCATCAACGT CCCCGGCGCC
GACCTGGTGG CCGATGAAGA CCATGTGATT GATGCCTCCG TCACCGCCAC CGATGACGCC
GGCAACAGCG CCACCGCCAC CGACACCGAA GGCTATGGCG TCGACACGGA CATCAGTGCC
ACCATTGACC TCAACCCCAT TCTGGTCGGC GATGACAACG TCATCAACCA GGCCGAAAGT
GAAGGCAGCG TGACCCTCAG CGGTACCGTC GGCGGCGACG TCAAGCTCGG CGACACCGTG
ACCCTGACCC TCGATGGCAA TGTGATTGCC ACCGTGCAGG TGATTGACCT CGGCGGCGGC
GTACTGGGCT TCAGCACTTC CGTCGATGCC GCCCTGCTGG TCGGCGCTGA CGTGAACAGC
ATCACCGCCT CCGTCACCAC CACCGATGAC GCCGGCAACA CCGCGTCTGC CAGCGATACC
GAAGGCTATG GCGTCGACAC TGAAATCAGT GCCACCATTG ACCTCAACCC CATCCTGGTC
GGCGATGACA ACGTCATCAA CCAGGCCGAA AGTGAAGGCA GCGTGACCCT CAGCGGTACC
GTCGGCGGCG ACGTCAAGCT CGGCGACACC GTGACCCTGA CCCTCGATGG CAATGTGATT
GCCACCGTGC AGGTGATTGA CCTCGGCGGC GGCGTACTGG GCTTCAGCAC TTCCGTCGAT
GCCGCCCTGC TGGTCGGCGC TGACGTGAAC AGCATCACCG CCTCCGTCAC CACCACCGAT
GACGCCGGCA ACAGCGCGTC TGCCAGCGAT ACCGAAGGCT ATGGCGTCGA CACTGAAATC
AGTGCCACCA TTGACCTCAA CCCCATTCTG GTCGGCGATG ACAACGTCAT CAACCAGGCC
GAAAGTGAAG GCAGCGTGAC CCTCAGCGGT ACCGTCGGCG GCGACGTCAA GCTCGGCGAC
ACCGTGACCC TGACCCTCGA TGGCAATGTG ATTGCCACCG TGCAGGTGAT TGACCTCGGC
GGCGGCGTAC TGGGCTTCAG CACTTCCGTC GATGCCGCCC TGCTGGTCGG CGCTGACGTG
AACAGCATCA CCGCCTCCGT CACCACCACC GATGACGCCG GCAACACCGC GTCTGCCAGC
GATACCGAAG GCTATGGCGT CGACACTGAA ATCAGTGCCA CCATCGACCT CGACCCCATC
CTGGTCGGCG ATGACAACGT CATCAACCAG GTTGAAAGTG AAGGCAGCGT GACCCTCAGC
GGTACCGTCG GCGGCGACGT CAAGCTCGGC GACACCGTGA CCCTGACCCT CGATGGCAAT
GTGATTGCCA CCGTGCAGGT GATTGACCTC GGCGGCGGCG TACTGGGCTT CAGCACTTCC
GTCGATGCCG CCCTGCTGGT CGGCGCTGAC GTGAACAGCA TCACCGCCTC CGTCACCACC
ACCGATGACG CCGGCAACAC CGCGTCTGCC AGCGATACCG AAGGCTATGG CGTCGACACT
GAAATCAGTG CCACCATTGA CCTCAACCCC ATCCTGGTCG GCGATGACAA CGTCATCAAC
CAGGCCGAAA GTGAAGGCAG CGTGACCCTC AGCGGTACCG TCGGCGGCGA CGTCAAGCTC
GGCGACACCG TGACCCTGAC CCTCGATGGC AATGTGATTG CCACCGTGCA GGTGATTGAC
CTCGGCGGCG GCGTACTGGG CTTCAGCACT TCCGTCGATG CCGCCCTGCT GGTCGGCGCT
GACGTGAACA GCATCACCGC CTCCGTCACC ACCACCGATG ACGCCGGCAA CACCGCGTCT
GCCAGCGATA CCGAAGGCTA TGGCGTCGAC ACTGAAATCA GTGCCACCAT CGACCTCGAC
CCCATCCTGG TCGGCGATGA CAACGTCATC AACCAGGTTG AAAGTGAAGG CAGCGTGACC
CTCAGCGGTA CCGTCGGCGG CGACGTCAAG CTCGGCGACA CCGTGACCCT GACCCTCGAT
GGCAATGTGA TTGCCACCGT GCAGGTGATT GACCTCGGCG GCGGCGTACT GGGCTTCAGC
ACTTCCGTCG ATGCCGCCCT GCTGGTCGGC GCTGACGTGA ACAGCATCAC CGCCTCCGTC
ACCACCACCG ATGACGCCGG CAACACCGCG TCTGCCAGCG ATACCGAAGG CTATGGCGTC
GACACTGAAA TCAGTGCCAC CATTGACCTC AACCCCATCC TGGTCGGCGA TGACAACGTC
ATCAACCAGG CCGAAAGTGA AGGCAGCGTG ACCCTCAGCG GTACCGTCGG CGGCGACGTC
AAGCTCGGCG ACACCGTGAC CCTGACCCTC GATGGCAATG TGATTGCCAC CGTGCAGGTG
ATTGACCTCG GCGGCGGCGT ACTGGGCTTC AGCACTTCCG TCGATGCCGC CCTGCTGGTC
GGCGCTGACG TGAACAGCAT CACCGCCTCC GTCACCACCA CCGATGACGC CGGCAACACC
GCGTCTGCCA GCGATACCGA AGGCTATGGC GTCGACACTG AAATCAGTGC CACCATTGAC
CTCAACCCCA TCCTGGTCGG CGATGACAAC GTCATCAACC AGGCCGAAAG TGAAGGCAGC
GTGACCCTCA GCGGTACCGT CGGCGGCGAC GTCAAGCTCG GCGACACCGT GACCCTGACC
CTCGATGGCA ATGTGATTGC CACCGTGCAG GTGATTGACC TCGGCGGCGG CGTACTGGGC
TTCAGCACTT CCGTCGATGC CGCCCTGCTG GTCGGCGCTG ACGTGAACAG CATCACCGCC
TCCGTCACCA CCACCGATGA CGCCGGCAAC ACCGCGTCTG CCAGCGATAC CGAAGGCTAT
GGCGTCGACA CTGAAATCAG TGCCACCATT GACCTCAACC CCATCCTGGT CGGCGATGAC
AACGTCATCA ACCAGGCCGA AAGTGAAGGC AGCGTGACCC TCAGCGGTAC CGTCGGCGGC
GACGTCAAGC TCGGCGACAC CGTGACCCTG ACCCTCGATG GCAATGTGAT TGCCACCGTG
CAGGTGATTG ACCTCGGCGG CGTACTGGGC TTCAGCACTT CCGTCGATGC CGCCCTGCTG
GTCGGCGCTG ACGTGAACAG CATCACCGCC TCCGTCACCA CCACCGATGA CGCCGGCAAC
ACCGCGTCTG CCAGCGATAC CGAAGGCTAT GGCGTCGACA CCACCGCCCC GGCCGTCACC
ATCACCATCA CCGAAGACAC CAACAACGAC GGTCTGCTCA GCATTGCCGA GCTCGACGGT
CAGGTGAACT ACCTGGTACA GCTCGGTGCC GGTACCGCCG TGGACGATAC CCTGGTCATC
ACCGATCAGG ATGGCAACGT GCTGTTCAAC GGTCTGGTGA CCCAGGCCAT GCTCGACAAC
GGACTGGCCC TGGCCGTGGA TGCACCGGCT GACGGTGACA CCCTCACCCT GACCGCCACC
GTGACCGACC CGGCCGGCAA CAGTGATTCC GACAGCGACA GTGTCACCAT TGACACCACC
GCCCCGGCCG TCACCATCAC CATCACCGAA GACACCAACA ACGACGGTCT GCTCAGCATT
GCCGAGCTCG ACGGTCAGGT GAACTACCTG GTACAGCTCG GTGCCGGTAC CGCCGTGGAC
GATACCCTGG TCATCACCGA TCAGGATGGC AACGTGCTGT TCAACGGTCT GGTGACCCAG
GCCATGCTCG ACAACGGACT GGCCCTGGCC GTGGATGCAC CGGCTGACGG TGACACCCTC
ACCCTGACCG CCACCGTGAC CGACCCGGCC GGCAACAGTG ATTCCGACAG CGACAGTGTC
ACCATTGACA CCACCGCCCC GGCCGTCACC ATCACCATCA CCGAAGACAC CAACAACGAC
GGTCTGCTCA GCATTGCCGA GCTCGACGGT CAGGTGAACT ACCTGGTACA GCTCGGTGCC
GGTACCGCCG TGGACGATAC CCTGGTTATC ACCGATCAGG ATGGCAACGT GCTGTTCAAC
GGTCTGGTGA CCCAGGCCAT GCTCGACAAC GGACTGGCCC TGGCCGTGGA TGCACCGGCT
GACGGTGACA CCCTCACCCT GACCGCCACC GTGACCGACC CGGCCGGCAA CAGTGATTCC
GACAGCGACA GTGTCACCAT TGACACCACC GCCCCGGCCG TCACCATCAC CATCACCGAA
GACACCAACA ACGACGGTCT GCTCAGCATT GCCGAGCTCG ACGGTCAGGT GAACTACCTG
GTACAGCTCG GTGCCGGTAC CGCCGTGGAC GATACCCTGG TCATCACCGA TCAGGATGGC
AACGTGCTGT TCAACGGTCT GGTGACCCAG GCCATGCTCG ACAACGGACT GGCCCTGGCC
GTGGATGCAC CGGCTGACGG TGACACCCTC ACCCTGACCG CCACCGTGAC CGACCCGGCC
GGCAACAGTG ATTCCGACAG CGACAGTGTC ACCATTGACA CCACCGCCCC GGCCGTCACC
ATCACCATCA CCGAAGACAC CAACAACGAC GGTCTGCTCA GCATTGCCGA GCTCGACGGT
CAGGTGAACT ACCTGGTACA GCTCGGTGCC GGTACCGCCG TGGACGATAC CCTGGTCATC
ACCGATCAGG ATGGCAACGT GCTGTTCAAC GGTCTGGTGA CCCAGGCCAT GCTCGACAAC
GGACTGGCCC TGGCCGTGGA TGCACCGGCT GACGGTGACA CCCTCACCCT GACCGCCACC
GTGACCGACC CGGCCGGCAA CAGTGATTCC GACAGCGACA GTGTCACCAT TGACACCACC
GCCCCGGCCG TCACCATCAC CATCACCGAA GACACCAACA ACGACGGTCT GCTCAGCATT
GCCGAGCTCG ACGGTCAGGT GAACTACCTG GTACAGCTCG GTGCCGGTAC CGCCGTGGAC
GATACCCTGG TTATCACCGA TCAGGATGGC AACGTGCTGT TCAACGGTCT GGTGACCCAG
GCCATGCTCG ACAACGGACT GGCCCTGGCC GTGGATGCAC CGGCTGACGG TGACACCCTC
ACCCTGACCG CCACCGTGAC CGACCCGGCC GGCAACAGTG ATTCCGACAG CGACAGTGTC
ACCATTGACA CCACCGCCCC GGCCGTCACC ATCACCATCA CCGAAGACAC CAACAACGAC
GGTCTGCTCA GCATTGCCGA GCTCGACGGT CAGGTGAACT ACCTGGTACA GCTCGGTGCC
GGTACCGCCG TGGACGATAC CCTGGTTATC ACCGATCAGG ATGGCAACGT GCTGTTCAAC
GGTCTGGTGA CCCAGGCCAT GCTCGACAAC GGACTGGCCC TGGCCGTGGA TGCACCGGCT
GACGGTGGCA CCCTCACCCT GACCGCCACC GTGACCGACC CGGCCGGCAA CAGTGATTCC
GACAGCGACA GTGTCACCAT TGACACCACC GCCCCGGCCG TCACCATCAC CATCACCGAA
GACACCAACA ACGACGGTCT GCTCAGCATT GCCGAGCTCG ACGGTCAGGT GAACTACCTG
GTACAGCTCG GTGCCGGTAC CGCCGTGGAC GATACCCTGG TCATCACCGA TCAGGATGGC
AACGTGCTGT TCAACGGTCT GGTGACCCAG GCCATGCTCG ACAACGGACT GGCCCTGGCC
GTGGATGCAC CGGCTGACGG TGACACCCTC ACCCTGACCG CCACCGTGAC CGACCCGGCC
GGCAACAGTG ATTCCGACAG CGACAGTGTC ACCATTGACA CCACCGCCCC GGCCGTCACC
ATCACCATCA CCGAAGACAC CAACAACGAC GGTCTGCTCA GCATTGCCGA GCTCGACGGT
CAGGTGAACT ACCTGGTACA GCTCGGTGCC GGTACCGCCG TGGACGATAC CTTGGTCATC
ACCGATCAGG ATGGCAACGT GCTGTTCAAC GGTCTGGTGA CCCAGGCCAT GCTCGACAAC
GGACTGGCCC TGGCCGTGGA TGCACCGGCT GACGGTGACA CCCTCACCCT GACCGCCACC
GTGACCGACC CGGCCGGCAA CAGTGATTCC GACAGCGACA GTGTCACCGT TGACACCAGC
ATAGCGGCCC CCACCGTGTG GATCGTTGAC GACGGCACGC CAGGCGATGG TTTGCTCACT
CAGGGCGAAA TCAGCAGTAA CGGCCCTGGG GTTCAGCTGC AGGCGAATGT TAGCCACGCC
GATCTGTTGG AAGGTGGTTT CGTAACCCTG ACAGTGACGA TTGGCAATGC CCAGCCACAA
GAATATGAGC TGGAGTTGGT GAATGGCGTA CTGCAATTTA CCAATGGCGA TCCTGCACCA
GATTTCGACT ACAACAACGG CGTTATCACC TGGACTGAAA ATGCACCCGA TGCAGGGCAA
AGCATTACTG TAACTGTCAC TCAAACCGAT CTTGCTGGTA ATGAATCTGC ACAAGACTCT
GATACAGCAC AGGTATTTGC TCCCGGCAAT AATGAAATGA CCGTTAATGA AAGCGATCTG
CGTGACAATG TGCCCAATGT GGTTTCGCAG CAGATTTCCT TCACCGCTGG CAGTGAGACT
CTGACACAAT TCCGCTTTGG CGATGTAAAC AGCATTCAAG CTGCAACCAA CCTTGCAACA
GGAGTAAGTA TTGCCTGGGC TCTGGCATTG GATGGTAGCT TGGTTGGTAG TATCGGCGGT
GTTCCTGTCA TCAAGTTGAC ACTGACCGAT ACAGATCCAA TCTCTGCCAA CACGACTGGC
AGTATCAGCG TTAACGTGGA ATTGCTGGAT AACATTAAGC AAGTCAATGG CGCAAATGAC
ATCAACCTCA GTTCGCTGAT TGAAGGTATC GTTATTGAAG CTGTTGGTGC CAACAACAGT
GTTATCTCTG CGACTCTGGA TCTCACTATT GACGACGATC TCGTTGAAGC CTCAGCTACT
GACAGCAGTG GTTTGAACGC AGCGGATACC CAAATCAGCG GTACTGTGAC CGTAGCAGGT
GCAGACGGAA ACGATAACGA ATCAGCCGAT CAATACAGCG CGGATCTGTC TGTGAACATT
ACAGGTTGGA GTGAGTCATC AACCTTCGCT GATTCCGGTC TGATGACCTC GGGTAAAGCA
ATTTACTACT ATGTCGATCC TAACGACACG TCCGTGATGA TTGCGTACAC CAGCTCCACT
GCAGCCGAAT GGGGTGCTGT CGGTGCGATT CAGACCAAAA TTTTCACTCT GACACTTGAT
CCAAACAGCG GTGAATATGT GCTGGATCTT GAAACTCCAA TTACCAAGAT CACCACAACC
AACGCCGATT TGACAGGTAA CATTCCAGGC GGTAACGACG CAGATTTGTT CGTCATGCTG
AACGGAGCAG TGAAAGGTGA ACCTGAGTCT GGTGACGTGG TGCTCTGCAC CATTACAGCA
ACTGACGCGG GTGGTGTTAG CACTGTAAAC ACCAGCCCCA ATGGTATTGG TGTTGGTACA
GGTAAGGACA TCGGCTCTGG TGAAACCCTC ACGCTTGATT TTGGCTTCCC AGTCACGAAT
TTTACTGGAA TTAGCCTCTC CTACAATAAC GGTAATCCTT ACACTGGCGT GTTTACCATC
AACATTGTCG GTAAAGATGC GTTTGGAAAT GATCTGGTCA AGACCTTTAT TGCCACGCCT
GCAACTTTGG CCAATTTGAT AGCCTCAGAA GGATTTGCAG AGTTCACAAA GATAGAGCTT
TCTACCGCAG CAGGTGGACA AGACTTCAAC TTGAAGAACT TCACTGCGAA CAGTCTCAGT
GTTGACCCGC TGGGAACCGT TTTGAACTTC AATGTTGCCA TCACTGACAG TGATGGTGAT
ACGGACTCCA GCAATCCGTT TACCGTCACG CTGAATGTGC CTAACACACT CAACGCTGTC
ACACCCGCTG CGTTTACCAG CCTAGCGGAA GCCAAACTGG TATCCGATAC CATGGATGCC
GATACAGATA CTCTGGTATT CAAGGCCGGT AGCAGTGACG TTAACAACGT CAGCTTCAGC
GCAGACGTGA GCGATATTCA GGTTGAAGGC ATTCGCCAGC CTATGAGTTG GAGAATCGAA
GGTGGGGTGC TGATAGGCTC TATGCCAGGC CGCGGCGACC TGCTGAAACT GACCTTGGAT
TGGAACGCAA TCGAAGCTGG TGAGCAGGGT TCTGTTGTGG TTGAGGCAGA GCTGCTTGGT
AAACTGCCTC ACAACATAGA TTATGACTCT CTGACAGTTA CAGGCATTAA GGTTGTTGCA
ACCGATGACA GCGGCGACAC CGCCAAGGCG GACGTCACAG TAACTGTTGC TGATAGTCAG
CATATCGCTG TCGATGACGA CAACCAGGTT GATGTACTGA TAGACGCCTT TGAAGTTCGC
GATATCACAG CCAGGTGGAC TGACTGGACC GCTGAGTCAG AAGAAAAATC CAACGTAACA
ACTTCTGACG GCCCTGATGA CGATAATGGC CACGATATAA TTCGTTGGGG CGACGTTGGC
AACGGACAGC CACGTTCTGG CTACAACTTT GATGAAAACA CCAACATTCA GGTGGGTGAT
GTGGGCTTGA ACCAGAATAT TGTTCTGGGC ACCTTTACGC ACGTAAACCA ACCTATCAAT
GGCTACTCAT CAATCACTGA AGCCACCCTC GAAGTAACCC TGATGATCAA TGGTGTTGCA
GCCAAGGTCA CTCTCGAGTT TAATCACAAT GAAACCGGTG GTTATAACAA CCCACCGGAT
ATTGTGACCG TCGCCAACAC ATCCTCTGAG TTTGTGTACG ATGGCGCTCG CTACACACTG
AAAGTAATGG GCTTCCTCGA TAGCAACGGC GATGTTGTGA CCAGTATCAA GACAGCTGAA
GGTGCCTCCA CCAGCTTCCC ATTGGTTGTT CAGTTGATTC CGGGAGATGG ATTTGAACTG
CCGCTCATCG GTGGCAACGT CCTCCACAAC GATATCCAGG GCGCCGATGG CGATATGGAG
ATCTACGGCG TTTCGCACAG TGGCAACAAC GCCACTGAAT CCAATGGAAC CTTTGTGATT
CAAGGCACTT ACGGAACCCT GACCCTGTAT GGCAATGGCA GTTACAGCTA TCAGGTCACA
ACCGTAGGCA GTCTGATACC GGACAATGCG GTCGACACAT TCAGCTACAC CATTGAAGAC
AGCGATGGCG ATCTTAGCAC TGCGGAATTG AATATTAACC TCAATGCTGT AGAAGAGCTG
CCAATCGTTT ACGAAGGCAC GCAGGGGGTG GATTCGTTCC TGCTGACAAA CTCGACTTCA
GGAAAAGGCC TGTTTGCCGT GTCTTCCAGT GGTTATGATG CCGTCGCTTC AACAGAATCT
GCCAATGTGA TCAATCTGGA TACGGCCTTG CACATCAAGG CAGGTGACAG TAACGACTAT
GTCGATCTGG GTATTTCACG GACTGATAAC ACTGTGGAGA CCGGCAGCTC ATTACCCAAC
GTGCCCAGCC AAATGTCCCA AGATGAAGTG CTTTCCAGTA AGTTTATGTC GGCACGCGAC
ATCTTGGATA ACGATGGCAA GCTGAAACAA AATGTGTTGG AAGAAGTGCA GCCCAAGACC
GATACCGTCA ACCTTGGCGT TGGTGACGAC ACCGTTTATG GCGGTGAAGG TTCCCAAATG
GTGTATGGCG GTGCCGGCAA CGACTTACTC ATCGGCGGCG AGGGTATTGA CGGACTGCGT
GGCGGCGAAG GCAATGACAC CATCATCGGT GGTCTTGGAG ACGACGTTCT GCGAGGTGAT
GGCGGTGCTG ATACCTTCGT TTGGAGAGCC GGTGAAACCG GAACTGACCA TATCATCGAC
TTCAACATCA ATGAAGATAA GCTGGACTTG AGCGACCTGC TGCAAGGAGA AGAAAATGGC
AATCTGGAAG ATTACCTGAG CTTCAGCTTT GAGCGGGGTT CAACCACCAT TGAAATTGAT
GCCAACCGCG ATGGTACTGT TGATCAGCGC ATCGTACTGG ATGGTGTTGA TTTGGCAGAT
GAGTATGGCC TGGCCTCAAC CGATGAATCC GGGATCATCA ACGGTCTGCT GGGTAACGGT
ACTGGCCCAT TGATAGTCGA TACACAAGCC GATACAGGTG CAGCCCAGGC TGTGGGCCGT
ATCTTGTCAC TCGATGAGGA TCATAAGACC GAGTTGATGC CTTAA
 
Protein sequence
MGSYITPKNG LLKSVNGQIS MVAEGAEKSV SAGDAIPAGA VLWIKEGAQF ELVLEDGTVI 
SESNTPTTAN QPDVGAEQLD PNALNEIEAL QAQIAAGDDP TADLPETAAG GQTGNEGGSG
FVSLDRTGGE TLASTGYDST FDAQFPESTI LEAQPIEPLL VTPLVITVDA PDNSTDTTPT
ITGTTDAEPG STVTILVTDS NGDTQTLTTT VNEDGTYSVD VVEPLPEGGY TADASVIDTT
GNTGTATDVG DVVSPEVSIT ANQIDVEEGS TASFVVSLDE ASNEDITVTF TYSGVAEDGT
DFIGVASVVI PAGQTSVTIN INTIDDKLYE VSEDFTITIS SVTGGNAVIG ADNSASTNIV
DEAVPGPEDT VTVTLNGPTA VAEGDNTGTY TVTLSDPAPA GSIVTLTYTY ITADGNDIVE
TVQAIIGADG VTATFTIATV NDDIFEPTES FSVSVSGVVT PDGTPVFENL DLTDATVTTD
ILDNDLSASI TLDANITADD IINSAEAGQT IPVTGVVGAD VKVGDIVTLT VNGKEFTGAV
YDDNGTLRFS IDVPGSDLAA DADRVIDASV TATDDAGNSA TATDTEGYGV DTDISASITL
DADITADDII NAQEAGQDIP VTGIVGGDVK VGDIVTLTVN GKEFTGAVYD DNGTLRFSIN
VPGADLVADE DHVIDASVTA TDDAGNSATA TDTEGYGVDT DISASITLDA DITADDIINA
QEAGQDIPVT GIVGGDVKVG DIVTLTVNGK EFTGAVYDDN GTLRFSINVP GADLVADEDH
VIDASVTATD DAGNSATATD TEGYGVDTDI SASITLDADI TADDIINAQE AGQDIPVTGI
VGGDVKVGDI VTLTVNGKEF TGAVYDDNGT LRFSINVPGA DLVADEDHVI DASVTATDDA
GNSATATDTE GYGVDTDISA TIDLNPILVG DDNVINQAES EGSVTLSGTV GGDVKLGDTV
TLTLDGNVIA TVQVIDLGGG VLGFSTSVDA ALLVGADVNS ITASVTTTDD AGNTASASDT
EGYGVDTEIS ATIDLNPILV GDDNVINQAE SEGSVTLSGT VGGDVKLGDT VTLTLDGNVI
ATVQVIDLGG GVLGFSTSVD AALLVGADVN SITASVTTTD DAGNSASASD TEGYGVDTEI
SATIDLNPIL VGDDNVINQA ESEGSVTLSG TVGGDVKLGD TVTLTLDGNV IATVQVIDLG
GGVLGFSTSV DAALLVGADV NSITASVTTT DDAGNTASAS DTEGYGVDTE ISATIDLDPI
LVGDDNVINQ VESEGSVTLS GTVGGDVKLG DTVTLTLDGN VIATVQVIDL GGGVLGFSTS
VDAALLVGAD VNSITASVTT TDDAGNTASA SDTEGYGVDT EISATIDLNP ILVGDDNVIN
QAESEGSVTL SGTVGGDVKL GDTVTLTLDG NVIATVQVID LGGGVLGFST SVDAALLVGA
DVNSITASVT TTDDAGNTAS ASDTEGYGVD TEISATIDLD PILVGDDNVI NQVESEGSVT
LSGTVGGDVK LGDTVTLTLD GNVIATVQVI DLGGGVLGFS TSVDAALLVG ADVNSITASV
TTTDDAGNTA SASDTEGYGV DTEISATIDL NPILVGDDNV INQAESEGSV TLSGTVGGDV
KLGDTVTLTL DGNVIATVQV IDLGGGVLGF STSVDAALLV GADVNSITAS VTTTDDAGNT
ASASDTEGYG VDTEISATID LNPILVGDDN VINQAESEGS VTLSGTVGGD VKLGDTVTLT
LDGNVIATVQ VIDLGGGVLG FSTSVDAALL VGADVNSITA SVTTTDDAGN TASASDTEGY
GVDTEISATI DLNPILVGDD NVINQAESEG SVTLSGTVGG DVKLGDTVTL TLDGNVIATV
QVIDLGGVLG FSTSVDAALL VGADVNSITA SVTTTDDAGN TASASDTEGY GVDTTAPAVT
ITITEDTNND GLLSIAELDG QVNYLVQLGA GTAVDDTLVI TDQDGNVLFN GLVTQAMLDN
GLALAVDAPA DGDTLTLTAT VTDPAGNSDS DSDSVTIDTT APAVTITITE DTNNDGLLSI
AELDGQVNYL VQLGAGTAVD DTLVITDQDG NVLFNGLVTQ AMLDNGLALA VDAPADGDTL
TLTATVTDPA GNSDSDSDSV TIDTTAPAVT ITITEDTNND GLLSIAELDG QVNYLVQLGA
GTAVDDTLVI TDQDGNVLFN GLVTQAMLDN GLALAVDAPA DGDTLTLTAT VTDPAGNSDS
DSDSVTIDTT APAVTITITE DTNNDGLLSI AELDGQVNYL VQLGAGTAVD DTLVITDQDG
NVLFNGLVTQ AMLDNGLALA VDAPADGDTL TLTATVTDPA GNSDSDSDSV TIDTTAPAVT
ITITEDTNND GLLSIAELDG QVNYLVQLGA GTAVDDTLVI TDQDGNVLFN GLVTQAMLDN
GLALAVDAPA DGDTLTLTAT VTDPAGNSDS DSDSVTIDTT APAVTITITE DTNNDGLLSI
AELDGQVNYL VQLGAGTAVD DTLVITDQDG NVLFNGLVTQ AMLDNGLALA VDAPADGDTL
TLTATVTDPA GNSDSDSDSV TIDTTAPAVT ITITEDTNND GLLSIAELDG QVNYLVQLGA
GTAVDDTLVI TDQDGNVLFN GLVTQAMLDN GLALAVDAPA DGGTLTLTAT VTDPAGNSDS
DSDSVTIDTT APAVTITITE DTNNDGLLSI AELDGQVNYL VQLGAGTAVD DTLVITDQDG
NVLFNGLVTQ AMLDNGLALA VDAPADGDTL TLTATVTDPA GNSDSDSDSV TIDTTAPAVT
ITITEDTNND GLLSIAELDG QVNYLVQLGA GTAVDDTLVI TDQDGNVLFN GLVTQAMLDN
GLALAVDAPA DGDTLTLTAT VTDPAGNSDS DSDSVTVDTS IAAPTVWIVD DGTPGDGLLT
QGEISSNGPG VQLQANVSHA DLLEGGFVTL TVTIGNAQPQ EYELELVNGV LQFTNGDPAP
DFDYNNGVIT WTENAPDAGQ SITVTVTQTD LAGNESAQDS DTAQVFAPGN NEMTVNESDL
RDNVPNVVSQ QISFTAGSET LTQFRFGDVN SIQAATNLAT GVSIAWALAL DGSLVGSIGG
VPVIKLTLTD TDPISANTTG SISVNVELLD NIKQVNGAND INLSSLIEGI VIEAVGANNS
VISATLDLTI DDDLVEASAT DSSGLNAADT QISGTVTVAG ADGNDNESAD QYSADLSVNI
TGWSESSTFA DSGLMTSGKA IYYYVDPNDT SVMIAYTSST AAEWGAVGAI QTKIFTLTLD
PNSGEYVLDL ETPITKITTT NADLTGNIPG GNDADLFVML NGAVKGEPES GDVVLCTITA
TDAGGVSTVN TSPNGIGVGT GKDIGSGETL TLDFGFPVTN FTGISLSYNN GNPYTGVFTI
NIVGKDAFGN DLVKTFIATP ATLANLIASE GFAEFTKIEL STAAGGQDFN LKNFTANSLS
VDPLGTVLNF NVAITDSDGD TDSSNPFTVT LNVPNTLNAV TPAAFTSLAE AKLVSDTMDA
DTDTLVFKAG SSDVNNVSFS ADVSDIQVEG IRQPMSWRIE GGVLIGSMPG RGDLLKLTLD
WNAIEAGEQG SVVVEAELLG KLPHNIDYDS LTVTGIKVVA TDDSGDTAKA DVTVTVADSQ
HIAVDDDNQV DVLIDAFEVR DITARWTDWT AESEEKSNVT TSDGPDDDNG HDIIRWGDVG
NGQPRSGYNF DENTNIQVGD VGLNQNIVLG TFTHVNQPIN GYSSITEATL EVTLMINGVA
AKVTLEFNHN ETGGYNNPPD IVTVANTSSE FVYDGARYTL KVMGFLDSNG DVVTSIKTAE
GASTSFPLVV QLIPGDGFEL PLIGGNVLHN DIQGADGDME IYGVSHSGNN ATESNGTFVI
QGTYGTLTLY GNGSYSYQVT TVGSLIPDNA VDTFSYTIED SDGDLSTAEL NINLNAVEEL
PIVYEGTQGV DSFLLTNSTS GKGLFAVSSS GYDAVASTES ANVINLDTAL HIKAGDSNDY
VDLGISRTDN TVETGSSLPN VPSQMSQDEV LSSKFMSARD ILDNDGKLKQ NVLEEVQPKT
DTVNLGVGDD TVYGGEGSQM VYGGAGNDLL IGGEGIDGLR GGEGNDTIIG GLGDDVLRGD
GGADTFVWRA GETGTDHIID FNINEDKLDL SDLLQGEENG NLEDYLSFSF ERGSTTIEID
ANRDGTVDQR IVLDGVDLAD EYGLASTDES GIINGLLGNG TGPLIVDTQA DTGAAQAVGR
ILSLDEDHKT ELMP