Gene Sputcn32_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_3591 
Symbol 
ID5078827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp4169699 
End bp4182361 
Gene Length12663 bp 
Protein Length4220 aa 
Translation table11 
GC content53% 
IMG OID640500793 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_001185098 
Protein GI146294674 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATCGG TCATTACATC AAAAAAAGGT TTGCTAAAGT TAGTTAAAGG GCAAATCAAT 
ATCGAAGTTG ATGGTACAAA TCAACCCGCT AAGGATGGCG AGCAACTCCC GAAAGGTGCA
GTCCTCCATA TTGGCGAAAA TGCAACATAT GAAATCACCT TCGATGATGG CACTAAATTA
TCGAACGAAG ACGTACCGAA TACCGCTACA GCGACACCGC CAAGTACCGC CAGTGAAGCG
ACGCTAGATG AAATCCAAGC ACTACAAGAT CTCATCGCTT CTGGAGAAGA TCCAACTCAA
AGCCTGCCAG AAACCGCAGC AGGTAATACC CCTGGTAGTG ATGGTAACTC AGGTTATGTA
ACATTAGCCC GTAGCGGCAG TGAAACCCTT GCGTCCTCAG GCTACTCAAC AACAGGTCAA
GCGCCAACAA CCATCGCACC CAATGAGTTA AGCCAAACCA TTGCCACTGA CTCGCCATCC
ATACTCGCGA ATGACACTAA TACTATTGAT GAAGATACAA TTGCCACAGG CAATGTACTC
GACAATGACA GCGATGCAGA TTCTGAATTA ACGGTAGCCA GCTTTGAGGT TGAAGGCACA
AGTTATGCCG CAGGAACTGA AGTGACTCTT GAAAGCGGTG TATTAATCAT CAACGCTGAT
GGCTCTTATA CTTTTACACC CAGTGAAAAC TGGAATGGCC AAGTTCCAGT TATTACCTAC
ACCACCAATA CAGGCGCAAG TGCAACACTG ACTCTAAATG TCACTCCAGT CAACGACCCG
AGTATTGCCA CAGGCGACAC CCAAGGCAGT GGCGCTGAGG ACGCTGGCGT GATCAGTGGC
ACCCTCAGTG CCACCGATGT TGACGGCCTC ACCAATGGCA ATGTCTTTAG CATCACAGGC
GCTGCCGCCA ACGGCACCGC CAGCATCGAC CCTGTCACTG GCGCGTGGAG CTACACCCCT
GTGGCGGACT GGAATGGCAC CGACAGCTTT ACCGTGACCA TCACCGATGA TGACGGCAAC
ACCACAACCC AGGTCATCAA TGTGACCGTG ACTCCGGTTG CCGATATCGT CGCCGACACC
ATGACCACCA ACGAAGATAC CGCGGTCACG CTGAACCTGC TGGCCAATGA CAGCTTTGAA
AATGCCGACG CCGCGGTGAC TGCAGTGACC AATGGCGCCA ACGGCACCGT GATCATCAAT
GCCGACGGCA CGGTCACTTA CACGCCTAAT GCAAACTTCC ATGGCAGTGA CAGCTTTACC
TACACCGTCA CCTCTGGCGG CGTCACCGAA ACGACGACGG TCAATGTGAC GGTAGGTCAA
GTCGATGACC CCACCACCAT CACAGGCGAC ACCCAAGGCA GTGGCGCTGA GGACGCTGGC
GTGATCAGTG GCACCCTCAG TGCCACCGAT GTTGACGGCC TCACCAATGG CAATGTCTTT
AGCATCACAG GCGCTGCCGC CAACGGCACC GCCAGCATCG ACCCTGTCAC CGGCGCGTGG
AGCTACACCC CTGTGGCGGA CTGGAATGGC ACCGACAGCT TTACCGTGAC CATCACCGAT
GATGACGGCA ACACCACCAC CCAGGTCATC AATGTGACCG TGACTCCGGT TGCCGATATC
GTCGCCGACA CCATGACCAC CAACGAAGAT ACCGCGGTCA CGCTGAACCT GCTGGCCAAT
GACAGCTTTG AAAATGCCGA CGCTGCGGTG ACTGCAGTGA CCAATGGCGC CAACGGCACC
GTGATCATCA ATGCCGACGG CACGGTCACT TACACGCCTA ATGCAAACTT CCATGGCAGT
GACAGCTTTA CCTACACCGT CACCTCTGGC GGCGTCACCG AAACGACGAC GGTCAATGTG
ACGGTAGGTC AAGTCGATGA CCCCACCACC ATCACAGGCG ACACCCAAGG CAGTGGCGCT
GAGGACGCTG GCGTGATCAG TGGCACCCTC AGTGCCACCG ATGTTGACGG CCTCACCAAT
GGCAATGTCT TTAGCATCAC AGGCGCTGCC GCCAACGGCA CCGCCAGCAT CGACCCTGTC
ACTGGCGCGT GGAGCTACAC CCCTGTGGCG GACTGGAATG GCACCGACAG CTTTACCGTG
ACCATCACCG ATGATGACGG CAACACCACA ACCCAGGTCA TCAATGTGAC CGTGACTCCG
GTTGCCGATA TCGTCGCCGA CACCATGACC ACCAACGAAG ATACCGCGGT CACGCTGAAC
CTGCTGGCCA ATGACAGCTT TGAAAATGCC GACGCCGCGG TGACTGCAGT GACCAATGGC
GCCAACGGCA CCGTGATCAT CAATGCCGAC GGCACGGTCA CTTACACGCC TAATGCAAAC
TTCCATGGCA GTGACAGCTT TACCTACACC GTCACCTCTG GCGGCGTCAC CGAAACGACG
ACGGTCAATG TGACGGTAGG TCAAGTCGAT GACCCCACCA CCATCACAGG CGACACCCAA
GGCAGTGGCG CTGAGGACGC TGGCGTGATC AGTGGCACCC TCAGTGCCAC CGATGTTGAC
GGCCTCACCA ATGGCAATGT CTTTAGCATC ACAGGCGCTG CCGCCAACGG CACCGCCAGC
ATCGACCCTG TCACCGGCGC GTGGAGCTAC ACCCCTGTGG CGGACTGGAA TGGCACCGAC
AGCTTTACCG TGACCATCAC CGATGATGAC GGCAACACCA CCACCCAGGT CATCAATGTG
ACCGTGACTC CGGTTGCCGA TATCGTCGCC GACACCATGA CCACCAACGA AGATACCGCG
GTCACGCTGA ACCTGCTGGC CAATGACAGC TTTGAAAATG CCGACGCTGC GGTGACTGCA
GTGACCAATG GCGCCAACGG CACCGTGATC ATCAATGCCG ACGGCACGGT CACTTACACG
CCTAATGCAA ACTTCCATGG CAGTGACAGC TTTACCTACA CCGTCACCTC TGGCGGCGTC
ACCGAAACGA CGACGGTCAA TGTGACGGTA GGTCAAGTCG ATGACCCCAC CACCATCACA
GGCGACACCC AAGGCAGTGG CGCTGAGGAC GCTGGCGTGA TCAGTGGCAC CCTCAGTGCC
ACCGATGTTG ACGGCCTCAC CAATGGCAAT GTCTTTAGCA TCACAGGCGC TGCCGCCAAC
GGCACCGCCA GCATCGACCC TGTCACTGGC GCGTGGAGCT ACACCCCTGT GGCGGACTGG
AATGGCACCG ACAGCTTTAC CGTGACCATC ACCGATGATG ACGGCAACAC CACCACCCAG
GTCATCAATG TGACCGTGAC TCCGGTTGCC GATATCGTCG CCGACACCAT GACCACCAAC
GAAGATACCG CGGTCACGCT GAACCTGCTG GCCAATGACA GCTTTGAAAA TGCCGACGCC
GCGGTGACTG CAGTGACCAA TGGCGCCAAC GGCACCGTGA TCATCAATGC CGACGGCACG
GTCACTTACA CGCCTAATGC AAACTTCCAT GGCAGTGACA GCTTTACCTA CACCGTCACC
TCTGGCGGCG TCACCGAAAC GACGACGGTC AATGTGACGG TAGGTCAAGT CGATGACCCC
ACCACCATCA CAGGCGACAC CCAAGGCAGT GGCGCTGAGG ACGCTGGCGT GATCAGTGGC
ACCCTCAGTG CCACCGATGT TGACGGCCTC ACCAATGGCA ATGTCTTTAG CATCACAGGC
GCTGCCGCCA ACGGCACCGC CAGCATCGAC CCTGTCACTG GCGCGTGGAG CTACACCCCT
GTGGCGGACT GGAATGGCAC CGACAGCTTT ACCGTGACCA TCACCGATGA TGACGGCAAC
ACCACCACCC AGGTCATCAA TGTGACCGTG ACTCCGGTTG CCGATATCGT CGCCGACACC
ATGACCACCA ACGAAGATAC CGCGGTCACG CTGAACCTGC TGGCCAATGA CAGCTTTGAA
AATGCCGACG CCGCGGTGAC TGCAGTGACC AATGGCGCCA ACGGCACCGT GATCATCAAT
GCCGACGGCA CGGTCACTTA CACGCCTAAT GCAAACTTCC ATGGCAGTGA CAGCTTTACC
TACACCGTCA CCTCTGGCGG CGTCACCGAA ACGACGACGG TCAATGTGAC GGTAGGTCAA
GTCGATGACC CCACCACCAT CACAGGCGAC ACCCAAGGCA GTGGCGCTGA GGACGCTGGC
GTGATCAGTG GCACCCTCAG TGCCACCGAT GTTGACGGCC TCACCAATGG CAATGTCTTT
AGCATCACAG GCGCTGCCGC CAACGGCACC GCCAGCATCG ACCCTGTCAC CGGCGCGTGG
AGCTACACCC CTGTGGCGGA CTGGAATGGC ACCGACAGCT TTACCGTGAC CATCACCGAT
GATGACGGCA ACACCACCAC CCAGGTCATC AATGTGACCG TGACTCCGGT TGCCGATATC
GTCGCCGACA CCATGACCAC CAACGAAGAT ACCGCGGTCA CGCTGAACCT GCTGGCCAAT
GACAGCTTTG AAAATGCCGA CGCTGCGGTG ACTGCAGTGA CCAATGGCGC CAACGGCACC
GTGATCATCA ATGCCGACGG CACGGTCACT TACACGCCTA ATGCAAACTT CCATGGCAGT
GACAGCTTTA CCTACACCGT CACCTCTGGC GGCGTCACCG AAACGACGAC GGTCAATGTG
ACGGTAGGTC AAGTCGATGA CCCCACCACC ATCACAGGCG ACACCCAAGG CAGTGGCGCT
GAGGACGCTG GCGTGATCAG TGGCACCCTC AGTGCCACCG ATGTTGACGG CCTCACCAAT
GGCAATGTCT TTAGCATCAC AGGCGCTGCC GCCAACGGCA CCGCCAGCAT CGACCCTGTC
ACCGGCGCGT GGAGCTACAC CCCTGTGGCG GACTGGAATG GCACCGACAG CTTTACCGTG
ACCATCACCG ATGATGACGG CAACACCACC ACCCAGGTCA TCAATGTGAC CGTGACTCCG
GTTGCCGATA TCGTCGCCGA CACCATGACC ACCAACGAAG ATACCGCGGT CACGCTGAAC
CTGCTGGCCA ATGACAGCTT TGAAAATGCC GACGCCTCGG TGACTGCAGT GACCAATGGC
GCCAACGGCA CCGTGATCAT CAATGCCGAC GGCACGGTCA CTTACACGCC TAATGCAAAC
TTCCATGGCA GTGACAGCTT TACCTACACC GTCACCTCTG GCGGCGTCAC CGAAACGACG
ACGGTCAATG TGACGGTAGG TCAAGTCGAT GACCCCACCA CCATCACAGG CGACACCCAA
GGCAGTGGCG CTGAGGACGC TGGCGTGATC AGTGGCACCC TCAGTGCCAC CGATGTTGAC
GGCCTCACCA ATGGCAATGT CTTTAGCATC ACAGGCGCTG CCGCCAACGG CACCGCCAGC
ATCGACCCTG TCACCGGCGC GTGGAGCTAC ACCCCTGTGG CGGACTGGAA TGGCACCGAC
AGCTTTACCG TGACCATCAC CGATGATGAC GGCAACACCA CCACCCAGGT CATCAATGTG
ACCGTGACTC CGGTTGCCGA TATCGTCGCC GACACCATGA CCACCAACGA AGATACCGCG
GTCACGCTGA ACCTGCTGGC CAATGACAGC TTTGAAAATG CCGACGCTGC GGTGACTGCA
GTGACCAATG GCGCCAACGG CACCGTGATC ATCAATGCCG ACGGCACGGT CACTTACACG
CCTAATGCAA ACTTCCATGG CAGTGACAGC TTTACCTACA CCGTCACCTC TGGCGGCGTC
ACCGAAACGA CGACGGTCAA TGTGACGGTA GGTCAAGTCG ATGACCCCAC CACCATCACA
GGCGACACCC AAGGCAGTGG CGCTGAGGAC GCTGGCGTGA TCAGTGGCAC CCTCAGTGCC
ACCGATGTTG ACGGCCTCAC CAATGGCAAT GTCTTTAGCA TCACAGGCGC TGCCGCCAAC
GGCACCGCCA GCATCGACCC TGTCACTGGC GCGTGGAGCT ACACCCCTGT GGCGGACTGG
AATGGCACCG ACAGCTTTAC CGTGACCATC ACCGATGATG ACGGCAACAC CACCACCCAG
GTCATCAATG TGACCGTGAC TCCGGTTGCC GATATCGTCG CCGACACCAT GACCACCAAC
GAAGATACCG CGGTCACGCT GAACCTGCTG GCCAATGACA GCTTTGAAAA TGCCGACGCT
GCGGTGACTG CAGTGACCAA TGGCGCCAAC GGCACCGTGA TCATCAATGC CGACGGCACG
GTCACTTACA CGCCTAATGC AAACTTCCAT GGCAGTGACA GCTTTACCTA CACCGTCACC
TCTGGCGGCG TCACCGAAAC GACGACGGTC AATGTGACGG TAGGTCAAGT CGATGACCCC
ACCACCATCA CAGGCGACAC CCAAGGCAGT GGCGCTGAGG ACGCTGGCGT GATCAGTGGC
ACCCTCAGTG CCACCGATGT TGACGGCCTC ACCAATGGCA ATGTCTTTAG CATCACAGGC
GCTGCCGCCA ACGGCACCGC CAGCATCGAC CCTGTCACTG GCGCGTGGAG CTACACCCCT
GTGGCGGACT GGAATGGCAC CGACAGCTTT ACCGTGACCA TCACCGATGA TGACGGCAAC
ACCACCACCC AGGTCATCAA TGTGACCGTG ACTCCGGTTG CCGATATCGT CGCCGACACC
ATGACCACCA ACGAAGATAC CGCGGTCACG CTGAACCTGC TGGCCAATGA CAGCTTTGAA
AATGCCGACG CTGCGGTGAC TGCAGTGACC AATGGCGCCA ACGGCACCGT GATCATCAAT
GCCGACGGCA CGGTCACTTA CACGCCTAAT GCAAACTTCC ATGGCAGTGA CAGCTTTACC
TACACCGTCA CCTCTGGCGG CGTCACCGAA ACGACGACGG TCAATGTGAC GGTAGGTCAA
GTCGATGACC CCACCACCAT CACAGGCGAC ACCCAAGGCA GTGGCGCTGA GGACGCTGGC
GTGATCAGTG GCACCCTCAG TGCCACCGAT GTTGACGGCC TCACCAATGG CAATGTCTTT
AGCATCACAG GCGCTGCCGC CAACGGCACC GCCAGCATCG ACCCTGTCAC TGGCGCGTGG
AGCTACACCC CTGTGGCGGA CTGGAATGGC ACCGACAGCT TTACCGTGAC CATCACCGAT
GATGACGGCA ACACCACCAC CCAGGTCATC AATGTGACCG TGACTCCGGT TGCCGATATC
GTCGCCGACA CCATGACCAC CAACGAAGAT ACCGCGGTCA CGCTGAACCT GCTGGCCAAT
GACAGCTTTG AAAATGCCGA CGCCGCGGTG ACTGCAGTGA CCAATGGCGC CAACGGCACC
GTGATCATCA ATGCCGACGG CACGGTCACT TACACGCCTA ATGCAAACTT CCATGGCAGT
GACAGCTTTA CCTACACCGT CACCTCTGGC GGCGTCACCG AAACGACGAC GGTCAATGTG
ACGGTAGGTC AAGTCGATGA CCCCACCACC ATCACAGGCG ACACCCAAGG CAGTGGCGCT
GAGGACGCTG GCGTGATCAG TGGCACCCTC AGTGCCACCG ATGTTGACGG CCTCACCAAT
GGCAATGTCT TTAGCATCAC AGGCGCTGCC GCCAACGGCA CCGCCAGCAT CGACCCTGTC
ACCGGCGCGT GGAGCTACAC CCCTGTGGCG GACTGGAATG GCACCGACAG CTTTACCGTG
ACCATCACCG ATGATGACGG CAACACCACC ACCCAGGTCA TCAATGTGAC CGTGACTCCG
GTTGCCGATA TCGTCGCCGA CACCATGACC ACCAACGAAG ATACCGCGGT CACGCTGAAC
CTGCTGGCCA ATGACAGCTT TGAAAATGCC GACGCTGCGG TGACTGCAGT GACCAATGGC
GCCAACGGCA CCGTGATCAT CAATGCCGAC GGCACGGTCA CTTACACGCC TAATGCAAAC
TTCCATGGCA GTGACAGCTT TACCTACACC GTCACCTCTG GCGGCGTCAC CGAAACGACG
ACGGTCAATG TGACGGTAGG TCAAGTCGAT GACCCCACCA CCATCACAGG CGACACCCAA
GGCAGTGGCG CTGAGGACGC TGGCGTGATC AGTGGCACCC TCAGTGCCAC CGATGTTGAC
GGCCTCACCA ATGGCAATGT CTTTAGCATC ACAGGCGCTG CCGCCAACGG CACCGCCAGC
ATCGACCCTG TCACCGGCGC GTGGAGCTAC ACCCCTGTGG CGGACTGGAA TGGCACCGAC
AGCTTTACCG TGACCATCAC CGATGATGAC GGCAACACCA CCACCCAGGT CATCAATGTG
ACCGTGACTC CGGTTGCCGA TATCGTCGCC GACACCATGA CCACCAACGA AGATACCGCG
GTCACGCTGA ACCTGCTGGC CAATGACAGC TTTGAAAATG CCGACGCTGC GGTGACTGCA
GTGACCAATG GCGCCAACGG CACCGTGATC ATCAATGCCG ACGGCACGGT CACTTACACG
CCTAATGCAA ACTTCCATGG CAGTGACAGC TTTACCTACA CCGTCACCTC TGGCGGCGTC
ACCGAAACGA CGACGGTCAA TGTGACGGTT TTGGATATTA CCCCCCCACC AGCGCCAACC
GTCTGGATCG TGGATGATGG TGTACCGGGA GATGGTCTGC TCACCCAAAG TGAAATCAAC
AGTAACGGTG TAGGCATTCA GCTTCAGGTT ACAGTGAGCC ATGCGGAACT TTTAATCGGA
GGTGTAGTCA CTATTAACGT CAATAACGGC GGAGATTTAA GCACCTATAC ACTTAAGCTC
GTCGATGGTG CATTACTCTT TAGTGATAAC ACATCAGCAA CAGGTTTCAG CTATAACAAT
GGTGTAATCA GTTGGACAGA AGACGTACCT GTGGCAGGGC AAAACATTAC CGTTACAGCA
ACTCAAACTG ACTCAACTGG TAATACATCA ACACAAGCAT ATGATACAGC AGAGATTTAC
CAACCTAACA ATCAGCAGAT AACGGTTAAT GAAAGTGACC TGCGTGACAA TATTCCTAAT
GTTGTCTCAA GTACAATTAG TTTCACCGCA GGTAACCAAG CACTGACGCA ATTCCGCTTT
AATGAGTCAG CCATTAATGC TGCGACAAAT CTCGCAGCAG GAGTGAGCAT CGTTTGGGCG
ATAGCCGCAA ATGGAGCACT AATAGGTTCC ATTGATGGGG TTGATGTCAT CAAGCTCACT
TTAACAGGTG GAGTGATTGC CGCAGGTACA ACTGCGAATA TTACTGTGAA TGTTGAATTG
CTTGATAATA TCAAACATAT GAATGCTCTA GATGGCACAA ATCTAAATTC ACTGATCAAT
GGTATTGTTA TTGAAGCTGT TAGCGCCGAT GGTAGCGTTT TGACCGGTAA TTTAAGCATC
GTCATTAATG ACGATCTAAT CTCTATCGAC CCTGTAAGTA CCTCTGGAGT AAATAGCTCT
AGCGCTGCCA ATATCATAGG TGCTCTTAAT ATTTTAGGAG CCGATGGTAA TGATCATGAT
CTTAATGATG ATTACAGCAT AAGTCTGACA GCGAATATCA CAGGGTGGAA TGGAACAACC
ACTACGTTTG CTAACTCAGG TATAACCTCA GAAGGGCTAA CCATATTTTA CTACGTAGAC
CCTGCTCATC CCAATGTGCT TATTGCATAT ACGGATACCA ATGCTACCCC CTCAGCCTAT
ACTGGCGCGG CGAACCAAGC ATTAATCTTC ACGTTGACAA CCGATCCGCA TAGCGACCAA
TACACTTTCG ATATCAATCA AAGCATTGAT CAACTTTCAA CTATCCAAAT TGCAGGTTTA
GTTGGAGGCC AAGGCGGCAT TGGTAACGCG GTATATGTTA CAACTAATAA TTCCCCAGCG
GGTTATGGTA TCTACAATGA TATAACTAAA ATTCCTGCTG GTGAGGATAT TGCTTTCACA
TTAACCGCAC GCGCTGAAAA TGGCAGTGTA GGCCGCGTCA ATGGTACCAA CAACGGCTTC
GGAGTGGACA ACCCACATGT TAGTGGTAAA GAAATTCTCA TCATTGATTA TTTAGAAGAT
GCAGCAACAG CAAGCTTTAA CTTTACAGGC GCTACATCCA TCTATTTTAA AGCCTATGAT
GAGCAAGGTA ATTTAATCGG TGAAGGAAAT ATAACAAGTG GTCAAGTTAT TCAAAATTTA
GGTTCTATAG CCTATGTTGA ATTATCTGCC TTAGCAGGTA CCAGTTTCCA ATTTACTGGC
ACTACGGCAC AAACCATTGT GAGCTCCACT CAAAATGTCG ACTTACATTT TGATGTAACG
GTTACCGATA GTGATGGTGA TACGAATACA GGCGGATTTA ACATCCATCT TGAGGCACCA
AATACAACGC CAATCGCACC TGTTGCGCTC ACTACTAACG CCATTGCAAG CCTTAATGAA
GCAGACTTGC AGGCTGGCGC TCCTGATATG AGTGTACAAA CACTTAGCTT TAAATCTGGT
AGTAACTCGA TTGGTAGCTT CCAATTTGGT GATTTTAGTA ATATTTCCGT CACAGGTATT
AATGCGCACA TTCATTGGGC CGTTAATACT GCGGGCCAAT TAATAGGTAC CGTGTTTGGC
CGTGAAGCTC TTCGCTTAAC CTTAGACTGG GATCGAATTA ATGCAGGAGA ACAAGGTGAT
GTGACGGTTA CCGCCGAATT ACTGACTAAT TTACCCCATA GCGTTAATGT GGATAGCCTT
ACGATAAATG GCATTAAAGT CGTGGCTATT GATGGTGCTG GTAATACAGC CCAGTCAACT
GTCACGGTTA CCGTTGCCGA TGATGTTGAT ATCGCTAAAA ATGATACAGC ACAACTCGAT
GTTGTTGTCG ACTCATTTAA GTTTTCTGGA GTAGTTGCTA ATTGGCAAAG TACTATTGGT
GGTACAAATA TCACTAAATA TGATGGTCCA GATAACGATA CAGGTTTAGA TCAAATTCGG
TGGGGAGATC CATCTAGTTG GTATGGAAAC CAATCAGGTT ATGGTTTTAT GGATAACGAT
GCAGGACTTA ATGGAGCATT GTCACTTAAC CAAGATATCG TTTTGGGTAC ATTTACCCAC
TACAATTATT CAATTACGTC AGGTACATCT ATTACTGCCG CCACAATGAA AGTGACCTTT
AACGTCACAG ATGCATATGG AGTAACTACA CCTGTAACGT TAACTCTCAA CTTTAGCCAT
AACGAAACAC CTAATACAAA TGACCCAATA GCTTCTCGCG ACATTGTCAC CGTAGGCCAA
ACCAGTGTCA CCTTCAACTA TGAAGGCCAA ATCTACACAA TGCAAGTGAT TGGATTTAAA
GATACTAATG GCAACGTTGT CACATCTATC TACACTAATG AGGATGCTGC GACCAGTTAT
GAACTAGTAG TTCGTATGGT TGCTGGTAAT GGATATTCCT TACCAAACAC TGAAGGCAAT
GTGTTGACAA ATGATGTCGT TGGAGCTGAT GGTCCATTGA CCATTATTGG GGTAGCTAAG
GGCGATTTTA CCAATACCGG CGGTGTATCT GGGCAAGTAG GGTCAACCAT AACTGGCCTA
TATGGCACTC TGATACTAAA CGCCGATGGA ACCTATAAAT ACCAACTTAC AGCAAATGCA
AGCCAATTGC CTACAACAGG AGCAATAGAA ACCTTTACTT ATACCATCCG TGATAGTGAT
GGCGATGTAT CCAGTGCAAC CTTGAAAATC AACGTTAATC CCGTCAATAG CGATGGTATT
AATATTGCGG ACGCCAACCT TATTACCACT CAAGGTTCTA GCCTGAATGA CACTATGGTC
GTGGTTAATG GTGAAAGTGC TAATAACCCA AATCAAAAAA TATTGAACGT CAGTTTTGGT
GGTGGCCAAA GCGGTATCAT CACAAACAGT AACGGAAAAG AGGTTGTTGC TTCAGGAGCT
AACAATAAGA GCTATAGCAC TACGGATGCC CAATTCGTTA ATGGCGGCGA TGGTAATGAT
CATATTGAAA CAGGCAAAGG CAACGATGTT ATCTATGCGG GTAGAACAGG CTCGACTGGA
TATGGTTCAG ATGATGCACT TGAGCTATCA GTCAATACGC TCCTAAATCA CCATATTATG
ACGGGTGAAT TAACTGGCGC TAACAGAATG GTTGATAGTA ACGGCCTATT ATTGGCTAAT
GATGTCGCTT CGCATAAAGC GGATATTGTA AATGGTGGCA GCGGTGATGA CCGAATTTAT
GGTCAATCTG GCTCTGACAT TCTCTATGGT CATACAGGTA ATGATTATAT TGATGGTGGT
AGCCACAACG ATGCACTCCG TGGTGGAGAG GGCAACGATA CTCTCATCGG TGGCCTTGGT
GATGATGTCC TTCGTGGTGA TAGCGGCGCT GATACCTTCG TCTGGCGATA TGCGGAGTTT
GGCACTGACC ACATTATGGA CTTTAAAGTC ACAGAAGATA AGTTAGACCT GAGTGACTTA
CTCCAAGGCG AATCGGCAAA TAATTTGGAC AGTTACTTAA ACTTTAGCTT AGACAGCACT
GGCTCTACCG TCATCGATAT TGATGCCAAT CTTGATGGTG TATATGAACA GCACATCATT
CTTGATGGCG TGAATTTATT CACCACTTAT GGCGCAACAA ATGATGCCGG AGTGATTAAT
GGTCTACTGG GTACCAATGG TAATGGTCCA TTAATTATCG ATACTCAGCC TATCACTCCA
GAGACACCAC AGGGAGTAAC ACCACTTAAT GATCCTCATA ACAATGGCAC TATGATCCCT
TAA
 
Protein sequence
MGSVITSKKG LLKLVKGQIN IEVDGTNQPA KDGEQLPKGA VLHIGENATY EITFDDGTKL 
SNEDVPNTAT ATPPSTASEA TLDEIQALQD LIASGEDPTQ SLPETAAGNT PGSDGNSGYV
TLARSGSETL ASSGYSTTGQ APTTIAPNEL SQTIATDSPS ILANDTNTID EDTIATGNVL
DNDSDADSEL TVASFEVEGT SYAAGTEVTL ESGVLIINAD GSYTFTPSEN WNGQVPVITY
TTNTGASATL TLNVTPVNDP SIATGDTQGS GAEDAGVISG TLSATDVDGL TNGNVFSITG
AAANGTASID PVTGAWSYTP VADWNGTDSF TVTITDDDGN TTTQVINVTV TPVADIVADT
MTTNEDTAVT LNLLANDSFE NADAAVTAVT NGANGTVIIN ADGTVTYTPN ANFHGSDSFT
YTVTSGGVTE TTTVNVTVGQ VDDPTTITGD TQGSGAEDAG VISGTLSATD VDGLTNGNVF
SITGAAANGT ASIDPVTGAW SYTPVADWNG TDSFTVTITD DDGNTTTQVI NVTVTPVADI
VADTMTTNED TAVTLNLLAN DSFENADAAV TAVTNGANGT VIINADGTVT YTPNANFHGS
DSFTYTVTSG GVTETTTVNV TVGQVDDPTT ITGDTQGSGA EDAGVISGTL SATDVDGLTN
GNVFSITGAA ANGTASIDPV TGAWSYTPVA DWNGTDSFTV TITDDDGNTT TQVINVTVTP
VADIVADTMT TNEDTAVTLN LLANDSFENA DAAVTAVTNG ANGTVIINAD GTVTYTPNAN
FHGSDSFTYT VTSGGVTETT TVNVTVGQVD DPTTITGDTQ GSGAEDAGVI SGTLSATDVD
GLTNGNVFSI TGAAANGTAS IDPVTGAWSY TPVADWNGTD SFTVTITDDD GNTTTQVINV
TVTPVADIVA DTMTTNEDTA VTLNLLANDS FENADAAVTA VTNGANGTVI INADGTVTYT
PNANFHGSDS FTYTVTSGGV TETTTVNVTV GQVDDPTTIT GDTQGSGAED AGVISGTLSA
TDVDGLTNGN VFSITGAAAN GTASIDPVTG AWSYTPVADW NGTDSFTVTI TDDDGNTTTQ
VINVTVTPVA DIVADTMTTN EDTAVTLNLL ANDSFENADA AVTAVTNGAN GTVIINADGT
VTYTPNANFH GSDSFTYTVT SGGVTETTTV NVTVGQVDDP TTITGDTQGS GAEDAGVISG
TLSATDVDGL TNGNVFSITG AAANGTASID PVTGAWSYTP VADWNGTDSF TVTITDDDGN
TTTQVINVTV TPVADIVADT MTTNEDTAVT LNLLANDSFE NADAAVTAVT NGANGTVIIN
ADGTVTYTPN ANFHGSDSFT YTVTSGGVTE TTTVNVTVGQ VDDPTTITGD TQGSGAEDAG
VISGTLSATD VDGLTNGNVF SITGAAANGT ASIDPVTGAW SYTPVADWNG TDSFTVTITD
DDGNTTTQVI NVTVTPVADI VADTMTTNED TAVTLNLLAN DSFENADAAV TAVTNGANGT
VIINADGTVT YTPNANFHGS DSFTYTVTSG GVTETTTVNV TVGQVDDPTT ITGDTQGSGA
EDAGVISGTL SATDVDGLTN GNVFSITGAA ANGTASIDPV TGAWSYTPVA DWNGTDSFTV
TITDDDGNTT TQVINVTVTP VADIVADTMT TNEDTAVTLN LLANDSFENA DASVTAVTNG
ANGTVIINAD GTVTYTPNAN FHGSDSFTYT VTSGGVTETT TVNVTVGQVD DPTTITGDTQ
GSGAEDAGVI SGTLSATDVD GLTNGNVFSI TGAAANGTAS IDPVTGAWSY TPVADWNGTD
SFTVTITDDD GNTTTQVINV TVTPVADIVA DTMTTNEDTA VTLNLLANDS FENADAAVTA
VTNGANGTVI INADGTVTYT PNANFHGSDS FTYTVTSGGV TETTTVNVTV GQVDDPTTIT
GDTQGSGAED AGVISGTLSA TDVDGLTNGN VFSITGAAAN GTASIDPVTG AWSYTPVADW
NGTDSFTVTI TDDDGNTTTQ VINVTVTPVA DIVADTMTTN EDTAVTLNLL ANDSFENADA
AVTAVTNGAN GTVIINADGT VTYTPNANFH GSDSFTYTVT SGGVTETTTV NVTVGQVDDP
TTITGDTQGS GAEDAGVISG TLSATDVDGL TNGNVFSITG AAANGTASID PVTGAWSYTP
VADWNGTDSF TVTITDDDGN TTTQVINVTV TPVADIVADT MTTNEDTAVT LNLLANDSFE
NADAAVTAVT NGANGTVIIN ADGTVTYTPN ANFHGSDSFT YTVTSGGVTE TTTVNVTVGQ
VDDPTTITGD TQGSGAEDAG VISGTLSATD VDGLTNGNVF SITGAAANGT ASIDPVTGAW
SYTPVADWNG TDSFTVTITD DDGNTTTQVI NVTVTPVADI VADTMTTNED TAVTLNLLAN
DSFENADAAV TAVTNGANGT VIINADGTVT YTPNANFHGS DSFTYTVTSG GVTETTTVNV
TVGQVDDPTT ITGDTQGSGA EDAGVISGTL SATDVDGLTN GNVFSITGAA ANGTASIDPV
TGAWSYTPVA DWNGTDSFTV TITDDDGNTT TQVINVTVTP VADIVADTMT TNEDTAVTLN
LLANDSFENA DAAVTAVTNG ANGTVIINAD GTVTYTPNAN FHGSDSFTYT VTSGGVTETT
TVNVTVGQVD DPTTITGDTQ GSGAEDAGVI SGTLSATDVD GLTNGNVFSI TGAAANGTAS
IDPVTGAWSY TPVADWNGTD SFTVTITDDD GNTTTQVINV TVTPVADIVA DTMTTNEDTA
VTLNLLANDS FENADAAVTA VTNGANGTVI INADGTVTYT PNANFHGSDS FTYTVTSGGV
TETTTVNVTV LDITPPPAPT VWIVDDGVPG DGLLTQSEIN SNGVGIQLQV TVSHAELLIG
GVVTINVNNG GDLSTYTLKL VDGALLFSDN TSATGFSYNN GVISWTEDVP VAGQNITVTA
TQTDSTGNTS TQAYDTAEIY QPNNQQITVN ESDLRDNIPN VVSSTISFTA GNQALTQFRF
NESAINAATN LAAGVSIVWA IAANGALIGS IDGVDVIKLT LTGGVIAAGT TANITVNVEL
LDNIKHMNAL DGTNLNSLIN GIVIEAVSAD GSVLTGNLSI VINDDLISID PVSTSGVNSS
SAANIIGALN ILGADGNDHD LNDDYSISLT ANITGWNGTT TTFANSGITS EGLTIFYYVD
PAHPNVLIAY TDTNATPSAY TGAANQALIF TLTTDPHSDQ YTFDINQSID QLSTIQIAGL
VGGQGGIGNA VYVTTNNSPA GYGIYNDITK IPAGEDIAFT LTARAENGSV GRVNGTNNGF
GVDNPHVSGK EILIIDYLED AATASFNFTG ATSIYFKAYD EQGNLIGEGN ITSGQVIQNL
GSIAYVELSA LAGTSFQFTG TTAQTIVSST QNVDLHFDVT VTDSDGDTNT GGFNIHLEAP
NTTPIAPVAL TTNAIASLNE ADLQAGAPDM SVQTLSFKSG SNSIGSFQFG DFSNISVTGI
NAHIHWAVNT AGQLIGTVFG REALRLTLDW DRINAGEQGD VTVTAELLTN LPHSVNVDSL
TINGIKVVAI DGAGNTAQST VTVTVADDVD IAKNDTAQLD VVVDSFKFSG VVANWQSTIG
GTNITKYDGP DNDTGLDQIR WGDPSSWYGN QSGYGFMDND AGLNGALSLN QDIVLGTFTH
YNYSITSGTS ITAATMKVTF NVTDAYGVTT PVTLTLNFSH NETPNTNDPI ASRDIVTVGQ
TSVTFNYEGQ IYTMQVIGFK DTNGNVVTSI YTNEDAATSY ELVVRMVAGN GYSLPNTEGN
VLTNDVVGAD GPLTIIGVAK GDFTNTGGVS GQVGSTITGL YGTLILNADG TYKYQLTANA
SQLPTTGAIE TFTYTIRDSD GDVSSATLKI NVNPVNSDGI NIADANLITT QGSSLNDTMV
VVNGESANNP NQKILNVSFG GGQSGIITNS NGKEVVASGA NNKSYSTTDA QFVNGGDGND
HIETGKGNDV IYAGRTGSTG YGSDDALELS VNTLLNHHIM TGELTGANRM VDSNGLLLAN
DVASHKADIV NGGSGDDRIY GQSGSDILYG HTGNDYIDGG SHNDALRGGE GNDTLIGGLG
DDVLRGDSGA DTFVWRYAEF GTDHIMDFKV TEDKLDLSDL LQGESANNLD SYLNFSLDST
GSTVIDIDAN LDGVYEQHII LDGVNLFTTY GATNDAGVIN GLLGTNGNGP LIIDTQPITP
ETPQGVTPLN DPHNNGTMIP