Gene Smal_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_0113 
Symbol 
ID6477662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp138072 
End bp152972 
Gene Length14901 bp 
Protein Length4966 aa 
Translation table11 
GC content67% 
IMG OID642729246 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_002026501 
Protein GI194363891 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.352935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAAAG TCTACCGTCT GGTCTTCAAC CGTACGCTCG GGGTCATGCA GGTGGCGTCC 
GAGCTGGTCA ACGCCGCCCG CGGTGGTGCC GACAGTCGAG AGGGCCATGC GGTCGGCACC
CTGCGTCCGA TCAGTTTCGC GCTGTGGCTG GTGCTGGGAT GGGTGGGGCT GGTGCAGCCG
TTGTCGGCGC AGCAGGCGCC GGCCGACCCG GGCCGTATTG CCGCCGATCC CGGCGCACCG
GCAAACCAGC GCCCGACGGT GATCACCTCG GCCAATGGCA CGCCGCAGGT GAATATCACC
ACGCCCAGCG CTGGCGGCGT GTCGCGCAAT TCCTACCAGC AGTTCGATGT CGGCCAGCAG
GGCGTCATCC TCAACAACTC GCGCGGCGAC GTGCAGACCC AGCTGGGCGG CTGGGTACAG
GGCAATCCGT GGCTGGCCAC CGGCACCGCC CGGGTCATCC TCAACGAGGT CAACAGCAGC
AATCCCAGCC ATCTCAATGG CTACGTGGAA GTTGCCGGTA CCCGCGCGCA GGTGGTCATC
GCCAACCCGG CCGGGATCCA GGTCAATGGC GCGGGCTTCC TCAACGCCAG CCGGGTGACC
CTGACCACCG GCACGCCGAT CGTCAGCAAT GGCGTGCTGG AAGGCTATCG CGTGGAAGGC
GGCAGCATCG GCGTCGGTGG CACGGGCCTG GATACCAGCC GCGCCGACTA CACCGACATC
ATCACCCGCT CGCTGCAGGT CAATGCCGGC ATCTGGGCCA ACCAGCTGCA GGCCAGCCTG
GGCAGCAACG TGGTCAGTGC CGACCACAGC AGCGTGCAGC AGCAGAAGCC CACGTCCGCC
GCGCCCACCT TCGCGCTGGA TGTGGGCGCG CTGGGCGGCA TGTATGCCAA CCGCATCTGG
CTGGTCGGCA ACGAACATGG CGTGGGCGTA AGCAATGCCG GCAAGATCGG TGCCCAGGCC
GGCGAGCTGG TGGTCACCGC CGACGGCCGG CTGCAGAACA CCGGCGCGAT GCATGCGCAG
CAGGACCTGC GTATCGACGC CAGCGCGGGC ATCGCCAATG CCGGCACGCT CAGTGCCGAG
CGCGAGCTGC GCGTGCAGAC CCCGGCAGAT GTCGACAATA GTGGCGGCAC GCTCAATGCG
CGACGCATCG AAGTGAATGC CGATGCGCTG CGCAACCGCG GCGGCAGCAT CGAACAGCTT
GGAACGCAGG CGCTGCAGCT GCAGGCCGCC GACCTCAGCA ACCTGCAGGG CCGTATCGGC
ACCGTGTCCA GTGTTTCGCC AGGCACCGGC ACCGGCGGCA CCGGCACCCC GCCGGGTACG
GGTACGCCAG GCACGGGTAC GCCAGGTACC GGCACTCCGG GCACGGGTAC CCCGGGCACT
GGCACTCCCG GCACCGGCGG CGGCACCCTG CCGGTCACAC CACTTCCGCC GCTGGCGGCC
GGCGTGCTCA ACATCGTAGG AACGCTGGGC AACGATGGCG GCCGCATCGA AGCCAGTGGC
GACCTGCAGC TCAGCGCCCG CAACAGCCTG GACAACAGCG ACGGCACGCT GGGGGTAAGC
ACCCTGCTGG TGCAGGGACA GGCGCTGCGC AACATGCGCG GCACCTTGCA GGTCCAGGGC
GCGGCCAGCG CACAGGTGCA GCAGCTGGAC AACAGTGGCG GCCGCATGAC CTTCGCCCGT
GGCTTCGAGC TGCAGGCGCA ATCGGCAATC AATCGTGGCG GCAGCCTCGC CCACGGTGGC
AGTGCCGGCA CGATCTGGAC GATCGGCCAG CTCGACAACA GCGAGGGCAG CATCTCCAGC
AATGCCACGC GGCTCGGCCT CGACATCGGC ACCCTGGTCA ACACCCGGGG CAGCATCAAC
CACGCCGGCA GCGAGGGCCT GCTGCTGCGT GCGGAGGTAC TGGATGGCGT GCAGGGCAGC
ATTGCCACCG CCGGCGCGGC AGACCTGCTG CTGGGCCGCG CCGACCACCG CGGTGCGCAG
CTGATCGCCC GCCAGCTCGG CCTGAGCGCA CAGGACTTCG ACAACCGCGG CGGCCGCGTG
CTGTCCACGG GGCTGCAGGC CAGCACGCTC AACGTCCGCA ATCACCTGGA CAACAGCGAC
GGTGGCCTGC TGGCCAGCAA TGCCGACCTG CAGATCGATG CAGCGTCCTT CGGCAATGCC
GGCGGCACCG TGCAGCAGGC GGGCACCGGC AACCTGCGCG TCACCACCGC CAACCTGCAG
GGCCAGGGCG GCACCGTTCT CAGCAACGGC ACGCTGCAGC TGCACGGCGA CACCCTGAAT
CTGCGCAATG GCACCACCGC CGCGCAGCGC ATCGATGTGC GCGCCGGAGA CCTGACCACC
GCCGGCGGCA CGCTGACCTC GACCGGCGGC GATGCGCTGC AGCTGCAGGT ATCGCGCACC
CTGGACAACA GCGGCGGCAC CGTGGGCGCC AATGGTGCGC TGGACATCAA CAGCGGCATC
CTCATCAACG ACGGCGGCAA GCTGATCGCC GCAGGCACCG GCACCAGCCG CATCCACGCC
AGCCAGCGGC TGCAGAACCA GCGCGGCCTG CTGTCCGGCA ATGGCGATCT GGATATCAAG
ACCGATGGCC TGCTGAACGA GAGCGGCAGC ATCGACCATG CCGGCACCGG CACGCTGCAG
ATCGACGCGA CCACCGTGCA GGGAGCTGCT GGCAGCATCG CCAGCAATGG CCTGCTGCAG
TTGAGCGGGC AGCAGCTGGA CCTGCAGAAG GGCACCACCC GCGCCCGCGA GGTGCAGATC
ACGGCCACCG GCCTGGACAC CAGCGGCGGC AGCCTGTTGT CACTGGGCAA CAACGCGATG
CAGCTGCAGG TGCAGGGCCG GCTGCGCAAC GACGGCGGCA GCATCAGCGC CAATGGCGCG
CAGCAGATCC AGGCGGGTGC GCTGTCCAAC CGCGGCGGTG CGCTCAATTC GGCGGGAACG
GCGGCTTCGC AGATCCGGGT CGACGGCACG CTCGACAACA GTGGCGGCAG CATCGCCAGC
AATGCCGCCC ACCTGCAGCT GAAGAGCGGC GCGCTGGTCA ATGCCGGCGG TACGCTCAGC
CATGCGGGCA GCGACGGCCT GATCATCGAC AGCGGGCGTC TGGACGGAGC AAAGGGGCAG
ATCGCCACTG CCGGCGCCCT GCAGCTCACT GCGGCCGAGG TCGATCACCA GGGAGCGTCG
CTGGCTGCCG CGCAGCTGGA CATCACGGCG ACCGGGTTCG ACAACCGCGG CGGGCGCATC
ATCGCCACCG GTACCGGCGG CAATACGCTG CGCGTGCAGG GCACGCTGGA CAACGGCAAT
GGCGGCACCC TCGCCAGCAA TGGCAACCTG GATATCCACG CGCGTACCTT CGGCAATGCC
GGCGGAACCG TGCAGCAGGC CGGGTCCGGC AGCCTGGTCA TCACCACGCA GGACCTGACC
GGCGCCGGTG GCACCCTGCT CAGCAACGGT TCGCTGGAGC TGCAGGGCGA CACCCTCGAT
CTGCGCAACG GCACCACCAC CGCGCAGCGC ATCAGCATTG ACGGCGACAC CGTGACCACC
GCCGGTGGCC AGCTGAGTGC GCTTGGCACC CAAGCGCTCC AGCTGCAGGC CCGCAGCCTG
CTCGACAATA CCGGCGGAAC GCTGGGCAGC AATGGGGCCG TCGATGTGCA CGCAGGCCGC
TTCGTCAACG ACCATGGCAA GTTGATTGCC GCCGGTGATG CTGCATCCGC AATCCGCGCC
GCGCAGCTGG AGAACCACAG CGGCTCGATC TCCGCCAACG GCAACCTGCG CATCGAGGCG
CAAACGCTGT CGGGCCAGAG CGGCAGCATC GGTGCCGCCC GCGCGCTGTC ACTGCAGGGC
GGGTCATTGG ATCTCCGCGG CGGCAGCGTG GCGGCCGAGC AGCTCGATAT CGTGGCCGAC
AGCCTGGACA ACAGCGCAGG CGCGCTGCGC GCAACCGGCA GCGGGACGCT GGCACTGCAG
GTGAGCGGCC GCTTGGCCAA TGACGCGGGC ACCATCGCCA GCAACGGTGC GCAGCGCATC
GAAGCCGGCG AACTCTCCAA CCGTGGTGGC ACGCTCAGTT CGGCGGGCAG CGCCACGACC
GAACTGCACG TAGCGGGCCT GTTCGACAAC AGCAACGGCG TGCTCGCCAG CAACGGCGCC
GCGCTGAAGA TCGATGCCGG CCACATGGCC AACGTGAAGG GCACGCTCAG CCACGCCGGC
ACTCAGGGCC TGCTGCTGCG CACCGGGCAA CTGGACGGCC AGGGCGGCAG CATCGCCAGC
GCAGGCGGCA TCACCCTGCA GGCGGGCAGC GTGGACCATC GCGGTGCCAC CCTGCAGGCC
GAGCGCATCG CGCTGGACGC ACAGTCCTTC GACAACCGCG CCGGCAAGGT CATCGCGACC
GGTGCCGAGA GCAGCACGAT CAACGTGGTC GGCACGCTGG ACAACGGCGA GGGCGGCCTG
CTGGCCAGCA ATGGCGACCT CAGCATCCAG GCGGCGGTGT TCGGCAATGC CGGTGGCACC
GTGCAGCACG CCGGTGAAGG CGTGCTGTCG ATCGACGCCG GAACGCTCAA CGGCACCGGC
GGCACTCTGG TCAGCAATGG CAGCCTGTTG CTCAAGGGCA CCACCACCGA CCTGCGCGCG
GGTATCACCT CCGCCAAGCG GATCGAGATA GACACCGGCT CCCTGATCAC CGCCGGCGGC
ACGCTCACCG CGACCGGCAA TGACATGCTG CGGCTGAGCG CGCGCGAACG GATCGACAAC
AGCGGCGGTA CCGTGTCCAG CAATGGCGCG CTGGACCTGC GCAGTGCCAC GCTGGTCAAC
GCGGGCGGCA CGCTGCAGTC GGCGGGTACA GCGGCGAGCC AGCTGATCAT CGGCCAGGAC
ATCAACAACC GCGGCGGCCG CATCCTGGCC AACGGTGGGC TTGGCATCAG CTCCGGCAGC
ATCGACAACC AGGGCGGCAC GCTGCACAGC GCCGCACGGC TGGTGGTAAA GGCCGACGGC
CTGCTCGACA ACAGCAACAA GGGCGTCATC GCCAGCGGCG CGGGCATGCA GGTGGCGGCA
ACTTCGCTGG ACAACCGCAG CGGCAGCATC GAGCAGGCCG GTGACGACCT GCTGCAGATC
GATGCCACCA CGCTGCAGGG CCACGCCGGC CGCATCGTCA GCAACGGCGA GCTGCAGCTG
AAGGGCGAGA CCCTGGATCT CAGCGCAGGC ACCACCGCCG CGCAGCAGGT CAGCATCGAG
GCCGGCCAGC TGGACAACAC GGCCGGTACG CTGAACGCCA CCGGCAGCCA GGCGATGTCG
CTGCAGGTGC GTGGTGCGCT CGGCAATGAC GGCGGCACCA TCGCCGCCAA CGGCGCACAG
CAGATCAACG CCGGTTCGCT GTCGAACGCA GGCGGCACCT TGAGTTCGGC AGGGACCTCC
GACAGCCGGA TCACGGTCAC GGGCCGCTTC AACAACGGCG GTGGCACCCT CGCCAGCAAT
GCCGGCAACC TGCGCCTGGA CGCCGGGCAG CTGGTCAATG CCGCAGGCAG CATCGTCCAT
GCCGGGAAGG GCATGCTGAC CGTGAAGGCC ACCCAGCTGG ACGGGGCCGG CGGCAGCATC
GCCACGGCAG GCGCACTGCA GCTGGATGCG GTGAGCGTGG ACCACCGCGG TGCCACGCTC
AATGCCGATC ACTTCACCGT CAACGCCGAC CGCTTCGACA ATCAGAACGG CAAGCTGCTG
GCGACGGGCA CGCAGGCCAG CACCGTGCAG GCCACCACCT CGCTGGACAA CGGCGGCAAC
GGCCTGATCG CCAGCAATGG CGACCTGACC CTGACCAGCG CATTGTTCGG CAATGCCGGC
GGTACCGTGC AGCAGGCCGG TACCGGCACG CTGGCGATCA ACGCGCACAC CTTGAATGGC
CAGGGCGGCA AGCTGCTCAG CAATGGCGCC CTGCAGTTGA CCGGTGAAAC CACCGACCTG
CGTGATGGCA CGCTGTCCGC TGCCCGCATC GCTGTGGATA CCGGCACGCT GCTCAACGCG
GGCGGTTCGA TCATCGCCTC CGGCACTGAT GCGTTGAAGG TGTCTGCGCG CGACCGGCTG
GACAACAGCG GTGGCACGCT TGCCGGCAAT GGCGCGCTGG ATCTGCGCAG CGCGCAACTG
CTCAACAACC TGGGTACGAT CCAGGCCGCC GGCAGCGGCA GCAGCACCTT GGCGATCACC
CATGCGCTGG AGAATCGCGG TGGTCGCATC CTGACCACGG GCGATGCAGG CATCAGCGCG
GGCACGCTCG ACAATCGCGG CGGCACCGTG CACAGCGATG GCAACAGCGC GCTGACCGTG
CGCGTTGACG GCCTGCTCGA CAACAGCGAC AAGGGCACCC TGTCTGCGGG GGGCGCGTTG
CTGGCCGAGG CACAGGCCAT CAACAACAGC AGCGGTACCG TGGCGGCAGG GCAGAACCTG
CAGCTGACCA GTGCCGATCT GCTGCGCAAC GAAGGCGGCC TGGTCCAGGC CGGCAAGCAC
CTGCAGGTCT CGGCCAGCGG CGTGAACAAC AACAGTGGCC GGATCATCGC CAACGACCTG
CAGCTGGATA CCCGCGGGCA GGCGCTGGAC AACCGCAGCG GCATCATTGC CAGCCTTTCC
GGAAATGCAG CGCTGCGCAG TGGTGCGCTG GACAACACCG GTGGCCTGCT GCAGTCGGCG
GCAGCGCTCA GCATCGACAC CTCCGGACAG CGGCTGACCA ATGCCGCTTC CAATGGCAAC
GGCATCGTCA GCAGTGGCAC GCTGCAGATC CGCAGTGGCG ATCTGGACAA TCGAGGCGGC
TCGGTGTTCG CCAAGGCCGG CGTCGACGTG CAGGCCGTCA ACATCGACAA CAGTGGCGGC
GGCTCGCTGG TGAGTGCAGC CGACCTGCTG CTGCGCGCGC AGCAGCTGGC CAACAGCGGT
GGCAGCGTGA CCGCCGGTGG CAATGCCGAC ATCGGTTTGC AGGGCGCCCT GCTCAACAGT
GGCGGGCTGG TGGCGGCGAC CGGCCGCCTC GACCTGCAGG CCGGTTACAT CGACAACCGC
AATACGCTCA GCCAGGCCAA TGGCCCTGCG CTGGGCTTGC AGGGCAAGAA CCTGCAGGTG
ACCACCGGCA ACCTGGACAA CCAGAACGGC CAGGTGATTG CCGACAACCT GCTGCTGCAG
GTGAACCAGC GCCTGGACAA CAGCGGTGGC CAGGTGTCGG CGGCGCTGAC CTCCGATGTC
CGCGCCGATA CCTTGGTCAA CAGCGGCGGC ACCCTGGTGG CCGGCAGGCA GCAGACCCTC
CGCACCCGCG AAATCATCGG TGATGGCCGG CTGATGTCGC AGGGTGACAT GACCCTGGAG
CTGGGCCAGA GCCACACCAA CCGTGGCGAG ATGGTGGCCA ACGGCACGCT GAGCCTGTCC
ATCCACGGCA ACCTGGACAA CAGCGGCAAG CTGGCGGGTG CCAACGTCAA CATCAATGCC
GGCAACATCA CCAACGCCTC CACCGGCGAG ATCAGCAGCA TCGGCCTGAC CCGCCTGGCA
GCGGGCGGCG CACTGGTCAA CTACGGCCTG CTGGACGGCA ACGTCACCCA CATCACCGCC
GCGAGCGTCG ACAACATCGG CAGCGGCCGC ATCTACGGTG ACCGTGTCGC CATCCAGGCG
GGCTCGCTCA ACAACCTCGC CGCCAACATC GGCGGCGTGG ACCGTGCAGG CACCATCGCC
GCGCGCCAGC GTCTGGACCT GGGCGTCGGC ACGCTGACCA ACAGCGGCAA GAGCCTGATC
TTCAGTGATG GCGACGCCGC GATCGGCGGC GCCCTGAACG GCCTGGGCGC GGTAGGCAGT
GCCCAGAAGG TGGACAACAT CGGTTCCACC ATCGAGGTCA GCGGCAATCT GGACCTGTCC
GCGCTGGCGG TGAACAACAT CCGCGAGAAC GTGGTCGTGG AGAAGGTCAC CACCGTGCAT
GCACCGGTCC GGCTGGAGCA GCCCGGCTGG TTCAAGAACG CCACCAACAA CAACCGTGAT
TTCCGCGCGA CGTCGAACTA CCAGCCGTAC GAGATCTACT ACCTCGATCC GTCGGACATC
CTTGAAGACA CCCCCTATGT CACCCCGGAT GGCCAGCAGA TCCGCAAGGC GGTTATCCGG
CTGACCAGCA ACACCAGTGC CTACGCCTTC GCCCGCGGTG GCGCGTACGG TGCGCGCGCG
TCGCGCGAGC GCCTGAACGT GCAGGACGGC ACCGTCACGG TCTACTACGT CGGCCGTGCA
GACAACCGCA GCAACCCCGA TCAGCTCGGC GCTGGTGCGG AAGATCCCTT CCGCGACCTG
TCCACCCGCG GTCCCGGTTC GCCGCCGATC GAGTACGTGA CCGATACCCT CAGGTACAAC
AACGCCTACG GTACCTGCAC CACCACCTGC GTGCAGATCA TCACCTATCC GGACTATGAC
AATCCGGAAT CGATGCTGAT CAACATGCAG CGTCATACCC AGGACACCAG CGGCAACGAG
AAGACCCGCG TTGCCACCCG CACCACCGTG GAAGATGTGG TGGTTTCGGC AGGCAACAAC
GCGGTGATCA ATGCAGGCGG CAACATGCGG ATCAGCACCG ATCGGCTGGA GAACCGGTTC
GCCAACATCG CCGCGGGTCG CGACCTGGCC ATTGTCGGCC TGAACCGTGA CCAGTCCGAG
GTGATCAACG CAGCCGAGCA GCTGACCCGC ACCAGCACGT TCGACAACGT GTCCATCACC
TACGGTGGCT CCTCCAGCCG CTGGAAGGCG GCCCCCATCA CCGAGAAGAC CGGTGCGCTG
GGTTCGTCGA TAACGGCGGG CGGCAAGCTC ACCATTGATG TCGGCAACCT GCGCAACGAC
AACACCGGTG GCAGTAACCC CAACGCCGGT GGTGGCAAGG GCACGGCACA GCTGGATACC
GGCGGCCGCG GTGCGGGTGC GGTCGGTCCG GGCGCGGGCA GCGTGCAGGG GCCTGGCCAG
AGCACCGGCC AGGGTGCGGG CAGCGTGGAC GCGCAGGGGC CGGGCACGGC CGGCGCGGCG
CAGGGCGCCA ATGGCGGCCA CCAGCAGGCG GTAGCCCAGG CCGACGGTGT CCGCGCGGAT
GCAGCCGGTG CACAGGGCCC CGGCCAGGCC GGCAATGCCG GCAGCGGTGC CGGCACGGCG
CAGGACATCA GTGCGGCACA GGCCGGGCAG GCCAATGCCA ATGGCCCGGC GGCGGTCCGC
GCCGGGGAGC ATGCCGGCAG TGGTGGCAAC ACCGCCGTGG AGGGCAAGCA GGCTACAACG
ACAACCGGGA CCGACCCGCG CGTGGTGGTG ACCACCAGCC CGAATGCCAG TGCACCCACC
GCCAGCCTGT TCAACGTGGA CGCCAACCGT GGCAATCACA TCGTGGAAAC CGATCCGCGC
TTTGCCAGCT ACCGTGATTG GCTCAGCTCG GATTACCTGC TGCAGCGCGC AGGCTTTGAT
CCCGCGCAGA CGCAGAAGCG CCTGGGTGAC GGCTTTTATG AGCAGAAGCT GGTACGCGAG
CAGATCGGCG AGCTCACCGG CCGCCGCTTC CTTACCGGCC ACGGCAGCGA CGAAGAGCAG
TACCGCGCGT TGCTGGAGGC AGGTGCCACT GTCGCGAGCG AATGGGGCCT GCGACCCGGC
GTGGCGCTCA CGGCCGAGCA GATGGCACGC CTGACCAGCG ACATCGTCTG GCTGGTGGAG
AAGGACGTCA CCCTGGCTGA TGGCTCGGTG GTCCGCGCGC TGGTCCCGCA GGTCTACCTG
CGGGTGATGC CGGGCGACCT TGGCAACGAC GGCGCACTGC TGGCCGGTGC CGAAGTGGAC
ATCAAGCTGC GCGGCGACCT GGTCAACAGC GGCACGATTG CCGGCCGCCA GCTGGTCAGC
ATCGATGCCG GCAACATCCG CAACCTGTCC GGTGGCCAGA TCAGCGGTGC CCAGGTGGGC
CTGTCGGCGC GGCAGGACAT CGACATCATC GGCAGTACCG TCAAGGCCAC CGATGCCCTC
GCATTGAAGG CCGGCGGCAA CATCACCGTG GCCTCGACCA CTACGGAATG GAAGGACCAG
GGCGATCGCC TGTCGCAGCA GAAGACCACG CTCGATCGGG TGGCTGGCCT GTATGTCACC
AATCCCGGTG GCGATGGCGT GCTGTCGGTG GTGGCTGGCG GCGACATCGG CCTCAAGGCA
GCGGAGATCC GCAATGCCGG CACCCATGGC ATCACCCAGC TGGCTGCCGG TGGCAATCTG
GACCTGGGCG CGCAGACGCT GGGCCAGAGC AGCGCGCTGA ACCACGACAG CCGCAACTAC
ACCAGCAACA GCCAGACCAC GCACGCGGTG TCTTCGATTG AAGGCGCTGG CGATGTGCTG
CTGGTCGCAG GCAAGGACAT CAACCTGGCC GCAACCCGTA TCAAGACCGG CGGTGGCATG
GCTCTGCAGG CTGGCGGCGA CATCAACAGC CAGGTACTGG TGGACAGCAG CAGCAGCGAT
TTCAATGCCG GTGGCAAGCG CAGTTCACTG CAGATCAGCC AGAGTGATGA GATCGTACGT
GGCAGCCAGT TGTCTGCTGG TGACAATGTT GTGCTGAAAG CGACTCGCGA TATCAATCTG
ACCGCGACCC AGGTCGCCAG TTCGGATGGC ACGCTGAGCG TGGTGGCCGG TCGCGACGTC
AACCTGCTCT CGGCCAGCGA GACGCATGAC TTCAGCCTGG ACAGCTACGA CAAGAAAAAG
AAGACCCTGT CGAGTACCAC CACCACCCGC CACGCCGAGA GCAGCGACAG CTACGCCATT
GGCACCGCTT TGCAGGGCAA GGCGGTCAAT GTCAACGCGG GCCATGACCT GACCGCAGTC
GGCACGGTGA TCGACGCCAC CGGCAACGTC ACCCTGGGTG CTGGCAACAA CGTGCTGATC
GCCTCCGCGG AAGACCATCA CAGCAGCGAG TCCAGCCAGA GCAAGAAGAA GTCCGGCTTC
ACCGGTGGCT TCGCCAATGG CACTGCATCG ATCGGCTATG GCAGCTCGAA GAACAGCAGC
AGCACCGCCG AACAGTCGAC CACCCAGGTT GGTTCGGCCA TCGCCTCGCG CGAAGGCAAC
GTGCTGATCA ATGCGGGCAA CCAGCTGACG ATCGCGGCGT CCGACGTGGC CGCCGGCAGG
GACCTGACTC TGGTTGGCAG GGACATCAAC CTGATTGCCC GCCAGGACAC GGTGGATACC
CAGGCCAGCC AGTCGAGCAA GTCCAGCGGG TTCTCGGTGG GCGTCACCTA CGACCCGGCC
AAGGCGTACC GCACCGCACG CGACAACGCT ACCGAAGGCA TGGCCGACAG CGGCACGATG
ATGGGCCGGA TTACGCGTAC CGCCGAAGGC GTGGCGGCCG GCGTGTCGGC GGCCGTGCTA
CCGGCGATCA CGGCCGGCAG CCAGAGGTCC AACAGCAACC AGAGCCACTC CAGCAGTGAT
GCACGGGTCA GTAACCTCAA CGCGGGAGGC AATCTCACAT TGATCGCCAA TGGCGGCTCG
ATCACCAGCC AGGGCGCGCA GATGTCGGCC GAGGGTGATG CCGCGCTGCT GGCCACCAAG
GACATCGTGT TCGACGTGGC GCACAACACG GAGCGCAGCA GCAGTGACAG CCGTGGCAAG
GGCTGGGGTA TTTCCACCAA CTCGGCCGGC CTGCCATTTG GCACCAACAA CTCGCGCAGC
GATGGTGCCG GACAGAGCGA CACCATCACC GGTACCCAGC TGTCGGTGGG CGGTGGCGTG
CGCATGGCCA CCACCGAGGG TGATATCCGC CTGACCGCCG CCAACATCGC GGCCGAGAAG
GACGTCAACA TCCGTGCCGC AGGCGATCTC ACGATCCGTA GTGGCCAGGA TACGGTCGGC
AATGCCAACC GTTCGGATAG CAAGGCGATC GGCACTGTCC AGATTTCCGA CACCGAGAAA
TTCTCCGGTT GGCATCGCGA GCAGCACCGT GACGACAGCG CGCAGGTCTC GCAGGTGGCC
AGCACCGTCG GCAGCCTCGG CGGCAGCGTC AACCTTACCG CGGGCGGAAA ATACACGCAG
ACAGCGAGCA ATGTGGTCGC GGCCCAGGAC GTGAACATCA CCGCCGCCCA GATCGAACTG
CTGACCGCCG ATGAAAGCGG GCACTACTCG CAGAGCGACA AGGACCTGAA GATCGGTGCG
TTCGCACGAG TGAAATCGCC GTTGATCGAT CTGCTCAACA ACGTCGATGC CGCACGCAAA
TCCGACGGTC GCCTGCAGGC GATGCAGGGC CTGGCGGCTA GCGCAAACGC CTACCAGTCG
GCGAGTGCGA TCGCAGACAT GGCCAAGGGC GCTGGCGGCG GCTCCCTGCT CAGTGCGGAA
GCGGGAGTGG GCTTCAAGAC CTCAAGCCGC AGCGCTGATG GCAGCAGCCA GGTCTCGCGC
GGTTCGACCA TCCAGGGCGG CGGCAACGTC AACCTGACCA GCACCCAAGG CGACATCCAC
GTGGTGCAGG GCAGCCTGAG CGCGGGCAGT ACGCTGGCGC TGGACTCGGC GCGCGACATC
CTTCTTGAAG CGGGGCAGGC CAACCTGCAG AGCAAGAGCA AAGGCAGCAA CGCCGGGGCC
GAAGTGGGTG TCGGCGTGTC GGTCGGCGCG CAGACCGGCG TCTATGTCTA CGCCGAGGCC
AGCGTGGGCA GCAGCAAGTC GAATGCGGAA AGCAGCACCT GGCAGAACAC CACGCTGGCC
GGTAGCACCA TCTCGCTGAA GGCGGAAGGA GATACCACCC TGCGTGGTGC CACCGCCACC
GCGGACCGCA TCGACGTCAA GACCGGTGGC ACGCTGACCA TCGAATCGCT GCAGGACATC
GCCGAAAGCA TGTCCAAGGA CAACCAGGTA GGGGGTCGCG TGCAGGTATC TTTCGGCACT
GCCTGGAACG TGAACGGCTA TGCCAGTGCC GGCAAAGCCA ACGGCAGCTA CCAGGGCGTC
GGTCAGCAGA GTGGCCTGTT TGCCGGTGAT GGTGGCTACC ACGTGGATGC CGGCCATGTG
AATCTGATTG GCGGCGCCAT TGCCAGCACC CAGACGGGCA ACAGTGAGTT GACTGCACAG
ACGTTGACCT TCACCGATCT GAAGAACGAG ATGGATTACC GCGCAAGTTC GGTTGGCATC
AGCGGAGGTT TCGGGTCGAC CGGCAAGGTG GCCACCGATG CGGACGGCAA TCCGATAGCG
GCACCGAATG CCGCAGGCCA GTTGAAGGAC ATCGGCAACA CCATCACCAC CGGCGGGTAC
GGCAAAGCCA ACACGACGAC CTTCAGCCCG GGCATCCCGA TGACCGAGAG CGGCCATGAC
AGTTCCACCA CTTACGCCAC CCTGACCGGC GGCAACATCA CCATCGGTGG CAAGAAGGTC
GATGCGGCCG ATCTTGGGAT CAACACCGAC GCCAGTACCG CACACAAAGC GCTCGATACG
CTGCCGGACC TGCAGCAGAT GCTGCAGCAG CAGCGCGCGA TGGCTCAGGC GACCGGCACC
GTGGTGAGTG TAGGGATGCA GATCCGCTCC GATATCAATG CGTCGATCGA CGCAGCAACG
GACCGCCGGG AAGCCGTCAA GGCCATTCTG AACGACCCGG AACAGCGCGC CGCATTGACG
CCGGAACGCG AGGCCCAGCT AATTGCAGTT GGCGTGGCGG CCAGCGGTGA GATTGATCGC
CTGCAGAGGG CCGGTGTGCT GGTTGGGGCG ATCACCGGGG GTCTCGCGGC TTCGTCGGGC
AGTGCGGGAA GCATTGTGGC GGGCACGTTG GCACCTGCGA TTTCCTATCA GATCGGTCAG
TACTTCAAAG AGAACGCCGA CAGGAACATG GCTGATGGAG GAAGCCGCAG TGAAGAGGGA
AGCGCCACGC ATCTGCTGGC CCACGCCTTG CTCGGTGCTG CGGTGGCGTC AGCCGGCGGA
GACAATGCGC TGATAGGTGC GTTGGCGGCC GGGAGTGCCG AAGCGGCAGC CCCTTCCATC
GCCAGGTTCA TGTTCGGCAA GGACAGCAAG GACCTGACCG CCAGCGAGAA AGAAGCGGTC
AGCACCGTTG CCGGAATCGG CGGGGCAATG CTTGGCACGT TCCAGGACGG CATGCTGGGC
GCTGCTGCCG GCAACAACGC CGCAAAGAAC GCGGTCGAAA ACAACTGGGG AGAGGTGGGG
CACTACTCCA CAACGGCGAC GATTCTTTAC CTGAGTGGTT TCACGGAAAG CGACGCAAAG
GGCATCGCGG CTGCAACGTG GGCGCCTGAC ACGGATCGAC GAAACGCCAT CACGCCTTTC
AACGTTGCCT GGGCAAAATT CAAGGGCACG CCACAGCAAC ACAACCATGC CTTGGGTGGC
GAAATGGACC CGGAAGCAGT CAGGGCCATC CAGGCGGAAC TGGGGGAAAA GGTTGCTGTG
ATCCTGGCCG ATATCAAGAG GAACGAGAAC AATCCCGAGG CCAAGAGGGC CATTCTCGAC
AATGTTGAGA CGCAACGGGT GTTGCACCTC TTTGGCGATT CGTTTGCCCA CGTTCAACGC
GATGGCACGC AGTTCAAGCC GGTCCTGGGG CATGCCAAAG CATCGACACA GGATGAAGGC
ATCAACGATC CGGACAACCC GTACACGCAT CGGGATGCCT ATCTGGCCTA TAGTCTCGCC
CTTTACAAGG CAGCGACGGG AGCGTCCAAG GGCAAGGCGC TGGGCGATGC CAACTACATC
GCCGACCTGG CTTCCCGGGT TTCTGCAGTC AATGGGGAGG CGGCGCAGAA GGGTGTGCTG
GATGCTGCGG CGTCGTTCTT CATGCCAACC AGGACATCAG GGTTGGTCGA TGCTCCGATG
TCAGATTGCG GGTGGTACTG CACATACGTG CCTGCAGGGT GGATGGCGAG GCCAGAGCTT
GAAAAAATCT ATCGAAAGCC AGTCGAGGTT CCGCAGACTC CGGTGATCGA CTGGACGAAG
TTCTCCCAAG GGCGATGGTG A
 
Protein sequence
MNKVYRLVFN RTLGVMQVAS ELVNAARGGA DSREGHAVGT LRPISFALWL VLGWVGLVQP 
LSAQQAPADP GRIAADPGAP ANQRPTVITS ANGTPQVNIT TPSAGGVSRN SYQQFDVGQQ
GVILNNSRGD VQTQLGGWVQ GNPWLATGTA RVILNEVNSS NPSHLNGYVE VAGTRAQVVI
ANPAGIQVNG AGFLNASRVT LTTGTPIVSN GVLEGYRVEG GSIGVGGTGL DTSRADYTDI
ITRSLQVNAG IWANQLQASL GSNVVSADHS SVQQQKPTSA APTFALDVGA LGGMYANRIW
LVGNEHGVGV SNAGKIGAQA GELVVTADGR LQNTGAMHAQ QDLRIDASAG IANAGTLSAE
RELRVQTPAD VDNSGGTLNA RRIEVNADAL RNRGGSIEQL GTQALQLQAA DLSNLQGRIG
TVSSVSPGTG TGGTGTPPGT GTPGTGTPGT GTPGTGTPGT GTPGTGGGTL PVTPLPPLAA
GVLNIVGTLG NDGGRIEASG DLQLSARNSL DNSDGTLGVS TLLVQGQALR NMRGTLQVQG
AASAQVQQLD NSGGRMTFAR GFELQAQSAI NRGGSLAHGG SAGTIWTIGQ LDNSEGSISS
NATRLGLDIG TLVNTRGSIN HAGSEGLLLR AEVLDGVQGS IATAGAADLL LGRADHRGAQ
LIARQLGLSA QDFDNRGGRV LSTGLQASTL NVRNHLDNSD GGLLASNADL QIDAASFGNA
GGTVQQAGTG NLRVTTANLQ GQGGTVLSNG TLQLHGDTLN LRNGTTAAQR IDVRAGDLTT
AGGTLTSTGG DALQLQVSRT LDNSGGTVGA NGALDINSGI LINDGGKLIA AGTGTSRIHA
SQRLQNQRGL LSGNGDLDIK TDGLLNESGS IDHAGTGTLQ IDATTVQGAA GSIASNGLLQ
LSGQQLDLQK GTTRAREVQI TATGLDTSGG SLLSLGNNAM QLQVQGRLRN DGGSISANGA
QQIQAGALSN RGGALNSAGT AASQIRVDGT LDNSGGSIAS NAAHLQLKSG ALVNAGGTLS
HAGSDGLIID SGRLDGAKGQ IATAGALQLT AAEVDHQGAS LAAAQLDITA TGFDNRGGRI
IATGTGGNTL RVQGTLDNGN GGTLASNGNL DIHARTFGNA GGTVQQAGSG SLVITTQDLT
GAGGTLLSNG SLELQGDTLD LRNGTTTAQR ISIDGDTVTT AGGQLSALGT QALQLQARSL
LDNTGGTLGS NGAVDVHAGR FVNDHGKLIA AGDAASAIRA AQLENHSGSI SANGNLRIEA
QTLSGQSGSI GAARALSLQG GSLDLRGGSV AAEQLDIVAD SLDNSAGALR ATGSGTLALQ
VSGRLANDAG TIASNGAQRI EAGELSNRGG TLSSAGSATT ELHVAGLFDN SNGVLASNGA
ALKIDAGHMA NVKGTLSHAG TQGLLLRTGQ LDGQGGSIAS AGGITLQAGS VDHRGATLQA
ERIALDAQSF DNRAGKVIAT GAESSTINVV GTLDNGEGGL LASNGDLSIQ AAVFGNAGGT
VQHAGEGVLS IDAGTLNGTG GTLVSNGSLL LKGTTTDLRA GITSAKRIEI DTGSLITAGG
TLTATGNDML RLSARERIDN SGGTVSSNGA LDLRSATLVN AGGTLQSAGT AASQLIIGQD
INNRGGRILA NGGLGISSGS IDNQGGTLHS AARLVVKADG LLDNSNKGVI ASGAGMQVAA
TSLDNRSGSI EQAGDDLLQI DATTLQGHAG RIVSNGELQL KGETLDLSAG TTAAQQVSIE
AGQLDNTAGT LNATGSQAMS LQVRGALGND GGTIAANGAQ QINAGSLSNA GGTLSSAGTS
DSRITVTGRF NNGGGTLASN AGNLRLDAGQ LVNAAGSIVH AGKGMLTVKA TQLDGAGGSI
ATAGALQLDA VSVDHRGATL NADHFTVNAD RFDNQNGKLL ATGTQASTVQ ATTSLDNGGN
GLIASNGDLT LTSALFGNAG GTVQQAGTGT LAINAHTLNG QGGKLLSNGA LQLTGETTDL
RDGTLSAARI AVDTGTLLNA GGSIIASGTD ALKVSARDRL DNSGGTLAGN GALDLRSAQL
LNNLGTIQAA GSGSSTLAIT HALENRGGRI LTTGDAGISA GTLDNRGGTV HSDGNSALTV
RVDGLLDNSD KGTLSAGGAL LAEAQAINNS SGTVAAGQNL QLTSADLLRN EGGLVQAGKH
LQVSASGVNN NSGRIIANDL QLDTRGQALD NRSGIIASLS GNAALRSGAL DNTGGLLQSA
AALSIDTSGQ RLTNAASNGN GIVSSGTLQI RSGDLDNRGG SVFAKAGVDV QAVNIDNSGG
GSLVSAADLL LRAQQLANSG GSVTAGGNAD IGLQGALLNS GGLVAATGRL DLQAGYIDNR
NTLSQANGPA LGLQGKNLQV TTGNLDNQNG QVIADNLLLQ VNQRLDNSGG QVSAALTSDV
RADTLVNSGG TLVAGRQQTL RTREIIGDGR LMSQGDMTLE LGQSHTNRGE MVANGTLSLS
IHGNLDNSGK LAGANVNINA GNITNASTGE ISSIGLTRLA AGGALVNYGL LDGNVTHITA
ASVDNIGSGR IYGDRVAIQA GSLNNLAANI GGVDRAGTIA ARQRLDLGVG TLTNSGKSLI
FSDGDAAIGG ALNGLGAVGS AQKVDNIGST IEVSGNLDLS ALAVNNIREN VVVEKVTTVH
APVRLEQPGW FKNATNNNRD FRATSNYQPY EIYYLDPSDI LEDTPYVTPD GQQIRKAVIR
LTSNTSAYAF ARGGAYGARA SRERLNVQDG TVTVYYVGRA DNRSNPDQLG AGAEDPFRDL
STRGPGSPPI EYVTDTLRYN NAYGTCTTTC VQIITYPDYD NPESMLINMQ RHTQDTSGNE
KTRVATRTTV EDVVVSAGNN AVINAGGNMR ISTDRLENRF ANIAAGRDLA IVGLNRDQSE
VINAAEQLTR TSTFDNVSIT YGGSSSRWKA APITEKTGAL GSSITAGGKL TIDVGNLRND
NTGGSNPNAG GGKGTAQLDT GGRGAGAVGP GAGSVQGPGQ STGQGAGSVD AQGPGTAGAA
QGANGGHQQA VAQADGVRAD AAGAQGPGQA GNAGSGAGTA QDISAAQAGQ ANANGPAAVR
AGEHAGSGGN TAVEGKQATT TTGTDPRVVV TTSPNASAPT ASLFNVDANR GNHIVETDPR
FASYRDWLSS DYLLQRAGFD PAQTQKRLGD GFYEQKLVRE QIGELTGRRF LTGHGSDEEQ
YRALLEAGAT VASEWGLRPG VALTAEQMAR LTSDIVWLVE KDVTLADGSV VRALVPQVYL
RVMPGDLGND GALLAGAEVD IKLRGDLVNS GTIAGRQLVS IDAGNIRNLS GGQISGAQVG
LSARQDIDII GSTVKATDAL ALKAGGNITV ASTTTEWKDQ GDRLSQQKTT LDRVAGLYVT
NPGGDGVLSV VAGGDIGLKA AEIRNAGTHG ITQLAAGGNL DLGAQTLGQS SALNHDSRNY
TSNSQTTHAV SSIEGAGDVL LVAGKDINLA ATRIKTGGGM ALQAGGDINS QVLVDSSSSD
FNAGGKRSSL QISQSDEIVR GSQLSAGDNV VLKATRDINL TATQVASSDG TLSVVAGRDV
NLLSASETHD FSLDSYDKKK KTLSSTTTTR HAESSDSYAI GTALQGKAVN VNAGHDLTAV
GTVIDATGNV TLGAGNNVLI ASAEDHHSSE SSQSKKKSGF TGGFANGTAS IGYGSSKNSS
STAEQSTTQV GSAIASREGN VLINAGNQLT IAASDVAAGR DLTLVGRDIN LIARQDTVDT
QASQSSKSSG FSVGVTYDPA KAYRTARDNA TEGMADSGTM MGRITRTAEG VAAGVSAAVL
PAITAGSQRS NSNQSHSSSD ARVSNLNAGG NLTLIANGGS ITSQGAQMSA EGDAALLATK
DIVFDVAHNT ERSSSDSRGK GWGISTNSAG LPFGTNNSRS DGAGQSDTIT GTQLSVGGGV
RMATTEGDIR LTAANIAAEK DVNIRAAGDL TIRSGQDTVG NANRSDSKAI GTVQISDTEK
FSGWHREQHR DDSAQVSQVA STVGSLGGSV NLTAGGKYTQ TASNVVAAQD VNITAAQIEL
LTADESGHYS QSDKDLKIGA FARVKSPLID LLNNVDAARK SDGRLQAMQG LAASANAYQS
ASAIADMAKG AGGGSLLSAE AGVGFKTSSR SADGSSQVSR GSTIQGGGNV NLTSTQGDIH
VVQGSLSAGS TLALDSARDI LLEAGQANLQ SKSKGSNAGA EVGVGVSVGA QTGVYVYAEA
SVGSSKSNAE SSTWQNTTLA GSTISLKAEG DTTLRGATAT ADRIDVKTGG TLTIESLQDI
AESMSKDNQV GGRVQVSFGT AWNVNGYASA GKANGSYQGV GQQSGLFAGD GGYHVDAGHV
NLIGGAIAST QTGNSELTAQ TLTFTDLKNE MDYRASSVGI SGGFGSTGKV ATDADGNPIA
APNAAGQLKD IGNTITTGGY GKANTTTFSP GIPMTESGHD SSTTYATLTG GNITIGGKKV
DAADLGINTD ASTAHKALDT LPDLQQMLQQ QRAMAQATGT VVSVGMQIRS DINASIDAAT
DRREAVKAIL NDPEQRAALT PEREAQLIAV GVAASGEIDR LQRAGVLVGA ITGGLAASSG
SAGSIVAGTL APAISYQIGQ YFKENADRNM ADGGSRSEEG SATHLLAHAL LGAAVASAGG
DNALIGALAA GSAEAAAPSI ARFMFGKDSK DLTASEKEAV STVAGIGGAM LGTFQDGMLG
AAAGNNAAKN AVENNWGEVG HYSTTATILY LSGFTESDAK GIAAATWAPD TDRRNAITPF
NVAWAKFKGT PQQHNHALGG EMDPEAVRAI QAELGEKVAV ILADIKRNEN NPEAKRAILD
NVETQRVLHL FGDSFAHVQR DGTQFKPVLG HAKASTQDEG INDPDNPYTH RDAYLAYSLA
LYKAATGASK GKALGDANYI ADLASRVSAV NGEAAQKGVL DAAASFFMPT RTSGLVDAPM
SDCGWYCTYV PAGWMARPEL EKIYRKPVEV PQTPVIDWTK FSQGRW