Gene Smal_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_2206 
Symbol 
ID6476537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2469696 
End bp2482094 
Gene Length12399 bp 
Protein Length4132 aa 
Translation table11 
GC content68% 
IMG OID642731388 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_002028593 
Protein GI194365983 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.498823 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACC CGCATTCGCC CACCGCCCGT TTCCCTGCTG CCTTGTGGCG TAGCCATCCG 
ATGGCCTGGG CCGTCGCAGT GGCCCTGGGC AGCGTGGTGG CTCCGGCCAC TCAGGCCCAA
CAGGCGTTCA GCCCCGGCTG GTTCGCCGAG CGCGGTGCGG CGCAGGGCGC GGCGGCACAG
AGCGGGCGCA TGCCCAATGG CGTGCCGATC CAGTTCCAGC TACCCGCGCA GCAGCAGGAT
GCGGCACGGC AGAAGCTGCA GCAGTCGATC GACAACCTTG GTACGGCCGC GCAGGCCATC
GCCCTGCAGC AGCGCCTGCA GGAACAGGCA CGGCAGGCGC GCCGCGAGGC GGGCTTCGTG
GTCGCCGATG GCCTGGGCAA GGATGGCCTG AAGGTCGACG AGAATCCGCT GACCCGCGGC
TGGATCAACG CCCGCGAGGC GATACAGTCG CAGGCCGCCG ACGGGCGGGT GCTGGTCAAC
ATTGAGCAGA CAGCCGACCA AGCGATCCTG AACTGGGAAA CCTTCAACAT TGGTGGCAAC
ACCACACTGA ATTTCCTGCA GAACCCCGAC TGGGCCGTGT TGAACCGGGT CAACGATCCC
GCCGCAAGGC CCAGCCAGAT CCTTGGCCAG CTGAAGGCCA ATGGCACGGT GTTCGTGGCC
AACCGCAACG GTGTTGTGTT CGGCAACAAC AGCCAGGTCA ATGTCCGCAA TCTGGTGGCA
GCGGCCGCAC GCATCAGCGA TGCGCAGTTC CGTGAGCACG GCCTGTACAG TGCCGACGCG
AACACCTCGG CGCTGACCGA CGCCGTGGGC AAGGTCATGG TCGAGCGCGG TGCACGCATC
ACCACGCACG AACCGACCAC GGCCACCCGC GGCGGTGGCT ATGTACTGCT GGCCGGCCAC
AGTGTGGAGA ACGCCGGACA GATCGAGACT CGAAAAGGCC AGGCCCAGCT CGCCGCCGGT
GACAGTTTCG TGATCCGCCG CGGCATGGGC ACCGCCCAGA ACACCGCGTC CACCACCCGA
GGCAATGAGA TTGCCCCGCG CTTCATCGCT GACAGTACGG CGGGCGGCGT TCGCAACAGT
GGTCTGATCC AGGCGCGCGA AGGTGACATC ACCCTGGCTG GACGCCGGGT CGAACAGGCT
GGCGTTGCAG TCGCGTCCAC CACGCTGAAT CAACGTGGTA CGGTGCACCT GTTGAACTCG
GCGTCCGATG CGGCGGGCAG TGTGACCCTG GCCTCCGGCT CGACCACTGC AGTACTGCTG
GAAGACGATG GCAAGAGCAC GGCGCTGGAC AGCCAGCGCG ACGCGCTCAT CAAGGAATCC
ACCGAGCAGG ACAAGCTGCG CGGTTCCAGC AACAGCGGCA CCTTCGACAA TCTTTCGCGC
CTGCAGGACC GTCGCGAGCA ATCGCGCATC GAAGTCGTCT CCGGCGGCGA TGTGAACTTC
CAGGCCGGCG CACTGGCAGT GGCCACCGGC GGCCAGGTGA TCGCCGACGC CAGCCGGCGC
AGCTATCTGG ATGATGGCGC GCGCGTGGAC GTCTCCGGTG CGGTGGGCGT GCAGGTGGCG
ATGGAAAGCA ACAACGTCAA GGTGAAGGTG CAGGGCAATG AGTTGCGCGA TTCGCCAGAC
AACCGTGACA GCGGCAAGCT GATCAGCAGC GAGGTGTGGA TCGACCGCCG CCAGTTGACC
GAAGTCGCCG CCGGTACCGG CGGCTATGAA GGTACGCGCT GGTATGCCGG CGGCGGCCTG
CTGGAAGTGG GGGGCTACCT GGACAACCAG GGGCATTCGA TCAGCGAATG GGCAGCACAG
GGTGGAACCG TCCAGCTGGC AGGCAAGGAA GTCGTCAGCC ACGCCGGTTC GCGCATCAAT
CTGGCCGGTG GCAGCCTGGA TGTGCAGTCT GGCGTGGTGC AGCAGAGCTG GCTGCGTGGC
CGCGACGGGC AGTTGTACCG GCTTGATGAT GCACCGGCCG AGATGCTGTA CGACGGTCTG
TACCAGGGCT ATGAAGTGAA GCAGGAACGT TGGGGGGTGA CCGAATCGTT CCGCAATCCG
CTGGTGGCAC CCACCCAGCG TTTCGACAAT GGCTATACCG TGGGACGCGA TGCAGGCCGT
CTGCTGATCT CGGCGCCCAC GGCTGTCCTG CAGGGCCAGG TGGATACGGT GGCCTTCCAG
GGCCTGCAGC AGACCCGCCG CCCGGATCAG GCGCAGGACG GGTACGTGCA GGCACAGACG
GCCGCGGCCC GCAACGCCCA GCTCTGGCTC GGGCGTTTCG ACAACACCGG CCGAAGTGCA
GTGTTCGATT CGCAGATCCG CATCGGAGCG CTCCAGGCCG ACACCCGGTC ATGGACGTTG
CAGGCGCCGA TCGGGGAAAC ACAGCGCAAT ACCGTGTGGT TGGACAGTGA GGTCCTGAGC
GCCCAGCGTT GGGGGCAGGT GGACCTCGCT TCGGCGGGCC GTATTGACCT GGATGGCACG
CTCAGGTTGC AGGAGGGCGG CCGGCTTGGG CTCACTGCAA GCCGCGTCAG CTTCGGTGGA
ACGGTGCAGA TTGCCGGCGG GCAGGTAGAG GCCGGCAACC TGCTTCCGGC GCTCGGCGGG
CCCACGGCCC TGCTGAGGTC CGGACGCGGC GCGGTGGATG TTGCGGCGGG CGCACGGATC
GATCTTGGCG GCAGTTGGAG TAACGCGCTT GCTGCGCCGG ATGGCGACCC CGCGCAGGCG
TGGATCGACG GTGGACAGAT GCGCTTCAAC AGCAGTCACG ACATCCGTGT GGGTGAAGGG
GCAGCGATTC TTGTCGATGC CGGCGGCGCG ATCGATGCCA GTGGCAAAGG GCGCGTAGGC
AAGGGCGGCA GTGTCAGCCT GCTCGCGGCC AGCACCGAGG TTGCCACCGA CGGCAGCGGG
CGGATCCGTA TCGGCGATGG CACACGCTTC AGCGCGCTCG GCAATGGAAG CGGCAATTTC
ACGCTGGCAA CAGGCGGTGC GGTGTCTATC GGCGCGCCTG GCACGGATGC ATCGGCGGCG
TCGTTGCGAT TGCAGTCAGC ACTGTTCAGG AGTGGCTTCG CCGGCTATGA CATCGCCGGT
CACAACGGTT TGACGGTTGA AGAAGGTACC CAGCTGCTGG TGGAACGACC GGCGCTGCGC
CTGGCTGAGG GCGCACAGCA GGCACTGGCC CGCGAGCAGG GCGTTCAGGC CTGGACGCCG
TCGCTGTATG AGGCGGATCC GGCCAGCGGG CGTGTATCGC AGCGCGGCGG CGCGAGCCTG
GCACTGCGCG CAGGGCATCC GCTGTCCACG GCCGATCTGC GCGTTGGCAA GGGCGCAAGG
ATCGAGGTCG ATGCCGGCCA GACGATCGAA CTGTTCAGCG CCGGCAACCT GGACGTGGCT
GGCAGTCTGA AGGCGGCCGG CGGCCGTATC CGCCTGGACG AGGCATTCGA TCCGACCGGT
GTGCGTGGCG ATCAACGCCG CGAACGTCGC TGGACGATTG CCGAGGGCGC CCTGCTGGAC
GTGAGCGGCG ATAGCGTTTC CCTGCCCGAC GCGCGCGGGA CATTGCGCGG ACAGGTGCGC
AAGGGCGGAA CGATCGAGAT CGGGGGGGCG TTGGACTGGG AAGACCAGGA CAACGTACGC
CACTTGCCGC CCGATACCTT CGTGGTGGTG GAGCGCGGTG CGCGGCTGGA CGCGTCCGGT
GCTTCGGCGC TGCTGGACAT TGATGGCAGC GGGCGCACCC GGGTGGACAG CGACGGCGGC
AGCATCGTGC TGCGCGCCGG CAATGCGCTG TACCTGCAGG GCGACCTGGA CGCAGCAGCG
GGCGGCGACG GCGCGCGCGG TGGCACCCTC GGCGTTGCCT TCGGTGGCGG AACCTACGGC
AGAACGGCAA TGAACAAGGA AGTGCTGGCG CCGAGGGTGA TCTCGCTGAC CCAGCACGCC
GCGCAGACGA CCGACCTGCC TGCACGCCTT GAGTACGGCC ATGCGGCACT GTCGGTGGAG
CAGATCGAGG CCGGTGGTTT CGATCATCTG GCCTTGTTCG GCGATATCCG CGCCCAGGGA
GATGTCACTC TGCGCATGGA CCAGAGCCTG CGCCTCCAGG GCTTGAACGA GCGTTACCTC
GGCTTCGTCC CGGGCAACAG CGAGGGCAAC CGTCTGCAAC TGGACGCGCC GTATGTCCGC
CTTGCGCAGG GGCGCTGGTG GCAGCCGGCC GGGGAAGGCA CGTTGCGCCC GCAGGAAGCG
GGGTTCGACA GCGGCCGCCA GCATCGCCTG GACGTGAGCG CCGATCTGAT TGACCTGCGC
GACGTCACCT GGCTGGCGGG ATTTGATCAG GTCGCGCTGC GCAGCCGCGG TGACATTCGC
ATGCTGGCCG CGGTCGCCAG TAGTTCACGG GAATCCACGC TTGCAAGCCC GGGTTCGATC
GATATCAGCG CCGCACGCAT GTATCCGGCG GCCCGGGCCA AGGGACGCAT CGTGGCGGGC
GTCCCCGACA TCGGTCCTTC AGGCCTGCCG TTCTGGCAGA ACCCCGACGC CGTACTGCGC
ATCCATGGCA TTGCAGGCGG CGCGCAACCG GCACCGGACT CTGCGTTCGG AAGCCTGGAG
CTGGTGGCAG CGACGGTGGT GCAGGGCGGC AATGTGCAGG CGCCTTGGGG CCATGTGCAG
CTGGGTGGCA GGGAATTCAA CGCTGATGCG GCCAGCCGCG TCGACCTGCT TGCAGGCAGC
GTGACGTCGG TGAGCGGTGC CGGCCTGCAG CTGCCGTTTG GCGGTACCGT GGATGGTGTG
GGCTGGCGCC GCAACGGTGC CGACTTCGAT GTGCTGGGCC CCGGCTCGAC CAACATCCCG
ATCGGTATCG ACATCGTCGC CAACGCCGTC GATGGCGTGG TGGGCAGCGT GCTGGACGTG
TCCGGCGGAG GCGAGCTGTC CGGCGCTGCA TTCGTGGCCG GGCGCGGTGG CTCGGTGGAC
ATCCTGCGGC ATGCGCTGGC GGATGCGAAC CCGCGCTACC GCTTCAGCGG CAGTGACAAT
GCGGTCTACG CGATCATGCC GGGACGCTCT GGCACCCAGG CGCCACAGGC GGTTGCTGAT
GGCAGCGCCG ATCCGCGCAT CGGCCAGCAG ATCGTCATAC CTGCGGGGGT CCCCGGCCTC
CCGGCCGGAA CCTACACGCT GCTGCCGGCC AGCTACGCAC TGCAGAAGGG CGCATTCCGC
GTCGAAGTCG GCGCCGAACG CGCCGTCGGC AGCAGGCAGG CAGTGGCGAC CGGCACAGGC
TCATGGCGTG TCAGCGGGCA CCGGGCGCAA AGCCTGGGCG GGGCAGTGTC GCCGCTGCTC
ACCGATCTGG TGCTGACGCC CGCGGAAGCG GTGCGCCGCC ATGCCAACTA CAACGAAACG
TCCTACAGCA CGTTCGTACA GGGCGTGGCC GAACGGCGTG GCGAAGCGCT GCGCTGGCGG
CCTACGGACG CCGGCAACCT CAACCTGGCA CTGGGTGAGG GCGCCGGACG TTCGAGTGTG
CCAGCGGCGA TCTTCCAAGG CGTGTCGCGC TTCAACGCCG GCAGCAATGA CGGACGCGGG
GGCACGCTTT CGGTCAATCT GCTCACCTCC AATGACGCGA TGTTGGAGAT TGTCACCGAA
GGCGGCAGCG CCGGCAGCGG TAGCGGCGCC ACCGTCTTCG ACTCGGCGCT CAATGCGTTC
CGCCCGGAGA CGATGCTGAT CGGCGGCGTG CTGCGGCGTG ACGCGACGAC CCATTCGCTG
GAGGGCAGGG CGCAGCACAT CGTGGTGCGC AACGGCGTCA ACCTGACGGC GCAGGAAGTG
CTGCTGTCGG CCGCGTTCGG AGGCAAGGGA ATCCTGGTGG AGCAGGGCGC TTCGATCGAT
ACCTTGACCG GGGCAAATAC CTCGCGCGTG GCGCAGCCCA CGACGCCCTA TCTGGTGTCG
GGCGGCCTGC TTGCCGTGTC CAACCAGCGC TTGACGGCGC TCAGTGCGCA GGGCGGCAGT
GCCGCCGGTC CGGTGGCGAT CGACATCGGC GGATGCGTGG TGGACTGCAA TGGCCAGACC
CGGTTGCTGT CGGCAGGCAG CATCAATGTG GTCACCGACG GTGCCTTGAA CATTGGTGAT
TCGGTGAGCT ACGGCACGCG CCAGCTCGGC CTGGGCATGT CGGCGCTCAA TCTCGGCAGC
GCCGAGGCCA TTGCGGCGGC AGCGGCGGCC GGCGCCCTGC CGGCGGGCAT GACCATGAAC
CAGGAGGTGT TGCAGCGATT GCTGCGGGGA AACACCGCCA CCGGCGCGCC GGCGCTGGAG
GCGCTGAGCC TGACCGCGCG CGACGCCATC AATGTGTTCG GCAGCGTGGA CCTGGATACG
CGTGACGGTG CGACCGGTCG CAGCAGCCTG CGCAGCCTGG TGCTGGGCGC CCCTGCCATT
CATGGCTACG GCTCTGCGGC CGACCACGCG CGCATCTTCG CCGATACGCT GGTCTGGGAC
GGTACGCTGG CCGGCACCAC GTTGCCCGGC GGTGAGCAGA CCCAGCCGGC CGGCGAGGCG
ATGGTGGGCC GGCTTGGCCA AGGGCAGCTG GAAATCAATG CACGGGTGCT GGAGCTGGGG
CGTGCGCCCT TTACCCGGCC CAGTTCGTCG GTGGCCGCGG ATCGCCAGGT GCTCGGCTTC
GCCGGCGTTA CCCTGGCAGC ATCGGATCGC ATGTTGTTCT CCGGCAAGGG CAGTCTGGAT
GTCTTCCAGC GGCAGGGCGA TTACGTGGCA GGCAGCGGCT GGCAGTTCAG CGGCGGCGCG
CTGGATATCG TTACCCCGTT GCTGACCGGC AGTGCGGGCG CGCAGCTGGC AATCCGCAAT
GGCGGTACGG TGCAGTTGCG CGGTGCTGCG GCAACCGCGG GCAGCGATGC GTTGGGTGCG
GAGTTGTCCA TCACGGCCGA ACGCGTTGTG ATCGACAGCC GGGTGGCGCT GGCGTCCGGC
CGGTTCGAAG CCAATGCCCG CCAGGGCGTG GCGCTTGGCA GCAATGCAGT GCTGGACATG
GCGGGGCGGA AGGTCAGCCT GTTCGACGTG GACAAGTACA GTTGGGGCGG CGACGTAGCG
CTGTCGAGCC GTGACGGTGA CATCGTTGCC GACGCGGCGT CGCGCATCGA TCTTTCCGCC
CGCAACAACC GCGGCGGCCG CTTGACGGTC GCTGCGCTGG GCGCGCAGGG CGGCCGTGTG
GATCTGGCAG GCACGTTGCT TGGCGGAGCC AGCGGCCGCT ACGACGCCGG TGGCACTGAA
GTGCCCTACG ACGGCGGCGA GCTGGTAGTG CGTGCACGCC AGTTGCAGGA CTTCTCCGGC
CTGAACACCC GGCTTACCGC CGGTGGCATT ACCGGTGGCC GGACGTTCCA GTTGAGCGAA
GGCGACCTGG TGATCGGCGA TGAAGTGAAG GCACGCAACG TGGATATCAG CGTCGATGGC
GGCAGCCTGC TCGTCAATGG CCGCATTGAT GCCAGTGGCG AACAGGTCGG CAGCATCCGC
CTGTCCGCCC GCGATGTGTT GCGTATCGAC GGCACATTGG ACGCGCATGG CAGCGCCTTG
CGCGTCGATA GTTACGGCAA GATCATCGAC AGCCCGAACC GGGCGATGAT CGAACTGACC
TCGGCCAGTG GCAGCCTGCA GCTGGGCGCG ACCTCATCGA TGGACCTGCG CGCCGGCACC
TCGGTGGCCA CGGGTAGCAG CCCTGGCCAG AACGATGGCC GCGCACGCGG CAGCGTCAAG
CTCAACGTGC CACGCGTGGG CAGCAACGAC GCTGCGATTG ACGTGGCCAG CGGACTCCGC
ATCGCCGGTG CCGCCGATCT CCAGGTCAAC GCCTTCCGCA GCTATGATTC GGCGCCGTTG
GCAAGCGCTC CGGACGTGCA TGGCCATCGT CCGCAGGTGA TCAACCAGCA TTGGCTGGAC
ACGGTTGTGG ACCCGGACAA CAGCCAATGG ATGAACGCCG CGCTGACCAA TACGGGACTG
CAGCAGCGCA CTGCCGCACT GGGCAGCTAT CGCCTGCGTC CCGGCGTGCA GATCGTGGCG
CGGACCAGCA GTGACAACCC GCGTGGCGAT CTGGTCGTGG CCGGCGATAT CGATCTGTCC
GGCTACCGCT ACGGACCGCA GTCGAACCGG ACCGATCCAG CGCGCCGAGG CTTCGGTGAA
TCGGCAGCGC TGGTGCTGCG CGCCGAAGGC GATATCAACG TCTACGGCAG CATCAACGAT
GGATTCGCGC CGCCGCCTGC CAATCCCGAT GAGGACGGCT GGGTGCTGCT GGAGGGGCGC
AGTGCCGGTA CGCCCAACAC GGCGTTCGGT GGCGACCTGA TCGTGCCCGG CGAAGGTGTA
CAGCTGCAGC GCGGCACCGA ATTCCGTGCT GGCGCGACGC TGAACTATGC ACTGCCCTTC
GAGGCCGTGA CCCTGCCGGC CGGAACCGTG CTGCCCGCGG AAATGCGGTT GTCGGGACCG
CTGCTGCTGC CGGCCGGCAC TGTGCTGGGC GCGGCAGTGA CGACGGCGGG TGGCGAGGTG
GTAGCGGCTG GAACCGTGCT GGCCCAGGCA TTGACGTTGC AGGCGGGTGC GCGCCTGGGG
GCCGGCTTCC GGCTGCGCGC GCCGGCGCCG GTGGCCGCAC AGGTGTGGCC GGCCGGCGTC
GCGCTGCCGA TGGCGATGAA GCTGTCAGAC GCGGTGGACC TGCATGCCGG CGCGATCATC
CCGTCGATGA CCAAGGTGGA GCTGGCCGGC GATGCGCCGG TGAAACTGCG GCCTGCCGAC
GCCAGCGGGC GCCAGGGACG CAACTGGGCG CTGGCGGCGA TGCTGCCCGA AGGCACCACC
TCGTGGGATC TGACTGCAGT GGCCGGTGCC GATACGACCG CTGCCGATCC GCGTACCCGC
AGCTGGGGAA GCGACGGCAG CATCGTGCTG GCCGACAGCC ACTACAGCAC CATCGGGACG
GTGACCAAGA CGACGGAATG GAAGGGTGAT CGAATCGTCA ATTTGGAAGG TTCTCTGTAC
TGGTGGGGCG ATGAGAGTCT GGCCGGAAAG TCACCTGCCG AGGTGGCCGA GATCATCGGG
ACCACCGAGG CGGAGATCTG TGGCGCAGGC GCCTTCTGCG GCCCGGCCCC GCGCCTGGTG
GACAAGGAAG GCTCGCTGGC ATGGTGGGGC GATGAATCCT GGGTCGGCCG CCCGGCTACG
GAACTGGGCG CCGAGATGGG CATGTCGGAG GAAGAGATCT GCGCGGCAAT GGGCTACTGC
TACGGCGGCG GTACGCTTAC CGAGGTGACC ACCTACGGCA AGCGCCTGGG CTCGCCGGCT
TGGAGCGTGC TACGTACCGG CAAGGGTGAT CTGGCACTGC TGGCCGCGCA GGACGTGCGC
ATGAAGTCCG GCTTCGGCGT GTACACCGCA GGTGCGCCGA CGCTACTGGG CGATGGCAGC
GACCCCCAGT TCAATCCGGT GCGTACGGCT GCGCCCTGGC ACACCTCGCT GCTGGGCAGG
AACCAGGTCT CGGGCAACTA TGATGCGGCG CTGGCCAGTT ACCGTGCGTG GTATCCCGAC
CATGGCGGCG ATGTCAGGGT TGAAGCTGGA CGCGACATCA TCGGCGACGC CTGGACCGCC
CGGTCCGAAT CCGGTACGGC GCAGCGCGAC CAGGCTGGGC ATTCAAGCGC AGCCGTGGGC
GGCTGGTTGT GGCGGCAGGG TACGGGCACC ACCGAAGGCG CCCAGGCAAC ACCGACGAGC
TGGTGGATCA ACTTCGGCAC GTACAGCACC GTGGGCGCCG AAGCGGACGC AGCTCCCCGC
ATGGTCGGGT TCACCGGCTT CGGCACGCTG GGAGGCGGCA ATCTGTCACT GGAAGCGGGC
CGCCACGCGG GCGTGCTCGA CCCGATGGGC AATGCCCTGG GTCTGTTGAC CGCGCCTCAT
TCCAGCGCGA TTGTTGCTGC GGTCGGGTCG ACCGGGCGCG TCAGCAATGG TGAGCTGCAT
CTGACCGGTG GCGGCGACCT GACCCTGCGC AGTGGCGGTG CGTTGAATCC TGGCCTGCGC
GCCAGCGCGC AGCAGCCGGA GAGTTTCCTG CAGGATCTGG ATCTCAATGG CGCGGTGACC
AACCTGCGCG GCAGCACCCT GCTGCAGGCG GCACGCATCG GTGGCCTCAG CGCGATACAG
AGCAGCTACG GAACGCTGGG CCTGATGGAT CCGTTCGTGG CGGACGCCCC GGTGGCGATG
GCGGGGCCCC TGCTCATTCT GGGTGATTCC ACTGCCCGCC TGCAGGCCCG TGGTGACATT
GTTCTCGGTG GCGCCGCAGA TGCAGGGCGT GTACCGACGG CCAACTACAA CGACGTGGTT
CTGGGCGATG GCCAGCATGC AGCTGGCACG ACGTGGTTCT CGCTCTGGAC CCGAGGCACG
GCACTGGACC TGTTCTCGGC AGGCGGAAAT CTCGCACCCA GCCTGGCCGG AAGTCTGCAT
GCAACCGGTA GCAACATGTT CGCTGAAGGC ATCGCGCTGC GCGAGCAATC CGTTCCCGCC
AGCATCAATT ACTGGCTGTA TCCGTCCAGG TTCAGCGCGG TGGCAGCGAG TGGTGACATC
ATCAGCGCCC CGCTTCGAGG TCTCGGACCA GGGGCGACTA CGGGCGATGT GATCCTGCTC
GCGCCTTCGG CCAGCGGCCA GCTCGATGTT CTGGCTGGAG GCTCGATCCA TGCCGCGATC
AACGCCACCG GCATTGCACG TTCCGGTAGC GACGCCCGCC TGCCTGGCCC GTTTGATCCT
GCATTCATGG CCCAGCGGGG CGCCAGCCTG ACCCGGGTCG GCCACAATCT CTCCGACGAT
GGGGTGCAGG GAGCGGATCC GTCGGCGATC CTGCCCTTGT TCGCGTTCGG CCCCAGTACT
CCGACCACGG GTGCGCTGAC GTCCCGGCCC ACGTCGCCCA GCCGCTTCTA TGCCGTTGCC
GGTGACATCA TCGGGTTGGG CAGCGGCGGG CGCAGGCAAT TGAGGCGGAA CATCGGCGAG
GAGAGCCGGA CACTGTTCGA CTGGTACGAA GCGGGCTCGG CCGTACAGCT GCGCGCCGGT
CGCGACGTGA TCAATGCGAA CGTCACAGCG CTCAACACCA GCGGTACCGA TATCAGCGGG
ATCGAGGCGG GCCGGGACAT CATCCGCAGC AACCTGACGG TGGCCGGCCC CGGCAACGTT
GAAGTGAGTG CGGGCCGGCA GCTGCGCCAG GAAGAGGCCG GAAGCATCGT GAGCCTGGGT
GGCGTCGTCC AGGGCGATGC CCGTCCCGGT GCCAGCATCG CAGTGACTGC CGGCAATCAG
GGCATTGACT TCGATGCACT GCGCACGCGC TACCTGGACC CGGCCAACCT GGCCGATCCG
GCGCAGTCGC TGGCATCGCA GCCCGGCAAG GCGGTGAAGA TCTATGACAA GGAGTTGAAG
CAGTGGCTGC AGCAGCGCTT CGGCCTGGCC GTCGACGGCG CCGAGGTGCT GGCTGCGTTC
GACCGGTTGC CGAGCGAACA GCAACGGATC TTCCTTCGCC AGGTCTACTA CGCCGAACTG
CGCGAAGGTG GCCGCGAGTA CAACGATCGG AATGGACCAC GTGTTGGTTC TTACCTGCGG
GGCCGTGAGG CCATCGCCAC GCTGATGCCG GACAAGGATG CAGCAGGTGC GACGATCCAA
CGCACCGGCG ATATCCTGAT GTATGGCGGC GCCGGCGTAC GTACCGAAGC CGGTGGCAAC
ATCGAGCTGA TGGCACCGGG CGGACAGATC GTGGTGGGTG TACAGGGCGT GGTGCCACCG
GCCAGTGCCG GGCTGGTGAC CCAAGGGCAG GGTGACATCC GGCTGTTCAG CCAGGACAGC
GTGCTGCTGG GCCTGTCGCG GGTGATGACG ACGTTCGGCG GTGACATCCT GGCATGGTCG
GAGCAGGGTG ACATCAATGC CGGTCGCGGT TCGCAGACCA CCTTGTTGTA CACGCCGCCG
CGTCGCGTCT ATGACGGCTG GGGCAATGTG ATCCTGTCGC CGCAGGCTCC CGCCAGTGGC
GCAGGCATCG CCACCTTGAA CCCCATCGCC GAAGTGCCGC CGGGCGACGT CGACCTGATC
GCGCCGCTGG GCACCATCGA CGCGGGCGAA GCAGGCATAC GAGTCTCGGG CAACATCAAC
CTGGCAGCGC TGCAGGTGCT CAACGCCGCC AACATCCAGG TGCAGGGCGA GAGCAAGGGC
CTGCCGGTGT TGGCCACGGT GAACGTCAAT GCGCTGGCGT CGGCCAGTGC GGCAGCGAAC
AGTGCCAGCC AGGCCGCACA GGACGTGATG CGCAAGAGCC AGGACGACGC ACGGCGCAAC
CAGCCGTCAG TGATCAGCGT GCAGATTCTC GGCTTCGGCA GCGGCACCAG CAGCATTGCG
CCGCCGGCGC GCGGCACGAC CGCCAACAGC GGCTACGACG CCAACAGCGC CTTCCAGTTC
CCGCAGGCAA GCCGCGACGA AGGCACGCAG CGCCGATAG
 
Protein sequence
MSHPHSPTAR FPAALWRSHP MAWAVAVALG SVVAPATQAQ QAFSPGWFAE RGAAQGAAAQ 
SGRMPNGVPI QFQLPAQQQD AARQKLQQSI DNLGTAAQAI ALQQRLQEQA RQARREAGFV
VADGLGKDGL KVDENPLTRG WINAREAIQS QAADGRVLVN IEQTADQAIL NWETFNIGGN
TTLNFLQNPD WAVLNRVNDP AARPSQILGQ LKANGTVFVA NRNGVVFGNN SQVNVRNLVA
AAARISDAQF REHGLYSADA NTSALTDAVG KVMVERGARI TTHEPTTATR GGGYVLLAGH
SVENAGQIET RKGQAQLAAG DSFVIRRGMG TAQNTASTTR GNEIAPRFIA DSTAGGVRNS
GLIQAREGDI TLAGRRVEQA GVAVASTTLN QRGTVHLLNS ASDAAGSVTL ASGSTTAVLL
EDDGKSTALD SQRDALIKES TEQDKLRGSS NSGTFDNLSR LQDRREQSRI EVVSGGDVNF
QAGALAVATG GQVIADASRR SYLDDGARVD VSGAVGVQVA MESNNVKVKV QGNELRDSPD
NRDSGKLISS EVWIDRRQLT EVAAGTGGYE GTRWYAGGGL LEVGGYLDNQ GHSISEWAAQ
GGTVQLAGKE VVSHAGSRIN LAGGSLDVQS GVVQQSWLRG RDGQLYRLDD APAEMLYDGL
YQGYEVKQER WGVTESFRNP LVAPTQRFDN GYTVGRDAGR LLISAPTAVL QGQVDTVAFQ
GLQQTRRPDQ AQDGYVQAQT AAARNAQLWL GRFDNTGRSA VFDSQIRIGA LQADTRSWTL
QAPIGETQRN TVWLDSEVLS AQRWGQVDLA SAGRIDLDGT LRLQEGGRLG LTASRVSFGG
TVQIAGGQVE AGNLLPALGG PTALLRSGRG AVDVAAGARI DLGGSWSNAL AAPDGDPAQA
WIDGGQMRFN SSHDIRVGEG AAILVDAGGA IDASGKGRVG KGGSVSLLAA STEVATDGSG
RIRIGDGTRF SALGNGSGNF TLATGGAVSI GAPGTDASAA SLRLQSALFR SGFAGYDIAG
HNGLTVEEGT QLLVERPALR LAEGAQQALA REQGVQAWTP SLYEADPASG RVSQRGGASL
ALRAGHPLST ADLRVGKGAR IEVDAGQTIE LFSAGNLDVA GSLKAAGGRI RLDEAFDPTG
VRGDQRRERR WTIAEGALLD VSGDSVSLPD ARGTLRGQVR KGGTIEIGGA LDWEDQDNVR
HLPPDTFVVV ERGARLDASG ASALLDIDGS GRTRVDSDGG SIVLRAGNAL YLQGDLDAAA
GGDGARGGTL GVAFGGGTYG RTAMNKEVLA PRVISLTQHA AQTTDLPARL EYGHAALSVE
QIEAGGFDHL ALFGDIRAQG DVTLRMDQSL RLQGLNERYL GFVPGNSEGN RLQLDAPYVR
LAQGRWWQPA GEGTLRPQEA GFDSGRQHRL DVSADLIDLR DVTWLAGFDQ VALRSRGDIR
MLAAVASSSR ESTLASPGSI DISAARMYPA ARAKGRIVAG VPDIGPSGLP FWQNPDAVLR
IHGIAGGAQP APDSAFGSLE LVAATVVQGG NVQAPWGHVQ LGGREFNADA ASRVDLLAGS
VTSVSGAGLQ LPFGGTVDGV GWRRNGADFD VLGPGSTNIP IGIDIVANAV DGVVGSVLDV
SGGGELSGAA FVAGRGGSVD ILRHALADAN PRYRFSGSDN AVYAIMPGRS GTQAPQAVAD
GSADPRIGQQ IVIPAGVPGL PAGTYTLLPA SYALQKGAFR VEVGAERAVG SRQAVATGTG
SWRVSGHRAQ SLGGAVSPLL TDLVLTPAEA VRRHANYNET SYSTFVQGVA ERRGEALRWR
PTDAGNLNLA LGEGAGRSSV PAAIFQGVSR FNAGSNDGRG GTLSVNLLTS NDAMLEIVTE
GGSAGSGSGA TVFDSALNAF RPETMLIGGV LRRDATTHSL EGRAQHIVVR NGVNLTAQEV
LLSAAFGGKG ILVEQGASID TLTGANTSRV AQPTTPYLVS GGLLAVSNQR LTALSAQGGS
AAGPVAIDIG GCVVDCNGQT RLLSAGSINV VTDGALNIGD SVSYGTRQLG LGMSALNLGS
AEAIAAAAAA GALPAGMTMN QEVLQRLLRG NTATGAPALE ALSLTARDAI NVFGSVDLDT
RDGATGRSSL RSLVLGAPAI HGYGSAADHA RIFADTLVWD GTLAGTTLPG GEQTQPAGEA
MVGRLGQGQL EINARVLELG RAPFTRPSSS VAADRQVLGF AGVTLAASDR MLFSGKGSLD
VFQRQGDYVA GSGWQFSGGA LDIVTPLLTG SAGAQLAIRN GGTVQLRGAA ATAGSDALGA
ELSITAERVV IDSRVALASG RFEANARQGV ALGSNAVLDM AGRKVSLFDV DKYSWGGDVA
LSSRDGDIVA DAASRIDLSA RNNRGGRLTV AALGAQGGRV DLAGTLLGGA SGRYDAGGTE
VPYDGGELVV RARQLQDFSG LNTRLTAGGI TGGRTFQLSE GDLVIGDEVK ARNVDISVDG
GSLLVNGRID ASGEQVGSIR LSARDVLRID GTLDAHGSAL RVDSYGKIID SPNRAMIELT
SASGSLQLGA TSSMDLRAGT SVATGSSPGQ NDGRARGSVK LNVPRVGSND AAIDVASGLR
IAGAADLQVN AFRSYDSAPL ASAPDVHGHR PQVINQHWLD TVVDPDNSQW MNAALTNTGL
QQRTAALGSY RLRPGVQIVA RTSSDNPRGD LVVAGDIDLS GYRYGPQSNR TDPARRGFGE
SAALVLRAEG DINVYGSIND GFAPPPANPD EDGWVLLEGR SAGTPNTAFG GDLIVPGEGV
QLQRGTEFRA GATLNYALPF EAVTLPAGTV LPAEMRLSGP LLLPAGTVLG AAVTTAGGEV
VAAGTVLAQA LTLQAGARLG AGFRLRAPAP VAAQVWPAGV ALPMAMKLSD AVDLHAGAII
PSMTKVELAG DAPVKLRPAD ASGRQGRNWA LAAMLPEGTT SWDLTAVAGA DTTAADPRTR
SWGSDGSIVL ADSHYSTIGT VTKTTEWKGD RIVNLEGSLY WWGDESLAGK SPAEVAEIIG
TTEAEICGAG AFCGPAPRLV DKEGSLAWWG DESWVGRPAT ELGAEMGMSE EEICAAMGYC
YGGGTLTEVT TYGKRLGSPA WSVLRTGKGD LALLAAQDVR MKSGFGVYTA GAPTLLGDGS
DPQFNPVRTA APWHTSLLGR NQVSGNYDAA LASYRAWYPD HGGDVRVEAG RDIIGDAWTA
RSESGTAQRD QAGHSSAAVG GWLWRQGTGT TEGAQATPTS WWINFGTYST VGAEADAAPR
MVGFTGFGTL GGGNLSLEAG RHAGVLDPMG NALGLLTAPH SSAIVAAVGS TGRVSNGELH
LTGGGDLTLR SGGALNPGLR ASAQQPESFL QDLDLNGAVT NLRGSTLLQA ARIGGLSAIQ
SSYGTLGLMD PFVADAPVAM AGPLLILGDS TARLQARGDI VLGGAADAGR VPTANYNDVV
LGDGQHAAGT TWFSLWTRGT ALDLFSAGGN LAPSLAGSLH ATGSNMFAEG IALREQSVPA
SINYWLYPSR FSAVAASGDI ISAPLRGLGP GATTGDVILL APSASGQLDV LAGGSIHAAI
NATGIARSGS DARLPGPFDP AFMAQRGASL TRVGHNLSDD GVQGADPSAI LPLFAFGPST
PTTGALTSRP TSPSRFYAVA GDIIGLGSGG RRQLRRNIGE ESRTLFDWYE AGSAVQLRAG
RDVINANVTA LNTSGTDISG IEAGRDIIRS NLTVAGPGNV EVSAGRQLRQ EEAGSIVSLG
GVVQGDARPG ASIAVTAGNQ GIDFDALRTR YLDPANLADP AQSLASQPGK AVKIYDKELK
QWLQQRFGLA VDGAEVLAAF DRLPSEQQRI FLRQVYYAEL REGGREYNDR NGPRVGSYLR
GREAIATLMP DKDAAGATIQ RTGDILMYGG AGVRTEAGGN IELMAPGGQI VVGVQGVVPP
ASAGLVTQGQ GDIRLFSQDS VLLGLSRVMT TFGGDILAWS EQGDINAGRG SQTTLLYTPP
RRVYDGWGNV ILSPQAPASG AGIATLNPIA EVPPGDVDLI APLGTIDAGE AGIRVSGNIN
LAALQVLNAA NIQVQGESKG LPVLATVNVN ALASASAAAN SASQAAQDVM RKSQDDARRN
QPSVISVQIL GFGSGTSSIA PPARGTTANS GYDANSAFQF PQASRDEGTQ RR