Gene RS05701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRS05701 
SymbolRSp0540 
ID1222847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp673790 
End bp684448 
Gene Length10659 bp 
Protein Length3552 aa 
Translation table11 
GC content67% 
IMG OID637240400 
Producthemagglutinin-related protein 
Protein accessionNP_522101 
Protein GI17548761 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGA AGTGCTATCG AACCGTTTTC AACGCTGTAC GCGGCATGTT GGTGGCCGTA 
GAGGAATCGG CGAGAAGCAC CGGCAAGGGG CGTCAGACCG GCGGGCAAGC CGGCACGTCG
GCCGCATCGA CCTCCACCGC CGCGCGCTTC GCCGTGCTGC CGGTGGTGTT CGGGGCATGG
TGCGTGTTGG GCTTGCCCTA CACGGTGCAG GCCCAGGTCG TCGCGGCCCC GGGGTCGGGC
GCCCATGTCA TCCAGACGCA GAACGGATTG CAGCAAGTCA ACATCGCACG TCCAAACGGA
AGCGGTGTTT CGCTCAATAC CTATACGCAG TTCAACGTTC CCAGCCAGGG CACCCTCCTC
AACAACGCCC CCGGCATCAC GCAAACGCAG CAGGCCGGCT ACATCAACGG CAACCCGAAC
CTGCTGCCCG GCGGTTCGGC GCGCATCATC GTCAACCAGG TCACCAGCAC GTCGCCGAGT
ACCCTGCAGG GGTATCTGGA GGTGGCCGGC CCGCGTGCGG AAGTGGTGAT CGCCAACCCG
AACGGCATCC TGGTCAACGG CGGGGGCTTC ATCAATACGA GCCGGGCGAC GTTGACGACC
GGGGTGCCGG TGTTCGGCGG CAGCGGCAGC CTGGATGCCT ACCGCGTGAC CGGTGGGCAG
ATCACGGTGC AGGGCGCGGG CCTGAACGCC AGCAACGTCG ACCAGGTGGA CCTGATTGCG
CGGGCGGTGT CGGTCAACGC TTCCGTGTAC GCCAACCAGC TCAACGTGGT GGCTGGCGCG
AACCAGGTCG ACCGTAGCAC CCTCGGCGCG ACGCCGATTG CCGGGGACGG CGCCGCGCCC
GCCAACGGCA TCGACGTGAG CCAGCTGGGC GGCATGTATG CCAACAAGAT TCTGCTGGCC
TCGACCGAGA AAGGTGTTGG GGTTTCCCTG CGCGGTGTTG CCGCTGCCCA GGCGGGCGAT
CTGACGCTGA CCTCCCAGGG CAAGCTGGTG CTCGCCGGCC AGACCAATGC GAGCGGCAAC
CTGTCCGTGT CCGCCCAGGG CGGCATCGAC AACACCGGCA CCACCTACGG CCGGCAATCC
GTCACGGCCA GCACGTCCGG TGACCTGACG AACAGCGGCA CGCTCGCCGC GCAGCAGAAC
CTGGCCGTCA ACGCGCACAA CGTGGCGTCG AGCGGGACGC TGGGGGCGGG TGTCAACAGT
GATGGCTCGC TCGCGCATGC GGGCGACCTC TCCGTGGTTG CCGGCGGATC GATGTCCGCG
ACCGGCCAGA ACGTGGCCGG CGGCAACGCG ACCCTGCAGG GCGCGAGCGT CAATCTTGCC
GGCAGCCAGA CCTCGGCCAA CGGCAACCTG AGTCTGAACG CCCAGACCGG CAATCTGGAT
CTGACCGGTG CGACAGCGAG CGCGGGCGGG GCGTTGAGCG CCAACGCCCA GGGCGCGCTG
ATCAATGACC GTGGCCATCT CGCCAGCCAG GGTGCCACGG CCATCACGGC CGGCAGCCTG
TCCAACCAGA ACGGACAGAT CGTCTCGCAG AATGCGCTGT CGGCCAACAT CGCGGGTGCG
CTCGCCAACC AGGGCGGCAC ATTGCAAGCC GCCGGAGCGC TCAATGCCAA CGCCGGCAGC
CTGGACAACA CGGCGGGCCA CATCGCCTCG CTCAATACCG ACGGCCTGAA CCTCACCACG
ACCGGGCGGC TTACCAACGC CCAGGGCGGC ACGATCGGCG GCAACGGCAA TGTGACCGTG
CAGGCCGGTC AGCTAACGAA CAGTAGCTCC ATCAGCGCCG TACAGAACCT GGGCATCAGC
ACAGCCCAGA GCCTGGCCAA TGCAGGCACG CTCGCGGCCA ACGGCAACAC CACGGTCTCC
GCCGGGACGA CGCTGACCAA TGCCGGCGGC ACGATTGCAT CCGGCCAGCG GACAAACGTT
TCCGCCGCCA CGCTCGACAA CAGCGCGGGG ACCCTCACCG GCAACCAGCT TGCCCTGGCC
GCAGCCAACC TGATGAACCG CAACGGCAGC ATCACGCAGT CCGGAACGGG GCCGATCATG
ATCGGCGTGT CCGGCACGCT CGACAACACC AGCGGCTCCA TCCAGACCAA CAGCGCGGAT
CTGGCCTTGG CGCCGGCCAC GCTGATCAAC GACCGCGGCA CGATCACCGA TTCTGGTACC
GGCACGCTGT CCGTGACGAC CGGAAACCTG TCCAACAACG GCGGCACGAT CGCGACGAAC
GGTGCGCTGG ATGTCCAGGC CGGCGCCGTA TCGAACCAGG GCGGCAAGCT GTCGGCGCAG
TCGCAGGCCA CGCTCAACGT CGCGTCGCTC GATAACAGCG CGGGCGGCTA CGTGGGCGCG
CAAGGCGTCG CCATCACGGA TCAGGGTGCA CTGAACAACG CGGGCGGCAC TGTCGCGGCA
AGCGGCGCGT TGACCGTATC GGCGGGCGCC ATCGCCAATG CCGGCGGGGC CATCAAGAAC
GCCGGCACGC AGGCAACCAG CGTGAGTGCA ACTCAAGCGC TCTCCAACAC CCAGGGCGGC
CTGATCGGCG GCAACGGCGA TGTGTCGGTG TCGGGCGGCA GCGTGGACAA CTCCGGCGGA
ACCGTTGCCG CAGGCGGTGC CGTCACCGTG CAGTCCGGCA GCACCCTGGG CAACGTGGCA
GGGCTGATCC AGGCCAAGGG CAACGCCTCG GTCACTGCGG CGGGGGCCAT CGCCAACACC
GGTGGCCAGA TCGAGGCGGA TGGCACCGCA TCCACGCTTC AGGTGGCTGG TGCGGCTGTC
GACAACACCA ACGGCCGGAT CGCCAACACC GGCACCGGTG CGACCCAGGT CACGGCGGCC
ACGGTCGTCA ACGCCAATAC CGGCGGTGCG GTCGGCGCTG GCACCATCGG CGGCAATGGC
GACGTGACGG TCTCCGGGCG GGCGTTGTCG AACACCCAAG GCGGGCAGAT TGTTGCCGGC
CACAATCTGA CGCTCGCGAC GGCGCAGTCT GTGAATAACA GCACGGGCTC GTTATCGGCC
GCCAACAATC TGACGCTTGA CCAGTCGGGT GCCGTCGTTA TCAACCAGGG GGGCTCGATG
CGCGGCAACG GCGCCGTGAG CCTCAATGCC GCGTCGATCG ACAACACCTC GGGCAAGATC
GGCAACGACG CGGGCAGCGG CGGCAGTGTC GCGATGACGA CCGGTTCACT GGCCAACCAG
GGCGGCGCCA TCGGCAGCGA CCGGAACCTG AGCGTCACGA CCGGGCAACT GAGCGGCGAC
GGCCGGATCA TCGCGGGGGG CGACGGCGCC ATCACGATCA ACGGCAACTA CACCCACTCG
GCCGCCAACC AGATCCAGGC CAACCACAAC CTGACCCTCA CCACCACCGG CACCCTGACC
AACCAGGGCA CGCTGGCCGC CGTCAATGCG CTGACGGTCA ATGCGGCGAA CGTCGACAAC
CGCGCCGGGG CCGATCTGAA CTCGGCGTCC ACCTCGGTCA ATGCCGGCGG TGCGATCACC
AACGCGGGCC GGATCGAAGG CGACACCGTC ACCACGCAAA GCGGCGCGTT CGCCAATACG
GGGACGGTGG TGGGCAACAA CGTGACGTTG AATGCCGGCG CCATCAGCAA CACGGGCGCC
TCGGCGGCGC TTGCCGCGGC GACGCAATTG AACCTGTACG CGTCCGACCG CCTGTCCAAC
ACGGGCGGCG CGACGCTCTT CAGCCTGGGC GACATCAACA TCGCCGCCAA TGGCGCGCGG
GATGGGAACG GCCTGCTGGC CAACCGTTCG AACCTGGTCA CCAACGATCA GTCGACGATC
GAAGCGCAGG GCAACCTGGA GATCGCCACC CAGACGCTGA ACAACACGCG GCCGGAGCCG
ACCGTGCAGA CCGTGACGAC AGGCACGAGC ACCGCGCACG AAACCAAGCG CGGCAAGTAC
ATCGCCTGCG CGACCATGAA CGCCGCACCG CACGGCGGCT GCACGCAGGC GGTCTGGAAC
AGCGGCTACA AGACGCCGAT CGATGCGACC TTCTCCACGG CGCAGATCGT GTCGCAGACC
TCCGGGCCGA ATCCGGTCGA CAACGTGCTG GTGGTCAATG TCAACGGCCA GAACCAGACG
ATCTACTACA ACGCGGTCAC CAACAACGGC AACGGCACGG TGACCGTCAA CTACTGGGAC
GCCTACGACC CGCATACCAA CTACGTCCCC TCCACGGAGT ATGCGACGCG CAGTGACGGC
CACAACGGCT ACCAGCGTGT CGAAATCGCC CGCGACACGA CCACCACGAC CCAGCAGGAC
CAGGTGAGCG GCGGCAGCGC ACCGCAGGCC CAACTGCTGT CCGGCCGCAA CATGACGCTG
GCCAACGTCG GCACCATCAA CAACAACTAC AGCGCCATCG CGGCCGGCGG TTCGATCAGG
ATCGGCAGTT CGCAGCAAGG CGGCGCGGTG GGCAGCGGCA ATTACGGCGG CACCACGGTC
AACAACGTCG GGCGGACGCT CTATCAGTAC CAGACGCAGA ACATCGTCTC GACCTATGCG
TGGAACGAGG GGACGAACGA GGATGTCGGC GCGGTGGCGC AGGCGCCGTC GGTGTTGCCG
CCGGTAGCGA TCGGCGGCAC GGGCGGCACC CTCATCGCCA ACAACGCGGT CCAGATCAAC
GCGACGAACC TGAACAACAC CAACGTGGCC GCCGCCAGTT CCGCGACCGG GGCGACAGGC
GGCACGCTGG GCGCGAACCA GTCCGCTTCC GGCGTGACCG CCTCCGGCCA GCAGGCGGTG
GGGGCGGCCA GCGGCCAGGC ACCGTCCGTC AACGCGCCAC AATCGGTGGC GGGCAGCAAC
GGCGCGCTCA ACATCAGCCT GCCGACCAGC GGCCTGTTCT CGTTCCGCAC GGCGCCCGGC
CAGCCCTACC TGATCGCCAC CGACCCGCGC CTGACCAGCT ATACCAAGTT CATCTCCAGC
GACTACATGC TGAGCGCGCT GAACCTGAAC CCGCAGCAGG TCCAGAAGCG CCTGGGCGAC
GGCTTCTACG AAGAGAAGCT GGTGCGCGAC CAGATCACGC AGCTCACCGG GCGCGTCTAT
CTGCAGGGCT ACGGCAATAA CGAAGACCAG TACCGCGCGC TGATGGCCTC GGGCGTCAAC
GCGGCCAAGC AGTTCAGCCT GGTGCCGGGG ATCGCGCTGA CGGCCGCGCA GATGGATGCC
CTGACCAGCG ACATCGTCTG GCTGGTCAGC CAGACCGTGA AGCTGCCGGA CGGCTCCACG
CAGCAGGTGC TGGCGCCGGT CGTCTATCTG GCACATACGC ACGCGAACGA CCTGCAGCCC
ACCGGCGCGC TGATTGCCGC CGATGATGTG CAGATCCATG CGGTGGGCAG CGCGACCAAC
TCGGGCGTGA TCAAGGGCGG CACGCAGACG GTCATCACCG CCACGGACAT CGTCAATCGG
GGCGGCACCA TCGCGAGCGA CAAGGCCCAC GGCACGACGG TGGTCTCCGC CACCCACGAC
ATCCTCAATG CGTCGGGCGA GATCAGCGGC AACCGGGTGG CAGCGCAGGC GGGCCACGAC
ATCGTCAACA CAACCCTGGT GGATACCGTT GGCGCGACGG CGGTTGCGGG CAACAGCCGG
GCAAACGTAA CGCTGGTCGG CCGTCAGGGT TCGATTGCCT CGACGGGCGA TCTGCTGGTG
CGGGCGGGTA ACGACCTGAC CGTGCACGGC GCAAACATTG CGGCCGGCGG CAACGCCCAG
GTGACGGCGG GGCATGACAT CCTGGTCGAT GCCGTGCAGT CGACCACGTC GCAGTCGGTC
ACCAAGAACA GCCAGCATCA CTGGGAAGCC GACAGTACGA CCCATCAGGG CAGCACGATC
TCGGCCGGCG GCAGCCTGGC CATGCAGAGC GGCAATGACA CCACCTTCAA GGGGGCGAAG
GTCAGTGCCG GGCAGGATCT TGTCGTCGTC GCCGGCGGCG ATCTGACGGC AACAACGGTC
ACCGACACGT CGAAGTACAA CAACGTCGCG GCAGACGACA AAGCGCGGAA AGAGTCCAGC
CGGACCTACG ACGAGACGGT CGCCGGGACC ACCTTCACCG CAGGCCGCGA CGCCACCTTC
GCCGCCGTGA ACGCGAACGC CGGCGGGCAG GCGCGGACCG ACGGCAAGGG GAATGTGACC
TTCATCGGGT CGTCGGTCAC GGCAGGCACC GCGCAGCAGG ACAACACGTC GACCGCCTCC
GCTGCGGGTA ACCCCGAGCA CATGACCGTT GGTCGTGTCG ACCCGACGGG CACCAGGTCT
GGCGCATCGA CCAAGGCGGG CGGCGTCACC ATCGTGGCCG ACCGCAATGT GACCCTGGCC
GAGGCACGCG AGGTGCATGA CAGTACCCGG TCCGTCTCCA GCGAGAGCGG GAGTGCGTTG
TCGTCCAAGT CGGCCTCGTC CAGCGATGCG ATGCACCTGG ACGTGGGGGC GGGGAGTTCA
GTCTCGGGCA ATTCCGTCCG CGTGCAGGCC GGCAATGACC TGACCGTGCG CAACAGTGCC
GTGGTGGGCT CGGGCGATGT CAGCCTGAAC GCCGTGGGCG GCAATGTGCT CATCACGGCC
GGCCAGAACG TCCGCGACGA ATCGCACAGC TTCGAGCAGA AGCAGTCGGG CTTCTCCGGC
ACCGGGGGGG TCGGGATCGC CTACGGCCAC AGCGGCGCCA ACGGTCGTTC CGAACTGCAC
GAGGTCACGC AGAGCGATGC GCGCAGCACG GTCGGCAGCA CCGGCGGCAA CGTGTCGATC
TCGGCCGGCA AGGACGCGGC CATCATCGGC AGCGATGTGA TGGCGGGCTC GACCGGCGGC
GCCACCGGCA ACATCGATGT GCGCGCGCAG AACATCCGGA TCGAAGCGGG GCAGGATCAC
GCGTGGTCGA GCTCGTCCCA GGAGGCGCAC AGCAGCGGCA TTTCCGTCGG GCTGGTGGGC
ACGCCGCTGG ACACCTTGCG CAACCAGCGC GAGGCGCAGC GTGACCCCAG CAAGGTCAAC
CGCGTGCGCA ACTCGCTCAA TGAGGTGGGC GCGGGGGCGC TGGATACGCC GCAACTGGCG
ATCGGGTTCA ACGCCCGCGG CAGCCGCAGC CAGACGTCGA GCGAATCGCT GACGCACAGC
GCCAGCCAGC TGACCGCATC GGGCGACATC CGCCTGCGCG CCACGGGCAA CGGCGCCACC
GATGCCAACG GCCGCGCCGC CAGCGGCGAT ATCACCGTCA CGGGCAGCAC CTTGAGCGCC
GGTGGCACAG CAGCGCTGGA TGCGCAGCGC AGTGTCGTGC TACAGGCGTC CACCGACACC
TACCAGGAGT CGAGTTCGGC CAGCAGCTCC GGCTCGCACT TCAGCACGGC CGGCCCCTCC
TGGGGCGATC TTGGCCGCAA TGTCGGTGGC GGGCCGAACA GCAGCGGGGT GGGGCTTGCA
CCCTACGGCT CCGCTCACAG TGCGGACAAC GCCGCCGGCA ACAGCAGCCG CCAGAACGCG
TCGGTCGTGA TCGGCAAGAG CGTGCAGGTG CAGGCGCGCA CGGGCGACAT CACCGTCTCG
GGCAGTGGCA TCTCGGCGCT GTCGGATGTG GACCTGCTGG CCAAGCAGGG CAAGGTCGAC
ATCGTGGCGG GCAACGACAC CTCCAGTCGC CACGAGGACC ATTCCGACCG CACGATCGGC
GACCTGGGCG GCAACGGTTA CTCCGGCACG GTGGGCGTGC GCAGCGCCAG CAGCACGCTG
GATACCGCCA AGAGCCAGCA GAGCACGATC CGCAGCCAGG TAAGCAGCGC GGCGGGCAAC
GTCACCATCG CCGCGCGAGA CGACGTAACG GTGCACGGTG CCGATGTGTC GGCCGGTGGG
GACCTCAAGG TGACCGGCCG CAACGTGCTG CTCGACGCGG GGCAGGATGC CGAGCGCAGC
CGGCAGACGG AATCGTCGAG CCAGTATGGG GTGACGCTGG CGATGTCGGG CTATGCAGTG
AGCATCGCGC AGTCGGTGGA GCAGGCGGGC CGCGCCGTCG AGCAGCACAA GGATCCGCGC
GTCGCGGCGC TGTATCTGGC GCAGGCGGCG TTGATGGGTT ACAACGCTGC GGGGAATCCA
GGTCTGAACT CCCCCAACGG ATCTGCCATC CAGGTGCAGG CAGCGGCGCA GCCGCAAGCG
CAGTCGTCCG CCATCGTCAA GGCCACGCTC AGCATCGGCG GCGGGTCGTC ATCGAGCGAG
TCGAACGCCA ACGCCACGGT CAACCAGGGC AGCACGCTGC GCGCTGGGCA GAACGTAAGC
ATCACGGCGA CGGGCAAGGA CGCGTCGGGC AAGGTGGTGG ACGGCGACAT CGTGGCGCGT
GGCTCCAGCA TCTCGGGGCG CAATGTCTCG CTTGATGCGG CACGCGACAT CACGCTGGAG
AGCCGGCAGG ACAACACCCA CCAGGACAGC AAGAGCGGCG GCTCGAACGC GAGCATCGGC
GTGGGTGTGG CCCTGGGCGG CAACCAGACC GGCTTCACGC TGGAGCTGGC CGCGGGCTTC
AACCGGGCGC ATGCGGACGG GGATGCGGTC ACGCACGTCA ACTCGTCGGT CAACGCCGCC
GACACGCTGA CGCTCAACGC GGGTCGGGAT GCGAACCTGC GCGGCGCGCA AGCGTCGGGC
AACACGGTGA ACGCCACGGT TGGCCGCAAC CTGAACGTCG AGAGCCGACA GGACACGGAC
AACTACGCCA GCCGTTCGGA GAGCGGCGGC GCGCAGGTGA GTCTGTGCTT TCCGCCGTTC
TGTTACGGGT CGACCTTCAG CGGCAACGCG AACGTCGCGG AGGGCAAGAC CGACAGCACG
TATGCGTCAG TGGTGCATCA GAGCGGCATC GCAGCGGGCA CGGGCGGCTA CAACATCAAC
GTGAAGGGCA ACACGGATCT GGTAGGCGGG GTGATCTCGT CGACAGCGGC GCCGAGCAAG
AACGTGCTGC GCACGGGCAC GCTGACCACG CGGGACGTGG AGAACCACGC GGCGTACTCG
AGCGAGCAGA GCAGCGTCAG CGTGAGCTAC ACCAGCAACA ATCCGCTCAG GCCCGATGCG
GTGCCGACGC CGCTGCAGCA GGGGGTGAGC AACCTGGCGA GCAATGCCGT CGGCAACGCG
CAGGGACCGA TCGCGGGGAA TGCATCGGGC ACGACGCGCT CGGCCATCTC GGCGGGCACG
GTTGTGATCA CGGATAACGC TGGGCAACTG GCGAAGACAG GCAAGGACGC GGAGGCCACG
GTAGCGGGCT TGAACCGGGA CACGGAGCAC GCTAATGATG GGGCGATCGG GAAGATCTTT
GATAAGCAGA AGGTTGAGGA GCAGCAGGAG ATTGCGCGGT TGCAGGCGCA GGTGGTGCAG
CAGGCGGCGC CGCTGCTGTA CACCAAGGTC GGTAGCATGC TGGAGGGGCA GCCGCCCGAG
GTGAAGGTGG CCGTGCACGC GCTGATTGGC GGGCTGATCA GCCGGGCGAT GGGTGGGGAG
TTTGTGGCCG GGGCGGCGGG CGCGGGCGCG GCGACGCTCA TCATGGAGAC CTTCGGCAAG
GAGCTGGAGA GCAGCGACGC GCTGCGCAAG CTGTCGGAAA AGGACCGCAA TGCGCTGATG
CAACTGGTGT CGGGCGCGAT CAGTGGGGTC GTGGCGGGAG CGGTCAGCGG ATCTGGCTCG
GCCGCCGCTG CGGGTGGGGC GGCGTCGCAG ATGGCGGAGC AGTTCAATCG GGAACAGCAC
AAGCACAAGA ATCCGGAGAA GGATGAAAAG ACGGCCCTGG CGCAATTGCA AGAAGGGAAG
TCGCCGGAGG AGCAGCAGGG GCTGGCAGAT GCTGCGTGTG CGTTGATTCA CTGCTCAGCC
GGTCTTTCTG ACAACGATCC AGACAAGGCG GCACTCGAGG CATCGGAGCG GCGTGGTGCA
CAGAACTTGG TTCAGCAGGG GCAACTCAAG GCTACCGGGC TGTTTACATA TTCCTGGGGC
GACGCTGGAT CCGATGTGCG GAGCCGGGAG CTGGACTGGA TCAAGCACCT ACTCATCAAG
GCCAACCAAG GTCTCGATGA CGCCTCGGCG AGTGTCAGCC GAGGGATGCA GAACTCCGGC
AATCCCGGCT CTCATGTAAC GCCGAGTGAT CTGGACAACC TGACTGGCGG TAGTGGCGGT
GGTGGTGGTC CGAGCGGCCC AGCCAGGGCG GTCGTTACGC CGGGCGTTAC CATGTGCGCA
CCGGGCGTTC TTTGTCCGAC CGTCAATGTT GCACCGGGAG CACCGAGTTC GGGGTTGCCT
AGCAATGCCA TTGCGTCGAG CGGCGGCGAC GACAGTACGA CTACGGGGAG TAAGAGTAAT
GACAGCAATA CAACAAAACG TCCAACTCCG CGCAATTCTG AGACGGATGT TGGTTCTGAT
CTTGGCTCAG CCTACAGGCC TCAAGTAAGC TACAAAGATG GCAAAGAGGT CCCGTATGGA
ACAAAAGGGA GTGTTCGCCC CGATTGGTGC AATACGACTT CGTGTAGTGT TGAAGTAAAG
AACTACAATA TAGACTCTAA CTCTAGTGGT CTTATTAAAA ATGTTGCCGA ACAAGCAATC
CAGCGGCAAG CGAATTTGCC TGCAGGAATG GATCAGCAGG TCGTGATTGA TATACGTGGG
CAGACAGTAA CGGATGCGCA GAAGATTTCG ATTATTAAAG GGATTGTGAA GCAATCAAAC
GGAATTATTG GCCCAACCTC CATAAGATTT AAACAGTGA
 
Protein sequence
MNAKCYRTVF NAVRGMLVAV EESARSTGKG RQTGGQAGTS AASTSTAARF AVLPVVFGAW 
CVLGLPYTVQ AQVVAAPGSG AHVIQTQNGL QQVNIARPNG SGVSLNTYTQ FNVPSQGTLL
NNAPGITQTQ QAGYINGNPN LLPGGSARII VNQVTSTSPS TLQGYLEVAG PRAEVVIANP
NGILVNGGGF INTSRATLTT GVPVFGGSGS LDAYRVTGGQ ITVQGAGLNA SNVDQVDLIA
RAVSVNASVY ANQLNVVAGA NQVDRSTLGA TPIAGDGAAP ANGIDVSQLG GMYANKILLA
STEKGVGVSL RGVAAAQAGD LTLTSQGKLV LAGQTNASGN LSVSAQGGID NTGTTYGRQS
VTASTSGDLT NSGTLAAQQN LAVNAHNVAS SGTLGAGVNS DGSLAHAGDL SVVAGGSMSA
TGQNVAGGNA TLQGASVNLA GSQTSANGNL SLNAQTGNLD LTGATASAGG ALSANAQGAL
INDRGHLASQ GATAITAGSL SNQNGQIVSQ NALSANIAGA LANQGGTLQA AGALNANAGS
LDNTAGHIAS LNTDGLNLTT TGRLTNAQGG TIGGNGNVTV QAGQLTNSSS ISAVQNLGIS
TAQSLANAGT LAANGNTTVS AGTTLTNAGG TIASGQRTNV SAATLDNSAG TLTGNQLALA
AANLMNRNGS ITQSGTGPIM IGVSGTLDNT SGSIQTNSAD LALAPATLIN DRGTITDSGT
GTLSVTTGNL SNNGGTIATN GALDVQAGAV SNQGGKLSAQ SQATLNVASL DNSAGGYVGA
QGVAITDQGA LNNAGGTVAA SGALTVSAGA IANAGGAIKN AGTQATSVSA TQALSNTQGG
LIGGNGDVSV SGGSVDNSGG TVAAGGAVTV QSGSTLGNVA GLIQAKGNAS VTAAGAIANT
GGQIEADGTA STLQVAGAAV DNTNGRIANT GTGATQVTAA TVVNANTGGA VGAGTIGGNG
DVTVSGRALS NTQGGQIVAG HNLTLATAQS VNNSTGSLSA ANNLTLDQSG AVVINQGGSM
RGNGAVSLNA ASIDNTSGKI GNDAGSGGSV AMTTGSLANQ GGAIGSDRNL SVTTGQLSGD
GRIIAGGDGA ITINGNYTHS AANQIQANHN LTLTTTGTLT NQGTLAAVNA LTVNAANVDN
RAGADLNSAS TSVNAGGAIT NAGRIEGDTV TTQSGAFANT GTVVGNNVTL NAGAISNTGA
SAALAAATQL NLYASDRLSN TGGATLFSLG DINIAANGAR DGNGLLANRS NLVTNDQSTI
EAQGNLEIAT QTLNNTRPEP TVQTVTTGTS TAHETKRGKY IACATMNAAP HGGCTQAVWN
SGYKTPIDAT FSTAQIVSQT SGPNPVDNVL VVNVNGQNQT IYYNAVTNNG NGTVTVNYWD
AYDPHTNYVP STEYATRSDG HNGYQRVEIA RDTTTTTQQD QVSGGSAPQA QLLSGRNMTL
ANVGTINNNY SAIAAGGSIR IGSSQQGGAV GSGNYGGTTV NNVGRTLYQY QTQNIVSTYA
WNEGTNEDVG AVAQAPSVLP PVAIGGTGGT LIANNAVQIN ATNLNNTNVA AASSATGATG
GTLGANQSAS GVTASGQQAV GAASGQAPSV NAPQSVAGSN GALNISLPTS GLFSFRTAPG
QPYLIATDPR LTSYTKFISS DYMLSALNLN PQQVQKRLGD GFYEEKLVRD QITQLTGRVY
LQGYGNNEDQ YRALMASGVN AAKQFSLVPG IALTAAQMDA LTSDIVWLVS QTVKLPDGST
QQVLAPVVYL AHTHANDLQP TGALIAADDV QIHAVGSATN SGVIKGGTQT VITATDIVNR
GGTIASDKAH GTTVVSATHD ILNASGEISG NRVAAQAGHD IVNTTLVDTV GATAVAGNSR
ANVTLVGRQG SIASTGDLLV RAGNDLTVHG ANIAAGGNAQ VTAGHDILVD AVQSTTSQSV
TKNSQHHWEA DSTTHQGSTI SAGGSLAMQS GNDTTFKGAK VSAGQDLVVV AGGDLTATTV
TDTSKYNNVA ADDKARKESS RTYDETVAGT TFTAGRDATF AAVNANAGGQ ARTDGKGNVT
FIGSSVTAGT AQQDNTSTAS AAGNPEHMTV GRVDPTGTRS GASTKAGGVT IVADRNVTLA
EAREVHDSTR SVSSESGSAL SSKSASSSDA MHLDVGAGSS VSGNSVRVQA GNDLTVRNSA
VVGSGDVSLN AVGGNVLITA GQNVRDESHS FEQKQSGFSG TGGVGIAYGH SGANGRSELH
EVTQSDARST VGSTGGNVSI SAGKDAAIIG SDVMAGSTGG ATGNIDVRAQ NIRIEAGQDH
AWSSSSQEAH SSGISVGLVG TPLDTLRNQR EAQRDPSKVN RVRNSLNEVG AGALDTPQLA
IGFNARGSRS QTSSESLTHS ASQLTASGDI RLRATGNGAT DANGRAASGD ITVTGSTLSA
GGTAALDAQR SVVLQASTDT YQESSSASSS GSHFSTAGPS WGDLGRNVGG GPNSSGVGLA
PYGSAHSADN AAGNSSRQNA SVVIGKSVQV QARTGDITVS GSGISALSDV DLLAKQGKVD
IVAGNDTSSR HEDHSDRTIG DLGGNGYSGT VGVRSASSTL DTAKSQQSTI RSQVSSAAGN
VTIAARDDVT VHGADVSAGG DLKVTGRNVL LDAGQDAERS RQTESSSQYG VTLAMSGYAV
SIAQSVEQAG RAVEQHKDPR VAALYLAQAA LMGYNAAGNP GLNSPNGSAI QVQAAAQPQA
QSSAIVKATL SIGGGSSSSE SNANATVNQG STLRAGQNVS ITATGKDASG KVVDGDIVAR
GSSISGRNVS LDAARDITLE SRQDNTHQDS KSGGSNASIG VGVALGGNQT GFTLELAAGF
NRAHADGDAV THVNSSVNAA DTLTLNAGRD ANLRGAQASG NTVNATVGRN LNVESRQDTD
NYASRSESGG AQVSLCFPPF CYGSTFSGNA NVAEGKTDST YASVVHQSGI AAGTGGYNIN
VKGNTDLVGG VISSTAAPSK NVLRTGTLTT RDVENHAAYS SEQSSVSVSY TSNNPLRPDA
VPTPLQQGVS NLASNAVGNA QGPIAGNASG TTRSAISAGT VVITDNAGQL AKTGKDAEAT
VAGLNRDTEH ANDGAIGKIF DKQKVEEQQE IARLQAQVVQ QAAPLLYTKV GSMLEGQPPE
VKVAVHALIG GLISRAMGGE FVAGAAGAGA ATLIMETFGK ELESSDALRK LSEKDRNALM
QLVSGAISGV VAGAVSGSGS AAAAGGAASQ MAEQFNREQH KHKNPEKDEK TALAQLQEGK
SPEEQQGLAD AACALIHCSA GLSDNDPDKA ALEASERRGA QNLVQQGQLK ATGLFTYSWG
DAGSDVRSRE LDWIKHLLIK ANQGLDDASA SVSRGMQNSG NPGSHVTPSD LDNLTGGSGG
GGGPSGPARA VVTPGVTMCA PGVLCPTVNV APGAPSSGLP SNAIASSGGD DSTTTGSKSN
DSNTTKRPTP RNSETDVGSD LGSAYRPQVS YKDGKEVPYG TKGSVRPDWC NTTSCSVEVK
NYNIDSNSSG LIKNVAEQAI QRQANLPAGM DQQVVIDIRG QTVTDAQKIS IIKGIVKQSN
GIIGPTSIRF KQ