Gene PXO_02976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPXO_02976 
SymbolfhaB 
ID6308214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthomonas oryzae pv. oryzae PXO99A 
KingdomBacteria 
Replicon accessionNC_010717 
Strand
Start bp4812409 
End bp4822989 
Gene Length10581 bp 
Protein Length3526 aa 
Translation table11 
GC content66% 
IMG OID642641939 
Productfilamentous haemagglutinin; haemagglutination activity domain protein 
Protein accessionYP_001915776 
Protein GI188578847 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.224102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTGCG CGCTCGGCCT TGTCAGCCTG GTCGACGCCG CACAGGCGCA GTCTGCTGGA 
CGGATCGTCG GCGCTCCCGC AGCGCCCGGA AACGAGCGCC CGACGGTCAT GACCGCGCCC
AACGGGGTGC CGTTGGTCAA CATCACCACA CCCTCCGCCG CAGGCGTGTC GCGTAACCGC
TACTCGCAAT TCGATGTGGG CCGCGAAGGG GCGATTCTCA ACAACGCACG CACTCAAACC
CAAACCCAAT TGGGCGGCTG GGTGCAAGGC AATCCGTGGC TGGCCACCGG CAGCGCCAAG
GTCATCCTCA ACGAGGTCAA CGGGCCGACC AGTCGGCTCA ATGGCTACCT CGAGGTGGCC
GGCCAGCGTG CCGAAGTCAT CATCGCCAAT TCCGCCGGCA TCCAGGTCGA TGGCGGCGGC
TTCCTCAATG CCAGCCGGGT CACCTTGACC ACCGGCACGC CGATCCTCTC CGGTGGCGCG
CTGGAGGGCT ACCGGGTCAG CGGCGGCGCG ATCCAGATCG GCGGCGCCGG CCTGGATACC
AGCCGGGCCG ACTACACCGA CCTGATCACC CGCTCGCTGC AGGTCAATGC CGGCATTTGG
GCCAACCAGC TGCAGGCCAC CTTGGGCAAC AACGTGGTCA GTGCCGATCA CCGCAGCGTC
GCCACGCAGG CTGCTGATGG CCAGGCACCC ACGTTCGCGC TGGATGTCGG CGCGCTCGGC
GGCATGTTCG CCAACAAGAT CTGGCTGGTC GGCAACGAAC ACGGCGTGGG CGTGCGCAAT
GCCGGCAATC TCGGTGCGCA GGCCGGCGAA CTGGTGGTCA CCGTCGATGG CCGGCTCGAA
AACACCGGTG CCTTGCAGTC GCAGCAGGAC ACGCAGATCG CGGCCAGCGG CGGAATCGCC
AACAGCGGCA CCCTCAGCGC CGCGCGCGAG CTGCGCGTCG GCACCGCGGC GGACCTGGAC
AACCGCGGCG GCACGCTGGA TGCGCAACGG CTGCAGATCG ACGCCGCCAG CCTGCGCAAC
CAGGGCGGCA CGCTGACGCA GACCGGCATG CAAGCGCTCG AGGTCACTGC CGGCAGCGCC
AGCAACCGTA CCGGCGGCAG CATCGGTGCA CTCGCCACCG CGCCCTCGCG CGATGGCGGC
AGTCCGGCTC CGGGGGGCGA CAACACGGGC ACCGACACCA ACAACAACGC CGGCACCGGC
ACCGACGGCG GACGCAGTCC AACACCGACC CCGGGCGGCC AGAGCCCGGT GGCGACGGCG
CCGCTGGCCA CCGGCACGCT GCACATCGCT GGACTGTTGG ACAACGACGG TGGCCGCATC
GGCAACGGCG GCGCGGTGCG CCTGGCGGCG GCCGCTGGCC TGGACAATAG TGCCGGCCAG
TTGGGCGTGG CGGCGCTGCA GGTGCGTGGC GATCTGCGCA ACAGCGCAGG AACGCTGGAG
GTGCAAGGCG ATGCCGATCT GCACCTGGGC GCGCTGGTCA ACGATCAGGG CCGCTTCAGC
GTCGCCAACG CGCTCAACCT GCAGGCGCAA TCGCTGAGCA ACCGCGCCGG CGATCTGCGC
CATGGCGGCA GCGCTGCGAG CACCTGGCAA GTGGCGGGGC TGCTGGACAA CCAGGGCGGC
GTGCTGACCA GCAATGCCGC CGCGCTGCAG CTGCAGGCGC AGACAGTGGT CAATGCCGAC
GGTCGCATCG TCCACACCGG CACGCAAGGG CTGACGCTGG CCGTGCAAAC CTGGTCGGGC
GCCGGCGGCA GCATTGTCAC TCCAGGCGCG CTCACCTGGC AGGCCGGCAG CATCGATCAT
CGCAATGCCA CCCTCACCAC CAGCCAGCTG GTGGTGCAGG CCGCCACGCT GGACAACCGC
GGCGGCACCC TGGTGTCGAG CGGAACCCAG GCGGCGAGCG TGCACGTGGA CGCAGCGCTG
GACAATGGCG CTGGCGGCAC GATCGCCAGC AACGGCGCGT TAGACCTGCG CGCGGCGACG
CTCGGCAATG CCGGTGGCCA GATCCAGCAG GCCGGTCCCG GCTTGCTGCA GATCGCAGCG
CAAGCGATCG ACGGCAACGG CGGACGCCTG CTCAGCAATG GCGGACTGAC GCTCACCGGC
GGACGCATCG ACCTGACCGG CGGCACCACT GCCGCCCAGC AGGTGCAGAT CCAGGCCGAG
ACGCTGACCA CTGCCGGTGG CAGCGTGAGC TCGCTCGGCG ATCAGGCCCT GCAACTGACC
GTGCGCCAGG GCCTGGACAA TCACGCCGGT CACCTGGCGA GCAACGCCGG CCTGGCGATC
GCTGCCGGTG CGCTGAGCAA CCAGGGAGGC GCGATCGCCG CCGCCGGTAC CACCGATGCC
ACCGTCACGG TGGCCGGTCG CCTGGACAAC AGCGGCGGCA GCATCTCCGG CAACGGCCGT
GTCGCGGTGG ATGCCGACAC CTTGATCAAC CAGCATGGCA GCGTCCTGGC CGCCAATGGC
GCGCCGCTGC AACTGCATGC CGCCAGCCTG CTGGACAACA GTGCCGGCGG GCGCCTGGCC
AGCGGTGGCG AGTTCGCAGT GCAGGCCGGC ACGCTCGACA ACCGTGGCGG CGCGATCGAG
CACGGTGGCA ATGGCACCTT GGCGGTGAGC GCCGATACCC TGCAAGGGGC CGGCGGCAGC
TTGCTCAGCC TCGGCAGCTT GCAACTGCGC GGCGGCGTGC TCGATCTCGG GACCGGCAGC
ACGACCCAGG CCGACCGCAT CGACATCGCG GCCGCCAGCC TGCGCACCGC CGGTGGTCAT
CTCAGTGCGA CCGGCAGCGG CCCGCTGCAG GTGCACCTGA GCGACACGCT CGACAATCGC
GCCGGCAGCA TCGCCAGCAA TGGCGCCCTG GACCTGCAGG CAGCCACGCT GCTCAACAGC
GACGGCACGC TCAGCGGCGC CGGCAGCGCC GATGGCCGCA TCACCGTCAG CGGGCAATTG
GACAACACCC GTGGCCGCAT CGCCGCCAAC GGCGCCACCT TGCAGATCGC TGCCGACCAG
CTGATCAACG CGCAGGGCAC GCTCAGCCAC GCCGGCAGCG CCGGCCTGGT GATCGACAGC
CACCGCGTGG ACGGCAGCCA GGGCACCATC GTCACCAGCG GCATGCTGTC GTTGACCGCC
GCCACCGTCG ACCATCGCGG CGCCACCATC GGTGCCGACC GCATCGCGCT CGACGTCCAG
CAGCTGGACA ACAGCGGCGG CCGCATCGTC GCCAGCGGCA CGGGTGCCAG CAGTCTCCAG
GCCGACACGC TGGACAACGC CGGCGGTATC GTCGCCAGCG CTGGCGATCT GACCGTGGTC
AGCACGCTGC TCGACAACAC CCAGGGCACC CTCCAGCACT CCGGCAACGG CCAGCTGCAC
CTCACGGCGC AGACCCTGCT CGGCGACGCC GGCAAGCTGC TCAGCAACGG CGCACTGACC
TTGCATGGGC AGACCACCGA CCTGCGCAAT GCCACCACCT CGGCAGCCGC GCTCAGCATC
GACACTGGCG ATCTCACCAC CGCTGGCGGC ACCCTGGTGG CGACCGGCAG CCAGCTGCTC
ACGCTCACCG CGCGCGGCAC GCTGGACAAC AGCGGCGGCA GCATCGGCGG CAATGGCGTG
ATCGCGCTCA GTGCGCAGAC GCTGCGCAAC GCCCAGGGCA CGCTGCAGGT CGCCGGCACC
GGCGCCTCCA CCGTGGACGT CGCTCAGGCG CTGCAAAACC AGCAGGGCAA ACTGTTGTTG
GGCGGCAGCG GGCGCATCGC CGCCGCCAGC ATCGACAACC AGGGCGGCAC CCTGTCCGCG
GCCGGCACGG CCTTGCAGGT GCAGGTCGCC GGACTGCTGG ATAACCACGC CGCCGGCACC
GTCTCCAGCG CCGGGCAGCT GAGCGTGGGC AGTGGATCGC TCGACAACAG CGCCGGCACG
CTGGTGGCCG GCACCGATTT GAACGTGACC ACGACAGCGG CCATCGGCAA CGATGCGGGC
GCGCTGCAGG CAGGCGGCGG CGTGCAGCTG CAGAGTGCCG GGCTGTCCAA CCGCGCCGGC
AGCGTGATCG GCGGGCAGGT GGTCGTGGAC ACCGGCGGCC AGACGCTGGA CAACAGCGCC
GGCACCATCG GCAGCAAGCT GGGCACGCTG CAGATCCGCA GCGGCACGTT GGCCAATGCG
GGCGGGCGCC TGCAATCGCA GAGCGATCTG ACCCTGCAGA GCAACGGCGC GGCCATCGTC
AACAGCAATT CCGGCAGCAC TGGCGGCATC CTGGCCGGCG CTGCACTGCA GATCGACGGC
GGCACGCTGG ACAACCGCAA CGGCGCGATC GTCGCCCAAG GCCAGGCCCG GCTCGGGCTG
GCCAGTGTCG ACAACAGCAG CGCTGGCGTG CTCAGTGCAG CGGGCAACTT CACCCTGACC
GCGGCCACGC TCGACAACAC CAGCGGTCGC GTGCAGGGCG GGCAGAACCT CACCCTGCAA
CTCAGCGGCG CGCTCGCCAA CCAGGCCGGC CTGGTCACCA CCCGCAACCT GCTCACGCTC
AACGCGGCCA CCGTCGACAA CCGCAATACC CGCGCCAATG CGCTGCAAGG CCTGCAGGCC
GGGCAGCTGC AAGTGCAGGC GCAGGCGCTA GACAACCGCC AGGGGCAGGT GATCAGCGAC
GGTGCCGGCA CCGTGCAGCT GAGCGCGGCG CTGGACAACA GCGGCGGTCA GATTTCCAGC
GGCAGCAGCC TGGACATGCG CGCCGATGCG GTCACCAATA CCGCAGGGCT GTTGCGCTCT
GGCGGCAATC AACGGCTGGA CGCACGCGCG CTCAGCGGCG ATGGCCAGCT GCAGTCGCAG
GGCGATCTCA GCCTCACCCT GCGCGAGGGC CTGACCAATA CCGGCGAGCT GTCGGCCAAT
GGCACCCTGT CCATCCACAC CGATGGCGAT CTGGCCAACC AGGGCGTGCT GCGGGCCGGT
AATCTCGACG TGGCAGCGCG CAATATCGAC AACGCCGCCA ACGGCCAGAT CACCAGCCAG
GGGCTCACCC ACCTGGTCAG CAACGGCCAG TTGCTCAACC GTGGCCTGAT CGATGGCGGC
GTGACCCATC TACAGGCCGC CATGCTGGAC AACCTGGGCA CCGGGCGCAT CTATGGCGAC
CAGCTCGCCA TCGCCGCCGG CAGCGTGCTC AACCACGCCG AGACCGTGGG CGGCGCGACG
CGCGTGGGCA CCATCGCCGC ACGCCAGCGG CTGGACCTGG GCGTAGGGCA GCTGACCAAC
AGCGACCGCG GCTTGATCTA CAGCGATGGC GATGCCGCCA TCGGCGGTGC GCTGGACGGC
AATCGCCACG CCACCGGCGC CGCCGGGCAG GTGGACAACC TCGGTTCCAC CATCGATGTG
GCAGGCAATC TCGATCTGCA GGCGGGGGGC GTCAACAACA TTCGTCAGAA CGTGGCGGTC
ACCCAGACCA CCACCACCCT GGCACCGGTG CGGCTGGATC AGCCGAGCTG GCGCAACAAT
GGCAAAAATG GCAACGCGCC GCTGCGCACC ACCAGCCTCT ACAGCGCGTA CGAGATCTAC
TACCTCAACC CGCGGGACAT CGTTGAGGAC ACGCCTTACA TCACCCCGGA TGGATATCAG
GTGCGACGCG CAGTCATCCG CGTCAACCCG CAGACCAGCG CGTATTTCTT CGCGCGCGGC
GCGCTGTACC GGGCGACCGG CGAGCGCTCG CGGCTGGACC CACGCACCGG AACGCTGACG
CTCTACTACT TCACCCGCGA TGACAACAAT ACCAATCCGG ATCAGGTGAG CAGCGGCGCC
AGCGATCCCT TCGCCGGGAT GACCACCGAC GAAGCGGGCG CACCCGCATT CCACTACGAA
AGCGACACGT TGCGGTATTC CAACGCTTAT GGCACCTGCA CGACGAACTG CGTGCGCTTG
ATTGCGCAAT ACGCCTACAC CGATCCGGAT CACATTCTGG TGAATCCGCA AGGCACCGGC
CCACTCAAGC TGGAGTACAA CGAGCACTAC CGGACCGCCA CCCAAACCGT GGTCGAAGAC
GTGCTGCAAC CCGGTGCCGG CCCGGATGCG GTGATCCGCG CCGGTGGCAG CATGCGCATC
GGCGCCAATG CGCTGCGCAA CGAGTACGCC AGCATCGCGG CCGGCGGCAA TCTGGCCATC
GTCGGCCTGG GTGGCAATCC CAACGCCAAC GTGACCAATC TGGCGTACAC GCTGTACCGC
ACCCACAGTT TCAGCAACGT CACCACCGCC TACAACGGCA CCACCCGCAG CTGGAGCAAT
CCGTCCATCT CCGAGCAGGT CGGGCAGGTC GGCGGCTCCA TCGTCAGCGG TGGGACGTTG
ACCATCGATG TCGGCGATCT GAGCAACCTC AATCAGGGGC GCGACGCGCC CAACGTGCGC
GATGGCGCGG CGATGGCCAA CCTCAACGTG CGTGGCGCGC AAGCAGGCCC CACCGGCGCC
AGCGCCAGCG TTCGTGGCCC GATCAGTGTC GCCAGCCAGG GCGTGGGGCG CATCACCACC
ACGGTGGCGC AGGCCGACAG CGGCAACGCC AGCAACAACG TCGGCAGCAT CGGCAGCGCG
GCCACCACCA CCGGCCCGCG GGTGGTCAAC GCCGCCGGCG GCTCGCCGGA TCGCATCGCC
ATGGGCACGC CCGACACGCG CGCCCCCACC GGCAGCCTGT TCACGCTGCG CCCGGCCAGC
GGCCATTACC TGGTGGAGAC CGACCCGCAG TTCACCGACT ACCGCAGCTG GCTGGGCTCG
GACTACCTGC TCCGGCAGAT GGGCTACTCG GCCGATGCGT TGCAAAAGCG CCTGGGCGAC
GGCTATTACG AGCAGAAGCT GGTGCGCGAG CAGATCGGCC AGCTCACCGG CCGGCGTTTC
CTCGACGGTT ACAAGGACGA CGAAGCGCAA TACCAGGCGC TGCTGGATGC CGGTGCCACC
ATCGGCAAGG CCTGGAACCT GCGTCCGGGC ATTGCGCTGA GCGAGGCGCA GATGGCCCAG
CTCACCAGCG ACATCGTCTG GCTGGTGGAG CAGACCGTCA CCCTGCCCGA TGGCAGCACC
ACCACCGCCC TGGTGCCGCA GGTGTATCTG CGCCTGCGCC CCGGCGATCT GGACAGCGGC
GGCGCGCTGC TGGCCGGTGC CAATGTCGAT GCGCATGCGA GCGGTACGCT GACCAACACC
GGCACCATCG CTGGCCGTCA GTTGGTGTCC TTGGATGCGG GACGGATCGA GCACCTGGGC
GGCAGCATCA GCGGCAATCA AGTAGCGCTG ACCTCGGCTA GCGACATCGA CATCCATGGC
GCCAGCGTGA GCGCGGTGGA TGCGTTGAGC GTGCGTGCGG CGGGCAACAT CGACGTGGCC
TCTACAGTGG AGACGCTGCA AGGCGGCGGG CATCAGGAGG CGATCACGCG CGTGGCCGGG
CTTTACGTGA CCGGAGCCAA TGGCAGCGGC GTGTTGTCGG TGGTGGGCGG TGGCGATGTG
ACCTTGCAGG CGGCGCAGGT GCGCAATGCC GGCAGCGACG GGGTGACCCA ACTGGTGGCC
GGTCACGACC TGACCCTGGG CGCGCAGACG CTCACGCACA GCACGGACGC CACCCACGAT
GCACGCAACT ACCAACGCAG CAGCGAGACC ACGCATGCGG TGAGCAGCGT GCAAGGCGCT
GGCGAGGTGG TGCTGGCGGC GGGCCACGAT ATGACCTTGC AAGCGGCGCA GATCGGCGCG
GGCAAGACGC TGGCGTTGCA GGCCGGTCAC GACCTCGATA GCCAGGCCGT GGTGGACAGC
CGCACGCAGT CCAACAGCAG CGTGAGCAAG CGCCACTCGC TGGTCACCTC GAACTACGAT
GAACACGTAC AAGGCACGCA ACTGGGTGCC GGTGGCGACA TCGTGATGCG CGCAGGCAAC
GACCTGACGC TGGCCTCGAC GGCGGTGGCG AGCCAGAACG GCGGCATCGC TTTGGCGGCG
GGCCACGACG TGGCGCTGAC GGCCACGCAG GAACAGCACG ACAGCGTGGT GGACGAGCAA
ACGCGCAAGC ACCATTTCCT GTCGAACAAG ACCACGACCA CGCACGACGA GAGCCACGAC
AGTCTCGCGG TGACCAGCAG CCTGAGCGGC GACACGGTAC ACATCGCCGC GGGTAACGAC
GTGTTGTCGC AAGGCGCGCA GATCGTCGGC ACCGGCGATG TGGTGCTGGC CGCCGGCAAC
AACCTCACGT TGGAGACAGC GCAGAACACG CACAGCGAGG AACACGACAA ACAGGTCAAG
AAGAGCGGCC TGTTCAGCAG TGGCGGTGCC AGCTTCACCA TCGGCGCCAG CAAGCAGACC
AACACGCTCG CCACCACGCA AGTGAGCCAC ACCGGCAGCA CGGTGGGCAG TATCGACGGC
GCGGTCACCC TCACGGCGGG CAATGCGCTG GCGATCAGCG GCAGCGACGT GCTGAGCAAG
ACCGGCACGG CCATCGTAGG CAAGGACGTG ACCATCGCGG CAGTGGAAGA CACCGTGGAC
ACGGTGGAGA CCTCCAAGCA GCACAGCGCC GGCATCAACG TGGGGCTGAC CGGCGCGGTG
GTGCAGGCAG CCGAAGCGGC GTATGGCATG ACCCGGCATG GCAGCCAAGT GAGCGATGAC
CGGCTCAAGG CGCTGTACGC GGTCAAGGCG GCGTATGCGG CCAAGGACAG CGTGGATGCG
TACCAGGCGG CCGCCGCGCA AGGTGGATCG ATGGGAGGCG TTAGCGTGCG GATTGGCATC
GGCGCCAGCA GCGCATCGAG CAAGAGCGCC ACGCATGAGG AAAGCACGGT GGGCAGCCGC
ATCCAGAGCG AGGGCAACGT CACCATCGCG GCCACGGGTG GCGACCTCAA CGTGATCGGC
AGCAAGATCG ACGGCGAGAA TGTGGCGTTG TCTGCGGCGC ACGACCTGAA CCTGCTGAGT
CAACAGGAAA ACAATACGCA GAAGTCGGAC AACAAGAACG CCGGTGGGGA GATCGGCGTG
AGCGTGGGCA CCACCACGGG GGTATACGCG ACCGCCTACG CAGGCAAGGG CGCTGCCAAG
GGCAACAGCA CGCTACACAC CGAGAGCGTG GTCACTGCGA AGGACACGCT GAGCCTGGTC
AGCGGCAACG ACACCACGAT CAAGGGTGCG CAGGCCATCG GCAATCAAGT GCTGGCCCAG
GTGGGCGGCA ACCTGCTGAT CCAGAGCGAG CAGGACAGCA GCGATTACAA GAGCAAGCAG
CAGCAGGCGA GCGCGACGGT GGTATGGGGG TTCAGTGGCA GCAGCGCCAG CTATAGCCAG
CAGAAGGTCA ACAGCACCTA CACCAGCGTC AAGGAGCAGA GTGGGATCCA GGCGGGCGAT
GGCGGGTTCG CGATCAATGT AGATGGCAAC ACCCACCTGA TTGGCGGGGC CATTGCCAGT
ACGGCGGATC CGGCGTTGAA TCATCTCTCG ACCGGGAGTC TGACGGTGGA AGACCTGCAG
AACATGTCCA AAGCCAGTGC GTCCGGGTTC GGGGTGACGG CGGATGCAAG CATGTTCAGC
GGGAGCAAAT ATGCGGCGTC CAAGGGCGTG GCCAGCAACG CACTGGGGAG CGGAAGCTCC
AGCGAGAGCC ACACGAGCAC GACGCGCAGC GATATTGCTG CCGGTGCGGT GGAGATCGGC
AACCACGACG ATGCCGCGCT GGCTGGCCTG GCGCGAAAGG CGTCTGTCCT GGATGGCAAT
GGCGTCGGGG AAGTCGATCA GAAGAAACTG CAGGAGGACG TGGAGTTCCA GCAGCAGGCG
AAGCGGTTAA TTTATGACCA GGCGGTCAAG ATCACGGATG AGGCTTATGC TGATATGTTC
GCAAGAGAGC ACACATTATA TAAAATTGCA TATGATGAAA AAGGGGGGTT AATTCCTCAT
CAAAAAGTGA CAGGGGAAGA AATGGATAAT CTTCAGCCTG CATCTGACGG TAAGGTGCAT
GTAACGCTAA ATGGTATCTT CAATGGGGAG TACGGAGATG ACGTCCTGGC AGAGAAGTAT
GCAAATCAGC ACAGTACGGT TGCGGGGCCA AAGTATTACA TTCACTTCCC TGAGGCAAGT
AGCGATTTGG CCGAGTTGTT GATTGCGGGT TACCAAAAAT ATTTGGAGAA TGATTTTTTT
GGGTTGACTA ATTCAACGCA AGAAATAAAG GATATCATGC TGAAATATGG CCAAACAGGA
CTTCATATTG ATGCACACAG TCGTGGTTCG ATGACGGATG GCAATGCTGA AGAGTCGATT
GCAAAGATGC CTGATGCGTC TGGATTGCTG AGCAATACCA CGGTCTCCTT CTTTGGGCCA
GCGTATAATG CCAAGAAAGC GGATGATATC TTGAGCTACC TCCAGAATCG CGAGGCGCAG
GATGATCCAG AATCCATGGT GTTGACATTG CAAAATCATA TGGCTGACCC GGTTGGTCGT
CTGATCGGTG GGAATCCAGC AACAGGCGGG ACTATTCCTG ATAGAAGTAG TTTGATTGCT
GAAATGATGC GTGCGCTTCT TGGTGGAAAG GATACATCTC ATAATTGTTA TGGTGCCGGT
TCTGGCTCTG GTTGTGATAA TCTTTGGAAT AATACAGAGC CTAAAAAATC AATGCCGTAT
CCAATTAACT CAATTAAATA A
 
Protein sequence
MLCALGLVSL VDAAQAQSAG RIVGAPAAPG NERPTVMTAP NGVPLVNITT PSAAGVSRNR 
YSQFDVGREG AILNNARTQT QTQLGGWVQG NPWLATGSAK VILNEVNGPT SRLNGYLEVA
GQRAEVIIAN SAGIQVDGGG FLNASRVTLT TGTPILSGGA LEGYRVSGGA IQIGGAGLDT
SRADYTDLIT RSLQVNAGIW ANQLQATLGN NVVSADHRSV ATQAADGQAP TFALDVGALG
GMFANKIWLV GNEHGVGVRN AGNLGAQAGE LVVTVDGRLE NTGALQSQQD TQIAASGGIA
NSGTLSAARE LRVGTAADLD NRGGTLDAQR LQIDAASLRN QGGTLTQTGM QALEVTAGSA
SNRTGGSIGA LATAPSRDGG SPAPGGDNTG TDTNNNAGTG TDGGRSPTPT PGGQSPVATA
PLATGTLHIA GLLDNDGGRI GNGGAVRLAA AAGLDNSAGQ LGVAALQVRG DLRNSAGTLE
VQGDADLHLG ALVNDQGRFS VANALNLQAQ SLSNRAGDLR HGGSAASTWQ VAGLLDNQGG
VLTSNAAALQ LQAQTVVNAD GRIVHTGTQG LTLAVQTWSG AGGSIVTPGA LTWQAGSIDH
RNATLTTSQL VVQAATLDNR GGTLVSSGTQ AASVHVDAAL DNGAGGTIAS NGALDLRAAT
LGNAGGQIQQ AGPGLLQIAA QAIDGNGGRL LSNGGLTLTG GRIDLTGGTT AAQQVQIQAE
TLTTAGGSVS SLGDQALQLT VRQGLDNHAG HLASNAGLAI AAGALSNQGG AIAAAGTTDA
TVTVAGRLDN SGGSISGNGR VAVDADTLIN QHGSVLAANG APLQLHAASL LDNSAGGRLA
SGGEFAVQAG TLDNRGGAIE HGGNGTLAVS ADTLQGAGGS LLSLGSLQLR GGVLDLGTGS
TTQADRIDIA AASLRTAGGH LSATGSGPLQ VHLSDTLDNR AGSIASNGAL DLQAATLLNS
DGTLSGAGSA DGRITVSGQL DNTRGRIAAN GATLQIAADQ LINAQGTLSH AGSAGLVIDS
HRVDGSQGTI VTSGMLSLTA ATVDHRGATI GADRIALDVQ QLDNSGGRIV ASGTGASSLQ
ADTLDNAGGI VASAGDLTVV STLLDNTQGT LQHSGNGQLH LTAQTLLGDA GKLLSNGALT
LHGQTTDLRN ATTSAAALSI DTGDLTTAGG TLVATGSQLL TLTARGTLDN SGGSIGGNGV
IALSAQTLRN AQGTLQVAGT GASTVDVAQA LQNQQGKLLL GGSGRIAAAS IDNQGGTLSA
AGTALQVQVA GLLDNHAAGT VSSAGQLSVG SGSLDNSAGT LVAGTDLNVT TTAAIGNDAG
ALQAGGGVQL QSAGLSNRAG SVIGGQVVVD TGGQTLDNSA GTIGSKLGTL QIRSGTLANA
GGRLQSQSDL TLQSNGAAIV NSNSGSTGGI LAGAALQIDG GTLDNRNGAI VAQGQARLGL
ASVDNSSAGV LSAAGNFTLT AATLDNTSGR VQGGQNLTLQ LSGALANQAG LVTTRNLLTL
NAATVDNRNT RANALQGLQA GQLQVQAQAL DNRQGQVISD GAGTVQLSAA LDNSGGQISS
GSSLDMRADA VTNTAGLLRS GGNQRLDARA LSGDGQLQSQ GDLSLTLREG LTNTGELSAN
GTLSIHTDGD LANQGVLRAG NLDVAARNID NAANGQITSQ GLTHLVSNGQ LLNRGLIDGG
VTHLQAAMLD NLGTGRIYGD QLAIAAGSVL NHAETVGGAT RVGTIAARQR LDLGVGQLTN
SDRGLIYSDG DAAIGGALDG NRHATGAAGQ VDNLGSTIDV AGNLDLQAGG VNNIRQNVAV
TQTTTTLAPV RLDQPSWRNN GKNGNAPLRT TSLYSAYEIY YLNPRDIVED TPYITPDGYQ
VRRAVIRVNP QTSAYFFARG ALYRATGERS RLDPRTGTLT LYYFTRDDNN TNPDQVSSGA
SDPFAGMTTD EAGAPAFHYE SDTLRYSNAY GTCTTNCVRL IAQYAYTDPD HILVNPQGTG
PLKLEYNEHY RTATQTVVED VLQPGAGPDA VIRAGGSMRI GANALRNEYA SIAAGGNLAI
VGLGGNPNAN VTNLAYTLYR THSFSNVTTA YNGTTRSWSN PSISEQVGQV GGSIVSGGTL
TIDVGDLSNL NQGRDAPNVR DGAAMANLNV RGAQAGPTGA SASVRGPISV ASQGVGRITT
TVAQADSGNA SNNVGSIGSA ATTTGPRVVN AAGGSPDRIA MGTPDTRAPT GSLFTLRPAS
GHYLVETDPQ FTDYRSWLGS DYLLRQMGYS ADALQKRLGD GYYEQKLVRE QIGQLTGRRF
LDGYKDDEAQ YQALLDAGAT IGKAWNLRPG IALSEAQMAQ LTSDIVWLVE QTVTLPDGST
TTALVPQVYL RLRPGDLDSG GALLAGANVD AHASGTLTNT GTIAGRQLVS LDAGRIEHLG
GSISGNQVAL TSASDIDIHG ASVSAVDALS VRAAGNIDVA STVETLQGGG HQEAITRVAG
LYVTGANGSG VLSVVGGGDV TLQAAQVRNA GSDGVTQLVA GHDLTLGAQT LTHSTDATHD
ARNYQRSSET THAVSSVQGA GEVVLAAGHD MTLQAAQIGA GKTLALQAGH DLDSQAVVDS
RTQSNSSVSK RHSLVTSNYD EHVQGTQLGA GGDIVMRAGN DLTLASTAVA SQNGGIALAA
GHDVALTATQ EQHDSVVDEQ TRKHHFLSNK TTTTHDESHD SLAVTSSLSG DTVHIAAGND
VLSQGAQIVG TGDVVLAAGN NLTLETAQNT HSEEHDKQVK KSGLFSSGGA SFTIGASKQT
NTLATTQVSH TGSTVGSIDG AVTLTAGNAL AISGSDVLSK TGTAIVGKDV TIAAVEDTVD
TVETSKQHSA GINVGLTGAV VQAAEAAYGM TRHGSQVSDD RLKALYAVKA AYAAKDSVDA
YQAAAAQGGS MGGVSVRIGI GASSASSKSA THEESTVGSR IQSEGNVTIA ATGGDLNVIG
SKIDGENVAL SAAHDLNLLS QQENNTQKSD NKNAGGEIGV SVGTTTGVYA TAYAGKGAAK
GNSTLHTESV VTAKDTLSLV SGNDTTIKGA QAIGNQVLAQ VGGNLLIQSE QDSSDYKSKQ
QQASATVVWG FSGSSASYSQ QKVNSTYTSV KEQSGIQAGD GGFAINVDGN THLIGGAIAS
TADPALNHLS TGSLTVEDLQ NMSKASASGF GVTADASMFS GSKYAASKGV ASNALGSGSS
SESHTSTTRS DIAAGAVEIG NHDDAALAGL ARKASVLDGN GVGEVDQKKL QEDVEFQQQA
KRLIYDQAVK ITDEAYADMF AREHTLYKIA YDEKGGLIPH QKVTGEEMDN LQPASDGKVH
VTLNGIFNGE YGDDVLAEKY ANQHSTVAGP KYYIHFPEAS SDLAELLIAG YQKYLENDFF
GLTNSTQEIK DIMLKYGQTG LHIDAHSRGS MTDGNAEESI AKMPDASGLL SNTTVSFFGP
AYNAKKADDI LSYLQNREAQ DDPESMVLTL QNHMADPVGR LIGGNPATGG TIPDRSSLIA
EMMRALLGGK DTSHNCYGAG SGSGCDNLWN NTEPKKSMPY PINSIK