Gene Caul_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2110 
Symbol 
ID5899565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2257831 
End bp2270490 
Gene Length12660 bp 
Protein Length4219 aa 
Translation table11 
GC content70% 
IMG OID641562599 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001683736 
Protein GI167646073 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.328655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA AATCCAAGCC GCTGTTCTTC CCGGTTCACC GGTCTCCGGT CGAGGGGGCG 
CGGCAGGCGC TGTTCCTGGG CGCCAGCGCC ATGGTCCTGC TCGCGGCGCT GGGGACGGGC
GAGGCGTTCG CCGGCCCGTC GGCGACGACG CGCGCCGTCG CGCGCCTGGC GGCCACGGGG
GCGGGCGCGC CCTCGGTGGC GGTTCCGGTC GTGCCCGGCG TGACGCCCAA CGCCGCGATG
GCGGCCGCCC GCGCCTTGCA AAACGCGACC AGGGTCCAGC AGGCGGTCAA TCTCGCCCAG
CAGGCCCAGG TCGCCGGCCG CCAGGCGGCC AGCGCCTTGA TCAGCAAGGC GCCCAACGGC
CTGGTCGCCG GCGGCTTGGC GCCCGCGACC GACGCCGCCT ACGCCACGGG CGGCATGCTG
CTGTGGCAGG GCGCCAACAC ACCGACCCAA TCGACCTCGA CCTCGGGCCG GATCCTGGTC
GACATCAAGC AGACCGACTC CCGCGCCATC CTGTCGTGGG ACACCTTCAA CGTCGGCGCC
AACACGACGC TGAACTTCGA CCAGAGCCAG AACGGGGTCG CCCAACCCGA CTGGATCGTG
CTCAACCGCG TTGTCGGCCA GCTGAATCCC GTGACCGGCT TGCGGGATCC GTCCAAGACG
CCCGCCCCCA GCCAGATTCT GGGCGCGATC ACGGCGCAGG GCACGGTGCT GGTGATCAAC
CAGAACGGCG TGCTGTTCGG CGGGACCTCA CAGATCAACA CCCGCTCGCT GATCGCCAGC
AGCCTGGAAG TCGGGCGAGG CGTGTCCCGG GACGCGGCCG GCGTCGTCCA TGACCGCACG
ATCGCCGACC GGAACGCCGA CTTCCTGACC GGGGGGCTGT TAGGCGTCAA CGTGTCGGCA
AACGACCTCG GCGTGTCGAC CTTCTCGGCG ATGAACGACC CGACGGGTCT TGGTCCCGCG
ACCATCGAGG GGGCTATCAC CGTCGACGCG GGCGCCCAGA TCACCGCCGG GGACGGCGGA
CTCATCCTTT TGGCCGGACC GCAGGTGGTC AATTCGGGCA TGTTGAGCGC CCAGCGCGGA
CAGGTCAGCC TGGTCTCCGG GCGATCGTTT ATTCTGACTG CCTCGGACGG ATCGGCCACC
AGCCTCGATC CCAATGTTCG CGGCCTGGTC GTCGGTCGGG GCCATGCGGT GACAGGCGGC
GGCGCTGGCG GCTTCTATGT GCGCAACAGC GCCTCGGGCC TGATCCAGTC CCGCGAGGGC
TATGCCTCGC TCTATGGGGC GGTGATCAAC GAGGGGGTGA TCACCGCCAC CACCAGCGTG
TCGCGTAACG GCTCCATCGA CCTGGGCGGC GCGAACGGCG ACGCCATTCA GCTGGCTCCG
GGCAGCGTCA TCGCCATCAC CCCCGACGAC ACCGGTTCCA TTCCGCAGGA CGCCCAGTCC
CTGGCCGCCT TCAAGCCGTC CAGGGTGACG CTGGGCAATG CCGCATCCCA GATCGAGATC
GGGTCGAACA GCCTGATCTA TGCGCCCGGC GCGACGGTGG AGGTCGGGTC CAAGCCCGGC
GTCGACACCG ACTCCACGGC CACGGCGGGC TACGCCAATG CGCGAATCTT CATCGACAGC
GGCGCGACCA TCGACGTTGC CGGCCTGAAG AACGTGATCG TGCCGGCCTC GCGCAACGTC
ATCGTCATCG ATCCGGTCAA GGGCAACGAA CTGGCGGATT CGCCGCTCTA TCGCCAGGGC
TTCCTCAACG GCGCCAAGAT CTATCTGGAC CCGCGGATAT CGGGCGTGCG CGAGGACGGC
GTGGCCTATA TCGGCTCGCC GTTGATCTCG GCCGAAAGCT ACGCCCAGCA GGTGGGTGTG
ACTGCATCCG AACTGTTGAC CAAGGGGGGC GCCGTCACGC TCGGCGTCCC CTCCGCCAGC
CCCACCGCCG GCGTCGTGAC CCAGGCGCCC GACGTGATCG TCAAGCGCGG CGCGACGATC
GACATCTCCG GCGGCTGGCG GACGTTCGAA GCCGGCCGCG TGCGCACCAC CCGGCTGATC
GACGCCAATG GCGGGATCGT GGACATTGGC TACGCCGACC CCAACGCCAC CTATGTCGGC
GTCTATGAGG GCTTCGTCGA CGTCCAGCCG CGGTTTGGCG TCACGCGGAC TTATGTGAGC
CCGATCCTCG ACGGCGGCGA CTTCGTGCCG TCCTACACCG AAGGGACCGA CGCCGGTTCC
CTGACCATCA AGTCGTCCCA GCCGCTGTTC GAAGGCACGC TCTACGCCGA CGCCTTTCCC
GGCCTGGCGC AGAAACAGGC CGGCCAGGTC GGCACGGCCA AGCCGGTGCT CTATGGCGAC
AGGCGCCGGC TCCAGGCCGT CTCATCGCAA CTACCGAGCG GCGGCCTCCT ATCGATCCAG
GAACTGGGCC TGTCGCCCAA TGGCGTCACC GGCGGCGGCG ACATCCGCAT CATCGACGGC
GCGATGCCCG CGACGGCTGA CAGCCTGACC TACGGCCAGT CGCTGGTCAT CGACGGCCAG
GGCAACCTCA GCATGGCGGC CCGGCCAGTG GAATCGATGA TCCCCGCCGC CCAGCGGGGC
GTGCTGACCT TCGGCGCCGA CACGCTGTCG GCCATGGGTC TGGGTCAGCT GTCGTTGTTT
ACGTCCGGCG CCCTGACGGT CGAGTCGGGC GCCGACCTGA CCCTGACCCC GGGAGGAGTG
TTCACGGCCA CCACCGGCCG CGCGATCACC ATCGACGGCG ACATCACGGC CGCCTCGGGA
ACCATCGCGC TGGAGACGGT CGCCACCGGG CGCGGTTCAG TGTTCAAGGC CGATCCCGCC
GGGCCGGGCA GCTATGACGT CACGATCAAC GGCCAATTGT CGACCGCCGG CCTGTGGACC
AACGACCTGG GCGCCAGCAG CGACGACCTG GGCGGGGCGG CCTATGTCGA TGGCGGCAAG
GTGTCGATCT CCGCCGCGCC GCGAGCGCTG CTGACCGACG ATGTCGTGCC GACCAGCTCA
GGATCGGGGC CGGCCACCAA TGTCGATATC AGCGGCAGCA TCCTGATCGA CGGACCCCGG
TCGCGCATCG ACGTGTCGTC GGGCGGCTAT GTCGCGACGG ACGGCGACCT GGACCTGACC
GCCCGTGGCG GCGCCGTCAC CCTCACTAGC GACACGACCT ACTTTCAGCT CACGGCGCCG
CCCGGGCAAG GCTATGTCGC GGGTCAGGCG CCGGGCATCC GGGTCACGGG GCTCGCCAAC
GGCGCGGCGC CGATCGTTCC GGTGAACCCC AGCGAAATCA CGGCGCGAGT CAGCATCGGC
CAGGACACCA TCGTGGGCCA TGGCTTCGCC GGCGGCGGAA CCTTCTCCCT GACCACACCC
GCCATCGCCT TCGGCGACGG CGTGGCCAGC ACCGGCACGG AACTGCCGCT GGACTTCTTC
TCCAAGGCCG GGTTCTCGAC CTACACCATC AAGTCCTACG GGACGGACCT GTCGCCCAAC
ACCTTCAACA ACGGCCTGGG CGGCTACAAC GCGGTGCTGA AGACCCAGGT GCTGACGGTC
GGCGACGGCC AGACCCTGAA CCTGACCCAG AGCGGCTATT CGACCCGGCC CGACGCCGCT
CAGACCGCCG CCCTGCGGAC CTTGCGGACC GGCGGCGCCG TGACGAGCGT CCTGACCGCG
GGCGTCCAGC CGCAAGCCTG GGACCAGGCG CCGATCAGCC TGACCTTCGA CGGCCTGATC
GAGCTGAAGG TCGCCCAGGG CGGCCAGATC ATCGGTGCGC CCGGCGCGCG GATCGGCGCG
TCGCAGATCC TGAACCAGGG CACGATCCGC TTGGCGGGCG GCGCGATCAA CCAGGTCAAG
TCGCTGCCGG CCCTGTATGC GACCACGACG GGGCCTAACG CGGCGCTTTC TGCGGCTAGC
CTTTCCGACA TCTTCACCGT CAAGCCGGAC GGCACGATCG ACGAGGCGGC TCCGTCCAAG
ATCGATCCCA CCCGCACCAA CCGCCAGGTG GCGGCCCAGG GCGGCATCTA CCTGACCGGC
GACCTGCCGG CCGACGTCGG CGTCCAGCTG GACGCGGGCA GCGTGACCGA TCTCTCCGGC
GTCAGCATCC GCAATCCCTA CGCCATCGGC GTTGACGGCC GCCAGATCGT CACCGGTCGC
GTCTATGGCG GCGGGGCGAT CACGACAGCG CCGACACGGC GTCAACAGGG TGCGCTGTTC
GCGGATTCGA CGTTCTCGCG CGGCGTCTAT CGCAACCTGT CGTACCAGAG CGGCAGCTTC
AGCGCGGCGG TGCTCGCCGC CGACGTTCAG GGTTCTGACT TCATCGCCGC GCCGGGCGCG
GCGGTCAATC TGTCGGGGGT CTCGGACACC TTCGATCAGC TCCAGGCGGA CGGGGGTTAC
GCGCCCACCC TGCAATGGAG CAGCGCCGGC GCGCTCTCGC TGGGGGCAGG GGGCGTGCTG
ACCGGCGCGA CCATCACAGC CAAGGGCGGC GGGCCGGCCG CGGCGGGCGG GGTGCTCGTC
CTGTCCAACC CGACCCTGAC CCAGAACGAC CCAACTTCCC CAACCCGAAA TCTCTTTTCG
GCCAATCAGA TCGAAGCCGC GGGCTTCGAT ACGCTGGTGG TGCGCGGCGC CCTGCGCGGC
CAAGGCGACG TGGCCCTGAC GCTGGACGGA GCCTTCGAGC TGACCTCGCC CGTCTATGAC
GGCGTGGCGA GCCTGAACGA TCCTTCCGTC CGCCAGAACC TTTCCCCAGT GGTCGGCGCC
ATTGGTCAAC TGGATATCAC GGCCTCTTAT GTTCGGTTGG ACGGCGCATT CCAAAGCCTC
GCCACACCGG CCGTCGGAAC GGCGGGAACC GGTCAAGTGA CGCTCCATGC CCAGTCCATG
GACGTCGCTG GGGCGGTGCT GTTCGATCGT TCCGTTGCCA ACACCACCTT CGATGTGACC
GGGGATCTGC GGTTCTCTGG CGTCGCCCCT TACCAGGTCG CGTTCGACGT CGGAACCGCC
GCGCCCAGCC TGGCCGGGCA ACTGGCGGTG AACGGCAACC TGCTGCTCCG GGCGGGCCAG
GTCTATGCGA CCACCGGGTC TAGTGTCTTT GTCAGTTCGG CGGCCTCCGA TGGCGTGCTG
ACGGTGGAGC GCGCTTCCTC GGCCACCCCG GCCACACCCT ATTCCGCCGG CAGCAATCTG
ACCTTGCAGG CCGCCAGCAT CGTGCAGAAT GGCGTACTGC GCGCGCCGCT CGGCGTCTTG
ACCCTGGGGG GCAACAGCGC CTCGCTCTTC GCGCCCGCGA CGCGCTCGGT GGTGCTGGGC
GAGGGGGGGA TTACCTCGGT GTCGGCTGCC GGCCTGTCAA TCCCTTACGG CACGACGACC
GACCAGACCG AATATTTCTT CAACCCGACC AACGCCAACC CGCTGACCGC GCCGCCCACC
GGCGTCCTGA CCCTGGCGGC CGGTGCGGTG ACCACGGCGG CGGGCGCGAC GGTCGATATA
AGCGGCGGCG GCGACGTCTA CGCCTATGAG TTCGTCCCCG GTCCGGGCGG CACGCGCGAC
GTGCTCAGCC AGTTCAATCC GGATGTCTTC ACTGGCAACG ACGGCTATCA GTATGCTGAT
CATAGGCAGG TCTACGCGAT CGTGCCGGGC CTGTCGGATG GCTCGATCTC GCCCTATGAC
CCGATCTATT CGTCCAATTA TGGCGAGCTC TATCAAGCCG CCAACGCTGG CCGCCGGGTC
TATCTCGAGG GCGGCCAGGG CCTGGCCGCG GGCTGGTACA CCCTGCTGCC GGCGCAATAC
GCCCTGTTGC CCGGCGGGAT GCGGGTCGTG GAAAACACCG CCGCCAGTGG CGTGGCCGCA
GGCGCGACCG CTGTCCGACG CGACGGGACC CTGGTGACCA CCGGACGCTA CGGCGGGGTC
GGCGGCGTCG AGGAGTCGCG GGTCCGCGTC TTCGAGGTGC AGAGCCAATC GGTGATCCAC
GCAGGCTCCA ACATCGTCCA GACCTCGGCC AACACGGCGT TCGCGGCGGC GGCGGCGAAG
CGGGGCGAGG CCTCGCCGGT CCTGCCGCGG GACGCCGGAC GCCTGGTGTT CGCGCCGCTG
ACCTCGCTGG ACCTCAACGG CCGCCTGGTC ACCACGCCCG GAAAGGGCGG GCGCGGCGGC
CAGGCTGACA TCAGCGGCCA GGCGATCGAG ATCGTCACCC AGCGCGGAAC GCCGACGGCC
GGCGTCATCC AACTGGACGC CGACCAACTC AGCGGCCTAA ACGTCGACAG CCTGCTGATC
GGCGGCGTTC GGACGGACCG GGCCGACGGC TCCACCGGCT TGGCCGTCAC GGCCAACACC
ATCACCGTCG CCAACAACGC GACCGCCCCG CTGACGGGTC CCGAAATCCT GCTCGCCGTC
GATGGCGCCG GGAGCCGGCT GACGATTCAG GACGGCGCGA CGATCACCGC CACGGCGAGC
AGCGCCGTTC AACGGACGGG AGACTATCTG ATCGACGGCG CCGGCGCGTC GATGACCGGC
CAGGGCGCGC TCGTGCGCGT CACCAGCGGC TCCGATCGCG ACGTCGTGCG CACCAATGTC
GATGCGGTCT CGACCGGAGG ACTGGTGGTC GGCGCCGCCA CCCTGATCGG CAAGTCGATG
CTGCTGGATT CCAGCGCTGG CTTCACCATC GCCCCGACCG CCACGCTGGC GGCGGACACC
CTGACCTTGA GCGCTTCGGA GATCCATTTC GCCGACGCGC CTGACAGTCT GGCCGGGCTG
GTGCTGACAC CCGGCCTGCA GGCCGCCCTC GGCCGGGCCC AGGGCCTGCG GCTGCGGACC
GCCAATCGCA TTGACTTCGC GGCCGGTGAC TATCATTTCG GCGACCTGAC CTTGGTCGCC
CCCGGGGTGG CCTTGGCCGG CGGGTCCGGC GACGTGCGGA TCTTCGCCGA CGACCTGCGC
CTGGAGTCCC GCTCCGTCGC GACGGCGGCG TGCGGTGCGA GCGGGCCTCT GGCTTGCGGC
ACGGGGGCAC TTACGCTGGA CGGGCGCACC GTGACCTTAG GCGACGGCGC GCTCCATACC
TACGGCGCCG GCGGCGGCGT GTCCGTGAAC GCCCGCGAGG GCCTGTTCTA TGACGGCAAG
GGCTCGCTCG ACGTCGGCGC GGCCGGCCTT ACGATCCAGA CGCCGTTCCT TGGCGATCGC
GCGGCGCAGG TTGCCTCCAG CGGCGCGACT CCGACCATCC CCAGCCTGTC GCTCGTCTCG
ACCGGCGTGG TGACCATCGC CAACGCCGCG GGCGGCGCGC GGCCGACGGC GGCCGGCGTG
GCCGGCTCCA GCCTGACGAT CGCCGGCCGA AGCCTTTCGG TGAGCGGCGT CGATGTGCGC
GCCACCGCCG GCAAGCTGAC CTTGACGGCG ACGGACGCCC TGACCATCGG CGCGGGCGCG
CTGATCGAGA CCCCCGGCTA TGCCAAGAGC TTCGGCGACG CGGTCGATCC CTATTCGGTC
TCGGCGCCGG GCGGCCTGCT GACCCTGACC TCGGTCAACG GCGACGTGCG GATGGCGGCG
GGCTCGACGC TGTCGGTCGG CGGCGGCCTT GGCGCCAGCG GGACCCTGGC CGTCAACGCC
GGCAAGGGCG AAGCCGTGTT CGGCGGCGCT GTCGATGCAT CGACCCCTGA CGGCGGCGCG
CGGTTCGCGC TGAACCAGGC CGGCGGCTTT GATCTCTCGG GCTTCGTGCG GTCCACCAAG
GGCGGCTTCG ATGGGGGCAT GGACGTTCAG ACCGGGGCAG GCGACCTGGT GCTGGCCGAT
GGCCTGGCGC TGAAGGCGCG CAGCGTCTCG CTGGTCGCCG ACGGCGGTCA GGTGGCGGTC
GATGGCTCGA TCGACACTTC GGGCGCCAAC GGCGGCGACA TCCGACTGTT CGGCGCGACG
GGCGTGACGC TGGGATCTAA GGCCGTGCTC AACGCCCGGG CTCTGGGCTA TGACGACAGC
GCCACCCGCA CCGCCGAGGG CGGAACCGTT CAACTCGGCG TCGGCCAATC CGGCGCGATC
GACGTCGCGA CCGGCGCGAA GATCGATGTC GGCGCGCGGC ATGACAAGGC CCGACTGGTG
ACCACGGTCG AGAACGGCGT GGTCAACTAT CGCCAGGTCG CCGCCGACAC CGGCGGCGCG
CTGGTGTTGC GCGCGCCGGT GCTGGGGCCG GCGGGCGGAC AGACCGTCGA TGTCCACTTT
GCCGGCTCGG TCGTCGGGGC CGACAGCGTC GTGCTCGAGG GCTATCGCGC CTTCGATCTG
GCCGCGATCG CCGCCGATAG CCGCTTCACG GGCGTGACGG TGGCCGGGCA GACGGCCACG
CTGAACCTGG CCGCCACGGC CGCTGGGCGG GAGAACTTCC TGGCGGGCAC GGGAGTGGGA
ACCCTGTCCG ACTTCATCAA GACCTTCGAC GTCTCCAGCA TCTATGGCCG CCTTGGCGGC
CTGGCCGGCC AGGCCAACTT CCACGCCCGC CCGGGCGTGG AACTGAACTA TGACGGTTCG
ATCCTCCTGG CCTCGAACTG GAACCTTGGG GCCGGCACGG TCGATGTGGC CGGCGCCATG
AACGCTGGGC TGATGGCCGC CCATCCTGGA ATCTCGGGCG CGGTCTATGT CGTGCCCGGC
TCGGAAGGCC GCATCCTGGC TAACTACACG GACATGACCT ACCACGTTGG CGGCAAGGCG
ACCGGCGAGG GCGGGGTGCT GACCCTGCGA GCGACCGACG ACGTCACCCT CAACGGCAGC
CTCACCGACG GCTTCTTCAC CTTCGCCGAC CAAAGCGACC CGGCCTATCT CAACCGGGCG
CTCGGCGGCG GGACCCGGAC CTATGACGGC GTGCTGAACT CCACCTGCAC GGGGTCGTGC
GTCGTCGGCG ACTTCACCAC GGGAGCGGCG CCGGCCAATA CCGTCACCGT CAACTTCCCC
GGCGCGACGG GCCTGGGGAA CCAGGAGATC AACACGGCCA ATCCCGCGCC TTACAACGCC
GCGGCCAACA GCCCCGCCGC GCTCGGGGTC GGCGCGGGCG GCAAGGGCGA CGCGATCGGC
AGCGCCGAAC TGTTCCCCTT GATCGAGACC GCCGGGGGCA CGCGCGCGGT CGACTCCTGG
TCCTACCAGA TCACCGCCGG GGCCCGCGCC TCCGACGCGG GGGTGTTCAG CGTTGATCCC
CTGCGGGTCC AGGCGGGCGC CGCCGGAAAC CTGAAAGTGG CGGGAACCGC GACCTATAGC
TATGGCGGGG TCGCGGGAAA CTCGAGCCTG ACCAACACCC TGCTGCTGGG CGTCGCCAAT
GGCGACAAGG TCGCGGCCGA CCAGTGGGTG CAGGCCCAGA TGGCCCAAAA CCCCGGTCTC
ACGACGCAGT CCTACACGCG GCTGCAGTGG ACCAGCGCTC CGGCGGCGCT TCGCACGGTC
CTGGCCCAGC GCGCCCTGGC CTTCCTGGCT CAACACCCCG GCGAGGTCGC CTTGACCGGC
CCCGCCAACG CCCCGACCGG CGTGTCGACC AGCCTCAGCC TGGCCGGCGC CTTCCTGGCC
CAGTTCGCCA ACGACTGGCC GACGCTGAAG GCCAACTATT CCGCCCCGCG GGCATCGACG
CCTTCGCCCA CCACGGTGAC GACCACGACC TTGATGCGCA CCGGCACGGG CTCGATCACC
CTGGCGGCGG CCGGCGACAT CGACCTGCGC AATGGCGAGG CGGTGACCTA TCGCAACATC
TTGACCGGCG CCGACGCGCC CGGCCCCGGC CCGACCGCCT ATCAGGTCGG CGGAGTCGCG
GTCTACACCG CCGGTCACCG GGTGATCCCC GAGGCGATCG ACGCGGTCGA TCCCAAGACT
GGCGCGGCCT TGGTCCTCGA CCCGTCCGCC TACAGCCAGC CGGCGGTCCT GAAGGCCCAG
ACCGCTGGCG GCGGCGCCGT GCGCGGCCTG GCGCTGTCCC AGCCCGTCTA CGCCACCGGC
GGCGGCGACG TGTCGCTGCT CGCTGGCGGG GACGTGCTGA GCCGCCGCGA TCTCTATACG
GCGGCCTGGG TCGACGACCT TCAGATTCAG AATCTCACAG GCCTCGCGGG CTATGTCGGA
ACCGGCGAGC AGCCGTGGCG CATGGGTTTC GTCGGCATGG CGACCGACCT GCGGATCAAT
CCGCAGCTGT TCACCGAGGG CGCGGGCACG CTGGGCGGCG GCGACATCCG CGTGGTGGCC
GGCGGCGACG TGAGCGATCT GTCGGTGGTC GCCGACACCA CCGTGACCAC GGCCAACGTC
GCCGAGGCCG GCGGCGCGGC GCGGCCGGGG CGGACCCTGC TGACGTTCGG CGGCGGCGAC
GTCATGATCG AGGCCGGCGG CGAGATGCTG GGCGGCCGGA TCGACATGGG GGCCGGCCAG
GGCGAGATCC GCGTCGGCGG CGACCTGATC CGCGCCGCCG GAACCGATAA CGGCCTGAGG
CTGCGGCTGA GCGACGCGAC GATCAACCTT TCGGTGCGCG GCGCGGCGTT GGTTGAGGGC
ATAACCGCCC TGGGCGTCAA AAGGAACGCC AGCAGCGGCA ACATCGACGC CAACGCCACC
AGCACGGCTA ACGCCCTGGG CTTCTACGCG CAGGACGCCG GCGTTTCGGT GCTTTCCAAC
GGAAATCTGA CGATCGCCAA CAGCGCGACT CTGCGCGGTG ACGGCGGCGC AACCAACACG
CAGGCCGCAA ACGCTTTGGA GGCGATCTAT CCCGGGTCGT TGTCGGCCGT CAGCCTGTCG
GGGGACCTTG CGTTCGGTTC AAACACCGAG ATCCTGCTGA CCCCCACCGC CAGGGGAACC
CTGACCTTGG CGGCTGGCGG CGATATCGCG CCCGCCACCA TCGCCATGCT GGACAACGAT
CCGGGCGTCA CCCCCGGCGT GTTCTCCAGG TTCATTCGCA GCGGATCGTT CGCGGCCAGC
GGCATGGCGT TCGACTTCCC GGTGGTCCTG CCGACCACCA GCCTGGCCGC GCGCCGGCTG
CTGCACAACC CGCGAGTCCC GCGGAGCGGC GACGCGGCTC CCAACCGAAT CTATGCCGGC
GGCGACATCG GCGCCCTGAG CCTGTCGACG CCAAAACAGA CCCGGATCGG GGCCGGCCGG
GATATCGTCA ACATGATGTT CTTCGGCCAG AACCTGAACG TCGGCGACGT CACCCGCGTC
GTCGCGGGGC GGGACATCAC CGCCACCACC GTGGTGCGCG GCGCGCCGCT CGGATTCGTC
GGTCTGACCG ATCCGTTGCC AGTGCTGCAG GGCAACAGCT TCGTGATCGG CGGCCCGGGC
ACGCTGTCGC TGGAAGCCGG CCGCGACCTG GGGCCGTTCC TCAACTCGGC GATCGACGAC
TTTCAGGCCG CCAGCGACAT CGTTCAACCG CGTCACATCA CCTTCGGCGG CGGCGTGATG
TCGGTCGGCA ATGAGTGGAA TCCCTGGCTG GCGCCGGTCG GGGCCAATCT GGACATCCAG
TTCGGCGTCG CCAAGGGCGC CGACTTCACA GCCCTGCGCG AGACCTATCT CGATCCGACG
AATCTGGCGC AATTTCCCGA CTATCTGTTC GTCCAGGCCG AGGATGAGCA CGGAGCGATC
ATCACCGACC GGACCAAGCC GATCTACGGT CCTGCTCTGG TCGCCTGGAT GCAGGCCAAC
GCCGGCGAGA CGCTCCAGGC GGCGTTCGGT ACGACCAAGG TCGACTACGT CCAAGCCTAC
CAGGCCTTCA CCGGCCTGTC CGCTCTGCGC CAGCGCGGCT TCCTGCAGCA GGTCTATTTC
AACGAACTGA CCGTGACCTC GATCCCAGGT CCGTCATTCC AGAAATATTC CCGCGGCTAC
ACGGCGGTGA ACACCCTGTT CCCGGCCTCG CGGGGTTATA CGGCCAACGA CCTGACCGGC
GGCTCAAACG GCGCCAACCT GCTGGTGCGA ACCGGTGATC TCGACCTGCG CCTGGCGACG
ATCCAGACCG CTCGGGGCGG GGACATCTCG ATCCTGGGTC CCGGCGGGCG GGTGCTGGCC
GGCTCCACCG TCAGCACCGC CCAGCAGGCC GCGCGCCGCA ACTACGCGGG GCGTGGTCTA
TTCAGCGCCT TCGGGGCGCC AGTTGCGCAG ATCAGCGCCA TTCCTATCGG CTTCGAGGGG
GTTTTGACCC TGCGCGGCGG CGACATCTCG TCGTTCACGG ATGGCGACTT CATCCTCAAC
CAGAGCCGGC TGTTCACCGA ACAGGGGGGC GATGTCGCCA TGTGGTCGTC CAACGGAGAC
CTCAATGCCG GCCAGGGTCC CAAGACCTCG CCCAACTTCC CGCCGGTGGT GGTCAAGGTC
AGCGACAACG CCAATTCGGA AGTCGACCAG ACCAGCGCCG TCAGCGGGGC CGGCATCGCC
GCCTTCCAGC CGGCCCCTGG CGTGGCTCCG CCGAATGCCT ATCTGATCGC CCCGCGCGGC
ACGGTCGATG CCGGCGACGC GGGCGTGCGG GTGGCCGGCA ATCTGTTCGT CGCCGCGCTC
AGCGTCGCCA ACGCCGACAA TTTCAAGGCC AGCGGATCGG CGATCGGGGT GCCAACGGCG
GCGGCCGCGC CGGTGGTCGG GGCCGAAACC TCGGCGGCGG GCAACGCCGT GACCCAGGCC
GCCCAGCAGG CGGTTCGGGG CCGCGACCGC CCTGACCGCT CGATCATCAC GGTCGATGTC
CTGGGCTTTG GCGAGGCCGA TTCCTGTCCG ACACCGAACG ATCCGAAATG TCCGCGCTAG
 
Protein sequence
MAVKSKPLFF PVHRSPVEGA RQALFLGASA MVLLAALGTG EAFAGPSATT RAVARLAATG 
AGAPSVAVPV VPGVTPNAAM AAARALQNAT RVQQAVNLAQ QAQVAGRQAA SALISKAPNG
LVAGGLAPAT DAAYATGGML LWQGANTPTQ STSTSGRILV DIKQTDSRAI LSWDTFNVGA
NTTLNFDQSQ NGVAQPDWIV LNRVVGQLNP VTGLRDPSKT PAPSQILGAI TAQGTVLVIN
QNGVLFGGTS QINTRSLIAS SLEVGRGVSR DAAGVVHDRT IADRNADFLT GGLLGVNVSA
NDLGVSTFSA MNDPTGLGPA TIEGAITVDA GAQITAGDGG LILLAGPQVV NSGMLSAQRG
QVSLVSGRSF ILTASDGSAT SLDPNVRGLV VGRGHAVTGG GAGGFYVRNS ASGLIQSREG
YASLYGAVIN EGVITATTSV SRNGSIDLGG ANGDAIQLAP GSVIAITPDD TGSIPQDAQS
LAAFKPSRVT LGNAASQIEI GSNSLIYAPG ATVEVGSKPG VDTDSTATAG YANARIFIDS
GATIDVAGLK NVIVPASRNV IVIDPVKGNE LADSPLYRQG FLNGAKIYLD PRISGVREDG
VAYIGSPLIS AESYAQQVGV TASELLTKGG AVTLGVPSAS PTAGVVTQAP DVIVKRGATI
DISGGWRTFE AGRVRTTRLI DANGGIVDIG YADPNATYVG VYEGFVDVQP RFGVTRTYVS
PILDGGDFVP SYTEGTDAGS LTIKSSQPLF EGTLYADAFP GLAQKQAGQV GTAKPVLYGD
RRRLQAVSSQ LPSGGLLSIQ ELGLSPNGVT GGGDIRIIDG AMPATADSLT YGQSLVIDGQ
GNLSMAARPV ESMIPAAQRG VLTFGADTLS AMGLGQLSLF TSGALTVESG ADLTLTPGGV
FTATTGRAIT IDGDITAASG TIALETVATG RGSVFKADPA GPGSYDVTIN GQLSTAGLWT
NDLGASSDDL GGAAYVDGGK VSISAAPRAL LTDDVVPTSS GSGPATNVDI SGSILIDGPR
SRIDVSSGGY VATDGDLDLT ARGGAVTLTS DTTYFQLTAP PGQGYVAGQA PGIRVTGLAN
GAAPIVPVNP SEITARVSIG QDTIVGHGFA GGGTFSLTTP AIAFGDGVAS TGTELPLDFF
SKAGFSTYTI KSYGTDLSPN TFNNGLGGYN AVLKTQVLTV GDGQTLNLTQ SGYSTRPDAA
QTAALRTLRT GGAVTSVLTA GVQPQAWDQA PISLTFDGLI ELKVAQGGQI IGAPGARIGA
SQILNQGTIR LAGGAINQVK SLPALYATTT GPNAALSAAS LSDIFTVKPD GTIDEAAPSK
IDPTRTNRQV AAQGGIYLTG DLPADVGVQL DAGSVTDLSG VSIRNPYAIG VDGRQIVTGR
VYGGGAITTA PTRRQQGALF ADSTFSRGVY RNLSYQSGSF SAAVLAADVQ GSDFIAAPGA
AVNLSGVSDT FDQLQADGGY APTLQWSSAG ALSLGAGGVL TGATITAKGG GPAAAGGVLV
LSNPTLTQND PTSPTRNLFS ANQIEAAGFD TLVVRGALRG QGDVALTLDG AFELTSPVYD
GVASLNDPSV RQNLSPVVGA IGQLDITASY VRLDGAFQSL ATPAVGTAGT GQVTLHAQSM
DVAGAVLFDR SVANTTFDVT GDLRFSGVAP YQVAFDVGTA APSLAGQLAV NGNLLLRAGQ
VYATTGSSVF VSSAASDGVL TVERASSATP ATPYSAGSNL TLQAASIVQN GVLRAPLGVL
TLGGNSASLF APATRSVVLG EGGITSVSAA GLSIPYGTTT DQTEYFFNPT NANPLTAPPT
GVLTLAAGAV TTAAGATVDI SGGGDVYAYE FVPGPGGTRD VLSQFNPDVF TGNDGYQYAD
HRQVYAIVPG LSDGSISPYD PIYSSNYGEL YQAANAGRRV YLEGGQGLAA GWYTLLPAQY
ALLPGGMRVV ENTAASGVAA GATAVRRDGT LVTTGRYGGV GGVEESRVRV FEVQSQSVIH
AGSNIVQTSA NTAFAAAAAK RGEASPVLPR DAGRLVFAPL TSLDLNGRLV TTPGKGGRGG
QADISGQAIE IVTQRGTPTA GVIQLDADQL SGLNVDSLLI GGVRTDRADG STGLAVTANT
ITVANNATAP LTGPEILLAV DGAGSRLTIQ DGATITATAS SAVQRTGDYL IDGAGASMTG
QGALVRVTSG SDRDVVRTNV DAVSTGGLVV GAATLIGKSM LLDSSAGFTI APTATLAADT
LTLSASEIHF ADAPDSLAGL VLTPGLQAAL GRAQGLRLRT ANRIDFAAGD YHFGDLTLVA
PGVALAGGSG DVRIFADDLR LESRSVATAA CGASGPLACG TGALTLDGRT VTLGDGALHT
YGAGGGVSVN AREGLFYDGK GSLDVGAAGL TIQTPFLGDR AAQVASSGAT PTIPSLSLVS
TGVVTIANAA GGARPTAAGV AGSSLTIAGR SLSVSGVDVR ATAGKLTLTA TDALTIGAGA
LIETPGYAKS FGDAVDPYSV SAPGGLLTLT SVNGDVRMAA GSTLSVGGGL GASGTLAVNA
GKGEAVFGGA VDASTPDGGA RFALNQAGGF DLSGFVRSTK GGFDGGMDVQ TGAGDLVLAD
GLALKARSVS LVADGGQVAV DGSIDTSGAN GGDIRLFGAT GVTLGSKAVL NARALGYDDS
ATRTAEGGTV QLGVGQSGAI DVATGAKIDV GARHDKARLV TTVENGVVNY RQVAADTGGA
LVLRAPVLGP AGGQTVDVHF AGSVVGADSV VLEGYRAFDL AAIAADSRFT GVTVAGQTAT
LNLAATAAGR ENFLAGTGVG TLSDFIKTFD VSSIYGRLGG LAGQANFHAR PGVELNYDGS
ILLASNWNLG AGTVDVAGAM NAGLMAAHPG ISGAVYVVPG SEGRILANYT DMTYHVGGKA
TGEGGVLTLR ATDDVTLNGS LTDGFFTFAD QSDPAYLNRA LGGGTRTYDG VLNSTCTGSC
VVGDFTTGAA PANTVTVNFP GATGLGNQEI NTANPAPYNA AANSPAALGV GAGGKGDAIG
SAELFPLIET AGGTRAVDSW SYQITAGARA SDAGVFSVDP LRVQAGAAGN LKVAGTATYS
YGGVAGNSSL TNTLLLGVAN GDKVAADQWV QAQMAQNPGL TTQSYTRLQW TSAPAALRTV
LAQRALAFLA QHPGEVALTG PANAPTGVST SLSLAGAFLA QFANDWPTLK ANYSAPRAST
PSPTTVTTTT LMRTGTGSIT LAAAGDIDLR NGEAVTYRNI LTGADAPGPG PTAYQVGGVA
VYTAGHRVIP EAIDAVDPKT GAALVLDPSA YSQPAVLKAQ TAGGGAVRGL ALSQPVYATG
GGDVSLLAGG DVLSRRDLYT AAWVDDLQIQ NLTGLAGYVG TGEQPWRMGF VGMATDLRIN
PQLFTEGAGT LGGGDIRVVA GGDVSDLSVV ADTTVTTANV AEAGGAARPG RTLLTFGGGD
VMIEAGGEML GGRIDMGAGQ GEIRVGGDLI RAAGTDNGLR LRLSDATINL SVRGAALVEG
ITALGVKRNA SSGNIDANAT STANALGFYA QDAGVSVLSN GNLTIANSAT LRGDGGATNT
QAANALEAIY PGSLSAVSLS GDLAFGSNTE ILLTPTARGT LTLAAGGDIA PATIAMLDND
PGVTPGVFSR FIRSGSFAAS GMAFDFPVVL PTTSLAARRL LHNPRVPRSG DAAPNRIYAG
GDIGALSLST PKQTRIGAGR DIVNMMFFGQ NLNVGDVTRV VAGRDITATT VVRGAPLGFV
GLTDPLPVLQ GNSFVIGGPG TLSLEAGRDL GPFLNSAIDD FQAASDIVQP RHITFGGGVM
SVGNEWNPWL APVGANLDIQ FGVAKGADFT ALRETYLDPT NLAQFPDYLF VQAEDEHGAI
ITDRTKPIYG PALVAWMQAN AGETLQAAFG TTKVDYVQAY QAFTGLSALR QRGFLQQVYF
NELTVTSIPG PSFQKYSRGY TAVNTLFPAS RGYTANDLTG GSNGANLLVR TGDLDLRLAT
IQTARGGDIS ILGPGGRVLA GSTVSTAQQA ARRNYAGRGL FSAFGAPVAQ ISAIPIGFEG
VLTLRGGDIS SFTDGDFILN QSRLFTEQGG DVAMWSSNGD LNAGQGPKTS PNFPPVVVKV
SDNANSEVDQ TSAVSGAGIA AFQPAPGVAP PNAYLIAPRG TVDAGDAGVR VAGNLFVAAL
SVANADNFKA SGSAIGVPTA AAAPVVGAET SAAGNAVTQA AQQAVRGRDR PDRSIITVDV
LGFGEADSCP TPNDPKCPR