Gene Caul_4162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4162 
Symbol 
ID5901624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4514164 
End bp4528227 
Gene Length14064 bp 
Protein Length4687 aa 
Translation table11 
GC content64% 
IMG OID641564683 
Productouter membrane adhesin like proteiin 
Protein accessionYP_001685784 
Protein GI167648121 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGCTC CGCACAAGCC CGCTGGTCGG ATGGATCTTA CGGGTGAAGT CATCGCGGGC 
TATCAGCGCG ATCACACGAT CCCAAAGGAG CTATTTCTTG GATCGGATTC TGAATTTATC
AGAGAGTTTT TGAAATTAGG CGATGATAGT AAGCAAATAT TCGACTTCGA TTCGAAGTCA
GATAATATTC AGATGCAGCA CGACGGCAGC ATTGTGGATG ATGGTACGCC TCATAACGGG
AGTCATCCCG CGAAAACTCA AGCGATATCT GATATTCTGA AAAATAAATA CAACAATGAC
TATCGATATA TACTTGAACA TCCGGATACA ACTCCGGAAC AAAAGAATCA AATTCGCCGC
GATATGGGTG AGTTTCTAAA GAAAACAGCT AATGAAGTTC AATTCAAATC ATCAGGTTGG
GTTGATCCCG ATCGGCAACT GATCAATAGC GCTACCAAAT TCGATCCGGT GACCAAGAAA
AACGTACCGG TCTCTAAACT GGTCGCGTCA CCTGAATGGC AAAACGCTAC AGCCGCGGAG
CAAAAGCAAC CAGCCAGAAT AGCGGAAAAG TATAACGATC GCATTTTGAA GCAACCATTT
AGTGTTACGG AAGCTCGGGA CTTCAAGATA TTGTCCGATC TTGGCGGCGG TCGCACTATC
GCGCTGACGC CAGGCGATCC AGACGCCGTA AAGAACCTTC GTGCGATCGA CGCCGAGGCC
AAGAGGCTTG GCGTCCCTGA TGACGCCAAT TTCTACGACA AGATCAAGGC GGTCGATAAA
AGCCTGGGCG TCGACACTCT AGTTGAGCAA ACACACGCGC TCACCGACCA GATCCACGCG
GCTTCGCCGG CTGAGCGAGG CCCGCTCGTC GCGCAACGTG ACGAACTGAT CCGACAGACC
GAAGCGAAAA TCAATGCGCG CGCCATCGGC GCCGATGGTG CGAAGGTTGA AGCGCGCAAG
GCCTTTGGCA CACTTCGCAA CGTGGTCGGT GACCTGGCAG AGCGTGGGCT GCTGCGCAAG
GCGGGTCGAT GGGCGGCCGG CGAAGGCGCG GAGTTCCTCG TCAAGCACAT CCTACCCAAG
TTCATCCCTG GCCTAAACGT CGTGTCGACG GCGCTCGATA TCTACGACAC GGCGATGTTC
GCCTATGAAA TCTACAAGCG ACGAGACGAG ATAGCCGCCT TCGCGCGGCG CATCGGCGCG
GAGTTTGCGA GTGTTGCCCA TGCCGAAGAC AAGGAAGGCG GGGCTGTGGA AGTCACCACG
ATCAAGCTTC CGAACGAAGT GGACGAGATC GTGGTTGTCG CACGCCGCGC GCCGGGCACG
AACGTCTTCC AGGTGCTGGG CGCCTACGCG AAGGGAACGT TCAGGCAGGT GGCGTCTGCG
TTGGAGCAAG CCACCGGCAT GTCGTTCCGA CTGAAATCGG GGGAACTCCA AGCTGTTAGC
TACGGATTCG CGAGCGTTTC GCTTGCGCCG AACGGTGGTT CCTCGACGAC GACGACCTCG
ACCTATGATC ATGGTCGAGT CACGCATAGC CAACAGGTCG TTCAGTCGGC GGAAAGCATG
CTCGGCGACG TCCGAAGCAT GCTGGATTCG GACTACGAGG TCCGCCAGGG ACTCTATTTC
AACCTGAATA CGGAGGAAAC CTACGGAGAA GACGGCGCTC TCGAAGAGAC GCGGACGGAT
GTCGACGCCG TCAGATACAA GACGAACGCC AACGGGCGGC TTGTTGCGGA GACGCAGCGT
TCCCGATTCA CGGCCGCGCA ATTTGGCTCC ATTCTCGGCT CCAATATCTC CAATCTCTTG
GGCATCGACA ACCCCTGGCT CAAGTTGTCG GCCGGAACAG TGCTGGGCAC GATTGGCCTG
AACCTGGGGG ACGCGCTGGA TCATGGCGGT ACGCGCGAAG CTTTCTCCGA CGCGTTCAAC
GATCTCGATA TCGATCTCCT AGACGCCGGT ATCGGTGCGG TCAGTTCCTA CCTGTTCGGC
GAGCTGGTCG CCGAACTCGG CATGGATGCG ATTCCCACCC AGGTCATCAC AACGGTAGGC
GGGGCGGCGA TTTCAACCAT CGCCGTCAAT CTGGCCCATT CGCACGTGTG GAACGACGGG
ATGAGCGCCA ATGCGCTGAA TGCCGCCGGC GCGTTCGTGG GCACCTGGCT TGCTTCGCAA
CTTGTCAGTT TCGACAGCAC CGGCGGTCAA CTCGGCTCGA CCTTTGGCTC GGCGATCGGC
GCCGCGTTCG TGGTTGCTCA GTTCGTCGTC GCAGCGACGG CGACTTCGAG CACAACGCTC
TTTGGCGCGC AACTTGGAGC GTTTGCGGGA CCTTTGGGCG CTGCGATCGG CGCCTTCGCG
GGATTCATCA TCGGCGGCCT TATCGGCTCC CTGTTCGGCG GCAAACCGAA GGCCGGCGCG
GACCTGGGAT GGAACGCGGC CGAGGGCCGC TACGCGGTCA CCTCCGCCTG GGCGAAGAAC
GGGGGTTCTA AGGATGGTAT GGCCGCCATG GCCATGTCCG TCGCCGGCAC GCTCAACACG
GTCATCGCCG CCAGCGGTTC GACCATTATT GATCCTACGT CCGTGCGGCT TGGTTCGTAC
AGCAGCAAGG GGAAGGCTTT CCGGTACTCG TCGGTGGGGA CGAGTGGGAT CAGCTACACA
ACGCCCGACG CCGAAGCGTT GATCGCGCAC GGATCTTCGA TCGCGCTGGG CGACCTCATT
TCGCGTCTGA CCGGCGGAGA TGTGCTGACA AAACGCGCCG TCCTCGCGAC GCTCGCGGGA
GCGGGCGGCT TCGATGCGGC CACGCTCTAC GGCAATCTTG CGACGGCCAA GGGCTATGGC
GAATATCTTG CCAACAGGGA TCTCATCACT CCCCTCATGG AGTCGGATCC GGAATCCGTT
TTCGCGGCGG GTTGGACGAT CACCGTGGCT CAGGCCCTGG TCCTGGGACT GGACCGGCGC
GCATCGACCG ACTGGATCGG CGGCTGGACC TCTTTCTTTG ATGAGACCCT CGGTGGAACG
ATCGACGGCC TCGCCTACGC GCCGTCTGTC TTGCGTTTCC AGCTCAGTGA CAAGAACGAA
CGGACCTTCA TTTTTATCGA TGAGAACGGG GATCTGACTG GAGCGTTGGG CGACACGATC
GACACAGCCG GAAAGCTGAA GGTCAACGGG TCCGCCTCGG CCGATATCAT CGAAGTGTCA
GGAAGGGCGC TCACGTCGAC CGGTGGCCTG ACCATCGGGC CGGGATCAGC GACAATAGCG
CTGGACGTTT CAGTGGCCGC CCTGATCGAC GCCGGCGACG GCGACGACAT CGTCCGGGCC
GGCGACCTCG GCAATGACGT GCTAGGCGGG GCCGGGAACG ACAAGATCGT TGGCGGTAAG
CTGGACGACT GGCTGTTCGG CGACGACGGC GACGATACCC TGTTCGCGGG CGACGTAGTC
TCGGCTGGGG TGGTCAGCCA GGCGGCGCTG CAGGCCGGCG GGACGCTGGT CGTCGACGCC
GTGGCGGCCA CGGCCGTCGA CGGCGGCAAT GGCGATCTGC TGGACGGCGG GGAAGGCAAT
GACCGGCTCT ATGGCGGCAA GGGCTCGGAC TGGCTGAAGG GCGGCGAAGG CGTCGACCTG
CTGGTCGGCG GGGCCGGCGG CGACATCCTC GAGGGCGGGG CGGGCGACGA CCAGGGCGCG
TCGGGCGCGG CGGCGGTGCT CGGCGGCGCG GGCTCGGACC AGTATGTGTT CGGCTACGGC
GACGGCAATG ACGTGCTCTT CGACGAGTCC GATCCGGCTG GCGTGGCCGG TTCGACGGGG
GACTCGCTCA ACATCCGGGT CAGTCAGCTC AACGCCGGTA CCCTGGCCAA GAACTGGGCG
GGCGGCGGGT CCTATGAGGT CGACGGTTCG GTCAAAGGCG GCGAGGACGC CATCGTGTTC
GGCGTCGGCG TGACGATGCA GAACCTGATC ATGAAGCGCT CGGGCACCAC GGGCGCCCCG
GGCTCTGACC TGATCATCCA GCTGACCGCC GAGGATCCGA TCGGCGTGCT GGTCAACGGC
CATGTCCGGC AGATCGCCAC CGGCGACAGC CTGACGATCA AGGACTGGTT CGAAAGCACT
CGCAAGATCG AGTGGCTGCG GTTCGCCAAC GGCGACGACA TCCGCATCGG CGACATCACC
TCCTACATCG TCGGCGTGGC CGGGGCCTCG GTGATCCTGG GCACCAACGG CGCCGACTGG
ATCGTCGGCA CCGACGGAGC CGACAAGATC TATGGCCTGA ACGGCGACGA TTTCGGGTTC
GGGGGCCTGG GCAACGACAT GGTGTCGGGC GACGGCAACA ACGACCTGGT GTCGGGCGGG
GCCGGCGACG ACGTGGTGAT CGGCGGGGCG GGCAACGACA CCGTGCTCGG CGACGGCGGC
GACGACCGCG CCTTCGGCGG GCTGGGCGCC GACATCCTGG CCGGCGGCCG GGGCGACGAC
GTGGTCATCG CCGGAGCCGG CGACGACGTG ATCCGCTACG CGCGCGGCGA CGGCCAGGAC
GTGCTGATCG ACGACCTGGT CAACAACTGG GACCTGGTCT GGCAGAACGG CGCCTATGTC
AACGGCTACG CCCTGAACGC CGACGGGACC GTGAGCAAGG GCGGAGTGGT CTATTTCGAC
GGCTCCAAAT GGCTCGACGG GTTCAACTAC GACTATGACG ACGCCGCCAA GACCTTGAAG
CGCCACATGG GCGCGGTGGG CGGGGTGATC TCGGCCAATG CCGGGATCGA TACGCTGGAG
TTCGCGGTCG GGGTGGACAT ACAGGATCTG ATGCTGCGCC GGGTCGGAAC CGACCTGGAA
GTCGTCGTTT CCGAGGTGGA CGCCACGGGC GGCTTCTCGG GCGCCGCCGA CAAGGTCACC
ATCAAGGACT GGTGGTCGGC CACCACCGGC GCCGAAACGC GCCCGATCGA GAAGTTCAGC
TTCGCGGCCA CCGGCACGCT GGCGCTGGGC GGCTACGCGA TCATCGGCGG CGCCACCGAC
GGCGCCGACA CCCTGACCGC GGCGGCGACC GCCAGCTGGA TCACCGGCGG CGGCGGCGAC
GACCTGATAA ATGGCGGGAC GGGCGCCGAC ATCCTTGTTG GCGGCGATGG CTTCGACACG
CTGTCGGGCG GCGCGGCGGG CGACATCCTC TACGGCGGCG CAGGCGACGA CATCCTCGAT
GGCGGCGCGG GCGCCGACCA ACTGTTTGGC GGAACGGGGA CGAATGATAT AGCGTCCTAC
GCCTCGGCCG GCCCGGTTCG CGCCTATCTC GACGCCAGCT TCGCCAACAA CGGCAATGCG
GGCGGCGACG TCTACACCGG CATCGAAGGC CTCGAAGGAT CGAGCTCCTC GGATCGCCTC
GGCGGCAATT TCGGCGCCAA TGTGCTGCGC GGCGGCGGCG CGAACGACAA GCTCTGGGGC
GGGGCGGGCG ACGACACCTA CGAGTTCAAC CGCGGCGATG GTCTGGACAG CGTCTATGAC
GGCGTTCTGG TTGTCGAACA GATCCTCGAC ACCGCCGGAA CCCTAAGCAG CACCTTCACG
GCCTCCTGGC AATTGATGCG CTACGGGACG GCGACCGGCG TTTCGGGCGA CTACTACCAG
TATCAGCTGA CCGTAAAGAG AACGGCCGAC AACGAGGTCG TCTATCAAAG CCGCGACGGC
GTCGACTTCC TCTACACGAC CCCTCAGGCG GCCGTTCCGG GCGGTTCTGC ATGGCCCTAC
GCCAATGGCC AATGGATCAC CGGCGCCACG CGCATCAACG GCGTGATGAC GGTGCTGGAA
CACATCGTCG CCGGGGACGG CGGCGCCGAC ACCCTGCAAA TGGGTTCGAC GATCAGCTTC
TCGGACCTGA CGATCCTGCG CGGATCCAAT TTCCTAAAGG TCAGCCTCGA CGGCGCCAAC
TATGTCGCCC TGTACGACCA GACGATGACG GATCGCGCGG TCGAGACCCT GCAGTTGGCC
GATGGCCAGA CGGCCGACCT GACCCATCTG CGCCTGGCGG GCGAAACCGC CAGCGTCGCG
GCGGACTTCG TCATCGGCGG CGCGGGCAAC GACACCCTCA GCGGTCTGGC CGGCGACGAC
GTCATGTCTG GCGGCGCGGG CAATGACAGT CTGGACGGCG GCGCGGGCAA CGATGTCCTG
GAGGGCGGAG CCGGCGCCGA CACTCTGAAC GGCGGCGCGG ACTCCCAGAC CGACGGCCTG
GCGGTCAGCG CCAGCGATCC TGGCAGCTAT GGCGACACCA TCCGCTACGT CACCTCCGGC
GCGGGCGTGG TCATTGACCT GGCCACTAGA GCGGCGTCGG GCGGCGACGC GCAAGGCGAC
ATCATCGTCC TGGGCGCCAA CGGCTTCGGC TCCATTGAGA ACGTGCTCGG CTCCGACGCC
TATGGTGATC AGTTGTCCGG CGACAATCGC GCCAACCGTC TGTCCGGCCT CGGCGGCAAT
GACGTTCTTG ATGGTCGGGC CGGCGACGAT GTCGTGGTCG GCGGGGCCGG GGACGATACG
CTCTATGGTG GCGACGGCGA CGACGCCTTG TCTGGCGAGG ACGGTATAGA CCGTCTCGAG
GGCGGCCTGG GCAAGGACGT GCTCGGCGGC GGAGCGGGCG GCGACATGCT GCTGGGCCAA
GCAGGCGACG ACCTGCTCAC CGGCGACGAC GGCGACGATG TGCTCTACGG CGGCGACGGC
CTCGACACCC TGGGCGGCGA CGCGGGCGCC GATATCCTCT ATGGCGAAGC CGGCGACGAC
AAGTTGGTGG GCGGCGACGG CGGCGACCAA CTGTTTGGCG GCGACGGCGA CGACGTGCTG
GTCGGCGGCA CGGGCGACGA CCTGCTCGAC GGCGGCGCCG GCGGCGACAT TTACGGCTTC
GACGCCAACA GCGGCGTCGA CCAGATCGTC GATGCGGCCG GCGTCAACCA TATCCAGATC
AGCGGCGTGA CGTCCGATCG CATCTGGATC ACCAGAGACA ACCTCGACCT GGTGATCAAG
GTGATCGGCG GCGACACCCG GATCACGCTG CAGAACTACT ATGCCGCTTC AGCTGGGTCC
CGCGTTCGCG ACATTGCGCT CGGTTCCCAA ACCCTGTTCC TGGACCATGC CGCGCCGATC
ATCACGGCGA TGAGCGCCGC GGGCGCGACG GCGACGGACG CCGCCATCGT GGGCCTGCGG
GCGCGCTACT GGCACGCGGG CGCGACCGCC GCCCCGGTCG TGGCCGCGCA GTCCTACGCC
ATCAACGAAG ACCAACCCCT GGGCGGCCAG GTCGGCGCCG AGGACGACGA CGACAACATC
ACCGCCTACG CCGTGGTCTC GGGGCCAGCC CTGGGCACGC TCAGTTCGAT CTCCGCCACG
GGCGGCTGGA CCTATACGCC CAACGCCAAT GTGTCCGGCC AGGACCACAT GACGTTGTCA
GTCACCGATG CCAGCGGCGT GACCGTCCAG CAGGACGTGG CGATCGCCAT CGCGCCGGTC
AACGACGCGC CGACCAACCT CCAGGCGCCG TTCCTTCTGT CCGTCCCTGA AGGGACGGCG
AGCGGCTATT CCTTGGGGAA ATTCACCAGC CAGGATCCGG ATGGCGCGGG CGAGGAAGCC
CACTACGCCT TCGCCCCCAA CACCGCCCAT GGCCTGTTTT CCATCAGCGA TGATGGCGAA
CTGAAGGTTG CTCTGGGCGC CAATGGTCAA CTGGACAATG AGACCGCGCC GATCCAATCG
ATCACCTTGC AGGTCACCGA CCAGCACGGC GCGGCCAGCC AAAAGACCTT CGACATCGCC
ATCGCCGACG TCAACGAAAG GCCAACGATC CGCTCGCTGA CATCCGCGGC GCCAGTCTTC
GCGGAGACGA CGGGAACAGG GACCTTGAAC GGCGCCACGA TCGCCAGTTT TGAACTGAAG
GATCCCGACA ATATCAACAA CCCGTCCCGG CCCGCCCCGT CATTGGTCTT GACGACGGAG
CGAACTGGCT GGCTGCAGGC CAATGGCGCC CAGCTTCAGT TCAAGCCAGG CACGGCGATC
GATTTCGACA CCCTGAGCCT GTCTGGCGTC ACCATGGGCG ATGTCGACGC TGATGGCGCG
TTGGACATCA AGTACAGCTA TTTGGTCGCG GCCAAGGATG GGCGAGACGG CGCCCTCTAC
TCATCGTCCT ACTGGGCGAG CTTCTATATC GAAGACGTCA ATGAGGCCCC CAACGCGCCG
GTGTTCAGCA ACGCGGTCGC CTCGATCGCC GAGCGCGATC ATCCGTTGGA CGGCGCGTTG
CGACCCGCGC TCGAAGTGGC GCGTCTGGCC TCGACGGATC CCGACAGGTC ACCGATTTTC
GCGGCCCTGT CCTATACGGT CGACAACCCC AATTTCGAGG TGGTGTTCGT GGCGGCCGCC
GGCGCTCAGC CGAGCTATTA TGCATTGCGG CTGAAGGAAA ACGCTTGGAT CGATTTTGAA
ACGGGCGGTT CGATTACGGT GAAGGCGGCA GCCACGGACA TGGCCGGAGC CGGGCTGTCG
TCGGCGCCGA GTGCGATCAC CTTCCTGGTC GAGAACCGAG ACGACTACCT CTATGGCGAC
CAGAACGCCC TGGCCCCCAA CGACAACCTG CTGGGCGGCG CCAACCGCGA TCTGATCTAT
GGAAGGTCTG GCAACGATAC GCTGAACGGC GGCAGCGGCG ACGACTATCT AGAAGGCGAT
GACGGCGATG ACAGTCTGAT CGGCGGCGAT GGCGTTGACC GGCTCCTGGG CGGCAACGGC
GCCGATATCC TCGACGGGGG GGCTGGAGCC GACTCGCTGG TCGGTGAGCT TGGCGACGAT
ATCTTGATCG GCGGATCGGG CGCTGACAGG CTCAGCGGCG GCGACGGTCG CGACAGCCTT
TCCGGCGGCG CGGAAAACGA CGTTCTCAAC GGCGATCTGG GCGATGACAC GCTGGACGGC
GGCGACGGCG TGGATCAACT GGACGGCGGC GACGGCAACG ACCTGCTCGT CGGAGGACTT
GGCGACGATA GCCTGTTTGG CGGCGCGGGC GCCGATACGA TGTCGGGCGG GGCGGGCGCT
GATCAGTTCA CCGGCGGGGC CGACCGCGAC ACCGTGACCT ATGCCAGTGC GACGAGCGGA
GTCATCGCCA GTCTGACGAC CGGCGGCAGC GCGGGTGACG CCTTGGGCGA CACCTTCCTG
GATGCGATCG AGGTCCTGGT CGGCTCCAAC TATGATGACA GCCTGACCGG CACGGCGACC
AATGACACCC TGCAAGGCGG TGATGGGAAC GACTCTCTCC ACGGCGGCGA TGGCGACGAT
GTGCTCGAAG GCGGCGCGGG CGACGATCAG CTGTTCGCCG AGTCGGGCGA CGATCGCCTC
TATGGCGGGA TTGGGCACGA CACGCTGGTC GGCGGAACCG GCAGCGACAC CTATTATCTG
AACGGTCTGT CGGGCGCGGA CGATATCCAG AACTTCAATT CGTCGGCCCA GGATATCGAC
ATCATCGGTT ATCTGGATGG CGACATCGAC CGCACCAAGC TGTGGTTCCG CCGCTCGGTG
CTCGCCGATG GCGTCACCGC GGGCAATGAT CTGATCATCG ACGTGGTCGG GACCACGACG
ACGACGACCA TCGTCAACTG GTATGGCCCC GCGGCCGCCA ACAGCGACAA CAAGATCGAC
TTCATCTTCA CCGGCCAAAA TTCCAGCCGG AAGATCAATG TCGAACAGCT GGTGACCCTG
ATGGCCGCCA ACCGGCCATC CGGCGTGGCG GCCGGCAGCG CGCCTTCGGC GGCGCAATTC
GCCGGCCTGC TGGCCAACAG CAGCTTCAAG ACCGCATGGC AAAACCTCTG GGACAGCAAT
GATCCGCCCG TGGTGGCCGC GCCCGCGCTC CTGACGATCG ACGAGGCGAC GGCTCAGACG
TCCGACATTT CCGTCACGAT CAGCGACAAT CCCTTGTCGG GTCTGTCCCT GCTGATCAGC
GCTGTCAGCG CGACCGATCA CCATGTCTCC GACGCAAGCC TTGTCGGTCT GCTGGACGCT
ACCGATGTTG ACGCCAACGG GAAGGCCACG ATCCACTTCA AGCCTAAGGA CTATCTCAGC
GGGGAGTTTG ACATCCAGGT CATCGCTGTC GACCAGGGCA AATTGGCCAG CGCGGCGCAA
TATGTGCGCG TCAAGATCAA TCCCGTCGGC ACGACGCCGC AACTCGATCC TCCTCAGCCC
GTCTATGGCG CCTTCGCCAA CGGTCCTATC GCCCTGACCA TCCCGGCGGC CCTGGTCGAC
AAGGACGGTT CGGAATATCT GAAGGTCGAA CTGAGCGGGG TACCGGCCGC ACTGATCCTC
AACAAGGGCA CGTCTTCAGG CGGCGATCTC GCCGTGTGGA CCTTGGTGAG AGACAACCAG
ACAGGCCGAG ACGACTTCAA GGATCTGACG ATCAACGGGG CGTCGACCTG GTCGCAGGAT
CTGACCATCG GCATCAAGGC CTATGGCGTT GAATATAATG GCCACAACGC CGAGGAGGCC
GGCGTCATCG CCAGCGCCCC GAAAAAGGGC GAGCTCAACG TCTATATCAA TGGCGCTCCC
TCCAGCATTT CGGCGACCAC GCTCGCCGTG CCGGAGGGCG CCTATTCAAC CGCGACCCTT
GTCGGCTCCT TCTCGGCCAG CGATCCGGAC GGCGACGACT TGGATTTCAC CTTGGTCGAT
AGCGCGGGCG CGGCGATTGT CGGCGGCGTG TTCCGTTTGA GCAAGGCGGT CTATTCCGCC
GCCACCCAGA AATGGACAAC CCAGTTGATC CTGACGGGCG TCGTGGATCA CGAAGCCATC
ACCCAAGACG TCCATGTGAA GGTGTCCGAC GGCAAGCTGG TCTCGGCGGC GCAACAGTTC
AGCGTCACGA TCTCCGACGT TCCGGAAGAC CCCACGACCC CGGCCCCCGG TGTTCAGGCC
CTGTCGGTCA TTTCGGAGAA CACCCATCCG GTCGGCGCCG TCATCACCTA CACCGCGACC
GATGGCGACC TGGTGGCGCC GACCTTGGAG ATCGTCGACG GCAGCGATCC TCTGGGCCTT
TTCGTGCTCA GTCCCATCGC TGGACAGGTT GCGGGAACCG TGAAAGCCGA TCTCGTGCTC
AGACCTGGCG CGGTCTTGGA CTACGAGACC ATATTGGCCG GCCATGTGCC GACTGACCAA
GACGGCGATC AGCGACCCGA CGCCGCCTAT ACCGTGCAGG TGCGCAGTCG CGACGCCGCC
CGGCCAGGCG TCACGTCGCC TGAAACGGTT TCGGTCACGA TCTATGTCGA GGACGTCAAT
GAGGCGCCGA ACGGACTGGT TGAGACGGGC TTTGCAATCG ATGAGAACAA GACTTCGATC
GGAACCCTGC AGGTCAAGGA TGAGGATATC GGCGACGGGG TCACCTATTC CCTGATCAGC
GACCCATCCG GGCTTTTCGC CGTCTCGGGT ACCGGGGACA TCACCGTCGC GCCCGACAAG
GCGCTGGATT TTGAAACCTC GCCCGCTGGC AATCACACCT ATGAGATCAA GGTTCAGGCC
AGGGACACCG GCGGCTTGAC GCTGGCCACG CCCGCCTCGA TCTCGATCAC GGTCAACGAG
GTTGACGAAC CCCCAACCAA GCTCATCGTC ACGACACCTT CGACAATCGT CGAAGGCGCG
ACGGCCGCCG CGATCGATCT GGCGACCCTG TCCTCTGACG ATCCGGAAAA GCACGCGATC
ACCTATGCGA TCGTCTCCGA TCCCTGGGAG CTTCTGCAGG TCAGCGGCGA CAAGCTCCAG
CTGAAGGCAG GGATCGTCGA CTTTGAGACC TTGGCAAAAA AGGCGGCGGG GAACTATTGG
AAGATCGACA GCACGGGGAT TTCTTACGCC GTGACCGTGT CGGCGTCCGA TGGGGCCCAT
GCTCCGGTTT CCAACAAGAT CTGGCTCAAG ATCAGCGACG CCAATGAAGC GCCGACGATC
TGGAACCAGG AGTTCAGCGT TTTAGAAAGC CGGCCTGGCG CCGGCGGGGC GGGGGTGTTC
GGGACCGTCC AGTTTACCGA TCTCGATCTC GCGGGCTCAT ACAATCGCGA CATCCGCTTC
TCTATTAGCG GCGGAGAAAC CGATCTTGTG AGCATCAACC CTCTCACCGG CGAACTGACG
CTGCAGGGCG CGTTGGATTT CGAAACTGCC CAGGCACGCC AGGTGCAGGT GACGGTCAAG
GATCAGGCTG GATCCGGCTT TTCTCGAGAT GCGATGATTA CCTTGAATGT TCAGAATGTC
GTTGAGGCGC CGACGGTTGA GACGGAGATA GGTGTCAATA ACGGCACAAA ATGGTACATG
GCTTATCTCG AAATGAGCGC GAGCCACACC GCTGGATCCG GCGATTTTTT GTGGGAAATA
GTAAATACTG TCAATCGAGG AGGGGGCATT GCTCCCCCGC CACTCGGTGT CTGGTCTGGA
TCTAGTACAA ACTATCGACA GTTGACGATC GAGAGAATGT ACAAGTATGG TGACGGTGAA
TTCTACGAAG CGAATTTCGA CTTCACTCTC CGGATAAGCG ATTCTTCCGG AGAGGCAGGC
CTCTACACCT ATAATGTAGA GAACGGAGGC GGCGGCCGCA TTGCGCCGCT CGTTCTGGAT
CTTGGCGATG ACGGCATATC GCTTGTCGAT CTCGCTCATA GCCCAGTGGT TTTCGATCAA
GACGCCGACG GGGTCGTCAA TCACACAGGC TGGGTCGGCC CGGGCGACGG CCTTTTGGTG
CTCGACCGCA ACCACAACGG TCTGATCGAC GACGGCTCGG AAATCAGCTT CTCCACCGAC
GAGGAAGATG CGGTTTCCGA TCTGGAAGGG CTGCGCGCCT ATGACAGCAA CGCGAACGGC
TTCTTCGACA GCCAGGACGA TCAGTTCTCG GCGTTCAAGG TCTGGCTGGA CGCCGACAGC
GACGGGGTCA GCCAGACGGG CGAACTGCGC TCGCTGGCCG AGTTGGGGAT CACCGCGATC
AACCTGACGC TCACCCTCAA CCCCAACGCC GATCCGGGCG CCGACGAGAA CCACCTCTAT
GGGACCACCC AGTTCGTCCG GGCCGACGGC ACGAGCGGGC AGCTCGGCGA CGTCATGCTC
GCCTATCAGA GCGCGCCGAA GGTCACGCTG ATCTCGTCTA CGTCGAGTTC GTCTGAAACA
GACTTGTTGC CGCCCGTGGT TATCGATCTC GACGGCGACG GCGTCGAGCT TGTCGACCGT
GCAGCCTCGA CGGTGCGCTT CGACGTCGCT GGCTTCGGCC AGGTCGTCCG CACCGGCTGG
GTCGGCGCGG ACGACGCCTT CCTGGCGCTC GACCGGAATG GCGACGGCAA GATCACCACC
GGGGCTGAGA TCTCGTTCAC CGGCGATCTC GAGGGCGCGG TGTCCGATCT CGAAGGCCTG
CGCGCCTTCG ACAGCAACGA CAACGGCTTC CTCGACACCG GCGACGCGCG GTTTGGCGAC
TTCCGGATCT GGCAGGACAG GAACCAGGAC GGCGTCAGCC AGGCCGACGA ACTGCATGGC
CTGAGCGACC TGGGCTTCCA GGCGCTGAAC CTGACCCAGA CCCTGACCGG GGCCAGCGTC
GCAGTCACCG GCGCCAACGT GCTTTATGCG ACCTCCGATC TGATCAGGAG CGATGGGACC
CGCCTGGGCG TCGGCGATGT CATGCTGGCT TTCGACGTGG CGCCCTCGGT TACGGTGACC
CCGCCGGCCG GCGATCTTGA GGATGTCGAT CGCACGGCCG CATCGACCCT GAAGCCGCCA
ACCGCCCATA CCGGCTCGAA GGGTGGCGCC AGGTCCGCGC AACCCGCCGC TGACCCGGGC
GAAGACCCGA TGCTCGGCGC CCAGTTGGCC CAGGCCCTGG CGCCGCGCGC TTGGTCACAG
ATCGACCTGG CGGGCATGCT CGACGCCCGC CTGGCGCTCG GCGCCCTGGC GGCGGTTGAC
TATGGTGACC TGGCGGGCGA CGGATCGCGC TCGGCTCTGG ACGCTGGGCT TGCCTTGGCG
CAACAGACCC GCTTGCAGAT GATCCAGGCG ATGGCTGGGT TCTCGCGCGA GGGGGCGGCT
GATTTTGGTC CCGACGCCCT GCGCCGGGGG CATGCCCAGT CCCTCGCCTT GCTGACGGCC
CTTCCCGATG TTCGAGTGCG ATAG
 
Protein sequence
MNAPHKPAGR MDLTGEVIAG YQRDHTIPKE LFLGSDSEFI REFLKLGDDS KQIFDFDSKS 
DNIQMQHDGS IVDDGTPHNG SHPAKTQAIS DILKNKYNND YRYILEHPDT TPEQKNQIRR
DMGEFLKKTA NEVQFKSSGW VDPDRQLINS ATKFDPVTKK NVPVSKLVAS PEWQNATAAE
QKQPARIAEK YNDRILKQPF SVTEARDFKI LSDLGGGRTI ALTPGDPDAV KNLRAIDAEA
KRLGVPDDAN FYDKIKAVDK SLGVDTLVEQ THALTDQIHA ASPAERGPLV AQRDELIRQT
EAKINARAIG ADGAKVEARK AFGTLRNVVG DLAERGLLRK AGRWAAGEGA EFLVKHILPK
FIPGLNVVST ALDIYDTAMF AYEIYKRRDE IAAFARRIGA EFASVAHAED KEGGAVEVTT
IKLPNEVDEI VVVARRAPGT NVFQVLGAYA KGTFRQVASA LEQATGMSFR LKSGELQAVS
YGFASVSLAP NGGSSTTTTS TYDHGRVTHS QQVVQSAESM LGDVRSMLDS DYEVRQGLYF
NLNTEETYGE DGALEETRTD VDAVRYKTNA NGRLVAETQR SRFTAAQFGS ILGSNISNLL
GIDNPWLKLS AGTVLGTIGL NLGDALDHGG TREAFSDAFN DLDIDLLDAG IGAVSSYLFG
ELVAELGMDA IPTQVITTVG GAAISTIAVN LAHSHVWNDG MSANALNAAG AFVGTWLASQ
LVSFDSTGGQ LGSTFGSAIG AAFVVAQFVV AATATSSTTL FGAQLGAFAG PLGAAIGAFA
GFIIGGLIGS LFGGKPKAGA DLGWNAAEGR YAVTSAWAKN GGSKDGMAAM AMSVAGTLNT
VIAASGSTII DPTSVRLGSY SSKGKAFRYS SVGTSGISYT TPDAEALIAH GSSIALGDLI
SRLTGGDVLT KRAVLATLAG AGGFDAATLY GNLATAKGYG EYLANRDLIT PLMESDPESV
FAAGWTITVA QALVLGLDRR ASTDWIGGWT SFFDETLGGT IDGLAYAPSV LRFQLSDKNE
RTFIFIDENG DLTGALGDTI DTAGKLKVNG SASADIIEVS GRALTSTGGL TIGPGSATIA
LDVSVAALID AGDGDDIVRA GDLGNDVLGG AGNDKIVGGK LDDWLFGDDG DDTLFAGDVV
SAGVVSQAAL QAGGTLVVDA VAATAVDGGN GDLLDGGEGN DRLYGGKGSD WLKGGEGVDL
LVGGAGGDIL EGGAGDDQGA SGAAAVLGGA GSDQYVFGYG DGNDVLFDES DPAGVAGSTG
DSLNIRVSQL NAGTLAKNWA GGGSYEVDGS VKGGEDAIVF GVGVTMQNLI MKRSGTTGAP
GSDLIIQLTA EDPIGVLVNG HVRQIATGDS LTIKDWFEST RKIEWLRFAN GDDIRIGDIT
SYIVGVAGAS VILGTNGADW IVGTDGADKI YGLNGDDFGF GGLGNDMVSG DGNNDLVSGG
AGDDVVIGGA GNDTVLGDGG DDRAFGGLGA DILAGGRGDD VVIAGAGDDV IRYARGDGQD
VLIDDLVNNW DLVWQNGAYV NGYALNADGT VSKGGVVYFD GSKWLDGFNY DYDDAAKTLK
RHMGAVGGVI SANAGIDTLE FAVGVDIQDL MLRRVGTDLE VVVSEVDATG GFSGAADKVT
IKDWWSATTG AETRPIEKFS FAATGTLALG GYAIIGGATD GADTLTAAAT ASWITGGGGD
DLINGGTGAD ILVGGDGFDT LSGGAAGDIL YGGAGDDILD GGAGADQLFG GTGTNDIASY
ASAGPVRAYL DASFANNGNA GGDVYTGIEG LEGSSSSDRL GGNFGANVLR GGGANDKLWG
GAGDDTYEFN RGDGLDSVYD GVLVVEQILD TAGTLSSTFT ASWQLMRYGT ATGVSGDYYQ
YQLTVKRTAD NEVVYQSRDG VDFLYTTPQA AVPGGSAWPY ANGQWITGAT RINGVMTVLE
HIVAGDGGAD TLQMGSTISF SDLTILRGSN FLKVSLDGAN YVALYDQTMT DRAVETLQLA
DGQTADLTHL RLAGETASVA ADFVIGGAGN DTLSGLAGDD VMSGGAGNDS LDGGAGNDVL
EGGAGADTLN GGADSQTDGL AVSASDPGSY GDTIRYVTSG AGVVIDLATR AASGGDAQGD
IIVLGANGFG SIENVLGSDA YGDQLSGDNR ANRLSGLGGN DVLDGRAGDD VVVGGAGDDT
LYGGDGDDAL SGEDGIDRLE GGLGKDVLGG GAGGDMLLGQ AGDDLLTGDD GDDVLYGGDG
LDTLGGDAGA DILYGEAGDD KLVGGDGGDQ LFGGDGDDVL VGGTGDDLLD GGAGGDIYGF
DANSGVDQIV DAAGVNHIQI SGVTSDRIWI TRDNLDLVIK VIGGDTRITL QNYYAASAGS
RVRDIALGSQ TLFLDHAAPI ITAMSAAGAT ATDAAIVGLR ARYWHAGATA APVVAAQSYA
INEDQPLGGQ VGAEDDDDNI TAYAVVSGPA LGTLSSISAT GGWTYTPNAN VSGQDHMTLS
VTDASGVTVQ QDVAIAIAPV NDAPTNLQAP FLLSVPEGTA SGYSLGKFTS QDPDGAGEEA
HYAFAPNTAH GLFSISDDGE LKVALGANGQ LDNETAPIQS ITLQVTDQHG AASQKTFDIA
IADVNERPTI RSLTSAAPVF AETTGTGTLN GATIASFELK DPDNINNPSR PAPSLVLTTE
RTGWLQANGA QLQFKPGTAI DFDTLSLSGV TMGDVDADGA LDIKYSYLVA AKDGRDGALY
SSSYWASFYI EDVNEAPNAP VFSNAVASIA ERDHPLDGAL RPALEVARLA STDPDRSPIF
AALSYTVDNP NFEVVFVAAA GAQPSYYALR LKENAWIDFE TGGSITVKAA ATDMAGAGLS
SAPSAITFLV ENRDDYLYGD QNALAPNDNL LGGANRDLIY GRSGNDTLNG GSGDDYLEGD
DGDDSLIGGD GVDRLLGGNG ADILDGGAGA DSLVGELGDD ILIGGSGADR LSGGDGRDSL
SGGAENDVLN GDLGDDTLDG GDGVDQLDGG DGNDLLVGGL GDDSLFGGAG ADTMSGGAGA
DQFTGGADRD TVTYASATSG VIASLTTGGS AGDALGDTFL DAIEVLVGSN YDDSLTGTAT
NDTLQGGDGN DSLHGGDGDD VLEGGAGDDQ LFAESGDDRL YGGIGHDTLV GGTGSDTYYL
NGLSGADDIQ NFNSSAQDID IIGYLDGDID RTKLWFRRSV LADGVTAGND LIIDVVGTTT
TTTIVNWYGP AAANSDNKID FIFTGQNSSR KINVEQLVTL MAANRPSGVA AGSAPSAAQF
AGLLANSSFK TAWQNLWDSN DPPVVAAPAL LTIDEATAQT SDISVTISDN PLSGLSLLIS
AVSATDHHVS DASLVGLLDA TDVDANGKAT IHFKPKDYLS GEFDIQVIAV DQGKLASAAQ
YVRVKINPVG TTPQLDPPQP VYGAFANGPI ALTIPAALVD KDGSEYLKVE LSGVPAALIL
NKGTSSGGDL AVWTLVRDNQ TGRDDFKDLT INGASTWSQD LTIGIKAYGV EYNGHNAEEA
GVIASAPKKG ELNVYINGAP SSISATTLAV PEGAYSTATL VGSFSASDPD GDDLDFTLVD
SAGAAIVGGV FRLSKAVYSA ATQKWTTQLI LTGVVDHEAI TQDVHVKVSD GKLVSAAQQF
SVTISDVPED PTTPAPGVQA LSVISENTHP VGAVITYTAT DGDLVAPTLE IVDGSDPLGL
FVLSPIAGQV AGTVKADLVL RPGAVLDYET ILAGHVPTDQ DGDQRPDAAY TVQVRSRDAA
RPGVTSPETV SVTIYVEDVN EAPNGLVETG FAIDENKTSI GTLQVKDEDI GDGVTYSLIS
DPSGLFAVSG TGDITVAPDK ALDFETSPAG NHTYEIKVQA RDTGGLTLAT PASISITVNE
VDEPPTKLIV TTPSTIVEGA TAAAIDLATL SSDDPEKHAI TYAIVSDPWE LLQVSGDKLQ
LKAGIVDFET LAKKAAGNYW KIDSTGISYA VTVSASDGAH APVSNKIWLK ISDANEAPTI
WNQEFSVLES RPGAGGAGVF GTVQFTDLDL AGSYNRDIRF SISGGETDLV SINPLTGELT
LQGALDFETA QARQVQVTVK DQAGSGFSRD AMITLNVQNV VEAPTVETEI GVNNGTKWYM
AYLEMSASHT AGSGDFLWEI VNTVNRGGGI APPPLGVWSG SSTNYRQLTI ERMYKYGDGE
FYEANFDFTL RISDSSGEAG LYTYNVENGG GGRIAPLVLD LGDDGISLVD LAHSPVVFDQ
DADGVVNHTG WVGPGDGLLV LDRNHNGLID DGSEISFSTD EEDAVSDLEG LRAYDSNANG
FFDSQDDQFS AFKVWLDADS DGVSQTGELR SLAELGITAI NLTLTLNPNA DPGADENHLY
GTTQFVRADG TSGQLGDVML AYQSAPKVTL ISSTSSSSET DLLPPVVIDL DGDGVELVDR
AASTVRFDVA GFGQVVRTGW VGADDAFLAL DRNGDGKITT GAEISFTGDL EGAVSDLEGL
RAFDSNDNGF LDTGDARFGD FRIWQDRNQD GVSQADELHG LSDLGFQALN LTQTLTGASV
AVTGANVLYA TSDLIRSDGT RLGVGDVMLA FDVAPSVTVT PPAGDLEDVD RTAASTLKPP
TAHTGSKGGA RSAQPAADPG EDPMLGAQLA QALAPRAWSQ IDLAGMLDAR LALGALAAVD
YGDLAGDGSR SALDAGLALA QQTRLQMIQA MAGFSREGAA DFGPDALRRG HAQSLALLTA
LPDVRVR