Gene Bcep18194_B1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1068 
Symbol 
ID3752833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp1197246 
End bp1210097 
Gene Length12852 bp 
Protein Length4283 aa 
Translation table11 
GC content65% 
IMG OID637765917 
Producthaemagglutinin-like 
Protein accessionYP_371826 
Protein GI78061918 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC CATTCGCGCT TCGTATGGTC TTCGGATTCC AGCGGCCTGT TTCAGGCGCA 
ACGACATGTG CGGTGCCGGA CCGGCGCCGC GCGCAAATTC CTAGCATCGG GGTGGCAAGA
ATGGCAGCAC GCGCACGTAC GGAACGGCAA CTTCGCGCAA CGGCCTCGCG GTCGGTCCTG
AAATCGGAGC TGAGCCCGCT TTACCGCGCG GTCGTGCTGA TGCTCGCGGC GGGTGTGTCC
GCTCATGCGC ATGCGGCAGG CATCATGAAC CTCGGTCACG TGGCTGCGAG CCCGGTGGCC
GGTGGCGCGG GCGGTGCCGG CGGCGTGCCG GGGATTCCGA ATCTCGGCGT GTCGCCGCAG
CAGGCGCTGC AGGCGAGCCA GCCGTCGATC CGCAACCTCG GCTATGCGGC GCACGCGATC
GCCGCGCAGA TCGCCGCGCA GCAGAATGCG GCGGCGGCCG CGGCGAAGCT GCCGTCGACC
GTGCCGAACG GGCTGGCGCC GGGCGGGCTG CAGGTGGCCG CGGGCGCAAC GGGCGATACG
AACAATCCGG TGCTGTGGGT CAACGCGAAC GCGCCGACGC AGAACGTCGA CGCGAGCGGC
CACGTGAACG TCGACGTGAA GCAGACCGGG CAGAACGCGG TGCTGACGTG GGAGACGATG
AACGTCGGTC GGCAGACGAC GCTGAATTTC GACCAGTCGG GCGGCACGCA GACGAACGGC
GCGAACAACT GGGCCGTGCT GAACCGGATC AACGATCCGA GCGGCAGGCC GAGCCAGATT
CTGGGGAACA TCACCGCGCA GGGGGCGGTG TACGTGATCA ACCGGAACGG GGTGCTGTTC
GGCGCCGGGT CGCAGGTGAA CGTGCATTCG CTGGTGGCGT CGTCGCTCGA CGTGCTGAAC
ATGAACAACT ACAACCAGAC CCTGCAGAAC CGGGTGCCGG TTGTGACGGA TAGCAGTCTC
GCGCAGGATG CGGCGGGGAT CGTCGCAAGC AACAAGCAGT TCCTGAGCCT GCCTGCCGGG
ACGACGGGTG GGCTCGCGTA TCTGGAGTCT GGCAGCTCGG GCGGGTTGAA CGGCAACAGC
CGGCTTCCGA ACGAGGTGCT TGGTCTGGGC AACCAGGTCA ATCTTACGTC CGCGAGCCAG
TATCAACCGT GGGGCGACGT CACCATCGAA CAGGGCGCCT CGATTACCAC GCACGCGAAC
GGCAACGCGA GCGATGGCGG CTTCGTGATG ATCGCCGCAC CGAACGTGAC GAACGCGGGG
CATATCAAGG CGACGGACGG GCAGGTGGTG CTGGCGGCCG GCGTCGGGGT AAGTCTGCGG
CCCAACAGCG GAGCTACGAC GAACCCGCAG GTGCTGCTGC CGGAACTCAG CGGAAAGATC
ACGCTGCTGG GCCCGAACGG CCAGATGACC GACATCACGC CGGCCGGTAC GCTGACCAAT
ACCGGCATCG TCGAGGCTGC GCGCGGCAAT GTGAACCTGC TCGGCAGCCG CGTTGCGCAG
AACGGCGTGG TGGGCGTGAC GACGAGCGTC AATTCGCCCG GCACGATCAC GATTTCCACG
GTCGACGAAT ATTCTTCCAA CAATCCGACC GGAGCGGCCT ATCTCGGGCA GGCGCTTCAG
ACGACGGACG GCACCGGTGG CGCCAACGAC ACGCACCGCG CCGGCTTGCT GAGCTTCGGC
CCCGATTCGG TGACCACGGT GATGCCGGAC GGCAACGGCC AGACGACGCC ATCGACGCCC
GGCGCGACCT TCACGCCTGG CAGCATCGGC ATGACGGCCG GCTCGGTCTG GTTCCAGGGC
GGCTCGTTGA TCGAGGCACC CGGTTCCAAG GTATCTGTCG CGGCGTTGAC GCCCTCTGCA
TCGCGCGCAC AGCCACCGGC CGGCCAGACG GCCGTTCCGG GTCGCATCTA TCTCGACAGC
GGCGCGACGA TCGATGTGTC GGGCCTCGCG AACGTCGAGG TGCCGATCGG GCAGACGCTC
GTGACGGTCG ACCGTATCGG CCAGAACGAA CTCGCCGATT CGCCGCTGCT GCGCAACGGG
TTTCTGCTCG GGTACAAGAA CCTCGTGTTC GACAGCACGC TGACGGGCAC GCGCAGCGAC
GGCGTGCAAT GGGTCGGCAG TCCGATCCTG AACCTGTCGG GTTATGTGAA CCTGATTCCG
CGCACGGTCG ACCAACTGTT GACCAATGGC GGTTCGATCA CGCTGTCGGG CAACGAGGTG
ATGACCGCGA CCGGTTCGTC GATGAACCTG AACGGCGGCT ACGTGCATTA CAACGGCGGG
ATCGTCAACA CGACGCGGCT GGTCGACGCG AACGGCGCGC TCGTGCCGAT CGGACAGGCA
AGCCCGTACG ACACCTACGT CGGCATCGCG GGGCAATTCG TCGAATCGCA TCCGCGCTGG
GGCGTGACGA AGACGTGGTA CAACCCGCTG CTGACGATGG GCGGCTATCA GGGCGACTAT
ATCGTCGGCG GCAACGCGGG GACGCTCGAT ATCTACGGAC TGCAGTCGAC GGTGCTGGAC
GGTGACATCA GTGCGCAGGC GTTCGGTGGC GCAAAGCAGA TGCAGGGCAA CAGCCTGCCG
AGCGGCGGGA CGTTCAACCT CGGCGCCGAT CCGAAGCTCG ACGGCGCCGC GCTGGTGCAC
CTGACGTCGA ACAACAGCGA TGTCGCATCC GGCACCGCGG GCCTGGTGGT GCTGCAGGAC
AATGCGCCGC AGCTCGCCGA CATCGCGCCG GGATTCTCGA TCGATACACC GCTCGATAAG
ACGGCGCTGC AGGCGTTGCC CGCGAACGAT CCGCGCAACG TGCTGACGAC CACCGTGGTG
CCGGTCGCGA CGCTGAACAA CGGCGGTTTC GCGAACCTGA ACGTGACCGA GGACAAGAGC
GGCGGCAAGG GATTCGTCGT GCCGCAAGGC ACGCAGCTGG CCCTGCAGCC GGGCGGTTCC
ATTTCGCTCA ACAGCGTGCC CATCGGTGCG GATGTCAACG TGGCAGGCAA GCTGATCGTA
CCGTCGGGCA CGATTTCGAT TGTGGGCGGC GGGAATATCA TCGTCGGGCC GCAGGCGGTG
ATCAGCGTGG CCGGGCAATG GGTCAACAAC GACGTACAGG CCGCACCCGG CACGACACCG
GGCAATAGCC AGTTCATCAA TGGCGGCAGT ATCGCGTTAT CGACCACCGA ACAGACGGTC
GGAAACTCGA GCGACGGTTT CAGGGACTCG ACGGGATCGA TTCTGCTGCG CCCCGGTAGC
GTACTCGACG TGTCGAGCGG CGGCGAAGTG CTGGCGAACG GCCAGTTGCT GATGCAGAAC
GGCATTCCGG TCGGACGCGG TGGCAACGTT GCGCTGTCGG TCTACGCGGA GCCGGTCACC
CGGCAGTTTG GTCATACGGG CGACGGTGGT CAGATGATGC CGACCACGCA ACCCGTCAAC
GGTACGATCG CACTCGGCGG GACGATCCTC AGCGACGGTT TTTCGGGCGG TGGCACGTTG
GCGTTGCAGG CACTCGGCTT CCAGATCGGC GGCGATCCGA AGGCTGCTGC GCCGTGGGAT
GTCTATTTGC CGGAGCGCTT TTTCGCGCAG CAGGGTTTCG GCAAGTATGT GTTGAACGCG
TACTACGACG CGACCGTGGC GCCCGGTGCG GCCGTCACGC TGACGCAGCG CAATCTGATT
GCCGATGTGC CGGCGCTTCT GCAAACCGCG ACGGGCGCCA ACCTGTCGAC GTCGGGCCTG
ACGACCTCGG GGCGTCTCGA CGACTATCAC CGGCAGCCCA CGAGTCTCGT GTTGACGGGC
GGCAATGACG CGTCATGGAT CACGCCGGCG AATACGGCAC CGTCGTATCC GGGGGTGACG
GGTGCCGTTA CGTTGTCGTC CGGCGCGTCG ATTCATGCGG ATGCCGGCGC GAACATCGGC
TTCGGTTCGC CGACGCAAGT AACGGTGCTC GGCTCGATCG TCGCGCCCGG CGGGTCGATC
ACGCTCAGCG CGGATTCAGG CAGCACGTTC GCGCAGACGG GGCAATACGG CATATTCGTG
CCGAGCGACA GTCGATCGGT CTGGCTCGGC TCGGCTGCCG CGCTCGACGT GTCGGGTACT
GCGCTCGTGA ACCCGCTGGC GGCACCGGTC TGGATCGGCA CGGCTGTCGT CGTGCCGAAC
ACCGGCAAGG TCCTGCCGGG TGGTTCGGTC ACGTTGTCGA GCGACGCAGG TTACGTGGTC
GCTCAGCCGG GGTCGAAAAT TGACGTGTCG GGCACCTCGT CCAACTTCGA CCAGTTGCAG
CCGAACGGCA CGTATGCGTC GCAGCCGGTG TGGAGCGATG CGGGTTCGAT CACGTTGTCC
GCCGGGTATG GGCTCTTTGC CGACGCGACG CTGGCTGCTC ACGCCGGTGC CGCACAGGCG
CGGGGCGGTG TGTTGACGAT TCTGCCGCAA CAGAACAGGG CGGGTACCGG TGAGGCTACG
CTCGTCATCC GGCAAAGCGG CGAATTGACG CCTGTCGGGC TGGGAGTGGG GCAGAGTTTC
AAGACGGCGA TCGATTCGAC GACCGGGTTG CCGATTGGTC AGCCGACCGG TGTCATTCAG
TTCGCCACCG ACCGGCTGGA CGGCTCCGGC ATCGCGAATC TCGTCCTGGG CAGCGCGACA
TCGTTGGCGC CGCTGATTGC GTTCGCCGGG AACGTGAACC TGAGCTTGCC CGAGTCGGTG
ACGCTGAATG CCGGTCGGAT CGCCGCAATC GGGACGGACC AACTTGCCAC GCTCCTTTCT
GCGCCCTCGA CGACGCTTTC GACACTGCTC ACGCAACCGT CGCAGCATAT GCCCGGGACG
ACGGTCGAGA TCGATGCGCC ATATGTTGCA CTGAACGGGC CGACGCTGAC GTCGCGGTCG
CCGGCGTTCG TGCCGGTTGC GACGGTATCT GATGCAACGC TGAACGTGAA CGCGTCGTTC
ATCGATGTGA CGAACCAGTT TCAGCTCAAC AACTTCGGGC AGGCGAACTT CACGAGCAGC
GGGGATATTC GCCTCGGCTC GACGAACGCG ACCCAGACGT CGTCGACCGC CCTCGCTGCG
GCCGGCATGC TCTATACCTC CGGCAACCTG ACTTTCAAGG CGGCGGATCT CTATCCGGCG
ACCGGCAGCA CGTTCATCGT CGATGCGGTC GGGCCGACCG ATCGGGCCAC CGGCAAGCCT
GTCCCGACGA CGATCACGTT CGCGTCGAAC GGTGCGTCGG GCACGCCGCT GTCGGCCGGC
GGCACGTTGC TCGTGGATGC CACGAACATC GTGCAGGGTG GCACGGTGCG CGCGCCGTCG
GGTTCACTCG TGTTCGGTGT CGGGGATCCG GCCAGCACGG CGACGCAGGC GCAATTCAAC
GGCCTGCCGC TCGTTGCGAC GGAATCGGTC AGGCTCGCGA ACGGCAGCGT GACGTCGGTC
TCGAACGACA GCACGGTCAT CCCGTATGGC ACGACGGTCG ACGGCGTCGA ATGGCAATTC
AACCCGATGG CGGGCAACAC CAAAGTCGCC GACCTGACTG CGCCGCCCGC GAAGTACGTC
GGAGTGAACG CAGGCAACGT CGCGCTCGAC AAGGGCGCGA CGATCGACCT GTCCGGCGGC
GGCGATCTGC AGGCGGCCGA GTGGGTGCCG GGCACCGGCG GCACGCGCGA TGTGCTATCG
CAATACAACG TGAGTTATGC GAGCGGAAAG GGCGCCGCGG CTGTACCGGT CAATGCTGGC
GCGGGCAATG TCTACGCGAT TTTGCCGGGC GCGCAGGCGC CCGTGGCGGC CTATGATCCG
GTGTTTGCAC AGACGGTCCA GCCGGCCACC AGCGTCTATG GCACGGCGAC CAAGACGACC
GCGACGCTCG GTGTCGGGCA GGCCGGCCTG AACGACGGGA TCGGCAAGGC GGTGTACCTG
TCCGGTGTGC CGGGACTGGC CGCCGGGTAT TACACGCTGC TGCCGGGCAA GTATGCGACG
CTGCCGGGTG CGTATCGCGT CACGGTTGCG AGTACGGGCG GCGCCGTCGT GCCGGGCGCG
AGCACCGTGC TGCCCGACGG GACGGTCACG ACGGCAGGTT ATTTCGCCGA CGCGCTGACG
GGCGGCCGCA ACGCGATGCC GACGCTGTTC AACGTGCAGT CTGGCCCGGT CTGGCAGCAG
TACTCGCAGT ACACGCTGAC AGGGGCGAAC AAATTCTTTG CCGCGCAGGC GGCTAAGCAA
GGCAGCGTGA CGCCGCCGTT GCCGGTGGAT GGCGGTCAAC TGGTGCTGGC GGCGACGAAG
GCGCTTACGC TTGGCGCGAC GTTGAATACC GCTGCCGGCA TCGGTGGCGC GCCGGCCGAG
GTCGATATCG CGTCGCGGGA CATTCAGATC ACCGGCAGCG GCAGTGCCGC GCTGGCCGGC
TATCTGCAGA TCGGCGCAAG CGATCTCGAT TCGCTGAACG CGGGCAGCCT GCTGATCGGT
GGCACGCGTC AAGCCACGCC GCAGGGCGTT GCGATCACAC CGATCGCGAA CAGCGTCGTC
GTGTCGAACG ACGGGAGCAC GACGCTGAAA GGCCCCGAGA TCCTGCTCGT CACGAAGGCG
GACGGCAGCG GCACCGATCC GAACGCGCCG AACGGCTTGC GCGTCGACGC AGGCGCGTCG
ATCGCCGCGC AGGGCGATTA TCCGGCGGCG AAGGATCAGC CGCTTTCGAT CGCCGGCGAC
GGCGCACTGT TGCGCGTATC GAACGGCACG ATGGCGCCGC TGACGCGTAC GGGCGGGACG
GGCACGGGCC TGTTGACCGT TGGCGCCGGT GCGACGCTCG CAGGTGGCCA GGCACTGATG
CTCGATTCGT CCGGCAATCT GAAGGTCGAT CCGTCCGCGG TGCTGTCGGC CAAGGCGATC
ACGGCCGACG GTTCGGCGAT CACGTTTACG AACGCGAGCG GCGGCGCGGC CGCGAGCCTG
CCGGGATTCG TGGTCGATCC GGCCGGCCTA GCACAGTTCG CGAATGCGCA GCAGGTGGCG
CTGCGCAGCT ACGGCGCGAT CGGCTTCGTC GGCGACGTGA ACGCGACGTT CGGTAATAGC
GTGGACCTGA GCGCGGGCGC GTTCACGAGC GATGGCGGCC GTGTGACGCT GAATGCGCAG
CAGATCGCGT TCACGAACGA GACCGGGGCG CCGAACGGCG TGGCCGCGCC GGGCAACGGC
ACGCTGACGG TCAACGCGAA GGAGATCGAT TTAGGCACCG GCACGAAGAC GTTGAGCGGT
TTCGGGGCGG TGACGATGAA TGCGACCGGC GGCATCGTTG GTCAGGGCAC GGGTACGTTC
GATTTCGGCG CACTGCCGGT GACGCTTGGT GCGCCGGTGT ATCTCGCCGA CACGGGCTCC
GCGTCGATCG TGAAGACGAC CGGTGCGCTG ACGCTGAATG GCGCATCGGG CACGGCGCTG
ACGAAGACAC CGGTCGGCGG TGCGTGGCAT TTCATCGGCG GCGCGATCGC CGACAACGGC
GCGGCGATTG CCGCGCCGGC GGGCAACGTG AGCCTCGAGG CGACCAGCGG CAATCTGACG
ATCGGTAGCG GCTCGACGGT GAGTTCGGCT GGCATGTCGA AGCAGTTCTT CGATGTGACG
CGCTACGCGC CGGCCGGATC GATCACGCTG ACGGCCGACG CCGGTACCGT CGATGTCCAG
GCCGGCTCGA CGGTCGATTT TTCCGGTGCC AATGGCGGTG GCGCGGCAGG CAGCCTTGCG
CTGTCCGCGC CGCAACAGGT CGTGAACCTG AACGGCACGA TCAAGGGCGG CGCGGCGAAC
GGTTATGCGG GCGGCTCGTT CTTGCTGAAT ACAGGCGGTG CGGCGGATCT CGACGCGCTG
TCGAAGACGC TCGCGTCGAG TGGCGTGAAC CAGTCGATCG CGATCCGCTC GAACACGGGC
AACCTGACGC TGTCGCAGGG CAACACGCTG ACCGCGCATA CGGTGCGGCT CACGGCGGAC
GGCGGGGCGG GCAATGCGGC CGACTCGGTC AACGGCAACG TGAACGTGTT CGGCACGATC
GATGCATCCG GCAAGGCGGG TGGGGAAATC GACCTGTACG GCCGCAACGG CGTGGATATC
GAAGGCACGC TGCTGGCGCG CGGCTCCGAT CCGAAGCAGC GTGGCGGCAA GGTCGACATC
GGCACCAGCG CGATGTTCGA TCCGACCGTC GCGAATCCAT ACAACGCGAC CTACGGCTAC
GAGAACATCG CGCGTGCGAA CGCGGGGACG ATCACGCTGG GAGCAAACGC GCTGATCGAC
GTGTCGGGCG GCACGGCCGG CGGGTTATCG GGCGGGACGG TGAATTTCCG TGCGCCGCTG
CTCGCGGACG GCGGCGTCAA CGTCAATCCG CCGGGGGCAT TCAACGACGG CAAGGGTATC
GTCGGGTCGC GCGCGACGAC GCTCGAGGCC TATGCGGTAT GGAGCACGAC CGACGCGACG
ACGCATGCGC AGCACTTCGA CGGCATCATC GATCCGGCGG GCTGGTACGA CAGCAACGGA
CATTTGGTCG CCGGTACGTT CACGGCACAG GGCACGAGCG GCGTAACGTT CGGCTTCACG
CCGAATGCAA GCGGCGACGG TGGCGGCACG CTGACGAACA ACTCGACCGG CGCGCAAGTG
GTGCTGACGG GCAGCGCCGG AGATCAAGCG CAACTGCGCA ACGGCTTGGC TTCGATCGGC
TTCGACGGGA TGAACGGCAC GTACTTCGTG CCGGGCGCGG CAAATGCCGA TCACCAGACG
TTCTACGGCT ATCAGGCTGG CGGTGCGACT GCGACGACGC CCGGTACGCT GATGGGTTTC
GTTCAGCATG GGCTCGACGC GCTGGTGAAT CCGTTCGCGG GGAAGAACGT CGCGAACGTC
CACGTGGTGC CGGGTATCGA ACTGGACAAT CCGAGCCGGG CGATCAACGG CGGCGACATT
CAGGTGCTGA CGAACTGGAA CCTGGGCACC GGCACGTCGC CGACGGATCT CGCATTCCGT
TTCAATGGGG AGGCGCCGGT GGTGACGCTG CGCGCCGAGA ACAACGTGAA GCTCAAGGCG
AGCCTGACGG ACGGGTTCTT CCAGATCGCG AATCCGCTTG GCGGCGGGGG GACGATTCCG
GTGCCGGGGT TGTCCACATT GAGTGCGACG CAGTCGATTT ACGACATGCC GTCCGGTTCA
GGTGGATATC ACTACGGATA CACGCTCGGC TACTTTGGGA AAAACGGAAT TGGGTTCAAT
CGGAATCTGG CTTTTGGGCC TGGTCAGCCG ACGGGCGGGA CTCCGGACGA AATAGCGGAA
TACTATGCAT TGTATTCAGC GTATGCGAAT TATCTGACGG CGACAGCCAC TACGATCAAT
TCCATGTTCG GATGGGATCC GACCAATGAC AACATCAACA TCATCAGCAC TTTTCGAAGC
ACCGGTAAAT CAGTGGCAGG TCTCACTGCA CCGATTGCAC CAACCGCATC GGAGCAGGCG
ATCAATCCAG GCTCCTATCT GATCTATTTG AATCAATACA AGACATATCT TTTAGCGGCT
GCCAACTATC GCAACCAGCA TCAACTCGAT CCGTATCCGT TCGTGGACGT TGTCACACCG
CCGACTGTTC AGCCGGTCGC GGTAATTGCG ACGTCGACGA TCAACTTGCC CGCCGTGGCC
GACAATACGC CATCACCGGT CGCCACTGCG GCGAATCCGA TCCCGCTTCA GTCTGCTTCA
TTGGCTGGAG GAGCAAGCAG TTCTTTCCGC GTGGTTGCGG GTGCGGATCT GGGCAGCACG
AATCCGCTTG CCGTACAGGC CGCGTCGGCT TCGACGAAAT CGCCGGGCAA CGGCAGTGTG
ACCTTCGACG GTCACACTGC TTATGTGGAC GCCAACGGTT TGGCATTGCT TGCGCCGACA
GCGCTACGCA CCGGTACGGG ATCGATCGAT GTGGCGGCCG CGAATGACAT CGCACTGTTC
GATGCGAGTT CGACGCCGGC GAACGATCCG AGCGTGACGG TTGTCCCCGG TGTCATTTAT
ACGGCGGGTG CTCCGGCTGC CGGAGCGCCC GCGCAGGGCA GCGATGTCGC GATCGTGCAT
CCGTTGTGGA CCGGCAAGCA GGACATTCTG GTGACGCCGG CCGTCAACCC CGATTCTGCC
GGCGATATCA CCATTCACGC GCAAGGGAAC ATCGTCGGTG CCGAGCGCTT GACGGATGCG
ACAGGTGAGG TGACAGGCCA GCAAGGCAGC GATATCAGCC AATTCTGGTG GCAGTGGATG
CAGATCGGCA ATCCGACGGG AACCGTCGGC GCAACGCGTC CTGTGATGCA GACCGTTCAA
ACATCGATCA ATTTCGGCGC CTTCGACCAG GGCGTGATGA GTTCTGGCGG CAACATTTCC
GTGTCGGCAG GCGGGAATAT TGCCGATCTG TCGGTGTCGC TGCCGACGAC GTGGTATTTG
ACCGCGGCCA ATACGGATAA CCCGACGGTC AATACGGTCG GTGGCGGCAA CCTTGCGGTA
CACGCCGGTG GGGATATTCT GAGCGGCGGC TATTTCGTCG CGAAGGGCAT CGGCACGATT
ACCGCGGACG GCAAGATCGG CTCGGATATC ACGTTGCCGT CGCTGTTGGT CGGACAACTT
CCAGGTTCGC TGGATACGTT GCTGGCGACG CAGGATGGCA CGCTCGACGT GAGCGCCCGC
CAGGGCGCCA GCATTGGAAG GGTGTTCAAC CCGTCGTACG TGCAGAGCAC TACACTGTTG
AATGCCTATC GGCAATACGC GGACGCGCAA GGCTATTCGA CGAATTCGGC TGTCAATATC
ACGTCAACGA CGGGCGACGT GGCAATCGGC ACGCTGAAGG GGATCGACAC GATAGGGGGA
GGAGCTTCAA GTGCCACATT CGGGCTCAAC GACATGTCGT TCGTGTTACC TGCCCGATTG
AACCTCACTG CGTTCACCGG TGGTATCACG ATAGCGGCCA GCGGAGAACT TGCTCCGTCT
CCGTCGGGCA ATCTGAGCTT GATCGCGGAC CAGTCGATCA ACTTTTCCAG CCTCAATGGG
CTGGCGACCA GTGGCGGGGG GGACGGCACC CCGGTGTTCG GCATGCTCGA TATGGATCCT
GCGTCGATGC CGTCCCCGAC GAATCCTCAT GCGAATGTGC CGCTCCTGAC CGACACGACG
CTGGCCGCTC ATGCGTCCAG CGCGTTGCAC GAAGACGATA CCGTTCCGGC GCGCATCTAT
AGCCTGAATG GCGACATCGT CGACGGGATT TTGCAGACCA CCGGCTTTTA CGACCAACTC
GTTCCCATCT CGATCGACAA GCCCACGTTC ATCCAGGCCG GGCAAGACAT CGTCAACCTC
GCTTTCCAGG GGCAGAACCT GCGCAAGTCC GATGTGACGC GCATCGTGGC CGGGCGCGAC
ATCTACGATA CGCCGTTCGC AGGTACCGTG AACGCCGTCG TTCCGGCGCT GGTACTGGGT
GGCCCCGGCA CGTTCGACAT CGAGGCGGGG CGGAACATCG GACCGCTGAC CAACCAGATT
GAAGTGAATC TTCAAGGTGT ACTTACCGGG ATAGACGCGA TCGGCAACGC GAAAAATCCG
TACCTGCCTC ATGAAAGCGC GAACATCAAC GTGCTGTTCG GTGTGGGGCC GGGCGTCGAT
ACCGCGACGT TCGTTTCGAC TTACATCGAC CCGGCGAGCT CGGTGGCAGG GGTGCCAGGC
ACGACGCCCG CGCTGATCGC ATTCGTACAG CAGTACGAAG CGGGTCAGGT GGTCGACACG
GGGCTGGTGA GCGATCAGCC GGCAAGCGCG CCGCTGACGG CAGCACAAGC GTGGTCGAAA
TTCAAGGCGC TTCCGCAGTA TGTTCAGCAG CTTTTTGCCG AGCAGGTGCT GTTCAACGTG
CTGGCCCGGG TCGGGGAGGA CTACAACAAT CCCGCGAGCC CGTATTTTCA GAAATATGCC
AGAGGCTATC AGGCGCTCAA TACGCTGTTC CCGGCTTCGC TGGGCTACAC GGCCAACAGT
CTCGGTGGCG GTAGCAATGG TGCGAACAAG CAGGTCGACA CGGGCGATCT GGATATGCGC
GGCACGACCA TCCAGACGCA GCAGGGCGGT AACGTGTCGA TTCTCGGCCC GGGTGGGCAG
GCGTTGGTCG GGAGCACGTC AGCGCCGCCG CAGATCGTCA ATGACAAGGG AACCGTGATC
GCTGGCCCGG GGTCGATGGG TATCCTGACG CTCGAGAAGG GCGACGTGAA TATCTTCACG
GATCGCAGCG TGTTGCTCGC GCAGAGCCGG ATCTTCACCG AGCAGGGGGG CGACATGACG
ATCTGGAGTT CGAACGGCGA CATCAACGCC GGCAAGGGCG CGAAGTCGTC GGCCGATACC
CCGGCGCCGC AATATCAGTG CGACGCGAAC CACTATTGCA TGGTCGATGC GCGCGGGCAG
GTGACCGGTG CGGGCATCGC GACGCTGCAG AGCGTACCGG GCGTGTCGCC CGGCACGGTC
AACCTGATTG CGCCGCGCGG CACGGTCGAT GCGGGCGACG CCGGGATTCG GGCGGGCAAC
CTGAACGTCG CGGCGCTGCG CGTGGTGAAC GCGGACAACA TTCAGGTGAC GGGGAAGGCG
ACCGGTATCC CGCTCGTGCA GGCGGTGAAC ACGGGAGCGT TGACGGCCGC GAGCGCGGCT
GCGTCGGCGG CGACGCAGGT CGCGCAGGAT ATGGCGAAAA ACAACGCGTC CGGTGCGTCG
GCGCGGCGCT GGACGATTTC GGTGCAGGTC GAGGGATTCG GCGATGCGGG CGGCGATGGC
GCGAAGAAGC ACAGGCAGCA GGTTGGCTAC GATGCATCAA ATGCGGTGTC GGTATTGGGT
TTCGGTCAGG TGGGGCCGAC GCAGCGTGCG ACATTGACCG AGGATGAGCG TGGGCGGTTA
GGCAAAATAT GA
 
Protein sequence
MSLPFALRMV FGFQRPVSGA TTCAVPDRRR AQIPSIGVAR MAARARTERQ LRATASRSVL 
KSELSPLYRA VVLMLAAGVS AHAHAAGIMN LGHVAASPVA GGAGGAGGVP GIPNLGVSPQ
QALQASQPSI RNLGYAAHAI AAQIAAQQNA AAAAAKLPST VPNGLAPGGL QVAAGATGDT
NNPVLWVNAN APTQNVDASG HVNVDVKQTG QNAVLTWETM NVGRQTTLNF DQSGGTQTNG
ANNWAVLNRI NDPSGRPSQI LGNITAQGAV YVINRNGVLF GAGSQVNVHS LVASSLDVLN
MNNYNQTLQN RVPVVTDSSL AQDAAGIVAS NKQFLSLPAG TTGGLAYLES GSSGGLNGNS
RLPNEVLGLG NQVNLTSASQ YQPWGDVTIE QGASITTHAN GNASDGGFVM IAAPNVTNAG
HIKATDGQVV LAAGVGVSLR PNSGATTNPQ VLLPELSGKI TLLGPNGQMT DITPAGTLTN
TGIVEAARGN VNLLGSRVAQ NGVVGVTTSV NSPGTITIST VDEYSSNNPT GAAYLGQALQ
TTDGTGGAND THRAGLLSFG PDSVTTVMPD GNGQTTPSTP GATFTPGSIG MTAGSVWFQG
GSLIEAPGSK VSVAALTPSA SRAQPPAGQT AVPGRIYLDS GATIDVSGLA NVEVPIGQTL
VTVDRIGQNE LADSPLLRNG FLLGYKNLVF DSTLTGTRSD GVQWVGSPIL NLSGYVNLIP
RTVDQLLTNG GSITLSGNEV MTATGSSMNL NGGYVHYNGG IVNTTRLVDA NGALVPIGQA
SPYDTYVGIA GQFVESHPRW GVTKTWYNPL LTMGGYQGDY IVGGNAGTLD IYGLQSTVLD
GDISAQAFGG AKQMQGNSLP SGGTFNLGAD PKLDGAALVH LTSNNSDVAS GTAGLVVLQD
NAPQLADIAP GFSIDTPLDK TALQALPAND PRNVLTTTVV PVATLNNGGF ANLNVTEDKS
GGKGFVVPQG TQLALQPGGS ISLNSVPIGA DVNVAGKLIV PSGTISIVGG GNIIVGPQAV
ISVAGQWVNN DVQAAPGTTP GNSQFINGGS IALSTTEQTV GNSSDGFRDS TGSILLRPGS
VLDVSSGGEV LANGQLLMQN GIPVGRGGNV ALSVYAEPVT RQFGHTGDGG QMMPTTQPVN
GTIALGGTIL SDGFSGGGTL ALQALGFQIG GDPKAAAPWD VYLPERFFAQ QGFGKYVLNA
YYDATVAPGA AVTLTQRNLI ADVPALLQTA TGANLSTSGL TTSGRLDDYH RQPTSLVLTG
GNDASWITPA NTAPSYPGVT GAVTLSSGAS IHADAGANIG FGSPTQVTVL GSIVAPGGSI
TLSADSGSTF AQTGQYGIFV PSDSRSVWLG SAAALDVSGT ALVNPLAAPV WIGTAVVVPN
TGKVLPGGSV TLSSDAGYVV AQPGSKIDVS GTSSNFDQLQ PNGTYASQPV WSDAGSITLS
AGYGLFADAT LAAHAGAAQA RGGVLTILPQ QNRAGTGEAT LVIRQSGELT PVGLGVGQSF
KTAIDSTTGL PIGQPTGVIQ FATDRLDGSG IANLVLGSAT SLAPLIAFAG NVNLSLPESV
TLNAGRIAAI GTDQLATLLS APSTTLSTLL TQPSQHMPGT TVEIDAPYVA LNGPTLTSRS
PAFVPVATVS DATLNVNASF IDVTNQFQLN NFGQANFTSS GDIRLGSTNA TQTSSTALAA
AGMLYTSGNL TFKAADLYPA TGSTFIVDAV GPTDRATGKP VPTTITFASN GASGTPLSAG
GTLLVDATNI VQGGTVRAPS GSLVFGVGDP ASTATQAQFN GLPLVATESV RLANGSVTSV
SNDSTVIPYG TTVDGVEWQF NPMAGNTKVA DLTAPPAKYV GVNAGNVALD KGATIDLSGG
GDLQAAEWVP GTGGTRDVLS QYNVSYASGK GAAAVPVNAG AGNVYAILPG AQAPVAAYDP
VFAQTVQPAT SVYGTATKTT ATLGVGQAGL NDGIGKAVYL SGVPGLAAGY YTLLPGKYAT
LPGAYRVTVA STGGAVVPGA STVLPDGTVT TAGYFADALT GGRNAMPTLF NVQSGPVWQQ
YSQYTLTGAN KFFAAQAAKQ GSVTPPLPVD GGQLVLAATK ALTLGATLNT AAGIGGAPAE
VDIASRDIQI TGSGSAALAG YLQIGASDLD SLNAGSLLIG GTRQATPQGV AITPIANSVV
VSNDGSTTLK GPEILLVTKA DGSGTDPNAP NGLRVDAGAS IAAQGDYPAA KDQPLSIAGD
GALLRVSNGT MAPLTRTGGT GTGLLTVGAG ATLAGGQALM LDSSGNLKVD PSAVLSAKAI
TADGSAITFT NASGGAAASL PGFVVDPAGL AQFANAQQVA LRSYGAIGFV GDVNATFGNS
VDLSAGAFTS DGGRVTLNAQ QIAFTNETGA PNGVAAPGNG TLTVNAKEID LGTGTKTLSG
FGAVTMNATG GIVGQGTGTF DFGALPVTLG APVYLADTGS ASIVKTTGAL TLNGASGTAL
TKTPVGGAWH FIGGAIADNG AAIAAPAGNV SLEATSGNLT IGSGSTVSSA GMSKQFFDVT
RYAPAGSITL TADAGTVDVQ AGSTVDFSGA NGGGAAGSLA LSAPQQVVNL NGTIKGGAAN
GYAGGSFLLN TGGAADLDAL SKTLASSGVN QSIAIRSNTG NLTLSQGNTL TAHTVRLTAD
GGAGNAADSV NGNVNVFGTI DASGKAGGEI DLYGRNGVDI EGTLLARGSD PKQRGGKVDI
GTSAMFDPTV ANPYNATYGY ENIARANAGT ITLGANALID VSGGTAGGLS GGTVNFRAPL
LADGGVNVNP PGAFNDGKGI VGSRATTLEA YAVWSTTDAT THAQHFDGII DPAGWYDSNG
HLVAGTFTAQ GTSGVTFGFT PNASGDGGGT LTNNSTGAQV VLTGSAGDQA QLRNGLASIG
FDGMNGTYFV PGAANADHQT FYGYQAGGAT ATTPGTLMGF VQHGLDALVN PFAGKNVANV
HVVPGIELDN PSRAINGGDI QVLTNWNLGT GTSPTDLAFR FNGEAPVVTL RAENNVKLKA
SLTDGFFQIA NPLGGGGTIP VPGLSTLSAT QSIYDMPSGS GGYHYGYTLG YFGKNGIGFN
RNLAFGPGQP TGGTPDEIAE YYALYSAYAN YLTATATTIN SMFGWDPTND NINIISTFRS
TGKSVAGLTA PIAPTASEQA INPGSYLIYL NQYKTYLLAA ANYRNQHQLD PYPFVDVVTP
PTVQPVAVIA TSTINLPAVA DNTPSPVATA ANPIPLQSAS LAGGASSSFR VVAGADLGST
NPLAVQAASA STKSPGNGSV TFDGHTAYVD ANGLALLAPT ALRTGTGSID VAAANDIALF
DASSTPANDP SVTVVPGVIY TAGAPAAGAP AQGSDVAIVH PLWTGKQDIL VTPAVNPDSA
GDITIHAQGN IVGAERLTDA TGEVTGQQGS DISQFWWQWM QIGNPTGTVG ATRPVMQTVQ
TSINFGAFDQ GVMSSGGNIS VSAGGNIADL SVSLPTTWYL TAANTDNPTV NTVGGGNLAV
HAGGDILSGG YFVAKGIGTI TADGKIGSDI TLPSLLVGQL PGSLDTLLAT QDGTLDVSAR
QGASIGRVFN PSYVQSTTLL NAYRQYADAQ GYSTNSAVNI TSTTGDVAIG TLKGIDTIGG
GASSATFGLN DMSFVLPARL NLTAFTGGIT IAASGELAPS PSGNLSLIAD QSINFSSLNG
LATSGGGDGT PVFGMLDMDP ASMPSPTNPH ANVPLLTDTT LAAHASSALH EDDTVPARIY
SLNGDIVDGI LQTTGFYDQL VPISIDKPTF IQAGQDIVNL AFQGQNLRKS DVTRIVAGRD
IYDTPFAGTV NAVVPALVLG GPGTFDIEAG RNIGPLTNQI EVNLQGVLTG IDAIGNAKNP
YLPHESANIN VLFGVGPGVD TATFVSTYID PASSVAGVPG TTPALIAFVQ QYEAGQVVDT
GLVSDQPASA PLTAAQAWSK FKALPQYVQQ LFAEQVLFNV LARVGEDYNN PASPYFQKYA
RGYQALNTLF PASLGYTANS LGGGSNGANK QVDTGDLDMR GTTIQTQQGG NVSILGPGGQ
ALVGSTSAPP QIVNDKGTVI AGPGSMGILT LEKGDVNIFT DRSVLLAQSR IFTEQGGDMT
IWSSNGDINA GKGAKSSADT PAPQYQCDAN HYCMVDARGQ VTGAGIATLQ SVPGVSPGTV
NLIAPRGTVD AGDAGIRAGN LNVAALRVVN ADNIQVTGKA TGIPLVQAVN TGALTAASAA
ASAATQVAQD MAKNNASGAS ARRWTISVQV EGFGDAGGDG AKKHRQQVGY DASNAVSVLG
FGQVGPTQRA TLTEDERGRL GKI