Gene Cpin_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3756 
Symbol 
ID8359924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4724568 
End bp4736783 
Gene Length12216 bp 
Protein Length4071 aa 
Translation table11 
GC content49% 
IMG OID644965925 
Producthypothetical protein 
Protein accessionYP_003123419 
Protein GI256422766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00737008 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAAAGA TTCTAATCCT ATTATATACT ATGATGTGTT GTGCCGTGGC GATGGCCCAG 
AACAACCCTA CACCGTATAT TAATGTGCCG AATGAAGTTT GTATCTCGAC CGCAAGTAGC
ACTGCCGGTC AGGCGAAGTA CCTGAGTATT AACGCCTCGA TACCCAACAA CCTCAACAGC
AATGGCTATC GGCCAACTTC TGAGAAAGCA TCATGGACGA TATCGGGACC ATCGGGGACC
GATGCGGACT ATGAAATTCT GTACACAGCT AATACCACTG CAACGACAAA AGCAACTAAG
CTACAAAAAA CAATGTCGCT GACCTTGCAG TTTATGCAGC CAGGTACTTA TACTGTGAAC
ATTTCCATTC CCTATACGTA CAATAATGGA ACGGTAAGAA CATATACCAC CAGCCGTACG
ATTATAGCGC ATGACTGTAC TATTAATATC TGTGGATCAA ACGAGGCCAA CGATAAACCA
GGCTTCTTTG AAGACTTCGG TACCATGGCC AACGGTGTCA CCCGCAGAAA ATATCCGATT
GATGGCGTAG TAACCTACGA CTATCAGGGT ACGGGAGAAC TTGCAGATAA CTATTATTCA
ATCTCCAACA CGACCCAGTT AAAGGGAGAC TGGGTAAACA ACACTGACCA CACCGGTAAC
AATCGTGGTG CGATGCTTGT TGCGAACTCC GCTTATCTCC CGAAAAGAAT GTATCAGAAA
ACAGTAACAG GTCTCTGTCG CGGTTCTGTA TACAACTTTA GTGCATGGTT GATTAACATC
AACCCTATCG GCGTTTTTGA AAGTGGTTGC GTCAGCAAAT ATCAATATGC TGGTGTGACC
TTCCAGGTGG TGAATGCCGC CAACCCAAGC CAGATTCTGG CAAACTTCCC TACTTATAAC
GTGTCTATGG ACCTCAGTAA CGCTAAATCG AGCTGGCAGA AATATGGTGG CTCATTTACG
GTACCGTCGA ATATCGACTC CGTTAAGGTA ATTATCATGA ACGACATGGA TGGTGGTTGT
GGTAACGATA TCGCGATTGA CGATATCGAA TTCGCATATT GTAGCCCAAG CATCACCGCA
TCTATTAAAG GTAAGACAGA TAACCTTGCG GAAGTAGTCT GCGAAGGCGC TCCGATTACC
CTTACATCCA ATTATACTCC TACTACTTAC TTTACAAATC CGGCTTACCA ATGGGAAATG
AGTGATGACG AAGGTCTCAC TTATGTGAAT GTTCCATTTG GTACCAATAC TTCGAAAGAA
CTGGTAATCG GACCTGGCGA GCTGAAAGGT ACCAGGAACG TTCCCACTTC TTATCGTTTC
CGTGTACGTA TTTATGAAGT CGGGAGTAAC TCGGTCACGT GTGCGGCTCC CTCGGAGTTC
GTCCGACTGA CGATACTACC AATGCCTCAG CTGTACCTGA CGAAGAGCAA GGTCTGCGCA
GGCGCATTTG TTGAGTTACA GGCATCGGGT GGTTTCGACC GATTTACGTG GAGAGACCTT
CCTGGATATG TGGGCGATAC CCGAACTATC CAGGTACTCG GCGACACCAC TATTATGGTG
TACGGTTATG TGGATTATGC GGACGGACAT ACGTGTGTGG ACTCAAACAC CGCGTTTATC
AGTTCTGTAG AATCACCAAT TGTAGAAGTA CTTTCTACTT CACAGAATAT TTGTGAAGGA
TCTTCTGTTG ATATTAAGAT CAATGACGCC ATTAATAATG GTACTAACAT TATTAAATGG
TATCAGGGTC CGAATGCAGG TGGTACACTT ACTCCTTTAC CAGCATACGA TGGTCTGACG
ACGCTGTCTC AGGTGCAGAT AAACAATGCA GCTGAAGGCG TATTCACCGT TATCGTAAAT
ACACCTGGTG ATGTGTGTCG GGTACAATCC GCACCATTCG TTGTTAATGT AACGCCAATA
CCTGTTGCCG AAGCGGGTCC GTCACAATAC GCTTGTGCGA GCACCAACTC TTCCGGTAAC
TTCACCATGG CAGCATTGCT GAATGCTGGT GAAAGCGGTG TATGGACAAT CGACTCCATC
TGGGGTCCGG CTGCACCTCC TGATACAGTA GGTGTAAACC TGAAAGACTA TGCAAACATC
CTGCTGCCTA CGCTTCGTAA CACAAGGGTA ACCCTGAAAA AAGGTGGTAC CAGCGTAAGA
TTCAGATGGA CTGTAAAAAG CTCAGCGAAC AATTCCTGCG CAAGCAGCGA CACCGTAACC
CTGTCTCTCC TGTATGATCC GTCTTACAGC GATGCAGGTC CGGATACTAC ACTCTGTGGT
AATAATAATG TATTCACTAT GCGGGCCAGC CGACCTGACC CTACGCTGAC AGGATTGTAT
GCGGAGACAG GTACCTGGAG ACTGGTAAGC GGTAACGCGA CCATTGCGAA CATACATCAA
TATAATACTA CAGTAACTTC TCTGGTAAAT GAGCAGGATA TCGTACTGGA ATGGACGATT
ACCAATGCAG CCAATTGTAC GCCGAACGCG GATCTGGTTG TATTGCATAA AACAACCAAA
CCAGTTATCA GACTGCAACC AGTACCCGTA GTTTGTAATA CTGCGACTAC CTTCTCCCTG
GATACAATTT CCACAAAAGG TAATCCGAAT GTGTATACAC TGACTACCGG TACGCCAGCG
ATGCCAGGCT TCACGGCTAT CACCAATGAT ACTATCAGAG CATGGCCGAT GACTTTCAAC
ATCCCTGCAA ACACGCCTGC TGGTAACTAT AGCTTCAACA TGAGCTATAA AAACAGCGCT
TCTGCGGGTT GTGACTCTAC GATCACTTTC ACGGTAGGTG TAGCAACGCC ACCAACGGCT
CCGACTTCCG TAACTGTAGG TACTCCAGGT ATCTGTGTAA CCGGTTCCAC TACACTGACT
GTTGTAGGCG GTAACCTGGG TACTCAGCCA AATGGTACGC CGAATGCAGT CTGGAGATGG
TACGCAGGTG GTTGCGGAAC AGGTACTGCT ATCGGTACCG GTGCAACGAT CACCGTAAAT
AACATCACTG CTACCACTAC TTATTATGTA CGTGCTGAAA GCACTGTCGG TGGTTGTGGT
AATACTACCT GTGCAAGCGG TACGGTAACT GTTTACCAGC AGCCAACAGC TTCTAACGCA
GGTCCTAACC AGACAAAATG TAATACGCCT GGTTTCACCA TGGCGGCTAA TACACCAACA
TTAGGTACAG GTGCATGGAC ACTGGTGCCG GTAACTGGTA CAGGCGCTAC CATCACTGGC
AGCGTGTCCA GTCCGACCAC TACCATCAAC GTTCCTGCCG GTGTAACAGT AAATGCAATA
TGGACCATTA CCAATGGTAC ATGTACAAGC AGCAGCACCG TGGTTCTCAG AAACGACGTA
CTGCCAACTT CTAACGCGGG TAGCGATCAG TCTAAATGTA ACACCCCGGG CTTTACCGTA
ACAGGTAATA CGCCTACAGT GGGTTCAGGT ACATGGACAC TGACAGCGGT AACCGGTACT
GGTGCTACTA TCACTGGTAG CGCTACTACT CCGTCTACTA CCATCAACGT TCCTGCCGGT
GTAGTTGTAA GAGCTACATG GACAGTAACA AATGGTAGCT GTACTGTTTC TTCATTTGTT
ACACTGACCA ACTATGCGCA GCCAACTGCT AACGCTGGTT CAAATCAGTC TAAATGTAAC
ACGCCTTCAT TCACAATGGG TGCTACAACA CCAACTGTGG GTACAGGTAC CTGGACACTG
ACGCCGGTAA CCGGTACTGG TGCTACCATC ACTGGCAGCG CAACCAATCC GGCTACTACC
ATCAACGTTC CTGCCGGTGT AGTTGTAACA GCTACATGGA CGGTAACAAA TGGTAGTTGT
ACTGCGTCTT CGTCTGTAAC ACTGACCAAC TTCGCGCCTC CGACTACATC GAACGCAGGT
CCGGATCAGG AACAGTGTAA CGTTTCAGCC TTCACCCTGG CTGGTAACGC ACCAACAACC
GGTACCGGTG CATGGAGCGT AGTTTCTTCT GCTCCTGCAG GTTTCACTTT AACGGCCGCT
CAGATGAGCA ACCGCAACGC TGCTATTACT ATCCCTGTGG GTACTACGGT AACATTGCGC
TGGACCATCA CAAACGGTCT TTGTACATCT ACTGACGACG TCGTCCTGAC CAACAGACCG
CTGCCAACAA CAGCTGCTGC CGGTCCTGAT CAGGCAAAAT GTAACAATAC TTCTTTCACA
CTGTCCGCTA ATACGCCAAC AACAGGTACC GGAGCATGGA GCGTAGTTTC TGCTACTCCA
ACCGGCTTCA CATTCCCTGC TGCAAGCGTA AGTAACCCAA CCGCTACGAT CACTGTTCCT
GTAGGTACAA CGGTGACCCT GCGCTGGACT ATTACTAACG GTACATGTAC TTCTACTGAT
GATGTAGTAT TAAGAAACGA TGCACTTCCA ACTACACCAA ACGCTGGTCC TGACCAGTCT
AAGTGTAATA TTTCTGCTTT CACCCTGGCG GCTAACACCA TCACGGTAGG AACAGGAGCA
TGGAGCGTAG TTTCTTCTAC TCCTGCAGGT TTCACTTTCC CTGCCGGTAG CGTTAACAAC
CCAACTGCAG CGATCACCGT ACCTGCCGGT ACGACTGTGA CCCTGCGCTG GACCGCTACA
AATGGTACTT GTTCTGCTAC GGACGATGTA GTGTTGACAA ACTTTGCAAC ACCAACAACC
TCCAATGCAG GTCCGGACCA GTCAAACTGT AACAATCCGT CCTTCACGCT GGCAGCTAAC
GCGCCGACAA TCGGAACAGG AGCATGGAGC GTAGTTTCTT CTACTCCTGC CGGATTTACT
TTCCCTGCTG CCCAGGTAAA CAACAGAACA GCTGCTATCA GCGTACCTGC CGGTACGACG
GTAACACTGC GCTGGACGAT TACCAATGGT GTTTGTTCTT CTACAGATGA TGTAGTTATT
ACAAACTTCG CAGCACCAAC TGCTGCTAAT GCTGGTCCTG ACCAGTCTAA ATGTAATACG
CCTTCCTTCA CGATGGCAGC TAACGCTGCT TCTGTAGGTA CTGGTACATG GTCACTGGTT
CCAGTGGTTG GTACCGGCGC TACCATCACA GGAAGTGTGA ACAGTCCGAA CGCTGTCATC
AACGTTCCTG CTGGTGTAAC TGTAAACGCG GTATGGACCA TCACAAATGG TACCTGTACG
ACCAGCGATA TAGTTATACT CAGAAACGAT GTACTGCCAA CTGCAAATGC AGGTCCGAGT
CAGACAAAAT GTAACATTTC TACCTTCACC ATGGCGGCTA ACACGCCAGC GGTAGGTGGT
GGTATATGGA CGCTGCCTTC AGGTTCTGCT GCAACGATCA CTGCTGGTCA GCAGAATAAC
CCTGCTGCGG TGATCAACGT TCCTGCCGGT ACTTCTGTAA CCGCGACATG GACGGTATCA
AACGGTACCT GTACTGTTGC TTCTTCTGTG ACACTGACCA ACAACGCATT GCCAACTGCC
AATGCGGGTC CTGCACAGCA GAAATGTAAT ACGCCTTCCT TCACAATGGC GGCTAACACG
CCGACTGTCG GTACCGGCGT ATGGTCACTC CCTGCTGGTA CAACTGCGAC CATCACCGGA
AGCACTACCA GTCCGACCAC TACCATCAAC GTTCCGGTAG GTACTTCCGT AGTAGCTACA
TGGACTGTAA CAAATGGTAG CTGTACGGTT TCTTCTACTG TAACACTGAC CAACTTCGCG
CCTGCTACTG TTTCAAGTGC CGGTCCGGAT CAGGAAAAGT GTAACGTTTC CAGCTTCACA
ATGGCTGCTA ACGCCCCAAC AGTAGGCACC GGCGCGTGGA GCGTAGTTTC TTCTGCTCCT
GCTGGTTTCA CTTTAACCGC CGCTCAGATG AGCAACCGCA CTGCTGCTAT CACTATCCCT
GTGGGTACTA CGGTAACACT GCGCTGGACC ATCACCAACG GCGTTTGTAC ATCTATTGAC
GATGTCGTCC TGACCAACAG ACCGCTGCCA ACAACAGCTG CTGCCGGTCC TGATCAGGCA
AAATGTAACA ATACTTCTTT CACACTGGCC GCTAATACGC CAACAGTAGG AACAGGAGCA
TGGAGCGTAG TTTCTGCTAC ACCTGCCGGC TTCACATTCC CTGCTGCAAG CGTAAGTAAC
CCAGGCGCTG CGATCACTGT TCCTGTAGGT ACAACAGTAA CCCTGCGCTG GACGATTACT
AACGGTGTGT GTACTTCTAC CGACGATGTA GTATTAAGAA ACGATGCACC TCCAACTACA
CCAAACGCTG GTGCTGACCA GTCTAAGTGT AACACATCTG CTTTCACCCT GGCGGCTAAC
ACCATCACGG TAGGAACAGG TGCATGGAGC GTAGTGTCTT CTACTCCTGC CGGATTTACT
TTCCCTGCCG GTAGCGTGAA CAACCCAACT GCGGCGATCA CTGTACCTGC CGGTACGACC
GTGACCCTGC GCTGGACCGC TACAAACGGT ACCTGCTCTG CAACAGATGA TGTAGTGCTG
ACTAACTTCG CAGCACCAAC TCCTGCCAAT GCAGGTCCTG ACCAGCAGGA ATGTAACAAT
ACGTCTACAT TCACACTGGC GGCAAATGCT CCTTCTGTAG GTACAGGTGT ATGGAGCGTA
GTTTCTTCTA CTCCTGCAGG ATTTACTTTC CCTGCTGCTC AGGTGAATAA CCGTACAGCA
AGTATCAGTA TCCCTGCCGG TACCTCAGTA ACCCTGAGAT GGACGATCAC CAATGGTGTT
TGTACCACTT CAGATGATGT AACACTGACC AACTTCCAGG AACCAAATCT GGCGAATGCC
GGTCCTGACC AGGAAAAATG TGCGGGTGCC GACTTCGTAA CTGCTGCGAA TGCGCCAACA
GTTACCGGCG CTACCGGTAT GTGGTCTGTG ATAAGCGGTA ACGCAACGAT CAGAACAGGG
GAAGAAAGTA ATCCGATTGC GCACATCACT GTGCCGAACG GTGAAACAGC TATCCTGCGC
TGGACGTTTA CAAACGGTAC ATGTTCAAAC TTTGACGAGG TTGAACTGAC CAACTACCTG
ACACCATCTC CGGCTAACGC GGGTGTGGAT CAGCGTGAAT GTAATATCTC TACGTTCTAT
ATGGGTGCGA ATGCACCGGA TGTTCCTGGT GCTACCGGTA CCTGGACACT GGCTGCTGGT
TCTCCTGGTT CAATCAATGC AGGTGACGAC AACAATCCTT CTGCGCTGAT CAATGTTCCT
GTGGGTACTA CTGTAACTGC TATCTGGACG ATCACAAACG GTACATGTCC GACTTCAGAC
ACTGTGTTGC TGACGAACGA CATTATGCCA ACTGCTGCAG ATGCAGGTCC TGACCAGGAA
CATTGTAACA TTCCTGCCTT CAGAATGGCT GCTAACACAG CTTCTGTCGG TGTAGGTAAA
TGGAGCCTGA CGCCTGGTAC TACTGCTTCA TTCGCAGTAG CTGACAGTAC TAATCCAAAT
GCGGTAATCA CCGTTCCTGC GGGTGTAACC GTAACAGCTA CATGGACTAT TACGAATGGT
TACTGTGTGA CATCTGACGA TGTCGTACTG ACTAACAGAG TAATGCCAGC TGCTGCTGTT
GCCGGTCCTG ACCAGCAGAA ATGTAACACG CCTGCCTTCA CAATGGCTGC TAACGTCGCC
AACGTAGGTG TAGGTAAATG GAGCCTGGTG CCAGGTACTA CTGCTTCTTT CGCAGTAGCC
GACAGTACCA ATCCGACTGC TGTAATTAAT GTTCCTGTAG GCGTAACAGC ACAGGCTGTG
TGGATCATCA CCAATGGTGT GTGTGAAACC AGAGACACTG TAGCGCTGAC TAACTATGAA
ATGCCAACTG CAGCTGCTGC CGGTCCTGAC CAGGCACAGT GTAACACACC TGCCTTTACA
ATGGCTGCTA ACGCTGCTTC TGTGGGTACA GGTAAATGGA GCCTTGTGCC AGTAACTGGT
ACCGGTGCAT CCTTCGCAGT AGCTGACAGC ACCAGACAGA ATGCGGTAAT CAACGTACCT
GCCGGTGTAA CAGTTGACGC GGTATGGACG ATCTCTAACG GTGTCTGCGT AACAAGAGAT
ACTGTAAGAC TGACCAACTT CGTAATGCCA ACAACCGCTG CTGCCGGTCC TGACCAGGCA
CAGTGTAACA CGCCTGCCTT TACAATGGCT GCTAACACAG CTTCTGTAGG TACAGGTAAA
TGGAGCCTTG TGCCAGTAAC TGGTACCGGT GCATCCTTTG CGGTAGCTGA CAGCACCAGA
CAGAATGCGG TAATCAACGT ACCTGCCGGT GTAACAGTTG ACGCGGTATG GACGATCTCC
AATGGTGTCT GCGTAACAAG AGATACTGTG AGACTGACCA ACTTTGTGGC TCCTGCTGCT
GCTAATGCCG GTGCAGATCA GACACATTGT AACACGCCTG CCTTTACAAT GGCCGCTCAG
GCGCCAACTG TAGGTGTAGG TAAATGGAGT CTGCCTGCTG GTTCTCCGGC TTCCTTCGCG
GTTGCCGACA GCACCAACCC TAACGCTGTC ATCAACGTAC CTGCGGGTCT GACGGTAACG
GCTACATGGA CTATCACAAA TGGTGTTTGT ACCACTTCTG ATAATGTGAT CCTGACCAAC
TACGAAATGC CATCTAACGC GGCTGCTGGT CCTGACCAGG TTCATTGTGA CGATCCGATG
TTCACAATGT CAGCTAACGT TCCTGCTCCT GCTACAGCGA GAGGTATCTG GACAATCGTA
AGCGGTACTG CAACTATCAC AGATCCAAAC AATCCTTCTA CAACTGTAAG AGTAGATCCG
GGTCAGACAG TAACCCTGCG CTGGACGATC TCTAACGGAA CATGTACATC CACTCCTGAT
GATGTGGTGC TGACTAACCA GGCAATGATC CTGGGTAACA CCATCACCGC TGATCAGCTG
CTTTGTGGAA ACGAAACACC TGCAATGCTG CAGGGTGCTA CCCTGAGCGG TGGTAACGGT
ACCTTCACTT ATCAGTGGCA GGTAAGTACT ACCAGCGCAA CTATCGGTTT CGCAAACGTA
ACCGTAGGTG GTACCAATGC AACATTCACT CCTCCGATGA TCACCCGTAA TACCTGGTAC
AGACGTGTGG TAATGTCCGG TGCTTGTACC GGTAACATCA GCAACGCAGT AATGCTGACA
CTGATGAACA TTCCTCCGGT AGTAATATCC GTTCCTGGTC CGCTGACAGT GGATTGCGAA
CAGGGTACTG ACTACACCCA GCAATTCGGT ACACCGGTGT TCAGCCATGC TCCTTATGAC
AATGAGCCAC TGAGCATCAC TTACAACGAC GTAACAGTAA CGGTTGACGC TTGTACCTTC
ACGGTAAGAC GTACATGGAC AGCGACTGAC CGTTGTGGAC TGACCACTCA GGCACAACAG
ACAATCACCG TGGTAGACAC GAAAGCGCCT GTATTCGCAG GTACTGCTCC TGCCAACATC
ACTGTAGATT GTGATAAAGT ACCTGCTGCG GTTACTATCC CTGCAAATGA CGCTTGTAAC
GGCGCTATGA CGATCACTCC GATCGAAGTG AGAATCGACC AGCCAGGTGC TTGTGCAAGC
AACTACCAGC TGATCCGTAA ATGGGTAGCA GTTGATGCTT GTGGTAACGC AAGCGATACG
CTGAGACAGA TCATCACTGT AAGAGACATG ACGCCGCCTG TATTTGACGG TACTGCTCCG
ACCAATATCA CTGTAGATTG TGATAAGGTT CCTGCCGGTA CGCCAATGAC AGCGACTGAC
AATTGTACGC CTGGCGTCCT CACTATCAAT CCGGTAGATA CCCGTAAGAA CATCTCCGGT
AGCAAGTGTG CTGACAACTA CCAGATCATC CGTACCTGGA CGGCTACTGA CCTTTGCGGT
AACAAAACCG TATTGACACA GACCATCACT GTTCAGGATA CTATCAAGCC TAGATTCTCC
ATGGCCGTAC CTCCGGCAAT CACTGTAGAT TGCGACAAGG TACCGTCTGT TGAAACTATA
ACAGCTACAG ACAACTGTAC TTCAACCGTA GCCGTGAGAG TTACAGAGAG AAGAGATAAC
CTTTCTTCTT CCTGTGCAAG CAGCTACAGA CTGACACGTA CCTGGACAGC TACCGATAAC
TGTGGTAATA CTAACGTAAT GCAGCAGGTG ATCACTGTAC AGGATACTAC AAGGCCGGTG
TTTGTTGTAG CACCTCCTGC TGATACAACT GTAAGCTGTG ACGCAGTTCC TGCTCCTCCG
ACCAATCTGA GAGCGACTGA CAATTGCAGC ACTGTGAAGA TCAGTTATGC GCAGACACGC
GAAACGATCC AGGGCGCTTG CGCAAGCAAC TACCGTCTGA TCCGTATCTG GACTGCAAAA
GATGCATGTA ACAACACGGC TATCTTCCGC CAGGTAATCA CCGTAACGGA TACTACAAGA
CCAACGATCG ATCCTGCTCC GGCAAACGTA ACGTTGAATT GTGGAGACGC TATCCCTGCT
GCGGCGACAT TGTATGCGCG TGACAATTGT GATGCTACTT TCCCTAAGAA GGCTATCATG
ATCCAGGATC CGTTTACAGT AGACCTGTGT GCTGGTTATA CCATCACAAG AAGATGGACT
ATCACTGACG CTTGTGGTAA CGCAGCAACT GAACGTGTAC AGACAATCAC GGTGAATCCT
TGTCCGAAAC CGGCCCTGGT TCCATCACTG CCTGCTAACT GTTCTGACAA TACCAAATTT
GCAATCCTGC TGGAGAATAA GGTAAGCAGA CCTAAATTCA CCCTCACGAG CGTAGTTCCG
GCTACTGCGG TTAACACTCC GCTGACACAG AGCAGCAACG TGTTTGACCT GAATGGCGCC
ACTCAGGCTA CCTTCATCGT AACAGACGGT GTAACGGGTT GTGTATCTGA TCCTATCACG
TACGATCTGC AATATGTAAC CAAACCAACG GTTGAACTGG GTAACGACGT AACTATCTGT
CAGGGTAGTA TCGCCACCCT CGATGCCGGT ATCGCAAACG ACGCTTACAC CATCCGCTGG
TCAACTGGTG CTACTACCAG AACCATCGAT GTGAGCACGG CTGGTACTTA TTATGCGACT
GTGACCAACG GTATCTGTTC TGCAACAGAC TCTGTGAAAC TGATCGTAAA TCTGCCACCG
CCGATTAATA TCCGTGACAC TGCTATCTGT CAGGGCGAAT CAGTAACACT GAACGCTTAT
GTAGAAGGTG GCAGCTATGT ATGGTCAACC GGTGAAACCA CTGCTTCTAT CACTGTAAAT
GCTACTGGTA CTTATGGTGT GGATGTGACT GTAAATGGTT GTTCTAATCA CGGTGAAGCA
GTTGTTCTCG TAGGTACGCC GCCAAATATC GTTCTGACTG ACGATACTGA ACTGTGTCCT
AACGAAACAG TTATGCTGAA CGTAGAACCT GATGGAGGTT CTGTACTCTG GAATACCGGT
GAAAACACCA ACTCGATCGT AGTATCAAGA CCAGGCGACT ATACAGTGAC TGTAACCCGC
GACGGTTGCG TGGTAACTGA TAAAGTGACT GTCACTTTAA GACCTGACCT GGGTATTGAC
CTCGGACCAG ACAGAGAGTT CTGTAACGGT GGACGTGTAG TAATTGACGC TAGTCACCCG
GATGCCATCT CTTACCTCTG GAATGACGGG GATACGAATC CGGTGAAAGA AATCACTGGT
GCTGGTAAAT ATGTAGTGTC TGTAATGGAC AGATTCTGTT CAAGAATTAC TATGGATAGT
GTGAATGTAA CTGTGGCTGG TATTCCTGAC TTTGATCTGG GCAGAGACAC CATGTTATGT
ATCGGTGAAG ATCTGACACT GAGAGTAAAC GCTGGTGCGG GTAACACTAT ACGTTGGCAG
GATGGTTCAA CCGCTGCTAC TTATAAGGTG ACAACACCTG GCACTTACAC AGTGACAATC
TCCAATGATT GTGGATCCAT GTCTGACCAG ATTGTGGTTC GCTATCAGCC TTGTGAGGCT
AAACCAGAAT TCCCGACGGG CTTTACACCA AACGGTGACG GACATAACGA CATCTTCAGA
CCTGTCGTTC GCGGTCCGAT GTATGACTAC GACTTACGTA TCTACAACCG TTGGGGTGAA
CTGATCTTCT TAAGTAAAGA TCAGAAGACC GGATGGGATG GCCGATATAA AGGCGCCCTG
GTTGAAAACG GAACTTATGT CTGGATGCTG AGCTATAAGA AATCGCTCGG TGGCAACACA
AATGTTGTGA AAGGCGAAGT AACCGCTATC AGATAA
 
Protein sequence
MKKILILLYT MMCCAVAMAQ NNPTPYINVP NEVCISTASS TAGQAKYLSI NASIPNNLNS 
NGYRPTSEKA SWTISGPSGT DADYEILYTA NTTATTKATK LQKTMSLTLQ FMQPGTYTVN
ISIPYTYNNG TVRTYTTSRT IIAHDCTINI CGSNEANDKP GFFEDFGTMA NGVTRRKYPI
DGVVTYDYQG TGELADNYYS ISNTTQLKGD WVNNTDHTGN NRGAMLVANS AYLPKRMYQK
TVTGLCRGSV YNFSAWLINI NPIGVFESGC VSKYQYAGVT FQVVNAANPS QILANFPTYN
VSMDLSNAKS SWQKYGGSFT VPSNIDSVKV IIMNDMDGGC GNDIAIDDIE FAYCSPSITA
SIKGKTDNLA EVVCEGAPIT LTSNYTPTTY FTNPAYQWEM SDDEGLTYVN VPFGTNTSKE
LVIGPGELKG TRNVPTSYRF RVRIYEVGSN SVTCAAPSEF VRLTILPMPQ LYLTKSKVCA
GAFVELQASG GFDRFTWRDL PGYVGDTRTI QVLGDTTIMV YGYVDYADGH TCVDSNTAFI
SSVESPIVEV LSTSQNICEG SSVDIKINDA INNGTNIIKW YQGPNAGGTL TPLPAYDGLT
TLSQVQINNA AEGVFTVIVN TPGDVCRVQS APFVVNVTPI PVAEAGPSQY ACASTNSSGN
FTMAALLNAG ESGVWTIDSI WGPAAPPDTV GVNLKDYANI LLPTLRNTRV TLKKGGTSVR
FRWTVKSSAN NSCASSDTVT LSLLYDPSYS DAGPDTTLCG NNNVFTMRAS RPDPTLTGLY
AETGTWRLVS GNATIANIHQ YNTTVTSLVN EQDIVLEWTI TNAANCTPNA DLVVLHKTTK
PVIRLQPVPV VCNTATTFSL DTISTKGNPN VYTLTTGTPA MPGFTAITND TIRAWPMTFN
IPANTPAGNY SFNMSYKNSA SAGCDSTITF TVGVATPPTA PTSVTVGTPG ICVTGSTTLT
VVGGNLGTQP NGTPNAVWRW YAGGCGTGTA IGTGATITVN NITATTTYYV RAESTVGGCG
NTTCASGTVT VYQQPTASNA GPNQTKCNTP GFTMAANTPT LGTGAWTLVP VTGTGATITG
SVSSPTTTIN VPAGVTVNAI WTITNGTCTS SSTVVLRNDV LPTSNAGSDQ SKCNTPGFTV
TGNTPTVGSG TWTLTAVTGT GATITGSATT PSTTINVPAG VVVRATWTVT NGSCTVSSFV
TLTNYAQPTA NAGSNQSKCN TPSFTMGATT PTVGTGTWTL TPVTGTGATI TGSATNPATT
INVPAGVVVT ATWTVTNGSC TASSSVTLTN FAPPTTSNAG PDQEQCNVSA FTLAGNAPTT
GTGAWSVVSS APAGFTLTAA QMSNRNAAIT IPVGTTVTLR WTITNGLCTS TDDVVLTNRP
LPTTAAAGPD QAKCNNTSFT LSANTPTTGT GAWSVVSATP TGFTFPAASV SNPTATITVP
VGTTVTLRWT ITNGTCTSTD DVVLRNDALP TTPNAGPDQS KCNISAFTLA ANTITVGTGA
WSVVSSTPAG FTFPAGSVNN PTAAITVPAG TTVTLRWTAT NGTCSATDDV VLTNFATPTT
SNAGPDQSNC NNPSFTLAAN APTIGTGAWS VVSSTPAGFT FPAAQVNNRT AAISVPAGTT
VTLRWTITNG VCSSTDDVVI TNFAAPTAAN AGPDQSKCNT PSFTMAANAA SVGTGTWSLV
PVVGTGATIT GSVNSPNAVI NVPAGVTVNA VWTITNGTCT TSDIVILRND VLPTANAGPS
QTKCNISTFT MAANTPAVGG GIWTLPSGSA ATITAGQQNN PAAVINVPAG TSVTATWTVS
NGTCTVASSV TLTNNALPTA NAGPAQQKCN TPSFTMAANT PTVGTGVWSL PAGTTATITG
STTSPTTTIN VPVGTSVVAT WTVTNGSCTV SSTVTLTNFA PATVSSAGPD QEKCNVSSFT
MAANAPTVGT GAWSVVSSAP AGFTLTAAQM SNRTAAITIP VGTTVTLRWT ITNGVCTSID
DVVLTNRPLP TTAAAGPDQA KCNNTSFTLA ANTPTVGTGA WSVVSATPAG FTFPAASVSN
PGAAITVPVG TTVTLRWTIT NGVCTSTDDV VLRNDAPPTT PNAGADQSKC NTSAFTLAAN
TITVGTGAWS VVSSTPAGFT FPAGSVNNPT AAITVPAGTT VTLRWTATNG TCSATDDVVL
TNFAAPTPAN AGPDQQECNN TSTFTLAANA PSVGTGVWSV VSSTPAGFTF PAAQVNNRTA
SISIPAGTSV TLRWTITNGV CTTSDDVTLT NFQEPNLANA GPDQEKCAGA DFVTAANAPT
VTGATGMWSV ISGNATIRTG EESNPIAHIT VPNGETAILR WTFTNGTCSN FDEVELTNYL
TPSPANAGVD QRECNISTFY MGANAPDVPG ATGTWTLAAG SPGSINAGDD NNPSALINVP
VGTTVTAIWT ITNGTCPTSD TVLLTNDIMP TAADAGPDQE HCNIPAFRMA ANTASVGVGK
WSLTPGTTAS FAVADSTNPN AVITVPAGVT VTATWTITNG YCVTSDDVVL TNRVMPAAAV
AGPDQQKCNT PAFTMAANVA NVGVGKWSLV PGTTASFAVA DSTNPTAVIN VPVGVTAQAV
WIITNGVCET RDTVALTNYE MPTAAAAGPD QAQCNTPAFT MAANAASVGT GKWSLVPVTG
TGASFAVADS TRQNAVINVP AGVTVDAVWT ISNGVCVTRD TVRLTNFVMP TTAAAGPDQA
QCNTPAFTMA ANTASVGTGK WSLVPVTGTG ASFAVADSTR QNAVINVPAG VTVDAVWTIS
NGVCVTRDTV RLTNFVAPAA ANAGADQTHC NTPAFTMAAQ APTVGVGKWS LPAGSPASFA
VADSTNPNAV INVPAGLTVT ATWTITNGVC TTSDNVILTN YEMPSNAAAG PDQVHCDDPM
FTMSANVPAP ATARGIWTIV SGTATITDPN NPSTTVRVDP GQTVTLRWTI SNGTCTSTPD
DVVLTNQAMI LGNTITADQL LCGNETPAML QGATLSGGNG TFTYQWQVST TSATIGFANV
TVGGTNATFT PPMITRNTWY RRVVMSGACT GNISNAVMLT LMNIPPVVIS VPGPLTVDCE
QGTDYTQQFG TPVFSHAPYD NEPLSITYND VTVTVDACTF TVRRTWTATD RCGLTTQAQQ
TITVVDTKAP VFAGTAPANI TVDCDKVPAA VTIPANDACN GAMTITPIEV RIDQPGACAS
NYQLIRKWVA VDACGNASDT LRQIITVRDM TPPVFDGTAP TNITVDCDKV PAGTPMTATD
NCTPGVLTIN PVDTRKNISG SKCADNYQII RTWTATDLCG NKTVLTQTIT VQDTIKPRFS
MAVPPAITVD CDKVPSVETI TATDNCTSTV AVRVTERRDN LSSSCASSYR LTRTWTATDN
CGNTNVMQQV ITVQDTTRPV FVVAPPADTT VSCDAVPAPP TNLRATDNCS TVKISYAQTR
ETIQGACASN YRLIRIWTAK DACNNTAIFR QVITVTDTTR PTIDPAPANV TLNCGDAIPA
AATLYARDNC DATFPKKAIM IQDPFTVDLC AGYTITRRWT ITDACGNAAT ERVQTITVNP
CPKPALVPSL PANCSDNTKF AILLENKVSR PKFTLTSVVP ATAVNTPLTQ SSNVFDLNGA
TQATFIVTDG VTGCVSDPIT YDLQYVTKPT VELGNDVTIC QGSIATLDAG IANDAYTIRW
STGATTRTID VSTAGTYYAT VTNGICSATD SVKLIVNLPP PINIRDTAIC QGESVTLNAY
VEGGSYVWST GETTASITVN ATGTYGVDVT VNGCSNHGEA VVLVGTPPNI VLTDDTELCP
NETVMLNVEP DGGSVLWNTG ENTNSIVVSR PGDYTVTVTR DGCVVTDKVT VTLRPDLGID
LGPDREFCNG GRVVIDASHP DAISYLWNDG DTNPVKEITG AGKYVVSVMD RFCSRITMDS
VNVTVAGIPD FDLGRDTMLC IGEDLTLRVN AGAGNTIRWQ DGSTAATYKV TTPGTYTVTI
SNDCGSMSDQ IVVRYQPCEA KPEFPTGFTP NGDGHNDIFR PVVRGPMYDY DLRIYNRWGE
LIFLSKDQKT GWDGRYKGAL VENGTYVWML SYKKSLGGNT NVVKGEVTAI R