Gene Cpin_5368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5368 
Symbol 
ID8361545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6858795 
End bp6870578 
Gene Length11784 bp 
Protein Length3927 aa 
Translation table11 
GC content47% 
IMG OID644967516 
Productouter membrane adhesin like protein 
Protein accessionYP_003125000 
Protein GI256424347 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain
[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC GTATCGTAAA CGTTTTATCA AGCCTGGTCT TGATATTCTG TTCAATGGTG 
ACGTTCGGCC AGATGCCGTT CAGCCCCTGT AACCCCAGTG GGGGAAAGAA GGGAGACATT
CTGGGAATTG AGACCAGGTA CAGAGCGCCC GGTGGGGCAG CACCCGTATT TAACATCCCT
GCAGGTACAA GGTCAATTAC AGTGTATATT TCCTCCGAGA CGGGTATTAC CACGATGTTG
GATAATCCGC AGGGAGATGA GGACTTCATG ACTGTCAATG CAATTATTGA CCTGACCTCC
AACACTTCTT CCGGTTACGT GAACTTTGCC AAGAACACTT TTGTGGATGG TTCAGGTACA
AACCTGTATG GTTGGCAAAA GGTTCCCCTC GGCGCATATA TCCCTAATGG AAGTAAGTTA
GGGGATGCTA CTCCCAATCT GAACAACGTG AACTTCACGG TGAGCGGCAG TACACTGACC
ATCACCGAAA GTGCGAATAC GATCCATTCG TCGTATTATG TTGAGTATGT GTCTCCGTAC
AACAACTCTA TCAACCCCCT TGACCCGCAG GTCAGGGCTT TGCTGCATGG TACCGGTACG
GCGAATACGG ACCTGACGAT CCCTATTCCG ACAGGCGCGA ACCTGATTTG TATATCAGGT
AAAGGTACTA ACTCAAGCGC TGTTGATCTT AATACATCCG CTGGTACAGA GGAAGGTTAT
TCGAATCTCC GTGTTACCAT AGATATGGAT GCTGGTTATA CGGATGGTTT TGTTACACTG
GCGAATGGTG GTTCTGTCGA CAGACGATCC ACTTATGTCA TTAATAACCT CGCGAGTTCG
TCGACGATGA ATTTCCTTTC ATCCGCCGCG ATCACAGGTG ACTATACTTC TAAACTGACC
ACTGCCGGTG CCGTGGGTGT GTATAATCCT CAGATATATG TCAGCGGTAC CAATCTTGTT
ATCAGGCGTG ATGCGAACTA TGCAAGAGAC TTCGATGATG CTTATGTGGT AGAGTTCTAT
CACAGGGTAG GTCAGGGGAT GAGTGCGGAG TTTATCAATT CTGATATTCA GCCGATACCG
AAGGGCGTGA GTTCTACGAC AGGTATCAGC AGAACATTTA ATATACCTCC TGGTACGAAT
GCTATTTACT TCAATGAAAC GGGTAACGCG TGTAATACCG ACAGAGAAAG TAATGAAAAC
TCCATTGCGG CATATGCTTA CATCGATCTG AATACAGAGA CGGCTACCGG TTATTTTTAT
CAGCAGGTAG GTCTTGATGG CGTTAACAGA CGAGATGACA ACTTTGCATT CAAAGGCGTA
TCATTAAATG GCAGCAGCGC CAGAGCGCAT GCGAGCACCG TTGGTTTCAA AGGGCCTAAT
GCTTACGATA TCGTTTTCAC ATTATCCGCA GATAAAACCC AGCTTACAGT GACCAACAGA
ACGGGCCTGG CGAATCCTGA TTACCAGTTC CTGTTGTCAA TGGACTATTA CGGTGCGAGA
CCGGATGTTG CTTTTGATGC TTCCAGTATT GCGCTGGCAA AAGGTCCTTC CTGTAATGTG
ATCAGGGCCA CCTTCAACGT GTGTAACCCG GGAGCTGGTA ACAGCAGCGG GGGGATGCCG
GTATCCTTCT ATCATGGAGA TCCTACTACG GACCCTGCGG CGGTATTATT ATATACCGGT
GCATTTGGAA GTGGTTTACT GGAAGGGGAG TGTAAGGTAT TTACCTACGA TGTTCCGATG
AACGGTTTTG ATGACCTCAA CGTACCGATG ACCATTGTCA TAAACGACAA TGGTAGTTTT
GTTACAGGAG GTGTGGGTAC AGCCGTGGGT ACACCATTTA CATTGGCGTC ACTGGTGAAC
CAGAATTCCC TTTATAAAGA ATGTAACTAC GATAATAACC TCATTACAAG AGTCTTAAAT
GTAAATAACT GTCCGGTTCC CAATCTCGAT GCAGATAACA GTTCCGGAGC GGTTGGCAGA
TATAACTACC TGAACTATTT CAATGCAGGT ACTCCTGGTG GTGTGAAGAT CAATGATGCA
GATCTTGCCG TGGTGGATCC GGGTGGCACG ACAATCGCTT CCGCTACGAT TACATTAACG
AATAGATTAG ATGGAGCTGC TGAGTCGGTT TTCATCAATG GTACATTACC GGCAGGTATT
ACTGCAACCG GTAGTGGAAC AGGTACAATC GTGTTGTCAG GTGTTGCGTC ACAGGCTGCT
TATGTTGCTG CGTTGAGACT GATAGAATAC CAGAACAGCA ATCCTTCTCC TAATACGACT
AACCGTATCA TTACTACTGT GCTGAATGAC GGACTGGAGA CTGGGCCGGC TTCCACTACC
ACGATCGTAA TATTAACAGA TCCGCGTATC AATGTATCAG GTAATGGTAC TACGATCGCA
GACAATAGTA CGATGGTAAA TGCGACAGAC TGGACAGACT TTGGTAATAC GATTTCAGCT
TCCGTAACAC GTACTTTTAG TATCAGCAAC GTTGGTACCG GTGTTATTAA TTTAACAGGT
ACACCAGCGA TAAGTATTGC TTCCGGAGAT GCAGGGTTTA CAATATCAAC ACAACCAGGT
GTTACCGCGC TTCCTGCTGC ACAAAACACC AGCTTTGTAG TGAACTTTAA TCCGGCAGCA
CATGCTGTTG GTGTGTATAC AGCTGTGATA CGTATTGTGA ATAACGATAC CAATACTGAC
AGAGCCGACT TTACTTTTAC GGTATCAATT ACGGTGAATG GTTTACCAAC GGTAACGAAT
TTTACTGTAA ATGGGACGGA AGATAATACG CTGGCATTTA CTGCGGCAAA CTTTACTTCT
AACTATAGTG ATCCGGATGG CGCACCGCTG AATAGTATCA GAATTACCAG TCTGCCGTTA
AACGGTAGCT TCCGTCTGAA TGGTACTGTG ATCACGGTGG GTCAGGATAT TCCGGCTGCA
CAGCTGGGTA ATATCACCTT TATACCTACC GCTAACTGGT CCGGCACTAC AGGATTTGAC
TGGAGAGCTT ATGATGGTAC TTCCTTCTCA GCAGGGACAT CCCATGTGAC GATCAATATC
CAACCGGCGA ACGATGCGCC GCAGATTACT ACGCCAACCA GTATCGCGGT AACAGAAGAT
ATTCCTGCAT CGCTGAAGGA TATTTCCTTC TCTGATATTG ATGCCGGTAC AGGTGTGGTA
ACAGTTACAT TCTCTGTGCC TAACGGAACG CTGAATGCGA CCTCCGGTGC AGGTGTGACT
GTTAGTGGTA CGCCGGCTGC ATTGATATTA ACGGGTACAA TCGCTGATAT TAATGCCTTC
CTGGTAGCGA ATAGAGTCAA TTATTCCTCT ACCCAGAATC CGCCACCAAC CGTTATACTG
ACGGTAAATA TTTCCGACAA TGGTAATACT GGTGCAGGAG GTGCACAACA GGCTACAGCT
ACCGTACCAC TGGGTATTAC AGCTGTGAAT GATGCACCGA CTGGTACAGG TGATACACGT
ACCACAGCGG AAGATACCCC GGTAAACGGG GCAGTCACGG GTAATGATGT GGATGGCGAT
GTGCTGACTT ATACATTGGG GACGCCTCCT ACAAACGGAA CGGCGACAGT TAATGCTTCT
ACGGGCGCTT ATACTTATAC GCCGAATCCG AATTATAATG GTCCTGATAT GTTTACCGTT
GTGATCAGTG ATGGGCATGG CAGTTCAGTG ACTGTGACGG TGAATATTAC TGTTACAGCT
GTAAATGATG CACCGACCGG AACAGGCGAT ACGCGGACGA CATTGGAAGA TACACCAGTA
AACGGTGCTG TATCTGGAGC AGATGTGGAT GGTGATGCGT TGACTTATAC CTTAGGTACA
GCTCCGACCA ACGGTCTGGC GACAGTAAAT GCTACTACGG GTGCTTATAC TTATACACCG
AATCCAAATT ATAATGGTCC TGATGTGTTT ACCGTTGTGA TCAGTGATGG GCATGGCGGT
TCAGTGACTG TGACGGTGAA TATTACAGTG ACCGCTGTAA ATGATGCACC GACCGGAACA
GGAGATACGC GTACGACACT GGAAGATACG CCAGTAAATG GTGCAGTATC TGGAGCAGAT
GCAGACGGCG ATGCGCTGAC TTATACCTTA GGTACAGCTC CGACCAATGG TCTGGCGACA
GTAAATGCTA CTACGGGTGC TTATACTTAT ACACCGAATC CGAATTATAA TGGTCCGGAT
GCGTTTACCG TTGTTATCAG TGATGGTAAT GGTGGTTCAA TAACTGTGAC GGTGAATATT
ACGGTGACCG CTGTGAATGA TGCGCCGACT GGTACAGGAG ATACACAGAC TACACTGGAA
GATACGCCGG TAAACGGCGC AGTATCTGGT GCAGATGTGG ATGGTGACGC GCTGACTTAT
ACTTTAGGTA CAGCTCCAGC AAACGGTCTG GCGACAGTAA ATGCTACTAC AGGTGCTTAT
ACTTATACAC CGAATCCGAA TTATAATGGT CCGGATGCGT TTACCGTTGT GATCAGTGAT
GGTAATGTTG GTTCAGTAAC TGTGACAGTG AATATTACAG TGACCGCTGT AAATGATGCG
CCGACAGGAA CAGGAGATAC ACAGACTACA CTGGAAGATA CCCCAGTAAA TGGTGCAATA
TCTGGTGCAG ATGTGGATGG CGATGCGCTG ACTTATACGG TAGGTACAGC TCCGACCAAT
GGTCTGGCGA CAGTAAATGC TACTACAGGT GCTTATACTT ATACGCCGAA TCCGAATTAT
AATGGTCCGG ATGTGTTTAC TGTTATTATC AGTGATGGTA ATGGTGGTTC AGCAACGGTG
ACGGTGAATA TTACGGTGAC CGCTGTGAAT GATGCGCCGA CTGGTACAGG AGATACACAG
ACTACACTGG AAGATACGCC GGTAAACGGC GCAGTATCTG GTGCAGATGT GGATGGTGAC
GCGCTGACTT ATACCTTAGG TACAGCTCCG ACCAATGGTC TGGCGACAGT AAATGCTACT
ACGGGTGCTT ATACTTATAC ACCGAATCCA AATTATAATG GTCCGGATGC GTTTACCGTT
GTTATCAGTG ATGGTAATGG TGGTTCAGTA ACTGTGACGG TGAATATTAC TGTTACAGCT
GTAAATGATG TACCAACCGG AGCTGACCAG AACCTGACTA CTCCGGAAGA CACTCCGCTC
AATGGGGCAG TAGTAGGAGC AGATGCAGAT GGCGATGCAT TGACTTATGC GATTGGCACA
ACGACACCAG CAAATGGTAG TGTGACAGTA AATACTGACG GTACATTTAT CTATACACCG
AATGCTGATT ATGTAGGTAC CGATGCCTTC ACCGTAGTAA TCAGCGATGG TAATGGCGGT
ACAGCTACTG TGACGATCAA TATTAATGTT ACGGCAGTAA ACGACGCGCC AACTGGTACA
AACCAGAATC TGACTACTCC TGAAGATACT CCGCTCAATG GGGCAGTACT AGGAGTAGAT
GCAGATGGCG ATGCATTGAC TTATGCGATT GGCACAACGA CACCGACGAA TGGTAGCGTG
ATAGTAAATA CAGATGGTAC ATTTATCTAT ACGCCAAATG CTGACTATGT AGGCACTGAT
GCCTTCACCG TAGTAATCAG CGATGGTAAT GGCGGTACAG CTACTGTGAC GATCAATATT
AATATTACGG CAGTAAACGA CGCGCCAACT GGTGCAAACC AGAACCTGAC TACTCCGGAA
GATACTCCGC TTAATGGAGC GGTAGTAGGA GCAGATGCAG ATGGCGATGC ATTGACTTAT
GCGATTGGCA CAACGACACC AGCAAATGGT AGTGTGACAG TAAATACTGA CGGTACGTTT
ATCTATACAC CGAATGCTGA TTATGTAGGT ACCGATGCCT TCACCGTAGT AATCAGCGAT
GGTAATGGCG GTACAGCTAC TGTGACGATC AATATTAATG TTACGGCAGT AAACGACGCG
CCAACTGGTA CAAACCAGAA TCTGACTACT CCTGAAGATA CTCCGCTCAA TGGGGCAGTA
GTAGGAGTAG ATGCAGATGG CGATGCATTG ACTTATGCGA TTGGCACAAC GACACCAACG
AATGGTAGCG TGATAGTAAA TACAGATGGT ACATTTATCT ATACGCCAAA TGCTGACTAT
GTAGGCACTG ATGCCTTCAT CGTAGTAATC AGCGATGGTA ATGGCGGTAC AGTTACTGTG
ACGATCAATA TTAATATTAC GGCAGTAAAC GACGCGCCAA CTGGTGCAAA CCAGAACCTG
ACTACTCCGG AAGATACTCC GCTTAATGGA GCGGTAGTAG GAGCAGATGC AGACGGCGAT
CCATTGACTT ATGCAATAGG TACAACGACA CCGGCAAATG GTAGTGCGAC TGTAAATGCA
GATGGTACAT TTATCTATAC ACCGAATGCT GATTATGCTG GAGCAGATGC CTTTACCGTA
GTAATTAGTG ATGGTAATGG CGGTACAGCT ACTGTGACGA TCAATATTAA TGTTACAGCA
GTAAACGACG CACCAACTGG TACAAACCAG AATCTGACTA CTCCTGAAGA CACTCCGCTC
AATGGGGCAG TACTAGGCGC AGATGCAGAC GGCGACGCAT TGACTTATGC GATTGGCACC
ACGACACCAG CAAATGGTAG TGTGACAGTA AATGCAGATG GTACATTCAT CTATACACCG
AATGCTGATT ATGTAGGCAC CGATGTCTTT ACCGTAGTAA TTAGCGATGG TAATGGCGGT
ACAGCTACTG TGACGATCAA TATTAATGTT ACAGCAGTAA ACGACGCGCC AACTGGTGCA
AACCAGAACC TGACTACTCC TGAAGATACT CCGCTTAACG GAGCAGTAGT GGGCGCAGAT
GCAGACGGCG ATGCATTAAC TTATGCGATT GGCACAACGA CACCAACGAA TGGTAGTGTG
ACAGTAAATG CAAATGGCAC ATTTGTTTAT ACACCGAATG CAGACTTCAA CGGTACCGAT
GCCTTTACAG TAGTGATCAG TGATGGTAAT GGCGGAACGA CAACAGTAAC AGTTAATATT
ACGGTTACAG CAGTAAACGA CGCGCCAACT GGTGCGAACC AGAACCTGAG TACTCCTGAA
GACACTCCGC TTAATGGAGC GGTAGTAGGC GCAGATGCAG ACGGCGACGC ATTGACTTAT
GCGACTGGCA CTACGACACC AGCAAATGGT AGTGTGACAG TAAATGCAGA TGGTACATTC
ATCTATACAC CAAATGCTGA TTATGTAGGC ACCGATGCCT TTACCGTAGT AATTAGCGAT
GGTAATGGCG GTACAGCTAC TGTGACGATC AATATTAATG TTACAGCAGT AAACGACGCG
CCAACTGGTG CAAACCAGAA CCTGACAACT CCTGAAGACA CTCCGCTTAA TGGAGCGGTA
GTAGGCGCAG ATGCAGACGG CGATCCATTG ACTTATGCGA TTGGCACAAC GACACCAACG
AATGGTAGTG TGACAGTAAA TGCAGATGGC ACATTTGTTT ATACACCGAA TGCAGACTTC
AACGGTACCG ATGCCTTTAC AGTAGTGATC AGCGATGGTA ATGGTGGAAC GACAACAGTA
ACAGTTAATA TTACGGTTAC AGCAGTAAAC GACGCGCCAA CTGGTGCAAA TCAGAACCTG
ACAACTCCTG AAGACACTCC GCTTAATGGA GCGGTAGTAG GAGCAGATGC AGACGGCGAT
GCATTGACTT ATGCGATTGG CACAACGACA CCAGCAAATG GTAGTGTGAC AGTAAATGCA
GATGGTACTT TCATCTATAC ACCAAATGCT GATTATGTAG GCACCGATGC TTTTACCGTA
GTAATTAGCG ATGGGAATGG CGGTACAGCT ACTGTGACGA TCAATATTAA TGTTACAGCA
GTAAACGACG CGCCAACTGG TGCAAACCAG AACCTGACTA CTCCTGAAGA TACTCCGCTT
AACGGAGCAG TAGTAGGCGC AGATGCAGAC GGCGATGCAT TAACTTATGC GATTGGCACA
ACAACACCAA CGAATGGTAG TGTGACAGTA AATGCTGACG GTACATTTAT CTATACACCG
AATGCAGACT TCAACGGTAC CGATGCCTTT ACAGTAGTGA TCAGTGATGG TAATGGCGGA
ACGACAACAG TAACCGTTAA TATTACGGTT ACAGCAGTAA ACGACGCGCC AACTGGTGCA
AACCAAAACC TGACAACTCC TGAAGACACT CCGCTTAATG GAGCGGTTGT AGGAGCAGAT
GCAGACGGCG ACGCATTGAC TTATGCGACT GGCACCACGA CACCAGCAAA TGGTAGTGTG
ACAGTAAATG CAGATGGTAC ATTCATCTAT ACACCAAATG CTGATTATGT AGGCACCGAT
GCCTTTACCG TAGTAATTAG CGATGGGAAT GGCGGAACAG CTACTGTGAC GATCAATATT
AATGTTACAG CAGTAAACGA CGCGCCAACT GGTGCAAACC AGAACCTGAC TACTCCGGAA
GACACTCCAC TTAATGGAGC GGTAGTAGGC GCAGATGCAG ACGGCGATCC ATTGACTTAT
GCGATTGGCA CAACGACACC AACGAATGGT AGTGTGACAG TAAATGCAGA TGGCACATTT
GTTTATACAC CGAATGCAGA CTTCAACGGT ACCGATGCCT TTACAGTAGT GATCAGCGAT
GGTAATGGCG GAACGACAAC AGTAACAGTT AATATTACGG TTACAGCAGT AAACGACGCG
CCAACTGGTG CAAACCAGAA CCTGAGTACT CCTGAAGACA CTCCGCTTAA TGGAGCGTTA
GTAGGAGCAG ATGCAGACGG CGACGCATTG ACTTATGCGA TTGGCACAAC GACACCAGCA
AATGGTAGTG TGACAGTAAA TGCAGATGGT ACATTCATCT ATACACCAAA TGCTGATTAT
GTAGGCACCG ATGTCTTTAC CGTAGTAATT AGCGATGGTA ATGGCGGTAC AGCTACTGTG
ACGATCAATA TTAATGTTAC AGCAGTAAAC GACGCGCCAA CTGGTGCAAA CCAGAACCTG
ACTACTCCTG AAGATACTCC GCTTAACGGA GCAGTAGTAG GCGCAGATGC AGACGGCGAT
GCATTAACTT ATGCGATTGG CACAACAACA CCAACGAATG GTAGTGTGAC AGTAAATGCT
GACGGTACAT TTATCTATAC ACCGAATGCA GACTTCAACG GTGCCGATGC CTTTACAGTA
GTAATCAGCG ATGGTAATGG CGGAACGACA ACTGTAACAG TGAATATTAC GGTTACAGCA
GTAAACGACG CACCAACTGG TACAAATCAG AACCTGACTA CTCCTGAAGA TACTCCGCTT
AATGGAGCGG TAGTAGGAGC AGATGCAGAC GGCGATGCAT TGACTTATGC GATTGGCACA
ACGACACCAG CAAATGGTAG CGTGACCGTA AATGCAGATG GCACATTTGC TTACACACCG
AATGCTGATT ATGTAGGCAC CGATGCCTTC ACAGTAGTAA TCAGCGATGG TAATGGCGGA
ACGACAACTG TAACAGTTAA TATTACGGTT ACAGCAGTAA ACGACGCACC AACTGGTACA
AATCAGAACC TGACTACTCC TGAAGATACT CCGCTTAATG GAGCGGTAGT AGGCGCAGAT
GCAGACGGCG ATCCATTGAC TTATGCAATA GGCACAACGA TACCAGCAAA TGGTAGCGTG
ACAGTAAATG CAGATGGCAC ATTTGTTTAT ACACCGAATG CTGATTATGT AGGAGCAGAT
GCCTTTACAG TAGTGATCAG CGATGGTAAT GGCGGAACGA CAACTGTAAC CGTTAATATT
ACGGTTACAG CAGTAAATGA CGCACCGACC GGCACAGATC TTAATCTGAC CATGGCCGAA
GACAATCCGC TCAACGGAAC AGTAGTAGGA GTAGATGCAG ACAACGATCC ACTGACTTAT
GTTATCGGAG CGACAAATCC TGCAAATGGT ATCGTGACAG TAAACACAGA TGGAACGTTT
GTCTATATTC CGAATCCTAA CTTTAACGGC ATAGATGCCT TTACGGTAGT CATCAGTGAT
GGTAATGGTG GTACAGCTAC CGTAACAGTT AATATTACAG TTACACCAGT AAATGATGCA
CCAACCGGTA CCGGCGATAC CCGTACTACG CCAAGAAATA CACCTGTCAA TGGTGCTGTT
AGCGGAACCG ATGTTGACGG CGACGTATTG ACCTATACCC TGGGCACACC TCCTGCAAAC
GGAACAGCGG TCGTAAATAC TGATGGTACT TATACCTATA CGCCAAATGC AGGTTACAGT
GGTCCGGATA GCTTTACAAT CATCATCAGC GATGGTAATG GCGGTACAGT GACAGTAACG
GTTGATATCA CCGTAACAGC TATTAATAAC GCGCCTACAG GTACCGGCGA CTCGCAGACG
ACACTGGAAG ACACTCCTGT AAACGGAGCT GTGACCGGAA ACGACGCAGA TGGGGATCCA
TTGACTTATG TCCTGGGTAC TGCCCCTGCA AATGGCGGCG TGATCGTTAA TGCTGATGGA
ACTTATATCT ATACACCAAA TGCGAACTAT AATGGTCCGG ATAACTTCAC AATCCTGATC
AGCGATGGTA ATGGTGGTTC TATCACCGTA CCGGTAAGTA TCATTGTTAC ACCTGTAAAT
GATGCGCCAA CCGGCACCGG CGATACGCAG TCTACTCCGT TGAATACGCC GGTAAGCGGC
GCCGTAAGCG GTACGGATGT AGATGGGGAT GTCCTGACTT ATACATTGGG TACACCTCCG
GCAAATGGTA CTGCCATTGT AAATGCAGAT GGTTCTTATA CCTATACACC GAATACCGGT
TATGCCGGCC CGGATAGCTT CACAATCATT ATCAGCGATG GTAATGGTGG TACAGTCACA
GTAACCGTAG ATATCATGAT GATTACCGTG GTGCCTAATC CGGCTATTGC ACTGGTGAAA
GTAGGTGCAA GGGATAAAAA TGATATCACT TACATTTTCC ATGTTACTAA CACGGGTAAT
GTTCCGTTGC ACAACATCGT GGTATCTGAT CCGCAGATGG GTGTGACCAA GAATTATAGT
GGTACACTGC AACCTGGCGC CTCTGTAACG ATGACAGCGG TATATCATAT CACCCAACAG
GATAAGGAAG CCGGCAGCGT TACAAATACA GCAACGGTGA CTGGTCTGAC GCCTGCTGAT
GCAACGGTTA CGGATATCTC CGGTACTTCG TTGAACAACG ATACGCCTAC TGAGACAACT
GTTCCGGCTC CACCGCAGGC AAATGATGAT CAGGCTGAGA CGCGCGCAAG TATTGCTGTT
GTGATTCCGG TACTGGATAA TGATGATCCG GTAGAATCAA CCTTCAGTAT TCCGAGCTTA
ACAATCACCA GCATTCCTAA ACATGGTCAG GTGACTATCA ACGAAGATGG TTCTATCCGT
TATATGCCGG ACAACGGTTA TACCGGAGAA GATGACTTCA GTTACCAGGT CAGCGATGTA
GATGGTTACA TCACTAACAT TGCTGTGGTG AAGATCACTA TCGTAGAAAC AGATCTGAGA
ATACCGCCAT TGTTTACACC AAATGGTGAC GGTAAGAATG ATGTGTTTGA AATACGCGGA
CTTAACAAGT ATGTAGAAAA TGAGTTAATA ATGGTCAATC GTTGGGGTAA TGAAGTATAC
AGACAGAAGA ATTATCAGAA TACCTGGAGC GGCAATGGAT TGAACGACGG TACTTACTAT
TACCTGTTGC GTGTTAAAAA GGCCAACGGT GACTGGGAAG TAGTGAAAGG ATACACGACG
ATTATCCGAA AAATGAAAGA TTAA
 
Protein sequence
MKTRIVNVLS SLVLIFCSMV TFGQMPFSPC NPSGGKKGDI LGIETRYRAP GGAAPVFNIP 
AGTRSITVYI SSETGITTML DNPQGDEDFM TVNAIIDLTS NTSSGYVNFA KNTFVDGSGT
NLYGWQKVPL GAYIPNGSKL GDATPNLNNV NFTVSGSTLT ITESANTIHS SYYVEYVSPY
NNSINPLDPQ VRALLHGTGT ANTDLTIPIP TGANLICISG KGTNSSAVDL NTSAGTEEGY
SNLRVTIDMD AGYTDGFVTL ANGGSVDRRS TYVINNLASS STMNFLSSAA ITGDYTSKLT
TAGAVGVYNP QIYVSGTNLV IRRDANYARD FDDAYVVEFY HRVGQGMSAE FINSDIQPIP
KGVSSTTGIS RTFNIPPGTN AIYFNETGNA CNTDRESNEN SIAAYAYIDL NTETATGYFY
QQVGLDGVNR RDDNFAFKGV SLNGSSARAH ASTVGFKGPN AYDIVFTLSA DKTQLTVTNR
TGLANPDYQF LLSMDYYGAR PDVAFDASSI ALAKGPSCNV IRATFNVCNP GAGNSSGGMP
VSFYHGDPTT DPAAVLLYTG AFGSGLLEGE CKVFTYDVPM NGFDDLNVPM TIVINDNGSF
VTGGVGTAVG TPFTLASLVN QNSLYKECNY DNNLITRVLN VNNCPVPNLD ADNSSGAVGR
YNYLNYFNAG TPGGVKINDA DLAVVDPGGT TIASATITLT NRLDGAAESV FINGTLPAGI
TATGSGTGTI VLSGVASQAA YVAALRLIEY QNSNPSPNTT NRIITTVLND GLETGPASTT
TIVILTDPRI NVSGNGTTIA DNSTMVNATD WTDFGNTISA SVTRTFSISN VGTGVINLTG
TPAISIASGD AGFTISTQPG VTALPAAQNT SFVVNFNPAA HAVGVYTAVI RIVNNDTNTD
RADFTFTVSI TVNGLPTVTN FTVNGTEDNT LAFTAANFTS NYSDPDGAPL NSIRITSLPL
NGSFRLNGTV ITVGQDIPAA QLGNITFIPT ANWSGTTGFD WRAYDGTSFS AGTSHVTINI
QPANDAPQIT TPTSIAVTED IPASLKDISF SDIDAGTGVV TVTFSVPNGT LNATSGAGVT
VSGTPAALIL TGTIADINAF LVANRVNYSS TQNPPPTVIL TVNISDNGNT GAGGAQQATA
TVPLGITAVN DAPTGTGDTR TTAEDTPVNG AVTGNDVDGD VLTYTLGTPP TNGTATVNAS
TGAYTYTPNP NYNGPDMFTV VISDGHGSSV TVTVNITVTA VNDAPTGTGD TRTTLEDTPV
NGAVSGADVD GDALTYTLGT APTNGLATVN ATTGAYTYTP NPNYNGPDVF TVVISDGHGG
SVTVTVNITV TAVNDAPTGT GDTRTTLEDT PVNGAVSGAD ADGDALTYTL GTAPTNGLAT
VNATTGAYTY TPNPNYNGPD AFTVVISDGN GGSITVTVNI TVTAVNDAPT GTGDTQTTLE
DTPVNGAVSG ADVDGDALTY TLGTAPANGL ATVNATTGAY TYTPNPNYNG PDAFTVVISD
GNVGSVTVTV NITVTAVNDA PTGTGDTQTT LEDTPVNGAI SGADVDGDAL TYTVGTAPTN
GLATVNATTG AYTYTPNPNY NGPDVFTVII SDGNGGSATV TVNITVTAVN DAPTGTGDTQ
TTLEDTPVNG AVSGADVDGD ALTYTLGTAP TNGLATVNAT TGAYTYTPNP NYNGPDAFTV
VISDGNGGSV TVTVNITVTA VNDVPTGADQ NLTTPEDTPL NGAVVGADAD GDALTYAIGT
TTPANGSVTV NTDGTFIYTP NADYVGTDAF TVVISDGNGG TATVTININV TAVNDAPTGT
NQNLTTPEDT PLNGAVLGVD ADGDALTYAI GTTTPTNGSV IVNTDGTFIY TPNADYVGTD
AFTVVISDGN GGTATVTINI NITAVNDAPT GANQNLTTPE DTPLNGAVVG ADADGDALTY
AIGTTTPANG SVTVNTDGTF IYTPNADYVG TDAFTVVISD GNGGTATVTI NINVTAVNDA
PTGTNQNLTT PEDTPLNGAV VGVDADGDAL TYAIGTTTPT NGSVIVNTDG TFIYTPNADY
VGTDAFIVVI SDGNGGTVTV TININITAVN DAPTGANQNL TTPEDTPLNG AVVGADADGD
PLTYAIGTTT PANGSATVNA DGTFIYTPNA DYAGADAFTV VISDGNGGTA TVTININVTA
VNDAPTGTNQ NLTTPEDTPL NGAVLGADAD GDALTYAIGT TTPANGSVTV NADGTFIYTP
NADYVGTDVF TVVISDGNGG TATVTININV TAVNDAPTGA NQNLTTPEDT PLNGAVVGAD
ADGDALTYAI GTTTPTNGSV TVNANGTFVY TPNADFNGTD AFTVVISDGN GGTTTVTVNI
TVTAVNDAPT GANQNLSTPE DTPLNGAVVG ADADGDALTY ATGTTTPANG SVTVNADGTF
IYTPNADYVG TDAFTVVISD GNGGTATVTI NINVTAVNDA PTGANQNLTT PEDTPLNGAV
VGADADGDPL TYAIGTTTPT NGSVTVNADG TFVYTPNADF NGTDAFTVVI SDGNGGTTTV
TVNITVTAVN DAPTGANQNL TTPEDTPLNG AVVGADADGD ALTYAIGTTT PANGSVTVNA
DGTFIYTPNA DYVGTDAFTV VISDGNGGTA TVTININVTA VNDAPTGANQ NLTTPEDTPL
NGAVVGADAD GDALTYAIGT TTPTNGSVTV NADGTFIYTP NADFNGTDAF TVVISDGNGG
TTTVTVNITV TAVNDAPTGA NQNLTTPEDT PLNGAVVGAD ADGDALTYAT GTTTPANGSV
TVNADGTFIY TPNADYVGTD AFTVVISDGN GGTATVTINI NVTAVNDAPT GANQNLTTPE
DTPLNGAVVG ADADGDPLTY AIGTTTPTNG SVTVNADGTF VYTPNADFNG TDAFTVVISD
GNGGTTTVTV NITVTAVNDA PTGANQNLST PEDTPLNGAL VGADADGDAL TYAIGTTTPA
NGSVTVNADG TFIYTPNADY VGTDVFTVVI SDGNGGTATV TININVTAVN DAPTGANQNL
TTPEDTPLNG AVVGADADGD ALTYAIGTTT PTNGSVTVNA DGTFIYTPNA DFNGADAFTV
VISDGNGGTT TVTVNITVTA VNDAPTGTNQ NLTTPEDTPL NGAVVGADAD GDALTYAIGT
TTPANGSVTV NADGTFAYTP NADYVGTDAF TVVISDGNGG TTTVTVNITV TAVNDAPTGT
NQNLTTPEDT PLNGAVVGAD ADGDPLTYAI GTTIPANGSV TVNADGTFVY TPNADYVGAD
AFTVVISDGN GGTTTVTVNI TVTAVNDAPT GTDLNLTMAE DNPLNGTVVG VDADNDPLTY
VIGATNPANG IVTVNTDGTF VYIPNPNFNG IDAFTVVISD GNGGTATVTV NITVTPVNDA
PTGTGDTRTT PRNTPVNGAV SGTDVDGDVL TYTLGTPPAN GTAVVNTDGT YTYTPNAGYS
GPDSFTIIIS DGNGGTVTVT VDITVTAINN APTGTGDSQT TLEDTPVNGA VTGNDADGDP
LTYVLGTAPA NGGVIVNADG TYIYTPNANY NGPDNFTILI SDGNGGSITV PVSIIVTPVN
DAPTGTGDTQ STPLNTPVSG AVSGTDVDGD VLTYTLGTPP ANGTAIVNAD GSYTYTPNTG
YAGPDSFTII ISDGNGGTVT VTVDIMMITV VPNPAIALVK VGARDKNDIT YIFHVTNTGN
VPLHNIVVSD PQMGVTKNYS GTLQPGASVT MTAVYHITQQ DKEAGSVTNT ATVTGLTPAD
ATVTDISGTS LNNDTPTETT VPAPPQANDD QAETRASIAV VIPVLDNDDP VESTFSIPSL
TITSIPKHGQ VTINEDGSIR YMPDNGYTGE DDFSYQVSDV DGYITNIAVV KITIVETDLR
IPPLFTPNGD GKNDVFEIRG LNKYVENELI MVNRWGNEVY RQKNYQNTWS GNGLNDGTYY
YLLRVKKANG DWEVVKGYTT IIRKMKD