Gene Cpin_5094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5094 
Symbol 
ID8361270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6344867 
End bp6356908 
Gene Length12042 bp 
Protein Length4013 aa 
Translation table11 
GC content49% 
IMG OID644967242 
Productconserved repeat domain protein 
Protein accessionYP_003124727 
Protein GI256424074 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGGC TTTTATTTAC ATTCCTATTA GTGGCTTTTT TAAGTCCGCT GGCCATTGCA 
CAGACCTGTA ACAAAGATCT GGCATTAGGC AAAGCAGTTA CCACCTCCTC CGTTACGGCA
GGCAACATTC CCACAAGGGC AACCGATGGC GATCCCACTT CCCGCTGGGA AAGTGCCTGG
AGCAACAACC AGTGGATCAT GGTGGACCTT GGGCAAAGCT ATCCCATTTG TACCATTACA
CTGAACTGGC AGGTACTGGC CACCGGTTAT ACGGTAGAAG TATCTCCGGA CGGCACATCC
TGGACGAACC TTGTCACTGA GACTGCCAAC GCAGGGCTTA ACAAGACCTA CAGCGTATCT
GCCAACGGAC GTTATGTACG TATGACAGGT CTCACCAGGG GTTCTGCTTA TGGCTTCTCT
CTCTATGACT TTATAGTGAT CGGAACTGAA CCGATCAACT ATTGTTCAAA TACCAACGTA
GCGCAGCTAC GTCCTGTTAC TGTTTCCTCT GTAGTAAATG GCAATACCGG CAATTTCGCG
GTGGATAACA ACATGGGCAC CCGCTGGGAA AGTGCGCATA GCGACAACCA GTCTATGTAT
GTAGACCTGG GCGCCAACTA TGACCTTTGC AGGGTGGTAC TGAACTGGGA AGCGGCGTAT
GGCAGAGACT ATACGATCGA TATTTCCAGC AATGGTACTG CCTGGACCAC TGTAAAAACA
GTGACGGGCA ACTATATGAG AGACAATACG CTCGACGTCA GTGGTAATGC CAGGTATGTA
CGTATGACTG GTCAAACCCG TGGAAGTACC TATGGTTTCT CTCTATGGGA ATTTTCAGTA
TATGCTTTAC TACCAGCGGT CAGTGTCACT AAAGTCACTG ACGCCGCAGA ACCGGGAACA
GGCGGCAGCT TCAGATTTAG TCTGCCGTCA GGTATCACCT TCACAGAAGA TATCACCGTT
AACTATAGCA CCTCCGGTAC AGCATCGAGC GGTACGGACT ACACAGCACT GGCCGGCTCA
GTAGTCATCC CCGCAGGGCA AGCAGGTGTC AACGTTCCAC TTACCGTGAT CGATGATCAG
ATTATTGAAG GCAACGAAAC AGTGATAGCT ACTATCACCA ATGCGATCTC CACCACGCGT
CCTGGTTTTC CGATCAGTGC AACTGATCCT AGTGCGACGA TGACGATTAC GGACAATGAT
AATATTGCTG CGAATAAGAT CCTGAGCATT GCCCCAGGTG GTAATGGTGC AGAACCGTCT
ACCAATGGCA GCTACACCAT CAGCCTACCG GCCGGTGTTA CTTCTGCCCA AAGTATCACT
GTCAACTATA CGACCAGTGG AACTGCTACA GCCGGTGCGG ATTATACGCT CCCTGCTACC
TTCACCCTTC CCGCAGGGCA GAACAGTGCA ACGGTTACAC TGACAGTGTC GAACGATCAG
ATAATTGAAG GTACCGAAAC ATCGACGGTA ACCATTACCG GTGGTACAAC CGCTACACTC
GGCGCTTTTA CGGCGAATCT AAGTAATGCT ACAGCCTCAG TGAATATTCT TGATGATGAT
AACAACTGGA CCAATAAAAG CATCAGGATC GCCAGAACGG CACATGCTGC TGAGCCATCT
ACTAATGGCG GCTTCAGAAT ATACCTGCCG ACCGGTATCA CTGCTTCTGA AGACGTCACC
GTGACTGCCG GCGTTACCGG TACTGCTACC CCGGGAACCG ACTATGTTGT ATTAGGCTCC
ACATTTGTTA TTCCGGCTGG CCAAAACTCG GTTGCAGTTC CGGTACAGGT GATCGATGAC
AACGAGATAG AAAGTACAGA AACAGTGACC GTATCACCAA CTGCGGCATC CAGTGCAACC
TTGGGTGGCT TCGCAATCGT TTTTGGCACA ACAGCAAACA TCAACATTGC AGATAATGAT
GACATTCCAG CCAACAGACG ACTGAGCGTC ACCAAAACCA TAGATGCAAT GGAAGGCGCC
AGCAATGGTA ACTTCCGGAT CAGTCTTCCG GCAGGCGTCA GTTATCCGGC AGCTATTACT
GTCAACTACA CTATCAGTGG TACCGCTGTC GGCGGAACAG ATTATACCGC TATTCCAGCT
ACCGCTACCA TCCCTGCCGG ACAAAACGGT GTGGATGTAG CGGTAACCGC ACAGCCTTAC
AATAACAATA TCATTGAGAA TGACAGGACG GTAATAATGA CACTGACCGG TGGTTCCGCT
CCTTCTTTCA GTGCATTTAC ACCTGACCCG GCGAATACTG CAGCTACGGT GATCATCGCT
GATGATGACA GTAATGCCAC TAACAAAAGG CTTTCTGTTT ACAACATTGG TATGGATGCT
GCCGAGCCTT CCACCGCCGG TGCTTTCCAG ATAAGACTCC CGGGTACATT ACGTGCTTCA
GAGCCAATCA CGGTTAGCTA TACTATAACG CCATCAAGTA GTGCAGGGAC GACATCCGCA
ACACCCGGCA CTGACTTTAA CAACTTAACA GGTACTGTTG TCATTCCCGC AGGACAGAAC
GCTGTTAACG TTCCCTATTC ACCTATCGAT GATCAGATCA TTGAGACCAG AGAGGAGTTC
ACCGTACAAC TTACAGGTGG TACAACACCT ACTTTGGGTG CATTTGGTGT GGACGTAGCT
ACGGGTGTTA TGGCCATTGA TGACAACGAT AATATTCCGG CAAACAAAAC ATTCACCATC
ACGCCGACAG CACCTGCAGC TGAATCATCC ACAAACGGAT CTTTCAGTGT GAGTCTGCCG
GCAGGTTATA CCGTTTCGCA GAATGTTACC TTCTACTATA CTGTCGGAGG GACCGCTACT
CCTGGCAGCG ATTATACAGG TATAGGTACG TCTGTTATCT TAACTGCCGG ACAAAACAGC
GTGCAGATCA CCGTACCGGT AATAGATGAT AACATTATTG AAGGCACGGA GTCAATCGAC
GTCACGGCAA CTTCCGGCGC ATCTGCAGAT ATTACCGGAT TTACGAGCAG CGGCACTGCA
ACAAATACTA TTGCAGATGA TGATAATACT GCTACAAATA AAGTGATCAG TATTATTGCT
GCTAATAATG GTGCTGAGCC AGCTACAGAT GCAACATTTA CAATCAGTCT GCCGGCTGGC
ATCACGTCCG CCACTCCTGT AACCGTAAAC TTCTCTGTCT CCGGTACGGC TACCTCCGGC
GCTGACTATA CCGCCTTAGC TACTTCCATC GTCATTCCTG CTAATCAAAA CAGTGTAACA
TTGACTGTTC CGGTACTTGA TGACCAGGCG ATCGAGGGTA CAGAAACAGT TATCGTTACT
GTAACCGGTG GCACTGCACC GCTAGCAGGT ACATTTACGG CAAGTACGAC CAATGCGACA
GCTACCGTGA ATATCGCCGA TGATGATAAT ACCCCTGCCA ACAGGGTGAT CAGCATCGTG
AATGCAAACA ATGGCAGCGA GCCTGCTACT AATGGCGCAT TTACAATCAA TCTGCCGACA
GGTGTCACAC TTACTGAAGA CGTAACTGTT AACTTCTCCG TAGCTGGTAC TGCGACACCA
GGTAATGACT ATACTTCACC GGGTACTTCC ACTATTATCC CGGCGGGACA AAACAGCGTA
ACACTGACTG TTCACGTACT GGATGACCAG CTCATAGAAA CAACAGAAAC AGTTATTGTT
ACCATCACCG GCGGATCGGC AACAACAGCA GGTACGTTTA TAGCGGGTAC CAGTAATGTT
GCCACTGTTG ATATTACGGA CAATGATAAC ACGCCTGCGA ACAAAACGAT CAGTATCACT
ACTGCGAATG ATGGTTCCGA GCCATCCAGC AATGGTGCAT TTAATGTTAG TCTGCCGACA
GGTATAACAA ATACTGAAGA CATAACCGTC AACTTTACTG TCGCAGGTAC AGCGACGGCA
GGAACTGACT ATACAGGCTT AGGTACTTCC GTTACGATTC CTGCCGGACA AAACAGCGTT
ACCCTCACTG TTCCCATACT GGATGATCAA CTGATTGAGG GTACTGAAAC AGTTATTGTG
ACAGTTACAG GTGGTTCTGC TACTGCGGCG GGAGCATTTA GCGCAAGTGC TACCAATGCT
ACTGCCACAG TGAACATCAA CGACGATGAT AATAATGCTG CCAATAAGGT GATCAGCATC
GTTGCTGCGA ACGATGGTTC CGAACCTGCT ATAAATGCTG CATTTACTAT CAGTCTGCCG
ACTGGTGTAA CCGTTAATGA ACCTGTTACT GTCAACTTCA CCGTGGCGGG TTCTGCTACT
TCCAGTACAG ACTATACGGC AATAGGAACT TCCATCATTA TCCCGGCGGG ACAGAACAAC
GTAACACTAA CTGTACCAGT ACTGGACGAT CAGCTGATAG AGGGAACTGA AACAGTTATT
GTGACGGTTA CAGGTGGTGC CGCTACTGCC GCCGGTGTAT TTGCTGCAAG CACAACCAAT
GCTGCTGCGA CTGTAAACAT CAGTGACAAC GATGATGTGG CCGCCAATAA AGTGATCAGC
ATTGCAACTG ACAATGACGG CACAGAACCT GGGACAGCTG GTGCGTTTAC CATCAGTTTA
CCAGCTGGTA TAACGGCAGC AGAACCTATC ACAGTGAATT TCACTGTAGC GGGTACAGCG
ACTGCGGGTA CGGACTACAC CGCATTCGGT ACTTCCACTA TCATCCCGGC GGGTCAGAAT
AGCGTAACGC TGACTGTTAC GGTATTGGAT GATCAGATTA TAGAGTCTAC AGAAACGGTT
ATTGTAACCG TAACAGGTGG CACTGCTACC ACTGCTGGTG CATTTACCGC CGGTACGTCC
AATACGGCTA CGGTTAACAT CAATGACAAC GATAATACTA CTACAAATAA AGTCATCACG
ATCGCTGCTG CCAATGACGG TACAGAGCCG GGTACTAATG CTGCATTTAC TGTTAGCCTC
CCGGCTGGTG TAACAGTCGA TGAAGATGTA ACTGTCAATT TCTCTGTCGC TGGTACAGCT
ACTGCAGGTA CTGACTATAC CGCAATCGGT ACGGCTGTTA TGATCCTTGC AGGACAGAAC
AGCGCTACAC TCGCTGTTCC TGTATTGGAT GATCAGGAAA TAGAAGCGAC TGAAACAGTT
ATTGTAACCG TAACAGGTGG TGCCGCTACG AATGCAGGTG CATTTACTGC CAGCATTGCT
AATGCGACCG CTACTGTCAA CATCAATGAC GATGACAATA CGGCTGCTAA TAAAGTCATC
AGTATTACTA CTGCTAATAA CGGCGATGAA CCTTCCAATA ACGGTGCATT CACGATCAGT
CTGCCGACTG GCATTACTGT CAATGAAGAC GTCACTGTTA ACTTTACAGT GACCGGTACC
GCTACTGCGG GTACTGATTA CACTGCCCTG GGTACTTCCA TCATCATCCC GGCAGGTCAG
AACAGCGTAA CGCTCACTGT TCCGGTACTG GATGACCAGA TCATTGAAGG AACTGAAGCA
GTTACTGTTA CTATAACAGG TGCTGCCGCT ACGAATGCCG GTGCATTTAC TGCTGGTACT
AACAATGTAG CTACTGTGAA CATCAACGAT GATGACAACA CGGCTATTAA TAAAGTGATC
AGCATTGCCG CTGCGAATGA TGGTACGGAG CCTGCTACCA ATGCAGCCTT TACAATCAGT
CTTCCGGCTG GTGTAACGGT AAATGAAGAT GTCACAGTCA ACTTCACCGT CGCTGGTACG
GCTACATCAG GTGCTGACTA TACAACAATC GGTACATCTG CTACCATCCT TGCTGGACAG
AACAGTGTAA TACTGACTGT CCCTGTACTG GATGACCAGA TTATCGAAGC AGCAGAAACG
GTTATCTTGA CTGTCACAGG TGGTGCCGCT ACGAATACAG GTGCATTTAC GGCAAACCTG
ATCAATGCTA CTGCTACCGT CAATATCAAC GACGACGATA ATACTGCCAC TAATAAAGTT
CTCAGTATCG CTGCTGCGAA TGACGGTACA GAGCCGGGTA ATAATGCTGC ATTTACTGTC
AGCCTCCCGA CAGGTGTTAC GGTTGATGAA GATGTTACAG TCAATTTCTC TTTGGCGGGT
ACAGCTACTG CAGGTACTGA CTACACCGCA ATCGGTAGCG CTGTTACGAT CCTCGCTGGA
CAGAACAGCG CTACACTCAC AGTTCCGGTA CTCGATGATC AGGAAATAGA AGCGACTGAA
ACAGTGATCG TGACGGTAAC AGGTGGATCC GCTACGAATG CGGGTGCATT TACGGCCAGT
GTAACGAATA CGACCGCTAC TGTCAACATC AGTGATGACG ACAATACTGC GGCCAATAAA
GTCATCAGTA TTACTACTGC TAATAATGGC GATGAGCCTT CCAACAATGG CGCATTCACT
GTCAGTCTGC CGACTGGCAT TACTGTCAAT GAAGACGTCA CTGTTAACTT TACAGTGACC
GGTACCGCTA CTGCGGGTAC TGACTACACT GCCCTGGGTA CTTCCATCAT CATCCCGGCG
GGACAGAACA GCGTAACGCT GGCAGTGCCT GTACTGGATG ACCAGATCAT TGAAGGAACT
GAATCCGTTA CTGTTACAAT AACAGGCGGT TCGGCTACAG CTGCCGGTGC ATTTACCGCT
GGTACCAACA ATGTAGCTAC TGTGAACATC AATGATGATG ACAACACTGC TACTAACAAA
GTGATCAGCA TTGCTGCTGC GAATGATGGT ACCGAGCCTG CCACTAACGC AGCCTTTACA
ATCAGTCTGC CAGCCGGTAT AACAGTGAAT GAAGACCTCT CTGTCAACTT TACCATCGCT
GGTACGGCTA CATCAGGTGC TGATTATACT CCAATCGGTA CAACTGCTAC CATCCTCGCA
GGACAGAACA GCTTAACACT GACTGTCCCT GTACTGGATG ACCAGATCAT CGAAGCAGCA
GAAACAGTCA TCTTGACTAT AATAGACGGT ACGGCTACAG TTGCGGGTAC CTTTACTGCA
GGCACGGCAA ATACCGCTAC CGTTAACATC AACGATAATG ACAATACCGC TGCCAATAAA
ATCATCAGTA TCGCTGCTGC GAATGATGGC GCAGAACCAG CGGGCAACGC AGCCTTCACT
ATCAGTCTGC CGACAGGTGT AACTGTAGAT GAAGACCTCA CAGTCAACTT TACGGTGGCT
GGTACAGCTA CTGCGGGTAC CGACTACATA TCTCTCGGAA CCTCCATCGT CATTCCAGCT
GGACAAAACA GCGTAACACT CAATGTCCCT ATCCAGGATG ACCAGGTTAT AGAAGCTACA
GAAACCGTTA TTGTAACCGT AACAGGTGGT TCCGCCACTA ATGCGGGTGC ATTTACGGCA
AGTGCGACCA ATTCAACCGC TACCATCAAT ATCATTGACA ACGACAATAC AGCGGCTAAT
AAAGTCATTA GTATTGTTGC TGCGAATGAT GGTTCCGAGC CTGCTACCAA TGCTGCATTC
ACTGTCAGCC TTCCTACTGG TGTGACGTCA GATGAAGTTG TCACTGTCAA CTTCACCGTA
GCAGGTACAG CTACCGCGGG TACTGACTAC ACAGCTTTAG GTACGTCCAT TATCATTCCG
GCAGGACAGA ACAGCGTAAT GCTGACCGTG CCTGTACTCG ATGATCAGAT TATTGAATCA
ACAGAAACCG TCACGGTTAC CATAACAGCG GGTGCTGCTA CCAACGCAGG TGCATTTACC
GCAGGTACAA GCAACACTGC TACTGTCAAC ATCAGCGATG ATGATAACAT TCCTTCTAAT
AAGGTAATCA GCATTACCGC TGCGAATGAT GGCGCTGAGC CGGCAAGCAA CGCTGCTTTC
ACTATCAGTC TGCCAACAGG TGTAACGGCG AATGAAGCGG TCACTGTCAA CTTCACCGTG
GCAGGTACAG CTACTGCTGG TACGGACTAC ACGACGCTGG GGACATCGGT GATTATCCCG
GCTGGACAGA ACAGCGTAAG CCTCAATGTA CCGGTCCTGG ATGACCAGGT AATAGAAGCT
ACAGAAACAG TCATTGTTAC CGTTACAGGT GGTTCCGCTA CTGCGGCTGG TGCATTTACG
GCCAGCGCGA CGAATGCGAC GAGCACCGTT GACATCACCG ATAATGACAA CACCGCTGCC
AATAAAGTGA TCAGCATTAC CCGCACTGCC GACGGCTCTG AGCCTTCCAG CAACGGGGCA
TTCAGTATCA GCCTGCCGGC GGGTGTAAGT ATCAGCGAAG ACCTGAACAT TATTTACAAC
ATAGCGGGAT CCGCAATCAA TGGAACTGAC TACGGCGCCT TAAACGGTAC GATCATATTA
CCGGCTGGAC AGAACAGCAT CTCTCTGCCT GTGATAGTAA CTGACGATGA CATCATAGAA
GGCGCTGAAA GCGTTGTTGT AAGTATCGCC AATGCTGCTG CCACAACGAT CACCGGATTC
ACTGTCAGCA TCACTAATAG CTCTGCTACG GTGATCATCG CTGACGAAGA CAATACAACT
GCCAACAGGG TCATCAGCAT CAATCCTGTA ACAAACGGAT CAGAACCTGC TACAAATGGT
AGCTTCGCGG TGAGCTTGCC GACAGGTATC ACAGCTGCTG AAGACATTAC TGTCAACTAC
ATCGTAGCGG GTACAGCTAC TGTCGGTACT GACTATACCG CGCTGGGTAC TTCAATAATC
ATCCCGGCTG GCCAGAACAG TGTGACGCTG ACGGTGCCGG TGCTGGATGA TCAGATCATT
GAAGCAACAG AAACTGTCAT CGTAACTGTT ACAGGTGGCA CAACGATCAG TATTGGTGCC
TTTACTGCCA GCGCAACAAA TGCGACAACC ACCGTCAATA TCAACGATAA CGATAATACC
CCTGCTAATA AAGTGATCAG TATTGTAGCT GCGAATGATG GGGCTGAGCC GTCTACTAAC
GGGGCGTTTA CAATCAGTCT GCCAACAGGT GTAACTGTCA ACGAAGCGGT CACTGTCAAC
TTTACAACTG CGGGTACCGC TACTGCTGGT CCTGACTATA CCACACTGGC TACCACTATC
ACCATTCCGG CGGGTCAGAA CAGCGTAACG CTGAGTGTTC CGGTACTGGA TGACCAGATC
ATTGAGTCCG CAGAAACAGT GATCGCTACA ATCACCGGTG GTACGGCTAC TGCGGCGGGT
ACCTTTACAG CCAGTGCCAC CAATGCAACG GCTACCATCA ACATTACCGA TAACGATAAC
ACTGCTGCCA ATAAAGTCAT CAGTATCGTT GCTGTGAATG ATGGTGCGGA GCCGTCTACT
AACGGGGCGT TTACAATCAG TCTGCCGGCG GGTGTAAGCG TCGATGAAGA TGTCACCGTC
AACTTCACAG TGATCGGTAC TGCTACCGCT GGCGCTGACT ATACCGCCCT GGGCGCGTCA
GTGATCATCC CTGCCGGACA AAACAGCATC ACACTCGCTG TTCCGGTGAG AGATGACCAG
GCTATTGAAG CCACAGAAAC TGTGATCGTT ACTATCACTA GTGGTACAGC TGCCAACGCC
GGCGCATTCA CTGCCGGTAC AAACAATACA GCGACTGTGA ACATCAGCGA TGATGATAAC
ACGCCTGCCA ATAAAGTGAT CAGTATCGTT GCTGCAAATG ATGGTGCAGA ACCATCTACT
AACGGGGCGT TTACAATCAG TCTGCCGACA GGTGTAAGCG TTAATGAAGA TGTAACGGTG
AACTTTACCG TTGCCGGTAC GGCTACTGCC GGCGCTGACT ACAGCAGCTT AGGCACTACG
GTGATCATCC CTGCAGGACA AAACAGTGCT ATACTGACGG TCCCGGTACT TGATGATCAG
GTCATTGAAT CAACAGAAAC CGTGATCGTT ACTGTAACAG GTGGTACAGG TATTACTACA
GGCGCCTTCA CCGTCAGCAC TTCCGATGCA AGTGCAGCGG CAAATATCAG CGACGACGAC
AATACAGTGG CCAACAGGAT CATGAATGTC ATCAACACGA ATGATGGCAG CGAGCCTGCT
ACGAATGGTA ATTTCTCGAT CAGTCTGCCT GCGGGTATTA TTACTGCCGA AGACATCATC
GTTAGCTACA CTTTAGCGGG TACTGCTACT ACGGTTACAG ATTATACCAC CAGCGGTACA
AACGTCACCA TTCCTGCTGG CCAAAACAGC GTAACGTTGC CGATCGTAGT CAACGACGAC
CTGCTGATGG AAGGAGATGA AACGGTGGTT ATTACCATCA CAGGGGGAAT GACTGCTACA
CTGGGTACTT ATACCCCGGG TACTGCCAGC GCTACCCTGA TCATTGCTGA TGATGAAAAT
ACGGTTACTA ATAAAGTGCT CAGTATCTCC AGGAATGCAG ATGCAGCGGA ACCTGCTACT
AATGGCAGCT TCACGGTAAG TCTGCCGGCC GGTATCAGCG CCACTGAAAA CATCAGCGTA
AGTTATACCA TCAGTGGTAC TGCTACAGCG GGTACTGATT ATGCGACGCT CAGCGGATCA
GTGATCATCC CTGCAGGACA GAACAGTGCG AACATCGATA TGATGGTAAA TGACGATCAG
CTCATTGAGG GGAATGAAAC CGTGATTGCC ACTATTACCG GCGGTACGGG TGCTGCACTT
GGTACATTTA CTGTCAGCAT GACCAATAAT ACCGCTACGG TTACGATCAG CGATGACGAC
AACACAGCGG CGAATACGAC TATCAGCATT ACCACACTGA ATGATGCTGC GGAACCAGGT
ACTACGGGTA ATTTCACTAT CAGCTTACCA ACAGGTATTA CTGCGGTAGA AGACATTACC
GTTACTTACA ATACCAGTGG TACGGCAACA CCGGGTATTG ATTATGCTAC TTTAACCGGC
AGTCTCATTA TCCCTGCCGG ACAGAATAGC GTCACCCTGC CGGTAACTGT TATTGACGAC
GGATTGATGG AGGGTACTGA AACGGTGATG GTGAATCTGA GTGGCGCGTC TGCTGCTACA
CTGGGTACCT TGACGATCAG TACGACCAAT GCACAGGCGA CTGTCAATAT CAATGATGAT
GACAATAGTG TTGTCAGATT TGAAACCTGG AAAACAGCCG CCCTGCCTGC TGGTAATACA
GACGGTAAGA TCGGTCAGGG TGAACAGATC ACTTATACCA TCTTCATCCG CAATACGGGT
AATATCAATA TCCCTCGGCT GTCTGTACAG GATCCGGTGC CTGCTTATAC TTCTTATGTA
AGTGGTGGTG CGCTGATAGG CAATAATGTA CATTTCAGTA TTGTTGACCT GCAACCAAAT
GCTGTCAGCC AGGTGAGCTT TATTGTACAG ACTTACGATA ACCTTAACGG GGTACAGCAG
ATTACCAACG TGGCACAGAT CAGCGATGGT ACGACTACAT TGAATACCCT AGCATGTGAT
CCTTCCGATC CAAATTGTAC CGGAAGTAAC CAGACGATTG TTCCTGTACG TGAACCACAG
GGCGACCTGG TGATCAGCAA AAGTGCGGTG AATCAGCCAA CCAATGGTCA GCATTATATA
CTGGGCGAGA ATATCACTTA TGAGATTGTG GTGAGCAATG TGGGTGAGAA GACATTTACC
AACATCGCCA TTACGGATTC CCTGCCTGCG TCACTGGACA TGCCGACTTA TTATATCAGC
AGCAAGGGTA GTATCATTGC GAGTCCGGCG GCCAAAAAGG TGGTCACAAG TGTTGATCAG
CTGTTGCCGG GTGAGGATGT CACCATTACT ATCACCTGCC GTGTTAATAA TAAGGAGATC
ATCAATACTG CTTATGTGCA GGCAGATGAA ACGGAGACCG ATCTGTCCAA CAATACAGCA
GTTGCTACCG CGGCTGCTTC TATTAAGGAC CTGGCCTTTA TCAATGCATT CAGACCCGGC
AATGGTGCAA ACAATCGTTT TGTGATTGTG GGACTTGAAA AATACCCTGG GTCCAAGCTG
CTTGTTTACA ACAGATGGGG CTCACTGGTG TATCAGTCTA ACGATTATAA GAATGACTGG
AGAGCACTGG ATCTGCCAAT GGGCGGCTAC ATCTATGTAG CCGAAGTGAA GAAACCGGAA
GGCGTAGTAG TGTATAAAGG TGATTTCATT ATCATCAGAT AA
 
Protein sequence
MIRLLFTFLL VAFLSPLAIA QTCNKDLALG KAVTTSSVTA GNIPTRATDG DPTSRWESAW 
SNNQWIMVDL GQSYPICTIT LNWQVLATGY TVEVSPDGTS WTNLVTETAN AGLNKTYSVS
ANGRYVRMTG LTRGSAYGFS LYDFIVIGTE PINYCSNTNV AQLRPVTVSS VVNGNTGNFA
VDNNMGTRWE SAHSDNQSMY VDLGANYDLC RVVLNWEAAY GRDYTIDISS NGTAWTTVKT
VTGNYMRDNT LDVSGNARYV RMTGQTRGST YGFSLWEFSV YALLPAVSVT KVTDAAEPGT
GGSFRFSLPS GITFTEDITV NYSTSGTASS GTDYTALAGS VVIPAGQAGV NVPLTVIDDQ
IIEGNETVIA TITNAISTTR PGFPISATDP SATMTITDND NIAANKILSI APGGNGAEPS
TNGSYTISLP AGVTSAQSIT VNYTTSGTAT AGADYTLPAT FTLPAGQNSA TVTLTVSNDQ
IIEGTETSTV TITGGTTATL GAFTANLSNA TASVNILDDD NNWTNKSIRI ARTAHAAEPS
TNGGFRIYLP TGITASEDVT VTAGVTGTAT PGTDYVVLGS TFVIPAGQNS VAVPVQVIDD
NEIESTETVT VSPTAASSAT LGGFAIVFGT TANINIADND DIPANRRLSV TKTIDAMEGA
SNGNFRISLP AGVSYPAAIT VNYTISGTAV GGTDYTAIPA TATIPAGQNG VDVAVTAQPY
NNNIIENDRT VIMTLTGGSA PSFSAFTPDP ANTAATVIIA DDDSNATNKR LSVYNIGMDA
AEPSTAGAFQ IRLPGTLRAS EPITVSYTIT PSSSAGTTSA TPGTDFNNLT GTVVIPAGQN
AVNVPYSPID DQIIETREEF TVQLTGGTTP TLGAFGVDVA TGVMAIDDND NIPANKTFTI
TPTAPAAESS TNGSFSVSLP AGYTVSQNVT FYYTVGGTAT PGSDYTGIGT SVILTAGQNS
VQITVPVIDD NIIEGTESID VTATSGASAD ITGFTSSGTA TNTIADDDNT ATNKVISIIA
ANNGAEPATD ATFTISLPAG ITSATPVTVN FSVSGTATSG ADYTALATSI VIPANQNSVT
LTVPVLDDQA IEGTETVIVT VTGGTAPLAG TFTASTTNAT ATVNIADDDN TPANRVISIV
NANNGSEPAT NGAFTINLPT GVTLTEDVTV NFSVAGTATP GNDYTSPGTS TIIPAGQNSV
TLTVHVLDDQ LIETTETVIV TITGGSATTA GTFIAGTSNV ATVDITDNDN TPANKTISIT
TANDGSEPSS NGAFNVSLPT GITNTEDITV NFTVAGTATA GTDYTGLGTS VTIPAGQNSV
TLTVPILDDQ LIEGTETVIV TVTGGSATAA GAFSASATNA TATVNINDDD NNAANKVISI
VAANDGSEPA INAAFTISLP TGVTVNEPVT VNFTVAGSAT SSTDYTAIGT SIIIPAGQNN
VTLTVPVLDD QLIEGTETVI VTVTGGAATA AGVFAASTTN AAATVNISDN DDVAANKVIS
IATDNDGTEP GTAGAFTISL PAGITAAEPI TVNFTVAGTA TAGTDYTAFG TSTIIPAGQN
SVTLTVTVLD DQIIESTETV IVTVTGGTAT TAGAFTAGTS NTATVNINDN DNTTTNKVIT
IAAANDGTEP GTNAAFTVSL PAGVTVDEDV TVNFSVAGTA TAGTDYTAIG TAVMILAGQN
SATLAVPVLD DQEIEATETV IVTVTGGAAT NAGAFTASIA NATATVNIND DDNTAANKVI
SITTANNGDE PSNNGAFTIS LPTGITVNED VTVNFTVTGT ATAGTDYTAL GTSIIIPAGQ
NSVTLTVPVL DDQIIEGTEA VTVTITGAAA TNAGAFTAGT NNVATVNIND DDNTAINKVI
SIAAANDGTE PATNAAFTIS LPAGVTVNED VTVNFTVAGT ATSGADYTTI GTSATILAGQ
NSVILTVPVL DDQIIEAAET VILTVTGGAA TNTGAFTANL INATATVNIN DDDNTATNKV
LSIAAANDGT EPGNNAAFTV SLPTGVTVDE DVTVNFSLAG TATAGTDYTA IGSAVTILAG
QNSATLTVPV LDDQEIEATE TVIVTVTGGS ATNAGAFTAS VTNTTATVNI SDDDNTAANK
VISITTANNG DEPSNNGAFT VSLPTGITVN EDVTVNFTVT GTATAGTDYT ALGTSIIIPA
GQNSVTLAVP VLDDQIIEGT ESVTVTITGG SATAAGAFTA GTNNVATVNI NDDDNTATNK
VISIAAANDG TEPATNAAFT ISLPAGITVN EDLSVNFTIA GTATSGADYT PIGTTATILA
GQNSLTLTVP VLDDQIIEAA ETVILTIIDG TATVAGTFTA GTANTATVNI NDNDNTAANK
IISIAAANDG AEPAGNAAFT ISLPTGVTVD EDLTVNFTVA GTATAGTDYI SLGTSIVIPA
GQNSVTLNVP IQDDQVIEAT ETVIVTVTGG SATNAGAFTA SATNSTATIN IIDNDNTAAN
KVISIVAAND GSEPATNAAF TVSLPTGVTS DEVVTVNFTV AGTATAGTDY TALGTSIIIP
AGQNSVMLTV PVLDDQIIES TETVTVTITA GAATNAGAFT AGTSNTATVN ISDDDNIPSN
KVISITAAND GAEPASNAAF TISLPTGVTA NEAVTVNFTV AGTATAGTDY TTLGTSVIIP
AGQNSVSLNV PVLDDQVIEA TETVIVTVTG GSATAAGAFT ASATNATSTV DITDNDNTAA
NKVISITRTA DGSEPSSNGA FSISLPAGVS ISEDLNIIYN IAGSAINGTD YGALNGTIIL
PAGQNSISLP VIVTDDDIIE GAESVVVSIA NAAATTITGF TVSITNSSAT VIIADEDNTT
ANRVISINPV TNGSEPATNG SFAVSLPTGI TAAEDITVNY IVAGTATVGT DYTALGTSII
IPAGQNSVTL TVPVLDDQII EATETVIVTV TGGTTISIGA FTASATNATT TVNINDNDNT
PANKVISIVA ANDGAEPSTN GAFTISLPTG VTVNEAVTVN FTTAGTATAG PDYTTLATTI
TIPAGQNSVT LSVPVLDDQI IESAETVIAT ITGGTATAAG TFTASATNAT ATINITDNDN
TAANKVISIV AVNDGAEPST NGAFTISLPA GVSVDEDVTV NFTVIGTATA GADYTALGAS
VIIPAGQNSI TLAVPVRDDQ AIEATETVIV TITSGTAANA GAFTAGTNNT ATVNISDDDN
TPANKVISIV AANDGAEPST NGAFTISLPT GVSVNEDVTV NFTVAGTATA GADYSSLGTT
VIIPAGQNSA ILTVPVLDDQ VIESTETVIV TVTGGTGITT GAFTVSTSDA SAAANISDDD
NTVANRIMNV INTNDGSEPA TNGNFSISLP AGIITAEDII VSYTLAGTAT TVTDYTTSGT
NVTIPAGQNS VTLPIVVNDD LLMEGDETVV ITITGGMTAT LGTYTPGTAS ATLIIADDEN
TVTNKVLSIS RNADAAEPAT NGSFTVSLPA GISATENISV SYTISGTATA GTDYATLSGS
VIIPAGQNSA NIDMMVNDDQ LIEGNETVIA TITGGTGAAL GTFTVSMTNN TATVTISDDD
NTAANTTISI TTLNDAAEPG TTGNFTISLP TGITAVEDIT VTYNTSGTAT PGIDYATLTG
SLIIPAGQNS VTLPVTVIDD GLMEGTETVM VNLSGASAAT LGTLTISTTN AQATVNINDD
DNSVVRFETW KTAALPAGNT DGKIGQGEQI TYTIFIRNTG NINIPRLSVQ DPVPAYTSYV
SGGALIGNNV HFSIVDLQPN AVSQVSFIVQ TYDNLNGVQQ ITNVAQISDG TTTLNTLACD
PSDPNCTGSN QTIVPVREPQ GDLVISKSAV NQPTNGQHYI LGENITYEIV VSNVGEKTFT
NIAITDSLPA SLDMPTYYIS SKGSIIASPA AKKVVTSVDQ LLPGEDVTIT ITCRVNNKEI
INTAYVQADE TETDLSNNTA VATAAASIKD LAFINAFRPG NGANNRFVIV GLEKYPGSKL
LVYNRWGSLV YQSNDYKNDW RALDLPMGGY IYVAEVKKPE GVVVYKGDFI IIR