Gene Phep_2593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2593 
Symbol 
ID8253700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3021781 
End bp3033162 
Gene Length11382 bp 
Protein Length3793 aa 
Translation table11 
GC content45% 
IMG OID644936243 
Productconserved repeat domain protein 
Protein accessionYP_003092859 
Protein GI255532487 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.073398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATA CGCTCAGATC TAATGTGCTC TCCTGTTACA GAGGTATATG CTGTCTGGTT 
TTTCTATTAT GCTCATTATT TAATTTATCG AATGTACTGG CAGAGGGGAG TAAAGATTTA
TATCCGGTTG GGGCATCGGG TAATCGTGCA TTTTTATATT CCAATGCTGC ACTCGGAATT
ATTAGTACTT CAGTAGCTTT CCCTTTCAGA ACCCTTGGAA CGCATTATGT TTACGCTGTT
AAAAATGAAT ATATAGCGGT GGCATCCAGT GCACAGGGAG TTGGTAATGG GATAATTATT
ATCACTTCCC CATCAGGGGT ACAGACAATG ACTACTGTTG GAAGCACGGT GGGTAAAATT
TCCAGTAGAA CCGAAGAATT AGCTGGGCCA CGTTTTCCAG GAGAAGGTGC TGGGAATAAT
CGCTATTTAC CTTATTCTTT TCAGGTGGGT AATGATGAGG GTGTTTGGAA AATTGAATTT
ATACCAACAG GTAATAAAGT AGATACAAAT AGCCCTACCG CCTCAGACGT TTTCGCAGAT
GGTACTTGGA GTCAAAGTAA TAATACCCCC CTTATTTCTG CCTGGGATGT TTCAGTTTGT
AATGCTGTCA AAGATAAATG GCTGAGGGGT AGGGTGTATA CAAATGTTCT GAATTTAAAG
TTGTCCCAGA ATTTTGATGA TTCGAATAAG GCCTATAATG GGATAAATTA TGTTTTAACT
AAAGACGGAA GGGCTTACCG TGTAAAAAAT AACGGTAATA ACGGCTTGGG CTTTACTTTT
TTTTCGAATA ATAAAGGATT TGTAGATGCT ACGGGGAATC CACTTTATAA AAGTCTGAAT
ACAACAGACG AGACTGTAAT TGCCAATGCT GTACATGATC CAAGGGCTGT TGATGATTTG
ACAAAAAACC TGGTTACTCA TAAACTGTTT TATGGCCCTC CTTCTCCTGA TCTGCCGGTT
TCCGCTAAGG TTGCAATAGA TGGAACTACT TCTACGACTA CCTGGCTAAA AAATGAACCA
ACTGTTCCGG TTGTTTCGAA TATCGATTTT GTAGGTTCTG AAGGGACTGT TGGGCAGGCC
AGTCAAAAAG GAGGAACAAT AATCTTTAAA ACAAATGTTG CAGGTTCATA CCGGATCACT
ATACCACTTG GCGCTGGTAA AGACAGGGTG CTCAATGGTG CTGCTGCCAA TGGTGTGAAT
ACGATAAAAT GGGATGGAAA AGATGGCTTA GGTAATTTAG TTCCAGGAGG TTTGGTTGTT
CCTGAAGTAA GAGTTAGACT GGCATCAGCG GAGGTACATT TTCCATTTAT AGATATGGAG
ATAAACCCAA AGGGCATTAT TATTGAATTG ACTGAAAACT CAGCGTTTTA TCCTGTAAAT
CCAACGCCTA ATATTATTGA CGAAAGTGTG TATAGTGATA GGGTATATTG GGATGACAGC
GATATTACCG GAGGTAAGGC TGGTGAGGTT TCTAATCCAC AGGTAAATGT AATTAATGGA
ATCTCAAGTA ATTCGAATGG ACATAAATGG GGAAGCTATA GCGGGAGCAG TGGCTCTGGA
AATAGTGGTG CCGGAACAAA TAGCTATGGT AATGAAAAGG CTATGGATAC TTATGCTTAT
ATCCTTAGTA AGGAAGAAGC CCAGCCCTTA AATGTGAATG TTAAAGTTGC AGAGCTGCAG
GTGGTGAGTG TAATACCAAC TTATACTTCT ACAGCTGCAG GTACTCAGGT TACTTATGCA
GTGAAGGTTA AAAATGCCGG TGTAAGTGAT GTGACAGGAG CAGCGTTTCA TTTTAATGTA
CCTACTGGCT TTACCGTCAA TGCTAACGGC ATAAGTAGTT CTTTTACTGG GGCAACGGGT
GCAGAAAGTA ATGCTGTGCT AGATGCAACG GGATTTAGTT CTATGCTGGA TTTGGCTAAT
GGCTGTATAG TTACTTATAC AATTAACGGA ACTATTGGTG CTGGTATTTC GGGAAATGAT
ATTCTTGTCG ATGCAAGTAT AATGAGACCG AAGGATGTTA CGGACCCAGA TGCAACCAAT
ATTATATTTG ATAAGGCACC TACTAATCCA CACGATGAAT GTTTAAATGG TACTGCCAAT
GAGGTTTGTA ACAATATAAA ATACAATACG GCCCATGCTC AGCTCGTATG TGTAAATTCA
TCAATTGTGC CGATTGAGTA TGTTCTTGAA GCAGGTGGAA CAGATGTTCA AATCCTGCCT
GCATTGCCAG CCGGGCTGGT TAAAAATATA ACAGGAGGTA AATTGACAAT TACAGGTACA
CCAACCGCTT CTGGCATATT TACTTTTACA ATAAATACTT TAGGAACTCA ACGGTCAACA
AAAACGGTAG TACTAACGGT AGGTTCCAGG GCTATTGCTT CAGATATTGA AGTTGCTGAT
AAGGCGATTT GTATTGGTCA GAGTGCCGAA TTGACGTCTA ATTTGGCTGC TGGTAGTACT
ATCGTTAATC CGGTTTTCAA TTGGTATAGT GAGGCTGCCC TTACAACATT ACTTCATACG
GGGACAATTT ATACTGTTAA TCCTACTGTT AGCACTACCT ATTATGTAAC GGTTAAAGGT
GATAACAGTT GCGAAAATGC TTCCGGGGCG GGAAAAGCAG TAACGATTAC AGTAAACCCA
TTGCCCTTAC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAATGC AGCTACGAGC TACCAGTGGT ACCGTGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCAACAGT TAATGCATCT GCAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTATAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTTCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTCT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGCAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTTCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ACCGTGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTTGTAGTT
ACCGATGGTA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTTACCTTGA CCTCATCAGC TGCTGCCAGT TACCAGTGGT ATCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTTGTAGTT
ACCGATGGTA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTTACCTTGA CCTCATCAGC TGCTGCCAGT TACCAGTGGT ATCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACTGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC GGTAAACCCA
TTGCCACTTC AACCAACAGT TACGGCATCT GCAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTTCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGTA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTTACCTTGA CCTCATCAGC TGCTGCCAGT TACCAGTGGT ATCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACTGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC GGTAAACCCA
TTGCCACTAC AACCAACAGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTTACCTTGA CCTCATCAGC TGCTGCCAGT TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCTA
TTGCCACTTC AACCTACGGT TACGGCATCT GCAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTTACACTGA CCTCATCAGT TGCTACGAGC TACCAGTGGT ACCGTGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC ATACACGGCT ACAGTAACTG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCGACAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGTA ATTCCTGTTC AAGCCCAGCT TCGACAGCAG TTCCGGTTAC GGTAAACCCA
TTGCCACTTC AACCAACAGT TAATGCATCT GGAGGGACAA CCTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ATCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGCAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTTCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ACCGGGGCAC CACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCGACAGCAG TTACTGTTAC GGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGCAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTTCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCA GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGGGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC GGTTGTAGTT
ACCGATGGCA ATTCCTGTTC AAGCCCGGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGAACCA CCTTCTGTGT TGGTGGTTCA
GTAGTGTTAA CTTCCAGTGC AGCTACGAGC TACCAGTGGT ATCGCGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTAATCGTT
ACCGATGGCA ATTCCTGTTC AAGCCCAGCT TCGACAGTAG TTCCGGTTAC AGTAAACCCA
TTGCCACTTC AACCTACGGT TACGGCTTCT GGAGGGACAA CTTTCTGTCT TGGTGGTTCT
GTCACGCTGA CCTCATCAGT TGCTGCCAGT TACCAGTGGT ACCGTGGTAC TACTTTGTTA
ACCGGTGAAA CGAACCAAAC TTATACAGCA AGTGAAAGCG GCAATTATAC AGTTGTAGTT
ACCGATGGTA ATTCCTGTTC AAGCCCAGCT TCAGCAGCAG TTCCGGTTAC AGTAAACCCA
TTGCCTTTAC AACCAACGGT AATTGCAGAA AGTAATACCA TTTTCTGTAT TGGAGGCTCT
GTGAAATTAA CTTCCAGTGC TTCAACGGGT AACCAGTGGT ATAAAGACGG CAATCTAATC
TCAGGGGCAG TAAATAAAAC CTATATCGCA ACAGCAACAG GATCATATTT TACGATCGTT
ACCAATGCAA ACGGATGTAG TAGCCTTCCT TCCCAGGCCA TTCCGGTAAC TGTAAGTCCT
TATCCTGAGA TACCCGATAT TTCACCAGCT GGTGCCACAA CTTTTTGCGA AGGGGGCATA
GTTACCTTAA CCTCTTCATC AGCAAATGGA AATCAATGGT ATAAAAATGG AGTGTTGATG
CCAGGCGCAA CCGGAAAAAC CCTGGATGTA AATCAGATAG GGGATTACAC TGTGAAAGTT
ACCAACAGCA CCGGGTGTGC CAGTAACCTA TCGGCTTTAA CAAAAGTAAC TGTGAATCGT
GTGCCTAAGG GATTTGATGA CAACATCAAT TCATTAAGCT GTTCCCAATC CTCATTTAGT
TATAACCTGC AAACAAAAAA TGTAAACAAT ACTTTAAAAG GTGGAAATGG GGTTGCAGCA
TCCTTTACCT GGACTGTAAA TTCAACTGTT TTAGGTGCTA TAAATGGTTC CGGAAAAATA
CTTAATGCAA CACTGATCAA TACTTCAACC ACAGAACAAA CTGTTGTTTA TATAGTAACA
CCTATAGCCG AAACAGGTGG TTGCGCCGGG CAACCCTTTA AAATCACTGT TCGTGTCCCT
GCATGTATAG ATATTTCAAT CAGTAAAACC GCAGATAAAA ATGTAGTCGC TACAGTTGGA
GATAAAATTA ATTATACCAT AACCATAAAA AATACCGGAG ATGCAAACCA CAACAATGTA
AAGGTTAATG ATCCGTTGCT TGGAGGTATG CTCAGCCAGC CAACAGGCGA TAACGGAAAT
GCGATATTAG AAAAAGATGA GACATGGACT TACAGAGGAA CATATACTGT TACTCAGAAT
GATCTTGATC TTAACGGTAT ACCAACCGGA AATACCGGAA AAATCATCAA TACGGCAACA
GTTAGTTCAC AAGCTTATCC TCAATCCAAA TCGGCCATTG CAGAGGTAGA CATTCATACC
AACCCTTCAA TAACACTGGT TAAAACCGGT GCTATGAACC GTGATTTTAA AACCATGACC
TATACCTTCA ATATTACGAA CAATGGAAAT GTAACTTTAA ATAACCTGGT TGTAAGAGAT
CCAAAAATCC CTCAACTTAT TGCATTAAAG CAAACAATTC TTGCTCCGGG CGCTTCAACC
ACTGGTATTG CAGTATATAC CATTACCAAT GAGGAAAAGA TTGCTGGTAC TGTAAGTAAT
ACAGCAACAG TTGCAGGATT TACAAAAACA AATGTTAAGG TTACTGATGC ATCGGGCACC
GCTGAAAATA ACGATGAGCC TACAGTGATC GACATAACAC GATACCCTAT AGCGATAGAT
GATTATGCCA AAACCAAGGC AGAAGAGGAA GTAGCTGTAC CAGTAATAAA CAACGACCGG
CCAGCCCTGT TTCCTTTAAA TGCTGCCACC CTGGAAGTAA AAAGCCAGCC TGTAAATGGT
AAACTATTGG TAAATAAAGA TGGAAAGGTT GTTTATAAAC CAAACAAAGG TTTTTTTGGG
ATAGAGAAGT TTACCTATAA AGTTGATGAT GCAAACGGCT TATCATCTAA TGTTGCCATT
GTAACCATTA ATGTTGCTCC TCCAGATCTG GATATCCCAA ATACATTTAC ACCTAATGGC
GATGGTAAGA ACGATACCTT CCTGATTACA GGCATCGAAA ACTATGATGG GGTAAGCCTT
TTTGTTTATA ACCGTTGGGG CGATGAGGTA TATAAAAATA ACAATTATAA AAACGAATGG
GATGGTAACG GTTTGAATGA TGGTACTTAT TTCTACGTAT TAAAACTAAG AACAGGAAAT
AAAGAAGAGT CGAAACGGAG CTGGGTATTG ATAAAGAGAT AA
 
Protein sequence
MLNTLRSNVL SCYRGICCLV FLLCSLFNLS NVLAEGSKDL YPVGASGNRA FLYSNAALGI 
ISTSVAFPFR TLGTHYVYAV KNEYIAVASS AQGVGNGIII ITSPSGVQTM TTVGSTVGKI
SSRTEELAGP RFPGEGAGNN RYLPYSFQVG NDEGVWKIEF IPTGNKVDTN SPTASDVFAD
GTWSQSNNTP LISAWDVSVC NAVKDKWLRG RVYTNVLNLK LSQNFDDSNK AYNGINYVLT
KDGRAYRVKN NGNNGLGFTF FSNNKGFVDA TGNPLYKSLN TTDETVIANA VHDPRAVDDL
TKNLVTHKLF YGPPSPDLPV SAKVAIDGTT STTTWLKNEP TVPVVSNIDF VGSEGTVGQA
SQKGGTIIFK TNVAGSYRIT IPLGAGKDRV LNGAAANGVN TIKWDGKDGL GNLVPGGLVV
PEVRVRLASA EVHFPFIDME INPKGIIIEL TENSAFYPVN PTPNIIDESV YSDRVYWDDS
DITGGKAGEV SNPQVNVING ISSNSNGHKW GSYSGSSGSG NSGAGTNSYG NEKAMDTYAY
ILSKEEAQPL NVNVKVAELQ VVSVIPTYTS TAAGTQVTYA VKVKNAGVSD VTGAAFHFNV
PTGFTVNANG ISSSFTGATG AESNAVLDAT GFSSMLDLAN GCIVTYTING TIGAGISGND
ILVDASIMRP KDVTDPDATN IIFDKAPTNP HDECLNGTAN EVCNNIKYNT AHAQLVCVNS
SIVPIEYVLE AGGTDVQILP ALPAGLVKNI TGGKLTITGT PTASGIFTFT INTLGTQRST
KTVVLTVGSR AIASDIEVAD KAICIGQSAE LTSNLAAGST IVNPVFNWYS EAALTTLLHT
GTIYTVNPTV STTYYVTVKG DNSCENASGA GKAVTITVNP LPLQPTVTAS GGTTFCVGGS
VVLTSNAATS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNSCSSPA STAVTVTVNP
LPLQPTVNAS AGTTFCLGGS VTLTSSVAAS YQWYRGTTLL TGETNQTYTA SESGNYTVIV
TDGNSCSSPA STAVTVTVNP LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL
TGETNQTYTA SESGNYTVIV TDGNFCSSPA SAAVPVTVNP LPLQPTVTAS GGTTFCLGGS
VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV TDGNSCSSPA SAAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV
TDGNSCSSPA STAVTVTVNP LPLQPTVTAS GGTTFCLGGS VVLTSSAATS YQWYRGTTLL
TGETNQTYTA SESGNYTVIV TDGNFCSSPA SAAVPVTVNP LPLQPTVTAS GGTTFCVGGS
VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV TDGNSCSSPA STAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV
TDGNSCSSPA STAVPVTVNP LPLQPTVTAS GGTTFCLGGS VTLTSSVAAS YQWYRGTTLL
TGETNQTYTA SESGNYTVVV TDGNSCSSPA STAVTVTVNP LPLQPTVTAS GGTTFCLGGS
VTLTSSAAAS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNSCSSPA SAAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVIV
TDGNSCSSPA STAVPVTVNP LPLQPTVTAS GGTTFCLGGS VTLTSSVAAS YQWYRGTTLL
TGETNQTYTA SESGNYTVVV TDGNSCSSPA STAVPVTVNP LPLQPTVTAS GGTTFCLGGS
VTLTSSAAAS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNSCSSPA STAVPVTVNP
LPLQPTVTAS AGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVIV
TDGNFCSSPA SAAVPVTVNP LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL
TGETNQTYTA SESGNYTVIV TDGNSCSSPA STAVPVTVNP LPLQPTVTAS GGTTFCLGGS
VTLTSSVAAS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNSCSSPA STAVTVTVNP
LPLQPTVTAS GGTTFCLGGS VTLTSSAAAS YQWYRGTTLL TGETNQTYTA SESGNYTVIV
TDGNSCSSPA STAVPVTVNP LPLQPTVTAS GGTTFCLGGS VTLTSSAAAS YQWYRGTTLL
TGETNQTYTA SESGNYTVVV TDGNSCSSPA SAAVPVTVNL LPLQPTVTAS AGTTFCLGGS
VTLTSSVATS YQWYRGTTLL TGETNQTYTA TVTGNYTVVV TDGNSCSSPA STAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV
TDGNSCSSPA STAVPVTVNP LPLQPTVNAS GGTTFCLGGS VTLTSSVAAS YQWYRGTTLL
TGETNQTYTA SESGNYTVIV TDGNSCSSPA STAVTVTVNP LPLQPTVTAS GGTTFCVGGS
VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNFCSSPA SAAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV
TDGNSCSSPA SAAVPVTVNP LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL
TGETNQTYTA SESGNYTVVV TDGNSCSSPA STAVTVTVNP LPLQPTVTAS GGTTFCVGGS
VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVIV TDGNFCSSPA SAAVPVTVNP
LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL TGETNQTYTA SESGNYTVVV
TDGNSCSSPA SAAVPVTVNP LPLQPTVTAS GGTTFCVGGS VVLTSSAATS YQWYRGTTLL
TGETNQTYTA SESGNYTVIV TDGNSCSSPA STVVPVTVNP LPLQPTVTAS GGTTFCLGGS
VTLTSSVAAS YQWYRGTTLL TGETNQTYTA SESGNYTVVV TDGNSCSSPA SAAVPVTVNP
LPLQPTVIAE SNTIFCIGGS VKLTSSASTG NQWYKDGNLI SGAVNKTYIA TATGSYFTIV
TNANGCSSLP SQAIPVTVSP YPEIPDISPA GATTFCEGGI VTLTSSSANG NQWYKNGVLM
PGATGKTLDV NQIGDYTVKV TNSTGCASNL SALTKVTVNR VPKGFDDNIN SLSCSQSSFS
YNLQTKNVNN TLKGGNGVAA SFTWTVNSTV LGAINGSGKI LNATLINTST TEQTVVYIVT
PIAETGGCAG QPFKITVRVP ACIDISISKT ADKNVVATVG DKINYTITIK NTGDANHNNV
KVNDPLLGGM LSQPTGDNGN AILEKDETWT YRGTYTVTQN DLDLNGIPTG NTGKIINTAT
VSSQAYPQSK SAIAEVDIHT NPSITLVKTG AMNRDFKTMT YTFNITNNGN VTLNNLVVRD
PKIPQLIALK QTILAPGAST TGIAVYTITN EEKIAGTVSN TATVAGFTKT NVKVTDASGT
AENNDEPTVI DITRYPIAID DYAKTKAEEE VAVPVINNDR PALFPLNAAT LEVKSQPVNG
KLLVNKDGKV VYKPNKGFFG IEKFTYKVDD ANGLSSNVAI VTINVAPPDL DIPNTFTPNG
DGKNDTFLIT GIENYDGVSL FVYNRWGDEV YKNNNYKNEW DGNGLNDGTY FYVLKLRTGN
KEESKRSWVL IKR