Gene Sde_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3233 
Symbol 
ID3965706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4106756 
End bp4120153 
Gene Length13398 bp 
Protein Length4465 aa 
Translation table11 
GC content46% 
IMG OID637922330 
Producthypothetical protein 
Protein accessionYP_528702 
Protein GI90022875 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.226873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA GGGTGCGCAA AAAAAATCGA CGTCCGTTAA TCGAGCCGCT AGAGCCTCGG 
TTACTTTTCT CTGCCAGCGC AGATATTGTG CTGCTGGATG ATGCAGATAG TGACGTAAAT
TTTTTACAAC AGGCCGCCGC GCAAACGGAT CTTTCTGCCG TATTTAATAC CTCGCCTAGC
AAAACCACAC CTTTTGATAC CTCGCCTTTC GAAACCACGC CATACGAAAC CGAAGACGGG
TTAGCAGCAG ATGTGCATAA TGATGACCGT GTAGAAATTA AAGAGTTGGT TTTTGTCGAT
ACAGGTATCG ACAATTACGA GTCATTACTG AGTGGTTTGT TAGAAGGTAG AAATGCCGAC
GATATTAAAG TTATTTACAT AGACGCAGAG GAAAACGGTG TTGAGCTAGT TACACAAACT
CTGCTTGGGT TTACCGGTGT GCAAAGCGTT CACATGCTCA CCCATGGTAA GGCGGGGGAA
ATTCAATTAG GCAACAGCTT TATAAACAGT CACAGCTTGG CGAACTACGC CGATGCAATA
ACGCAATGGC AGCAGGCGTT AGGCGAAGAT GCCGACATTT TAATTTACGG TTGCGATATT
GCCGCCACCG ACGCAGGGCG AGAACTATTA AAGCAGCTCG CTGAGTTAAC CCAAGCCGAT
ATTGCTGCCA GCGACGATTT AACAGGGCAT GCCTCCTTAG GTGGTGATTG GGATTTAGAA
TTTAACGCCG GTGAAGTTGA AACAGAAATA ATTGTGACTA AAGCCGCGCA GAAACAATGG
CAAGGCTTAT TAGTAGATGT AAATTATATA AATACCGGCG GCGGCGATTT TACCGAGACG
GTATCTAGCT CGGTTAACGT TGGGCAAGAC TTTAGTTATA ACAGCGGCAG CGGTACCTAT
ACGGTAAACG AAATAAGTTT GAATTTGGCT CGTTACGCTA CAGCAGAAAG TCAAACAATT
ACTGTGCAGT TACGTGATGC ATGGAATGGT ACTGTTTTGG CTTCCGATAC CGTCGCTTCG
TCAGAAATAA GTGCCGATGG TTTTGAGTGG CATTCGTTTG GCTTTGGCGA TGTGTCGCTA
ACCGATGGCG CAACCTATTT TGTTCGCGTG TCCTCTGATG CAGACGACAG TGAAATTCTA
ATTCGACGTT ACAATAGCAG TATAATTGGC GGGCATGCAT TTCAAAGTAA CGGTACGCCT
AATGGTGACG GCAATGACCT AGCTTTTAAA ATCGCTTATG AAGACGGTAG CAACTCGGCA
CCTTATGTCG ACAATGCTGT GCCAGATCAA GCCGCCACAG AAGATGTTGC ATTTAACTAT
ACCTTACCGG CAAATACATT TGCAGACCCA GATGCGAACG ATACCATTCG TATTCGCGTT
GAGCTATCTG GTGGCGGTAG TTTAGGTTGG TTGCAATACA ATGAAAGTAC TCGACAATTT
TCTGGTACAC CACGCACCGC CGATGTAGGC ACCATGAGTA TCGATGTAAT TGCTACCGAT
AATCACGGCG CTTCCATCAC AGAAACGTTT GATATTGTGA TCAATGGTGT TCCCACTGCA
GTGGATGACT CCGCAAACGT TTGGTACACC GAATCTGTAA CCGGCAATGT GATTGATGGC
TCTGGTGGCG TGAGTGCAGA TACTATCGCC GATGCACCAG GTACAATTGT TTCTATTACT
TACGATTCTG TTTTATACAA TTCTTTCGAT GGCTCTAACA ATATAAATAT TGCCGCCGAT
GAAGGAACTT TTGTAATTAA TCAAGACGGC ACCTTCACTT ATACTCCCAC TGCTACCCCC
GTATCTGCGG GTGCAAATAC CACAACAGAT TGGGAAACAG CCTATAACCT ATACGGTTAT
ATGAGTAACC AATCTTATTT AGACGGCAGT AGCAATTTAG ATTTAAGTGC AGCTAACGAA
ACCGTTTTAC AAAGCAGCAC GGGCTTGGGT ATTGAATCTG GGCCGCAAGA TCACATTGAA
AATATAGGCC CAAACCCAGA AGCCGTAGTG GTAGATCTGC AAGCGAATTA TCGTGAATTT
GAAGCCATTA CTAGTTATTT GGGGGCTGGG GAAACAGGCA CGTGGGAAGC CTACGACTCA
AGCTTTGTGT TAGTGGACAC AGGTACACTA GTTGGCACGG GTAGCGGTGA CAGTACCGAT
CTATCAACAA ATTACGAAGT GGTGAGCGGC GATTTTAGAT ATTTAGTTTT TACCGCAACA
ACAGGTACCG ATAGCTACCG CGTATATGAG TTAAGCGGTT TTCAAGCAAC TCCTACCGAT
GAAACATTTG ATTATGTGGT TGAAGACAGT AACGGCGACC AAGATACGGG TTTACTGACC
GTTGCTTTTG TCAATGATAA TAATCGCCCA ACCGTTGCCA ATGCCATTCC AGATCAAGTT
GCCCCCGACA ACGAACCCTT TAGCTTTACT TTTGCAGCCA ATACCTTTAA CGACGCCGAT
GGTGAAACTC TCACTTATTC CGCAGAATTA AAAAACGGAG GTGCGTTACC CAGTTGGCTT
TCGTTTGATG CGAACACTCG AACTTTTAGC GGTACCCCAG CTACAAGCGA TGTGGGTACC
ATTGAAATAA GAGTGACAGC CGATGATGGT AATTGGGGTA CACCCGCACA AGATATTTTT
GAAATAGATG TTAACGATAC TAACGACGAC CCGACATTAG ATAATGCATT GGTCGATCAA
AGCGCAGATC AAGATGCAGC GTTCAGCTAC CAGTTTGCAG CAAATACCTT TGGCGATTTA
GATGCTGGCG ATACGTTAAC GTATACCGCA GAGTTATCTG GCGGTGGCGG ATTGCCTGCG
TGGTTAACCT TTACCCCAGG TACGCGTACT TTTAGTGGTA CGCCTGCATC TGGCGATGTA
GGTACCATTA CAATTGAAGT TACCGCAGAT GATTCTAATG GCGGTACGCC AGCAACCGAT
ACGTTTGATA TTGTTGTAGC CCCGCCCAAT CAGGCACCAG TAAATACTGT AATAGCAGAT
TTTTCTAGTG ACGAAGATGT ACCCATTGTG TTCAACCAAG GCAACGGGAT AATAGGTAGC
TATTTTAATA ACACTACTTT AACCGGCCCT GCTGTGGATA CAAATGTCGA TTACACTATC
AACTATTATT GGACAGGCGC GCCAGACAAC GGCGTTACAG GGATAAATGC AGATAACTTC
TCCGTAAGGT GGGAAGGGCA GCTACTTGTA ACTGAAACCG GCAACCATCA ATTTCAAACC
ATGTCGGATG ATGGTATTCG CGTTTATATT GATGGCGATT TAGTTATAGA TAACTGGGCT
AGCCATCCTT CGGGCACAAT AGATACAAGT TCAAACATTG CGTTAGTTGC CGGCCGTACT
TACGATGTTG TAGTAGATTT TTACGAAAAT AATGTTGAAG CCGAAGCTAA GCTTTTTTGG
CAAACACCTA GCTCTGGTGG TTTTAATATT ATCGCGGCAG GCGATGAAGA TAATTTTGCC
GCTGGCTTAT ATCAAGGGAG CGAATTCTCT GTATTTGATG TCGATGCAGG CGCTGATGAA
CTAGAAGTTA CTATTTCAGT TAATTCTGGT ACGTTATATC TCGCGGGTAT TGATGGGCTC
ACATTTACAA CGGGTGACGG AACATCCGAC ACCACCATGG TTTTTACAGG CACAGCGGAT
GATATTAATT CCGCGCTCGC TTTTTTAAGT TTTACTCCCA CCGCAAATTT TAGTGGTGAT
GTAAGCTTAA CGTTTACCAC TAACGACCAA GGTAACAATG GTCTTGGTGG GCCGCTAAGT
GATACCGACG TTGTAACCAT TACAGTAAAC AGTACCGATA ACGACGACCC GCAATTAGAC
AATGCACTAG TCGATCAAAA CGCTACTGAA GATAGCCCCT TTAGTTACCA GTTTGCAGCC
AACAGTTTTA GCGATCCAGA TGTAGGCGAT ACTATTACCT ACTCCGCGCA AATATCTGGC
GGTGGAGCTT TACCAAGTTG GTTAACTTTT ACTGCTGGCA CGCGTACTTT TAGCGGAACG
CCCACCAACG ATGATGTAGG CACCATCTCT ATTGAAGTAA CAGCAGATGA CGGTAATGGT
GGTACAACTG CAACGGATGT TTTTGATATT ACGGTTGCCA ACACTAACGA CCCACCTACT
GTAGCAAATC AAATTCCGAA TCAGGCCGGC GCAGAAGATT TCGCGTTTGA TTATACTTTT
CCTGCCAATA CGTTTAATGA TATTGATGGA GACTCGTTAA CCTACACGGC AAGGCTTTCA
ACCGGTGACC CACTTCCACC GTGGTTAAAT TTTGATGGTG CTAACAGACG TTTTTACGGT
ACACCCGGTG AAGCTGACTC AATAACTTGG ACTGTTGAAG TGACCGCGGA TGATGGAAAT
GGTGAAACCG TTACCGATAC ATTTGATATT GCCATCGCCA ATACAAATGA TGATCCATAT
GTCGCGAATG CCATCCCCGA TTTAAATACA GGGGATAACG AGCCATTTAG TTATCAGTTT
GCGGCCAACA CTTTTGGTGA TTCAGATTTA GATACGCTCA CTTACACAGC AGAATTATTT
ACCGGTGGTG CACTGCCCGC TTGGCTAAAT TTTGATGCAA ATGCTCGAAC GTTTAGTGGT
ACACCTAGTA CGAGCGATAT TGGTACAACG CACGTAAGAT TAATTGCAGA TGACGGTAAC
GGAGGCACAC CAGCAGAAGA TAGTTTTAAT ATAACCGTAA CCGATACCAA TGACGACCCA
TACATTGCCA ATGCAATTGC CGATCAAGCA GCTACCGAAG ATAGCCCATT TAGCTTTCAG
TTTGCGGCAA ATACGTTTGG AGATTACGAC GGCGACACGC TTACTTATAC TGCTCAACTA
AGCGGTGGAG CCTTGTGGCC GGGTTGGTTA ACATTTACGC CCGGTACACG TACATTTAGT
GGTACACCGG ATAATGGCGA CGTGGGTACA ATTACTATCG AGCTAACCGC GGATGATGGT
AACGGTGGTA CGCCTGCGAC AGAAACGTTT GATATTGTAA TAGCCAATAC AGACGATGAT
CCTTACGTTG ATAACGCAAT ACCCGACCAA GCCGCGACAG AAGATAGCCC TTTTAGCTTT
CAGTTTTCTT CAATAGCGTT TGCAGATGAT GACCCTGGCG ATTCGCTAAC TTATACCGCG
CAACTTTCTG GTGGAGGTTC GCTTCCAGCT TGGCTTACAT TCACACCCGC AACACGAACC
TTTAGCGGTA CGCCGGCTAA TGGCGATGTT GGAACAATCA GCGTAGAAGT TGTTGCAACG
GATAACGACG GTGGCACAAC GGCGTCAGAT GTTTTTGATA TTGTTGTATC GAATACGGAT
GACGATCCTA CCTTAGATAA TGCCCTAGTC GATCAAGCGG CAACGGAAGA TACAGCCTTC
AGTTATCAGT TTGCAGCTAA CAGTTTTAGT GATCCTGATG TTGGCGATAC ATTAACTTAC
TCTGCGCAGC TTTCTGGCGG TGGTTCGCTT CCGACTTGGC TTACGTTTAC ACCGGCAACG
CGCACTTTTA GCGGCACACC GGCTAATGGT GATGTTGGAA CAATCAGCGT AGAAATAATT
GCCACTGACG ATGATGGTGG TACTACCGCG TCGGATGTTT TTGATATTAC TGTGTCGAAC
ACAGATGACG ATCCTACTTT AGATAATGCA CTGGTTGATC AAGCAGCTAC GGAAGATACA
GCCTTCAGTT ATCAGTTTGC AGCTAACAGT TTTAGTGATC CCGATGTTGG CGATACATTA
ACTTACTCTG CGCAGCTTTC TGGCGGTGGT TCGCTTCCGA CTTGGCTTAC ATTCACTCCT
GCAACACGAA CCTTTAGCGG CACACCAGCT AATGGTGACG TAGGAACTAT TTCTGTAGAA
GTGATTGCTA CGGATAACGA CGGTGGCACA ACAGCAAGCG ATGTGTTTGA TATTGTTGTT
GCTAATGCCG ACGACGATCC AACTTTAGAT AATGCTTTGG TCGATCAAGT GGCAACGGAA
GATACCGCGT TCAGTTATCA GTTTGCAGCA AACAGTTTTA GTGATTCTGA TGTTGGCGAT
ACATTAACTT ATACGGCACA GCTTTCTGGT GGAGGGGCAC TGCCAACTTG GCTGACGTTT
ACACCAGCAA CACGCACTTT TAGTGGCACA CCAACAAATG GTGATGTAGG AACTATTTCT
GTAGAAGTGA TTGCTACCGA TAACGATGGT GGCACAACCG CTAGTGATGT TTTTGATATT
GTTGTATCGA ACACCGATGA CGATCCAACC TTAGATAATG CTTTAGTCGA TCAAGCGGCA
ACTGAGGATG CAGCGTTCAG CTATCAGTTT GCAGCGAACA GTTTTAGTGA TCCCGATGTT
GGCGATACAT TAACTTACTC TGCGCAGCTT TCTGGCGGTG GTTCGCTGCC AGCGTGGCTT
ACCTTTACAC CTGCAACACG AACCTTTAGC GGTACACCGG CTAACGGAGA TGTAGGAACT
ATTTCTGTTG AAGTAATTGC AACCGATAAC GACGGTGGAA CAACTGCGTC GGATGTTTTT
GATATAACTG TGTCGAACAC CGATGACGAT CCCACACTAG ATAATGCATT AGTCGATCAA
GCAGCAACGG AAGATACAGC ATTCAGCTAT CAGTTTGCAG CTAACAGTTT TAGTGATTCT
GATGTTGGCG ATACATTAAC TTATACGGCA CAGCTTTCTG GTGGAGGGGC ACTGCCAACT
TGGCTGACGT TTACACCAGC AACACGCACT TTTAGTGGCA CACCAACAAA TGGTGATGTA
GGAACTATTT CTGTAGAAGT GATTGCTACC GATAACGATG GTGGCACAAC CGCTAGTGAT
GTTTTTGATA TTGTTGTATC GAACACCGAT GACGATCCAA CCTTAGATAA TGCTTTAGTC
GATCAAGCGG CAACTGAGGA TGCAGCGTTC AGCTATCAGT TTGCAGCGAA CAGTTTTAGT
GATCCTGATG TTGGTGATAC CTTAACTTAT ACGGCACAGC TTTCTGGTGG AGGGGCACTG
CCAACTTGGC TGACGTTTAC ACCAGCAACA CGCACTTTTA GTGGCACACC AACAAATGGT
GATGTAGGAA CTATTTCTGT AGAAGTGATT GCTACCGATA ACGATGGTGG CACAACCGCT
AGTGATGTTT TTGATATTGT TGTATCGAAC ACCGATGACG ATCCAACCTT AGATAATGCT
TTAGTCGATC AAGCGGCAAC TGAGGATGCA GCGTTCAGCT ATCAGTTTGC AGCGAACAGT
TTTAGTGATC CTGATGTTGG TGATACCTTA ACTTACACCG CACAATTATC TGGCGGAGGT
GCGCTACCAG CGTGGCTCAC CTTCACACCC GCAACACGCA CTTTTAGTGG TACGCCTGCG
AATGGCGATG TTGGAACAAT CAGCGTAGAA GTTATTGCGA CTGATAACGA CGGAGGAACA
ACGGCTTCGG ACGTATTTGA TATTGTTGTT GCAAACTCTG ACGATGATCC GACATTAGAT
AACGCACTGG TTGATCAAGC GGCCACTGAG GATACAGTGT TCAGTTATCA GTTTGCAGCT
AACAGTTTTA GTGATCCTGA TGTTGGCGAT ACGTTAACTT ACTCTGCGCA GCTTTCTGGA
GGTGGTTCGC TTCCGGCTTG GCTTACGTTT ACTCCCGCAA CGCGAACCTT TAGTGGTACA
CCTGCGAATG GTGATGTTGG AACTATTTCT GTTGAAGTTA TCGCAACCGA TAATGATGGT
GGTACTACCG CGTCGGATGT TTTTGATATT ACTGTGTCGA ACACCGATGA TGATCCAACC
TTAGATAATG CACTTGTCGA TCAAGCGGCA ACGGAAGATA CTGCGTTCAG TTATCAGTTT
GCCGCTAATA GTTTTAGTGA TCCAGATGTG GGTGATACCT TAACTTACAC TGCGCAACTT
TCTGGTGGCG GTTCATTACC CGCGTGGTTA ACGTTCACTC CAGCAACTCG AACCTTTAGC
GGCACACCAA CAAATGGTGA TGTAGGAACT GTTTCTGTTG AAGTAATTGC AACCGATAAC
GACGGTGGCA CAACGGCAAG CGATGTGTTT GATATTGTTG TCGCTAACTC CGATGATGAT
CCGACGTTAG ATAACGCATT AGTCGATCAA GCGGCAACGG AAGACACGGC GTTCAGCTAT
CAGTTTGCCG CTAACAGTTT TAGTGATCCT GACGTCGGCG ATACATTAAC TTACTCTGCG
CAGCTTTCTG GCGGTGGTTC GCTTCCGACT TGGCTTACAT TCACCCCTGC AACACGAACC
TTTAGCGGCA CACCAACAAA TGGTGACGTA GGAACAATTT CTGTTGAAGT GATTGCTACC
GATAACGATG GTGGCACAAC GGCTAGTGAT ATATTCGATA TTGTTGTCGC TAATGCCGAC
GACGATCCAA CCTTAGATAA TGCATTAGTC GATCAAGCAG CAGCGGAAGA CACAGCGTTC
AGCTATCAGT TTGCAGCTAA CAGTTTTAGT GATCCTGATG TTGGCGATAC GTTAACGTAT
ACTGCGCAAC TTTCTGGTGG AGGCGCGCTA CCAACGTGGC TAACGTTTAC ACCTGCAACG
CGAACCTTTA GTGGTACACC TGCGAATGGT GATGTTGGAA CTATTTCTGT TGAAGTTATC
GCAACCGATA ATGATGGTGG TACTACCGCG TCGGATGTTT TTGATATTAC TGTGTCGAAC
ACCGATGATG ATCCAACCTT AGATAATGCA CTTGTCGATC AAGCGGCAAC GGAAGATACT
GCGTTCAGTT ATCAGTTTGC CGCTAATAGT TTTAGTGATC CTGATGTTGG CGATACGTTA
ACTTACTCTG CGCAGCTTTC TGGCGGTGGT TCGCTTCCGG CTTGGCTTAC GTTTACTCCC
GCAACGCGAA CCTTTAGTGG TACACCTGCG AATGGTGATG TTGGAACTAT TTCTGTTGAA
GTTATCGCAA CCGATAATGA TGGTGGTACT ACCGCGTCGG ATGTTTTTGA TATTACTGTG
TCGAACACCG ATGATGATCC AACCTTAGAT AATGCACTTG TCGATCAAGC GGCAACGGAA
GATACTGCGT TCAGTTATCA GTTTGCCGCT AATAGTTTTA GTGATCCAGA TGTGGGTGAT
ACCTTAACTT ACACTGCGCA ACTTTCTGGT GGCGGTTCAT TACCCGCGTG GTTAACGTTC
ACTCCAGCAA CTCGAACCTT TAGCGGCACA CCAACAAATG GTGATGTAGG AACTGTTTCT
GTTGAAGTAA TTGCAACCGA TAACGACGGT GGCACAACGG CAAGCGATGT GTTTAATATT
GTTGTCGCTA ACTCCGATGA TGATCCGACG TTAGATAACT CATTAGTCGA TCAAGCGGCA
ACGGAAGACA CGGCGTTCAG CTATCAGTTT GCCGCTAACA GTTTTAGTGA TCCTGACGTC
GGCGATACAT TAACTTACTC TGCGCAGCTT TCTGGCGGTG GTTCGTTACC CGCGTGGTTA
ACGTTCACTC CTGCAACGCG CACCTTTAAC GGTACACCGG CTAACGGCGA TGTCGGAACT
ATTTCTGTAG AAGTTATTGC TACCGATAAC GACGGTGGTA CAACTGCTAG TGATGTGTTC
GATATTGTTG TCGCTAATGC CGATGACGCC CCTACGCTGG ATAACGCGTT AGTTGATCAA
GCGGCAACGG AAGATACAGC ATTTAGCTAT CAGTTTGCAG CAAACAGTTT TAGTGATCCA
GATGCTGGTG ATACGCTTAC TTACACAGCG CAACTTTCTG GCGGCGGCGC GCTACCAACG
TGGCTAACGT TTACACCTGC AACGCGTACT TTTAGTGGCA CACCAACAAA TGGTGATGTT
GGAACAATTA GCGTAGAAGT AATTGCTACT GACGATGATG GTGGTACAAC CGCGTCGGAT
GTTTTTGATA TTACTGTGTC GAACACTGAT GACGATCCCA CGCTAGATAA TGCACTTGTC
GATCAGGCGG CCACTGAGGA TGCAGCGTTC AGCTATCAGT TTGCAGCAAA CAGTTTTAGT
GATCCAGACG CTGGTGATAC GCTTACTTAC ACAGCGCAAC TTTCTGGCGG TGGCGCGCTA
CCAACGTGGC TAACGTTTAC ACCTGCAACG CGTACTTTTA GTGGCACGCC TGTTAATGGT
GATGTTGGAA CAATCAGCGT AGAAGTTGTT GCAACGGATA ACGACGGTGG CACAACTGCT
AGTGATGTGT TCGATATTGT AGTTGCTAAT GCCGACGACG ATCCAACTTT AGATAATGCT
TTAGTCGATC AAGCGGCAAC GGAAGATACC GCGTTCAGTT ATCAGTTTGC AGCAAACAGT
TTTAGTGATC CTGATGTTGG TGATACGTTA ACTTATACGG CACAGCTTTC TGGTGGTGGA
TCATTACCCG CGTGGTTAAC GTTCACTCCT GCAACGCGCA CCTTTAGCGG TACACCAGCT
AATAGTGATG TAGGAACTAT TTCTGTTGAA GTGATTGCTA CTGATAACGA CGGTGGTACA
ACCGCTAGCG ATGTGTTCGA TATTGTTGTT GCTAATGCCG ACGACGATCC AACTTTAGAT
AATGCTTTGG TCGATCAAGT GGCAACGGAA GATACCGCGT TCAGTTATCA GTTTGCAGCA
AACAGTTTTA GTGATCCTGA TGTTAGCGAT ACATTAACTT ACTCTGCGCA GCTTTCTGGC
GGTGGTTCGC TTCCGACTTG GCTTACCTTT ACACCTGCAA CACGAACCTT TAGCGGTACA
CCTGCGAATG GTGATGTTGG GACAATCAGC GTAGAAGTAA TTGCTACTGA CGATGATGGT
GGTACAACCG CGTCGGATGT TTTTGATATT ACTGTGTCGA ACACAGATGA CGATCCTACT
TTAGATAATG CACTGGTTGA TCAAGCAGCT ACGGAAGATA CAGCGTTCAG CTATCAGTTT
GCAGCTAACA GTTTTAGTGA TCCCGATGTT GGCGATACAT TAACTTATAC TGCGCAGCTT
TCTGGTGGTG GATCATTACC CGCGTGGTTA ACGTTCACTC CAGCAACGCG AACCTTTAGC
GGCACACCAA CAAATGGTGA TGTAGGAACT ATTTCTGTCG AAGTGATTGC TACCGATAAC
GATGGGGGCA CAACCGCTAG TGATGTTTTT TATATTGTTG TATCGAACAC CGATGACGAT
CCAACCTTAG ATAATGCTTT AGTCGATCAA GCGGCAACTG AGGATGCAGC GTTCAGCTAT
CAGTTTGCAG CTAACAGTTT TAGTGATCCT GATGTTGGCG ATACGTTAAC TTATACGGCA
CAGCTTTCTG GTGGTGGATC ATTACCCGCG TGGTTAACGT TCACTCCTGC AACACGAACC
TTTAGCGGCA CACCGGCTAA CGGCGATGTA GGAACTATTT CTGTAGAAGT TATCGCTACG
GATAACGACG GTGGCACAAC AGCTAGTGAT GCATTCGATA TTGTTGTTGC TAACACCGAT
GACGATCCTA CTTTAGATAA TGCACTGGTC GATCAAGCGG CAACTGAGGA CACAGCGTTC
AACTATCAGT TTGCAGCTAA CAGTTTTAGT GATCCTGATG TGAGTGATAC ATTAACTTAC
ACCGCGCAAC TTTCTGGTGG AGGTTCGCTT CCGGCTTGGC TAACGTTCAC ACCCGCAACA
CGAACCTTTA GCGGTACACC TGCGAATGGA GATGTAGGAA CAATCAGCGT AGAGGTTATT
GCCACGGATA ATGACGGCGG CATAACGGCT AGTGATTCAT TTAATATTAA CGTATTAAAC
GTCAATGATA GCCCCGTAGT AAATATTGCC ATTCCTGATC AGCAAGTGGT TGCTGGTCGT
GATTTTTCGA TGCAATTACC GCCCAGTACT TTTATCGATG TAGATGTTGG TGATTCGCTT
GTTTATACGG CCAATTTAAT TAGTGGAGCA CCATTACCTG CTTGGTTGGT TTTTGATGAG
GTCGCGCAAT CGTTTAGTGG TCGGCCGGTT ACAAGTGATA TTGGTAGCTA TACTGTAGAA
GTAATTGCAA ATGATGGTAA TGGCGGTATT CCGGCGCGGG AAAGTTTTGT GTTGGTCGTG
CATTCTCCAA CAGCTGTGCA GCTGGCAAAC ATTGCTACAC AAGAAGATGC AGCTAATGAT
GAGATCGACC TAGCGACTCA ATTCGCATCC ATCACGCCAA GCGGTGAATC GATAAGCTTC
ACTGTTACCC AAAACTCAAA TACCTCTTTA TTTTCAGAGG TGAGAATTGA CAATACAACC
GGAACGTTAA CACTCACTTA TGCCGCGGAT CAATACGGAG AAAGTGATAT AACAATTTCT
GCCGTTACCA ATACTGGTGT AACGCTAGAA AGTACGTTTA ATGTAACAGT TAGTTCTGTG
AACGATATTC CGCAAGTAAC TCACCAAACG ATTACCGGTG GGGTTATCGA GCCTGATTCA
TTAACTCGTA CTGTTAGTGT AATGGGGGGC TTTTTTGATA TAGAAAACGG TGAAGACCTT
GTGTACACGG TTACCGAAAA CTCTAACCCC GCTATAGCGC AGGTGGTTGC CGTAGATAGC
GAGCGTGGCA CATTCACAAT TAACCGGGCA GGGGCCGAGG GTGGAACGGC TAATATTACA
GTGCGAGCTA CCGATAATGA TGGTGGCTGG GTTGAGCGTA CGGTACAAAT TACTATTCAA
GAAAAGCTAA CCTCGCCGTT AGAGCCGGAA CCACAGCCCG AGCCTCAACC AGAGCCTGAG
CCAGAAACAC CAGAAGAGGT GCAAGAAACA CCGCCCGAGG GTGAACCGAA CGAAACACCT
GTAGCCGAAG CAGAAAAAGA ACCTTCACTT ATTCAGCCAG AGCTTTCGCC CGATTTAGGC
ATTGTTGTAG AGGCCGTAGC TCCGCCGCTA CCGCCAGCAA GTGTGTTTGA GTTTTACGAA
GAGGTAGAGC GAGAGACAGA GCAAAACGAA AAAAATAATG CTAAGCGAGA ATACGAGCGC
CAGCTGAGAG AAGAAGCTGC GCAGGCTTAC CAGTTAATTG GCATATCTGC AGGCCCAGGA
GGCTATTTAA ATGCGCAGGA TATTACCGAT TTTAATATGG CTATAGATGA CGCAAGAAAA
CATATGGAAG AAGTATACGC AGAGCAAAAG CAACGTGAAG GCATGCTGGC AACGGTAACG
CTCTCACTTA CCACAGGGTT AATTATATGG GCGTTGCGCG CCAGTAGCTT GTTGGTTGCT
TTATTTTCGA TGATGCCTCT GTGGCGCGGC GTAGACCCAT TACCCGTTTT GGCGGATGTA
GAAAAACGTA AAAAAGCTTT GGCAGGCATA GAGGACGACA AAGAAGAAGA GGATAAAAAA
GCGGGTGAGG TGGGGTATTT GTTCGATCGC GCTGTAGATA AGCCACAACA GAAAGGAAGC
AAAGGTAAAA AAGTATGA
 
Protein sequence
MSKRVRKKNR RPLIEPLEPR LLFSASADIV LLDDADSDVN FLQQAAAQTD LSAVFNTSPS 
KTTPFDTSPF ETTPYETEDG LAADVHNDDR VEIKELVFVD TGIDNYESLL SGLLEGRNAD
DIKVIYIDAE ENGVELVTQT LLGFTGVQSV HMLTHGKAGE IQLGNSFINS HSLANYADAI
TQWQQALGED ADILIYGCDI AATDAGRELL KQLAELTQAD IAASDDLTGH ASLGGDWDLE
FNAGEVETEI IVTKAAQKQW QGLLVDVNYI NTGGGDFTET VSSSVNVGQD FSYNSGSGTY
TVNEISLNLA RYATAESQTI TVQLRDAWNG TVLASDTVAS SEISADGFEW HSFGFGDVSL
TDGATYFVRV SSDADDSEIL IRRYNSSIIG GHAFQSNGTP NGDGNDLAFK IAYEDGSNSA
PYVDNAVPDQ AATEDVAFNY TLPANTFADP DANDTIRIRV ELSGGGSLGW LQYNESTRQF
SGTPRTADVG TMSIDVIATD NHGASITETF DIVINGVPTA VDDSANVWYT ESVTGNVIDG
SGGVSADTIA DAPGTIVSIT YDSVLYNSFD GSNNINIAAD EGTFVINQDG TFTYTPTATP
VSAGANTTTD WETAYNLYGY MSNQSYLDGS SNLDLSAANE TVLQSSTGLG IESGPQDHIE
NIGPNPEAVV VDLQANYREF EAITSYLGAG ETGTWEAYDS SFVLVDTGTL VGTGSGDSTD
LSTNYEVVSG DFRYLVFTAT TGTDSYRVYE LSGFQATPTD ETFDYVVEDS NGDQDTGLLT
VAFVNDNNRP TVANAIPDQV APDNEPFSFT FAANTFNDAD GETLTYSAEL KNGGALPSWL
SFDANTRTFS GTPATSDVGT IEIRVTADDG NWGTPAQDIF EIDVNDTNDD PTLDNALVDQ
SADQDAAFSY QFAANTFGDL DAGDTLTYTA ELSGGGGLPA WLTFTPGTRT FSGTPASGDV
GTITIEVTAD DSNGGTPATD TFDIVVAPPN QAPVNTVIAD FSSDEDVPIV FNQGNGIIGS
YFNNTTLTGP AVDTNVDYTI NYYWTGAPDN GVTGINADNF SVRWEGQLLV TETGNHQFQT
MSDDGIRVYI DGDLVIDNWA SHPSGTIDTS SNIALVAGRT YDVVVDFYEN NVEAEAKLFW
QTPSSGGFNI IAAGDEDNFA AGLYQGSEFS VFDVDAGADE LEVTISVNSG TLYLAGIDGL
TFTTGDGTSD TTMVFTGTAD DINSALAFLS FTPTANFSGD VSLTFTTNDQ GNNGLGGPLS
DTDVVTITVN STDNDDPQLD NALVDQNATE DSPFSYQFAA NSFSDPDVGD TITYSAQISG
GGALPSWLTF TAGTRTFSGT PTNDDVGTIS IEVTADDGNG GTTATDVFDI TVANTNDPPT
VANQIPNQAG AEDFAFDYTF PANTFNDIDG DSLTYTARLS TGDPLPPWLN FDGANRRFYG
TPGEADSITW TVEVTADDGN GETVTDTFDI AIANTNDDPY VANAIPDLNT GDNEPFSYQF
AANTFGDSDL DTLTYTAELF TGGALPAWLN FDANARTFSG TPSTSDIGTT HVRLIADDGN
GGTPAEDSFN ITVTDTNDDP YIANAIADQA ATEDSPFSFQ FAANTFGDYD GDTLTYTAQL
SGGALWPGWL TFTPGTRTFS GTPDNGDVGT ITIELTADDG NGGTPATETF DIVIANTDDD
PYVDNAIPDQ AATEDSPFSF QFSSIAFADD DPGDSLTYTA QLSGGGSLPA WLTFTPATRT
FSGTPANGDV GTISVEVVAT DNDGGTTASD VFDIVVSNTD DDPTLDNALV DQAATEDTAF
SYQFAANSFS DPDVGDTLTY SAQLSGGGSL PTWLTFTPAT RTFSGTPANG DVGTISVEII
ATDDDGGTTA SDVFDITVSN TDDDPTLDNA LVDQAATEDT AFSYQFAANS FSDPDVGDTL
TYSAQLSGGG SLPTWLTFTP ATRTFSGTPA NGDVGTISVE VIATDNDGGT TASDVFDIVV
ANADDDPTLD NALVDQVATE DTAFSYQFAA NSFSDSDVGD TLTYTAQLSG GGALPTWLTF
TPATRTFSGT PTNGDVGTIS VEVIATDNDG GTTASDVFDI VVSNTDDDPT LDNALVDQAA
TEDAAFSYQF AANSFSDPDV GDTLTYSAQL SGGGSLPAWL TFTPATRTFS GTPANGDVGT
ISVEVIATDN DGGTTASDVF DITVSNTDDD PTLDNALVDQ AATEDTAFSY QFAANSFSDS
DVGDTLTYTA QLSGGGALPT WLTFTPATRT FSGTPTNGDV GTISVEVIAT DNDGGTTASD
VFDIVVSNTD DDPTLDNALV DQAATEDAAF SYQFAANSFS DPDVGDTLTY TAQLSGGGAL
PTWLTFTPAT RTFSGTPTNG DVGTISVEVI ATDNDGGTTA SDVFDIVVSN TDDDPTLDNA
LVDQAATEDA AFSYQFAANS FSDPDVGDTL TYTAQLSGGG ALPAWLTFTP ATRTFSGTPA
NGDVGTISVE VIATDNDGGT TASDVFDIVV ANSDDDPTLD NALVDQAATE DTVFSYQFAA
NSFSDPDVGD TLTYSAQLSG GGSLPAWLTF TPATRTFSGT PANGDVGTIS VEVIATDNDG
GTTASDVFDI TVSNTDDDPT LDNALVDQAA TEDTAFSYQF AANSFSDPDV GDTLTYTAQL
SGGGSLPAWL TFTPATRTFS GTPTNGDVGT VSVEVIATDN DGGTTASDVF DIVVANSDDD
PTLDNALVDQ AATEDTAFSY QFAANSFSDP DVGDTLTYSA QLSGGGSLPT WLTFTPATRT
FSGTPTNGDV GTISVEVIAT DNDGGTTASD IFDIVVANAD DDPTLDNALV DQAAAEDTAF
SYQFAANSFS DPDVGDTLTY TAQLSGGGAL PTWLTFTPAT RTFSGTPANG DVGTISVEVI
ATDNDGGTTA SDVFDITVSN TDDDPTLDNA LVDQAATEDT AFSYQFAANS FSDPDVGDTL
TYSAQLSGGG SLPAWLTFTP ATRTFSGTPA NGDVGTISVE VIATDNDGGT TASDVFDITV
SNTDDDPTLD NALVDQAATE DTAFSYQFAA NSFSDPDVGD TLTYTAQLSG GGSLPAWLTF
TPATRTFSGT PTNGDVGTVS VEVIATDNDG GTTASDVFNI VVANSDDDPT LDNSLVDQAA
TEDTAFSYQF AANSFSDPDV GDTLTYSAQL SGGGSLPAWL TFTPATRTFN GTPANGDVGT
ISVEVIATDN DGGTTASDVF DIVVANADDA PTLDNALVDQ AATEDTAFSY QFAANSFSDP
DAGDTLTYTA QLSGGGALPT WLTFTPATRT FSGTPTNGDV GTISVEVIAT DDDGGTTASD
VFDITVSNTD DDPTLDNALV DQAATEDAAF SYQFAANSFS DPDAGDTLTY TAQLSGGGAL
PTWLTFTPAT RTFSGTPVNG DVGTISVEVV ATDNDGGTTA SDVFDIVVAN ADDDPTLDNA
LVDQAATEDT AFSYQFAANS FSDPDVGDTL TYTAQLSGGG SLPAWLTFTP ATRTFSGTPA
NSDVGTISVE VIATDNDGGT TASDVFDIVV ANADDDPTLD NALVDQVATE DTAFSYQFAA
NSFSDPDVSD TLTYSAQLSG GGSLPTWLTF TPATRTFSGT PANGDVGTIS VEVIATDDDG
GTTASDVFDI TVSNTDDDPT LDNALVDQAA TEDTAFSYQF AANSFSDPDV GDTLTYTAQL
SGGGSLPAWL TFTPATRTFS GTPTNGDVGT ISVEVIATDN DGGTTASDVF YIVVSNTDDD
PTLDNALVDQ AATEDAAFSY QFAANSFSDP DVGDTLTYTA QLSGGGSLPA WLTFTPATRT
FSGTPANGDV GTISVEVIAT DNDGGTTASD AFDIVVANTD DDPTLDNALV DQAATEDTAF
NYQFAANSFS DPDVSDTLTY TAQLSGGGSL PAWLTFTPAT RTFSGTPANG DVGTISVEVI
ATDNDGGITA SDSFNINVLN VNDSPVVNIA IPDQQVVAGR DFSMQLPPST FIDVDVGDSL
VYTANLISGA PLPAWLVFDE VAQSFSGRPV TSDIGSYTVE VIANDGNGGI PARESFVLVV
HSPTAVQLAN IATQEDAAND EIDLATQFAS ITPSGESISF TVTQNSNTSL FSEVRIDNTT
GTLTLTYAAD QYGESDITIS AVTNTGVTLE STFNVTVSSV NDIPQVTHQT ITGGVIEPDS
LTRTVSVMGG FFDIENGEDL VYTVTENSNP AIAQVVAVDS ERGTFTINRA GAEGGTANIT
VRATDNDGGW VERTVQITIQ EKLTSPLEPE PQPEPQPEPE PETPEEVQET PPEGEPNETP
VAEAEKEPSL IQPELSPDLG IVVEAVAPPL PPASVFEFYE EVERETEQNE KNNAKREYER
QLREEAAQAY QLIGISAGPG GYLNAQDITD FNMAIDDARK HMEEVYAEQK QREGMLATVT
LSLTTGLIIW ALRASSLLVA LFSMMPLWRG VDPLPVLADV EKRKKALAGI EDDKEEEDKK
AGEVGYLFDR AVDKPQQKGS KGKKV