Gene OSTLU_17941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17941 
Symbol 
ID5004939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp496735 
End bp505789 
Gene Length9055 bp 
Protein Length3018 aa 
Translation table 
GC content51% 
IMG OID640420360 
Productpredicted protein 
Protein accessionXP_001421016 
Protein GI145353429 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.78771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCA TGCGAGAGAT TGTAGAAAAT TGCGGCGAAT TGCCGTCGTT TGCGCACGGC 
GAGGCTACGG CGTTGTTTTT GGAGTTCTTT GAAACGTGCG AGAGCGCGTT GCGGTTATTA
GCGGCGGCGA CGAGGCGATC CGTGGTTGGG CGAACGCCGC GTAACGCGTA CTTTTCGTCG
GACGTGTTTC AGCGGAGAGT GCGATTCATG CTTTCGAAGA TTCACGCGCA CATGCCGTTG
AGCGCGGCGC ACTTGAACGA CGCCGACGAG AACGAGTCGA AAGCCTGTGA AGTCGTTGCG
CACGAGTTAT TCGTTGCCGA AGACCGCGAA GAGGGAAGTG GTCCGTTAGA TTGCACGCGA
TACGAGTTTA CAGTCGACAC GACGTCGAGT GGTAGCGATG AAATAGTCTT CAAGGCGTCT
GGCCGACGGG GGCTCGACCC GTCCGAGATG CACCGGCCGT ATAATCGCGC CTCGAGCCTT
GAGTGCTTAA TCGATGACTT TTCACGACCT CAAATGTTCG ATCCGGGCAA AGAAATGGAC
TCTCTCAAAG ACATCTCCGC GGCGTATTTT GGTGTGTTGG CACAGTTGTG GAGCAAATCG
CTGCAAAGGA CGCGGGCGGG TCGCGAGAAG GCGGCGCGTC TTTCGATCCT CGCGCAGCAA
ATCAACGTCC ATCTAGGGAA AGGGGCGGAA CCGAGCGATT TCGAGCTTTA CGTTGAAGAC
GCGCAGACGA CCGCAATCAA GCTGTTGGAC TTGATGCAGG TAGATTTAGG AGTCACGCCG
TCGTTGCTCC TACGACGAGA CGCGACAACT TTGCTGAGCG CGTACACAAA CGAACGCGCG
AGCGGCTTTC AAGTTTGTCA AATCATGAGA CGGCCAGCCG GTCGAGGAGC GCTTAGTCAA
TGCGTATCAA CGATGACGGC GGCGTTGCTT GCGCCGCTGT CATCAAGTGA TGCGGAAATC
GATTTCGACG TCGTTCGCGA CGCGAGGTGT TTGATGCATC TTGTCGGGAT TTTGAGCACA
ACAACGCCTG GGTGCACGGC GATAAGAGAG GCTGAGCTCC TTCCGCTTTT GATGTCAATT
TTCAAGCTGG AGCGCCTCGA AGCGATCCCG GTGGTCGTGG ACGTGGTGCA TATCGTGGAA
TCGTATCTGG AGTTTCAACC CAACGCGGTC GTAGCGTTTA GAGAGTTGAA CGGAACGGAT
TTGCTTTTGA AGCGCATGCG ATACGAGTCC GTTACGGCGA TCGAGGAGCT GCGGGCGTCG
GGACATCTCG ATGACGTGGC GGATAGAAAA CGAAAAATAG ATAGTCTTTC TAGTTTAGAG
GACGCCGAAA CATCGCTCGC AGCGATGAAA AAGGCACCAA AGTACGTGAA CTTTTATCGA
AGAGTTTTGA TCAAGGTTCT GATGCGAACA TTGGCGTACA CGAGTTTTGC GTACGGAGGA
GCACGCATGC GAGTTCCCGG ACTTTCGGAT GGGACGTTGA CTGATATCTT GGGAAGTGTA
TTGAAATACC CTCTCAAGTT TGGTCCTGGG GTGTCGTCGC TCACCGCGAA TTTAGTGTGC
GACGTCATTC ATAACGAGCC GACGTGTTAC GCCACGCTAG ACGAAGCCGG TATTGTGGAC
GCATTCCTGA AGTTTATCAC TGAAACCATG TGGCCACTCG GCCGTAGTAA GGGTATGGCT
AAAGTACTTT GCGCGATACC GACGACACTT AACGCGATAT GTTTGAACGA GAAAGGGCAG
GAAAAAGTGT TGAAGTCTTC GGCGTTGGTC TGCTTTCGAA AAATGTTTCA AGATAGTTCT
TTCCCGTTGA ACGCCGATAC GTCGAGAATC GTCGGAACTG GGATCAACGA GACGCTGCGC
CACATTCCAG CACTAAAAGA CGTCAGTGTC GAAGTCTTGA ACGACATATT GTCCGATCTC
ACGATACAAC TTCGAAAGAT ATCAGATTCG GAGACGAATC TTGAGCACGC GTTCGTCAAC
AAGGCAGTCG TCGAGCAAAT GCCAATATCG AAAACGCTTC AAAATGTGGC GAGGTTCATG
GATGGTATTT TGCAGACTTC GCATATGTGC GCGCCGCTGG TTCAAGCTGG GGCTCTGGAT
TCCATGCTCA ATATGTTGAC ATGCCCTCTT CCGGTTGAAT TTTCGACATG TGCAAGCTAC
AACTCCATCA GTGTCGCGTG CAAATCGCTC ATTAACCCGC CGGATCCCAG CAACTCGTAT
GCCGGCGCGA ACACGCCGAA TCTGTCAGAC GACGTGATTC TCGCGTGCAC TACCGTCGTT
GAAAAGTATA TGGCCTCTGC TGTTGATATC GGTGTGCAGA TTGAAGAGTT TTACGTCCAC
GAGCTCGACA AGACAGGTGA TGACATCGAT AAATTAGCGA CGTACGATAT TTACAGCAAG
ACATTGAGCG AAAAGCACAA AGGTGACAAA GCAGCGTTGG ACCGCGAAGT TCGGTTCTGT
CGACAGACAA AATCACTGTG CCAGCGCATG GCTGCGCACG AAAGGTTGTG TGAACTTCTC
GCGACTCTCG TGCAGCAAGT TCCGGTGATG TCTGCTGCTA TCATGAATAA GCGACGTGAC
GATCCTCACG GCCCGATGAT GGCCGAAAGC TTGTTCAACG CAGCGTTGGC GGCGTTCAAT
GAAGCGGTAA AGATTGAAGC GAAGCTTCGT TCACTCGCAG TGGAAATTCA AGCGGATGAA
ATGCCGTATT CACCGGTTTG GTGGCTGTAC ACTCGAGCGG ATCTTTTTGT ACGAGTTACA
AGTGGTTTGT TTGCCGCGGT GGCAAAACTT TCGACAACGA TGCGGCGACG TAAAGACTTG
GGTCCAGAAG CTCAAAATTT CATCAAGCTT AGCGGAGAAG CAATAATTGC AGTTGCGCGC
GAGATGACCA AAGCGTTTAC GTCTCGTGTC ACCGCACAGG CTGAGCTACT TCCGCCGAGT
CACGCATCGA ATGCTAAGAT GGATTTTGAA CAGCGCTGCT CGCACAATTT CCTTGCGTTG
ATTTACGAAT CATTCTTCGA TGACAAGAGA AATTCACCGA ACGGGGTCAT GATCAACTAC
GCCGCTCGAT GTGGGTTATT CGAGCAACTT TACACCCATT TTCAAGGCAC GGTGGCTCTG
ACACAGAGTA TATTAGAAAC GTGCACCGGG GAAAAGGCTG AGCAAGGACA TGTCGCAGCG
GAATGTTACA GGACGTTGAG CTGCTTTGTG CGCATTTTCG TCTCTATATC CGACGTCAAA
TTGATGGCAT CGAACAACTC GAATAAGCTC GCGCTTGGTA CACGTCTTCC CGAACGCGTA
GAAGATGATC CTATCTACGA CAAGATGACA AAGTACACAC ATATTTCTGA CGCGTTCACT
AACTCAAGAT CTGCCATGGA CTGGATTCAT TGCTCTCTAA ATGAAGCGTT GAAGTGTATT
TGGCGGCCAG AAACGGACAT TTTTGCGTCC GTAGCGGCAG GTCATGTGGA TTCCGACGCA
CTAGTGCAGT GTCTACTCAA CATCATTAGT GGAACCGAAC AAGCCGTCAT GTCCGAGCAA
CGTCGTGCCG CGTCGGTACC GCGTGCTCCA CCGCGCCCGC CGCCACGGCA GTTTGTTCCA
TCAGAGACGA TGATAGATTC CATCGTTGAG ATGGGATTCA GCAGAGGTCA CGCGCATCAC
GCACTCGCAG CTGTGAATGG ACAGAGTGTG GAGAGAGCGC TCGAGTACAT GCTAACTCGA
CCAATGGAAG ATGTGCCCGA CGAAGCACCG ACGGCGCCTT CAGAACCTGC ACCGTCAGAG
TCTACTGCGC TAGAGGAGAC GGCGGACGCT ATGGAAACAG CTCCTGTTGC TTTCAATCAC
ACGCACGCAG TCCAAGGTCG ACTGCCTGCC ATAAACGAGC TAGTTGCCGC GCTGTACGCC
AATTTCCACA TCAGTGCAGA GAAAGGTGCG ACGCTTTTGC TCAAGGTTAT TGAGAAACAT
GAGCTCATGA GCGGCCTCAG TCGTGATGAG TGCATATACG CTCTCACGGA CGAAGTCGTG
CGCACGCGTA CGCCGAAGAT CGGTAGAGAT GAGGTGTATG CGATCAAGCA CTATGCGTTC
GTTGAGGCGC TCGCTAGGCA AGGTGATGAA GTGTTAAGAA GAGCGCTATG GATAAAGGGT
TCGGTAATTG ATTCACAAGT CGAAACATTT ATCGCCGAGG TCGACAAAAT CTGCGCTGCA
CAAAAGACGG CCTCATTGGA GACGTTTTTA AAAGATGTGC CGTTGTCGTT TGTGCCTTTG
GCAAACTGCA TCAATGCTGC GGCATGCCTG CAAAAGCATC GTGACTCCGA CGCCAGTGCC
AGAAATTACT CCAAGTTCGG TTTTCTGAAC GACGGTCATA AGATTGCGTT GGCCGATGCG
TGCGTCGAGT GCTTGATAGC AATCGTTAAA AATTCAGGCG AACTCGCGAG TGATTATGAA
TCGCACTCCG TCCAAGCCGT GATTCTTGAT CTGCTAGCAA ATCTATCACG TGACAACGTG
ATTGCGGATC GCATCGCTTC GAATTCATTC GCCGACAGCA AGGATGGAGC CGCAATATCT
AAGAGAAGTT TTGCACACAC GCTCTTTACA TGCTTTTGGA ACAACACCAA CGACAGTGTT
TCGGTCATTT TGAGGCACAT GGTGAACGAT CCCCGGACAT TACAATTCGC CATGGAGCTT
GAAATTGTCA AGAGCATCAC CAAGCCTCCG ATGGGTAGAG ACTCCAAGGT TGCACTCAAG
ACATTCTGCG TCGCCATGAA AAATATAATT GAGCGAGATT TTGATTGCTT TGAAGCGGCA
TTCGCAAACG TCGCCGAAAT CAGAACTATG GCGGATACAC CGCTGAGTAC ACCTCGGGAG
TACGTCATCC CGACGGCTGA AGTGAAAGCA ATGCAGTCGC TTCCAACCGT TCGATTGACA
AAGAACTTAA GAATGGTCGT GGAAGTACTT GTTGCAGTAG CGACAGAGCC GATTCATCAA
GCCCCATTGA CGAAGCTCGA GCGACAAGCC GCAGCTTTCC GTCTGCTCAC CGAATTGGTT
GAGGTATATC CATCAAGTGT GAAGGCTTTG ATGGATATGG ACGACCAGAC TGACATTTTC
CGCAACATTC TTCGTTATCA ACTGCCGTCG AGCAGAAGAA GTTCCTCAAG TGAACAGACG
GGAGTTTGGT GCGGGTGCGA AAACGCGGCG TTCTTCCTCG CAACTTTATG CGTCAACAGC
CAAAAAGCGC GAACGAAAAT AATTGAGAAA ATGCTTGTGC TTCTGAAGGA GCCGGACTTA
ACGCAAGAAA CAGCTTCAAC TTGCGCCGAT AAAGGTGATT CTCTAGTTCC AAAGCACGCA
TTTGTGGATT TATTGCACTC TCTTATGGAT TACGGTTTGG CGCGTGGCTA TCAGAGCGGT
GAACGCATTC GCAATCGGGT AGTCTTTAAT CTCCAACAAG GAGTTCTTAA GACGATGTAC
AGTCTAAAAG TCCCAGAGCG GATCATTGAA CTCATTGAAG AGACAAACGT CGTTGGCGCT
GCGCGCGAAA AACGTGGCAA TCCAGGATTT CTTCGAGTAA CGCTCGCAGT ACTCGAAGCT
CTAGTTGTTC GATCTACTAA GAGCAAACGC GAAAGCAGGC TTACTCGCAT CGGTAACATG
GTTCGTGCGG TAAACGCTGC ACGGGTACCC AGCGGTGAGG AAGTTGAAGA GAGACGCGCA
CTAACCGAGC AGATGCTCCA GAATTTGCTT GAACACGCTG CAGGAGGAAA CATTCAAGTC
GCTCGACTCG AAGATGACCT CGACGGAATG CCAGAAATCG AAGGTGTTCA TCCTGTATTC
GGAGGAGACG ATGGCGATGC CGACGGCATC GATGCAAATG ATGGTCACGA GAACTCGGAC
GATGATTCTG AGATGGAGGT TGACGACGAC CACGACCACG ATTCCGATGA AGCCAACGAA
GAAGGAATGC ACGAATCCTC AGACGAAGAA CAATTGAGCG AATCGGGCGA ATCAGGTGAA
TCAAGTGGAA CGAGCGACTC AAGCGAAGAG GACTCTGAAG AGGAAGAAGA AGAAGATGAT
GAAGAAGGCG AAGAAGCCAT GGCAGATGAC TTTTACGGAA GCGACATGGA TCACATCGAT
CCTGATCCTG GAAACGATGT CTTGGAAATG TCTGACGATG CTTTGTATGA CGAAGATGGT
GAGGAGTTCG ATTATGATGA TGGCGGTGGT TCCGATGATG ACGATGACGA CGATGAAGAG
GAAGGCGATT TTGAAGAACC TAGCTCGCCA CAGTCATCAT ATACGTCGCC GTCTGGGGTA
CGCGTGCTGC CTCGCGCTAC ACATTCAGGC AGTCATGTTG AACGCACATC CAATCTTGGT
TCGAGGGATC ACATTTCTGT CTCGAGAGCC CAGGTTGAAG CATGGTTCGC CACAGATTAC
TCTAGCGTCA TGTCAGGTGC CGATCCGAGA GTCGAGGCAC AGACAGTTGG CGAACTGCAG
ATTGCAGGAG GGCAACGAGC ACAAGTTTCC ACAGAGGACG AAAGACAGCA GACACTCAGC
GAATCTCGTT CTGAGGGACA ACTACGGGGA CATAATCGCG AGCACTCTCC TCGTTCTTCT
TCTGTGAATG CTTCCGAACT CTTCAATCGT AGAGATGTGC CTTTTACTTC GTCTCTTCGT
GAACAAACAC AAAGAATTGC ACGCGCGACC CTGGGTGACC CTTCACTTCT TGCGAGAGGA
AATCCTCCGG TGCATGGCAG TCCCTGGCAC ACGTGGTCGC AATCCGAAAC GGGCGTTCCG
CCTGCGTTGA ACCACGAAGG CATTAACGTG CAAAGAGTCG GCACGAGTTT AAATATCACT
CTCTCGACAC CCATGGATTT TAGTTCGAGA TCTCGCTCGT TGCACGACGT GCTCAATTCT
ATCAGCGGGG AAATGGCTCG CTTGGCCGTT CGTCCATCGC ATTTAGAAAT TTTGGATCCT
TTATCGTCGG CAAAAGCAAT CCTGTGGCTC GGGGATGGAT CCTTTCAGGC AACAGCACCG
GCTGAATTCA GCGTTCGGAA TCAAACAATT TGGAGCCTTG ATGGAAGGAA TGCCCGAGCA
AGTTTCTGGT CAAACGCAGT GTGCATGAGT ATCGGCGAGT CTTTGGCGAC TTCTGGTACT
TTTTTGTCCC ATTTGGACAC CGCGGAACTC GCGACATCGT CATTGGTTTC CAGTGAGCGA
GACAAGGAAA ACGTCAACGG AAATCTGGCA GAAATCGTGG ACGAATCGCC CGAAGAGCGG
ACAGCGCGAC AACGCCGAGA AGCGGCGGCA CTGGACATAA GCGCAAATGA GCTCTCCATG
GTTACAGAAG ATTTACGAGA TGGATTTTTG AATGCGAATA GACTAGTTGC CCGTGCGCGC
GAGCAAGCAC CGGAAGCTGC TAGCGACCTC AACTTGATCG GCGCTGAGTT TCTTAGCGCC
ATGCCAGCCG ATATTCAGCG CGAGTTGATA CAACGCACTA CCTCCGGAGC AAACAGATTG
TTGACGAGGA TTCAGCATAC CGAAGCCAGT CCTTCGGTAT CCGCGCCTCG GCCTCGTGCA
AATAGCCCAA CAACTACAGT GATAGACACT GCGCCGTTGG AGATTGGAAA TATGATGTTT
GGTGAAGGCT GGGATGGCCG TCTAGACCGT CACGAGCAAA TGTCATGGAA TGACATCGAG
CGAATCCGTC AGTCGTCAAC GCGCGACGCA GTCGCTCCAG AAGATTTCAA GCCGTTAAAA
GTGAACGTCG AATTGCGCCA TCAGCATATT CGAACGTTGT TGCTTTTGTT CTTTGTTCAG
ACTATCGACT TAAAGGATAC TGTGAACAAA ATATGTGTAA ACTTGTGCGT GGTGGACCAA
AACCGCGAGT CAATTTTGCG CCAGATTATC GAAACTGTGA TCATGAACAC GATGGATCGT
GCTGATCCTC GTTTCATCGA GTCCACTTTG AGTTCTCCTA ACGCCAAGGG TCTTGAGCCA
AGCGCGGCGA TGGAAGTAGC CATGGCGCGC TCGCTGTGCG CGAATCTGAG CGAGCGTAAC
GGAGCACGCA AGGAGATGGT GACGAGACGA CTTTTGACGC TTCTGTCGCA ACTCATTAGT
CGAAACAGGC ACGTACATGG GACATATCGC GTTCGTGGCG TTAATTTGAC GTGCGTGAAA
CTTTTCTTTG CGAAGGATGA GGACGCAAGT ATGGTAAACT TTCTCACAAA TCAGAAGGAG
AATGGCGAAG CTGTGTTGCG ATCTGACGCT GGCGTGAGCA CCGTGGATAT ACCACTCCTT
TCTTTCCTGG TCGGACTTCT CGCACACGCA TCGCAATTTC ATGACCAAAA GAATATTGAG
CTGTGCGCGA CGTGCACTGA CGCTTACCTC ACAAAGTTTC TTGCGGCTAC GCCCAAACCC
AACATCGGGC CCAAAGGCTT GTCAACCGCC GCCACCCCTG AACTTTTGGC GAGTATAGCA
AGCTTTCTCA AGGTTCCATA TCTCGCCCCG AGTGCGTACG AGAAAATCAA CACTATACTA
CAAAAAGTTT TAGGCGACTT CTTTGAAGAG GGATGCGACG CAGTCATACT GAATCGATTC
CATGCCGATG CCCTGCAGTG CTCGGTTGAA GCCATCGCGT GCTTGTCGAA GGCGTCGTCG
ATCAAGTACA GACCAAATGT CGATGAACCA TCCGCACGAC ACGTTGAGTA CGCCGACTCG
CGTCAATCCC TGAAAACGCA TATTCCTTAT TTGCGTCGCC TGACGGATTG TTTTCTTCAG
GTTCTGACGT ACTTGGATGG AAAAGACAGC CGTTTCCAAA AAGCTTTCAA GGATAAAGCG
AAGGAAGAGT TGATACGATA TCATGATGCG CTCCGTCCGT TTTGGCAGAT GATGAACGTC
TATGCAAGTC TCATGCAAGA CATTACTAGC GATGAAAGCG GTGATGCATC GATCCTGTGC
GAACGCTCGC ACGCTAGCGC CGCGCTGACT GAGGGTATAA CGACGTACTT GATGATTGGA
ACGAGCCTCT ATCCATACGT CGAAGGTGAA GTTTCGAAGA GCAGACGAGA ATCCG
 
Protein sequence
MEFMREIVEN CGELPSFAHG EATALFLEFF ETCESALRLL AAATRRSVVG RTPRNAYFSS 
DVFQRRVRFM LSKIHAHMPL SAAHLNDADE NESKACEVVA HELFVAEDRE EGSGPLDCTR
YEFTVDTTSS GSDEIVFKAS GRRGLDPSEM HRPYNRASSL ECLIDDFSRP QMFDPGKEMD
SLKDISAAYF GVLAQLWSKS LQRTRAGREK AARLSILAQQ INVHLGKGAE PSDFELYVED
AQTTAIKLLD LMQVDLGVTP SLLLRRDATT LLSAYTNERA SGFQVCQIMR RPAGRGALSQ
CVSTMTAALL APLSSSDAEI DFDVVRDARC LMHLVGILST TTPGCTAIRE AELLPLLMSI
FKLERLEAIP VVVDVVHIVE SYLEFQPNAV VAFRELNGTD LLLKRMRYES VTAIEELRAS
GHLDDVADRK RKIDSLSSLE DAETSLAAMK KAPKYVNFYR RVLIKVLMRT LAYTSFAYGG
ARMRVPGLSD GTLTDILGSV LKYPLKFGPG VSSLTANLVC DVIHNEPTCY ATLDEAGIVD
AFLKFITETM WPLGRSKGMA KVLCAIPTTL NAICLNEKGQ EKVLKSSALV CFRKMFQDSS
FPLNADTSRI VGTGINETLR HIPALKDVSV EVLNDILSDL TIQLRKISDS ETNLEHAFVN
KAVVEQMPIS KTLQNVARFM DGILQTSHMC APLVQAGALD SMLNMLTCPL PVEFSTCASY
NSISVACKSL INPPDPSNSY AGANTPNLSD DVILACTTVV EKYMASAVDI GVQIEEFYVH
ELDKTGDDID KLATYDIYSK TLSEKHKGDK AALDREVRFC RQTKSLCQRM AAHERLCELL
ATLVQQVPVM SAAIMNKRRD DPHGPMMAES LFNAALAAFN EAVKIEAKLR SLAVEIQADE
MPYSPVWWLY TRADLFVRVT SGLFAAVAKL STTMRRRKDL GPEAQNFIKL SGEAIIAVAR
EMTKAFTSRV TAQAELLPPS HASNAKMDFE QRCSHNFLAL IYESFFDDKR NSPNGVMINY
AARCGLFEQL YTHFQGTVAL TQSILETCTG EKAEQGHVAA ECYRTLSCFV RIFVSISDVK
LMASNNSNKL ALGTRLPERV EDDPIYDKMT KYTHISDAFT NSRSAMDWIH CSLNEALKCI
WRPETDIFAS VAAGHVDSDA LVQCLLNIIS GTEQAVMSEQ RRAASVPRAP PRPPPRQFVP
SETMIDSIVE MGFSRGHAHH ALAAVNGQSV ERALEYMLTR PMEDVPDEAP TAPSEPAPSE
STALEETADA METAPVAFNH THAVQGRLPA INELVAALYA NFHISAEKGA TLLLKVIEKH
ELMSGLSRDE CIYALTDEVV RTRTPKIGRD EVYAIKHYAF VEALARQGDE VLRRALWIKG
SVIDSQVETF IAEVDKICAA QKTASLETFL KDVPLSFVPL ANCINAAACL QKHRDSDASA
RNYSKFGFLN DGHKIALADA CVECLIAIVK NSGELASDYE SHSVQAVILD LLANLSRDNV
IADRIASNSF ADSKDGAAIS KRSFAHTLFT CFWNNTNDSV SVILRHMVND PRTLQFAMEL
EIVKSITKPP MGRDSKVALK TFCVAMKNII ERDFDCFEAA FANVAEIRTM ADTPLSTPRE
YVIPTAEVKA MQSLPTVRLT KNLRMVVEVL VAVATEPIHQ APLTKLERQA AAFRLLTELV
EVYPSSVKAL MDMDDQTDIF RNILRYQLPS SRRSSSSEQT GVWCGCENAA FFLATLCVNS
QKARTKIIEK MLVLLKEPDL TQETASTCAD KGDSLVPKHA FVDLLHSLMD YGLARGYQSG
ERIRNRVVFN LQQGVLKTMY SLKVPERIIE LIEETNVVGA AREKRGNPGF LRVTLAVLEA
LVVRSTKSKR ESRLTRIGNM VRAVNAARVP SGEEVEERRA LTEQMLQNLL EHAAGGNIQV
ARLEDDLDGM PEIEGVHPVF GGDDGDADGI DANDGHENSD DDSEMEVDDD HDHDSDEANE
EGMHESSDEE QLSESGESGE SSGTSDSSEE DSEEEEEEDD EEGEEAMADD FYGSDMDHID
PDPGNDVLEM SDDALYDEDG EEFDYDDGGG SDDDDDDDEE EGDFEEPSSP QSSYTSPSGV
RVLPRATHSG SHVERTSNLG SRDHISVSRA QVEAWFATDY SSVMSGADPR VEAQTVGELQ
IAGGQRAQVS TEDERQQTLS ESRSEGQLRG HNREHSPRSS SVNASELFNR RDVPFTSSLR
EQTQRIARAT LGDPSLLARG NPPVHGSPWH TWSQSETGVP PALNHEGINV QRVGTSLNIT
LSTPMDFSSR SRSLHDVLNS ISGEMARLAV RPSHLEILDP LSSAKAILWL GDGSFQATAP
AEFSVRNQTI WSLDGRNARA SFWSNAVCMS IGESLATSGT FLSHLDTAEL ATSSLVSSER
DKENVNGNLA EIVDESPEER TARQRREAAA LDISANELSM VTEDLRDGFL NANRLVARAR
EQAPEAASDL NLIGAEFLSA MPADIQRELI QRTTSGANRL LTRIQHTEAS PSVSAPRPRA
NSPTTTVIDT APLEIGNMMF GEGWDGRLDR HEQMSWNDIE RIRQSSTRDA VAPEDFKPLK
VNVELRHQHI RTLLLLFFVQ TIDLKDTVNK ICVNLCVVDQ NRESILRQII ETVIMNTMDR
ADPRFIESTL SSPNAKGLEP SAAMEVAMAR SLCANLSERN GARKEMVTRR LLTLLSQLIS
RNRHVHGTYR VRGVNLTCVK LFFAKDEDAS MVNFLTNQKE NGEAVLRSDA GVSTVDIPLL
SFLVGLLAHA SQFHDQKNIE LCATCTDAYL TKFLAATPKP NIGPKGLSTA ATPELLASIA
SFLKVPYLAP SAYEKINTIL QKVLGDFFEE GCDAVILNRF HADALQCSVE AIACLSKASS
IKYRPNVDEP SARHVEYADS RQSLKTHIPY LRRLTDCFLQ VLTYLDGKDS RFQKAFKDKA
KEELIRYHDA LRPFWQMMNV YASLMQDITS DESGDASILC ERSHASAALT EGITTYLMIG
TSLYPYVEGE VSKSRRES