Gene Ndas_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0494 
Symbol 
ID9244335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp595373 
End bp603373 
Gene Length8001 bp 
Protein Length2666 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAcyl transferase 
Protein accessionYP_003678447 
Protein GI297559473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCA GATCAGACGG AGCCCCGGCC CACGGCCCCT GGGAGCCCAT CGCAGTCGTC 
GGACTCTCCT GCCGGCTCCC GGGCGCGCGC ACGCCCGACG CCTTCTGGCG GCTGCTGTGC
GAGGGGCGCG AGGCCATCAC CCCGCCGCCC GCCCGGCTGG GCGAGCCCCC CACGGGACGC
GGGAGTACCT GGGGCGGCTA CCTCGACAAC GCCGAGGACT TCGACGCCGC CTTCTTCGGT
ATCTCCCCGC GCGAGGCGCT GGCCATGGAC CCCCAGCAGC GCCTGATGCT CGAACTGAGC
TGGGAGGCGG TCGAGAACTC CCGGACGGCG CCCGGCACGC TGCGGGGCGC GCGGGTGGGC
GTGTTCACCG GGGCCATCGG CTCCGACTAC GCGCTCCTGT ACGACCGCGG CGGGGCCGGG
GCCATCACCC ACCACACGCT GACGGGTACC CACCGCAGCA TGATCGCCAA CCGGGTCTCC
CACGCCCTGG GCCTGGTCGG GCCGAGCCTG ACCGTCGACG CCGGGCAGTC CTCCTCCCTG
GTGGGCGTGC AACTGGCGGT GGAGAGCCTG CGCCGGGGCG ACTCCGACGT CGCGCTGGCG
GGCGGGGTCA ACCTCATCCT GGTCGCCGAG AGCACCGAGA GCGTCGCCAG GTTCGGCGGG
CTCTCCCCGG ACGCGCGCTG CCACACCTTC GACTCCCGGG CCAACGGCTA CGTGCGCGGT
GAGGGCGGGG CCGTGGTCGT GCTCAAACCC CTCGGCCGGG CCCTGGCCGA CGGCGACCGC
GTCCACGCCG TCATCCACGG CGGGGCGGTC AACAACTCCG GCGCGGGCGA GTTCCTCACC
CGCCCCACCG TCGACGGGCA GGAGGCGGTG ATCCGCGCGG CCTACGCCGA CGCGGGCAAG
GACCCCGGCC AGGTCGCCTA CGTGGAGCTG CACGGTACCG GGACTCCGGC GGGCGACCCG
GTGGAGGCGG CGGCCCTGGG CGGCGTCTTC GGCCGGGGCG CCGAGGAACC GCTGCGGGTG
GGCTCGGCGA AGACCAACGT CGGACACCTG GAGGGCGCGG CGGGCGTCGT CGGCCTGGTC
AAGACGGTGC TCAGCCTGGA CCACCAGCAG ATCCCGCCCA CCCTGAACTT CGCCGAACCC
CACCCCGACA TCGACCTGGA CGGCCTCGGG CTGTCGGTGC AGACCCGGCG CGAGCCCTGG
CCGACCCACA CCCCCCTGGC GGGGGTGTCC TCCTTCGGCA TGGGCGGCAC CAACTGCCAC
CTGGTCCTGG GGCCGGGGCC CGTGCCCGAG CGGACCGGCT CCGACCCGGT GACCAAGCGC
GTCCTGGACC CGGTGCCCTG GGTGCTCACC GCCCGCTCGC CCGAGGCCCT GCGCGCCCAG
GCCTCCGCGC TGCACGCCTA CGTCTCCTCC CGGCCCGGCG TCAGGGCGCA CGACGTGGGC
CTGTCCCTGG CCACCACCCG CGACGTCATG GAGCACCGCG CGGTGGTCCT GGCCCCCGGG
GAAGACAGCG ACCGCGAACT GGAGTCGGTC CGCCTGGAGG GCGGTCCCGA CACCGAGCCC
GGCCACGACG GCCCCGTCCT CGTCTTCCCC GGACAGGGCG GCCAGTGGGC GGGGATGGGC
CGCAGCCTGC TCGACGGCGA GGGGGTGCTG GCCGAGGCCT TCACCCGGCG CCTGCGCGAG
TGCGAGAGGG CGCTGGAGCC CCACACGGAC TGGTCGGTCT CGGCGGTGCT GCGCGGCGAC
TCCCAGGCAC CCCCGCTGAC GGGCGAGGGC TCCAGGGCGG ACGTGGTGCA GCCGGCGCTG
TGGGCGGTCA TGGTCTCCCT GGCGGGCGTG TGGGAGGACC TGGGGGTGCG GCCCTCGGCG
GTGGTCGGGC ACTCGCAGGG TGAGCTGGCC GCGGCGTGTG TGTCGGGTGC GCTGTCCCTG
GAGGAGGCGG CCCGTATCGT GGCGCTGCGC AGCCGGGCGC TGACCGCGCT GTCGGGCAGC
GGCGGGATGG CCTCGCTGGG CGTCACGGCC GAGGAGGCCG CCGAGCTCGC GGCCGACATC
CCCGACCTGA ACGTGGCCGC GGTCAACGGC CCCGAGGCGG TGGTGGTCTC GGGGAGCCCG
CGCGCGGTGC GCGAGGCCGT GGAGCGCTGT GTGGACCGGG GTGTGCACGG GGCGCTGGTC
GACGTGGACT ACGCCTCGCA CTCGGCGCAC GTGGAGCGCA TCCGCGACAC GATCGTCCGT
GACCTGGGCG AGGTGCGCTG GCGGGCGCCG CGGATCCCCT TCTACTCCAC GCTCGTCGGC
GCGCGCCTGG ACGGGGACGG CCCCGCCCTG GACGCCGGGT ACTGGTACAC GGCGTTGCGC
GAGCCGGTCC TGTTCGCCTC GGCGGTCGGC GCGCTCATCG CCGACGGGAG GCGCGTCTTC
ATCGAGGTCG GCCCGCACCC GGTGCTCTCC TACGGGGTGC GGCGCTCCCT GGAGGCGGCG
GGTGTGCGGG GGCACGTGCT GGAGACCCTG CGCCGGGGCG ACGGCGACCA GGCCCAGTTG
CTCACGGCGG CGGCGCGGGC CTTCACCACC GGTGTGGACA TCGACTGGCC CATGCTGTTC
GAGGGCACCG GCGCGCGGCG CATCGACCTG CCCCCCTACC AGTTTCAGCG CCGGCGTCAC
TGGCCCGACG ACCTGGCCGC GCGCCGTCCC GCCTCCGGGG CCCCGGCCCC CGAGAGCGAG
GACGACCAGC GGGCGCGGGA GCGCACGGCT CCGGTCCTGG AGCTGGTGTG CGAGCACACC
GCCCGCGTCC TTGACCGGGA GGCACTGGAC CCGGGTGAGG AGAAACGGGC CTTCCGGGAC
CTGGGCTTCG ACTCCATCAT GACGCTCGAA CTGGGCGACG AGCTGGAGGA GCGGACCGGG
GTCCGCCTGG AGGACACCGT CCTGTTCGAG TTGCCCACCC CGGCTGAGCT GGCCGCCTTC
CTGGTCACCG AACTGGGGCG GGAGGAGGTC ACCACCGCCT CCGCGCCGTC GCGGCCGGCT
CCCGTGCCCG CGCCCGCCCC CGCACCGGCC CGGCCCGCCG ACGAGGACGA CCCCGTCGCC
ATCACCGCGA TGGCCTGCCG CCTGCCGGGA GGCGTCTCCT CGCCCGAGCA GCTGTGGCGC
CTGGTCGAGG ACGGCGTGGA CGCCACGGGC GATCTGCCCG ACAACCGATT CTGGGACCTG
GACTCCCTCT ACGACCCCGA GCCGGGCGCG CCCGGGCGCA CCTACACCCG TCGGGGCGGC
TTCCTGTACG ACGCCGACCT CTTCGACGCC GACTTCTTCG GTATCTCCCC GCGCGAGGCC
GACGCCATGG ACCCCCAGCA GCGGCTGCTG CTGGAGACCT CCTGGGAGGC GGTCCGGCGC
GCCGGGCTGT CCGACGAGGC CCTGCGCGCG GAGAACGTGG GCGTCTTCGT GGGGGCCATG
CCCAGTGACT ACGGTCCGCG CCTGGCCGAC CCCGTGCGCG GCAACGACGG CGGGTACCGC
CTCACCGGGT CCACCCTGAG TGTGGCCTCG GGCCGCATCG CCTACGTGCT GGGGCTCAAC
GGCCCGGCCA TGACCATCGA CACCGCCTGC TCCTCCTCCC TGGTCGCCGT CCACCAGGCC
GCCGAGGCCA TCCGCCGGGG CGAGTGCGCC ATGGCACTGG CCGGGGGAGC GACCGTCATG
GCCACGCCCG GCATGCTCCT GGAGTTCGCC GCCCAGCGGG GGCTGTCCGC CGACGGCCGC
TGCCGTGCCT TCGGGGAGGG CGCCGACGGC ACGGCCTGGG CCGAGGGCGC CGGAGTACTG
CTGCTGGAGC GCGCCTCCCG GGCCCGCCGG GCCGGACGCC CCGTCCTGGC CCTGATCCGG
GGCGGCGCGG TCAACTCCGA CGGCGCCAGC AACGGGCTCA CCGCGCCCAA CCCGCAGGCC
CAGCAGCGGC TGATCCGCAG CGCCCTGGCC GACGCCGGAC TGGAGCCGGG CGAGGTGGAC
GCGGTCGAGG CGCACGGCAC CGGCACGCGT CTGGGCGACC CCATCGAGGC CAAGGCGCTC
ATCGCGGTGT ACGGCCGGGA CCGGGACGGG GAACCGCTGC GCCTGGGCTC GCTCAAGTCC
AACATCGGGC ACGCCCAGGC CGCGGCCGGG GTCGCCGGGG TGATCAAGAC CGTGCAGGCC
CTGCGCCACG GACGGCTGCC CCGGACCCTG CACGCCGACG TTCCCACCAC GCAGGTGGAC
TGGGAGGACG CCGGGGTGCG CCTGCTCCGG GAGAACGAGC CCTGGCCCGA CACCGGCCGT
CCCCGGCGCG CCGCCGTGTC CTCCTTCGGC ATCAGCGGCA CCAACGCCCA CCTGGTGCTG
GAGGGGGTCG TCGACGAGGG CGCCCCGGTC AGCATGGTGG ACCTGCCCTT CCAGCGCCGC
AGACACTGGA CCCGGCCCCC CGCCCCGGAG GCGGCCCCCG CCGCGGCCCC GCGGCTGCTG
GACGGACCGG GCCTGGAGCT GGTCGACGGC AGCACCGTGT TCGGCGGCCG GATCGGTGAG
GCCGCCCACC CCTGGACGCG CGACCACCGG CTCCTGGGCC GGGTCGTCCT GCCCGGGACG
GCGCTGGCCG AACTGGCCCT GTACGCCGGA GCGCACACCG GCGCCGCCGC CGTGGCCGAC
CTGACCCTGG AGCGGCCCCT CATCCTGCCC CGGGGCGGGG AGACCTCCGT GCAGGTGACC
GTCGCCGCGC CCGACGCGCG CGGGCGCCGC GCGGTGAGCG TGCACTCGCG CGGCCCCGAC
GCGGGGGAGT GGACCCGGCA CGCCTCGGGC GCGCTGGAAC CCGACACCGG CGGGGAGGCT
CCGTCCGCGG GGGCCTGGCC GCCGCCGGAG GCGGCCCCGG CGCCGTTCGG GTATGACGAG
GACTACGGGC GGCTGGCGCG GCGCGGCTAC GCCTACGGGC CGGTCTTTCG CGGGCTGCGA
TCGGTGTGGC GGGGGCGGGG TGACACGCTG CTGGCCACGG TCGCCCTGCC CGAGGGGCAC
CGCGGGGGAG CCTTCCGCGG CCCCCACCCG GCGCTGCTGG ACGCGGCCCT GCACGCCGTG
CTGCTCTTCG GCGAGAGCCG GGACGGTCCG CCCCTGGTCC CGTTCGCGTG GAGCGGGATG
CGCGTCCCCG ACCGCGACGG CCCGCCGCCG GCCGAGCTGC GCGTGGTCCT GGAGCCGCTG
GGCGGCGACC GCCACCGTCT CACCGCCTTC GACGAACGGG GAGACCTGGC GGTCGGCGTG
GACGAGCTGG CGCTGCGCCC GATCGACGCC GAGGACCTGC CCCCGGCCCC CGACGAGGAG
CCCGGGCTCT ACCGGGTGCG CTGGGACCCG GTTCCGGCGC CGGAACCGGC CGCGCGGGTG
CCCATCGGAC TGGCCGCCCT GGGCGACCTG GCCCCCGACG CGCCCGTCCC CGACACCGTC
TACGCCGAGC TGCCCCCGCG CGCGGTCTAC GCCGACCACG ACCCCGTCCC CTCCTCGGTG
TACGAGGGGC TGCACGCGGT CCTGGACACC GTGCGCACGT GGCTGTCCGA GGACCGGTTC
GCGCACGCCC GCCTGGTTCT GGTGACCTGG CGCTCGGTGT CCACCTCTCC CGAGGACCGG
CTGGGCGGCC CGGCGGCCTC ACCGGTGTGG GGCCTGCTGC GCTCGGTCCA GCGCGAGCAT
CCCGACCGGT TCGTGCTGGT GGACTCCGAC GGGTCAGAGG ACTCCCACCG CGTCCTGCCC
GCGGCCGCGA CGCTCGGCGA GCCTCAGCTG GCCCTGCGCT CGGGTCGGGT CCTGGCGCCG
CGCCTGGTCC CGGCGCATGC GGACGACGTC CCCGTGCCGC CCGGTCCGGC CTGGCGGATC
GAGGTCGGCG GCCCGGACGG TTCCGCTCTG TCGGCGGCAC CCGCCGCCGC GGCGCCGCTG
GAGGGGCACC AGGTGCGTGT CGCGGTCCGC GCCGCCGGTG CGGCCCCCGA CGTGGAGGCC
TCCGTCCGGC GCACCGGCGC ACAGGGCGCA GGCGTGGTCC TGGAGACGGG CGCGCAGGTG
CACGACCTGG TCCCCGGCGA CCGCGTCACC GGCCTGTTCG ACGACGCCTT CGCCACGACG
GCCGTCGCCG ACCGGCGCCT GCTCGCACGG GTTCCGCGCG AGTGGGGCCT GGAGGAGGCC
GCGGCCGTCC CGTTCCCCGC GCTGACGGCC TACCACGCCC TGGTCGACCT GGCGGCCGTG
CGCCCCGGGG AGAGCGTCCT GGTGCACGAC GCGGCCCGCG GCGCCGGACC GGTCGCGGTG
CAGCTCGCCC GCCACATGGG CGCCCACGTC CTGGCCACCG CCGACCCGCG CCAGTGGCCG
CTGCTGGTCG GCCTGGGGCT GTCCGAGGAC TGCCTGGCCT CCAGCCGCGA CCCCGGCTTC
GCCGCCGCCG TGCGCGAGGC CAACGGGGGC AGGGGCGTGG ACGTGGTCCT GGACACGCGT
GCCGGGGACG GTGTCGACGC CCTGATGGGC CTGCTCGTCG CGCCCTCCGA CGGCGGCGGA
GTGGGCGGCC GGTTCGTGGA GATGACCGGG ACCGGTCCCC GCGACGGGGC CGAACCCCCG
GTCGCCGACC ACCCGGGGCG CGGCTACGAG GCCTTCCGAC TGGCGGACGT GGACGCCGAG
CGGGTCGGGC AGATGCTCAC CCGCGTCATG GAGCTGTTCG CCTCCGGTGT CCTCAGCCCG
CTCCCGGTGG ACGTCCACGA CCTGCGTGAC GCGCACCTGG CACCCACCGA CCCGGACGGG
GCCGTGGTCC TGGCCGTCCC GGCGCCCTTC CCCCAGGACG GCACCGTCCT GGTCACCGGG
GGCACCGGCA CCCTGGGCCA CCAGGTGGCC CGGCACCTGG TCACCGGGTA CGGGGTGCGC
CACCTGACCC TGATCAGCCG CAGCGGTGCC GCCGCGCGGG GCCAGGACGG GCGCACCGAC
GAACTGCGGC GCCTGGGCGC GCAGGTGGAG GTGCACGCCT GCGACGCCGC GGACCGCGAC
GCGCTGGCCC GCCTGATCGA CTCCCTGCCC GGGGAGCGGC CGCTGCGCGC CGTCGTCCAC
ACGGCGGGGG TGCTCGACGA CGCCACCGTG CTGTCCCTGG ACGCCCAACG GCTGGAGGCG
GTGCTGCGGC CCAAGGTGGA CGCCGCCTGG AACCTGCACG AGCTGACCGC GGACCTGGAC
CTGGGCGCCT TCGTCCTGTT CTCCTCGGTG ACGGGGACCC TGGGCAGCCC GGGGCAGGCC
AACTACGCGG CGGCCAACGT CTTCCTGGAC GTGCTCGCAC GCCTGCGCCG CGAGCGCGGG
CTCCCCGCCC TGTCGCTGGC CTGGGGCCTG TGGGACGAGG CCAGCGGGAT GACCGCCCAC
CTGGACGGCG GTGACCTGGG CGCCCTGGGC CGCGCCGGAC TGCTCCCGAT GCCCGTCGAC
CGCGCCCTGG CCAAGATGGA CGCCGCCCTC TACCTGGGGG GCGACGTCCT GGTTCCCGCC
CTGCTCAACC TGGGGGACGG CGCCGACCTG CCGATCCTGC GCGAGCTCGC CGGACCCGCC
GGGGAGGATG ACCCGGGCGC GGGCGGCGGC GGGGAGCCGC CGCTGTCGCG GCGGAGTCTG
ACCGGGCTGG CCCCGGCCGA ACGCGGACGG TTGCTGCTCA CCGAGGTGCG CGAGCGCACC
GCGCTCGTCC TGTCCCGGCA GGACGCGGGC CAGGTCCCGG CCGACCGCCC CTTCCGCGAG
CTGGGCCTGG ACTCCCTGAC CGGGGTCGAG CTGCGCAACC GGCTGGGCGC CGCCTCCGGC
CTGCGCCTGC CCGCGACCGC GGTGTTCGAC CACCCCACCC CGCGAGCCCT GGCCCGGTTC
CTGGAGGGCC TGCTCTTCCC CGCCGAACCG GAGGAACCCG GGGACGCGCG GAGGTCCGGG
AACGCCGAAC CGGAGACGGA GCCCGGTGAC GCGGGCAGCA GCATCGACGA CATGGACGTG
GAGGACCTCC TGCACCTGGC CCTGGGCCGG GAGGGGGCCG CGTCCGACGG TGAACAGGAG
ACGGCTGATG GCTACCGATA G
 
Protein sequence
MSGRSDGAPA HGPWEPIAVV GLSCRLPGAR TPDAFWRLLC EGREAITPPP ARLGEPPTGR 
GSTWGGYLDN AEDFDAAFFG ISPREALAMD PQQRLMLELS WEAVENSRTA PGTLRGARVG
VFTGAIGSDY ALLYDRGGAG AITHHTLTGT HRSMIANRVS HALGLVGPSL TVDAGQSSSL
VGVQLAVESL RRGDSDVALA GGVNLILVAE STESVARFGG LSPDARCHTF DSRANGYVRG
EGGAVVVLKP LGRALADGDR VHAVIHGGAV NNSGAGEFLT RPTVDGQEAV IRAAYADAGK
DPGQVAYVEL HGTGTPAGDP VEAAALGGVF GRGAEEPLRV GSAKTNVGHL EGAAGVVGLV
KTVLSLDHQQ IPPTLNFAEP HPDIDLDGLG LSVQTRREPW PTHTPLAGVS SFGMGGTNCH
LVLGPGPVPE RTGSDPVTKR VLDPVPWVLT ARSPEALRAQ ASALHAYVSS RPGVRAHDVG
LSLATTRDVM EHRAVVLAPG EDSDRELESV RLEGGPDTEP GHDGPVLVFP GQGGQWAGMG
RSLLDGEGVL AEAFTRRLRE CERALEPHTD WSVSAVLRGD SQAPPLTGEG SRADVVQPAL
WAVMVSLAGV WEDLGVRPSA VVGHSQGELA AACVSGALSL EEAARIVALR SRALTALSGS
GGMASLGVTA EEAAELAADI PDLNVAAVNG PEAVVVSGSP RAVREAVERC VDRGVHGALV
DVDYASHSAH VERIRDTIVR DLGEVRWRAP RIPFYSTLVG ARLDGDGPAL DAGYWYTALR
EPVLFASAVG ALIADGRRVF IEVGPHPVLS YGVRRSLEAA GVRGHVLETL RRGDGDQAQL
LTAAARAFTT GVDIDWPMLF EGTGARRIDL PPYQFQRRRH WPDDLAARRP ASGAPAPESE
DDQRARERTA PVLELVCEHT ARVLDREALD PGEEKRAFRD LGFDSIMTLE LGDELEERTG
VRLEDTVLFE LPTPAELAAF LVTELGREEV TTASAPSRPA PVPAPAPAPA RPADEDDPVA
ITAMACRLPG GVSSPEQLWR LVEDGVDATG DLPDNRFWDL DSLYDPEPGA PGRTYTRRGG
FLYDADLFDA DFFGISPREA DAMDPQQRLL LETSWEAVRR AGLSDEALRA ENVGVFVGAM
PSDYGPRLAD PVRGNDGGYR LTGSTLSVAS GRIAYVLGLN GPAMTIDTAC SSSLVAVHQA
AEAIRRGECA MALAGGATVM ATPGMLLEFA AQRGLSADGR CRAFGEGADG TAWAEGAGVL
LLERASRARR AGRPVLALIR GGAVNSDGAS NGLTAPNPQA QQRLIRSALA DAGLEPGEVD
AVEAHGTGTR LGDPIEAKAL IAVYGRDRDG EPLRLGSLKS NIGHAQAAAG VAGVIKTVQA
LRHGRLPRTL HADVPTTQVD WEDAGVRLLR ENEPWPDTGR PRRAAVSSFG ISGTNAHLVL
EGVVDEGAPV SMVDLPFQRR RHWTRPPAPE AAPAAAPRLL DGPGLELVDG STVFGGRIGE
AAHPWTRDHR LLGRVVLPGT ALAELALYAG AHTGAAAVAD LTLERPLILP RGGETSVQVT
VAAPDARGRR AVSVHSRGPD AGEWTRHASG ALEPDTGGEA PSAGAWPPPE AAPAPFGYDE
DYGRLARRGY AYGPVFRGLR SVWRGRGDTL LATVALPEGH RGGAFRGPHP ALLDAALHAV
LLFGESRDGP PLVPFAWSGM RVPDRDGPPP AELRVVLEPL GGDRHRLTAF DERGDLAVGV
DELALRPIDA EDLPPAPDEE PGLYRVRWDP VPAPEPAARV PIGLAALGDL APDAPVPDTV
YAELPPRAVY ADHDPVPSSV YEGLHAVLDT VRTWLSEDRF AHARLVLVTW RSVSTSPEDR
LGGPAASPVW GLLRSVQREH PDRFVLVDSD GSEDSHRVLP AAATLGEPQL ALRSGRVLAP
RLVPAHADDV PVPPGPAWRI EVGGPDGSAL SAAPAAAAPL EGHQVRVAVR AAGAAPDVEA
SVRRTGAQGA GVVLETGAQV HDLVPGDRVT GLFDDAFATT AVADRRLLAR VPREWGLEEA
AAVPFPALTA YHALVDLAAV RPGESVLVHD AARGAGPVAV QLARHMGAHV LATADPRQWP
LLVGLGLSED CLASSRDPGF AAAVREANGG RGVDVVLDTR AGDGVDALMG LLVAPSDGGG
VGGRFVEMTG TGPRDGAEPP VADHPGRGYE AFRLADVDAE RVGQMLTRVM ELFASGVLSP
LPVDVHDLRD AHLAPTDPDG AVVLAVPAPF PQDGTVLVTG GTGTLGHQVA RHLVTGYGVR
HLTLISRSGA AARGQDGRTD ELRRLGAQVE VHACDAADRD ALARLIDSLP GERPLRAVVH
TAGVLDDATV LSLDAQRLEA VLRPKVDAAW NLHELTADLD LGAFVLFSSV TGTLGSPGQA
NYAAANVFLD VLARLRRERG LPALSLAWGL WDEASGMTAH LDGGDLGALG RAGLLPMPVD
RALAKMDAAL YLGGDVLVPA LLNLGDGADL PILRELAGPA GEDDPGAGGG GEPPLSRRSL
TGLAPAERGR LLLTEVRERT ALVLSRQDAG QVPADRPFRE LGLDSLTGVE LRNRLGAASG
LRLPATAVFD HPTPRALARF LEGLLFPAEP EEPGDARRSG NAEPETEPGD AGSSIDDMDV
EDLLHLALGR EGAASDGEQE TADGYR