Gene Rpal_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3761 
Symbol 
ID6411439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4028694 
End bp4036742 
Gene Length8049 bp 
Protein Length2682 aa 
Translation table11 
GC content63% 
IMG OID642713642 
Productamino acid adenylation domain protein 
Protein accessionYP_001992735 
Protein GI192292130 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase
[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.674004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACT GCAATGAGAT CGGCTTCGCG ATAGTGGGAC TTTCTGGTCG CTTTCCAGGC 
GCACCCGACG TTCAGAAGTT TTGGCAGAAC ATCAAAGCGG GGCGTACGAG CGTTTCGCGT
TTCTCGGCTG AAGAGCTGGA GGACTCCTTC GCTGATCAGG AGCGTGCCGA TCCCGACTTT
GTCGCGGCCA AGCCTGTTCT GGACGACGTC GATCTGTTCG ACAGCGACTA CTTCGGGATG
TCGCCGCGCG AAGCGGATTT GACCGATCCG CAGCAGCGGA TTTTCCTCGA GATCTGCTCC
GAGGCGTTGG ACGATGCGGG GTACGACCCC GAGCGTTATC AGGGTGCGGT GGGCGTCTTT
GCCGGCACCT CACTGAATAC CTATTTTTTG CGACACGTGT GCCCCGACCG CGCTGCCCTG
GAGCGGTTCA CGTCGAACTT CCAGGTGGGC TCTTATTCGG AGCTGCTCGG CACGCTCAAT
GATTTCATTG CGACGCGGGT CGCCTACAAG CTCAATCTCA AGGGACCAGC GGTTTCTGTC
CAGAGCGCCT GTTCGACGTC GCTGCTCGCT GTTGCGCAGG CCTGCGAGAG TCTCGCGGCT
CTGCAGTGCG ACATGGCGTT GGCCGGTGGC GTCTCGGTTA CGTTCCCCCA GAAGCGAGGA
TACATCTATC AGGATGGCGG GATTGCCTCG CGCGATGGCA CCTGCCGGCC GTTCGATGTC
GACGCGGCCG GGACCGTGTT CGGCTCCGGC GCGGGTGTGG TGGCGATCAA ACGGCTGGCG
GATGCCGTCG CTGATCGCGA CGCGATCTAC GCCGTGATCA GAGGTTACGG CATCAATAAT
GATGGCGCAG CCAAAGTCGG ATTCGCGGCC CCCAGCGTCG GAGGGCAGGC GGAGTGCATC
GCGGCGGCGC TCGCGATGGC CGGATTTGAG GCGGAGACCA TCGGCTACGT CGAGTGTCAC
GGCACCGGCA CGCCGCTGGG TGATCCGATC GAGATCGAGG GGCTGTCGCA AGCATTCGCT
GAAGCTTCCA CCGGTGCGAT CGAAGACCGC AACTGCGCCC TGGGATCGGT GAAGGCCAAT
GTCGGTCACT TGGATGCTGC GGCGGGCGTG ACGGGTCTGA TCAAGACCGC ATTGATACTT
CACCACGGCG TTCTTCCGCC TCAGGCGAAT TTTGCCGAAC CGAACCCACG CCTCGACCTC
GCGAAGCGGG GATTTCGGGT GGCGACGGAA TTGCAAGAGT GGCCTCGCGG AGATGCGCCG
CGCCGCGCCG GCGTCAGCGC CTTCGGGGTC GGAGGCACCA ATGTTCATCT GTGCCTGGAG
GAGTCACCAC CGGCTGCCGC GGAGAGCCTG TTGCCGGTGG AGAATGCGGT GGTTCTCCGC
CTGTCTGCCC GGTCTCCCAC TGCACTGGAG GCGATGCGCC TCCGGCTCGC CGACCATCTT
GAGGCGGCAG CGCAGCGTGG CGAGCGCATT GGCTTGAACG AACTCGCCCA CACCCTGGCC
GCAGGACGGC GAACGTTTGA TCGGCGCTGG GCGGTTGCGG TGAACTCCGT GCAGGGAGCG
GTTCAGTCGT TGCGGGAAGT CCGCTCCGAC CCCCGACCGA CATCGCTTCG CCCCGGTGCG
GTGTTCATGT TTCCCGGCCA GGGAGCTCAA TACCCCGGGA TGGGACGTTG GCTCTTCGAG
TCCGATCGCG AATTCCGCGC CGATGTCGAG CTCGGGGCCG CAATCGTCAA AGAACACCTC
GGCGTCGATA TCCTTGCCGT GCTGTTCGAC GACCGTGACG ATGACGACCA TGCGATCCGA
TCGACGACGT TGGCGCAGCC GGCGCTCTTC CTCATCGAAT ATGCTTTGGC ACAGCGTCTG
CTCCGGTCCG GCATCGAGCC GAAAGCGATG ATCGGCCACA GCCTCGGCGA GTTGGTGGCG
GCGGCGCTTG CGGGCGTGAT GTCTTATCAG GATGCGCTGA AGCTTGTGGT GCATCGCGGG
CGGATCATGC AGGCTCAGCC ACCGGGCGCC ATGCTCAGCG TGCGATTGGC TGCCGGCGAG
CTTGCGGCGC ATCTGCCAGT CGGGGTCGAG ATTGCGTCCG ACAATTCGCC TCGCCTCAGT
GTCGCCAGCG GTCCTTTTGA AGCTGTGGAA GAGCTCGAGC GCCGGTTGGA GAGGGCGGAG
ATCCCTCATC GGCGTCTGCA CACATCCCAC GCCTTCCATT CAGCCATGAT GGATCCGGTG
GCGGTCGAAC TCGAGAAGGT TGCGGCCGGG ATTCCACTCC GGCCTCCGAA GCGTCGGTTG
ATCTCGACCG TCACCGGAAC TTGGATGACC GCGGCGGAGG CCACGTCGCC TGCCTATTGG
GCTCGCCAAT GCCGCGAGCC CGTCAGTTTC CGCGCCGCAC TCACCACGCT GATCGACACG
ATATCCGAGC GGGTGGATTG TCTTTTGGTG GAGGTCGGCC CGGGCCGAAC GCTGGCGACG
TTTGCAGGTG CCTCATCATC CGTTTCTAAG GTTCGGGCGA TCATTACGAC TGCGCCGGAC
TATGTGGAGC GAAACGACGA AGAGCTTACT TTCGCTCAAG CGTTGGGTGA TCTGTGGGCA
CACGGCGCGA GGCTGAAATC CGACGGTCGC CCGGCCGCTG GCGCGGCCCG GCTGCGATTG
CCAAGCTATC CGTTCGAGCG CACGCGCCAT TGGATCGCGC GTCCTGCTCC GGCCAGTCTC
GACATAGCCA TGCCCGTCTC ATCTGCCCAA GGAGACTCGT TCCCTCGGAC CGAGCCTCCG
CGGGAGGCGA GTAGCGATTT GGATGGTCTG CATAGCGTTT CGATGGAGAG TAAAATGTCC
GGTGACGAGG TCTCGAAGGC GTTGATCAAG GCGCTTGCGG CTATTCTGTC AGATGTCTCC
GGACGACCTC TTTCGGATCA GGACGCGGCG ACTTCCTTCA TCTCGCTCGG GTTTGATTCC
TTGCTGCTCG GTCAGGTCGC GCAGCGGGTC AACAAGTCGT TCGGCGTGAA GATTACGTTC
CGGCAATTGA TGCGAGATCT CAATTCTCTG GACGCCCTGG CCGCGATGTT GAAGACCTCG
GCACCGGCCG ACAAATTGCC CCAGGTCGAG CGAGCAGCCG TGAGGCCTCA GGTAGCAATG
ATGCAGAGGC CGGCAGAGGC TGTGCTGCCG AAAGATTCGC CTGTGCCGGT GATGGCGGGA
GCTACGGGAG TCGAGGCGTT GCTTCGCGAG CAGCTGCAAT CGATGGAGCG GATTTTTGCC
GCTCAGCTGG CGGCAGTCGG CAAGGAAGGG CGCGTCACGA CTGAGTTGCC GGTTGCTGCG
GTGCCGGCCA ACAGCGGCGA AGCGCCTTCG CCTGACAACT CGTCGGACAA GGCTCTTGTG
ATGCCGGACA CGGGAGAGGA GGGAGGCGGT CGGTTCAAGA TCTATCGCCC TGGCGCCGGC
GATAGCGATC TCGCACTTGC ACCGGAGCAG CGGGCCTACA TCGAGAACCT CGTCCAGCGT
TATAACGACC GCACTCCTGG ATCGAAGGCC CGTGCTCAGG CGTCGCGCAG AGTTCTTGCC
GATCCCCGGA CGGCGAGCGG CTTCAATGCG CAGTGGAAGG AGATCGTTTA CCCGATCGTG
TGCAAGCGGT CGAAGGGAGC GTCGATCTGG GACGTCGACG GCAACGAGTA CATTGATCTG
GTGAATGGCT ACGGCCAGAC GATGTTCGGC CATGTTCCGC CATTCGTCGC CGCTGCGTTG
CAGGCACAAT TAGACGACGG ATTCGCGATC GGACCGCAGA CCGAGCTCGC CGGGGAGGTT
GCGGCGCGAA TTAGCGCGAT GACCGGCAAT GAGCGGGTCG CCTTCTGCAA TACCGGTTCC
GAAGCTGTCA TGGCCGCCAT CCGGGTGGCG CGCGCCGTGA CCGGACGACA GAAGGTCGTC
GTGTTCAGTG GCGCCTATCA CGGCCAATTC GATGAGGTTC TGATCAAGTC GTGCCGCGCT
GGCAGCGTCC CCGGTGCACT TCCGATCGCG TCCGGCATCC CGGCCGAGAA CGTCGGACAG
ATGATTGTCC TGCCGTACGG CCATCCAGCC AGCCTTGAGA TTGTCAGGCA AATGGCTGAC
GATCTCGCCG CGGTCATCGT CGAGCCGATT CAAAGCCGGC ATCCAGCTTT GCAGCCGCGA
GACTTCGTAG CGTCGCTCCG AGAGTTGACC GCGAAGAGCG GGACGGCGCT GGTATTCGAC
GAGGTGGTGA CGGGCTTCCG AGTGGATCCC GGTGGAATGC AAAAGGTCTT CGGCATCAAG
GCCGACATGG CGACCTATGG CAAAGTGCTT GGCGGAGGAA TGCCAATCGG CGTGGTCGCC
GGCCGCGCGG ATTTCATGGA CGCGCTGGAT GGCGGCTTCT GGCAATACGG TGACGATAGC
GAGCCGGAAG TCGCTCCGAC ATTCTTCGCC GGAACGTTTG TCCGGCATCC GATGGTGCTC
GCCGCCGCGC GTGCCGTGCT CAACCACATC GAACAGAATA AGGCGGAGAT CTATGCGCCC
TTGGCGCAGC GCACAGGGCA GCTTGTCGAT CGGATCAATT CGCACCTCGA CAGCTACGGA
CTTCCGACGC AGGCCGAGAC ATGTGCGAGC TGGTTCTTTG TCGATGGTTC ACCTTTAGGA
CGGTTCGCCG GCCTGCTGTT TGCCGAGCTT CGACTGCAGG GGATTCATGT CCACGAAGGG
TTCCCGTGTT TTCTCACCAC CGCACACGAG GCCGCTCAGT GCCAAGCGAT CGACGGTGCG
TTTGAAAGGG CGATAGGGGC TTTTGCCGAC GCTGGGCTGA GCACGGGAGA AGCCAAGGGT
ATCACCAGGA AGTCGAGCCT TATTGGTGAT GATCAAGCTC AACCGCAGCC GCAAAAACTC
AAAACGGTGC CCCTGACCGA GCCTCAGATG GAGGTGCTGC TCGCAGCGCA AATGGGCGAT
GCAGCCTCGA TGGCGTTCAA CGAGTCGGTC AGCATTCGCT TTAATGGGAC TCTCGTCGAA
AGCGTGTTGA TCAAGGCGGT TGAAGAAGTC GTTCGGCGCC ACGAGGCACT TCGGAGCCGT
GTGGTTGATT TCACCGGAAT GCTTGAGATC AACCCTGATT TCGTGCCTGA GGTTCCGATC
ATCGCTCTCG ACGGGGAGGA TGGCGATACA GAACTGGCGC AGTGGCTTGC CAAGGATGCA
AGAACGCCGT TTGACCTTTT CAACGGCCCG TTGGTTCGTG CCAGCGTTCT GCGTCTGGGG
GCTGACGTCC ACGTGTTGGC CTTCACCGCG CATCACATTA TCTGTGACGG CTGGTCGATG
AACGTCCTGC TCGGGAACCT GGCCGAGATC TACAACGCTT ATCGTGACGG AAAGACGCCG
TCGCTGGTGC CCGCCGCGAG TTTCGCCGAA CACGCTGCGG AGCGACCGCG GCCTTCGGCT
TCCACCATCG AATTCTGGCG GAAGCAGTTC GAAAATGTGC CGGTCGCCTC GGAGCTTCCG
CTCGACCGCC CACGCCCGGA GGTCAAGAAC TTCTTCGGTG CATCGCTGCG GGAGCGGCTG
GGACAGGACC TCTGCCGTGA CGCCAGGGAG CTGGCGAAAC GCCAGGGCGT AACGCTGTTC
ACCGTGCTGT TTGCCTCGGT GCAGACGCTG TTCGCGCGGC TTTGCGACAA CGAGCGCGTT
GTCTTGACCG TCCCGATGGG CGGTCAGGCT TTGCTGGATC GGCAGGATCT GGTCGGGCAT
TGCGTCAACT TCCTGCCGGT GCCAGTCACG GCTCGGTCGG CGGTCTCGTT CTGCGACCAT
CTGAGACACG TCGACGAGAA ACTGAACGAG ATCTTCGATC ACCAGGACTA CACGTTGGGA
TCTCTGGTCC GCGATCTGGA CATACCCCGC GGTATCAACC GGACGCCGCT GTCGGATATC
CAGTTCAACC TTGAACGGGT CGGCTCTCGG CTGACATTCC GCGGCGTCGA CACGCGGGTC
GATACAAATC CAAAGGCCTT CGTCAATTTC GACATCTTCA TCAACATGAT CGAAACCGAC
GACGACATCA CGATCGAGGT CGATTACGCA ACTGATCTTC TGGATCGTGC GACGGTGTCG
CGATGGCTCG ATCAGCTGCG CAAGATCTTG GCGCAAGCGC TCGAGCGGCC TGAGATCAAG
CTGCAGGAAC TGAAGCTGCA GAATCCGGTC GTCATCGGGC TTGAAAGCGG TCAAACCACC
ACGCTGCAGG CCGCGCGTTT CGATATTCCT CTGGTCGAAA TGATCGAGCG TAACTGCGAC
CGCACGCCTG ACGCGATTGC GGTTGAGTAC GAGACCGACG TGTTGCGCTA CCGCGAGCTC
GACTCCAGGA GCAATCGGAT TGCGGATCGG TTGAGAGCGT TGGCTCCGAC CAAGGGAGCC
CGGGTCGCGG TCGCGGTTCA GCGGGGCCTT GATCTGCCGG TCGCGCTGGT CGCCGTAGCG
AAGGCGGGTC TCGCCTACGT CCCGGTCGAT CCCTCGCTGC CCGTGGTTCG GATTCGGCAG
ATGGCTGAAG CCGCCGAGGT TGCGGTCTTC ATCACGTCAG GGGCGGACTG TCCGGTTGCG
GCCGAAATGG GTGTACCGGT CATCGATCTC GAGCGTGATG CTGCACAGAT CGATGCCGCA
TCGTCCGCTC GTCCCGAACC GGCGCCACCG CAAGAGGTGC TGGACTCGAC CGCCTATGTG
ATCTTCACGT CCGGCTCGAC CGGGACACCG AAGGGCGTCG AGATCAGTCA CCGCGCACTC
GCCAACTTCC TCGGCTCGAT GGCGGTCCGC CCTGGCTTCG GTGCCGATGA TCGGATCGTC
GCAGTGACCA CGGTGTCGTT CGACATCGCG GTGCTCGAGC TGCTGCTGCC GCTGTATTGC
GGCGGACGAA CCGTGATCTG CGGCAAGGAC AGGTTGCTTG AACCAGAGAG CGTCGTTCGG
CTGATCGAGA CCAGCTCTGC GACGATTGTT CAGGCCACGC CGACGCTTTG GCGCGTGCTT
CTCGAAGCTG GTTTGCAACC ATCGCGTCCG CTTCGCGCAT TGAGCGGCGG GGAGGCGCTG
CCGCGCGACG TCGCCGAGAA GCTGATCGCT GCAGGCTTCG AGCTGTGGAA CATGTATGGG
CCGACCGAGA CCACGATCTG GTCGGCCTGC GGGCGGATCG TCGATGCCAG CCGCCCGATC
GTGATCGGCG AGCCGGTGGC TCATACCGAT CTCTACATCC TGTCCGACGA TGGAACGCAG
GCGCCGGTCG GCACGCCCGG GGAGCTGTGT ATTGGCGGAC TTGGTCTTGC AAAAGGCTAC
GTCAACCGGC CCGATCTGAC AGCCGCAGCG TTTCCGGTGA TCGCGTTGCA GGATGGCAAA
ACGGTGCGTC TATATCGAAC CGGCGATCTG GCCGTGCGAC TGTCGGACGG CGGCATTCAA
CTGCTCGGCC GACGCGACCA GCAGGTGAAG ATCCGCGGCT TCCGCATCGA GTTGGAAGAG
ATCGAGTCCG TGCTACGGAC CTGTCCAGAG ATCGTGGATG CCGCCGTCAT GGTCGAGAAT
GCCGGCACTG CCGACGCCGC GCTGGTTGCT GCCTTCGTGG CGAAGCCGGG CATGACGGTG
AGCATCGACA GCCTGCAGCA GACGGCGCAG CTCAGCCTGC CTCACTACAT GGTTCCCAAC
CGCTTCGTCG CGGTTGCCGA ATTGCCCAAG ACTGCGAACG GCAAATTGGA CCGAAAGGCG
CTGTCTGCGA CGATCGTCAG TCCGTCGATT GTCGTGAGTT TCATAGAGAA GGCGGAGCTG
CGCGACGAGC CGGCGCCGGA CGTTCTGACC AAGGTCATCG CCATCGTCGA GGCGGCGTTG
AACCGAACCG GGATCAAGCC GGACGACCGG GTTTTCGCGC TCGGTGCGAC CAGTCTGCAC
GTCTTCCGGA TGGCTGCGCG GTTTTCCGAA GCGCAGCTGC CGATCCGGGC ACAAGATCTG
ATGACCAATC CAAGCATGAG CGATTTGGCG CGGCGAGCTT CCGTCGCGGC AAGAGCGCAG
ACGTCCGACA AGAAGACACC ATCGCTCGCC GAATTTCGGC GTCCCTCGAG CCGCGCGAGG
ACAACATGA
 
Protein sequence
MSDCNEIGFA IVGLSGRFPG APDVQKFWQN IKAGRTSVSR FSAEELEDSF ADQERADPDF 
VAAKPVLDDV DLFDSDYFGM SPREADLTDP QQRIFLEICS EALDDAGYDP ERYQGAVGVF
AGTSLNTYFL RHVCPDRAAL ERFTSNFQVG SYSELLGTLN DFIATRVAYK LNLKGPAVSV
QSACSTSLLA VAQACESLAA LQCDMALAGG VSVTFPQKRG YIYQDGGIAS RDGTCRPFDV
DAAGTVFGSG AGVVAIKRLA DAVADRDAIY AVIRGYGINN DGAAKVGFAA PSVGGQAECI
AAALAMAGFE AETIGYVECH GTGTPLGDPI EIEGLSQAFA EASTGAIEDR NCALGSVKAN
VGHLDAAAGV TGLIKTALIL HHGVLPPQAN FAEPNPRLDL AKRGFRVATE LQEWPRGDAP
RRAGVSAFGV GGTNVHLCLE ESPPAAAESL LPVENAVVLR LSARSPTALE AMRLRLADHL
EAAAQRGERI GLNELAHTLA AGRRTFDRRW AVAVNSVQGA VQSLREVRSD PRPTSLRPGA
VFMFPGQGAQ YPGMGRWLFE SDREFRADVE LGAAIVKEHL GVDILAVLFD DRDDDDHAIR
STTLAQPALF LIEYALAQRL LRSGIEPKAM IGHSLGELVA AALAGVMSYQ DALKLVVHRG
RIMQAQPPGA MLSVRLAAGE LAAHLPVGVE IASDNSPRLS VASGPFEAVE ELERRLERAE
IPHRRLHTSH AFHSAMMDPV AVELEKVAAG IPLRPPKRRL ISTVTGTWMT AAEATSPAYW
ARQCREPVSF RAALTTLIDT ISERVDCLLV EVGPGRTLAT FAGASSSVSK VRAIITTAPD
YVERNDEELT FAQALGDLWA HGARLKSDGR PAAGAARLRL PSYPFERTRH WIARPAPASL
DIAMPVSSAQ GDSFPRTEPP REASSDLDGL HSVSMESKMS GDEVSKALIK ALAAILSDVS
GRPLSDQDAA TSFISLGFDS LLLGQVAQRV NKSFGVKITF RQLMRDLNSL DALAAMLKTS
APADKLPQVE RAAVRPQVAM MQRPAEAVLP KDSPVPVMAG ATGVEALLRE QLQSMERIFA
AQLAAVGKEG RVTTELPVAA VPANSGEAPS PDNSSDKALV MPDTGEEGGG RFKIYRPGAG
DSDLALAPEQ RAYIENLVQR YNDRTPGSKA RAQASRRVLA DPRTASGFNA QWKEIVYPIV
CKRSKGASIW DVDGNEYIDL VNGYGQTMFG HVPPFVAAAL QAQLDDGFAI GPQTELAGEV
AARISAMTGN ERVAFCNTGS EAVMAAIRVA RAVTGRQKVV VFSGAYHGQF DEVLIKSCRA
GSVPGALPIA SGIPAENVGQ MIVLPYGHPA SLEIVRQMAD DLAAVIVEPI QSRHPALQPR
DFVASLRELT AKSGTALVFD EVVTGFRVDP GGMQKVFGIK ADMATYGKVL GGGMPIGVVA
GRADFMDALD GGFWQYGDDS EPEVAPTFFA GTFVRHPMVL AAARAVLNHI EQNKAEIYAP
LAQRTGQLVD RINSHLDSYG LPTQAETCAS WFFVDGSPLG RFAGLLFAEL RLQGIHVHEG
FPCFLTTAHE AAQCQAIDGA FERAIGAFAD AGLSTGEAKG ITRKSSLIGD DQAQPQPQKL
KTVPLTEPQM EVLLAAQMGD AASMAFNESV SIRFNGTLVE SVLIKAVEEV VRRHEALRSR
VVDFTGMLEI NPDFVPEVPI IALDGEDGDT ELAQWLAKDA RTPFDLFNGP LVRASVLRLG
ADVHVLAFTA HHIICDGWSM NVLLGNLAEI YNAYRDGKTP SLVPAASFAE HAAERPRPSA
STIEFWRKQF ENVPVASELP LDRPRPEVKN FFGASLRERL GQDLCRDARE LAKRQGVTLF
TVLFASVQTL FARLCDNERV VLTVPMGGQA LLDRQDLVGH CVNFLPVPVT ARSAVSFCDH
LRHVDEKLNE IFDHQDYTLG SLVRDLDIPR GINRTPLSDI QFNLERVGSR LTFRGVDTRV
DTNPKAFVNF DIFINMIETD DDITIEVDYA TDLLDRATVS RWLDQLRKIL AQALERPEIK
LQELKLQNPV VIGLESGQTT TLQAARFDIP LVEMIERNCD RTPDAIAVEY ETDVLRYREL
DSRSNRIADR LRALAPTKGA RVAVAVQRGL DLPVALVAVA KAGLAYVPVD PSLPVVRIRQ
MAEAAEVAVF ITSGADCPVA AEMGVPVIDL ERDAAQIDAA SSARPEPAPP QEVLDSTAYV
IFTSGSTGTP KGVEISHRAL ANFLGSMAVR PGFGADDRIV AVTTVSFDIA VLELLLPLYC
GGRTVICGKD RLLEPESVVR LIETSSATIV QATPTLWRVL LEAGLQPSRP LRALSGGEAL
PRDVAEKLIA AGFELWNMYG PTETTIWSAC GRIVDASRPI VIGEPVAHTD LYILSDDGTQ
APVGTPGELC IGGLGLAKGY VNRPDLTAAA FPVIALQDGK TVRLYRTGDL AVRLSDGGIQ
LLGRRDQQVK IRGFRIELEE IESVLRTCPE IVDAAVMVEN AGTADAALVA AFVAKPGMTV
SIDSLQQTAQ LSLPHYMVPN RFVAVAELPK TANGKLDRKA LSATIVSPSI VVSFIEKAEL
RDEPAPDVLT KVIAIVEAAL NRTGIKPDDR VFALGATSLH VFRMAARFSE AQLPIRAQDL
MTNPSMSDLA RRASVAARAQ TSDKKTPSLA EFRRPSSRAR TT