Gene Sros_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4120 
Symbol 
ID8667414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4579280 
End bp4587442 
Gene Length8163 bp 
Protein Length2720 aa 
Translation table11 
GC content75% 
IMG OID 
ProductNon-ribosomal peptide synthetase modules and related protein-like protein 
Protein accessionYP_003339769 
Protein GI271965573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0715236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG TTCGTCAGGA CCGGATCGCC GAGATGGTCC GGTCCCGGTT CGCCGCCGCC 
CGGGTCGCGG CCGAAACCCC CGGCGCCGCG GTCATTCCGG CACTGTCCAC ACTGGACATG
CCGTTGTCAC CGGCCCAGGA ACGGCTGTGG TTCCTGGCCC AGCTGGAGCA GGACACCCCC
GCCTACAACG TGCCGCGGGC GCTGCGGCTG AGCGGGCCGG TGGACGTCGC GGCCCTTACC
GCCGCGGTGT CCGAGCTGGC CGACCGGCAC TGGATCCTGC GCGGCGTCAT CGACGGGGCG
AGGGTACGGC CGGCGGACGG CGTGCCCGTC TCCGTGGTGG ATGTGGACCC CGCGGCGCTG
GAGCGGGAGC TGGCCGAGCA TGCCTGGCGC CCGTTCCGGC TGGACGCCGA GCCGCCCATG
CGGGCCGCGG TGTTCCGGCT CGGCGAGGAC GAGTACGTGC TCGCGCTGAC CCTGCACCAC
ATCGCCACCG ACGCCTGGTC GGAACAGCTG CTGCTGCGGG ACCTCTCGGC CCTGTACGCC
GCCCGGCTCG GCCTGGCGCC GCAGCCGGAG CCCCCGGCCC TGCAGTACGC CGACGTGGCC
GCCTGGGAGG CCGAGCAGCC CGAGGTCGAC CTGGACTGGT GGACGCAGCG CCTGGCCGGC
CTGCCGCCGG TGCTGGACCT GCCCATCGCC GGGCCCCGCC CGGCCGTCCC CACCTGGCGG
GGGGCCGCGG TCGGGTTCGA GGTGCCGGAG TCCCTGTCCT CCAAGGTGCG CGCGGTGGCG
GGCATGACGC CGTTCATGGT GTTCCTGGCC GGCCTGCAGG CGCTGCTGTC CCGCCTGTCC
GGCAGCGACG ACATCGCCGT CGGGGTGCCG CACGCCGGCC GGCACCATCT GGACGCCGAG
CGGGTGGTGG GCTGCTTCAT CAACACCCTG GCGGTGCGCA CCGACACCTC CGGCGACCCC
ACCGGCGCCG AACTGCTGTC CAGGGCCCGT ACGGCCGCGC TGGACGCGTT CACCCACGCC
CGTACACCGT TCGAGCGGAT CGTGGAGCGG CTGCAACCGG AGCGGAACCT GTCGGTCACC
CCGCTGTTTC AGGTGATGCT GAACGTCTAC GACGCCGCCG CGCCGGTCAG CCTGGCCGGG
GTGGAGGTCC GCGCCGAACC GCTGCCCGTG CCGACGGCCA AGTTCGACCT GAACCTGACG
CTCGGCGACG AAGGGGACCG GTTCGCCGGC GAGCTGCGCT ACCGGGCCGA CCTGTTCGAG
GAGTCCACCG TCCGGCGGCT GGTGGAGTGG TATCTGGCGC TGCTGGAGGG GATGCTCACC
GATCCGGACG CACCGGTCCG GCTGCCCGCC GGCGCCGACC TGCGCGGCCC GGCCGGGGAC
CTGCCGACGG ACGTGCCATT GCACGCGCTG GTCGAGCGGA TGGCCGACGC CGGCCCCGAC
GTGACCGCGG TCGCCTCGCT CAGCTACGCG GAGCTGGACC GGCGGGCCAA CCAGGTGGCG
CACTGGCTGC TGGCCCGCGG GGTAGGGCCG CAGGAGCCGG TCGGGGTGCT GCTGGAGCGG
CGGCCGGAAC TGGTGGTCGC CCTGCTGGGG GTGCTCAAGG CCGGCGCGGC CTACCTGCCG
CTGGATCCGG TCTACCCGGC GCGACGGACC GAGGCCATCC TGGCCGACGC CGGCGCCCGG
ATCGTGCTCA CCGAGTCCGA GATCGCCGCC GCCGCGGACG GCCCCGGGCA CCGCCCGGAC
GTGGCCGTGC GGCCCGACCA CCTGGCGTAC GTGATCTACA CCTCCGGTTC CACGGGGGAG
CCCAAGGGGG TCGCGGTCGA GCACCGGCAG ATCACCCACT ACCTGGGCGC GGTGGCCGAA
CGGATCCCCG CCGGGGTGAC CTCCTTCGCG CTGGTGTCCA CCGCCGCGGC CGACCTGGGG
CTGACCAACG TGCTGTGCGC GCTGACCTCG GGAGCCACCC TGCACCTGAT CGACCACGAG
ACGGCCACCG ACCCGGTCGC GTACGCCGCC TACATGGCCG CGCATCCGGT GGACGTGATC
AAGATGGTGC CCAGCCAGCT GGAGCTGCTG GGCGTGGACG CCCTGCCCCG GAAACTGCTG
ATCCTGGCCG GTGAGGCCGT GCCGTCGGAC CTGGTGGAGC GGGTGAGGGC GGCGCGGCCG
GCCCTGGCGG TGCAGATCCA CTACGGGCCC ACCGAGACCA CGGTGTCGGT GCTGGCCTGC
GACGCGGCCG AGGTGGCGCC CGGTGTGGCC CCGCTGGGCC GGCCGCTGGC CGATGTGGAG
TGCCGGGTGG TGGACTCCGC CGGGCGGCCG CTGCCGGCCG GCGTGCCGGG AGAGTTGTGG
ATCGGCGGCC CCAGCCTGGC CCGCGGCTAT CTGGGCCGGC CCGACCTGAC CGCGCAGCGG
TTCGTGGACG GCTGGTACCG CACCGGCGAC CGGGTGCGGG TGAACCCGGC CGGGCTGGTG
GAGTTCCTGG GCCGGATCGA CGACCAGGTG AAGGTACGCG GATTCCGGGT GGAGCTCGGT
GAGGTCGCCG CGGCGCTGCG GGCCCTTCCG CAGGTGGCGG AGGCCTTCGT GCAGCCGGTC
GGCGCCGGGG CGCAGCGCCG CCTGGCCGCC TGGGTGACCC CCTCCACAGT GGACACGGCA
CAGGTGCGGG CCACGCTGCG GGAGCGGCTG CCCGACTACA TGGTGCCGCC CGCGATCGCC
GCGCTGGAGG CGTTGCCGCT CACCCCGAAC GGCAAGGTGG ACCGGGCGGC GCTGCCGGTT
CCCGAGGCCG GTTCGGCGGT GCGGGTGCCG CTGGGCACGC CGCAGGAGCA CCTGGTCGCC
GAGGTCTGGG CAGAGGTGCT GGACCTGCCA CAAGTGTGGG CGGACGACGA CTTCTTCGCC
CTGGGCGGGC ACTCCTTCGC CGCGACCCGG GCGGTCGGCC GGCTGCGGGA GCGGCTGGGC
GCGCCGGTTC CGGTGCGGCT GCTGTTCGAG CATCCGGTGC TCGCCGACCT GGCCGCCGCC
CTGCCCCGCC CGGTTCAGGT GGTCAGGGCG CGGCGGGAGC GGGCGGACGG GCCGGCCGCG
CTGTCCGGGG TGCAGGCCAG GCTGTGGTTC CTGGCGCAGC TGGAGCCGGA GAGCACGGCC
TACAACGTGC CGGTGGCGCT GCGGCTGCAC GGCCCGCTTC AGGTGGAGGC GCTGCTGGAC
GCGGTACGCG ACCTGGCCGA GCGGCACCAC GTGCTGCGCA GCGTGATCGA CGATTCCGGC
GCCGAGCCGG TTCTGGTGGT ACGGCCGGCC GGAGAGGTGC CGGTGTCCAC GGCGGACATC
GACCGGTCAC GGGTCGAGGA CGCGGTGGCC GCGCAGCTGG CCACACCGTT CGCGCTGGAC
CGGGAGCCGC CGATGAGGGC GGTGCTGTTC GCGGTCGGCG ACCGGGAGCA CGTGCTCTCG
CTCACCTTCC ACCACATCGC CACCGACGCC TGGACCCGTG GGCTGCTGCT GTCCGAGCTG
TCCGCGCTGT ACGCCGCCCG GATCGGCCTG CGCCCGACGC CGGAGCCGCC GCCGGCCCAG
TACGCCGAGG TGGCCCCGGT CCCCGACCTG GCCGACCTGG ACTGGTGGGC CGAGCAGTTG
CGCGGCCTGC CGCCGGTGCT GGACCTGCCC ACCGACCGGC CTCGCCCCGC CGTGGCCGAC
CCGGGCGGCG CCTCGGTGGA CCTGGAGCTG CCGGCGGAGC TGAGCGAGCG GGTCCGGGCG
GTGGCCACCG CCTACCGGGC GACGCCGTTC ATCGTGCTGC TGGCCGGCCT GCAGGCCCTG
CTGGCGCGGC TGTCGGCCGG CACCGACATC GCGGTCGGCG TGCCGGTCGC CGGCCGGGAC
CACCCCGACT CCGAGGGCGT GATCGGCTGC TTCCTCAACA CGGTGGTGGT CCGGACCGAC
GTGGGCGGCG AACCGACCGG CCACGAACTG CTGGCCCGAG TCCGGGAGAC CGCGCTGGGC
GCCTTCGCCC ACGCGAGCGC GCCGTTCGAC CGGGTGGTGG ACCGGCTGCG GCCCGAGCGG
AACCTGGCGG CGACGCCGCT GTTCCAGGTG ATGCTGAACT ACTTCCCCGA CACGGGCCGG
CCGGAGCTGC CCGGCCTGGA GGCGGCCGAG ATCCACCTGC CCGAGCAGAC GGCGAAGTTC
GACCTGAACT GGCATGTGAT CGACAGCGGG CCCGGCCGGC CTCTGCGCGG CGGGCTCGGC
TACCGTACCG ACCTGTTCGA CGGCGCCACC GCGGCCCGGT TCACCCGGTG GTACCTGGCG
CTGCTGGACG GGATGCTGTC CGACCTGGAA GCGCCTGTGG GCGCGCAGCC GCTGGAGCCG
GTCACCGGCC CGATCCTCGC CGGCGAGCCG CTGCCGGCCG TGGCGGACAC CCCGGTGCAC
CGGCTGATCG AGCGCTGGGT GGACACCACC CCCGACGCGC CGGCCGTGGT GGGCGCGGAC
CGCGGCCTGA CCTACGCCGA GCTGGAGACG GCCGCCAACC GGATCGCCCA CTGGCTGCTG
GCCGCCGGGG TGGGCGCCGA CGAGCCGGTC GGCGTACTCC TGGAGCCGGG TGCCGACCTG
GCCTGCGCCC TGTTCGGGAT CCAGAAGTCA GGCGGCGGCT ACCTGCCCAT GGATCCGGCC
TATCCGGCTG CGAGGATCGC CACCATGCTC GACGCCGCCG GGGTGCGGGC CGTGGTCACC
ACGGCCGAGT TCGCCGGCCT GATCGGGCCG GACCGCTGGG TGCTGGCGCT GGACCGGCTC
CCGTCCCTGC CGCGGACCCG GCCGGAGGTC GACGTCCGGC CCGAACACCT GCATCACGTG
ATCTTCACCT CCGGGTCCAC CGGAACGCCC AAAGCCGTGG CCGCCGAGCA CCGCGGCGTG
ATGAGCTACC TGAACGGCAT GCTGCCGCGG ATCGGCGTGC CCGGCGGGTC GTACGCGGTG
GTGTCCACAC CGGCCGCCGA CTTCGGGCTG ACGTGCGTGT TCGGCGCCCT GACCACGGGC
GGCACGGTGC ACCTGGTGCC TCGGGAGACC GCGATGGATC CCGCGGCGTT CGCCGGTTAC
CTGAGCGCGC ACCACGTCGA CGTGGTCAAG TGCGTGCCCA GCCACCTGGA GCTGCTGGCC
TCCGGCGGAG ACCTGGCCGC GGTGCTGCCG GACAGGCTGC TGATCCTGGC CGGGGAGGCG
TGCCCGTGGG ACCTGGTGGA GCGGGCCAGG GCAGCGCGGC CGGGCCTGCG GATCCAGAGC
CACTACGGGC ACACCGAGTC CACGATGATC TGCCTGGTCT GCGACACCGA GGAGATCGCG
GCCGAGCACC GGACCGGGAT CGTGCCGCTG GGCCGGCCCC TTCCGGGGGT GTACGGGCAC
CTGGTGGACG CGAGCAGGCG GCCGGTGCCG GCCGGGGTGC CGGGCGAGCT GGTGGTCGGC
GGCCCGGGGG TCACCCGCGG CTACATCGGC CTGCCCGAGC TGACCGCGGA GCGGTTCGTG
CCCGATCCGC TGACCGGGCA GGGGCGCTGC TACCGCAGCG GCGACCTGCT GCGGGTCACG
GCCGACGGGC GGGTGGAGTT CCGCGGCCGG GTGGACGACC AGGTCAAGGT GCGCGGCTAC
CGGGTGGAGC TGGGCGAGGT CACCACCGCG CTGCGCGCCC TGCCGCAGAT CGCCGACGCC
GTGGTGCTGC CGGTGGGTGA GGGCAAGGCC CGGCAGCTCG CCGCCTGGGT GACGCCGTCC
ACTGTGGACA CCTCGGCGAT CCGGTCCGCG CTGCGGGAGC GGCTGCCCGA CTACATGGTC
CCGGCCCAGT TCGTCGTACT CGACCGGATC CCGCTCAACC CAAACGGGAA GGTGGACCGG
GCCGCCCTGC CCGAACCGCG GCCGGAGACC GCCGAGTTCG TGCCGCCGTC CACGGCCGGC
GAGGAGCTGG TCGCCCGCGC ATGGGCCCAG GTGCTCGGCG TGGCGAGGGT CGGGGCGCAC
GACGACTTCT TCGCCCTGGG CGGGGACTCC TTCGCCGCGG TGCGCGCGGT CAAGGAGATC
GGCTGCGGCC TGCGGGTGAT CGACCTGTTC ACCCGGCCCA CGGTCGCCGA GCTGGCCGCG
TTCCTGGACC GCCGGGACGG CGGCGGGCTG CTGCACCGCC TGGGCGGTGG CCGCACCAGC
GAGTTCACCC TGGTGTGCCT GCCGTACGGC GGCGGGTCGG CGGCGGTGTA CCAGCCGCTG
GCCTGGGCGC TGGGCGAGCG GGTGGAGGTG CTCTCGGCGG AGTTGCCCGG CCACGATCCG
GCCCGGCCCG ACGAGCTGCC GCTGCCGCTG GAGGAGCTGG TGGAGGCCCT GTCCGCGGAG
GTGGCCACGA CCGCGTCCGG CCCGATCGCG ATCTACGGTC ACTGCGTGGG CTCCGCGCCC
GCGGTGGCCC TGGCCCGCAG GCTGGAGGCG GACGGGATCC CCGTGCTGGG CGTGATCGCG
GCCGGCAGCT TCCCCACCGC CCAGCTGCCC GGCCTGGCCC GGCGGATCTT CCGCAGTGAC
CGCTGGGTCT CCGACCGGAT GTTCCAGGAC GCGCTGCGTG CCACGGGCGG GCTGCTGGAC
GACATGGACG AGGCCGCCAA ACAGGTCGCG GTGCGGGCGA TGCGCCACGA CGCCGACCAG
GCGCAGGAGT GGTTCAGCCG CGAGCTGACC GGTGGGGGGC CGCCGCTGCG CGCCCCGATC
CTGTGCGTGG TGGGGGAACG GGACCGGGCC ACCGAGCTGC ACCAGGAGCG GTACGCCGAG
TGGACGGCCT TCGCGCCCAG GGTGGAGCTG GCCGTACTCC CCCATGCCGG CCACTACTTC
CTGCGGCACC AGGCCGAGCC ACTGGCCGCC CTGGTCATCG AGCACCTGCG GAGCTGGGCG
GCCGGTCGGC TGCCCGATCC GGTGCGCCCG CCCGACCGGA CCGGCCTGCG CCCCTTCTAC
ACGGTCGCGG GCGGGCAGTT CGTCTCGGTG GTGGGCACGG CGCTGAGCTC GTTCGCCCTC
GGCGTCTGGG CCTACCAGGA CAGCGGCCGG ATCCTGGACC TGGCCCTGAT CGTGATGCTG
TCCCAGATCC CGGCCGTGCT GCTCACCCCG CTGGGCGGGG CGCTGGCCGA CCGGGTGGAC
CGGCGCCGGA TCATGCTGGT CAGCGACGCG GTCTCCGGGC TGGCCATGGC GGCGCTGGTC
CTGCTGCTGG TCACCGACCG GCTGGCGTTG TGGAACGTCT GTCTCATCGT CGGTGTCACC
TCACTGGCCA CCGCGTTCCA GCAGCCCGCT TATCTGGCCG CGATCGCGCA GCTGGTGCCC
AAGCCGTACC TGCCGCAGGC CAACGCCGTG GCCAACCTGG GTTTCGGGAT AGGCAACGTG
GTGGCCCCGC TCGCGGGCGG CGCGCTGATC GGCATGTTCG GGCTGTCCGC GGTGGTGGCC
ATCGACGTGG CCTCGTTCGG GGTGGGAGTG GCGACCCTGC TGGCGGTCCG GTTCCCCGAC
CGGCTGTTCC ACCGGCAGGA GGAGACGTTC CGGGCGGCGC TGACCGGCGG CTGGCTGTTC
CTGCGCCGGC GCCGGCCGCT GCTGGTGATG GCCGTCTACT TCGCGGTGGT CAACTTCTGC
ACCGCGCTGA TGTGGGTACT GATCACCCCG GTCGTGCTGG CTCTCGGGTC CTCGGCCGCC
CTGGGCGCGG TGACCGCGGT CGGCGGCCTG GGCGCGGCCG TGGGCACCGC GGTGGTGCTG
GTGTGGGGCG GGACCCGGCG CCGGGCCACC GGCATGGTCG GCTTCGTGAT CGGGTCCGGG
ATCGGCGTGG TGCTGATGGG CGTGTGGCCG GCGCTGTGGC TGGTCGCGGC CGGCCTGTTC
CTCCGGCTGG CCTGCATGAG CATCGGCAAC GCGCACTGGC TGTCCATCAT CCAGGTGAAG
GTGGGGCCGG AGCTGCAGGG CCGGGTTCTG GCCGTCAACG TCATGCTGGC CACGGCCATG
CAGCCGCTGG GCTTCCTGGC CGCCGGACCG CTGGCCGACT GGGCCCAGTC GTACACCTCC
GGCCCCGGCC GGGGCGCGGC GGCGGTGCTG CTGGTCAGCG GCGTGTTCCT GGTGGTCTGG
GGGGTGATCG GGTTGCGTTA CCGCCCGCTC CACCACCTGG AGGACCTGGT CCCCGACGCC
GCGCCGCCGC CCGAGGCCGA GGCCGACCTG GACGCCATCC AGGCCAAGGT GCTGAGCGGA
TGA
 
Protein sequence
MTEVRQDRIA EMVRSRFAAA RVAAETPGAA VIPALSTLDM PLSPAQERLW FLAQLEQDTP 
AYNVPRALRL SGPVDVAALT AAVSELADRH WILRGVIDGA RVRPADGVPV SVVDVDPAAL
ERELAEHAWR PFRLDAEPPM RAAVFRLGED EYVLALTLHH IATDAWSEQL LLRDLSALYA
ARLGLAPQPE PPALQYADVA AWEAEQPEVD LDWWTQRLAG LPPVLDLPIA GPRPAVPTWR
GAAVGFEVPE SLSSKVRAVA GMTPFMVFLA GLQALLSRLS GSDDIAVGVP HAGRHHLDAE
RVVGCFINTL AVRTDTSGDP TGAELLSRAR TAALDAFTHA RTPFERIVER LQPERNLSVT
PLFQVMLNVY DAAAPVSLAG VEVRAEPLPV PTAKFDLNLT LGDEGDRFAG ELRYRADLFE
ESTVRRLVEW YLALLEGMLT DPDAPVRLPA GADLRGPAGD LPTDVPLHAL VERMADAGPD
VTAVASLSYA ELDRRANQVA HWLLARGVGP QEPVGVLLER RPELVVALLG VLKAGAAYLP
LDPVYPARRT EAILADAGAR IVLTESEIAA AADGPGHRPD VAVRPDHLAY VIYTSGSTGE
PKGVAVEHRQ ITHYLGAVAE RIPAGVTSFA LVSTAAADLG LTNVLCALTS GATLHLIDHE
TATDPVAYAA YMAAHPVDVI KMVPSQLELL GVDALPRKLL ILAGEAVPSD LVERVRAARP
ALAVQIHYGP TETTVSVLAC DAAEVAPGVA PLGRPLADVE CRVVDSAGRP LPAGVPGELW
IGGPSLARGY LGRPDLTAQR FVDGWYRTGD RVRVNPAGLV EFLGRIDDQV KVRGFRVELG
EVAAALRALP QVAEAFVQPV GAGAQRRLAA WVTPSTVDTA QVRATLRERL PDYMVPPAIA
ALEALPLTPN GKVDRAALPV PEAGSAVRVP LGTPQEHLVA EVWAEVLDLP QVWADDDFFA
LGGHSFAATR AVGRLRERLG APVPVRLLFE HPVLADLAAA LPRPVQVVRA RRERADGPAA
LSGVQARLWF LAQLEPESTA YNVPVALRLH GPLQVEALLD AVRDLAERHH VLRSVIDDSG
AEPVLVVRPA GEVPVSTADI DRSRVEDAVA AQLATPFALD REPPMRAVLF AVGDREHVLS
LTFHHIATDA WTRGLLLSEL SALYAARIGL RPTPEPPPAQ YAEVAPVPDL ADLDWWAEQL
RGLPPVLDLP TDRPRPAVAD PGGASVDLEL PAELSERVRA VATAYRATPF IVLLAGLQAL
LARLSAGTDI AVGVPVAGRD HPDSEGVIGC FLNTVVVRTD VGGEPTGHEL LARVRETALG
AFAHASAPFD RVVDRLRPER NLAATPLFQV MLNYFPDTGR PELPGLEAAE IHLPEQTAKF
DLNWHVIDSG PGRPLRGGLG YRTDLFDGAT AARFTRWYLA LLDGMLSDLE APVGAQPLEP
VTGPILAGEP LPAVADTPVH RLIERWVDTT PDAPAVVGAD RGLTYAELET AANRIAHWLL
AAGVGADEPV GVLLEPGADL ACALFGIQKS GGGYLPMDPA YPAARIATML DAAGVRAVVT
TAEFAGLIGP DRWVLALDRL PSLPRTRPEV DVRPEHLHHV IFTSGSTGTP KAVAAEHRGV
MSYLNGMLPR IGVPGGSYAV VSTPAADFGL TCVFGALTTG GTVHLVPRET AMDPAAFAGY
LSAHHVDVVK CVPSHLELLA SGGDLAAVLP DRLLILAGEA CPWDLVERAR AARPGLRIQS
HYGHTESTMI CLVCDTEEIA AEHRTGIVPL GRPLPGVYGH LVDASRRPVP AGVPGELVVG
GPGVTRGYIG LPELTAERFV PDPLTGQGRC YRSGDLLRVT ADGRVEFRGR VDDQVKVRGY
RVELGEVTTA LRALPQIADA VVLPVGEGKA RQLAAWVTPS TVDTSAIRSA LRERLPDYMV
PAQFVVLDRI PLNPNGKVDR AALPEPRPET AEFVPPSTAG EELVARAWAQ VLGVARVGAH
DDFFALGGDS FAAVRAVKEI GCGLRVIDLF TRPTVAELAA FLDRRDGGGL LHRLGGGRTS
EFTLVCLPYG GGSAAVYQPL AWALGERVEV LSAELPGHDP ARPDELPLPL EELVEALSAE
VATTASGPIA IYGHCVGSAP AVALARRLEA DGIPVLGVIA AGSFPTAQLP GLARRIFRSD
RWVSDRMFQD ALRATGGLLD DMDEAAKQVA VRAMRHDADQ AQEWFSRELT GGGPPLRAPI
LCVVGERDRA TELHQERYAE WTAFAPRVEL AVLPHAGHYF LRHQAEPLAA LVIEHLRSWA
AGRLPDPVRP PDRTGLRPFY TVAGGQFVSV VGTALSSFAL GVWAYQDSGR ILDLALIVML
SQIPAVLLTP LGGALADRVD RRRIMLVSDA VSGLAMAALV LLLVTDRLAL WNVCLIVGVT
SLATAFQQPA YLAAIAQLVP KPYLPQANAV ANLGFGIGNV VAPLAGGALI GMFGLSAVVA
IDVASFGVGV ATLLAVRFPD RLFHRQEETF RAALTGGWLF LRRRRPLLVM AVYFAVVNFC
TALMWVLITP VVLALGSSAA LGAVTAVGGL GAAVGTAVVL VWGGTRRRAT GMVGFVIGSG
IGVVLMGVWP ALWLVAAGLF LRLACMSIGN AHWLSIIQVK VGPELQGRVL AVNVMLATAM
QPLGFLAAGP LADWAQSYTS GPGRGAAAVL LVSGVFLVVW GVIGLRYRPL HHLEDLVPDA
APPPEAEADL DAIQAKVLSG