Gene Gura_4340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4340 
Symbol 
ID5162737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp5021149 
End bp5030214 
Gene Length9066 bp 
Protein Length3021 aa 
Translation table11 
GC content62% 
IMG OID640551821 
Productglycosyltransferase 36 
Protein accessionYP_001233056 
Protein GI148266350 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTTCC GCCGCGCCGT ACTGGTGCAC AAGGCCGCGA ACGGAGATGT TCGCATAACT 
GATCCCTTCC TCTGGCGCCG TGCTTACGGC GTGACCCTGG CTGTTATCCT GTCCGGCGGG
CTCGCCGAAG TCGCTTCCCT CGTCCTGCAC CGGTTGGGCT TGCTTCCGAG CGGGAACCTT
TTTACGCTGA CTTTATTCCC GGCAGTGGCC TCAATCGGCG CCCTGGCAGC TCTACTATGG
CTACGACGCT CCAGGTACGG TGTTGAACCG TGGGTCATGC GTGATCATGC CCGCTGGCTC
GTGTCCGGGG AATCCGTCTT GATTCTCCAG GCACCGGTCG AGTCGTTACA GCGTCCGGTG
GCAATGATGC GGGAAAGCAG CGATATCCCT CCGGCGCTGT TCGTCCTGCA TCCCAAGGCT
GAGCGACGGC TAGAGGCTCG GGGGCCCGCG ATAAAGCTCT CCCCGACACA GATTCAGGAG
CATGCCCGGC GCCATGCCGG CGAACAGCAA GTGGACACAA GGCCCCAGCA TACTACCGAG
CTGCTCAAGC GTCTCAAGCA GTCACGGAAA TGGGCCCATC AAGTCTGTGT GGATCTCACC
GCGGCGAGCC GCCTGGAACA GAAAGCCACG CCGGCCGCCG ACTGGATCCT TGATAACGAG
TACATCCTGG AGGGTAACGC CCGCGATGTC CTGCTTAACC TTCCCCGGCG CTTTTATCAG
CAGCTGCCCG CACTGGCCTC TGATCCCCAC CGGGGGTTGC CGTGCATCTA CGGTCTGGCC
AAGAATCTCG TCTCCCATAA CGAACTGCGC CTGGACCGGG AAAATATCCT GGCGTTCATC
GAGGCCCACC AATCCATACG CACGCTGACG ATAGGCGAAC TCTGGGCAAT TCCCCAGATG
CTGCGCGTTG CACTTATCGA AAGCATCCAG AGTCTGGCCG TTACTGCCCT GGCGGACCTG
CGCGAACGTC AGTTGGCCGA CTTCTGGGCG AACCGGCTGA TCGCTGCCAA TCGCCGCGGT
TCCAACCAAC TCTTCGCGAT CCTGGCGGAA CTTGCGCAAG CGGAGCCGTC TCCCACCCCG
TATTTCGGTA GCCAGCTGGT CGGCCTGCTC TATGACGAGG CGGCAGCATT GGCGCCGGTC
CAGAGCTGGC TTGAGCGCAC GCTCAAAACA CCCCTGCACG ACCTCAACCT GCGTGAGCAG
AACCGGCAGA CCAGGGCGCA GTTGTCTTGC GGCAACGCCT TCACCAGCCT GCGCCAGCTG
GCCCTGCTGG ACTGGCGGGA GATCTTCGAG AAGCTCAGCC GGGTGGAACA GATGCTGCGC
CGCGATCCGT CCGGGGTCTA TGCCGGGATG GATTTCGCAA CCCGTGACCG CTGTCGTCGG
GCGATTGAAG AGGTCGCCCG CGCTGCTGCT CAAAGCGAGG AGCAGGTCGC GGGGCGTGTC
ATCGAGCTGG CAACGCAGGC CCACCGTGGA TCAGCAACCG ATAAACGGGC TAGCCACGTC
GGCACCTGGC TGGTCGGGGA GGGGCGCGCG GAACTGACCC GGCTGCTCGC CTGCCGTGAA
ACGCACCGTT ACCGCGCCCT GGAATGGGTC TATTGCCACC ATGCCGCTGT TTATGCCTTT
GGCATCGGCG GCTTCTCTGC TTTGCTCTTA TCTTTGGTCG GATCAGCCGG ATCGGTCCGA
CTGGCAGGAC TTTTTCTGCT CCTGCTGATC CCGGTCAGCC AGCTGGCGAT CGAGGTGGTC
AATTACCTGA TCACGCGATT TCTGCCGCCC CGGCCCTTGC CCAAGATGGA TTTCGAGGAT
GCGGGGATTC CCGATGCATT CCGCACCCTG GTGGTCGTGC CGATGATGCT GGTGAACGCT
GAGACGGTAC GAGCCGAGGT GGAGAAACTG GAGATCCGCT ACCTGGCCAA CAAGGAAGCA
AATCTGCTCT TCAGCCTGTT CACTGACTAC ACCGATTCTG TTACGCTTTC GCGTGAAGAC
GACAGTCGGC TGCTCAAGAC CGCGAGCGGA TGTCTGGCAG AGCTAAATCT TCGCCATGGT
GCTGAGCGAT TTTTCCTCTT CCATCGGGAA CGCACCTGGA GCGAATCCGA GCAGAAATTT
ATCGGCTGGG AGCGTAAACG GGGGAAGCTG GAGGAACTGA ACCGCCTGAT CGACGGCACG
CGGCCGGAAA CCTCAGCACG CCTGGTCCAT GTGGGCGACC CGGATCATCT TACGGACGTT
CGCTTCATCA TCACCCTGGA CAGTGACACC CAGTTGCCGC ACGCTACGGC CCGCAGGATG
GTCGAGACCC TGGCGCATCC CCTCAATCAG CCGCGCTTCG ATGGGTTGGG TAACATCTTG
GCCGGCTCCT ACACCATCAT CCAGCCAAGG GTGAGCCCGA CCCTGCCGAG CACGAGCGTC
TCGATCTTCA GCCGGCTCTT CGCCGATGCG GTCGGCATCG ACCCCTATAC CCAGGCGGTC
TCCGATGTCT ATCAGGACTT AAGCGGAGAA GGATCGTACC ATGGCAAGGG GATCTATGAC
GTGCGTGCCT TCAGTCGCGT GCTGTCGGGA CGGTTCCCGG ATGAGTGGGT GCTGAGCCAC
GACCTGATCG AAGGGGCTCA TGTCCGGGTG GGACTGGCCA GCGATATTGA GCTCTACGAT
GAATTCCCCC AGGGTTACCA GAGTTACAGC AGCCGTTCCC ATCGCTGGAT TCGCGGCGAT
TGGCAGATTG CGGGGTGGGT TTTCCCGCGC GTTCCACAAA TTTCAGGCGG TCACGGGGTC
AACCCGCTTT CCCTGCTCAA CCGCTGGAAG ATCTTCGACA ACCTGCGCCG CAGCCTGTTG
CCTGCCACGA GCCTGGGCCT GCTGATGGCC TCCTGGCTCA TTTCTCCGAG AACCGGCGGG
ATCGCCGCGC TGGTGGTTGG CATGCAGCTC CTCTTTCACC CCTTGGCGCA GCCCTTCACC
ATGGCTACCA CCCGCAAGGG GTGGAAATAC TTCTCCCCAT CCAAGCTTCT GCACGACCTG
CTGCGGGCAA CCGCAGATGC TGCGCTGTTG CCGCACCAGG CAGCGGTGGC CCTGGATGCC
ATCGCCCGGG TCTGCTACCG TCGCTTGATC TCCCGACGCG GGCTTTTGGA GTGGACGGCC
CAGGCAACCC ATTGGAGCGC CTCCCGCCGG CAACCGCTCT TCGTCGCCAG CCTGGCCATG
GGGAGCATCT TCAGCGCCAT CGTGGGCGGG GCGATCTGGC GCGTTATGCC GGCCAGCCTC
CCACAGGCCG CCCCATGGCT CGTTTTGTGG TTTCTTTCCC CGCTGCTCGG CTGGCTCCTG
AACCTGCGAC CCGTTGAACA GCAGCAGGCA CAACCGCTGC CGGAAACGGA CCGCCGCTTC
CTCAGACAGG TCGCCCGGCG GACCTGGCGC TATTTTTCGT CCTTTATCAG CGCCGACACC
TCCTGGTTGC CGCCCGATAA CTATCAGGTC TCCCATCAGA ACCGCCTGGC CATGCGAACC
AGTCCGACCA ACATCGGCCT CTGGATGACC AGCGCCTTGG GAGCCCATGA CTCCGGCTAC
CTGACCATTA ACCAGGTCGT CGAAAAGCTG ACCAACACCA TGGCAACCAT CGGCCGTATG
GAGCGCTATG AAGGGCATCT CCTGAACTGG TACGATATCC AGACCCTGGT TCCGCTCGAA
CCTCGCTACG TCTCCACCGT TGACAGCGGA AACCTGCTGG GTGCGCTGTG GGCACTGGAG
CAGGGGCTTG ACGAACTGCT GCACGTCCCC CTCCTGGATG GCAAGGCCTT CGCCGGCCTG
GCCGACACCG GCGAAATTAT GAAACAGGAC GCCGTCTTAG AAGGGATAAC CGGCATTTAT
CGCCAGACAC TCGACCAACT ACTGATTGAG TGGCATGCTC CGCCTTCCGG CATCGACGAA
CTGCTATGCC TGCAACGTCG GATGCAGATC AATGTCAGGT CTGTTGTCGC CGCCGCTGGG
GCTGTCCCAT GGGCCGCTGA GCTTGAACAA CAGGTTTCGG CCTGGGTTCA GAACACAGAC
CGCTATCTCA CCTGGATCGA AATCCTGGCT GAAAAAACAG AGCAGGAACT CACGCAGTTT
GGGCCTGCGG CCATGCTTGC CATCCGTCAG GACCTGATGC AGGCGCCATC GCTCTTCGCA
CTTGCCCATG GCCGGCTCGG CTCGATCCAG ATCCTCAAAG CGATCCGGGA GGACTCTCTC
CAGGCCGGTG CACCTATCTG CCCGTGGCTC GACCGGGTTA TCGAGGCCTT CGCCACTGCG
CAGTGGCTGG CCGGGGAAAC CCTGGGGATG GCTGAGCGGT TGATAGTCAA CGTCCGCGAG
CTGTCAGCCG GGATGAGCAT GCGATTTCTC TATGACATCA AGAGCAAGCT GTTCGCCATC
GGCTACAACG TCTCTTTGAA TCGTCCGGAC GTTTCCAGTT ATGATCTCCT GGCCAGTGAG
GCGCGGCTCG GCAGTTTCGT CGCCATTGCC CGGGGCGACG TCCCCCTGGA ACACTGGTTC
TCCCTGGGTC GCCCCTACGG CGCTATCGGC CGGCAGCGGG TGTTGCTCAG CTGGACCGGG
ACCATGTTCG AATATCTGAT GCCGCTCTTG TTCCAGTGCT CCTACGGCAA CTCGCTCCTG
GACAAGGCCG CCCGGGAAGC AGTGACGGTT CAGATCGCCT ACGGCCGCAC GCGGCGTGTG
CCGTGGGGTA TTTCCGAATC CGCTTTTGCC GATCTTGACC TGGACAAGAC TTATCAGTAC
AAGGCCTTCG GCGTGCCAGC GCTCGGTTTG AAACGCGGCC TGGAGGAAAA GCTGGTCGTC
GCTCCCTACG CCACCCTGCT CGCACTGAAC GTGGCGCCGA AAGAAACCGT GCAGAATCTG
AAACAACTGG CCGGGCTGGG GCTGCTCGGT GATTACGGTT ATTACGAGGC CATGGATTTC
AGCCGGCAAC CGCAGCGTGA GGGGAGACGC GGTGTTGTAA TCGAGGCGTA TATGGTTCAT
CACCAGGGGA TGGCGTTCCT GGCGTTGACC AACTTCCTCC ATGGCAACCC GTTTCCGCGC
CGTTTTCATA GCGATTCGCG GGTGCGTGCC TTTGAGGCCC TGCTTCAGGA GCGCATCCCG
ACCCTGCCGC CGTTACACCT GATCTCGACG CGGCAGAGTG AACCTCTGCT TCCGGGCGGA
GACTTGGTCG CGCCGGCAGG AAGCACCTTT ACCACTCCCC ATACCACCAC GCCCAAGAGT
CTCTTGCTCA GCAACGGCCG CTATGGCTTG ATGATCACCA ACAGCGGCGG CGGCTACAGT
CAGTGGGAGA GCCAGGAACT TACCCGCTGG CGGTCGGATC AGACCTGCGA TAGCCAGGGA
ACCTTCTGCT ACATCCATGA GGCCGATCCG GATCGTGTCT GGTCCAGCAC CTGGCACCCG
GTTGGCGGGA AGGTTGAAGG GTATTCCGTA GACTTTGCTC TCGACCGGGC CGTATTCCGC
CGTGCCGATA ACGGTGTCCA CACCGAAACA GAGGTGATCG TCTCGCCGGA AGACGATTTG
GAGGTGCGCC GGATCACCCT GGTCAATCGC ACCAATCGCG TCCGCCGCCT CAACCTTACC
AGTTATGTTG AACTCTCCAT GGCGCCTCAC AATGCGGACC GTCAGCACCC GGCCTTCAAC
AAGCTGTTCA TCCAGACCGA AGCACTCCCC GAGCAGCAAA TACTCCTCGC CTACCGGCGA
CCGCGCAGCG CGAACGAACT GCCGCTGTAT GTGGCCCATT GCTTGACGCT TGAGCACACG
GGGGATGACA GCCCGCACGA AAATGTGTGG CAGTTTGAAA CCGACCGGGG GCGGTTCATC
GGCCGCGGCC GGACCCTGGC CAATCCCATG GGAGCCGTGC AAGAGCTCGG CAACAGTCAG
GGTTTTGTCC TTGATCCTAG CTTAAGCCTG CGACGGAGTC TCACCCTTGA GCCGGGCCAG
CGCGCCCGAC TTTCCCTGGT ACTCGCTGCC GGGGCAACCC GCGAGCAGGT CCTGCTGCTG
ATGGATAAGT ACAGCGACCC CCATGCGACC GAGCGGGCCA TGGATTTCGC CTGGCGCTCG
GCCCAGCAGC AGTTGCAGGT GCTGCACATC CAGCCTGACG AAGCGCGTCG CTTCCAGCAA
CTGGCAAGTC ACCTGCTGTT TCCCAATCAC CTCTTGCGGC CACCAGCCGA ACGCCTGGAA
GAGAACCGCA AGGGCCAGGC AGGATTGTGG CCCTATGCCA TCTCGGGCGA CCTGCCGATA
GCGCTGATCA CCATTAGCGA GGCGCGGGAT ATCAGCCTGG TCCGCCAGAT GCTCCAGGCC
CACACCTATT GGGGTTCGCA TGGCTTGGCG ACCGACCTGG TGATCCTCAA CGAGGAGGCC
GGGGGTTATG AGCGGCCGCT CCAGGAACGG CTGGAGCAGT TGATCCAAGC CCATACCCTT
TCTGCTGCGG CTGATCGGGC GGGGGGGATC TTCCTGAAGA GTGCGGCGCA GATCCAGGAG
GCGGATTTGA ATCTCCTCAA GGCCGCAGCC AGCGTCGTGC TGGTGGCGGC ACGCGGCACG
CTACCCCAGC AATTGGGGAT GCCGGTGGAG GCTCCCGAAC TGCTGGAAAC GCTGGCCCGG
AAGCGTGCTC CCCGGGAGCC CTCGGCCCCT CTCCCCTTCA TGGAGCTGAA CTACTTCAAC
AGCCTGGGCG GTTTTACCCC GGACGGGCGT GAATACGCGA TCTATCTCGG CCCGAACACC
AACACCCCGG CACCTTGGGT GAACGTCATC GCCAACCCGA CCTTCGGCAC CCTGGTCAGC
GAAACCGGGT CCGGTTTCAC CTGGTATGGC AACAGCCAGC GCAATCGCCT GACCGGCTGG
TCGAATGACC CGGTGCTGGA CCCGGCAACA GAGGCGCTCT ATATCCGTGA CGAGGAGAGT
GGCGTCTTCT GGTCGCCGAC CGCGGCGCCG ATCCGTGAGG AGACTGCCTA TCGGGCTCGA
CACGGGGCGG GCTATACGGT CTTCGAACAT AACAGTAACG GCATCGAGCA GGAACTGACC
GTCTTCGTAC CGGTAGATGA GAGCGGGGGG AAGCCGATCA AACTGCAGCG CCTGCGGCTC
ACCAATGCCT CCTCCAGAAG GCGCCGACTC TCGGTAACCT ATTATGTGGA ATTGACCCTC
GGCGAAAACC GCGAAACCTC CCAGATGCAC GTCATGACTA ACTGGGATGA CGAAGCCCAT
GCCTTGCTGG CGCAAAACCG CTACCACCCC GAGTATGGAG AGCGGGTCGC CTTCGCGGCC
ATCACTCCCC ACGCCGACTC ATACGGCGGC GACCGCACCT CTTTCATCGG CCGCAATCGC
TCGCTGGCTA ACCCGGCGGC CCTGGAGTTG ACCCGGCTGT CACAACGGAC CGGGGCGGGG
CTCGACCCGT GCTCGGCACT CAGGGTTTGC CTGGAATTGG CTCCCGGCGA GCGACGTGAT
ATCACCTGCA TGCTGGGCCA GGCCGGGTCG GTAGTGCAGG CCAGGGAGCT GGTGTTCAGC
TACCGGGAGG ATCAGGCATT TGAGGACGCA TTCGATCAGA CCAGGGCCTG GTGGGATGCA
CTGCTCGGCA CGATTGAAGT GCACACCCCC GAACTGGCCG CCGACCTTCT GATCAACCGC
TGGCTTCAGT ACCAGTCCTT GAGCTGCCGT ATCTGGGGAC GTTCAGCCTT TTACCAGTCA
GGCGGCGCCT TCGGTTTTCG CGATCAGTTG CAGGATGTCA TGGCGTTCCT TTACGCCAGC
CCCGACCTTG CGCGCGACCA GATCCTGATG GCCGCCAGCC GGCAGTTCAA GGAGGGGGAT
GTCCAGCATT GGTGGCATGA ACCTGCCGGC GCCGGGATTC GTTCGCGTAT TTCCGACGAC
CTGCTTTGGC TTCCGTACGT GGTCGCACAG TACGTCCGAA CGACCGGCGA TTCCGACATT
CTACAGGTGG AGGTTCCCTT CCTCAACGCC CCCCAACTGG CAGACGATCA GCACGAGGAA
TTCTCCACCC CGGAGGTCAC CTTTGAACGC GCCACGCTCT TTGAACACTG CCGGCGCGCG
GTCAGCCGCG GCCTGACGAT TGGGCCCCAT GGCTTGCCCC TGATGGGGAC GGGGGACTGG
AACGACGGGA TGAATCTGGT GGGCGCGGCA GGGAAGGGGG AGAGCGTATG GCTCGCCTGG
TTCCTGTGTG ATTCTCTGCA GGGGATGGCG GAACTGTCGA GCCTCCTGCA ACAACCTGAG
CTGAGCCGGA CCTATCAAGA AGAACGGATA GCTTTGATTA AGCGGGTCGA GCAGGCCGGT
TGGGACGGGG AATGGTATCT GCGGGGGACT TTCGATGATG GCACCCCGCT TGGCTCCGCC
CTGAACAGGG AGGCGAGGAT CGATTCCCTG CCCCAATCGT GGGCGTGGCT TTCCGGCGCA
GCCGACCCGG AGCGCGCGGA TCAGGCACTG GAGTCGGCCT GGAACCATCT CGTCCGCGAG
GATGAAGACC TGGTGCTGCT CTTCGAGCCC CCCTTCGACA TTGCAGAGCC ATCACCAGGA
TACATCAAGG GGTATCCCCC CGGGGTACGG GAGAACGGGG GTCAGTACAC CCACGCTGCC
CTCTGGATGG CCATGGCCAT GGCCCGCAAG GGGGATGGGG GGCGGGCGGT GCAGCTGCTG
CGCATGCTCA ATCCGATCGA GCACGCCCGC GATGCGGCAG CGGTCTGGCA CTACGGGGTT
GAGCCCTACG TGGTCGCGGC CGACGTCTAC CGGTTGCCCG GCCGGGTTGG TCAGGGGGGG
TGGTCCTGGT ATACCGGTTC GGCAGCATGG ATGTACCGGG CCTGGGTAGA AGAGGTGCTG
GGCTTACAGG TGCGGAGTGG GCAGCTGCGG GTGAACCCGG TCATCCCCGC AGCGTGGCCG
GGATTCAGCA TCAGTTATTG TCACGGTGAA ACGATCTACG CGATCCAGGT GGAGAATCCC
CACGGCTGCG AGCGCGGTGT CGCCTGGGTG GAGATGGACG GCCAGCGCGT GTCCGGTGGA
GTGATCCCCT TGGAGCGGGG ACTGGTAAAG CATCAGGTTG TCGTCCGGAT GGGGAATCGG
GAGTAG
 
Protein sequence
MGFRRAVLVH KAANGDVRIT DPFLWRRAYG VTLAVILSGG LAEVASLVLH RLGLLPSGNL 
FTLTLFPAVA SIGALAALLW LRRSRYGVEP WVMRDHARWL VSGESVLILQ APVESLQRPV
AMMRESSDIP PALFVLHPKA ERRLEARGPA IKLSPTQIQE HARRHAGEQQ VDTRPQHTTE
LLKRLKQSRK WAHQVCVDLT AASRLEQKAT PAADWILDNE YILEGNARDV LLNLPRRFYQ
QLPALASDPH RGLPCIYGLA KNLVSHNELR LDRENILAFI EAHQSIRTLT IGELWAIPQM
LRVALIESIQ SLAVTALADL RERQLADFWA NRLIAANRRG SNQLFAILAE LAQAEPSPTP
YFGSQLVGLL YDEAAALAPV QSWLERTLKT PLHDLNLREQ NRQTRAQLSC GNAFTSLRQL
ALLDWREIFE KLSRVEQMLR RDPSGVYAGM DFATRDRCRR AIEEVARAAA QSEEQVAGRV
IELATQAHRG SATDKRASHV GTWLVGEGRA ELTRLLACRE THRYRALEWV YCHHAAVYAF
GIGGFSALLL SLVGSAGSVR LAGLFLLLLI PVSQLAIEVV NYLITRFLPP RPLPKMDFED
AGIPDAFRTL VVVPMMLVNA ETVRAEVEKL EIRYLANKEA NLLFSLFTDY TDSVTLSRED
DSRLLKTASG CLAELNLRHG AERFFLFHRE RTWSESEQKF IGWERKRGKL EELNRLIDGT
RPETSARLVH VGDPDHLTDV RFIITLDSDT QLPHATARRM VETLAHPLNQ PRFDGLGNIL
AGSYTIIQPR VSPTLPSTSV SIFSRLFADA VGIDPYTQAV SDVYQDLSGE GSYHGKGIYD
VRAFSRVLSG RFPDEWVLSH DLIEGAHVRV GLASDIELYD EFPQGYQSYS SRSHRWIRGD
WQIAGWVFPR VPQISGGHGV NPLSLLNRWK IFDNLRRSLL PATSLGLLMA SWLISPRTGG
IAALVVGMQL LFHPLAQPFT MATTRKGWKY FSPSKLLHDL LRATADAALL PHQAAVALDA
IARVCYRRLI SRRGLLEWTA QATHWSASRR QPLFVASLAM GSIFSAIVGG AIWRVMPASL
PQAAPWLVLW FLSPLLGWLL NLRPVEQQQA QPLPETDRRF LRQVARRTWR YFSSFISADT
SWLPPDNYQV SHQNRLAMRT SPTNIGLWMT SALGAHDSGY LTINQVVEKL TNTMATIGRM
ERYEGHLLNW YDIQTLVPLE PRYVSTVDSG NLLGALWALE QGLDELLHVP LLDGKAFAGL
ADTGEIMKQD AVLEGITGIY RQTLDQLLIE WHAPPSGIDE LLCLQRRMQI NVRSVVAAAG
AVPWAAELEQ QVSAWVQNTD RYLTWIEILA EKTEQELTQF GPAAMLAIRQ DLMQAPSLFA
LAHGRLGSIQ ILKAIREDSL QAGAPICPWL DRVIEAFATA QWLAGETLGM AERLIVNVRE
LSAGMSMRFL YDIKSKLFAI GYNVSLNRPD VSSYDLLASE ARLGSFVAIA RGDVPLEHWF
SLGRPYGAIG RQRVLLSWTG TMFEYLMPLL FQCSYGNSLL DKAAREAVTV QIAYGRTRRV
PWGISESAFA DLDLDKTYQY KAFGVPALGL KRGLEEKLVV APYATLLALN VAPKETVQNL
KQLAGLGLLG DYGYYEAMDF SRQPQREGRR GVVIEAYMVH HQGMAFLALT NFLHGNPFPR
RFHSDSRVRA FEALLQERIP TLPPLHLIST RQSEPLLPGG DLVAPAGSTF TTPHTTTPKS
LLLSNGRYGL MITNSGGGYS QWESQELTRW RSDQTCDSQG TFCYIHEADP DRVWSSTWHP
VGGKVEGYSV DFALDRAVFR RADNGVHTET EVIVSPEDDL EVRRITLVNR TNRVRRLNLT
SYVELSMAPH NADRQHPAFN KLFIQTEALP EQQILLAYRR PRSANELPLY VAHCLTLEHT
GDDSPHENVW QFETDRGRFI GRGRTLANPM GAVQELGNSQ GFVLDPSLSL RRSLTLEPGQ
RARLSLVLAA GATREQVLLL MDKYSDPHAT ERAMDFAWRS AQQQLQVLHI QPDEARRFQQ
LASHLLFPNH LLRPPAERLE ENRKGQAGLW PYAISGDLPI ALITISEARD ISLVRQMLQA
HTYWGSHGLA TDLVILNEEA GGYERPLQER LEQLIQAHTL SAAADRAGGI FLKSAAQIQE
ADLNLLKAAA SVVLVAARGT LPQQLGMPVE APELLETLAR KRAPREPSAP LPFMELNYFN
SLGGFTPDGR EYAIYLGPNT NTPAPWVNVI ANPTFGTLVS ETGSGFTWYG NSQRNRLTGW
SNDPVLDPAT EALYIRDEES GVFWSPTAAP IREETAYRAR HGAGYTVFEH NSNGIEQELT
VFVPVDESGG KPIKLQRLRL TNASSRRRRL SVTYYVELTL GENRETSQMH VMTNWDDEAH
ALLAQNRYHP EYGERVAFAA ITPHADSYGG DRTSFIGRNR SLANPAALEL TRLSQRTGAG
LDPCSALRVC LELAPGERRD ITCMLGQAGS VVQARELVFS YREDQAFEDA FDQTRAWWDA
LLGTIEVHTP ELAADLLINR WLQYQSLSCR IWGRSAFYQS GGAFGFRDQL QDVMAFLYAS
PDLARDQILM AASRQFKEGD VQHWWHEPAG AGIRSRISDD LLWLPYVVAQ YVRTTGDSDI
LQVEVPFLNA PQLADDQHEE FSTPEVTFER ATLFEHCRRA VSRGLTIGPH GLPLMGTGDW
NDGMNLVGAA GKGESVWLAW FLCDSLQGMA ELSSLLQQPE LSRTYQEERI ALIKRVEQAG
WDGEWYLRGT FDDGTPLGSA LNREARIDSL PQSWAWLSGA ADPERADQAL ESAWNHLVRE
DEDLVLLFEP PFDIAEPSPG YIKGYPPGVR ENGGQYTHAA LWMAMAMARK GDGGRAVQLL
RMLNPIEHAR DAAAVWHYGV EPYVVAADVY RLPGRVGQGG WSWYTGSAAW MYRAWVEEVL
GLQVRSGQLR VNPVIPAAWP GFSISYCHGE TIYAIQVENP HGCERGVAWV EMDGQRVSGG
VIPLERGLVK HQVVVRMGNR E