Gene Haur_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2036 
Symbol 
ID5733925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2531184 
End bp2540405 
Gene Length9222 bp 
Protein Length3073 aa 
Translation table11 
GC content59% 
IMG OID641279180 
ProductYD repeat-containing protein 
Protein accessionYP_001544807 
Protein GI159898560 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTGC TTCGGCGTGG GCGCGTTTGG TACGCGCTCT GTCTTTTTTG CATGCTGGTT 
CCCTTGTTCG TGCATCCGGC CACCATCCAA CCCGTTATGG CTGCGCCGCT TGTCCCAGCA
TCGTCCACGC ACCAGTGGAC GCAAGCAACC CCGACCGCGC TGCTCATCGC CAATACGGCG
GATGGCATCA CCCTCACCAG TGCGGATACC GCCCTTCGCC AATTTCTGGA AACCACTATG
GGCTGGACAG TCACGCTTAA AGATGATGAC CTCGTCCAAG CGAGTGATGC GGCGGCGGTC
AATGTGGTGG TCATTTCGGC ATCGATTGAT TCGTATAAAT TAGACAATGA TTTTACCTAT
GTCACCACGC CGATGGTAGT GAATGAATAT GCGCTCTATG ATCAATTGGG CATGACGAAT
GGCGCATCGG GAACGGATTA TGGGATTAAC TCGGGAACCC CGACTACGAC CATCACGCTG
CTCGATCCAA CCCATCCCTT AGCGGGTGGC AAGAGTGGAA CCATCACCGT CCAAAGTCAG
AGTTTGCGCT TGCCCTATGG CCGTCCGGTC AATCAAGGGG TGAAGGTCGC AAATCTGCCG
GGCAGCACGA GTAAAACGGT GCTGTTTGGC TACGATACTG GGGCGACGAT GACGATCCCC
AGCACCAAGG ATGGCGGCTT AGTGGGGTCA ACCCCGTTAG CGGCCCCGGC ACGTCGGGTT
GGGTTGTTCT TGGATGCCGA TACCCCCTTA TACTTAACCA GCGATGGGTG GGATCTCGTT
CGTTCCGCGT TGCTGTGGGC GATTGGGCAA ACCCCACCGA GTGGCACGCC AACGCCCACT
ACCGTCGCGC CGACGGCAAC GCCTGCGCCG GAAACCGCGA CGGCTGCTCC AGCGACCGCC
ACGGCCACGA GTCTGCCTCC GACGGCAACC GCGACGAATC TGCCCCCAAC CGTCACGCCA
CCGAGTGGTG CGGGGAAAAA CCTGCTCTTG GTGGTGGGCA GTGTGCCCTT GGGGAGTGGT
GATGCGGCGG TACTGCAACG CTTGACCGAT GCCCAGTACA CCGTGCAGGT AATTCCCCAG
ACCAGTCTTA CCCTCGAGGC AACCCAGGGC ATGCATGGCG TGTTGATCTC CTCAACCGTC
AGCGCGTCGG CGATTGGTAT GCTGTTGACC ACCCATCCCG TGCCCGTGTT GACCTGGGAA
TATGCGCTCT ATGATGATCT GCGAATGACG GGCACGAGTT CCGGTACGCA TTTCGGGACG
AGTACGGGCA GTACGCTCAG TATTGCGGCT CCGACACATC CGATTGCCGC CGGGTTGAGT
GGGGTCGTCA GCGTGAGCGG ATCAGCGGTC ACCTTGGCCT TTGGGCAACC ACTTGCCCAA
GCAACGCTGG TCGCCGCCTT ACCCAATGCG CCAGCATCGG CGGGCATTTT TACCTATGCC
CAAGGGGCTG CATTAAGCAG CGGAACGGCT CCCGCACGGC GGGCGGGCTT TTTCTTTGAT
AACACCAGCC CAACAACGGC AACCGCGACC GGCTGGGCGT TGTTTGAGCG GAGTGTGGCG
TGGCTGATTG AGGGCTTTAC CCCCATTCAA CCCGCGACGG CGACCCCGAC CGCGACGGTA
GTCCCCAGTC CCACTCCGAC CGCGAGTCCG ACGATCGTTC CGAGTGTCAC CCCAGTGGTG
CCGCCCAGTG CGACCCCTAT CCCGACTGCG ACTCCCTTGG TCATTCCGCC TGATCCGGCA
GCAGTGGCCC CCGCGTTAAG TGCGTTTACC GCCAGCGATT TGTTTGCGGA TACCGCCTTC
TTGTATACGG GAACGACCCC GATTCAAACG GGGGTGGCCC CAGATACGAT TACCGCGCAG
CGGGTGGCGG TGATTCGTGG ACGGGTGCTA GATCGCACGC GCCAGCCGAT TCCTGGCGTG
CAAATCACGA TTAAGGATCA TCCCGAATTT GGTATGACCC ACACCCGGGC CGATGGCATG
TTTGATTTGG TGGTCAATGG TGGCAGTCGG TTGGTGGTCC AGTATCAGCA TCCACAGTTT
TTGCGGGTGC AACGGGCCGT GACCACCCCC TGGCGTGATA GTGTCTGGAC TCCTGAGGTC
GTACTCACGC CGCTCTCGCC GATCGTCACA ACGATTCAGT TGGGGGGCGC TGCGCAGGCT
GCCGAGGGCA CGGTTAGCAC CGATGATCGG GGGACGCGCC AGGCGCGGCT CTTCTTCCCT
GCGGGCATTA CGGCCACGTT GCGCCTGCCG TCAGGAGCCA CCCAGCCGCT GACCAGCATT
GCGGTGCGGG CGAGCGAATA TACCGTCGGC GATACGGGAC GTGCAGCGAT GCCGGGCGAA
TTGCCGAGTA ATAGTGGCTA TACCTATGCA GCAGAATTCA GCGCCGATGA AGCGTTGGCG
GTCGATGGCT CGGTGGAATT TAGTCAGGCA GTCGCCAACT ATACGGAAAA CTTCATGGGC
TTTCCGGTAG GGATGGCTGT CCCCACGGGC TGGTACGATG CCCAACGGGC GGTCTGGGTT
CCGAGTCCCA ATGGTCGGAT CATCGCAGTC TTAACCACCG CACCATTGAC CCTGGATCTG
ACGGGGGATG GGGTGGCCGA TGACCCAGCC AGCCTCGGGA TTGGTGCGGA TGAACAAGCC
TTTGTGGCCA GCACCTATTC GGCGGGAACC AGCCTGTGGC GGGTGCCGAT GACCCACTTT
ACGCCGTGGG ATCATAACTG GCCCTTTGGC CCACCAGACG ATGCGGTTGG CCCTAATGGT
GGCCCATTAA GCAGTGATCC CCCGATTAGC ATTCCCTGTG AACAACCGGG GTCGGTGATT
ACCTGTGAAA CCCAAAGTGT TGGGGAAGAG GTTGCGCTTG CGGGCACACC ATTCCGATTG
GTCTACACGA GTGATCCTGC GGCGGCCCAT CATCAGGAAG TCGTCATTCC GCTGAGTGGG
GCAACCATTC CGAATTCGCT GACCGGGATT GATGTAACGG TGCAGATAGG TGGGCAACAA
TGGACGCAGG CCTTTTCGCC GAGTACCAAT TTGGCAACGA TCTTTACGTG GGATGGATTG
GATGCCTATG GGCGGATGCT CCAAGGCAGT CAGTCCGCGA CGATCAAGAT CGGGTTTACC
TATCCTGGTC AGTATCGCAC TCCGGCCCAA TTCGGGAGTG CGTTTGGCCA ATTTGGGACA
GGCGGAATCT CCCTAACGGG TGACCGCGCT ACCGAGATGA TTACGGTCGA ACGCGTGTTA
AACACAACGC TTGGCCGACT CCATGTCAGT CAAAATGCGA TTGGCGGATG GTCGTTGGAT
ATTCATCATC AGTATGATAC GCATCAACGT ATCCTCTATC GTGGTGATGG GAGTCGCCAG
CGGGCGGAGG CGCTGACCGC TGTTACCCGA ACAGGAGTCG GCTGGGCCTA TCCGCGGCTT
CATGCGATTG CCCCAACCCG TGATGGGGGA CGCTATGAAG CAGGTGGATC GGGTTCGATC
AGTACAAACC TGTGGGAGTG GTTGCCCGAT GGAACGCGGA GGCAACGATC CGGAATCCTG
AATGGGTTAC CACCAGCTGA TGGGCAATTA CTCAGTCATA CCGAATTTAA TGGCATCCGT
GCGCTGCATA CCACGCCAAC GGGTGAGGTC TATCTGACCG ATGAGGGATC GACGTGGGGT
GGCAACGCAC CTGCAATCTA CCGAATTACG GCTGATCAGC GGCTCGAACG GGTGGTTGGG
GGGACGACCA AAGCGTTGAC GCTCACACCA GATGGTGCTG TGGCGCGAAC GATGCCGATC
GGGACACCCC TTGCCGTAGA TGTGGATACT GATGGAACCG TGTGGTTTAT TGATGCCTAT
ACGGTTGAGG TTCCGCAGGC GAATGGCCCG ATCATCGGCA AGACGGTGAC AGCATTGCGC
GTGGTCCGGA CGGATGGGCA GGTCATCCAT CCCGTCGCAC CAACGCAGCT GCGGTGTGGC
TTTCCCAATT TACGGGATGA GGAGATCGTC CATGTCACGC ATGATACGCG CGGGAATCAT
TATCTCGCCC TGCGTGGCCG CTATCCGGCG GTTGCGGGGC CGACAGGAAC CGGGTGTATC
CTGCGGATTG ATGCGGCAGG AACGATTACG CGGATTGCGG GCCGAGATAC CAACGGTCAA
GCCATTGGAC AATATCCGGA AGGATGGTTG GCGACGGAAT CCCCCCTTAT CGATCCTAAT
GCCGTTCATG TCCATTCAGA TGGACGGTTG ATTGTGCAAG ACATCGGAGC CTTACGCGAG
ATCCTTCCCG ACGGCGAAAT GCGAACCATT GTCGGGAATC CGTCCCCGAG CACTGATCCC
CAACTGGACG AACTCTTTGG GAGTATGGTG GGTCCCAATG GCCAGATTTC GTTTACCTCG
CGTGCGGTGA AACAGGGAAC TGTCGCGCCG CCGTTCCCCG GTGTGGGACA AGCACGCCTT
GAAGTGCCAT CCTCAGATGG GAGCGAAGTC TACGTTTTTG CATCCACAGG CCGCCATTTA
CTGACCCGCG ATGCGATGAC CGGGGTGCTT CGCTATACCT TTCTCTATGA TACAGCAGGA
CGGTTAACGG GGGTGCGGGA TCAGCATGGA CAGACCACGA CGATCACGCG GATAGCCGAT
GGCACCCCAA CGGCGGTGGT GAGCCCCGAT GGGGTTACGA CCACGCTGAC GGTCGGCTCC
GATGGCAGCG TGCAGGCGAT CAATGATCCG ACCCAGAGTC GCTGGCAGCT TCAGTATCAG
GATGGACTAT TGACGCGCTT GATTGATCCA CGCAATCCTG ATTGGCAGCA TCAGTATACG
TATACCGCTG ATGGGCATCT CAGCCAAGAC ATTGGGGCAA CCGGCGGATG GACAGCGCTC
GATCAGACGC GGCTCAGTGC CACGACGACC ATCGTGACCA GTACCACGGC CACGGGGATC
ATGACGGCGC ATCGGATTAG TACCGCCGCC GATGGCACGA CGAGCCGGAT GGTGACGACC
GAAGGTCGCC CAACGATCAC GACCGCGATC GATGCCCATG GCAATCAGGT GCTGATTGAT
GCACGGGGCA TGCGGCGCAC GACGACCATG GCTCCTGATC CCCGTTGGGG GATGACGGCT
CCCTATCCTG CGGCGATCAC GGTGACCAAT CCGGCAGGAC AGGTGGTGGC AACCGCCACG
ATGCACCGGA GTGTAGTGCT GAGTGACCCG ACCGATCCGT GGAGTTTGGT GACGTGGCAG
GAGGTCTTCA CCCAACACGG CTTGACGACC ACGACCATCT ATACCGCCGC AACCCGCACG
CGTCAGGTAA CCTTGCCATC AGGGGTCGTG CGCGTGACCA CCTATAACGC AGCTGGACAG
CCGATTACGG AAACGGCGAC CGATCAGGCC GTGAAATCAT GGACGTATCA GCCCGATGGC
CGACTCGCTA GTGACACGAT CGGGACGGGG CCAACTGCTG CGACAACGAC CTATGCCTAT
ACGCCCCAGG GCTTTATCGC CGCCATCACC AATGCGCTGG GCGAAACGAC CACCTATACC
CATGATGCCG CCGGACGGAT CGCGACCGTA ACCCACCCGA CGGGCCATAC CCGCCAGTAT
TGGTATGATG CGGTGGGTAA TCGGGTGCAG GAACGCGACG AACGGGGCAT GGTGACGACC
CGGGTCTACA ATGCCGACAA CCAACCGATT GCCGAAGTCG TCGATGTGCA CGGTCGGGCA
ATCTTAACGA CCGTCACCTA TGACCTGCAT GGCCGCGTGG TGCGCGAAAG CACGGCCCAA
GGGGCGAATC GTCCGCCAGC GGTCACGACC TATGGCTATG CATCGCTCAG TCCAGTGGCC
AGTGTGCCCA TCAGCGTGAC GAATGCGCTG GGCGAGACCA CGACCTATAC CTATGATGCG
GTGGGGGATC TGGTTGCAGT AACCGATGCC ATGCAGCACA CGACGGTCAT GACCTATACC
CCCGAAGGGT GGCTGGCCGC TGTCCAATCC CCGAGTGGTC AGCTGATGCA ACGGGACTAT
ACCACTGATG GCCAAGTCCT GAGTGAAACC GATGCGCGGG GCAGTATCAC GACCTATACC
TATACGGCAT TGCGCCAACC AGAAACGGTG ACGGTTAACA CCACTGCCGT TGCGGGCTAT
TCAGCGGTCG CAGCCAGCAC CACCACCACC TATGATGGCT GGGGTCGGGT GGTTCGTGTG
CGTGATCCGC GTCAGCAGGT GACGACGTTC CACTATACGG ATCGGGATCA CCTGGCATGG
ACGGAACTGC CAACCGGGGA ACAGACCCAC TATACCTATG ATGCGCTTGG ACGACCCATT
GCGACCGTGG TGGGAGCACA CGACCCCAGT ACGGCGATCA CCACGACGAC CAGCTACGAT
GCTGCTGGGC GGGTGTTGAC CACGGTGGTC GATCCCGACG GCATGGCCCT GACGACCCGC
TATCGCTATA CCGAACCAGG TGCGAGTAAT AGTTGGGATC TGCATGCCGT GGTTGACCCG
AATGGGCACA CCACGCGCTA TCAAACCAAT GCGCTTGGCT GGATGACGGC AACGACCAAT
GCCCTGACCG AAACCTGGAC GATGACCTAC GACCAGCGGG GCAACCAGAC GGGCATTACC
GACCCGCGCG GGAATACGCT TGCGTGGGAG TATGATGCGC TCGGTCGCCG GATCATGGAA
CATGAGGCAG GCCGCACCCA GCGCTGGCAG TATCGGGCGG ATGGACGGCT GGCGGCGCAG
ATCGATATGG CCAATCGCAC CACCAGCTAT GGCTATAGCG TTGATGGCTT CCTGACCGAT
ATTGACTATC CCAGTGGGAC AGCGGATGTC AGCTATACCC ATGACGCGAA TGGCAATGTC
ACGACGATGC AGGATGGTCT GGGGACGACC ACCTATCGCT ATGATGGGCT GAATCGGCTG
CGCGAACGGA CGCGCGATGG GCGCACGGTG GGCTATACCT ATGATGCCGC CTCGTTCCGC
ACCAACACCG ACTATTGGGG TACGGGCAGC GTGACGGCGA CTCCCGATGC CGCTGGACGG
GTCGCGAGTC TGCAACCGTG GGGTGGTTCC ACCACGACCT ATGCCTATGG CGCACGGGGG
CAGATGGCGA CCGCAACCAG CGGCAGTGGC CTCAGCGTGA CCCCCACCTA TGATGCGGCA
GGGCGCGTGC TGGCCACCCA CTATGCCCAG AACGGCACGA CCCTCGCCAA CTTTGGCAGC
ACGGTGGATG CGGTGGGGAA TCGTACCAGC CTTACCGATG CCAATGGCAC GACGACCCTG
ACCTACGATG CGACGGATCG CATGGTGCAG GCTACGGAAC CACAGGCCAG CACGACGTAT
ACCCATGACG CGGTGGGCAA TCGCACCAAT GTGGCAACCA CGGGGCAAAC CCCATTCAGC
TTCAGCTATG ATGCCGCCAA TCGCCTGACG ACCAGCGGCT ATACCCATGA TGCCAATGGC
AACCTGACAA CGACCCCCGA GGCGACCTAC ACCTATGACG CGGCGAATCG GGTGACCAGC
AGTACAACCA GTGCAGGCAC GACGACCTAT GGCTATGATG GCTGGGGTAA TCTCGTGCGG
GTGACGACCA ATGGGCAGGT ACAGGATCTG GTCTTGGATG AAGCGGCAGG GTTGCCCCAG
GTGCTGGGGA CAGTGACCGC GAACGGCACG ACCCGCGTGG CCCGCGATCC CGCAGGGTTG
GTGCATCAAA CCAGTGGCGG GCAAGTCAGC ACCTTGCTGA CCGATCTGGT GGGTAGTGTG
CGGCAGGGGA TCACCCCGAG CGGAACAACC CTGTTCAGCC AAGCCTTTAC GGTCGATGGG
GTGCCAGTGG CCCACAGCGG CAGTGCGACG AGTGCCTTTG GCTTTACGGG GGAATGGACG
AACCCGCTCG ATGGCCTCGT GTATCTGCGA GCACGCCACT ATCTGCCGAC CATGGGACGC
TTCATCCAAC GCGATACCGA TGCAGGCAGC AGCATGGCTC CGGCTTCGTT ACATCGCTAT
GCCTATGTGG ATAACAACCC GGCCACGTGG ACCGATCCAA CGGGTCATCG GAAGGGGTTG
CAACCCGATG GGTCATGGGA GCCATGTCAG ATCATCCTTG GCGATCCCGA GGATCAATGC
TCCTTGGGTG ATTACTTTGA CTTCCCCAAA TCGAAAAGTG GCGATGATCT GATGACAATG
CTCAAACGGT TCGGCAAGCG CATTGAAAAC TTGCCGGGTG ATGCCATCCA TGGGACGATT
GCCTTTGCGA CCGACCCGGT TGGGACGTTA TTGGCGATCC CGGGGTCGGT GGGATCGATG
TCGCAGAATT GTCTCCAAGG GACGTTCTGT TACAACTACG AAGCGTTAGC CGACTGCGTT
TGGGATGTGG GTGGCATGGC GGTTGGGGGT GGCTTCGGCG GAGCAGGCAT GGTTGATGAC
CTCGCACGCT TCCGTGGGGG AGCAGGCTAT GCCGACGACT TGCGCTATCT TGATGATCTG
GCGGATGGTG CCTGTTCCTT TACCCCCGAT ACGCCCGTGG CCACGCCGGA TGGGCCGCAG
CCGATTGCCC GCTTACGCGA AGGCGATACG GTGCTGGGCT ACGACGAGAC GACCCAAACC
ACCGGATCAT ACAGCATCAC CGCTCTCCTG ATCCACGACG ATCCGGTCGT GCTGGATCTG
ACGATTGACG GAGAGACCAT CACCACGACA CCCGAGCATC CCTTCTATGT GCGCGGAATG
GGCTGGGTTC CTGCTGGCGA TCTCGAACCA GGTGCTGCGA TTCGCACGGC GAGTGGCACA
TGGGGATACG TGCAAATCAG CATTGCGCGG AACATGCCGC AGACGATGTA TAACCTTACC
GTGGCGGACG CGCATACCTT CTTTGTCGGG GATGACCAGT GGCTGGTGCA TAACGCCAAG
TGCCCGAAGC CTAGAAACAG TGCGTATGCT GGCACAACCT ACTATCCAAA AGATCCTAAC
GTAAGAGCAC GGTATCCAAA CGGTGTTCCC TTCGATGCCC AAGGGTATCC TGATTTCTCA
GCATATTCTA TTAAGGATGT CCAAATTGAT ATGAGAGGAA ATCGTGGGAG CGACTTTGCC
AAAGCAGATC AAGCCGCAGG TTATACAGTA AGACCCTCTG GTTATACGTG GCATCATCAC
CAAGATAGAA CAACGATGCA GTTAGTACCA ACCGATTTAC ATGGAGATGT AGCACATACA
GGAGGGGTTT CTATGATTAG ATGGTTTGGT ATATTACCGT AG
 
Protein sequence
MPLLRRGRVW YALCLFCMLV PLFVHPATIQ PVMAAPLVPA SSTHQWTQAT PTALLIANTA 
DGITLTSADT ALRQFLETTM GWTVTLKDDD LVQASDAAAV NVVVISASID SYKLDNDFTY
VTTPMVVNEY ALYDQLGMTN GASGTDYGIN SGTPTTTITL LDPTHPLAGG KSGTITVQSQ
SLRLPYGRPV NQGVKVANLP GSTSKTVLFG YDTGATMTIP STKDGGLVGS TPLAAPARRV
GLFLDADTPL YLTSDGWDLV RSALLWAIGQ TPPSGTPTPT TVAPTATPAP ETATAAPATA
TATSLPPTAT ATNLPPTVTP PSGAGKNLLL VVGSVPLGSG DAAVLQRLTD AQYTVQVIPQ
TSLTLEATQG MHGVLISSTV SASAIGMLLT THPVPVLTWE YALYDDLRMT GTSSGTHFGT
STGSTLSIAA PTHPIAAGLS GVVSVSGSAV TLAFGQPLAQ ATLVAALPNA PASAGIFTYA
QGAALSSGTA PARRAGFFFD NTSPTTATAT GWALFERSVA WLIEGFTPIQ PATATPTATV
VPSPTPTASP TIVPSVTPVV PPSATPIPTA TPLVIPPDPA AVAPALSAFT ASDLFADTAF
LYTGTTPIQT GVAPDTITAQ RVAVIRGRVL DRTRQPIPGV QITIKDHPEF GMTHTRADGM
FDLVVNGGSR LVVQYQHPQF LRVQRAVTTP WRDSVWTPEV VLTPLSPIVT TIQLGGAAQA
AEGTVSTDDR GTRQARLFFP AGITATLRLP SGATQPLTSI AVRASEYTVG DTGRAAMPGE
LPSNSGYTYA AEFSADEALA VDGSVEFSQA VANYTENFMG FPVGMAVPTG WYDAQRAVWV
PSPNGRIIAV LTTAPLTLDL TGDGVADDPA SLGIGADEQA FVASTYSAGT SLWRVPMTHF
TPWDHNWPFG PPDDAVGPNG GPLSSDPPIS IPCEQPGSVI TCETQSVGEE VALAGTPFRL
VYTSDPAAAH HQEVVIPLSG ATIPNSLTGI DVTVQIGGQQ WTQAFSPSTN LATIFTWDGL
DAYGRMLQGS QSATIKIGFT YPGQYRTPAQ FGSAFGQFGT GGISLTGDRA TEMITVERVL
NTTLGRLHVS QNAIGGWSLD IHHQYDTHQR ILYRGDGSRQ RAEALTAVTR TGVGWAYPRL
HAIAPTRDGG RYEAGGSGSI STNLWEWLPD GTRRQRSGIL NGLPPADGQL LSHTEFNGIR
ALHTTPTGEV YLTDEGSTWG GNAPAIYRIT ADQRLERVVG GTTKALTLTP DGAVARTMPI
GTPLAVDVDT DGTVWFIDAY TVEVPQANGP IIGKTVTALR VVRTDGQVIH PVAPTQLRCG
FPNLRDEEIV HVTHDTRGNH YLALRGRYPA VAGPTGTGCI LRIDAAGTIT RIAGRDTNGQ
AIGQYPEGWL ATESPLIDPN AVHVHSDGRL IVQDIGALRE ILPDGEMRTI VGNPSPSTDP
QLDELFGSMV GPNGQISFTS RAVKQGTVAP PFPGVGQARL EVPSSDGSEV YVFASTGRHL
LTRDAMTGVL RYTFLYDTAG RLTGVRDQHG QTTTITRIAD GTPTAVVSPD GVTTTLTVGS
DGSVQAINDP TQSRWQLQYQ DGLLTRLIDP RNPDWQHQYT YTADGHLSQD IGATGGWTAL
DQTRLSATTT IVTSTTATGI MTAHRISTAA DGTTSRMVTT EGRPTITTAI DAHGNQVLID
ARGMRRTTTM APDPRWGMTA PYPAAITVTN PAGQVVATAT MHRSVVLSDP TDPWSLVTWQ
EVFTQHGLTT TTIYTAATRT RQVTLPSGVV RVTTYNAAGQ PITETATDQA VKSWTYQPDG
RLASDTIGTG PTAATTTYAY TPQGFIAAIT NALGETTTYT HDAAGRIATV THPTGHTRQY
WYDAVGNRVQ ERDERGMVTT RVYNADNQPI AEVVDVHGRA ILTTVTYDLH GRVVRESTAQ
GANRPPAVTT YGYASLSPVA SVPISVTNAL GETTTYTYDA VGDLVAVTDA MQHTTVMTYT
PEGWLAAVQS PSGQLMQRDY TTDGQVLSET DARGSITTYT YTALRQPETV TVNTTAVAGY
SAVAASTTTT YDGWGRVVRV RDPRQQVTTF HYTDRDHLAW TELPTGEQTH YTYDALGRPI
ATVVGAHDPS TAITTTTSYD AAGRVLTTVV DPDGMALTTR YRYTEPGASN SWDLHAVVDP
NGHTTRYQTN ALGWMTATTN ALTETWTMTY DQRGNQTGIT DPRGNTLAWE YDALGRRIME
HEAGRTQRWQ YRADGRLAAQ IDMANRTTSY GYSVDGFLTD IDYPSGTADV SYTHDANGNV
TTMQDGLGTT TYRYDGLNRL RERTRDGRTV GYTYDAASFR TNTDYWGTGS VTATPDAAGR
VASLQPWGGS TTTYAYGARG QMATATSGSG LSVTPTYDAA GRVLATHYAQ NGTTLANFGS
TVDAVGNRTS LTDANGTTTL TYDATDRMVQ ATEPQASTTY THDAVGNRTN VATTGQTPFS
FSYDAANRLT TSGYTHDANG NLTTTPEATY TYDAANRVTS STTSAGTTTY GYDGWGNLVR
VTTNGQVQDL VLDEAAGLPQ VLGTVTANGT TRVARDPAGL VHQTSGGQVS TLLTDLVGSV
RQGITPSGTT LFSQAFTVDG VPVAHSGSAT SAFGFTGEWT NPLDGLVYLR ARHYLPTMGR
FIQRDTDAGS SMAPASLHRY AYVDNNPATW TDPTGHRKGL QPDGSWEPCQ IILGDPEDQC
SLGDYFDFPK SKSGDDLMTM LKRFGKRIEN LPGDAIHGTI AFATDPVGTL LAIPGSVGSM
SQNCLQGTFC YNYEALADCV WDVGGMAVGG GFGGAGMVDD LARFRGGAGY ADDLRYLDDL
ADGACSFTPD TPVATPDGPQ PIARLREGDT VLGYDETTQT TGSYSITALL IHDDPVVLDL
TIDGETITTT PEHPFYVRGM GWVPAGDLEP GAAIRTASGT WGYVQISIAR NMPQTMYNLT
VADAHTFFVG DDQWLVHNAK CPKPRNSAYA GTTYYPKDPN VRARYPNGVP FDAQGYPDFS
AYSIKDVQID MRGNRGSDFA KADQAAGYTV RPSGYTWHHH QDRTTMQLVP TDLHGDVAHT
GGVSMIRWFG ILP