Gene Haur_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3967 
Symbol 
ID5735828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5037152 
End bp5046433 
Gene Length9282 bp 
Protein Length3093 aa 
Translation table11 
GC content66% 
IMG OID641281117 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546727 
Protein GI159900480 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGTT CGACAGCAAC GAACTACAAT GGAGCGGTCG CGATCATTGG GATAACAGGG 
CGCTTCCCCG ATGCCCGTGA CGTGGCTGCC TTCTGGCAGA ATCTGGTCGC GGGGGTGCGG
TCAATCCGCT CTTTCACCGA TGCGGAACTG CTGGCAGCGG GCGTCGATCC CGAGGTCTTG
AACGATCCGA ACTTTGTTAA GCATGGAACC AGGCTCGATG ACATCGAGCT GTTCGACGCG
GCGTTCTTCG GCTACACGCC ACGCGAAGCC GAGGTGATGG ACCCACAGCA CCGCCTGTTC
TTGGAGTGCG CCTGGCAGGC TCTTGAGCAG GCCGGCTACG ATCCGGAAGG GTTCCGCGGG
GCGATCGGTG TGTTCGCCGG CTCGGCGACC TCGACCTACA GGGGCAATAA CCTCCATACC
AATCGCGAGA TCGCCGAGGC GGCCGGCGGC CTGCAACTGG CCGTCGGCAA CGATGTCGAC
TCGCTGGCGT CGACCGTGTC GTACAAGCTG AACCTGCGCG GGCCGAGCGT GGCGGTGCAG
ACGTTCTGTT CGACCTCGCT GGTCGCAGTC CATATGGCCT GCCAGAGCCT GCTCACCTAT
GAGTGTGGCC TGGCACTAGC CGGCGGCGCT GTGATCGCCG TGCCACAGGG CGAGGGGTAC
ATGTACCAGG AAGGCGGCAT CCTGTCGCCC GATGGACACT GCCGCACCTT CGATGCCAAG
GCCCAGGGCA GCGTGATGAG CAATGGCGTT GGGGTGGTGC TGCTGAAGCG CATGACCGAT
GCATTGAAGG ATGGCGACAC GATCTACGCG GTTATCCGCG GGTCGGCGGT CAACAACGAC
GGCATCCGCA AGGTCGGCTA CACCGCCCCT GGCTTGAATG GCCAGTCATC AGTGATTACC
ATCGCCCAGA GCCGGGCCAA GGTCGCCCCG GAAACGGTCA GCTATATTGA GGCCCACGGA
ACGGCCACGC CGCTTGGCGA CTCGATCGAA CTGGCCGCCT TGATCAAAGC GTTTGAGCGC
GGCGCGCCAC GCAAACAATC CTGCGCGCTG GGGTCGGTGA AGCCCAACAT GGGGCACCTC
GACCGGGCGT CGGGGGTGAC GGGCCTGATC AAAACGACCA TGGCTTTGCA CCACCGCCAG
CTGCCGCCCA ACCTTGATTT CGAGGCGCCT AGCCCGGATA TTGATCTGGC CAACAGCCCG
TTCTACGTCA ACACGCAACT GCGCGAGTGG CCGGCCAACG GCGCTGCGCC GCGCCGGGCC
GGCGTCAACT CGTTTGGTCT GGGCGGCACC AACGTCCACG TCGTGCTGGA GGAAGCGCCT
GCACCGGCGC CCGTGGCTCC GGCTCGCCCG GCCCAGTTGC TGGTGCTGTC GGCCAAGACC
GCGACCGCCC TGGAGGCCAT GACCGACAAC TTGGCCGCCT ATCTGGCCGG CGCCCCCGTC
GATCTGGCCG ACGTGGCCTT TACCCTCCAG GTTGGTCGCG CGGGCTTCAA CCACCGCCGC
ATCCTGGTCG GCGACAGCGT GGCCGATGTC CGCGCCGCGT TGGTGGCAAA GGATTCGCGG
CGCGTGCTGA GCGCCACGCA GACTGGACGC AACCGGCCGG TGGCGTTTGT GTTCCCCGGC
GTGGGCGACC ATTACGCCGG CATGGCCGGG ACGCTGTACG CGACCGAGGC GGTCTTCCGC
GAGGCGGTTG ATCAGTGTGC CGAATTGCTG GCCCCGCGCC TTGGCCAGGA TCTGCGCGCC
GCCCTGTATC CCGCCGATCA GCCGGCCGCA ACCGCGGCCC ACACGCTGTT TGCGGCTACT
GCGGCGAGCA GTCGTGTGGC GGGAGCGCTG CATCAGACGG CGCTGGCCCA GCCGGCGGTG
TTTGTGGTCG AGTACGCGCT GGTCCAACTG CTGGCGAGCT GGGGTATCCG GCCGCAAGCG
CTGCTCGGCT ATAGTCTGGG CGAATACGTC GCGGCGACGG TCGCCGGGGT GTTGAGCCTG
GAGGACGCGC TGACCTTGGT GGCCAAACGC GCCCAGTGGA TCCAAGCCCA GCCGCATGGC
GCGATGCTGG CCGTCTCGCT GGGCGTCGAG GCCATCCAGC CCTACCTGAA TACCGAGGTG
GCGCTGGCGG TGGTCAACAG CCCAATGACC TGTGTCTTGG CCGGCTCGCA GCCCGCCTTG
GCGGCGCTGG CGGAGCGCTT CGCGGCCGCC GACGTGGCCA GTCGCTGGCT GGAGACGAGC
CACGCGTTCC ACTCGCCGAT GCTGGCGCCG GTGGCGGCCG AACTGACCGC GCTGGTACGC
ACGCTGCGGC TCCAGACGCC CAAAATTCCC TATATCTCCA ATGTGACTGG CACGTGGATC
ACCGATGCCG AGGCGACCGA CCCGGGCTAC TGGGCGCGGC ATATGGTCGA AACGGTGCAG
TTTGCCGATG GCGTCGGTAC GCTGCTGGCC GATGCTCAGC TGGCGCTGCT CGAAGTGGGA
CCGGGCCAAG CGCTGGGGTC GTTTATCCGC CAGCATCCGA CCTGCGGCCG CACGCGTTTT
GGCCAGATTG TCGCGACCCT ACCGGGCGCA ACGGAGCCGC AGCGAGATTT GATCGCGCTG
TTGGAAGGGC TGGGGCGACT GTGGCTGGCC GGTGTCCCCG TCGATTGGGC CGGTTTCCAC
GGCGGCGCGG CCCGCCGCCG TGTCCCGCTG CCGACGTATC CCTTCGAGCG CAAGCGCTTC
TGGATCGATG CGCCGAAGCG CGAATGGGCC GCGGCAAATC AGCCTGCCAC GGTCGGCAAG
CGGGCGGATA TCGCCGATTG GTTTTATCGG CCCGAATGGG CACCCGCCGC ACTCAATGCG
CCGACCAATG ACTCTAAACA GCGCTGGTTG ATTTTTGTTG ATGCCATGGG GATTGGCGCC
GGCGTACGCC AACGGCTGGC CGAGCATGGC CACGAAGCGG TAACCGTCAC GATCGGCGAA
GGCTTCAAGC GTCTTGATAC CCACCACTAC GAGCTGGCGC CAACCCGACT GGCCGATTAC
CTAGACCTTA TCGCGGCGAT CCAGGGGCAA GACCCGCTGC CGACCCATGT CCTCCATCTG
TGGAGCCTGA CGCCGCTCGC GGGGATCAAG TCCGGAGCGG CGCGCTTCGC AGCCGCGCAG
GACTACTGCT TTACCAGCTT GCTTAATCTT GCCCGCGCCT TTGAAGCGCA GGTGATTACC
GACAGCATCC ACATGCTGGT GGTTTCCAAA GGTATGCAGG CCGTCGATCC GAGCGACATC
CCCGATGCCG ACAACGCGAC CCTCTTGGGA GCCTGCACCG TGATTGGCCA GGAGAACACC
TCGATCATTG TCCGCAGCGT CGACCTGGAT GTGGATCTAG ATGCTCGCCA CTTGGACGGG
TATGCGGCGG GACTGATCGC CGAGGGCCAG AGCGGCAGCC ACGATCTGCA CGTTGCCTAT
CGCGACGGTC AACGCTTTGC CGAACGCTAT GTCCCGCTGC GCCTGGAAGC GCTGTCCCAG
CCGATCCTGC GGCCGGGCGG GGTGTATCTG ATGACGGGCG GTCTGGGCGG GGTCGGGTTA
ATCCTGGCGG AGCATCTGGC GCGGACGGCG CAGGCGAAGC TGGTGCTGGT CGGCCGGCAG
GGCTTGCCGG AGCGGGCGGC ATGGGACGCG TGGCTGCGCG AACACGGCGC GGATGACGCC
ACCAGCCAGC GTATCCAGCG GGTGCGGACG ATCGAGTCCG CCGGCGGCGT GGTTGAGGTA
GTGGCCGCCG ACGTGGCCGA TCCCGAGCAG ATGCGCCGCG CGGTCGCCAT CGCCGAAGCC
CGCTTCGGCC CGATCAATGG CGTCTTGCAC GCCGCCGGCA TTTCGGATGA TACGGCCTTC
CGACTGATCC AGACCCTCGA ACAGGCGACC TACGCCGCTC ACTTCCAGCC GAAGGTGTAC
GGGTTGTACG CGTTGGAAGC GGCGTTGGGC GACCGGCCGC TGGACTTCTG CGTGCTGTGT
TCGTCGGTGT CCTCGGTGTT GGGCGGGCTG GGTTTCGCGG GCTACGCGGC GGCCAACTGC
TTCATCGATG CATTCACCCA GCGCCACAAC CGCACGCACG CAGTGCCTTG GGTCAGCGTC
AACTGGGACA CCTGGCACCT GCGCGCCGGC CAGCACGACG TCACCGGTCT TACGGTTGCC
CAGTACGAGA TGAGCCCGGC CGAGGGCGCG GCGGCCTTCG AGCGCGTCGC CGCGGCCCGG
GGCCACAGCC AGATTATCAA CTCGACCGGC GACCTCGACG CCCGCATCCG CCAGTGGGTC
CGCCTCGAGT CCATTCGCAA CGATGCGGCC GTTGCGCCGG CTCCCGCGAG CAACGGCCGC
CCGGAGCTGT CCACAACCTA TGTTGCGCCG ATCGGCGAGT ATGAGCAACG CGTTTCCGCC
ATCTGGCAGC ATGTGTTGGG CATCGACGAG ATCGGACTTC ATGACAACTT CTTCGACCTG
GGCGGCAACT CGCTGATCGC GCTTCAGCTG ATCGCCCGGC TGAAGAAGGA GTTCAAGACT
CACATTTCAG CGGTGGCGCT CTTCGAGGCG CCGACGGTGA GCACGATGGC CCAATACCTG
CGCCCCGAAG TCGCACCGGA AGAACATGTC GATCAGGAGC GCCTGCTGAT TGAGCACCGC
CGCGAGCAAA CGCGCCAGAC GGTGCAGGCG GATGGTATCG CGATCATCGG CATGGTCGGG
CGTTTTCCCG GTGCCTCCAG CGTGGAGGAG CTCTGGCAAA ATCTGCATAA CGGTGTCGAG
TCGACAACCC ACTTCACCGA CGCAGAGTTG CTGGCGGCCG GCGTCGATCC CATGCAGGTG
TACCACCCCG ACTACGTGAA GTCGCGGCCG ATCCTGCAAG AAGATATCAC CCTCTTTGAT
GCGGCGTTCT TTGGCTACAC GCCGCGCGAG GCGGAGTTTC TCGACCCGCA GCAGCGCTTG
TTCCACGAGT GCGCTTGGGA GGCCCTGGAG CAGGCCGGCT ACGACACTCA GCGCTATCCC
GGCCTGGTCG GCGTTTTTGG CGGCACCAAC ACCAACGCCT ACCTCAACCG CATCGCCCGC
GACCCACGCT CCGACGGCCA TATCACTGAG ATCATCACCC TTGAGAACGA CAAGGACGCG
CTGGCGACCA ACGTCGCCTA CAAGTTGAAC CTGCGCGGGC CAAGCTTCGC GGTGCAGACG
TTCTGCTCGA CCTCGCTGGT CGCCACCCAC TTGGCCTGCC GCAGCCTGCG CCACGGCGAG
TGCGATATCG CGTTGGCGGG TGGCGTGTCG GTCCGTGTCC CGGTCAACAC CGGCTATCTG
TATGAAGAAG GCGATCAGGT GTCACCGGAC GGGCATTGCC GGACGTTCGA TGCCAACGCG
GGCGGAGCGA CCTTCGGCGA CGGGGTGGCG ATCGTGGTGC TGAAGCGGCT GGCGGACGCG
CTGGCCGACG GCGACACTAT CCACGCCGTG ATCCGTGGGT CGGCGATCAA CAACGACGGC
GGCCTCAAGG TCGGCTACAC CGCACCCAGC GTGGTCGGGC AGGCGGCGGT GGTGCAGGCT
GCGCTGGCTG ATGCCAATCT GGCCGCCGAT GCCATCTCGT ATGTCGAGGC CCACGGCACT
GCCACCAAAC TCGGCGACCC GATCGAGGTT GCCTCATTGA CCAAGGCCTA TCGCACGACC
ACCGACAAAG TTGGCTTCTG CGCGATCAGT TCGGTCAAAC CGAACGTCGG CCACCTCGAC
CGCGCTTCGG GGGCGACCGG CTTGATCAAG ACCGTCATGG CGCTCAAGCA CAACGTGATC
CCGGCCACGC TGCACTTCCA GACGCCCAAC CCCGAGATCG ACTTCGCCAG CAGCCCGTTC
TTTGTGCCGA CCGCGCTCAC GCCGTGGACG CGCAATGGCA CACCGCGCCG GGCCGGGGTC
AACTCGCTGG GTGTGGGTGG AACCAATGCG CACGTCATCG TGGAGGAAGC ACCGCAGGTC
GGGCCAAGTG GCCCCGGTCG GGCGGTCGAA CTGCTGGTGC TGTCGGCCAA AACGGCGACC
GCGCTGGAGG CAGCGACCAC GAATCTGGCC GCCTATCTGG AGGAGCAGCC GATGGTGAAT
CTGGCTGATG TGGCTCACAC GCTCCAGGTT GGGCGGCGGG TGTTTGAACA TCGCCGGGTT
GTGGTCGCCC GCGACGTGGC GGACGCGGTG GGCCTGCTGC GGAGCGGCGA TGCGCGGCGG
GTGCTGACGC TGGCACAGAA GCCGACCAGT CGGGGTGTGG CCTTTGTGTT CCCGGGCGTG
GGCGACCACT ACATCGGGAT GGCGGAGGGA TTATACGCGA CCGAGGGAGT ATTCCGCGCG
ACGGTTGACC GCTGCTGTGC GCTGCTGACG CCACTGCTCG GATCGCCCAT TCGGAAGGAA
ATCTACCCCG ATGGTGGTGT TCCCGCCCAG ACCGGCGTCG ACCTGCGTGC TATGCTGCGC
CAAGACGCGA CGCCGAGGTC GGCGGGGCGC TTGCACCAGA CGGCGTGGGC GCAACCGGCG
GTGTTCGTGG TGGAGTATGC GTTGGTGCAG CTGCTGGCGA GCTGGGGCAT CCGGCCGCAG
GCGTTGCTCG GCTACAGCGT GGGTGAGTAC GTGGCGGCGG CGGTCGCTGG GGTGTTGAGC
TTGGAGGATG CCTTGACCCT AGTCGCCAAG CGTGCCCAGT GGATTCAGGC CCAGCCGGCC
GGGTCGATGC TGGCGGTGAG CTTGAGTGCC GAGGCGATCG GTGCGTATGT GGGCGGTGCG
GTGGCGCTGG CGGTGGTCAA CAGCCCGATG ACCTGCGTCC TGGCTGGTCC GCAGGCGGCG
TTGGAGGCGG TGAAAACCCG CTTGGACGGT GATGAGGTGG CCAGCCGCTG GCTGGAGACG
AGCCACGCCT TCCACTCGCC GATGTTGGCG CCGGTGCAGG CCGAGCTGAC GGCGCTGGTG
CGCACGCTGC GGCTCCAGGC ACCGCGCATC CCGTATATCT CCAACATCAC CGGTACGTGG
ATCACCGATG CGGAAGCGAC CGACCCGGGC TACTGGGCAC GGCACATGGT CGAGACGGTG
CAGTTTGCGG ACGGCGTTGG CACGCTGCTG GCCGATGCCC AGCTCGTGGT GCTGGAAGTG
GGGCCGGGGC AGGCGCTGGG GTCGTTTATC CGGCAGCACC CGGCCTGCGG GCGCGACCGG
TTCGGCCAGA TCGTGGCCAC GGTGTGTGGG ATGACGGACA CGAGCGATGA CCTGGAGGTG
CTGTTGAGCG CGCTGGGGCG GCTGTGGCTA CACGATGTGG TGGTCGATTG GGCCAGCTTC
CGTGGCAGTG AAGTCCGCCA GCGTATCCCG CTGCCGACCT ACCCCTTCGA GCGCCAGCGC
TTCTGGATCG AGCTGCCTCC AAACCCGCGC GGGGATAGTG GGCGAAAGGT GCGCCGGTTT
GATGCGGGCG ACTGGTATGC CGTGCCCTCG TGGAAGCGCG CGGTCGCCCA CGACGAGCTT
ATCGACGGTG CGGCCGGCCT GGGCGACCAG GGTAGCTGGC TGGTGCTGGC GGATGGCGAA
GGACTGGCCG CTGGGTTGAC CGCCTGGCTG GAAGAGCGCG GCCAGACCGT GATCACGGTC
ACGCCCGGCG CAGCCTTTGC CCAGCACAGT GCAACGGCGT ATACCGTGCG CGCCGGCAGC
CGCGAGGATT TCACGGCCTT GTTGCAGACA CTGGAGCGTC ATGGCCAGAT GCCTAGTCGC
ATCGTCCATG CCTGGCTAGC GACCCCTAAG GTGGGTGCCG CTCAGCTTGA CGATGTGGGG
CTGGCGGAGT CCTTGGATCT CGGCTTCTAT AGCCTGCTGG CGCTGGCCCA GGCGCTTGGC
GAGCAGGATA TTGAGTGGTG TGAGATCAAT GTCCTCACCT CCGAGATGCA CGACATCAAC
GGCCGCGAGG AGCTCAACGT GGCGGCGGCG GCAGTGATTG GGCCGTGCAA GATTATTCCG
GTCGAGTATC CGAACCTGAC CGCGCGCTCC ATCGATATCC TGTTGCCGGC CAGCCCCGCT
GAGCGGGCGA CCTTGGTCGC GCAGATCGGC GCTGAGCTGG CTACCCCACC GACCGGCGAT
CTGGTCGCCT TCCGCGGCGC CCATCGCTGG GTTCAGATGA TGGAGCCGGT TGCGCTGCCG
ACAGCGCCCG CGTCGCATCC GCGCTTGCGG ACGGGCGGGG TGTACCTGCT GACCGGCGGC
TTGGGCGGGA TCGCCCTCGG CTTGGCCCGC GACCTGGCGG CGACGCTTCA AGCCAAGCTG
GTGCTGGTCA ACCGCTCCAG CCTGCCCGAC CGTGCCACCT GGCCGGCGCT GCTCGAACGC
GACGGTGCCG AGCAGGGCGT GGGGCGGCGC ATTCAGCAGG TGCTGGATCT GGAGGCGCTG
GGCGCGGAGG TGCTGGTCAT TCAGGCCGAC GTCACCGACG CGGTGGCGAT GGCGCGGGCG
GTGGCTGAGG CCCAGGCACG CTTCGGGACG ATCCACGGTG TGCTCCACAC GGCCGGCGTG
CCCGGCGTGG GCTTGATGCA GCTTAAGGAT GCCGCGACGG CAGCGGCTGA GCTGGCGCCC
AAGGTTCAGG GCACGCTGGC GCTGACTCGG GCGTTGGCCG GGGTGCCGCT GGATTTCTTG
GTGTTGTTCT CGTCAGTGAC GTCAGCGACG GGCGGCGGGC CGGGCCAGGT GGCCTACTGT
GCCGCCAACG CCTTCCTCGA CGCCTACGCC CGCAAGCATG CCACCGACCA CGGCCGCACC
GTGGCGGTGA GCTGGGGCGA GTGGTTGTGG GACGCCTGGT CCGACGGCCT GCAAGGTTTT
TCAGCCGAAG ATCAACTGCG CTTTCGGGCC TACCGCCGCA CCTTCGGCAT CACCTTTGAC
GAAGGCGCCG AGGCACTGCG CCGCATCCTG GCCTGCCGCA TCTCGCACCT GTTTGTGACG
ACCGAAGACG TCGTCGCCAT GTTCGAGGAT AGCAAGGGCT CGGCGCTGCA GCGCGCCGCC
CGCCAGGAGG ACGCGGCCCA GCGCTACCCG CGCCCCGAGG TCAGCACCTC GTTTGTCGAG
CCGCAAAGCG AGCTGGAGCA ACAGGTCAGC GCGATCTGGA GCGAAGTGCT CGGCATTGCG
CCGATCGGGG TCAATGACAA CTTCTTTGAT TTGGGCGGGA ACTCGCTGAT CGGGATTCAA
ATTGTGACGC GCTTGCGCCG AACATTCCAG GTCGCACTTC CGCTGACTAT TTTATTTGAC
GCACCCACAG TGGAGGAAAT GTCTATTGCG ATTGAAATGA TGCTGATCGA CGCGATCGAA
CATTCAAGCG AGAGCGCGGC GGAATCTGTT TCCCGGGTCT GA
 
Protein sequence
MTGSTATNYN GAVAIIGITG RFPDARDVAA FWQNLVAGVR SIRSFTDAEL LAAGVDPEVL 
NDPNFVKHGT RLDDIELFDA AFFGYTPREA EVMDPQHRLF LECAWQALEQ AGYDPEGFRG
AIGVFAGSAT STYRGNNLHT NREIAEAAGG LQLAVGNDVD SLASTVSYKL NLRGPSVAVQ
TFCSTSLVAV HMACQSLLTY ECGLALAGGA VIAVPQGEGY MYQEGGILSP DGHCRTFDAK
AQGSVMSNGV GVVLLKRMTD ALKDGDTIYA VIRGSAVNND GIRKVGYTAP GLNGQSSVIT
IAQSRAKVAP ETVSYIEAHG TATPLGDSIE LAALIKAFER GAPRKQSCAL GSVKPNMGHL
DRASGVTGLI KTTMALHHRQ LPPNLDFEAP SPDIDLANSP FYVNTQLREW PANGAAPRRA
GVNSFGLGGT NVHVVLEEAP APAPVAPARP AQLLVLSAKT ATALEAMTDN LAAYLAGAPV
DLADVAFTLQ VGRAGFNHRR ILVGDSVADV RAALVAKDSR RVLSATQTGR NRPVAFVFPG
VGDHYAGMAG TLYATEAVFR EAVDQCAELL APRLGQDLRA ALYPADQPAA TAAHTLFAAT
AASSRVAGAL HQTALAQPAV FVVEYALVQL LASWGIRPQA LLGYSLGEYV AATVAGVLSL
EDALTLVAKR AQWIQAQPHG AMLAVSLGVE AIQPYLNTEV ALAVVNSPMT CVLAGSQPAL
AALAERFAAA DVASRWLETS HAFHSPMLAP VAAELTALVR TLRLQTPKIP YISNVTGTWI
TDAEATDPGY WARHMVETVQ FADGVGTLLA DAQLALLEVG PGQALGSFIR QHPTCGRTRF
GQIVATLPGA TEPQRDLIAL LEGLGRLWLA GVPVDWAGFH GGAARRRVPL PTYPFERKRF
WIDAPKREWA AANQPATVGK RADIADWFYR PEWAPAALNA PTNDSKQRWL IFVDAMGIGA
GVRQRLAEHG HEAVTVTIGE GFKRLDTHHY ELAPTRLADY LDLIAAIQGQ DPLPTHVLHL
WSLTPLAGIK SGAARFAAAQ DYCFTSLLNL ARAFEAQVIT DSIHMLVVSK GMQAVDPSDI
PDADNATLLG ACTVIGQENT SIIVRSVDLD VDLDARHLDG YAAGLIAEGQ SGSHDLHVAY
RDGQRFAERY VPLRLEALSQ PILRPGGVYL MTGGLGGVGL ILAEHLARTA QAKLVLVGRQ
GLPERAAWDA WLREHGADDA TSQRIQRVRT IESAGGVVEV VAADVADPEQ MRRAVAIAEA
RFGPINGVLH AAGISDDTAF RLIQTLEQAT YAAHFQPKVY GLYALEAALG DRPLDFCVLC
SSVSSVLGGL GFAGYAAANC FIDAFTQRHN RTHAVPWVSV NWDTWHLRAG QHDVTGLTVA
QYEMSPAEGA AAFERVAAAR GHSQIINSTG DLDARIRQWV RLESIRNDAA VAPAPASNGR
PELSTTYVAP IGEYEQRVSA IWQHVLGIDE IGLHDNFFDL GGNSLIALQL IARLKKEFKT
HISAVALFEA PTVSTMAQYL RPEVAPEEHV DQERLLIEHR REQTRQTVQA DGIAIIGMVG
RFPGASSVEE LWQNLHNGVE STTHFTDAEL LAAGVDPMQV YHPDYVKSRP ILQEDITLFD
AAFFGYTPRE AEFLDPQQRL FHECAWEALE QAGYDTQRYP GLVGVFGGTN TNAYLNRIAR
DPRSDGHITE IITLENDKDA LATNVAYKLN LRGPSFAVQT FCSTSLVATH LACRSLRHGE
CDIALAGGVS VRVPVNTGYL YEEGDQVSPD GHCRTFDANA GGATFGDGVA IVVLKRLADA
LADGDTIHAV IRGSAINNDG GLKVGYTAPS VVGQAAVVQA ALADANLAAD AISYVEAHGT
ATKLGDPIEV ASLTKAYRTT TDKVGFCAIS SVKPNVGHLD RASGATGLIK TVMALKHNVI
PATLHFQTPN PEIDFASSPF FVPTALTPWT RNGTPRRAGV NSLGVGGTNA HVIVEEAPQV
GPSGPGRAVE LLVLSAKTAT ALEAATTNLA AYLEEQPMVN LADVAHTLQV GRRVFEHRRV
VVARDVADAV GLLRSGDARR VLTLAQKPTS RGVAFVFPGV GDHYIGMAEG LYATEGVFRA
TVDRCCALLT PLLGSPIRKE IYPDGGVPAQ TGVDLRAMLR QDATPRSAGR LHQTAWAQPA
VFVVEYALVQ LLASWGIRPQ ALLGYSVGEY VAAAVAGVLS LEDALTLVAK RAQWIQAQPA
GSMLAVSLSA EAIGAYVGGA VALAVVNSPM TCVLAGPQAA LEAVKTRLDG DEVASRWLET
SHAFHSPMLA PVQAELTALV RTLRLQAPRI PYISNITGTW ITDAEATDPG YWARHMVETV
QFADGVGTLL ADAQLVVLEV GPGQALGSFI RQHPACGRDR FGQIVATVCG MTDTSDDLEV
LLSALGRLWL HDVVVDWASF RGSEVRQRIP LPTYPFERQR FWIELPPNPR GDSGRKVRRF
DAGDWYAVPS WKRAVAHDEL IDGAAGLGDQ GSWLVLADGE GLAAGLTAWL EERGQTVITV
TPGAAFAQHS ATAYTVRAGS REDFTALLQT LERHGQMPSR IVHAWLATPK VGAAQLDDVG
LAESLDLGFY SLLALAQALG EQDIEWCEIN VLTSEMHDIN GREELNVAAA AVIGPCKIIP
VEYPNLTARS IDILLPASPA ERATLVAQIG AELATPPTGD LVAFRGAHRW VQMMEPVALP
TAPASHPRLR TGGVYLLTGG LGGIALGLAR DLAATLQAKL VLVNRSSLPD RATWPALLER
DGAEQGVGRR IQQVLDLEAL GAEVLVIQAD VTDAVAMARA VAEAQARFGT IHGVLHTAGV
PGVGLMQLKD AATAAAELAP KVQGTLALTR ALAGVPLDFL VLFSSVTSAT GGGPGQVAYC
AANAFLDAYA RKHATDHGRT VAVSWGEWLW DAWSDGLQGF SAEDQLRFRA YRRTFGITFD
EGAEALRRIL ACRISHLFVT TEDVVAMFED SKGSALQRAA RQEDAAQRYP RPEVSTSFVE
PQSELEQQVS AIWSEVLGIA PIGVNDNFFD LGGNSLIGIQ IVTRLRRTFQ VALPLTILFD
APTVEEMSIA IEMMLIDAIE HSSESAAESV SRV