Gene PC1_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_2330 
Symbol 
ID8133274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp2674631 
End bp2682433 
Gene Length7803 bp 
Protein Length2600 aa 
Translation table11 
GC content56% 
IMG OID644865611 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_003017898 
Protein GI253688708 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAGA ACCTCTATCG CATCATTTTC AACAAGGCTC GCGGCCTGTT AATGGTGGTT 
TCCGAAATCA ACCGCGGCCA GGGTAAAAGC GGCGCCAACG GCGTCGGCCA TACGCTGAGC
CAGCTGATTG GCAGCATGAA GCCCGCCGCC TTCCTGACCA TGACCGCACT CGGCCTGGTC
ACGCTCGCGC CACAGGCACT GGCGGCAGGA ATTGTCGCCG ACAAGAGCGC CCCGGGCGGC
CAGCAGCCGA ATGTGATGCA GAGCGCCAAC GGCACGCCGC AGGTCAATAT CCAAACGCCG
AGCGCCGGTG GTGTTTCGCA TAACAAATAC ACCCAGTTTG ACGTCGATAA CAAAGGCGCA
ATCCTTAATA ACTCGCACAA ACAGGTGCAG ACCCAGATGG GCGGCTGGGT GGCGGGTAAC
CCATGGCTGG CGAAAGGCGA AGCCAAAATT ATTCTTAACG AGGTCAACTC GCGTGATCCG
AGCAAGCTGA ACGGCTATGT CGAAGTGGCA GGCCGTAAGG CACAGGTGGT CATCGCCAAC
CCGGCAGGCA TCAGCTGCGA CGGCTGTGGT TTTATTAACG CCAATCGTGC CACTCTGACC
ACCGGCACGC CGCAGATGAC CAACGGTGAA CTGCGCGGTT ACCGCGTGGG CAATGGTGAA
ATCGTGGTGG AAGGTGCTGG CATGGACAGC AGCCGTCAGG ATTACACCGA CCTGATTGCC
CGCACCGTGA AGGTCAATGC CGGTATCTGG GCGAAAGATG TGACCGTCAC CACCGGTAAA
AACGAGGTGG CCGCCGATAA CAGCACCGCC ACAGCTACAG CCACAAGTGC GACGGACAAC
GACGTCAAGC CGACGTTGGC GATCGACGTC GCCCAGTTGG GTGGCATGTA TGCCGGAAAA
ATCCGCCTGG TGGCGACCGA GCAAGGGGTC GGCGTCAGCA ACAAAGGGAC TCTTGGCAGT
CAGGCGGGTG ACATCACCAT TAACGCCAAT GGCGATATCG TCAACAGCGG TACGGTCAAT
GCCGGTCAGG ATCTGCTGCT GGGCGGTAAA AAAGTTGACA ACAGCGGTAC CCTATTCGCC
CAGCGCGATC AGAGCACGAC CGCCAGCGAC GCGGTGACGA ATACCGGCCT TATCGCCGCC
AAAGGCAACA CCCATGTGCG CGGTGCCAGC ATCCGCAACA GCAAAGGTGC GGCCATTGCA
GCCGGGATGA AAACCGATGG CACGCTGGCC AATAGCGGCA ATCTGACACT CGCCAGCGAC
GGTAAACTGA CCAGCAAAGG TCAGGCGCTG GCCGGTGGCG ATCTGAACGC CAGCGCCAGC
CAGATTGATC TCAGCGGCAG CGATACCGCC GCGCATCACG CCACTCTGAC CAGTAGCAGC
GATATCATCA CCGACGACGC ACAGCTGCTG GCGAGTGGCG ATTTGAAACT CTCCGCCGCC
GGTAAACTTA GCAACGATCG CGGCGTGATT AACGCCAATA CGCTGCAGGT CAGCACGCCG
GTTATCAGTA ACCGCGGTGG CCAGTTACTG CAGAGCGGCG AGAGCGATCT GCAACTCAGC
ATTAACAGCA TTGATAACCA GAATGGCCGC ATCGCCGCCA ATGCAAAAAA ATTCAGCGTA
CAGAGCGCCA GCCTGAATAA CCAGGGCGGC ACCATTATGG CGGCCGGTTC CGGCTCGCTG
GATGTTGCGG CTAGCAGTGG ACTGAATAAC CAGAACGGTA CGCTGGCCGC CGCCAGCGAT
CTGGCGATCG CCACTCCGGT GCTGAATAAC AATCAAGGAC AGATCTCCGC CAATAAGCAG
TTGACGCTCG ATCAACAAAA CAGCAGCGCC CTCGCCCGCA CAGCAGCTAC CAGCAGCGAT
CTGCGTATCA GCAATGAAGG TGGCCGTCTG GTGGCAGGTC AGCAGTTGAT CTTCCGCGGT
CGCGAGATTA ATGGCAGCGG TGAAATTCTG TCGCTTGGCG ATATGGATCT CAGCTTCGCC
GACGGCTTCA GCAACACCGG CAAAACGCTG GCGAATGGCG ATCTGACGCT GAATGTTAAC
AACAGCCTGA TCAACACGGC GCTGCTTGGC GCAGGCGGTA AGCTGAATGT GCAGGCCAGC
AATATCGACA ACCAGACCAC TGGCGAACTG AGTTCCCAGC AGACCACGCT GAACGCCAGC
GATACGCTGA ACAACCGCGG CCTGATCGAC GGCGTGCTGA CCCGTATCAA CGCCAGCACC
GTCAACAATA TCGGCACCGG CCGTATTTAT GGCGACGGAC TGGCGATCGG CGCAGCGACG
TTGAATAACC TGGCGGAAAA TAACAGCAGC GCCACCATCG CGGCGCGTCA GCGTCTGGAT
CTGGGCATCG GTACGCTGAA TAACCGCGAC CACTCCCTGA TCTACAGCAA TGGTGATATG
GCCATCGGCG GCGCGCTAGA TGCGGACGCG CTGGCGACCG GCAAGGCTGG CGAAATTAAC
AACCACAGTT CGACTATCGA ATCTGTGGGT AATATGGCGC TGAGCTTCAC CGCGCTGAAC
AATATCAATG ACAACTTCGT CACCAGTATG GTGCAGTTGT CCCAGCAGCA GAAAGAGGAG
TACATGGTGG TGGACCTGAG GAATGGCGTC CACTACAGCC CTGACGACTA CAACATCAGC
TTCTACAAAG ATGAAGTGCG CCATATCTGT ATCGAAGGCG TGGTGTGTGG TCGCGACCAT
TACTATCAGT ACAGCTATAC CAAGACCATC AGCGAAGAGC AGATCACCCA GAGCGACCCG
GCGAAAATCA TCGCGGGCGG TCAGATTTCA CTCTCCGGCG ATAAGCTGCT GAACGACAAG
AGCCAGATTA TTGCCGGTGG CACCCTGCTG ACGGCGGTGA AGGAACTGGT GAATACCGAA
GTCACTGGCC AGAAACTGAC GGAAAAAGTC GGTCAGGTCA TCGAATGGGA TCGTATCCAT
AAAAAAGGCA AAGACAGCCA GAAAGCACGC GCTTCAGCCT ATACACCGCC AACGGAAATT
CAGTCCATCA GCCTCAGCCC GAGCGTGATG AAGGAGAATA CTCAGGCAAC CAGCAGCGCA
CCATCGCTGG CTGAGTATGC CACGCAGCGC GTGGAAGTGG CCGGACAGAA CACCGATGTG
ATTCGTTCCA TGACGCCGGA TGCTTCCCTG CCAACCGGCA GCCTGTTCAC CACCCTGCCG
GATGCCACCA GCAGCTATCT GATCGAAACC GATCCGCGCT TCACCAATCA GAGAACCTGG
CTGAGCTCCG ACTATATGCT GAGCCAGTTG CAGACCGATC CGTCGATCAC GCAAAAACGC
CTCGGCGATG GTTTCTATGA ACAACGTCTG GTGCGCGAGC AGATCGTTGA GCTGGTCGGC
CAGCGCTATC TCGCCGACTA CACCAGCGAT GAAGAGCAGT ATAAGGGGCT GATGGAAGCG
GGCGTCAGCT TCGCGAAGAC ATTCAACCTG GTGCCGGGCG TGGCGCTCAC CGCCGAACAG
ATGAAACAGA TCACCCAGGA TATGGTGTGG CTGGTTGCGC AGGATGTGAA GATGCCGAAT
GGCACGACTC AGCGCGTGCT GGTACCGCAG GTGTATGCAC AGGTGCAGCA AGGCGATATG
GACGGCAGCG GTGCGTTGCT GGCCGGTAAG AACGTTAGCA TTGGCATCAG CGGCGGCATG
CTGAACAGCG GACGCATCAG CGCCACACAG CTGGTTAGTG TTTCCGGCGA TGATATCGTT
AACGTCGGCG GCATTATTGC TGGTAAATCC GTGTCGCTGC AGGCGACCAA CGATATCACT
AACACCGGCG GCACGGTGCG TGCCACCGAT ACCCTGCTAG CGCAGGCCGG ACGCGATATC
ACCGTTGCCA GCGAGACTAA CCACGCGGAA AGTCAGAACG GCAGTAACCG CTTCAGCCGC
GACAATATCG ACCGAGTGGC AGGCATGTAT GTGCAGGGTG ACGATGGCAA ACTGCTGCTG
CAGGCAGGCC GTGATGTCAA TCTGCAGGCC GCGCAGGTGG TCAGCTCCGG TGAAAACAGC
CAGACGCAGA TCGCTGCTAA CCGCGATATC AATATGACCA CCGTCACCAC CGGCAGCAGC
GACAAGGTGG TGTGGGATAA AGATAACCAC ATCACGCAGA CGCTGACTCA GGTACAGGGC
AGTGAAGTCA CCAGCGACGG CAATATCTCG CTGAATGCCG GCAATAATAT TAACGCCCAA
GCGGCAAAAC TGAATGCCGA TCGGCAGCTT GCACTGACCG CCACCAATGA TATCAACCTC
GGCAGCGCCA ACAGCCAGGA ATATCTGGAT ATGAACTCGA AGGTGAAAGG TTCCGGCTTC
CTGTCGAAAC GCACCACTAC CACCCGCGCC GGTTACGATG CCACGCTGGC AAACGGCAGC
AGTCTGGGCG GGGAAAATAT CTCCGTCAGC GCGGGTAACA ACCTGAATAT CACCGGCAGC
GATGTGGCGG CGGATCGGGA TCTGGCGCTG CGTGCCGGTA ATGACCTCAA CGTCACGGCG
GCAGAAGAGA GCCGTGACAG CTGGTCGATG AAGAAAACCA CCAAGTCGGG CCTGATGAGC
AGCGGTGGCA TTGGCTTCTT TGTTGGTTCG ATTAAAGAGA GTTCAACCTC CGATACTGCG
GCGCTGACCC ACAACAACAG CACGCTGGGC AGCGTCGATG GCGATACCAG TCTGGTGGCG
GGCAATAATA TCGCCGTGCA GGGCTCTGAT GTGATCGCCG GTAACAACAT CAATATGGTG
GCGAACAATA TCACCATTGA TGCTGCGAAT AACCAGAGCA CCACCGATAC CACTTACGAG
CGTAAACAGA CGGGTCTGAC GCTGGCGCTG TCCGGCGCGA TCGGCAGCGC TATTAATGCC
GCTTACACCA GCGCGAAAGC CGCCGATGAG CAGCAGGATG GCCGTATGGC ATCGCTGCAG
AAGCTGAAAT CAGGTCTGGC AGGCGTACAG GCCGCTCAGG CAGCGGTACT GGCTTCGCAG
AATACCGCAG ATCAGAATGC CATTGGCGTC AGCTTGTCGC TGAGCACCTC AAAGTCGAAG
AGCGAGAGCC ACAGTGAAGC GGTTAACGCT TCCGGCAGTA CCGTGCAGGC GGGCAACAAT
ATCAACCTGG TAGCGACCGG TTCAGAGAAT GGCACCGATG GCGATCTGAC GATTGGCGGC
AGCCAGTTGA AAGCGGGTAA CGATGTGCTG CTGTCAGCGA ACCGCGATAT CAACCTGCTG
TCGGCGCAGA ATACCCAACT GCAGACAGGT AAAAACAGCA GCAGCGGTGG CGGCATCGGC
GTGAGCATCG GCGCTGGTCA GGGCGGCGCG GGGATCAGTG TGTTTGCTAA CGCCAGTAAA
GGTTCCGGTA ACGAAAATGG CGATGGCCTG ACGCATACCG AAACCACCGT CGACGCCGGT
AATAAGCTGA CGGTTAACAG CGGACGTGAC ACCACCCTGC GCGGCGCACA GCTGAATGCC
GATCAGGTAG TAGCCAATGT CGGCCGTAAC CTGACGCTGC AGAGTGAGCA GGACGTTAAT
AACTACGACT CGAAGCAGAA GAACAGCTCA GCAGGTGCCA GCTTTACCTT TGGCAGCATG
ACTGGCAGCG TCAGCGCCAA CGTAGCGAAA GATAAGATCC ACAGCACCTA TAACAGTGTG
CAGGAGCAGA CCGGTATCTT CGCTGGTGAT GGCGGCTTCG ATATCACTGT CGGTAACCAC
ACACAGCTGG ATGGCGCAGT GATTGGCAGC ACCGCCAGCG AAGATAAAAA CCGTCTCGAC
ACCGGCACGC TGGGCTTCAG CAATATCGAC AACAGTGCCG AGTTTGAGGT GTCGCACAGC
AGCGTCGGCA TCAGCACTGG CGGGCTGGGC GCACAGGATC TGCTGAAAAA TGCCGTACAG
AATCTGGCGG CTAATGGTCT GGGAGCGGAT GGCAGCGATG GGAATGCTTC CGGTACGACT
TATGCCGCCG TTTCGCCGGG ATCGCTGATT ATCCGTGACC AGGCGAACCA GCAGCAGGAT
GTCAGCGAAC TGAGCCGCGA TGTTGAGCAT GCGAATCAAA GCATCAGCCC AATCTTTGAT
AAGGCAAAGG AACAGCAGCG TCTGAGCCAG TTGCGGTTGA TTGGCGACGT TGTTAATCAG
GGCGTGGATA TCACCCTGAA CCAGGGCCAG ATTATGGCAA ATAACGCGGG AATTAAAGCG
GCTGGTGAGT GGGATGAGAC CAAAGAGACA CGTCAGGAAT ACTGGGACCG TGTACAAAAT
ACCGCCGCGT ATAAGGACAT CAACGATCAA TACAAAGCGG GTGGCGAGCT GAATAAAGGT
ATTCGTGCCG CAGCGGCAGC CATCACTGCA CTGGCGGGTG GTGATCCACT GAAAGCGCTG
GCTCAGGGCG CTGCACCTTA TCTGTCAAGC ACGGTGCGCG ATTTAACCCT GCAAAATAGT
GACAATCCAA CCGCCGGGCA GATCGCGGCA AATGCCATTG GTCACGCCAT CGTTGGCGGC
GTGGTGGCCG AGCTTTCTGG TTCAAACGCC ACTGCGGGTG CAGTCGGTGC GGCGGGAGGT
GAACTGGCGG CTCGCGCTAT CGTCAACTAT CTCTATCCAA ACAGAACGGT TGAGTCACTG
ACTGATGATG AGCGGGCGAA AGTCAGTAAT CTGGCATCTC TGGCAGCAAC CATGGCGGCA
GGTCTGGCGA GCGATTCTTC TGCAGGCGCG GTAGCAGGAC ATGATGCAGG TAAAAATGCC
GTTGATTATA ACCTGCTGTC CAATAAGTAC GGCGTTGAGA AACTGAGTAA AGAAGGCCGC
GCGCTGTATG AAAAACTAAA AGCCGCGGGT ATCGGTGGTA TGGATGAACT GCAGGAACGC
TTTACTGCCT GTGGCGCTAA TGGTAAATGC CAGACCGATA TCCGCAATGA GTACCGTAAA
TTGGAAAAAG AAGCGGGCGA AAAGCTGGTC GCGATGTATA AATCTGGGGC AATAACGGCT
GATGAGTTCG GCTATCTCGT AACAGATTAT GCCAGAACGA TGATGTATGG TGCGCGACAG
GGCCAGCTAG ATTCTGATTA CTCTGGATTC ATCGGTGACA TCTATACGCA GACAGGCATT
GACTGGACAC CAATGGGCAT CGCTGGCAAT CCGTATGTCG CCGCGATTAA GGGCAGTGAA
CAACTTGCCG AATGGAAAGC ACAGGGACTG AGCGATGAGA AAATCCGTGA GCTGGCACTG
AAAAACGATA TCATTAGCTC AGCCCTAACA CCTGTAGATA TCAATGGCAT ACTGAGCTTA
TACGATAATG GTGCATCTGC GCAGGACGTT GTAAAATTTG CTTCTGGCAT GGTGTTCAAT
AAAGTCGTAC AAAAAACCCA GGCGGGTACA GGGAAAGGGA ATGTTTTAAA TCAGGCAGAT
AAAGTTGCAG CGCAGGCGCA GCAAGATCTG CTGGATAAAA TCAAGAGCTT CCCGTCGAAA
ACCCAGGCGA ATAAAACCGC CACCATGGTT GGCGCCTATG ACCCAGTGAC AGGTAAAACC
GCGGTAGGAA GCAGTAATGC CAGTATCACA GCAGATGCAC TGGATCCTAA AACTGTTGCC
TACATTGAAA AACAGTTAGG CGTCAAAATC GGTGAGTTTA CGAGTTTTTG TAAGAACAAA
GCAGGAGCCT GTGCTGAAGT TTCCGCAGCA GACCAACTGG TTCGTCAGGG CGTATCTCCG
GAAAATGTTA AATTCACGGA TGCCGTCAGG CCAAGAGCGG TGTACGACGC AGGTACAGTG
ACACCTGAAT CGGTCATAAA ACCTTGCGAA AATTGCCAGG TAACATGGCC TAAAGGAAAA
TAA
 
Protein sequence
MNKNLYRIIF NKARGLLMVV SEINRGQGKS GANGVGHTLS QLIGSMKPAA FLTMTALGLV 
TLAPQALAAG IVADKSAPGG QQPNVMQSAN GTPQVNIQTP SAGGVSHNKY TQFDVDNKGA
ILNNSHKQVQ TQMGGWVAGN PWLAKGEAKI ILNEVNSRDP SKLNGYVEVA GRKAQVVIAN
PAGISCDGCG FINANRATLT TGTPQMTNGE LRGYRVGNGE IVVEGAGMDS SRQDYTDLIA
RTVKVNAGIW AKDVTVTTGK NEVAADNSTA TATATSATDN DVKPTLAIDV AQLGGMYAGK
IRLVATEQGV GVSNKGTLGS QAGDITINAN GDIVNSGTVN AGQDLLLGGK KVDNSGTLFA
QRDQSTTASD AVTNTGLIAA KGNTHVRGAS IRNSKGAAIA AGMKTDGTLA NSGNLTLASD
GKLTSKGQAL AGGDLNASAS QIDLSGSDTA AHHATLTSSS DIITDDAQLL ASGDLKLSAA
GKLSNDRGVI NANTLQVSTP VISNRGGQLL QSGESDLQLS INSIDNQNGR IAANAKKFSV
QSASLNNQGG TIMAAGSGSL DVAASSGLNN QNGTLAAASD LAIATPVLNN NQGQISANKQ
LTLDQQNSSA LARTAATSSD LRISNEGGRL VAGQQLIFRG REINGSGEIL SLGDMDLSFA
DGFSNTGKTL ANGDLTLNVN NSLINTALLG AGGKLNVQAS NIDNQTTGEL SSQQTTLNAS
DTLNNRGLID GVLTRINAST VNNIGTGRIY GDGLAIGAAT LNNLAENNSS ATIAARQRLD
LGIGTLNNRD HSLIYSNGDM AIGGALDADA LATGKAGEIN NHSSTIESVG NMALSFTALN
NINDNFVTSM VQLSQQQKEE YMVVDLRNGV HYSPDDYNIS FYKDEVRHIC IEGVVCGRDH
YYQYSYTKTI SEEQITQSDP AKIIAGGQIS LSGDKLLNDK SQIIAGGTLL TAVKELVNTE
VTGQKLTEKV GQVIEWDRIH KKGKDSQKAR ASAYTPPTEI QSISLSPSVM KENTQATSSA
PSLAEYATQR VEVAGQNTDV IRSMTPDASL PTGSLFTTLP DATSSYLIET DPRFTNQRTW
LSSDYMLSQL QTDPSITQKR LGDGFYEQRL VREQIVELVG QRYLADYTSD EEQYKGLMEA
GVSFAKTFNL VPGVALTAEQ MKQITQDMVW LVAQDVKMPN GTTQRVLVPQ VYAQVQQGDM
DGSGALLAGK NVSIGISGGM LNSGRISATQ LVSVSGDDIV NVGGIIAGKS VSLQATNDIT
NTGGTVRATD TLLAQAGRDI TVASETNHAE SQNGSNRFSR DNIDRVAGMY VQGDDGKLLL
QAGRDVNLQA AQVVSSGENS QTQIAANRDI NMTTVTTGSS DKVVWDKDNH ITQTLTQVQG
SEVTSDGNIS LNAGNNINAQ AAKLNADRQL ALTATNDINL GSANSQEYLD MNSKVKGSGF
LSKRTTTTRA GYDATLANGS SLGGENISVS AGNNLNITGS DVAADRDLAL RAGNDLNVTA
AEESRDSWSM KKTTKSGLMS SGGIGFFVGS IKESSTSDTA ALTHNNSTLG SVDGDTSLVA
GNNIAVQGSD VIAGNNINMV ANNITIDAAN NQSTTDTTYE RKQTGLTLAL SGAIGSAINA
AYTSAKAADE QQDGRMASLQ KLKSGLAGVQ AAQAAVLASQ NTADQNAIGV SLSLSTSKSK
SESHSEAVNA SGSTVQAGNN INLVATGSEN GTDGDLTIGG SQLKAGNDVL LSANRDINLL
SAQNTQLQTG KNSSSGGGIG VSIGAGQGGA GISVFANASK GSGNENGDGL THTETTVDAG
NKLTVNSGRD TTLRGAQLNA DQVVANVGRN LTLQSEQDVN NYDSKQKNSS AGASFTFGSM
TGSVSANVAK DKIHSTYNSV QEQTGIFAGD GGFDITVGNH TQLDGAVIGS TASEDKNRLD
TGTLGFSNID NSAEFEVSHS SVGISTGGLG AQDLLKNAVQ NLAANGLGAD GSDGNASGTT
YAAVSPGSLI IRDQANQQQD VSELSRDVEH ANQSISPIFD KAKEQQRLSQ LRLIGDVVNQ
GVDITLNQGQ IMANNAGIKA AGEWDETKET RQEYWDRVQN TAAYKDINDQ YKAGGELNKG
IRAAAAAITA LAGGDPLKAL AQGAAPYLSS TVRDLTLQNS DNPTAGQIAA NAIGHAIVGG
VVAELSGSNA TAGAVGAAGG ELAARAIVNY LYPNRTVESL TDDERAKVSN LASLAATMAA
GLASDSSAGA VAGHDAGKNA VDYNLLSNKY GVEKLSKEGR ALYEKLKAAG IGGMDELQER
FTACGANGKC QTDIRNEYRK LEKEAGEKLV AMYKSGAITA DEFGYLVTDY ARTMMYGARQ
GQLDSDYSGF IGDIYTQTGI DWTPMGIAGN PYVAAIKGSE QLAEWKAQGL SDEKIRELAL
KNDIISSALT PVDINGILSL YDNGASAQDV VKFASGMVFN KVVQKTQAGT GKGNVLNQAD
KVAAQAQQDL LDKIKSFPSK TQANKTATMV GAYDPVTGKT AVGSSNASIT ADALDPKTVA
YIEKQLGVKI GEFTSFCKNK AGACAEVSAA DQLVRQGVSP ENVKFTDAVR PRAVYDAGTV
TPESVIKPCE NCQVTWPKGK