Gene CHU_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1075 
Symbol 
ID4184352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1243716 
End bp1251788 
Gene Length8073 bp 
Protein Length2690 aa 
Translation table11 
GC content41% 
IMG OID638071074 
Productbeta-glycosidase-like protein 
Protein accessionYP_677692 
Protein GI110637485 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.243053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGA AATTTAATCT AAAGCTTATT GTGTTTTTGA TTGCCTGTCT GTACAGCGCG 
ATGCTGTCGG CGCAGATCAA TACACCATCC GGAGCTGTGG TACCATTTAA CTCGAATCCG
AATTATGGTG GTAACGGTAT TATGCCAACC AACCTGCCAA CCACCGGAAC ATATGGTAAA
TCGCAGGATG CGGCAGATGC CTATAATGAG TGGAAAGCTG CCTATACAGA AACATGTACA
GGTGATATGA TCCGTATCAA ATTTGATGAG CCGAATCGTA CCGTGTCTGA AGGTATTGCG
TATGGTATGC AATTAGCTGC CTATGCTGCG GATAAAGCGT TATTTGATGG TTTGTATAAA
TACTGGAAAA ACTTCCAGTC TCCGAATTCA TCCGGTAAAG CCGGTAAGTT AATGAACTGG
AGAATCAATG GCTGTTCAGG CGTGAGTGGT ACAGGCAGTG CTGCCGATGC TGACGTTGAT
GCTGCATGGG CATTAATGAT CGCAGAAACT CAATGGCCGA ATCTGAATAC ACCTTATGAT
TATATCACTG AAGCAAACAA TATGCTTAAT GCCATTAAGG AATTGGAAAT GCTGAATGGT
CAGCTTATCA ATGGTGATGG CTGGGGATTT GCTGATCAGT GCCGCAACCC TTCGTATCAG
TCTCCTGCAT ATTATGAATA CTTCAAAGTT GTAAATGCAG GTAACTCAGG TACATGGAAC
GGTGGAATTA CAGGAGCCTA TAATCTGATT AATGCAAACG CTGACGTTAC TACCGGTTTA
ATTTCGGATT GGTCAAATCC AAGCGGTGTT AGAAATACCT GTAACCCGGG TGGACTTGGA
CAGGCAGCAA CAGATGGTTA TGGCTATGAT GCCTGTCGTA ACCCATGGAG AATGGCACAG
GATGTAATCT GGAACAACAA TGCCAACGCA AAAACGATTT GCGGCAAAAT TACCGATAAG
TATATCAATG TAAAAGGTCC CGGTGGCGTA GGTGGTCCGT TATATCAGAA TGGTAATAAC
TATGCCGGTT TCTCACACAA TGCAACATTC GTTTCCACAT TTGCAATGGC TGTTATGGGT
TCAACCAACC AGGCGTTGAT GAACTCCATG TATACAGAGA CAAAAAATAC AAAAGATGTA
ATTAAAAACT CTACACTTTC CGGTTATTTC GGTAATACAT TGCGCTGCGT GTCTTTATTT
ATGATGACAG GTAACTTCTG GAAATACGGA ACAACTTCTT TCCAGGATAT AAATGTTCGC
AGAGGTACAA CTACTGTAAG TGTATTGTCA GGAACAACGT ATGATTTCCA GACACAGCAA
ATAAACACAG GCGGTAAAGT TGGAAACTTT ACGATTGAAA ACCTTGGTTT TGCATCGTTG
ACATTATCAG GATCACCAAT AGTTGTTTTA AGCGGTACAA ATGCTGCCGA CTTTATTTTA
ACGCAACCAA CAGCTACTTC GTTAGCATTG AGCCAAACAA CAACTTTTAC GGTAACATTT
AAACCAACTA CAGTAGGGGA TAAAACAGCT AAATTAACGA TAGCGAGTAA TGACCCGGAT
GAAGCTTCGT ATGTTATTAA CTTAACCGGT ACAGGTACAT TAAATGCAAC TGCTCCCAAA
ATGTCTGTAT TTGATGCAAG CACCGGAGCA GCCGTAACAA ATAATGGTAC AGTTTCAATG
GGAACTGCGT CTGCAGGTTC AGCCAATTTA AAACGGTTTG GTATTGTAAA CACAGGTGAT
GCTGCGTTAA ATATTCCGCC TACAGGTACT GTTGGTTATA CTGTAACCGG TACAGGTTAT
TCACTTGTAA CAACAACGCC CGGAGCTGTA GCACCAACAA TCGTTGCAAT CGGTGATACA
GGATATGTGT ACGTTGAAGC AACAAGCTTA ACTGCCGCAA ACCTTTCCGG TTCATTAACG
ATTATATCGG ATGACAATAT CAATCCTTCT TTCAAGATTA ACCTGACATC AGCTTTTGTG
GCATGTGCCA CGGCTGTTAC TACAAACGAT GTGTATCAGG ATTATAATGC GAATGCCAAA
AATTCAACTG CCGGTACAAT CTCCGGTTTT ACTGAAAGTG TAGACAATTT ATATATATCA
GAAAAAAACC CTTCCACGAA AGTAGGCAGA TTGGTTCGTA ATACTTCAGA ATACCAAGGG
CCAAGATATA CCTTATGTGG TTCAACAGTA AATCTAACCG CAACTAAATT TGCGATCAGT
ATGCTGGTTT ACTCACCGGC AGCAGGTATT CCTATTAAGA TGGGATTAAA AACGGATGCA
GATGTTGCTG ATCTCAGCTA CCCGGAAAGA AGCGGTGTTA TTGTTACTAC AACAAAAGCA
AATCAGTGGG AGCGACTTTA TTTCTATCAT TCCGGAGCTA TTGGTGTTAC AGGTATCCGC
CACATAGAAA TATACATTAA TCCTACTGCA CCTATTTCAG CCGGAACCTA TTACATTGAT
GATATCAGAT TAGAGTCTTC ACCATGTCTG ACTGATAATT CAGGTATAAT TCAGGATTTT
AATGATCACA ATAACGTAAC CCTGTCTTAT CCGGTTGCCG GTTATGAGCC AGGTGCGGTA
AACGATAAAG CAGTTGGATT AAATACCAGC ACAAACATTG GTAAATTTAC AAAGTCTGCA
GCTACGCTTG CTTACAAAGA CGGTATACGT TACATCGGTT GCGGCGGTAA ATTTGACATA
TCTACTAAGA AGTATGTCAG TATGCTGGTT TATTCAACTG TTGCAAATGC AAAAATTAAT
ATGTCACCTA AAATAGGGGC AAATGATGCA TTGGTTTCAG CTCCATCAAC AGTAACCGTT
TTTGCAAATC AATGGCACCG TATATACTTT GATTTATCTG CAGTTTCTGC TACGGACATT
CCTAATATAA CAGGTATAGA TATTTTCTTT GATCCGTTAA ATGATCTTGG AGCCCAAACA
TATCGATTTG ATGATATTCG TTTTGAAAGT GCTTTACCAT GTATAGCGGG TATTGATGCA
ACAGAGATTT TAAATGACTT TGAAAACAAC AGATTCCTTG GTGTTGCTTT TCCTGGAGAA
ACTCCGGTTA CAGGTGAGGT AACTTACTTT AACACGGTTT CTGCTAACCC AAGTGCTACA
GGTGCTAATA CAAGTACCGT TGTAGGTAAA TTCTTAAGAG GAACTGCTTC TACAGGAACA
TCTTTCCGCT TTACAGCTTG TCAGAGTAAA TTAAGTTTAA CTCCGGGAAG AGCAATAATT
GATTTGAAGA TGTACTCTCC AAATGCAAAT GTTGCAGTTG TAATGTCGTT AAAAAATGCA
GCTGGTGTAT CGCTAAGCGA TGTAACAGAT ACAATTAAGC TTGCTAATAC ATGGACAAAC
TTACGTTTTG ACCACAGTAA ATTATTGAAC TCAACAGAAA TTGCCTTTAT TGATATCATT
GTTGACGGTG CAGTTATTTA TTCAGGTACT TCTTCGACGG CAACAGCCCG AACGTATTAT
GTAGATGATC TGAGATATTC TCTTCCTGCA CCTGAAATTA ATATTCAGGC AAATACAACT
CCTGCATTAA CGGATATTCC TACAAGAGGA GCATTTAATA TGGGTACTGC TTCCATAGGT
GATAGCACTA CGATGGATTT CAAAATCCAG AACAACGGTT TAGAAACATT AACATTGACA
GGCACCGGTG CTGCAGCATT AGTGATAGGC GGTGCAAATC CTGCTGATTT CATTATTTCT
ACAACACAAT CATTCTCAAC TTCTATCAGT GGTTTAAGTG TAACTGGCTT TACCGTTAAA
TTTAAACCTA CAGCAGGAGG CGCACGTTCT GCATCAATCA CAATTGTGAG CAACGATGCA
AATGAAAGTC CATATATCAT TTATTTAACC GGTACAGGAA CAGTACCTGT GGTAAGTGTA
TTAAATGGAG GAACAACGAC TTCAACGGTT ATACCAAATA ATAACCCGAC GGCTATCAAT
GTAGGTACAT CCGCTGTGGG TACACCTGCA ACGCCTGCTT ATACTTTCAG TATTAAAAAT
ACGGGTGTTG GTCCGTTGAC TGTAAAATCC ATTACAGCTT CATCTACATC ATTATTTACA
GTATCCACAT TGTCTTCTTC TACACCAATT CTGACAGGTG CCATTGCTAC CTTTACGGTA
ACAGGTACAC CGGCAGCAAC TGGTTTAAAT ACAGGCTTCC TGACTATTGT AACAAATGAT
CCTGCAACAC CTTCTTATAA AGTAAATATC ACAGTTACGG GCAGTGTTCC GGTAATCTCG
GTTTCAGATG CCACACCTGC AGTTGTTATT AATAATAATA CAACACCGGT TTCAGTTGGT
TTTGCACCAG TAAATACAGA TGCTGCAGCC TATACATTTA CGATCAATAA CACAGGAAGT
GCTCCGTTAA CAATTACTTC CATATCAGGA TTACCTACAA CGGTATTTGC TATTAGTGGT
GTTCCTGCAA CGGCAATTGC TGCAGGCGGA AGCGCAACCT TTAAAGTAAC GGGTAAACCG
GCAGCAGCTG GTGTAAATAC AGGTTCAATT ACAATTGTTA CAAATGATCC TGTAACACCG
TCTTATAAAG TAAATGTAAC GGTTACAGGT ACTGTTCCGG TAATCTCGGT TTCAGATGCC
ACACCTGCAG TTGTAGTGAA TAATAATACA ACGCCGGTTT CAGTTGGTTC AGCACCTGTA
AATACAGATG CACCAATCTA TACCTTTACT ATTAATAACA CAGGTCTGGC TCCATTAACA
ATCACTTCAA TATCAGGGTT ACCTACAACG GTATTCGCTA TTAGTGGTAT TCCTGCAACA
CCAATCGCGG CAGGCGGCAG TGCAACGTTT AAAGTAACGG GCAAACCGGC AGCAGCTGGC
GTAAATACAG GATCTATTAC CATTGTTACA AATGATCCGG TTACAGGCTC ATACAAAGTA
AATGTTTCTG TAACAGGTAC TGTGCCTGCA TTACAGGTAT TAAACAGTGC AACAGCTGTA
ACATCTGACA ATGCACCGGC TATTTCTGTT GGTTCATCTA CGGTTGGTAC TTCTGCAGCA
GCTTATACGG CGTTCTCTGT TAAAAATGCT GGTTTAGCTC CGCTTACATT TACATCGATT
ACAAGTTCTT CTGCAGACTT TGTAATTTCA GCTGTAACAC CTGCAACACC TGCATCTATT
GCAGCGGGCA ATTCAGCAAC ATTCACCGTT ACAGCCAAGC CGGCAGTTGT AGGAGCTAAT
ACTGCTGTTA TAACTATTGT AACAAATGAC CCAACAAACC CTACCTTCAA ATTAAATGTA
TCGGTTACAG GTATTGCTGC ATCAACACCT TCTGTACAGG TATTGAATGG CGCATCTATC
GTAACAGATA ACTCTACGGC AATACTGGTA GGTACTGCTC CTGTAAATAC AACGGCAGCA
CCTTTCACAA CGTTCTCAAT TAAAAATAAC GGAACAGCTG CTTTAGACTT CACATCGATT
ACAAGTTCTT CTGCTGATTT CGTAATTTCA GCGGTAACAC CTGCAGCACC AACATCTCTT
GCAATAGGTG CATCTGCAAC ATTTACAGTA ACAGCCAAAC CTATGGTACT TGGAACTGGA
AATACAGCGG TAATCACTAT TGTTACAAAC GATCCGACAA CAGCATCGTT TAAGTTAAAT
GTAACGGCTA CAGGTACAGC ACCCGCTATT CAGGTATTTG ATGGTGCCGG TACAACGACT
CAGATCACAT CAGATAACGT AACTGCCATT TCATTAGGCA CAGCTACGGT GACTGAAGCG
GCAACGCCTT CGCACACATT TACAATTAAA AATAACGGTA CAGCTCCTTT AACAGGACTT
GCATTAACAG CAACGGCAGG TTTTGAAGTT TCTGCTTTAA CACCGGCAGG TACTTCGCTT
GCACCAAATG CAACGGCTAC GTTTACCGTA ACAGGTACAC CTGCTTTAGT TGGAGCAAAC
ACGGGTTCGA TTACAATTGC TTCAAATGAC GGAACAACTC CGGCATTCAA GATCAATGTA
AGCGTTACAG GTACTGCTGC TCCTGCAAAA CTACAAGTAG TAGATGGAAC AAGTATTTTA
ACATCTAACG GTACTGCAAT TTCACTTGGA ACAGCAGTGG TAGGTTCAGC TGTACCGGAT
TACACTACCA TATCTATTAA AAATACTGGC GGATCACCGC TTACGTTTAC ATCCATTACA
AGTTCTTCTG CTGTATTTGT TTTAGCAGAT GTAACGCCAG CAGCTCCCGG AACAATAGCA
GCAGGTGCTT CTGCAACGTT TACAATAAAA GCGACACCGG TATTAGGTGC TAATACAGGA
AAGATTACGA TTGTAACAAA TGACGCAACT ACACCTTCAT TCGTTATTAA TGTATCTGCA
ACGGGTACAT CTAATCCGGT ACCTGCCGTT CAGGTACTTG ATGGTTCTAC TCAATTGGTG
ACAAACGGAA CAGCCGTATC ACTTGGTACA GCTCCTGTAG GTACAGATGC TCCGGCATAT
ACAACGTTAT CTATAAAAAA TAACGGTACG GCTCCGCTTA GCTTTACATC AATAACAAGT
TCATCTGCCC AATTTGTAAT AACGGATGTC AGTCCGGCTG CACCTACTAC ATTAGCTGTG
GGCGCTTCGG CAACATTTAC AGTAACGGCT AAACCATTAG TGGTTGGTAC TGCAAATACA
GCTAAGCTTA CAATTGTAAC AGACGATCCT GCAACAGCAT CATTTGTTGT AAATGTTAGC
GTAACAGGTA CACCTGCTCC ATTGCCATCA TTACAGGTAT TAAATGGTTC TGCGATTGTT
ACAAATAATA ATGCGACACC TATTTCATTA GGCACGGCTG CGGTAGGTTC ACCTGCTCCG
GCATACACGT CGCTGTCAAT TAAGAATAAC GGATCCGCTC CGCTTACCTT TACGTCAATA
ACAAGTTCAT CTGCACAATT TGTAGTATCT GGTGTAACGC CGGAGGCACC TACAACATTG
GCTGTCGGAG CTTCTGCAAC GTTCACGGTA ACAGCTACTC CACTTGCTGT TGGTACAGGA
AATACAGCGA AGCTTACCAT TGTAACAAAT GATCCTGCAA CACCATCTTT TGATGTTAAC
GTAGCGGTTA CAGGTACGCC AGCTCCGAAA CCGATAATTA CAGTAACCGG TGTCACAAAT
AACGGATCTA CGGTTTCTAT CGGAAATGTG ACAGAAAATA CTGCAACGGC TCCTTATACA
TTTACGATAT CCAATACAGG CTCATTACCG CTGGAAGTTG GATCAATTAC ATCTTCGAAT
CCGGTGTTTG TTGTTACGCA GGTAAATCCT ACTACAATCG CTCCGAATTC TACAGGTACG
TTCACCGTAA TTGGTACACC AACAAGTGTT GGTACAGTAA CCGGAAAAAT TACGATTCCA
AGTAATGATA TCAGTACACC TTCATTTGTA ATCAATGTAA GTGTAACGGG TAAAACAGCC
CCTTCAGGAG CAATACTTGA AATATTGAAT AAATCTGCAA TCGTAATGGT TAACGACGGT
ACTCCAATGA TGATTGGTGC TAATTCAATC AATACGGTTA CAGCGCCTTA CCAGTTTACA
CTTACAAACC CTGGTACAGT GGCAGTAAAC ATTGGAGGCA TAACGGCTAC TACAGGATTT
ATTGCAACTC AAATGAATCC TACCGGTACA ACACTTGCTC CTGGTGCAAT TGCAACATTC
ACCGTAAGAG GTACACCAAA CAGTGTGGGT ACACCAAGAC AGGGTAGTGT TATTATAAAA
TCGAATAATG CAGCGGGTGG AGATTTCATA TTAAACGTTT CAGTTGAAGT AGGTACACCT
ACAGGTGTTG CTTCTGCATT GGCTGCTACA GAAATTGATC TGTTCCCGAA CCCGTCTACA
GGCTCAACGA ACTTAGAATT CAATGGTTCA TTTAATGAGG TAGCTGTAAC AATCTACACT
ATTGATGGAA ACAAAGTATT TGCAAGTGAG TATGCTTCTG TTGATCCGGG TGCATTAAGA
ACACTTAATG TTGAAGAACT ACCTTCAGGT ATTTATATCG TAGAGGTATC TACAGCAGAA
GGTAAATTAG TTAAGCGTTT GATTAAGCAA TAA
 
Protein sequence
MIKKFNLKLI VFLIACLYSA MLSAQINTPS GAVVPFNSNP NYGGNGIMPT NLPTTGTYGK 
SQDAADAYNE WKAAYTETCT GDMIRIKFDE PNRTVSEGIA YGMQLAAYAA DKALFDGLYK
YWKNFQSPNS SGKAGKLMNW RINGCSGVSG TGSAADADVD AAWALMIAET QWPNLNTPYD
YITEANNMLN AIKELEMLNG QLINGDGWGF ADQCRNPSYQ SPAYYEYFKV VNAGNSGTWN
GGITGAYNLI NANADVTTGL ISDWSNPSGV RNTCNPGGLG QAATDGYGYD ACRNPWRMAQ
DVIWNNNANA KTICGKITDK YINVKGPGGV GGPLYQNGNN YAGFSHNATF VSTFAMAVMG
STNQALMNSM YTETKNTKDV IKNSTLSGYF GNTLRCVSLF MMTGNFWKYG TTSFQDINVR
RGTTTVSVLS GTTYDFQTQQ INTGGKVGNF TIENLGFASL TLSGSPIVVL SGTNAADFIL
TQPTATSLAL SQTTTFTVTF KPTTVGDKTA KLTIASNDPD EASYVINLTG TGTLNATAPK
MSVFDASTGA AVTNNGTVSM GTASAGSANL KRFGIVNTGD AALNIPPTGT VGYTVTGTGY
SLVTTTPGAV APTIVAIGDT GYVYVEATSL TAANLSGSLT IISDDNINPS FKINLTSAFV
ACATAVTTND VYQDYNANAK NSTAGTISGF TESVDNLYIS EKNPSTKVGR LVRNTSEYQG
PRYTLCGSTV NLTATKFAIS MLVYSPAAGI PIKMGLKTDA DVADLSYPER SGVIVTTTKA
NQWERLYFYH SGAIGVTGIR HIEIYINPTA PISAGTYYID DIRLESSPCL TDNSGIIQDF
NDHNNVTLSY PVAGYEPGAV NDKAVGLNTS TNIGKFTKSA ATLAYKDGIR YIGCGGKFDI
STKKYVSMLV YSTVANAKIN MSPKIGANDA LVSAPSTVTV FANQWHRIYF DLSAVSATDI
PNITGIDIFF DPLNDLGAQT YRFDDIRFES ALPCIAGIDA TEILNDFENN RFLGVAFPGE
TPVTGEVTYF NTVSANPSAT GANTSTVVGK FLRGTASTGT SFRFTACQSK LSLTPGRAII
DLKMYSPNAN VAVVMSLKNA AGVSLSDVTD TIKLANTWTN LRFDHSKLLN STEIAFIDII
VDGAVIYSGT SSTATARTYY VDDLRYSLPA PEINIQANTT PALTDIPTRG AFNMGTASIG
DSTTMDFKIQ NNGLETLTLT GTGAAALVIG GANPADFIIS TTQSFSTSIS GLSVTGFTVK
FKPTAGGARS ASITIVSNDA NESPYIIYLT GTGTVPVVSV LNGGTTTSTV IPNNNPTAIN
VGTSAVGTPA TPAYTFSIKN TGVGPLTVKS ITASSTSLFT VSTLSSSTPI LTGAIATFTV
TGTPAATGLN TGFLTIVTND PATPSYKVNI TVTGSVPVIS VSDATPAVVI NNNTTPVSVG
FAPVNTDAAA YTFTINNTGS APLTITSISG LPTTVFAISG VPATAIAAGG SATFKVTGKP
AAAGVNTGSI TIVTNDPVTP SYKVNVTVTG TVPVISVSDA TPAVVVNNNT TPVSVGSAPV
NTDAPIYTFT INNTGLAPLT ITSISGLPTT VFAISGIPAT PIAAGGSATF KVTGKPAAAG
VNTGSITIVT NDPVTGSYKV NVSVTGTVPA LQVLNSATAV TSDNAPAISV GSSTVGTSAA
AYTAFSVKNA GLAPLTFTSI TSSSADFVIS AVTPATPASI AAGNSATFTV TAKPAVVGAN
TAVITIVTND PTNPTFKLNV SVTGIAASTP SVQVLNGASI VTDNSTAILV GTAPVNTTAA
PFTTFSIKNN GTAALDFTSI TSSSADFVIS AVTPAAPTSL AIGASATFTV TAKPMVLGTG
NTAVITIVTN DPTTASFKLN VTATGTAPAI QVFDGAGTTT QITSDNVTAI SLGTATVTEA
ATPSHTFTIK NNGTAPLTGL ALTATAGFEV SALTPAGTSL APNATATFTV TGTPALVGAN
TGSITIASND GTTPAFKINV SVTGTAAPAK LQVVDGTSIL TSNGTAISLG TAVVGSAVPD
YTTISIKNTG GSPLTFTSIT SSSAVFVLAD VTPAAPGTIA AGASATFTIK ATPVLGANTG
KITIVTNDAT TPSFVINVSA TGTSNPVPAV QVLDGSTQLV TNGTAVSLGT APVGTDAPAY
TTLSIKNNGT APLSFTSITS SSAQFVITDV SPAAPTTLAV GASATFTVTA KPLVVGTANT
AKLTIVTDDP ATASFVVNVS VTGTPAPLPS LQVLNGSAIV TNNNATPISL GTAAVGSPAP
AYTSLSIKNN GSAPLTFTSI TSSSAQFVVS GVTPEAPTTL AVGASATFTV TATPLAVGTG
NTAKLTIVTN DPATPSFDVN VAVTGTPAPK PIITVTGVTN NGSTVSIGNV TENTATAPYT
FTISNTGSLP LEVGSITSSN PVFVVTQVNP TTIAPNSTGT FTVIGTPTSV GTVTGKITIP
SNDISTPSFV INVSVTGKTA PSGAILEILN KSAIVMVNDG TPMMIGANSI NTVTAPYQFT
LTNPGTVAVN IGGITATTGF IATQMNPTGT TLAPGAIATF TVRGTPNSVG TPRQGSVIIK
SNNAAGGDFI LNVSVEVGTP TGVASALAAT EIDLFPNPST GSTNLEFNGS FNEVAVTIYT
IDGNKVFASE YASVDPGALR TLNVEELPSG IYIVEVSTAE GKLVKRLIKQ