Gene Caul_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1857 
Symbol 
ID5899312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1981842 
End bp1990388 
Gene Length8547 bp 
Protein Length2848 aa 
Translation table11 
GC content65% 
IMG OID641562347 
Productouter membrane autotransporter 
Protein accessionYP_001683484 
Protein GI167645821 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGC GTTCGAGCAG CAAGAGAACG ATCCTCGCCG GTTCATCCCT GCTCGTCATG 
GCGATTGCGG CCGCCCAGCC GGCGCTGGCG CAAGACACCA CAATCTCCAG TTCCATTACC
ACAGGGTCCA CCTGGAACGG CACAACGGCG CCGGGCAGCA ATTTTACGGT CACGTCCAGC
GGCGCGATCA ACACCGCCGA TCCCGCGGCG ATCACGGTCG CCGGGACGGT CGGCGCGCTG
CTCGGCACGC TGCTGAACAG CGGTCAGATC CTTGACGCGG CGGCCTCGGG ATACGGGATC
AGTATCGGGT CGGTCAGCGT CGGCGTCGTC AACAACAGCG GCCTGATCAG CGGCGCGTCG
GCGATCTACA ATACCGGAAC CCTCGGGGCG GTCAGCAACA CCGGCACAGT GTCTGGGACA
GCGCGGGGTA TTTATCTATC CAGCACGGGG TCGATTGGCT CGCTCAACAA TAGCGGGCTG
ATCGCGGCGG GCACACTGGG TTCCATGGCC GCGGGCGGTA TCGCCAACAG TGGCCGCATC
GGCACGATCG TCAATAATGG GACGATCATC AGCGCCCACA GCAGCGGCAT CGGCAATAAC
CAGAGCACCG GTTCAATCGG GCAGCTTGTC AATAATGGCC TGATCAGCGG CGTCACCGGG
GGGATCTGGA ACCAGGGCAC GTTCAGTCAG ATCACCAACA CCGGCCAGAT CGTCGGAACC
CAGGCCGGTA TTTTCCAGGT CGCTGGCGTG CTCAGCGCCC TCGACAACAG CGGCACGATC
AGCGGCGGCT CATACGGTGT CTCCATCGGC GGCGGCAACG TGGGGACCCT TGTCAACAGC
GGTCTTCTTT CGGGACCCAT CGCCCTGACC ATCGCCTCTG GAGCCACACT CGGCGGCGTC
GCCAATAGTG GCGTGATCGC GGGCAATATC GTCAATCGCA GCGTCAATGT TCTCACCATT
ACCGGCGGAA CCGGCACCCA GATTGGCACG CTGACGGGCT CTACGCTCAC CAATCAGGGC
AGCATTATCA ACACGGTGTC GAACCTGGTG TTCGGCAGCG GCAATCTTTT GCTGAACGAC
AACATCAATG TGACCGGTCG CACGGTCAGC AATGTCGGCG CCGCTCTGTC TTTGAACAGC
ATCGTCAACG TTGCCGGCGC CTTCAGCCAG ACGGCGGGCT CGCTTCAGAT CGACCCCATG
ACGGGCGGCC TGATTGTCAG CGGCGGCGCC ACCTTCAGCG GTGGAACAGT CAAGTCCACC
TTCCAAAGCA CCGGCAACTA CCTGGCGGGC AGCTACACCC TTCTCAACGC CTCCAGCCTG
AACCTGACCG GCGCGACGGT TTCGATCAAC AGCCTCACCG GGCTCTACAA GTCGACCAGC
ACCGTGGGCA ACAAGCTGCT GCTGTCGGTG AACAACGACT ATATCGGCGG CGCGCTGGCC
TCGCTGACCA ATGCGGCCGC GCTCACCAGC GCGAGCACCG GCCTTTACGT GGCGACGACG
GGCAGCATCG GCACGCTGAC CAACACCGGC ACCCTGGGCG GCGGCCAGTT CGGGATCAAC
AACCGCGGCA CGATCGGCAC GCTGCTCAAC GCCGGCCGCG TCACGGACAA CAGCTTCACG
GCGCTCTGGA ACCAGGGCAG CATCGGCCAA CTGATCAACA CCGGCACGAT CATCAACAGC
AGCTGGGCGA TCCTCAACAC CGGCAACCTG GCGACCCTGA CCAACAGCGG CCTGATCAGC
GGCGCCGGTA ACGCCATCCA GAACAGCGGC GTCATCACGC TGGTCAACAA CAGCGGCACG
ATGTCGGGCG GCAGCAACGC CATCGCCGGC AATTTCGGGA CCATCGTCAA CAGCGGCGTG
ATCCTCGGCG ACATCAACAA GTATGGCGGC TCGATCGGCG CCATCATCGG CGGAGCCAAC
GGCACGATCG GCACCCTGAC CGGCTTCACC GTTGGGACCC AGGGCACGAT CGGCGCCTCC
AGCAACCTGA CCTTCGCCTC TGGCGCGCTG CTGCTGAACG ATTCGATCAC CGCCGTCGGC
ATCGCCGCCA GCGCCCTGAC CGTCAGCAAC ACCGGCGCTG ACATCACGCT GGCGACGATC
GTCAACGTCA ACGGCAATTT CCGCCAGACC GCCGGCACGC TGACCCTTGG CGGCACCGGC
AAGCTGGTCG TCACCCAGGC CGCCAGCCTC ACCGGCGGCG CCATCACCAC CAGCACCGGC
GGCCTGTCGT CGTCGAGCAC CTATCTGAAG GGGGCTGTTG GCGGCACCTT GGTGGCCGGC
GGCGCGGGTT CCAGCTATTC TGGCGTCAAC GTCTCGATCA CCAGCGGCTT TACCGGTCTG
ACGGGCGCCA GCACGACCAG CGGCAACAAC CTGCTGCTCG CGATCGCCAA CGACTATGTC
GGCGACACCC AGGTGTCGAT CAGCAACAGC GGCTCCATCA GCGGCGTCAG CTATGCTCTC
TATGTCGCGG GCACAGGCAG CATCGGAACC TTCACCAACA GCGGCGTGCT GACCGGTTTG
AGCACGGGCG CCACCAACAA CGGCAGCATC GGGAGCCTGT CCAACAGTGG TTTGATCAGC
GGCGCGATGA AGGGGCTAGG TAACCAGGCC ACCATCGCCA ACCTGACCAA CACCTCGACC
GGTACGATCC TGGGTGGGAC CGCGGCGGGC CTGCAGAACG ACACCTGGCT CGGCACTCTG
ACCAATGCGG GCTATATTTC GTCGGGATCG GCCGGCTTCG CCAACTATGG CACGCTGGGA
TTGCTGACCA ACAGCGGCAG CATCAGCGGC GGTCCGCAGG CCCTCTATCT CGTGGGGACC
GTCGGCACGG TCAGCAACAG CGGTACGATC ACCTCGGGTC AGACCGGAAT CTATGTCGGC
GCCACAATGT CGGCGCTGGT CAATAGCGGC TTCATCGGCA ACACCAGCAA CGCCCTGAAT
CTGGGCGGCA CATTGGACAC CCTGATCAAC GCCGCCGGCG GCACCATCAG CGGCGGCAAT
GCGCTCTATA TGTCGGGCAG TCTCGGCGGC TTCGCCAACA GCGGCGTCGT CAAGGGCAAG
ATCGACAATC AGTCGGTCAA CAATCTTACG ATCACGGGGG GGGCGGGCGC CACCGTAGGC
ACGCTGACTG GTCTGACCGG TGTCGGCACG ATCAACAACA CCCTGTCCAA CCTGATCTTT
GCTTCGGGCA ACCTGCTGCT CAACGACCGG ATTGTCGCGA CCGGCCACAC CGTCAGCAAC
ACCGGCGCCA ACCTGTCGCT GACTAGCAAC ATCAGCATCA CGGGCGCCTA TAGCCAGTCG
GCCGGTACGC TCAACCTCAT TCCCGGCGCC AGCAGCCTGA TCGTCATCGG CGCGGCGAAC
ATCACGGGCG GGACCGTGGC GGCGAGCCTC TCCGCCACCG GCAACTACCT GGCGGGCGTC
CAGACCCTTG TCAGCGCCTC CAGCCTGAAC CTGACTGGCG CGACGGTTTC GATCAACAGC
CTCACCGGGC TCTACAAGTC GACCAGCACC GTGGGCAACA AGCTGCTGCT GTCGGTGAAC
AACGACTATA TCGGCGGCGC GCTGGCCTCG CTGACCAATG CGGCCTCGCT CACCAGCGCG
AGCACCGGCC TTTACGTGGC GACGACGGGC AGCATCGGCA CGCTGACCAA TACCGGCACC
TTGGGCGGCG GCCAGTTCGG GATCAACAAC CGCGGCACGA TCGGCACGCT GCTCAACGCC
GGCCGCGTCA CGGACAACAG TTTCACGGCG CTCTGGAACC AGGGCAGCAT CGGCCAACTG
ATCAACACCG GCACGATCAT CAACAGCAGC TGGGCGATCC TCAACACCGG CAACCTGGCG
ACCCTGACCA ACAGCGGCCT GATCAGCGGC GCCGGTAACG CCATCCAGAA CAACGGCGTC
ATCACGCTGG TCAACAACAG CGGCACGATG TCGGGCGGCA GCAACGCCGT CGCCGGCAAT
TTCGGCACGA TCGTCAACAG CGGCGTCATC CTCGGCGACA TCAACAAGTA TGGCGGCTCG
ATCGGCGCCA TCATCGGCGG GGCCAACGGC ACGATCGGCA CCCTGACCGG CTTCACCGTT
GGGACCCAGG GCACGATCGG CGCCTCCAGC AACCTGACCT TCGCCTCGGG CGCGCTGCTG
CTGAACGACT CGATCACCGC CGCCGGCATC GCCGCCAGCG CCCTGACCGT CAGCAACACC
GGCGCTGACA TCACGCTGGC GACGATCGTC AACGTCAACG GCAATTTCCG CCAGACCGCC
GGCATGCTGA ACCTCGGCGG CACCGGCAAG CTGGTCGTCA CCCAGGCCGC CAGCCTCACC
GGCGGCACGG TATCGACGAG CCTGTCGAGC ACCGGAAACT ATCTGGCGGG CAACACCGCA
GGCACTTTGG TTCAGGGTGG GGCCGGCTCC AGCTATGCCG GCGTCGCCGT GACCGGCGGG
ACCCCCGGCC TGGCGTTAAG CGCTGGGGCG AGCGGCAATA ATCTGGTGGT GACCGCCGAC
AACAACTATG TCGGCGCGAG TCTGGCGAGC CTGTCGAACA CGGGCAGCCT GACGGCAGAC
TATTCGGTCT ACGTGGCCTC GACCGGCAGC CTGGGCAGCC TGACCAACAG CGGCACGCTA
AGCGGATCGA TCGCGGCCCT CTACAATGCG GGAACGCTCG GCGCGATCGC CAACACGGGC
GTCATCGCGG GCAATATCGA GAATGTCGCG GCCCAGGACC TGATCTTCAC GGGTGGCGCT
GGAGCCAGTG TCGGAACGCT GACGGGCCTT GCGGGCGCGC CCGGCACGAT CACAAATACC
GCTGGCGATG TGGTGTTTGC GTCCGGTAAC CTGGTGCTCA ACGACCAGAT CGTCGCGACC
GGTCACACGG TCAGCAACAC CGGCGCAAAC CTGTCGCTGG TCAGCAATAT CAGCATTACG
GGCGACTATA GCCAGTCGGC GGGCACGCTC AACCTCATCC CCGGCGCCAG CAGCCTGATC
GTCAGCGGCG TGGCGAGCAT CACAGGCGGC ACGGTGGCCA CCAGTCTGCC GCTCAACGCC
AACTACCTGG CGGGGGGTGG CGCGGGCACG TTGGTTCAGG GCGGGGCTGG GTCCAGCTAT
GCCGGGGTGA CCTTATCGAT CGCAGCAACG CCTGGACTGG CGCTCACAGC AAGTGTCAGC
GGCAATGATC TGGTGATGAC CGTGGCCAAC AACTATGTTG GCTCGACCCT CGCGACCCTG
TCCAACACGG GCAGTCTGAC GGCGGACTAT CCGGTCTATC TGGCCGCGAC CAGCAACCTG
GGCAGTCTGA GCAACAGCGG CACGCTGAGC GGAGCCGTCG CCGCCCTCTA CAATGCGGGA
ACGCTCGGCG TGATCTCCAA CACGGGCGTG ATCGCGGGCA ATATCGTGAA TGTGGCGGCC
CAGGGCCTGA CCTTCACGGG CGCCGCCGCC TCCGGCGTCG GTACGCTGAC CGGCTTTGGC
GGCGGGCGCG GCACGATCAC GAATACGGGC GGCGACGTGG TGTTCGCGGC GGGCAATCTG
CTGCTCAATG ACGATATCGT GGTGACGGGC CATGGGGTGT CGAACACGGG CGCGGCTCTC
ACCCTCGCCA ACAGTGCGAA CATCACGGGC GACTATAGCC AGACCGCCGG ATCGCTGAAC
CTGGCGTCTG GCCAGAAGCT GATCGTTAGC GGCGCGGCCA GCCTCACCGG CGGCACGGTA
TCGACGAGCC TGTCGAGCAC CGGAAACTAT CTGGCGGGCA GTACGGCGGG CACGCTGGTT
CAGGGTGGGG CCGGCTCCAG CTATGCCGGG GTCGCCGTGA CCACCGGGAC AACCCCCGGC
CTGGCGCTGA GCGCTGGGAC GAGCGGCAAT AATCTCGTGG TGACCGCCGG CAACAACTAT
GTCGGCGCGA GTCTGGCGAG CCTGTCGAAC ACGGGCAGCC TGACGGCGGA CTATCCGGTC
TATGTGGCCT CGACCGGCAG CCTCGCCAGC CTGACCAACA GCGGTACGCT GAGCGGATCG
ATCGCGGCCC TCTACAATGC GGGAACGCTC GGCGTGATCG CCAACACGGG CGTGATCGCC
GGCAATATCG AGAATGTCGC GGCCCAGGAC CTGATCTTCA CGGGTGGCGC TGGTCCGAGC
ATCGGCGCGC CGACGACCTT TGCGGGCAAG AGCGGCTCGG TCGCGATGAC CAGCATGATC
GCGGGCGATA TCGAGACCAG TCCGATGCGG GCGCTGGCTG CTCCAAGCGC CGCCGCCTCT
GGCGTCGGTA CGCTGACCGG CTTTGGCGGC GGGCGCGGCA CGATCACGAA TACGGGCGGC
GACGTGGTGT TCGCGGCGGG CAATCTGCTG CTCAATGACG ATATCGTGGT GACGGGCCAT
GGGGTGTCGA ACACGGGCGC GGCTCTCACC CTCGCCAACA GTGCGAACAT CACGGGCGAC
TATAGCCAGA CCGCCGGATC GCTGAACCTG GCGTCTGGCC AGAAGCTGAT CGTTAGCGGC
GCGGCCAGCC TCACCGGCGG CACGGTGTCG ACGAGCTTGT CCGCCACGGC CAACTATCTG
GCGGGGAGCA CCGCAGGCAC GCTGGTTCAG GGTGGGGTCG GCTCCAGCTA TGCCGGGGTC
GTCGTGACCA CCGGGACAAC CCCCGGCCTG GCGCTGAGCG GCGGGACGAG CGGCGCCAAT
CTCGTGGTGA CCGCGGGCAA CAACTATGTC GGCGCGAGCC TGGCGGGCCT GTCGAACACG
GGCAGCCTGA CGGCGGATTA TCCGGTCTAT GTGGCCTCGA CCGGCAGTCT GGCCAGCCTG
AACAACAGCG GCACGCTGAG TGGAGCCGTC GCCGCCCTCT ACAATGCGGG AACGCTCGGC
ACGATCGCCA ACACGGGCGT GATCGCCGGC AATATCGAAA ACCTGTCGTC CACCGATCTC
CGCATCGCCG GCGGTTCCGG CGCCAGCTTC GGAACGCTCA CCGGCTATGC GGCGAGCAGT
CAGGGCGCCA TCAACAACCT CTCCAGTAAT GTGGTCCTGA CTGGCAACAT TCTGCTCAAC
GACACCTTGA ACCTCGGCGA CCACACGCTG ATCGCCAACG ACGCCACGCT GCAACTGAAC
AGCATAACCC GCGTGAACGG CAACTACAGC CAGTCGGGGG GGATGCTGCT GATCGGGGTG
ACGTCGACGA CGGCCTACGG ACAACTGCAG GTGAGCGGAA CGGCCAGCCT GACGGGAGCC
ACCGTGAAGC TGGTTGCGCT CGGTACTGGC ACGATCTCGG CCGGCGGCGC CTATACGGTG
GTCAAGGCGA CGGGCGGGCT GACCTACAGC AATCTGACGT CCTCCGTGTC GGGTCTTGAG
GGCGCATTTT CGTCGGTCGC CAGCGAAGGC GCGACAGGAC TCGTCCTGAC CATCGACAGG
GCCGTTCAGC CGACCCGCTT CGCCGCGGCG GGTGCGGCCG CGGGCGGGGC CGGCGTCGGG
ACCGGGATGG CGCTCGACGC GATTGCCGAC GCAGGCGGCG CCTTGGCGGC GCCCGTGATC
ACCGACGTTC TGATGCCCCT TGCCATCCTG TCATCGGCAG ACCAGCAGCG TGGCGTCATC
CAGCTCTCTC CGAGCCAGTT GGCGCCTCAA GTCGTCGCGG TCGCCATGTC GCCGGCGCTG
AACGCGATCA CCCGGCACCA GGACGCCCTG ATGGCCAATG CCGGCGGCCC CGCCTCCTGC
GCCCTGACGA GCGATGTCCC GAGCCAGAAC GGAGTCGTCT GGGGACAGTT CCTGTTCAAT
TCGTCCAAGC GCGACGTCGC GGCGGGAGCG ACCTCATACG ACGCGACCAA CTACGGCGTC
GTGGTCGGCG CCGATGTCGT CAACACGGCC AATCTGGTGG TCGGTGGAGC CCTCAACTGG
ACGAAGACCG CCGCCGACGG GCGCGCGGAG CTTGCCGGAA GCACGACGCG CATCAGCAGC
GTTCAGGCGA GCGGCTACTT CACCTGGCGG CCGGGTGACG CCAGATCGCC TGGCTTGTCG
ATGTCCGGTC AATTTGGCGT TGGCTACAAT GCCTATCGTC AGCATCGCCA GATCGACTTC
CTCGGCCGGA CAGCAAGCGC ATCCTTCGAC GGGCGCCAGT ATCTGGGCCA ATTGCGCGCC
GGCTATACGA TCCCTCTGGC TGACGCCATT TCGGTGACGC CGTTCGCCAG CGTTCGTGCG
GTCCATCTGA AGAACGACGG CTATACCGAG AGCGGGGCAG GCGCGGCCAA TCTGCAAGTG
GGCCGGCTGA CCGTGGATTC GGTCAGTCAT GAGATTGGCA TCCAGGGCGG CGCCACGCTC
GACACCGCCG CAGGCCGCAT CTCGCCTTCG CTGAAGATCG GATGGGTGCA TAATTACGAT
AACGGACCGA TCCCGCTAAA CGCCGCGCTT GGCGGCGTCG CATTCTCGAG CAGCTCAGCG
CGCGTGGCTC GCGACGGGGC GACGGTCAAT GCCGGCGTCT CCTTCGTGCA AAACGAACGC
CTCAGGGTTG GCGTGCAATA TGACGGCGAA CTGCGGAGGG CCTTCCAGAG CCACTCGGCC
ACCGTCGAGC TGAGATACAG GTTCTGA
 
Protein sequence
MMTRSSSKRT ILAGSSLLVM AIAAAQPALA QDTTISSSIT TGSTWNGTTA PGSNFTVTSS 
GAINTADPAA ITVAGTVGAL LGTLLNSGQI LDAAASGYGI SIGSVSVGVV NNSGLISGAS
AIYNTGTLGA VSNTGTVSGT ARGIYLSSTG SIGSLNNSGL IAAGTLGSMA AGGIANSGRI
GTIVNNGTII SAHSSGIGNN QSTGSIGQLV NNGLISGVTG GIWNQGTFSQ ITNTGQIVGT
QAGIFQVAGV LSALDNSGTI SGGSYGVSIG GGNVGTLVNS GLLSGPIALT IASGATLGGV
ANSGVIAGNI VNRSVNVLTI TGGTGTQIGT LTGSTLTNQG SIINTVSNLV FGSGNLLLND
NINVTGRTVS NVGAALSLNS IVNVAGAFSQ TAGSLQIDPM TGGLIVSGGA TFSGGTVKST
FQSTGNYLAG SYTLLNASSL NLTGATVSIN SLTGLYKSTS TVGNKLLLSV NNDYIGGALA
SLTNAAALTS ASTGLYVATT GSIGTLTNTG TLGGGQFGIN NRGTIGTLLN AGRVTDNSFT
ALWNQGSIGQ LINTGTIINS SWAILNTGNL ATLTNSGLIS GAGNAIQNSG VITLVNNSGT
MSGGSNAIAG NFGTIVNSGV ILGDINKYGG SIGAIIGGAN GTIGTLTGFT VGTQGTIGAS
SNLTFASGAL LLNDSITAVG IAASALTVSN TGADITLATI VNVNGNFRQT AGTLTLGGTG
KLVVTQAASL TGGAITTSTG GLSSSSTYLK GAVGGTLVAG GAGSSYSGVN VSITSGFTGL
TGASTTSGNN LLLAIANDYV GDTQVSISNS GSISGVSYAL YVAGTGSIGT FTNSGVLTGL
STGATNNGSI GSLSNSGLIS GAMKGLGNQA TIANLTNTST GTILGGTAAG LQNDTWLGTL
TNAGYISSGS AGFANYGTLG LLTNSGSISG GPQALYLVGT VGTVSNSGTI TSGQTGIYVG
ATMSALVNSG FIGNTSNALN LGGTLDTLIN AAGGTISGGN ALYMSGSLGG FANSGVVKGK
IDNQSVNNLT ITGGAGATVG TLTGLTGVGT INNTLSNLIF ASGNLLLNDR IVATGHTVSN
TGANLSLTSN ISITGAYSQS AGTLNLIPGA SSLIVIGAAN ITGGTVAASL SATGNYLAGV
QTLVSASSLN LTGATVSINS LTGLYKSTST VGNKLLLSVN NDYIGGALAS LTNAASLTSA
STGLYVATTG SIGTLTNTGT LGGGQFGINN RGTIGTLLNA GRVTDNSFTA LWNQGSIGQL
INTGTIINSS WAILNTGNLA TLTNSGLISG AGNAIQNNGV ITLVNNSGTM SGGSNAVAGN
FGTIVNSGVI LGDINKYGGS IGAIIGGANG TIGTLTGFTV GTQGTIGASS NLTFASGALL
LNDSITAAGI AASALTVSNT GADITLATIV NVNGNFRQTA GMLNLGGTGK LVVTQAASLT
GGTVSTSLSS TGNYLAGNTA GTLVQGGAGS SYAGVAVTGG TPGLALSAGA SGNNLVVTAD
NNYVGASLAS LSNTGSLTAD YSVYVASTGS LGSLTNSGTL SGSIAALYNA GTLGAIANTG
VIAGNIENVA AQDLIFTGGA GASVGTLTGL AGAPGTITNT AGDVVFASGN LVLNDQIVAT
GHTVSNTGAN LSLVSNISIT GDYSQSAGTL NLIPGASSLI VSGVASITGG TVATSLPLNA
NYLAGGGAGT LVQGGAGSSY AGVTLSIAAT PGLALTASVS GNDLVMTVAN NYVGSTLATL
SNTGSLTADY PVYLAATSNL GSLSNSGTLS GAVAALYNAG TLGVISNTGV IAGNIVNVAA
QGLTFTGAAA SGVGTLTGFG GGRGTITNTG GDVVFAAGNL LLNDDIVVTG HGVSNTGAAL
TLANSANITG DYSQTAGSLN LASGQKLIVS GAASLTGGTV STSLSSTGNY LAGSTAGTLV
QGGAGSSYAG VAVTTGTTPG LALSAGTSGN NLVVTAGNNY VGASLASLSN TGSLTADYPV
YVASTGSLAS LTNSGTLSGS IAALYNAGTL GVIANTGVIA GNIENVAAQD LIFTGGAGPS
IGAPTTFAGK SGSVAMTSMI AGDIETSPMR ALAAPSAAAS GVGTLTGFGG GRGTITNTGG
DVVFAAGNLL LNDDIVVTGH GVSNTGAALT LANSANITGD YSQTAGSLNL ASGQKLIVSG
AASLTGGTVS TSLSATANYL AGSTAGTLVQ GGVGSSYAGV VVTTGTTPGL ALSGGTSGAN
LVVTAGNNYV GASLAGLSNT GSLTADYPVY VASTGSLASL NNSGTLSGAV AALYNAGTLG
TIANTGVIAG NIENLSSTDL RIAGGSGASF GTLTGYAASS QGAINNLSSN VVLTGNILLN
DTLNLGDHTL IANDATLQLN SITRVNGNYS QSGGMLLIGV TSTTAYGQLQ VSGTASLTGA
TVKLVALGTG TISAGGAYTV VKATGGLTYS NLTSSVSGLE GAFSSVASEG ATGLVLTIDR
AVQPTRFAAA GAAAGGAGVG TGMALDAIAD AGGALAAPVI TDVLMPLAIL SSADQQRGVI
QLSPSQLAPQ VVAVAMSPAL NAITRHQDAL MANAGGPASC ALTSDVPSQN GVVWGQFLFN
SSKRDVAAGA TSYDATNYGV VVGADVVNTA NLVVGGALNW TKTAADGRAE LAGSTTRISS
VQASGYFTWR PGDARSPGLS MSGQFGVGYN AYRQHRQIDF LGRTASASFD GRQYLGQLRA
GYTIPLADAI SVTPFASVRA VHLKNDGYTE SGAGAANLQV GRLTVDSVSH EIGIQGGATL
DTAAGRISPS LKIGWVHNYD NGPIPLNAAL GGVAFSSSSA RVARDGATVN AGVSFVQNER
LRVGVQYDGE LRRAFQSHSA TVELRYRF