Gene VEA_000324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_000324 
Symbol 
ID8558629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013457 
Strand
Start bp353000 
End bp368371 
Gene Length15372 bp 
Protein Length5123 aa 
Translation table11 
GC content47% 
IMG OID646407989 
Productautotransporter adhesin 
Protein accessionYP_003287477 
Protein GI262395624 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03660] T1SS-143 repeat domain
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC AAACCGTGCC GCAGAACGCG ATTGTAAATG CCGTCAAAGG TGAAGTCTTG 
GTTTTAGGTC TGGATGGAAA AGCCAGGTCT ATAAAAGCGG GGGATGAGTT AATATCTGGC
GAAGTGATCA TTACTGAAAA TAACGCTTCT CTTGATGTAC AGATACATAA TGAGCTGTAT
CTGGTTGATG CTAATTGTGT CGCGTGCCTT CCAGAACCTT CTTTTGAACA ACCAGAAACC
TTGCTGCAAT CTCCAGTTAA TGGCCAGGTT ACTTTCGATC CAACAGCTAT CGAAGGTGCT
AACTTTGACG CTAATGATGT CGCGGCCATC CAACAGGCCA TTTTAGATGG TGCTGACCCG
ACTGCTATTT TGGAAGCAAC GGCTGCTGGT GCTGGCGCTG AAGGCTCAGC CAATGCTGGC
TATGTGACCG TTGAATATAA TAATCCGGAA GTACTTGCCT CCACCTTTTT TGAGACGTCA
GCGACTCGTA GCGGTGTAGA AGAACGCGAT GAAGCTGATG ATTTAGATGT CACTATTTTT
GCCGATGGCG GACAATCACT GCAATCTAGC GTAACAGAAG GCTCAATTTC TCTATCAAGC
TATCCGCAAA CCATTTCTTC TAGTGTGCTG ATTGAAGCAG GAGATCTCGC ACTTGATACC
GCTTCATTTG TACCTGAAAC GTCCTCATTA GAGTCTCTGT TAACCGAGCT AAATAGTGAT
ATTACCTCTG CTGGTCAGCC AGTGGCGTTC GTTTATGACG AAGCGCAAAA TGCTATTATC
GGCACTCAAG AAGGTAACGA AGTGCTTCGT ATAGAGATCG AAGCGACATC TTTAGGCCGT
GATTTAGAGT TAGAGGTCGT TACCACGATT AGCCAGGGCA TTGACCATGT CGCGTCTGTT
TCTGATGGCC AGGTATCGAT CGTCAGTGAT CAAATCAATA TCGCTTTTGA CATTACAGGC
GCAGATATTG GTGGTAACAG CATTCAAGCA CCTATCGATT TCACGACAAC TGTTATTGAT
GGCGATGATC CGGCGCCGCA AAACGTTACT TTCGAAAACG TTGAATCTTC TTCTACTCCG
ATTACTGGCA CTTTTGTTGA TATTGGTAGT GATCAATTGG CAACGGTCAC ATTTAATCAA
GAAGGACTTA GTCAATTTGA TGGCCTATTG AGTGATAATC AGGCAACAGA GGCGGTACTC
TCTGAAGACG GCAGCACTAT CACCCTGTCC ATTGCAGGCT CTAATGAAAC CGTTTTAACC
ATCAGTCTGA ATACTGATGG TACCTACCAA TTTGAGCAAT TTAAGCCGTT AGAACAAACA
AACGGTGAAG ATACGATTGA GCTCTCTTTG CCAACCACAA TCGTCGATTT CGACCAAGAT
ACGGCAAGTA ACACCTTTAC GATGACCATC CTTGATGGCG ATAACCCGGT GATTGAAAAC
GTGACGGGAT TGTCTTTAGA CGAGGCGGGG GTTGACCAGG GTTCACAGGA AGGCGCGGTA
GTAACAAGTG GCACTGGGTC GATCACGACA GCGGTCGGTA GTGACATCAT TGATCACTAT
GAGTTGGAGC CTACAGAGTT TAATGTTGGT GGAGAACTCC AATCTCAAGG GCAGGTTGTT
CAACTGGAAC TAGCGTCAGA GAGCAACGGC GTGCGCACTT ACGAAGGCTT CATCGAGCTG
GATGGCGTGC GTATCACAGT ATTTGATGTG GCGATTGATG CGCCTGTATT GGGCGAATAC
CAATTCAACT TGTATGAACA ACTCGACCAT ACGGGTGTTA ACGATGACTC GATTACTTTC
TCATTGCCAG TTTACGCGGT AGATGCAGAT GGCGACCGCT CCTCGATTAC CGACGGTTCG
AACACACCAG AAGCGGCGCA AATTGTCATT CAAGTTCAAG ATGACGTTCC TTCGATTGAT
GGCGTTGATG CGCTAGCCGT CGATGAAGAC GATTTATCAA GCATCGGCTC CGATCAAAGC
GACTCTGTGC TCGCGCAAGG TAGCTTCACG ACGACTCAGG GCTCAGACCG TGTTGTAAGT
TATCAATTAG AGAGCGGCAC TGATCCACTC AATGGGTTGG AATCACAGGG TCGATCTATC
TCTCTGACGG AAACTGTCAA CTCAGATGGC AGCTTTACCT ATTCTGCAAC GGCAGAAGGG
GACGATATCT TCACCTTACA AGTGAACCCT GACGGTTCTT ATTCATTTGC TCTGGAAGGT
CCAATCGATC ATGCTGTGGG TAGCGATAGC TTAACTCTAG ACTTCACTAT TGTCGCAACG
GACTTTGACG GTGACACCAG TTCACTTGTT CTACCGGTAA CCATTACAGA TGATGTGCCA
ACCATTAACG ACGTTGTTGC TCTTACCGTC GATGAAGATG ACTTAAGTTC GGTAGGCTCT
GACCAATCAC AGCCAACGCT GGTTGAAGGT CAGTTCACGA CGACTCAGGG CTCAGATGGC
GTCGTACAGT ATCAGTTGGA TGTAAAAGCT GATCCTTTAA ATGGCCTGCA ATCTCAAGGT
CAAACCGTCT CCATTGCCGA AACGCAAAAT GCAGACGGCA GTTACACCTA CAGCGCAACG
GCAAATGGCA GTGCAGTTTT CACATTAATT TTGAATACCG ACGGCTCGTA CAGTTTTGAG
TTGCAAGGTC CAATCGACCA TGCGGCAAAT AGCGACAGTT TAACTCTTGA TTTTAGCGTT
ATTGCGACAG ACTTCGATGG TGATACAAGC CAAATCGTAC TGCCTGTTAC CATCGTGGAT
GATAAACCGA CCATTACCGA TGTTGATGCG ATTACTGTTG ATGAAGATGA TTTAGGCACG
ATTGGCTCAG ATCAGACAGA CCCGATTTCA ATTGATGGCA ACTTTACTAC CACGCAAGGC
TCGGACCGAG TGGTGAGCTA TCAGCTCGAT GCTTCCGCAA CACCAGTTAA TGGCCTTACA
TCACGAGGCG TGGCGGTCAC GCTGACTGAA ACCGCAAATG GCGACGGCAG CTTTACCTAC
AAAGCAACGG CAGGCACAGA AGCAGTCTTT ACGCTGACGG TGAATACTGA TGGTTCTTAC
AACTTCACGC TAGAAGGGCC AATTGATCAT GCTGTCGATA GCGACGAGTT GACGCTCAAC
TTCCCAATCA TCGCGACCGA CTTTGATGGT GATACCACAA ACGCCACGAT CCCTGTGACG
ATCGTGGACG ATAAGCCAGT TATTACTGAC GTAGATGCCA TTACGGTTGA TGAAGATGAT
CTAGCGTCAA TTGGTTCAGA CCAAAGTAAT CCAATTTCTA TTGATGGCAG CTTCACCACC
ACGCAAGGCT CAGACCGTGT GGTGAGCTAC CAACTTGATA CTTCAGCAAC ACCAGTGGAC
GGTTTAACAT CGCAAGGTGT GGCTGTCACA CTGACAGAAA CCGCAAACGC TGACGGCAGT
TTCACTTATG AAGCGACAGC GGGTGGCAAC CCAGTATTTA CTCTGACAGT AGATACTGAT
GGCTCTTACA ACTTCACGCT AGAAGGGCCA ATTGATCACG CGGTTGATAG TGACGAGCTG
ACACTGAACT TCCCAATTAC CGCAACCGAC TTTGATGGCG ACACCGTTAC TGAAACCATT
CCAGTCACGA TTGTTGATGA TGTACCGATT ATTACAGCGG TTGATGCGCT GAATGTTGAT
GAGGATGACC TTAACGTTAT AGGTTCCGAC CCAGGCGGCG ATCTATTCGT TAAAGGCGCG
TTTACCACCA CACAAGGCTC TGACCGAGTG GTGAGCTACC AACTGGATTC AACCTCTGAT
CCTGTCGCGG GCCTGATGTC TCAAGGTGAG GCGATTACCT TGGTCGAAAC GGCTAATGGT
GACGGCAGCT TTACCTATGT CGCGACCGCA GATGGTAATC CTATTTTTAC CCTAAACGTC
GCAACGGATG GTACGTACGA CTTTACGCTT CAAGGACCAA TCGACCATGC GGCCAACAGC
GACAGCCTAA CCATTGATTT CCCGATTATT GCCACCGATT TTGATGGTGA TACGGCCACG
GCAACTATTC CAGTCACGAT CACGGATGAT GCACCAGTTA TCGATAACGT CGTGCCGCTG
GCTGTGGATG AAGATGATCT CTCAGGTATT GGCTCAGACC AAAGCGACGC TGTTTATGTA
GAAGGTGCGT TCACGACCAC TCAAGGGTCA GACCGTGTGG TGAGCTACCA ACTTGATTCA
GCTTCTGATC CTGTTTCGGG GTTAACCTCC CAAGGTGAGC CTGTTACTTT GGTTGAGACT
GCGAATGCCG ATGGCAGCTT CACTTACGTG GCTACCGCGG ATGGCAATCC GGTCTTCACC
ATGAATGTGA ATGCTGATGG CACATACAAC TTCCGTCTGG AAGGGCCAGT CGATCACGCG
TTAAACAGCG ATGAATTAGT TCTGAACTTC CCAATCATTG CAACAGACTT TGATGGCGAT
ACGACGACGG CGACCATTCC AGTCACGATT AATGATGACG TTCCAACCAT TGATAACGTT
GTTCCACTGA CAGTAGACGA AGATGATCTC GCCTCAATTG GCTCAGACCA AAACAATGAT
GCGTTTATGT CGGGGTCTTT CACCACAACT GAAGGCTCAG ACAGCGTTGT TAAATACCAG
CTTGATGCAA CAGCAGATCC AGTTGCAGGG TTGACCTCTC ATGGTGAGCC TGTCGCGCTT
GCTGAAACTG CTAACGGCGA CGGTAGTTTT ACTTACACAG CCACAGCAGA TGGCAATGCC
GTATTTGAAT TAGTGCTGAA GCCAGATGGC AGCTACACCT TTACTCTGCT AGGTCCACTA
GACCATGCGA TGAATAGCGA CAGCCTACAA ATCGATTTCC CGATTATCGC AACAGACTTT
GATGGCGACA CTTCAACTAA AATACTGCCT GTCACTATTG TTGATGATCA GCCAAGCATC
ATTAATGTTG ATGCGATCAG CGTGGATGAA GATGACTTGG CTACGATTGG CTCTGATCAA
AACGAGTCCG TGTCGATTGA TGGTCACTTT GTCACGATGG GTTCTGATCA TGTTGTGCGT
TATCAGTTAG ACGCGTCCTC TAACCCTATT AATGGCTTAA CCTCACACGG GGTAGTGGTG
ACAATGACGG AAAGCGCGAA TGCGGATGGC AGTTTCACCT ATACCGCTAC CGCAGGATCA
GAAGCCGTAT TTACCTTGAC GGTGAACAGC GACGGCAGTT ACAACTTCAC GCTTGAAGGT
CCAATTGACC ATGCAACAGG TAGTGATGAG CTGACGCTGA ACTTCCCAAT CATCGCGACG
GATTTTGATG GTGATACCAC CACAGAAACG ATCCCAGTCA CCATCGTCGA TGACAAGCCG
ACTATTACTG ATGTCGATGC GATTACCGTT GATGAAGATG ATTTAGCCAC GATTGGCTCG
GATCAAAACG ATCCAATATC AATCGATGGC AACTTCACCA CAACGCAAGG CTCTGACCGA
GTGGTAAGCT ATCAGCTCGA CGCTTCTGCA ACACCAGTGG ATGGTCTGAC ATCACAAGGC
GTGGCGGTCA CGCTGACTGA AACCGCAAAT GGCGACGGCA GCTTTACCTA CAAAGCAACG
GCAGGCACAG AAGCAGTCTT TACGCTGACG GTGAATACCG ATGGTAGTTA CAACTTTACG
CTTGAAGGTC CAATCGACCA CTCGGTAGAT AGCGATGAGT TGACACTCAA CTTCCCAATA
ATCGCAACAG ATTTTGATGG CGACACGTCA GTTGAGACAA TTCCTGTCAA GATTGTTGAT
GACAAGCCAA CACTTGGTGG CATCGAAGCA ACAAGTGTGC AAACGGTTGA TGAGGACGAT
ATCCCAACGG TAGGATCTGA TGGTACGCAG GCGAACAGTA TTGCCGGTAA CTTCATCGCG
ACTGATGGCT CTGATGGCAT TGTAGAGTAT GGTGTTTCTG ACCTAACGAC GCCAGTACAA
GGCCTGACCT CTGGTGGTCA ATCACTGGTC ATGGTTGAAG TATCAAACGC AGGTGGCGTT
TCAGTTTACG AAGCTCGTAT TGATGGCACG ACGACACCAG TATTCCGAGT GACCCTCGAT
GCCTCTGATG ATAGCTACAC GTTCGATTTG CTCGCTCCTC TCGATCATCC AAATGCTGAT
GGCCAAAATG AACTGGTCAT CAATTTGCCG ATAAACGCGA CGGACTTTGA TGGAGACGAG
TCGAACGACA TCACCTTACC AATTATGGTT GTGGATGATG TTCCGACCAT TGATGGTTTA
CTAGCAGGCA GCGAACAAAC CGTTGATGAA GATGACCTAC CAGCAGGTAC TGATGCTGCA
AGTGCAGAAG ATACTGTTAT TTCAGGTACT TTTGATATCA CTGAAGGTGC TGACCAAGTT
GCTTCTATTC AGCTGAGTGA CCTGACCACG CCTGTTGCGT CACTGACTTC AGATGGTGAA
GCTATCACCT TAGTTCTGTC TTCTAGCGCA AACGGTGTGA ACGTATACCA AGGTGTTGCT
GGCAATCCTG CTGAAGTGGT TTTCGAACTG ACGTTGGATG CAACGAACAA CACCTACGAG
TTCGATTTAC AGAAGCCGCT TGATCATCCA GATGGTAATC AGCAAAACGA GATTGTTATT
AATCTCCCTG TGACGGCGAC AGATAATGAT GGTGATACTT CACCAGCCTT CACATTGCCA
ATTACCGTTG TGGACGATGT TCCAGTTGTT ACCAATATTG AAAGTCTACG TGTCGACGAA
GATGACTTAC CTCTTGGTTC TGATGGCACT AAAGAGCCAT TAACGGTATC GGGTGAATTT
GAAGTAACCA GCGCAGATGG TATTGATGCT TTCGAATTGG ATCTGAGCTC AAATCCTATC
CCGAATCTGA AATCGGGTGG GGAAGACGTC ACGCTTTCTC AAGACGTTAG TGCTTCAACT
GCTGACGCGC TTGTTTACGT GGGACAAACT CCGAGCGGAG TGACTGTTTT TACGCTGACA
CTTCATCAAG ATGGTAAATA CGACTTTGAG TTATCTGGTC AGTTAGATCA CGCGGTTAAC
TCTGATGAGA TTTTGTTGAA TCTACCTGTG AAGATTACAG ATGGCGACAA CGATACCATC
ACTGCGACCT TGCCAGTGAC CATCGTAGAT GACAAGCCAA CCATTGATGC TATTAGCTCT
GGTAGTACCT TGTCTGTGGA TGAGGATGAT ATTCCAAGCC AAGGTTCGGA CGACACGCCA
GAATCCAATA TCATTGGCGG CAACTTTGAC GTTACGGATG GTGCTGATTC TATCGTTAGC
TTCCAGTTGG ATAGCTTGAC GTCGCCTGTT TCTGGACTTA CCTCAGGTGG TCAACCGCTT
GAATTGGTTG AGTTCTCAAA TTCAAATGGT GTGATTGAGT ACCGAGCCTA TGTACAAGGT
ACGACGGATA CCGTCTTTAA ACTGACGCTA AATGGTTCAG CAGACAGCTA TGAATTTGAA
CTGTTGGGTG CACTTGATCA TCCTGCGGGT AACGATGAAA ACTCGTTAGT CATCAACTTC
CCAGTCAACG CGACCGATTT CGACGGCGAT GTGTCCAATA ACATCACCTT GCCGATCACC
GTGGTGGACG ATGTTCCAGC TATTGTGAAA GTGACAGACT CTAGCCAACA AACCGTTGAT
GAAGATGATC TTGCCGGTGG CTCTGATACG ACAAGTAATG ATTCTACGGT ATTAAACGGC
GGATTTGAGG TCGTTGCTGG TGCAGACAAG ATTGTCAGCT ACCAAGTTTC TGACCTCGAT
GCAGTCGTGT CAGGCTTAAC GTCGAATGGC AGCAGCATCG AGTTGAATCT AGTAGGCACC
AATGGTGGAG TAACCAGTTA CGAAGCGGTG ATTACTGGCA CATCAACTAA GATCTTCACG
CTGTCTCTTG ACGCGAATAA CGACAGTTAC CAGTTTGAAC TGTTGGGCCC AGTCGACCAT
GACGCTGTTC AAGGTGAAAA CAACCTTGTC ATTGATATCC CAATTACAGT CACAGATTTT
GATGGTGACA CGTCATCATC ACTGAACCTT CCAATCACCA TCGTTGATGA CATTCCCGAA
ATTAAATCAG CAGATGCTTT GGCTGTTGAC GAGGATGACC TCGCTAATGG CTCTCAGGCA
ACCAACAAAG ACAGCCTTGA AGCAACGGGT AATTTCGATA CGGTGGAAGG GGCAGATACC
GTTGTCTCTT ACCAGTTGGA TCTTACTTCT AACCCGATCC CGGGAGTCAC TTCAGGTGGT
CTTGCGGTGA CGTTAGTTCA AACCGCGGTA AGCAACAACA ACTTTACTTA TCAAGGTCAA
ACACCTGATG GTAACTCAGT GTTCACGCTA GTTTTGAGCG CTGACGGCTC TTACAAGTTT
ACACTCGAGG GGGCGCTAGA CCACTCAACC CAAGGCGAAG ATACTCTCAT ACTCGATCTT
CCAGTCTTTG CTACAGATGT CGATGGCGAT ACGGCAGGTA TTAACTTACC AGTGACGATT
ACCGATGATG TTCCAACCTT GTACGATGCA TCTATTTTGC GTGTTGAGGG ACAAGGCAGC
CGCACGGTTC ACTTATTCCA AGATCCTGTA GAAGCTGATG ACGATCTTGG TGCTGACGGC
GCACAAGTGA CTTCATTTTC GGCTGATGAT TCAGGCATTT ACTTTAAGCA AAACGGTGTT
GATTCCGATT CTGTCGATTT GAACGGCAGC GACCAAGTCG TATTTGTTCA CAAGGTTCTT
GACGGAGTGG ATACTGAAAT CGGTCGTTTG GTTGTTCGAA CTGATGGCAG TGTGTCATTC
CGTCCTAATG ACGATCTTGA CCATACCGAA ACCGACTCGA TTGATTTCAC CATCAACGTT
GTTGCAACCG ATGGTGATGG CGATATCGCA GATGCCGATG TGGATATCTC GATAAGAGAC
CGCAATGCGC AAATTGACAC GTCTACCGTT AATGCGTTCG AGGACCAAGG GCGTGATGGT
GTCGTTGTGG GCGTTGACTC TGCAAATACG CAGGACAACT TGTCGACGCT GGATGTGACG
CCGGCGAAAG TCGACTTGGT TATCAATTTA CACGATATCG ACCGTAACGA GAGCTTAGGT
GACATCACGA TTCGAGATGC AAGCACGCAC AACGGTACTT TCTATTACCG AGATGGCAGC
GGTAACTTCA TTGAACTGAC ACCAGTAGGC GATACGGTTG TTCTCGATGC GTCGAATGTC
GAGCAATCAT TTAGTGGCGA GCTTGTCTCG TTAGATAACT TGTACTTCGT ACCAGATCGT
CATACCTCGA CCGATGCTTC TGGTATCGAT CCTCGTATCC GTGTGGAAAT TCTTAACAAT
GGTACGCCAG ATCATACGAT TAATGGTCGT TTAGACATTG AAGTTGCTGC CGTTGCAGAT
ATTGCGACTT GGACGACGAG CAGCGAGTTT AACTATTCTG TCGATGAAGA CGGAAACAAC
GTCGCACTTA ATATCACGGC GGAAACTCAG GATACTAGTA ATCCTGAAGA TATAGTGTAC
GAGCTCGTCT TTACACAAGG GGAAGGTAAC GCAGAGTTAG TATATTCGGA TGGTAGTGCG
ATTCCACAAA CTGGTGGTGT TTATCTCGTA GATGCCAGCC GAATTGGTGA CGTTCAAATT
GACCCAATCG ATAATTTCTC GGGCGAAATC AAGATTGATG TAACCGCGAT TACAACAGAA
AACAATAATC CGTTATCAGG GAAGGAAACG GCACGTTCAG AGACAGAAAC TATCATCATT
GATGTTAACC CAATCGCGGA CCCAGGTAGT TTTACAGTCA ACCGAATCAA TGTCTTTGAG
GATAACGCTC GAACTCAAGA CACCGTCAAT CCTGTCACGG ATCATGATCC ACTTCAGTTA
AGTGAAGTGA TTACCATGAA GCCGTCTGCG GATTTAGATG GTTCTGAAGC ACTGTTTGTG
CGTATTTCTG ACTTCTCAAT TGATGGCGTT ACGTTAGTGT GGCTTGATTC AGCAAACCCA
AGCCAAATAG TGGAAGTGAC CGATTCAAGC GGCAATGTAC TCTATTACGA AGTGCCAGAA
TCTGAGTTAG CCAACGTCGA AGTGCTTCCT CCGTTGCACA GCAATGATGA CTTTACTTTC
AACGTTGAAG GTATCGTAAA AGATACGGCA TCGTTATCTA GTGGGGTTGC CGAGGATATG
TTGTCACTAG GCAGCAAAAC AGTGATCGTC GGTGTGAAAG GCGTAGCTGA CATACCATTT
ATTGAGCTTA ATGATAAATC AGGGATCTGG CATGAGTTTA ATGACGGCAA TGTGAGAGGT
ATTGAGACTA ATATCGATGA GAATGGTCAA GTAGAGTTGG GCTTTAGTGT CATCTCTGGC
GAGTTACCAG ATAACCCGAA TGATCACTCA GAGTCGGTTA CGGTTCTGCT GTCCAATATT
CCGGCAGGTG TCGAGGTATT TGATAGCGAT GGTGCATCCG TCGATCTGAC CTTTGTAGGC
TATGACGCGC AAAATCAGCC AATCTATGAA GCCAATATCA CCACGGCGAA TATTAATTCT
GGAATTGTTA TTAAACCGGA AGCGTCATCG ACGGAGAACA TTCATATTAC AGCGACGACG
ATCGTCACAG AGAATGATGG TCACAGTCGA ACCTCATCGG GTGAGATCCG AATTATTGTC
GCTCCTGTCA TTGATGCGCA AGATAACTAC ACGGTTCAAT CAGAAGGTGA TGAAGATACA
CGCTTTAATA TTGATTGGAA GCCAACGTTG GCTCAAAGCC CTGATACAGA CGAGTTCTTT
AGTGATGTGA CGATTTCTGG ATTTCCTCCA GGCAGCACGG TTTATGTCGA TGGCGTAGCA
CAAACATTGG TTTCTGGAAC ATTAACACTT TCGCCACTAG CGAACGAGTC GGAGCAAGAC
TTTTCTGCTC GAATCTCTCA AAGCGGGTAT GTCCAGGTGC AATTAGAGCA AGACTCTAGT
ACTGACTTTG ATTTGAATAC TACGGTGACG GTGAAAGAGA TTGACCACGA ATATGTCGAT
GCTTCTAATC CAGGGCAGGG AATTGCAGAA AAAACCATCA ATGGCATGGT GCATGTCCAA
GTCAATCCAA TCGTTGAGCC AGAAAATACA ACAGGATCGC TAGACGCCCA GACACGTTTG
CTTGTTACTG AAAGCACTGG CGTTGTCACT GACATCGTGA AGTCAGATGG CCAAGGGAAC
ATTGATTTTA CGATCAATAC TTCAACGGGT GGGGAGTCTG GTGCGAATAT CATCAAATAC
CAAGAGTTTG ACGCCTCTTC AGATGAAGTG GTGACTCAGC TTGTTGTTCA GTTCCACAAT
GTCGACCCTG AGATTCTCAA TCAATTGGTC ATTGTTGGTG CATTGAACGA AGGCGGCGGT
CGATGGACAG TCATTGACGA AGAAAACTTC TCGATTAAAG CGCCTTCGGG TCTTGATTTA
ACGCCAAACG ACGACAGCGA TGATGGTGAC AACGGCGGTC TATCTCAAAT TGGGTTAACC
ATTTATGCCG AGGTCAACGA TTTAGGTGAA GATGCCGTTG AAAAAGACGC AACAGTAGTA
AGAGAAACTG ACGTGACACT TGAGTTTCCG ACGGTTCTCA CGCCGCAAAC GAGTGTTGCC
GCTGAAATTC AGGTGGCTGA TGACGTCCAA ATTGAAGCAT CGGAAGATAA TGCGATCGAC
CTTGGCACTC AATTGACGTC GAAAGTCGAT GCGATTAATG CGGATGGTGT GGAAGATGTT
TTGACTGTTG TTATTGATCC ATCTGCACCC GGTATTCCTC CAGGCTTGGT GATAACAGGC
ACTGATATCG ATTTTGTCAA TGGTAAATAC GTTTTCCAAG CAGATATTGA TGCTAGTGGC
AATATCACGG GTTTGGACGG TTTAACGATG CACTTGGCAG AAGATTACGC TGGTGATTTC
GAATTGCCTG TGAGGTTTGT CACCAAAGAC ACAGAGTCTG GCGACGAAAA AGAAAACAAC
GTGCGAATTC CAGTTCAGGT TTTACCGATT GCAGATGTTC CGAGTTCGGC GGGTGATCAG
CCATTAGATG GTGATGTTAC ACCAAATGTG ACAGTGGATA TCACAGGTAC TCTCGGTTTA
GACGCGAACA AACAACCCGT TGATGACCTA AATAACGATG TACCAACGGC GGATGGTGTG
GGTTATGAAG ATGGTTTAAT CCAGCTTAAT CTGAATGTCG ATTTTGCCGA CCGTTATAAC
AATATCCAAG GTGGTCAGGA AACGCTAACG AATATTAAGT TAGCACTCGA TGACACAACA
CTGGGGGAGT TTGTTGACGC AAATGGCAAT AGCTTAGGGA CAAGTATTGA GTTTAATGAA
GCAGAGATTT TAGCGGGTGC GCTTGATAAT GTTCTGTTTA AGCCGAAAGA AAATTATCCC
GTAGGTGGGG GACAAAACAC CGTCAAGATT AATATCGAAG GAGAGATCAC GGACGAGGCT
GTATTTGATC AGTCTATCCT AAATAACCCT GGCGATAATA TCGATATAAG AACTTTTACT
GATGACGTGA CATTCGAAGT TACGCCAGTG GTGGACGATA TTACGATTAC AGGTGCCGAT
CCTACGAAGC CAATTACCGT GGTTGGTGAC GAAGATACTT TAATTTCGTT GAATCAATCA
GGCTCAGGGG TTTCGATCAG CTTAAACGAT AATGATGGCT CTGAAAGTTT CGTTTCACTC
AAGTTAACAG GGATCCCTGA TGATTTTGTC GTGCAATCCA ATTCCTCTGA TTATGTTGTT
AAGAACAACG GCGGAGGAGA ATGGAGTATT CAGCTAAAAG ACCTTACTCA GACATCCGTT
GATTTGAGTG ACATCCAAAT CAAACCGCCG AAAAACTTCA GTGGCGAGGC AGAGATTGGC
ATTACCGTGT TCATCCAAGA AGAATTGTTA CAAGTACCAA CAGAGCGAAC TAATAACTTT
ACGCTGGTTG TGAATCCTAT TGGCGATGAT GTCGATGTGA ATCCTGATAC TTCAGCTGCG
GGGAATGAGG GAGAAGATAT TGTTATCAAC GTCAATGCGC TGGTGGTTGA TAACAAAGAG
TCCATCGGAG ATGGCGCAAC CTACCAGGAA AATGATCCAG AGACGCTGAG AGTTGAGATT
TCAAATGTCC CTGATGGGGC GAGTGTTTCA TTACCTGATG GAACGGTGTT TACTGACCAA
GGAAATGGCG TCTTCGTATT AGAAATCGAT GCTCAAGATT TGGATCAAGT GGTCTTTAAT
TCTGGTGATC GCAACGATAA CTCATGGGGT GGGTCACTGC ATTTCAAAGT GCAAGCGGTT
GATACCGGGC TCGATGGTCG CCAAAGTTTA GGTTCAGCCG AGGAGTTTGA CGTAGCTGTT
GATGTTGAAG CGGTGAATGA TCGACCGGAG TTCGCGAATG TCATTGACGT TGAAACACCA
GAAGACAACG CTATCCTGCT GGATACATTT GGTATTTCGG ATGTGGATGC GGTACTTGAT
GATCCTACCG CAGAGTATGT CTTGAATATT GCTGTCGATA GCGGTTACTT GGCGCTCGAA
CCATCGATTA TTGCCAATTA CGGATTGACG GTTTCTGGAG ACGGCACGGG TTCAATTGAA
CTGAAAGGAA CCGTATCTGA CCTTAACGCC GCTATCGCTG ATGGTTTAGT GGAGTTTAAC
CCAGCGCTTA ACTTCTTTGG TAACGTCAAT GTCGACATCT CTGTCGATGA CCAAGGCAAC
GAAGGTATCG TGATTAGCGG TGTCGATGAA ACACTCAATT CAAATAGTAG CCAATTTGTA
ATTGAAGTCA CAGAGGTGAA TGATGCGCCA ACAACATCGG AAGTGACGTT GACAAGTATT
GATGAAGACA GCGGCGCTGT TATTGTTACC GCAGCAGACT TGCTGGTTAA TGCGGTTGAT
ATTGAGTCGG ACAATTTAAC CGTTAGCAAT GTTACGCTTG TCGACCCTGC AGCAGGGACG
CTTACTCAAC TGAGCTCAAC GGAGTGGAGC TTTGAACCTG CTCCTGATTT TTACGGGGAT
GTAAGCTTTA ATTACGATAT TACGGATGAT GGCATGACCA ATGGCGTCTC GGATCCGAAG
ACCGTCAGTG GCTCAGCCGT GATGACAGTG CAGGCGATTA ACGATGCTCC GGAGATTGAT
GGCTCAATGG TCACAAATAC GATTGTGGAG TCATCGGATC AAAAAATCTC AGGTATCGAA
ATTACCGATG TTGATTTTGC AGGTATTCAT GAGAATGAAA TCATGACGGT GTCACTCAGC
ATCGATGAGG GTGATATAAG TGTCGTTGTC CCGGCGGGGA GTGGCATTAC TCAAGGCGTT
GGTTTAGCAG GTGAAACTGT ACTGATGGGG ACTTTGTCTC AGCTTAATAG TCTTTTCGCT
TCGACGGATC CGGATGTCGG TGTATTTATT GATGCCAGTG ACGTGAACAG CAATTCTATT
GCGCTTACCG TTACCGCTGA TGACAACGGT ATCTTCTATG ACAACTTAAC GGGTTCCTCA
CTACAAACAT CAGAAACGTT TGATATTAAC GTAACGCCTG TCGCTGACGT ACCAAACTTG
GCCATTGATC AGAACTTTAG TTACATTCAA AAAATTAGTG CGAGTCAAAG TGCAAGCCGT
CAAGGCATCG CGCTGGTGGG TATTATGGCG GCGTTGACCG ATGTCGATGA AGTACTGGCC
TTAGAGCTCA CTGGCGTGCC AAGAGGAGCC ACTATCACGA GTGAAGCTAC AACATCGAAT
ATCAGCTTTG ATGGTACAAC ATGGACTGTT CCAGAAGATG AAATTGATAC GCTACACATC
AATAATGCGA TCCCGGGAGA TTACGATATT ACATTGACTG CGGTTTCTAC TGCATCAAAT
GGTGACCAAG CGTATTCAAC GCCATTAGAT ATTAATCTAA ACGTTACGCT AAATAGTCAA
GATATTGATC AGTCTGCTGA AAGTGAAGAC AGCTATCTTA TTGGTAGTGA TGCAGGCATT
ACTTTAGCAG CAGGTACGGG AGATGATTAT ATCCTAGGAG GAGATGGAGA TGACGTGCTT
ATCGGCGGAC TAGGCTCTGA TATTCTGACT GGTGGTGCGG GTAGCGATAT CTTTAAATGG
ACAGAAGACA CGGTGGATAA TGGTGCGATT GACACCATCA CTGACTTTAG CGTCAATGAG
GACACCATTG ACCTCAAAGA CGTGATCGCT GATTTGAATG ACCCAACGGC GGGTATTGAT
GATTTACTTG CGCACATTCA AGCCGATTAC GATGCATCAA CAGAAAATGT TTCTCTGAAC
ATTACGACGG ATGCAAATGT CCAGCAAACC ATTGTGGTGG AAAACTTAGG TACATCGATA
GACTTCAACG GTCTTAGCTC CAATGAGATT GTAGAGTCAT TGCTGAATCA CGGTGTTATT
GATAACGGAT AA
 
Protein sequence
MSTQTVPQNA IVNAVKGEVL VLGLDGKARS IKAGDELISG EVIITENNAS LDVQIHNELY 
LVDANCVACL PEPSFEQPET LLQSPVNGQV TFDPTAIEGA NFDANDVAAI QQAILDGADP
TAILEATAAG AGAEGSANAG YVTVEYNNPE VLASTFFETS ATRSGVEERD EADDLDVTIF
ADGGQSLQSS VTEGSISLSS YPQTISSSVL IEAGDLALDT ASFVPETSSL ESLLTELNSD
ITSAGQPVAF VYDEAQNAII GTQEGNEVLR IEIEATSLGR DLELEVVTTI SQGIDHVASV
SDGQVSIVSD QINIAFDITG ADIGGNSIQA PIDFTTTVID GDDPAPQNVT FENVESSSTP
ITGTFVDIGS DQLATVTFNQ EGLSQFDGLL SDNQATEAVL SEDGSTITLS IAGSNETVLT
ISLNTDGTYQ FEQFKPLEQT NGEDTIELSL PTTIVDFDQD TASNTFTMTI LDGDNPVIEN
VTGLSLDEAG VDQGSQEGAV VTSGTGSITT AVGSDIIDHY ELEPTEFNVG GELQSQGQVV
QLELASESNG VRTYEGFIEL DGVRITVFDV AIDAPVLGEY QFNLYEQLDH TGVNDDSITF
SLPVYAVDAD GDRSSITDGS NTPEAAQIVI QVQDDVPSID GVDALAVDED DLSSIGSDQS
DSVLAQGSFT TTQGSDRVVS YQLESGTDPL NGLESQGRSI SLTETVNSDG SFTYSATAEG
DDIFTLQVNP DGSYSFALEG PIDHAVGSDS LTLDFTIVAT DFDGDTSSLV LPVTITDDVP
TINDVVALTV DEDDLSSVGS DQSQPTLVEG QFTTTQGSDG VVQYQLDVKA DPLNGLQSQG
QTVSIAETQN ADGSYTYSAT ANGSAVFTLI LNTDGSYSFE LQGPIDHAAN SDSLTLDFSV
IATDFDGDTS QIVLPVTIVD DKPTITDVDA ITVDEDDLGT IGSDQTDPIS IDGNFTTTQG
SDRVVSYQLD ASATPVNGLT SRGVAVTLTE TANGDGSFTY KATAGTEAVF TLTVNTDGSY
NFTLEGPIDH AVDSDELTLN FPIIATDFDG DTTNATIPVT IVDDKPVITD VDAITVDEDD
LASIGSDQSN PISIDGSFTT TQGSDRVVSY QLDTSATPVD GLTSQGVAVT LTETANADGS
FTYEATAGGN PVFTLTVDTD GSYNFTLEGP IDHAVDSDEL TLNFPITATD FDGDTVTETI
PVTIVDDVPI ITAVDALNVD EDDLNVIGSD PGGDLFVKGA FTTTQGSDRV VSYQLDSTSD
PVAGLMSQGE AITLVETANG DGSFTYVATA DGNPIFTLNV ATDGTYDFTL QGPIDHAANS
DSLTIDFPII ATDFDGDTAT ATIPVTITDD APVIDNVVPL AVDEDDLSGI GSDQSDAVYV
EGAFTTTQGS DRVVSYQLDS ASDPVSGLTS QGEPVTLVET ANADGSFTYV ATADGNPVFT
MNVNADGTYN FRLEGPVDHA LNSDELVLNF PIIATDFDGD TTTATIPVTI NDDVPTIDNV
VPLTVDEDDL ASIGSDQNND AFMSGSFTTT EGSDSVVKYQ LDATADPVAG LTSHGEPVAL
AETANGDGSF TYTATADGNA VFELVLKPDG SYTFTLLGPL DHAMNSDSLQ IDFPIIATDF
DGDTSTKILP VTIVDDQPSI INVDAISVDE DDLATIGSDQ NESVSIDGHF VTMGSDHVVR
YQLDASSNPI NGLTSHGVVV TMTESANADG SFTYTATAGS EAVFTLTVNS DGSYNFTLEG
PIDHATGSDE LTLNFPIIAT DFDGDTTTET IPVTIVDDKP TITDVDAITV DEDDLATIGS
DQNDPISIDG NFTTTQGSDR VVSYQLDASA TPVDGLTSQG VAVTLTETAN GDGSFTYKAT
AGTEAVFTLT VNTDGSYNFT LEGPIDHSVD SDELTLNFPI IATDFDGDTS VETIPVKIVD
DKPTLGGIEA TSVQTVDEDD IPTVGSDGTQ ANSIAGNFIA TDGSDGIVEY GVSDLTTPVQ
GLTSGGQSLV MVEVSNAGGV SVYEARIDGT TTPVFRVTLD ASDDSYTFDL LAPLDHPNAD
GQNELVINLP INATDFDGDE SNDITLPIMV VDDVPTIDGL LAGSEQTVDE DDLPAGTDAA
SAEDTVISGT FDITEGADQV ASIQLSDLTT PVASLTSDGE AITLVLSSSA NGVNVYQGVA
GNPAEVVFEL TLDATNNTYE FDLQKPLDHP DGNQQNEIVI NLPVTATDND GDTSPAFTLP
ITVVDDVPVV TNIESLRVDE DDLPLGSDGT KEPLTVSGEF EVTSADGIDA FELDLSSNPI
PNLKSGGEDV TLSQDVSAST ADALVYVGQT PSGVTVFTLT LHQDGKYDFE LSGQLDHAVN
SDEILLNLPV KITDGDNDTI TATLPVTIVD DKPTIDAISS GSTLSVDEDD IPSQGSDDTP
ESNIIGGNFD VTDGADSIVS FQLDSLTSPV SGLTSGGQPL ELVEFSNSNG VIEYRAYVQG
TTDTVFKLTL NGSADSYEFE LLGALDHPAG NDENSLVINF PVNATDFDGD VSNNITLPIT
VVDDVPAIVK VTDSSQQTVD EDDLAGGSDT TSNDSTVLNG GFEVVAGADK IVSYQVSDLD
AVVSGLTSNG SSIELNLVGT NGGVTSYEAV ITGTSTKIFT LSLDANNDSY QFELLGPVDH
DAVQGENNLV IDIPITVTDF DGDTSSSLNL PITIVDDIPE IKSADALAVD EDDLANGSQA
TNKDSLEATG NFDTVEGADT VVSYQLDLTS NPIPGVTSGG LAVTLVQTAV SNNNFTYQGQ
TPDGNSVFTL VLSADGSYKF TLEGALDHST QGEDTLILDL PVFATDVDGD TAGINLPVTI
TDDVPTLYDA SILRVEGQGS RTVHLFQDPV EADDDLGADG AQVTSFSADD SGIYFKQNGV
DSDSVDLNGS DQVVFVHKVL DGVDTEIGRL VVRTDGSVSF RPNDDLDHTE TDSIDFTINV
VATDGDGDIA DADVDISIRD RNAQIDTSTV NAFEDQGRDG VVVGVDSANT QDNLSTLDVT
PAKVDLVINL HDIDRNESLG DITIRDASTH NGTFYYRDGS GNFIELTPVG DTVVLDASNV
EQSFSGELVS LDNLYFVPDR HTSTDASGID PRIRVEILNN GTPDHTINGR LDIEVAAVAD
IATWTTSSEF NYSVDEDGNN VALNITAETQ DTSNPEDIVY ELVFTQGEGN AELVYSDGSA
IPQTGGVYLV DASRIGDVQI DPIDNFSGEI KIDVTAITTE NNNPLSGKET ARSETETIII
DVNPIADPGS FTVNRINVFE DNARTQDTVN PVTDHDPLQL SEVITMKPSA DLDGSEALFV
RISDFSIDGV TLVWLDSANP SQIVEVTDSS GNVLYYEVPE SELANVEVLP PLHSNDDFTF
NVEGIVKDTA SLSSGVAEDM LSLGSKTVIV GVKGVADIPF IELNDKSGIW HEFNDGNVRG
IETNIDENGQ VELGFSVISG ELPDNPNDHS ESVTVLLSNI PAGVEVFDSD GASVDLTFVG
YDAQNQPIYE ANITTANINS GIVIKPEASS TENIHITATT IVTENDGHSR TSSGEIRIIV
APVIDAQDNY TVQSEGDEDT RFNIDWKPTL AQSPDTDEFF SDVTISGFPP GSTVYVDGVA
QTLVSGTLTL SPLANESEQD FSARISQSGY VQVQLEQDSS TDFDLNTTVT VKEIDHEYVD
ASNPGQGIAE KTINGMVHVQ VNPIVEPENT TGSLDAQTRL LVTESTGVVT DIVKSDGQGN
IDFTINTSTG GESGANIIKY QEFDASSDEV VTQLVVQFHN VDPEILNQLV IVGALNEGGG
RWTVIDEENF SIKAPSGLDL TPNDDSDDGD NGGLSQIGLT IYAEVNDLGE DAVEKDATVV
RETDVTLEFP TVLTPQTSVA AEIQVADDVQ IEASEDNAID LGTQLTSKVD AINADGVEDV
LTVVIDPSAP GIPPGLVITG TDIDFVNGKY VFQADIDASG NITGLDGLTM HLAEDYAGDF
ELPVRFVTKD TESGDEKENN VRIPVQVLPI ADVPSSAGDQ PLDGDVTPNV TVDITGTLGL
DANKQPVDDL NNDVPTADGV GYEDGLIQLN LNVDFADRYN NIQGGQETLT NIKLALDDTT
LGEFVDANGN SLGTSIEFNE AEILAGALDN VLFKPKENYP VGGGQNTVKI NIEGEITDEA
VFDQSILNNP GDNIDIRTFT DDVTFEVTPV VDDITITGAD PTKPITVVGD EDTLISLNQS
GSGVSISLND NDGSESFVSL KLTGIPDDFV VQSNSSDYVV KNNGGGEWSI QLKDLTQTSV
DLSDIQIKPP KNFSGEAEIG ITVFIQEELL QVPTERTNNF TLVVNPIGDD VDVNPDTSAA
GNEGEDIVIN VNALVVDNKE SIGDGATYQE NDPETLRVEI SNVPDGASVS LPDGTVFTDQ
GNGVFVLEID AQDLDQVVFN SGDRNDNSWG GSLHFKVQAV DTGLDGRQSL GSAEEFDVAV
DVEAVNDRPE FANVIDVETP EDNAILLDTF GISDVDAVLD DPTAEYVLNI AVDSGYLALE
PSIIANYGLT VSGDGTGSIE LKGTVSDLNA AIADGLVEFN PALNFFGNVN VDISVDDQGN
EGIVISGVDE TLNSNSSQFV IEVTEVNDAP TTSEVTLTSI DEDSGAVIVT AADLLVNAVD
IESDNLTVSN VTLVDPAAGT LTQLSSTEWS FEPAPDFYGD VSFNYDITDD GMTNGVSDPK
TVSGSAVMTV QAINDAPEID GSMVTNTIVE SSDQKISGIE ITDVDFAGIH ENEIMTVSLS
IDEGDISVVV PAGSGITQGV GLAGETVLMG TLSQLNSLFA STDPDVGVFI DASDVNSNSI
ALTVTADDNG IFYDNLTGSS LQTSETFDIN VTPVADVPNL AIDQNFSYIQ KISASQSASR
QGIALVGIMA ALTDVDEVLA LELTGVPRGA TITSEATTSN ISFDGTTWTV PEDEIDTLHI
NNAIPGDYDI TLTAVSTASN GDQAYSTPLD INLNVTLNSQ DIDQSAESED SYLIGSDAGI
TLAAGTGDDY ILGGDGDDVL IGGLGSDILT GGAGSDIFKW TEDTVDNGAI DTITDFSVNE
DTIDLKDVIA DLNDPTAGID DLLAHIQADY DASTENVSLN ITTDANVQQT IVVENLGTSI
DFNGLSSNEI VESLLNHGVI DNG