Gene Avi_9641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_9641 
Symbol 
ID7381859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011991 
Strand
Start bp96775 
End bp108060 
Gene Length11286 bp 
Protein Length3761 aa 
Translation table11 
GC content53% 
IMG OID643653302 
Productpeptide synthetase 
Protein accessionYP_002551473 
Protein GI222109208 
COG category[J] Translation, ribosomal structure and biogenesis
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0223] Methionyl-tRNA formyltransferase
[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAACA GGCAAGTCTC GGCGATCGCA CGCGATGCGC AAGATCCTTC GGAATCAACA 
CGCATACGGA GCTGCATCAT CGCCGGTGAA GGAACGTTAG CAATTAGATG TGCGCAGCAC
TTAATGGACA GCGGTCATCA TCTCAAAGGG GTCCTCACTT CCGAAAAAGT GCTAGCCGAT
TGGGTTTACG CGAAGAACAT TACTGTCCTG CGGTCAGTCG AGGAGCTTGC TACTCTGCTG
CTCGACGAAG GCCCTGTGGA CTGGCTTTTC TCGATCGTGA ATCCGATCTT GCTCCCTCCT
AACGTGATAG CTCGAGTGAA GGGTGGTGCA TTCAACTATC ATGATGCGCC ATTACCGCGT
TATGCTGGGG TTCACGCCAC GTCATGGGCT ATCTTGGCCG AAGAGAGAGA TTACGCTATA
TCGTGGCACC GTATCAGCAA TTTTGTAGAC GCTGGCGATA TCGTGTTGCA GCGAGCCGTT
CCCATTGTTG ACGACGATAC GGCGCTTTCT CTCAATTTGA AATGCTACCA AGCGGCAGCT
GGAGCTTTTG AGGAACTAAT AAGCCGCCTG ACCCATGGAA AAGTGGAATC GTATCAGCAA
GATCTTTCGC AAAGGACCTT CTTCTCTCGA CACGACCGGC CTGAGGCAGA TGGGATTTTG
CAGTTTAATA GACAGGCTAA GCAGCTGTCC GCAACGGTTC GCGCTTTATC TTTCGGCCCG
TATCGTATCA ATACCTTTTG CAGGCCGAAG CTCCTCGTAG CCGACATCGC TCTGGGGGTC
GGCGCCTTGG ATGTGCTGGA CGAGCGCTCA ACTGAACCAC CTGGCACAGT GATCTGTGTT
GACCATAAAG GTTGGCGGAT TTCTACGTTT GACCACGACA TCTTAGTTAG AAATCTAGTT
GTACTAGAAA CTGATGAAGA TGTATCGCCC CAAATCCTTG CCGCAGAGCT TGAAATTGAG
CCCGGGACGC AGCTACCGAT TCTCTCAGAG TCGGAAATGT CGAGGCTGAG GCGACTTCAC
ATCGAGACAT CAAAGAACAA TAGCTTCTGG CAAAAAAGAT TGGCACACGC GAATTCGACA
AGTTTGCCGT TTGCACCTTC TTGGGCGGAA CAGGGCCGCG AACGTTGGCG CGCCAGTGCG
TGGCATGTTC TGCAGGCATT TGATGCAATC CCCTTAGAGA CGAGCGGGTC GTATATTCTG
ACGTCTTGGC TCGTATATCT CTCACGAGTT ACGGGAAAGA GTACGATCGA GACGGGATGG
GCTTGGCAAC CGGATGATTC CAGTTCGGCA GCGGTAAGGG CCACACTAGC TGACGTCGTG
CCTCTACAGC TCAATGTTGG GTGGGTGGAA TCCTTTGATA GCGTACGCTC TGCTCTGTCG
TCGGAGTGCG CGACAGTCCG GGTTAAGAAT ACCTTCGCTA AGGACACTTT CCCGAGTAAG
ACGAATCTCC GATCGTCTAA GATACTGCGC CCGCGGCGGG CATGGGACAC AGCAGTTGAA
TTAGTCGATA GTGAAGTGCT CCCCCAGGAT CATCCGAAAG CAAATGTGGT TACATTTCAG
GTTGATCCCC GGAAAAGATC TTTCCGCCTG ATTTACAATG CCGAGCGGTT GTCGCCTGCC
GACGTCGACA GGATAACGGA ACATCTTATC GTCTTGTCTC GTTCGATTAT GACACCTGGC
AACGAGTTTG TGCCGGCCCA TGGCTTGGAG ATACTCAGCA GGCAAGAAAC CGAGCTTGTG
CTTGAGGGTT GGAATGAGAC CTCGGTGGAC TTCCCTTCGG AGCGCTGTAT CCATCAGTTA
TTTGAGCGCC AAGTGGCCGA AACGCCTGAC GTGGTAGCGG TCGAGCAGGA TGGCATCAGC
GTGACGTACG CTGAACTGAA TGCAAAGGCG AATATACTTG CCGATAAAAT CCGAATGAGG
ATGACTAGGC CAGACAACAT CGTTGCGATC TGCGTCGAGC GTCGCATCTA TATGATTACT
GCGTTCATTG CCGTCGTAAA AGCTGGCGCT ACCTACCTCC CGCTTGACCC GACTAATCCG
CCTGAACATA TTGCCGATAC TGTGGCCGAT GCCGGTGCGG TTCTTATACT AACCGATACC
GTCGGAGCGA AGGCACTTTC GGCCGCAGTG GCGGAGGGGC CCCACACGCT CGATATCGGT
GAAGCCATGT CGGCTAAACA TCCCGCTGAA ATTGATCTCG TCTCGGGTTT CCCCCTTATT
GCTCCTAGCC AACTGGCTTA CGTGATATAC ACGTCTGGAT CGACGGGGAA ACCCAAGGGG
GTTATGATTG AACACGCGGG ATTGGTTAAC CTCGCGACGT GGCATATAAA GACGTTCGGC
TTGAAGACTG GCAGTCGTTG TACATTGATG TCTCGGCCTG TCTTCGACGC CAGTGTTTGG
GAAATGTGGC CCGCGCTATG CAGCGGCGCC ACACTAGTGC TTCCGCCTGT AAATCTCCTG
GATGACGTCG ACGTTTTATT GAAATGGTGG CATCAGCAGG ACCTGCACGT AAGTTTTCTA
AGCACGCCCC TTGCCTCCAT TGCGTTCCAG GAGAATCTGA CGAACCCCAA CCTCCAAAGT
CTCCTCGTAG GCGGAGACCA ATTGCGAAGT GTCCCTCCGA CACTGCCCCC TAAGCTGACC
CTGGTAAACA ACTATGGACC AACGGAGGCC ACAGTGGTGG CAACATCCGG TGAGGTCAGA
GCAGGAGAAC CAGTTCTTAC TATCGGCCGC CCGATCGCCA ATACACGGGC TTACGTTCTG
GACGAATACC TCCAGCCCGT ACCGCATGGG GTCGTCGGAG AGATATATAT AAGTGGCGCG
GGCGTGGCTC GCGGCTATCT TGGTCGCGCG GGATTAACAG CGGAACACTT TGTAGCTGAT
CCTTTCAGCC GGGTAGCCGG AAAACGGATG TACAAAACCG GAGATCTGGG GAGGCATCTT
CCAGACGGTC GGCTCCATTT CATCGGACGT AACGACGACC AAATCAAGAT CCGCGGTTTC
AGGCTAGAGC CCGGCGAGAT ACAAAGGCAG CTTTGTGAGC ATGCTGGCGT GCGCGATGCG
CTGGTGATCA AACACGACAG CCACGAACTC AACGACTATC TGGCTGCGTA CATCGTGCCG
GAGCAAAAAA CGCTCACCGC CGGCGCGGAT GAACTGCGGG CTTACCTCCG CGGTTACCTT
CCTGAGTACA TGGTTCCAGC CACTTTCACG TTCTTAGAAA CGTTCCCTCT TACGCAGAAC
GGGAAGATTG ATCGAAAAGC GCTGCCAAGC CCTATCGAAG ATCCTGTCTC AAGGCCGCCT
TATGAAGCTC CATCCGGTCA TATCGAGCAA CTACTGGCGA CAACCTGGCA GGACCTACTT
TGCATAACAA GGATCGGCAC ACAGGACAAT TTCTTTGAGC TCGGTGGCCA CTCGCTGCTG
GCGGTCAAAG TCCAAGCACG GTTACGCCAA GAAGGCTTAA ATTTGAAGAC GAGCGATCTG
TTTTCGTTTC CGACAATCGC AAGTTTAGCT GGCAAGGTGG CTGACGCAGA TCCTGTTTTA
GTTGACCACA GCAAGATCGA TCCAAATTCG CCTCTGATGC CCAGCGACCT ACCGCTTATT
GACCTGTCAA AAATTGACAT CGATAGGATC GTTGAACGAG CCCCAGGAGG CAAAGGAAAT
ATTCAGGATA TTTACGCCCT CTCCCCGCTG CAAGATGGAA TTCTTTTCCA TCATATGCTC
GGCGGTATAG GTGACGCGTA TTTGTTGCTC GACCTTATCG GCTTTCGTGA ACGAGTTATT
TTGCAACGTT ATGTCGAAGC CGTCCAAACT GTTGTGGATA GGCACGAGAT TTTGCGCACT
GGTTTCATGT GGCATGGTCT CACGACTCCA GCGCAATTTG TACTGCGGAA AGCTGATATT
TCGGTCGAGT ACGTTGATCT CGATCCGAGT GCCGGTCTGG CTGTTGATCA GTTAATTGAA
CGCTATGATC CTCGCACTTA CCGTCTTGAT CTTTCTGAGC CCCCTCTTAT GCGCTTCTTT
GCCACCAGGG ACTTTCAAAC GGGGCACTGG CTACTTCTGC AGCTGTTCCA CCATTTGATA
GCGGACCATA CCACTCTGGA AATCTTGCAT TCAGAAGTTC GTGAGATCAT GGGCGGCAGC
GGGAGCAATC TTCCCCCAGC AACTCCTTAT CGTAACCTCA TCGCGCAGAC ACGGCTGCGG
GCAGCGGATG ATGAACACCA TCGCCACTTC TTCCGGTCGA TGCTGGCTGA TATCGAGGAT
CCCAGCGCTC CTTTCGACAT GGTCGATGTT TACAATGATG GAAAGGATGT GACGGAGTCT
GTCGTCACAC TGTCTGTCGA CCTTGACCGG AGGCTTAGAA GACATGCGCG GCAACTTGGG
GTCAGCCTCG CTAGCATATG TCATGTTGCT TGGGGGATCG TGGTCGCCCG AACATGCGGT
CGCGATGCAG CTGTGTTTGG GACGGTCCTC CTAGGAAGGA ATCAGGCCGG CGGCGAAGTT
GACCGCGCGA TGGGCCTATT CATTAATACG TTGCCGATCA GAGTTGATTT AGACGCTGTC
AGTGTTGGAG ATGGTATCTG CAGGACCCAT TCTCTCCTCT CGGAACTCAT GCAGCACGAG
TACGCTTCAC TAACGTTAGC CCAAGGCTGT AGTAGCGTTC CGGCATCCAT TCCCTTGTTC
AGCAGCCTCA TCAACTACCG CCATAATGAC CCATTGCAGC ACCAAGCCGA TCTCGGTTGC
GGCATTGAGC GATTGAAGTT CGAAGAGCGT ACAAGCTATC CGATTACCTT ATCTGTCGAC
GACACCGGGG TCAGCTTGGG GCTTACAGTT CAAGCAGTAC CTCCAGTCGT TCCCGGAACG
CTTAGCGGTT ACATGGTAGA AGCTCTTAGT CAGATAGCTT CTGCCCTTGA GGAAACGGCA
TCAGTTGCGC TTGAGCGCTT AGATATTTGT GATCAGCGAA CCCGTCACAA GGTTCTTGTA
GAATGGAATG ACACGCGGCG GCCGATTTCC GAAGCGCTGC TGCCGGAACT CTTTCAGGCT
CAGGTCGCCC GTACACCTGA TGCCATTGCC GTTGAATATG AAGGCGAGCA GATTTCGTAC
GCCGAACTTG AGGCCCGCGC CAATCGGATG GCCCGGTATT TAATTGAGCT GGGCGTGGGA
CCTGAGACGA TTGTCGCACT GGCGCTTCCA CGGTCAATCG ACATGGTTGT GTCTTTGCTG
GCGGTTCTCA AGTCAGGTGG GGCTTATCTG CCTCTCGACC CGCAATATCC GACTGAAAGA
TTAGCGTTCA TGGTGGCTGA CGCGCAGCCC GCCACGGTAA TCGCGGTGGC ATCGACGGCA
GAAAAGCTGC CTGAGGATGC TCCTCTCCTA CTTCTAGATG ACGCTGGAGT CCTCGAACGG
ATCTCCGCAT TTTCCGATTC CGCGGTCATC GAGACAGAAC GGCACGCGCC GCTCGACCCA
CTACATCCCG CCTATGTCAT CTATACATCC GGATCAACCG GAAAACCCAA AGGCGTGGTT
GCGTCACACG AGTCTGCGCG AAACAGGGTA ACGGCGCAAT TGTGGATCAA TCCCCCAACT
CAGGCAGATG TCTGCTGCCA AAAAACTTCG ATCAGCTTCG TTGACAGTGT CTATGAGCTG
CTGCTTCCCC TCCTAGGTGG GGCGCATGTA TGTATCGCAC CGGATGAAAT TGGGACGGAT
TTGCAGGCTC TACTAAATTT CATTGAGGTA CACTCCGTTA CTAGAATCGT GCTCGTGCCC
AGCGTAGCCG AAAAATTACT ATCTCTCGCA GAAGCAAAGC AGAAGTGTAA TACTCTCGCC
GTTTGGACAT TGAGCGGCGA GGCCTTAGAG TCGAATTTGG TCGCCACGAT GAGGCGCGCC
TTTCCTGATG CTTCAGTCTT CAACTTGTAC GGATCATCAG AGATTGCGGC GGATGCGGTA
ATGCATCGGA TCAGCGATGT GGAATCCGAG GGTTCGGTTC CGATTGGGCG CCCGATTTCG
AATACATTTG TTTATGTCCT CAATGATACT TTGCAACTGG TTCCGCCTGG GGTTGTCGGG
GAGCTTTACA TAGCGGGAGC AGGCCTGGCG CGAGGTTATC TGCGTCGGCC CGGCCTGACT
GCCGAACGGT TTGTAGCTGA TCCCTTTGGT GAGGCTGGCA CGCGCATGTA TCGGACTGGA
GACCTCGTCA AATGGCGTAG CGACGGTATT CTGGACTATT TAGGGCGGGC CGACGAGCAG
GTTAAGATCC GTGGTTTCCG TATTGAGCCC GGGGAGATCG AGGCCGCTCT GCTCTCTCAT
CCTAGCGTTT CACAAGCCGT CGTCATCGCG CGCGAGGATA GCTCCGGTGA CAAAAAGCTT
CTGGGATATG TGGTCAGTCA GTCGGGCTCT CTCATAGACG CACAAGCATT GCGGCAATTT
CTCCGCGATC GTCTTCCTGA TTATATGGTT CCAGCGGCGA TCATAGCGCT CGATCGTCTT
CCCCTGTCTC CAAATGGCAA ACTTGATCGC AAGGCTCTGC CGTCGCCTAC TTACGAGGTT
TCAGTCGGCC GGGAACCTTC GACCCCACAG GAGGAGCTTC TTTGTGGCCT TTTCGCTGAA
GTCCTGGGGT TAAATCAGGT AGGTATCGAC GATAACTTCT TCGACCTGGG GGGACATTCA
CTACTTGCTA CTCGCCTAAT ATCTCGAATC CGCTCCACCT TCAGCCGTGA GGTTGCTATC
CGAACGCTTT TCGAGGCCCC GACCGTTGCT CAATTAGATC GATACCTCAA TAAGTCGGAT
ACCGCGCGGA CGGCATTGCA AAAACAGATG CGACCGCCGC GTCTTCCTCT GTCTCCGGCG
CAGCGTCGCC TTTGGCTTCT AGACCGGATT GATGGTCATA GCTCGACATA TAATATTCCG
ATCGCCCTGC ACCTGAACGG CGATATAGAG ATATCTGTCC TAAAGTCATG CTTGAAAGAC
ATCCTACTAA GGCACGAAAG CCTAGCGACT ACTTTCGTAG AGGTTGAAGG AATTCCCCAC
CAAGAGGTGG TATCCGCTGA CAATTTAAAT ATTGATATTA TCACAAAGAA GATTGCCCAT
GAGAATCTCA GTACTGCACT TGTAGCTGCC GCAAACTATA ACTTCGATCT CTCGATAGAT
GTCCCAATAC GACCTTATCT ATTCCAACTC GGCCAAAATG AATATGTGCT CCTGATCCTC
CTGCACCATA TCGCCGCCGA TGGATGGTCG ATGTTTCCGC TGTTGAGAGA TCTTTCTGCT
TCCTACAATG CTCGGAGCAA AGGAAAAGAG CCTGAATGGA CCCCTTTGTC GGTTCAGTAT
GCCGACTATA CACTGTGGCA TCAAGACGTC TTAGGCAGCG AAAGCGACCA GGATAGTCTC
ATTTCAAAGC AAATTGAGTA TTGGCGAGAA GCGCTTAAGG GCCTTCCGGA CGAGATCAAT
CTACCAGTCG ATCGAACCAG GCCCGCAATA GAAAGCCATC GCGGGGCGGA GGTGGAATTT
GCTCTTTCTG GGGAGCTTCA TGCCGGCTTG AAAGCCTTGG CAAGGAACGA GCGTGTAACA
TTATTTATGC TCTTCCATAG CACCCTCGCA GCGCTCCTTT ATCGGTTAGG CGCGGGGACA
GACATTGCGA TTGGGAGCCC GGTAGCGGGA CGCATGGATG CTGCTCTTGA GGACCTCGTC
GGATTTTTCG TCAACACCCT CGTATTCCGT ATAGACACTG CCGATAATCC CACGTTCCGC
GAGCTATGTG GCCGGGTTAG GGATATTACA CTTGATGCTT ATGCACACCA AGATCTACCC
TTCGAGCGGC TGGTCGAGCT TTTGAACCCA AGCCGTTCTT TAGCAAAGCA TCCCCTCTTT
CAAGTCCTTC TAGTGCTTCA CAACAACCTT GAGGCCAATC TCGCATTTGA TGGCGTCGAC
GCCTCAATTG AAAAGGTAAA CGTAACCTCC GCCAAATTCG ACCTCTCTTT TGCCCTCCGA
GAACAGTTCG ACGCGGATAA GGCACCTGCC GGCATCTTCG GCGCAATCGA ATATGCAACA
GACCTTTTTG ATCGCGACAC CGTAGAAGCG TTTGCCGATC GGTTTATCCG CCTTCTGGAG
GCTATTGTCT CCGATCCCGA ACAACGCCTA GACGAAATCG ATCTGTTAGA CCCCGACGAA
CGTCACAAGG TTCTTGTAGA ATGGAATGAC ACGCGGCGGC CGATTTCCGA AGCGCTGCTG
CCGGAACTCT TTCAGGCTCA GGTCGCCCGT ACACCTGATG CCATTGCCCT TGAATGTGAA
GGCGAGCAAA TTTCGTACGC CGAACTTGAG GCCCGCGCCA ACCGGATGGC CCGGTATTTA
ATTGAGCTGG GCGTGGGACC TGAGACGATT GTCGCACTGG CGCTTCCACG GTCAATCGAC
ATGGTTGTGT CTTTGCTGGC GGTTCTCAAG TCAGGTGGGG CTTATCTGCC TCTCGACCCG
CAATATCCGA CCGAAAGATT AGCGTTCATG GTGGCTGACG CGCAGCCCGC CACGGTAATC
GCGGTGGCAT CGACGGCAGA AAAGCTGCCT GAGGATGCTC CTCTCCTACT TCTAGATGAC
GCTGGAGTCC TCGAACGGAT CTCCGCATTT TCCGATTCCG CGGTCATCGA GACAGAACGG
CACGCGCCGC TCGACCCACT ACATCCCGGC TATGTCATCT ATACATCTGG ATCAACCGGA
AAGCCCAAAG GCGTGGTTGT GAATCATTCG GGGTTCACAA ATTACCTTCT CTGGGCTGAG
AGCTACTATC CGAAAAACCG GGGCACCGGT ACGCCACTGA CCACCTCACT CGCTTTCGAC
GCTACGGTAA CTAGCCTCTA CTTACCGCTA CTGAGGGGCA GTCGGGTCAT CCTACACGCA
GAAGGTACCG AGATCGAGCG CGTTCAGAAG GATATCGCTG GCGGTCCTTT GAACTACTCT
TTGATGAAAA TCACCCCAGC CCATTTGGAC TTATTTGGGA GCGTCGTTGC GAGCGATAGG
CTCCATGAGA TTTCTAATTG CCTAATTGTC GGCGGAGAGG CACTTCTAAG TTCGCACATA
AAACCGTGGC TCACCGGGGG ATCTCCGGTG AAGATTTTTA ACGAGTACGG TCCGACGGAG
ACGGTGGTGG GCTGCACAGT CTTCATGGCT GGTTCCGATC GGCAACTCGC AGCTTCGGTT
CCGATTGGGC GCCCGATTTC GAATACATTT GTTTATGTCC TCAATGATAC TTTGCAACTG
GTTCCGCCTG GGGTCGTCGG GGAGCTTTAC ATAGCGGGAG CAGGCCTGGC GCGAGGTTAT
CTGCGTCGGC CCGGCCTGAC TGCCGAACGG TTTGTAGCTG ATCCCTTTGG TGAGGCTGGC
ACGCGCATGT ATCGGACTGG AGACCTCGTC AAATGGCGTA GCGACGGTAT TCTGGACTAT
TTAGGGCGGG CCGACGAACA GGTTAAGATT CGCGGTTTCC GTATTGAGCC CGGGGAGATC
GAGGCTGCTC TGCTCTCTCA TCCAAGCGTT TCACAAGCCG TCGTCATCGC GCGCGAGGAT
AGCCCCGGTG ACAAAAAGCT TCTGGGATAT GTGGTCCATA ATTCGCATGG GCTTGCACCC
GCTGATGTGC GAGAGAGCAA ACTTGCGGAA TGGCATGAAC TGTACGAGAA CGAATATGCA
CTGCCCTCTG ACGCTCCTTT TGGCGAGGAC TTCAGAGGAT GGACAAATAG CTACGATGGG
GCACCGATAC CACGTCAGGA GATGGAAGAT TGGCGAGAGG AGACCGTCAA GCGGATCATC
GCCCTCAATC CATCGAAAGT CTTGGAGATC GGAGTAGGAA CAGGGCTCAT CTTATCCCGT
ATCGCACCAC TTTGCGACGA GTATTGGGGT AGCGATATCT CTGGCACCGC AATATCTAAG
CTCGCCTCTC TACTGTCAGC TCATGACAAG TTGAATGAAA AGGTACGGCT TATAACCGCT
CCGGCCCACG ATCTTTCTAC CATGCCGCTT GGATACTTCG ACACTATTAT CATAAACTCG
GTCACTCAGT ACTTTCCTGA GCTCGGGTAT CTCACCGAGC TCCTAGAGCG TCTCCAAGGA
TACTTAGCTC CTAGGGGAGC GATTTTCATC GGCGATGTTC GCAACCACCG ACTATTTCGT
ATGTTCGAAA CCGCAAAACA GGTTCGGATT GCGGCTCCTG AAGACACTCC TGCGATCATC
TCAGAGCGAA TTGAACGAGC ACTTCAGTCG CCAAGCGAGT TACTCATAGA TCCTTGCTAC
TTCGAGAAAT TGAACAGCCA CGCAGCGGTA AACATGATCG CCGACGTCCG AGTAAAGACT
GGCCGATTCT CCAACGAGTT GACGAAGTAT AGATACGACG TAATATTAAA TTTCCCTGTG
CCTTCGAGTG CTGCGGGCCC CCAAATAAAT TGCGTTCAGT TCAACAGCAG ACATCACGAC
GCGGATGTGA TTGGTCAGCT TCTTGAAGCA AGCCAAAATG CACTGGTTCT TGTCGGGGTT
CCAAATCAAA AGCTCGATGA AGATCGTTTC GTGTTAGCTA CTTTTCGGGC GCGCGTAAAA
CCTTCGAACG CGGATTCCCA CACTACTGCG CACCCTGGCG GCGAGCTTTT ACCTGTCTTT
CAGACGTTAG CTCAAAGGTA CGGCCGGGAC TTGATTCTCA GATGGAACAA TGAATCTACC
ACTGGGGCCA TGGACGTCGC ATTTGTTCCT GCCCACATTT CCGCACAGGC GGTCTCCGGC
CCCAGCTTCC CCGGAAATCA CGATATATCC AGCATCCCTG CCCGGATCTC CTTTGCCGAT
GATATGGCAC ATGAGCTTCG GGAGTATTTA GAGATATTCC TTCCTGCTCA TATGCTTCCC
GCCTCAATAA TAGCCGTTGA GAGCATTCCT TTATCACCCA ACGGGAAAGT CGACCGCGGT
TCTCTCCCCT CCCCCGATCT CAATCCCCGG AAAGGGAGCG ATCCCAAAAC AGCAATCGAA
CATCAACTTT GCCAAATTTT TTCTGACGTA CTCGGTATTC CGGAGATCGG GACAGATGAT
GACTTCTTCC GGCTCGGTGG TAACAGTATA TCAGTCATAC GGTTGATATC ACGTGCACGA
CACGAGATGT TGTCACTCTC CCCTCGAGAC GTGTTTGAAT GCAAGACGGT GGCTAAGTTG
GCACCGCGGG CCGCGCCTGC TCCGCAAGGC AAAACTGCTC AAAATGAGAG AGGTTCGCCC
CAGATGGGCC TAGCCCCTGT GGCGTTTGAT AAGCTTCAAG ATAAATGGGG AGGCCGCAGT
GCCTAA
 
Protein sequence
MFNRQVSAIA RDAQDPSEST RIRSCIIAGE GTLAIRCAQH LMDSGHHLKG VLTSEKVLAD 
WVYAKNITVL RSVEELATLL LDEGPVDWLF SIVNPILLPP NVIARVKGGA FNYHDAPLPR
YAGVHATSWA ILAEERDYAI SWHRISNFVD AGDIVLQRAV PIVDDDTALS LNLKCYQAAA
GAFEELISRL THGKVESYQQ DLSQRTFFSR HDRPEADGIL QFNRQAKQLS ATVRALSFGP
YRINTFCRPK LLVADIALGV GALDVLDERS TEPPGTVICV DHKGWRISTF DHDILVRNLV
VLETDEDVSP QILAAELEIE PGTQLPILSE SEMSRLRRLH IETSKNNSFW QKRLAHANST
SLPFAPSWAE QGRERWRASA WHVLQAFDAI PLETSGSYIL TSWLVYLSRV TGKSTIETGW
AWQPDDSSSA AVRATLADVV PLQLNVGWVE SFDSVRSALS SECATVRVKN TFAKDTFPSK
TNLRSSKILR PRRAWDTAVE LVDSEVLPQD HPKANVVTFQ VDPRKRSFRL IYNAERLSPA
DVDRITEHLI VLSRSIMTPG NEFVPAHGLE ILSRQETELV LEGWNETSVD FPSERCIHQL
FERQVAETPD VVAVEQDGIS VTYAELNAKA NILADKIRMR MTRPDNIVAI CVERRIYMIT
AFIAVVKAGA TYLPLDPTNP PEHIADTVAD AGAVLILTDT VGAKALSAAV AEGPHTLDIG
EAMSAKHPAE IDLVSGFPLI APSQLAYVIY TSGSTGKPKG VMIEHAGLVN LATWHIKTFG
LKTGSRCTLM SRPVFDASVW EMWPALCSGA TLVLPPVNLL DDVDVLLKWW HQQDLHVSFL
STPLASIAFQ ENLTNPNLQS LLVGGDQLRS VPPTLPPKLT LVNNYGPTEA TVVATSGEVR
AGEPVLTIGR PIANTRAYVL DEYLQPVPHG VVGEIYISGA GVARGYLGRA GLTAEHFVAD
PFSRVAGKRM YKTGDLGRHL PDGRLHFIGR NDDQIKIRGF RLEPGEIQRQ LCEHAGVRDA
LVIKHDSHEL NDYLAAYIVP EQKTLTAGAD ELRAYLRGYL PEYMVPATFT FLETFPLTQN
GKIDRKALPS PIEDPVSRPP YEAPSGHIEQ LLATTWQDLL CITRIGTQDN FFELGGHSLL
AVKVQARLRQ EGLNLKTSDL FSFPTIASLA GKVADADPVL VDHSKIDPNS PLMPSDLPLI
DLSKIDIDRI VERAPGGKGN IQDIYALSPL QDGILFHHML GGIGDAYLLL DLIGFRERVI
LQRYVEAVQT VVDRHEILRT GFMWHGLTTP AQFVLRKADI SVEYVDLDPS AGLAVDQLIE
RYDPRTYRLD LSEPPLMRFF ATRDFQTGHW LLLQLFHHLI ADHTTLEILH SEVREIMGGS
GSNLPPATPY RNLIAQTRLR AADDEHHRHF FRSMLADIED PSAPFDMVDV YNDGKDVTES
VVTLSVDLDR RLRRHARQLG VSLASICHVA WGIVVARTCG RDAAVFGTVL LGRNQAGGEV
DRAMGLFINT LPIRVDLDAV SVGDGICRTH SLLSELMQHE YASLTLAQGC SSVPASIPLF
SSLINYRHND PLQHQADLGC GIERLKFEER TSYPITLSVD DTGVSLGLTV QAVPPVVPGT
LSGYMVEALS QIASALEETA SVALERLDIC DQRTRHKVLV EWNDTRRPIS EALLPELFQA
QVARTPDAIA VEYEGEQISY AELEARANRM ARYLIELGVG PETIVALALP RSIDMVVSLL
AVLKSGGAYL PLDPQYPTER LAFMVADAQP ATVIAVASTA EKLPEDAPLL LLDDAGVLER
ISAFSDSAVI ETERHAPLDP LHPAYVIYTS GSTGKPKGVV ASHESARNRV TAQLWINPPT
QADVCCQKTS ISFVDSVYEL LLPLLGGAHV CIAPDEIGTD LQALLNFIEV HSVTRIVLVP
SVAEKLLSLA EAKQKCNTLA VWTLSGEALE SNLVATMRRA FPDASVFNLY GSSEIAADAV
MHRISDVESE GSVPIGRPIS NTFVYVLNDT LQLVPPGVVG ELYIAGAGLA RGYLRRPGLT
AERFVADPFG EAGTRMYRTG DLVKWRSDGI LDYLGRADEQ VKIRGFRIEP GEIEAALLSH
PSVSQAVVIA REDSSGDKKL LGYVVSQSGS LIDAQALRQF LRDRLPDYMV PAAIIALDRL
PLSPNGKLDR KALPSPTYEV SVGREPSTPQ EELLCGLFAE VLGLNQVGID DNFFDLGGHS
LLATRLISRI RSTFSREVAI RTLFEAPTVA QLDRYLNKSD TARTALQKQM RPPRLPLSPA
QRRLWLLDRI DGHSSTYNIP IALHLNGDIE ISVLKSCLKD ILLRHESLAT TFVEVEGIPH
QEVVSADNLN IDIITKKIAH ENLSTALVAA ANYNFDLSID VPIRPYLFQL GQNEYVLLIL
LHHIAADGWS MFPLLRDLSA SYNARSKGKE PEWTPLSVQY ADYTLWHQDV LGSESDQDSL
ISKQIEYWRE ALKGLPDEIN LPVDRTRPAI ESHRGAEVEF ALSGELHAGL KALARNERVT
LFMLFHSTLA ALLYRLGAGT DIAIGSPVAG RMDAALEDLV GFFVNTLVFR IDTADNPTFR
ELCGRVRDIT LDAYAHQDLP FERLVELLNP SRSLAKHPLF QVLLVLHNNL EANLAFDGVD
ASIEKVNVTS AKFDLSFALR EQFDADKAPA GIFGAIEYAT DLFDRDTVEA FADRFIRLLE
AIVSDPEQRL DEIDLLDPDE RHKVLVEWND TRRPISEALL PELFQAQVAR TPDAIALECE
GEQISYAELE ARANRMARYL IELGVGPETI VALALPRSID MVVSLLAVLK SGGAYLPLDP
QYPTERLAFM VADAQPATVI AVASTAEKLP EDAPLLLLDD AGVLERISAF SDSAVIETER
HAPLDPLHPG YVIYTSGSTG KPKGVVVNHS GFTNYLLWAE SYYPKNRGTG TPLTTSLAFD
ATVTSLYLPL LRGSRVILHA EGTEIERVQK DIAGGPLNYS LMKITPAHLD LFGSVVASDR
LHEISNCLIV GGEALLSSHI KPWLTGGSPV KIFNEYGPTE TVVGCTVFMA GSDRQLAASV
PIGRPISNTF VYVLNDTLQL VPPGVVGELY IAGAGLARGY LRRPGLTAER FVADPFGEAG
TRMYRTGDLV KWRSDGILDY LGRADEQVKI RGFRIEPGEI EAALLSHPSV SQAVVIARED
SPGDKKLLGY VVHNSHGLAP ADVRESKLAE WHELYENEYA LPSDAPFGED FRGWTNSYDG
APIPRQEMED WREETVKRII ALNPSKVLEI GVGTGLILSR IAPLCDEYWG SDISGTAISK
LASLLSAHDK LNEKVRLITA PAHDLSTMPL GYFDTIIINS VTQYFPELGY LTELLERLQG
YLAPRGAIFI GDVRNHRLFR MFETAKQVRI AAPEDTPAII SERIERALQS PSELLIDPCY
FEKLNSHAAV NMIADVRVKT GRFSNELTKY RYDVILNFPV PSSAAGPQIN CVQFNSRHHD
ADVIGQLLEA SQNALVLVGV PNQKLDEDRF VLATFRARVK PSNADSHTTA HPGGELLPVF
QTLAQRYGRD LILRWNNEST TGAMDVAFVP AHISAQAVSG PSFPGNHDIS SIPARISFAD
DMAHELREYL EIFLPAHMLP ASIIAVESIP LSPNGKVDRG SLPSPDLNPR KGSDPKTAIE
HQLCQIFSDV LGIPEIGTDD DFFRLGGNSI SVIRLISRAR HEMLSLSPRD VFECKTVAKL
APRAAPAPQG KTAQNERGSP QMGLAPVAFD KLQDKWGGRS A