Gene Dd1591_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_2008 
Symbol 
ID8119325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp2291630 
End bp2303020 
Gene Length11391 bp 
Protein Length3796 aa 
Translation table11 
GC content68% 
IMG OID644852396 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_003004334 
Protein GI251789613 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGG TGAAGACGTC GCAGCGGGTG AAGGTGTGGG CGCTGGTGTG GCTGACGGTG 
CTGCAGCCGA TGCTGCCGGC GTGGGCAGCG GGGGTGACGG TGGCGTCGGG CAACACGGCG
CTGGAGGCGG CGGGCAACGG GGTGCCGGTG GTGAACATCG CCACGCCGGA CGCGTCGGGG
CTGTCGCACA ACCGCTACCA CGACTTCAAC GTGGACCCCC GCGGACTTAT CCTCAACAAC
GGCACCGCCC GGCTGACGCC GAGTCAGCTG GGCGGGCTGC TCCAGAACAA CCCCAACCTG
AACGGCCGGT CGGCCGCGGC CATCCTCAAC GAAGTGGTGT CGCCGAACCG CAGCCAACTG
GCGGGCTACC TGGAGGTGGC GGGCCCGGCG GCCAATGTGG TGGTGGCCAA TCCGTACGGC
ATCACCTGCA GCGGCTGCGG CTTTCTCAAC ACCCCGCGCA TCACGCTGAC CACCGGCACG
CCGCAGGTCG ACGCGGCGGG CGGGCTGAGC GGTCTGGACG TGCGGGGCGG GGACATCCTG
ATAGACGGGG CGGGGCTGGA TGCGAGCCGC AGCGATTATT TTGCGCTGAT AGCGCGCACG
GCGTCGCTGC AGGCGGGGCT GACTGCGCGG GAGGCGCAGG TGGTGCTGGG GGCGAACCGG
GTGGGGTCGG ACGGCCGGGT GACGGCGCAG GCGGGCGAGG GGACGGCGCC GGTGCTGGCG
CTGGACACCG GGGCGCTGGG GGGGATGTAC GCCAACCGCA TCAGTCTGGT GTCGACGGAG
CGGGGCGTGG GGGTGAACAC GGCGGGGCTG AGCGCGCGGG AGGGCGACAT CCGGCTGTCG
GCGAACGGGC GGCTGCAGGT GGGCGACGCG GTGGCGCAGG GGGCGCTGAC GGCGCAGGGC
GAGACGCTGG CGCTGCAGGG CAGCCAGCAG GCGCAGGGCG CCGTCACGCT GAGCGGGGCG
CAGGGCGTCA TGCTGACGGG CAGCCGGACG CGGGCGGGCC AGGGGCTGAC GCTGGCGAGC
GACGGGCGCC TCACGGCGGC CGACGCGGGG CTGAGCGCGG GGGTGCGGGA AGACGGGACG
GTACAGCCGG GATACGGGCT GCGCCTGACG GGGCGTGAAC TGGCGCTGGG GCAGAGCCAG
CTGGCGGGCG ACCGGGTGAG CCTGACGGCG TCGGGGGCGG TGAGCCAGTC GGCGGGCGGG
GCGTGGCAGG CGGGCAGCGG GCTGACGGTG AACGGCGGCG CGCTGTCGCT GGACGGCGAC
GCGGGGGCGC AGACGCTGAC GGTCAGTGGC GGCAGCCTGA GCGGCGCAGG GCGCTGGCAG
GCGACGGGGG ACCTGACGCT GGACGGACTG ACCACGGCGC AGTGGGACGG GGCGCTGCTG
GCGGGCGGGG CGCTGTCGGT GGGGGCGGCG AGCCTGGTCA ACCGGGGCAC GCTGGCGGGC
GGCACGGTGA CGCTGACGAC GCCGGCGCTG AGCAATACGG GCACGGTGAG CGGGCGGCAG
GTGGCGGTCC GGACGGCGCA GCTGACCAAC GGCGGCACGC TGTCGGCGGA CGATGCGCTG
ACGGTGCAGG CGCAAACGCA ACTGGACAAC GGCGGGTCGC TGCTGTCCGG CGGGGCGCTG
ACGATACAGG CGGGGCAGAC GGACAACCGG GGGGTGCTGG CGGGCGGCGC GGTGACGCTG
AGCGGCGACG GGCTGTGGAA CGGTGGGGTG GTGCAGGGGC GACAGTCGCT CGGGGTGAGT
GCGCTGAGCG GGTTCAGCCA GACGGCGGAC GGGTTGCTGA CCAGCGGCGG CGGGATGACG
CTCTCAACGG GCGACATGGA CACGGCCGGC CGGCTGACGG CGCAGGGGCT GTCGCTGAGC
GCCGGGCGGT GGCGCAATAC GGGCACGGTG AGCCTGACGG GCGACGGGCA GCTGACGGTG
GACGCGCTGG ACAACGACGG GACGCTGCTG TCGGCGGGCG CCTGGGCCAT CCGGGGCGCG
GCGGTGACGA ACCGGGGCAC GCTGCAGGGC GACGGGCTGA CGCTGCAGGG CGAGCGGCTG
GACAACCGCG GCAGCCTGAC CGGTGTACGG TCGCTGACGG TGACCCTGCC CGGCACGCTG
ACCAACACCG GTTTGCTGAC CGGACAGCGG CTGGACCTGA CGGCGGGGTC GCTGGCCAAC
GGCGGCACGC TGCTGGGGGT GGACGCGCTG ACGCTGACGG CGGACGGGGC GCTGACCAAT
ACGGCGACCG GCCGCCTGCT GACGCAGGGG GCGGCGGTGC TGACGGCGGC GTCGGTGGCC
AACGACGGTG AGTGGCAGGC GGGCAGCCTG CAGCTGACGG CCGACCGCCT GGGCAACGGC
GGGCGGATAC AGAGCGGCGG CGCCCTGACG GTGGCGCTCT CCCCGGCGGG GACGTTGACC
AACACCGGCA CGCTGGCGGC GCAGGGCGGG CTGACGCTGA GCGGCAACTA CGCCGGCGCG
GGTGAGCTGT ACAGCGCGGG CGGCCTGACG CTGGAGGGGA CGACTATCGT CAACGACGGC
GGCCACTGGC AGGGCGACAC GCTGGATATC CGCGGCGGGT CACTGACCAA CAGCGGCACG
GTCACCGGGC TGACGGCGCT GACGGTGACG ACGGCGGGGA CGCTGACCAA TACCGGGCGG
CTGGAAGGAC GGCGACTGGA CCTGACGGCG GACCAACTCG ACAACGGCGG CACGCTGCTG
GGGGTGGATG CGCTGACGCT GGCGATAGCC GGCACCGCGC GCAATCAGGC CGGGGGCCGG
TGGCTGAGTC AGGGCGACGG GCGTCTGACG GCTGCCGAGC TGGACAATCA GGGCGACTGG
CAGGGCGACC GTATTGCGGT CACGGCAGGC CGCGTGCGCA ACGCCGGGCA GGTGCTGGGT
ATCGCGGCGC TGACGCTGAC GGCCGATGAC GCGCTCACCA ACACCGCCAC CGGCCGTCTG
CTGACGCAGG GGGCGGCGGT GCTGACGGCG GCGACGGCAG AGAACGACGG TGAGTGGCAG
GCGGGCCGTC TGCAGCTGAC CGCGGACCGC CTGCGCAACG GCGGGCGTAT CCTCAGTGAG
GGCGACCTGC AGATTACGCT ACCGGTCGCG GACGACGGGC TGCTGCGGGC CGTGCGGCAA
CTGGCGCAGG ATGTGCAGGT CGCCGGTCAG GGGACGCTGA GCAACAGCGG TACGCTGGCG
GCGGACGGCG ACAGCCGGAT AACCGGCCGT CAGGTGGACA ATCAGGGACT GCTGGCGACC
GGCGGCGCGC TGACGCTGAC CGCCGGGGAC CTCACCAACG GTGGCCGCGT GGAGAGCCGC
ACCCTGCAAC TGACCGGCGA CCGTCTGGAT AACGGCGGCA CGCTGCTGGC GGAGCAGGGC
GGCGTGCTGC GTCTGACCGA CGGGCTGACG GTGGGCGCGG ACGGACGGCT GCTGAGCAAC
GGCGACTGGC AAATACAGGC CGGCACGGTG ACCAGCCTGG GGCAGTGGCA GGGGAAAAAC
CTGCTGCTGA GCGCGGATTC ACTGACCAAC GACGGCGCCC TGCGGGCGAC GAACGACGTC
ACGCTGACCC TGACGCAGGA CTACACCGGC GGTGCGGGCA GTCAGGTGCG GGGCAACGGC
GCGGTGACGC TGACGGCGGA CCGGGTGACG CAGCAGGGCG ACATCGGCGG CGAGCGGCTG
CAGGTCACCG CCGGCACGCT GACCAACGGC GGGCGGCTGG TGGGGCTGTC GCAGCTGGAC
GTCGCCGGCC GGGGGGCGCT GACCAACACG GCAGACGGGG CGCTGCTCGG CAACGGCGTG
ACCGGTATCA CGGCCGCGAC GCTGGATAAC GCCGGCGTGC TGCAGGGCGA TGCGCTGACG
CTGCAGGCCG GGACGGTGGA CAACCGCGGG CGTATTCAGG GCACGTCGGG CCTGACGCTG
GAGGGGGTGT CGCGCTATAC CGGCACCGAC GGCAGCCGGC TGCTGAGCGG CGGCACGGCC
ATGCTGGCCA TCGACAGCGC CGACAACGCC GGGCTGTGGC AGGCAGGCGA CCTGCGCTTT
AGCGGGACGT CGCTGACCAG CCGGGGACAG ATAACCGGGC TGAACAGCCT GAATATCGAT
GCCGACAGCC TGACCAGCAC CGGCCAGCTG ACCACCCGGA GGGTGGCGAC GCTGCGCGGG
CGGCAGTTCG ACAACGGCGG CACGCTGACG GCGCTGGGCG ACCTGACGGC GGACTTCGGC
GACGGCATCG TCAATCAGGC GGGCGGGCAG CTGCTGAGCG GCGGGGCCGG CCGGCTCACC
ACCGGCACGC TGGACAACCG GGGTCTGTGG CAGGCATCGC GGCTGGCACT GACGGCGGAC
ACCCTGTTCA ATCAGGGGAC CCTGCTGGGG CTGGACGACG GCGACATCCG ACTGACCGGT
GCCTATGTGG GCGGGGTGGA CAGCCGGGTG GGCGGCAACG GCGCGTTCGG CCTGAGTGCG
GCGACGATAG ACCAGGCCGG GCAGTGGCAG GCGCGGGACG TCACCCTGCG GGGCGGGAGC
CTGCGCAATC AGGGCACGAT AACCGCCGGC GGCCAGCTGA ACGCCGTGCT GGATGAGGCG
CTGGAGAACA GCGCCGGGGC GGTGCTGTCG GGCGGCACGG TGTCGCTGGG CGCGGCGACG
GTGAGCAACG GCGGGCAGAT ACAGGGGCGC AATAGCCTGA CGGTAGAGGG CGGCAACCTG
CTGGACAACC TGAGCGGCGG CCAGCTGCTG TCGGGCGGCG CACTGACGCT GAACACGCCA
CACCTGACCA ACGCGGGCTG GGTGCAGGGC ACGGACCTGA CGCTGGGGAC GGCGCAACTG
GACAACAGCG GCACGCTGCA GGCGCAGAAT GGGCTGACGC TGCGCCTGCC GCAGTGGACC
AACCGCGGGA CGGTGCAGGC GGGGCAGCTG GACATCACCA CCGACGGGCA ACTGGAGAAC
CGGGGCACGC TGCTGGGGCT GACGCGGCTG GCGCTGCAAG CGGCGCGTCT CACCAACGCG
GACGGGGCGC GGCTGTACAG CGCAGGCAAC CTGCAACTGC GCACCGGGCA ACTGGTGCAG
GACGGGCAGT TGGCGGCGCT GGGCGACCTG CGGGCGGACA TCGGCAACCC GTTCACCGTC
ACCCGCGCGC TGGCGGCGGG GGGGCAACTG ACGCTGAATG TCACGGGGGA CCTGGTGCAG
GCGGGGACGC TGCAGGGCAA CGGGGTGACG GTGACGAGCA GCGGCACGCT GACGCAGCAG
GGCCGCATCG TGGCGGGCGG TGGTAACAGT ACGCTGTCGG CGGCAACCAT CAACCAGACC
GAGAGCGGCA GTATTCAGGG GGGCGGGCCG CTGAGTCTGC TGACCACGGG CGACATCACC
AACCGGGGCT TTGTGGGGAC AGCGGGCGAC CTGCTGCTGC AGGCGGGCGG GGTGATAGAC
AACGGCAGCC TGCTGTACGG CGGGGGCAAT CTGTGGCTGC TGTCGGATGC GCTGGTCAAC
CGGTTCGGCA ACATTCTGGC GGGCAACAGC CTGTGGATAC AGCGCGATGC GGCGGGCAAT
GCCAGCGGCA GCGTGCTGAA CAGTTCGGGG ACCATCGAGA CGCAGCGGGG GGATATCACG
GTCCGCACCG GGACGCTGAC CAACCAGCGG GAGGGGCTGG TGGTGACCGA AAGCAGCAGC
ACGGCGACGG ACATGCCGGA CTGGGCGGGG GGAACGGAGG CGAAGATTCC AGTGAGTTGG
TTTAATTTCA AGGACGTGAA AACGACGGAA AATGAAGTTT GTTCTGGAGG TGGAGGTAAT
GATAATCACA GTGATCATTG CCGTACTTCT ATTAGTAAAC AACTGTCGCA GGCAGTCAGC
ACGCAGATCG TTACTGTCGA GCAAAAGAAC ATTGTCGTCA GTGCAACCGG GGGCGTTGCA
CAGTTAAATT CGGCAGGCCA CGTCAATATC GCCGCTGATG TGCTGAATAA CAACGCTTCT
GCCATCACCG GCCGGGGTGA TGTGTTCTTG ACTGGTGGGC AGCTTAACAA TCAGTCGGTT
CAGGCCGGTT CATTGGTTAA GCAACTGAAA TATATCGCAC AGTATTACGT AAAGAGGGAT
GATTATTTCC TGTTCAAGTT AATCGGCGAA CCCATCACCG AATACACCCC CGGCCAGACC
TACGCCGCGA CCCTCCAGGC GGGCGGTGCG ATAACCGCCA GCTTCTCGCA GAATATCAGC
AACACCAGCC TGCAACCGGG TAGCGGCGGC TTTATGCCGG CGCTGGCCAC GCCGACACTG
GCGGGCGTGA GCGCGTTAAC CCCGGTGGAG ACGCAGGCCG GGCGGGGGCT GAGCGGCGGC
ACGGCCACGG CCGTCAGCGG CAGTGCGTTG TCGGGCACGG GGAATGGCGT GGCGCTGGCC
GGTCAGGCGG AGCGCCTGAG CGCCGCTGCC GGCGCCGTCA CCCGCGATAC CCCCGCCGGC
GGCGGCACAC TGACCCCGGC GGGTATCGAC AGCGGGCTGG GGACGGCCGC GCCGGTCGTT
CCGGGCGCGC TCGCCCCGGG CGACCTGCAG GCGGCGCTGC GTCAGGGGCT GGCCGCGGTG
GCCGGCCCGT CGCTCACCGA CTACCCGCTG CCGACCAGCC AGAACGGGCT GTTCGTGGCC
GACACCGCCG GCGACAGCCG CTACCTGATA CGCAGCAACC CGACGCTGAG CCGGCTCGGC
CAGGTGGACA ACGCGCTGTT CGGCGACCTG CGCGGCCTGC TGGGGCAGAC GCCGGGCACC
ACGGTGCCGG TGGAGCGCAG CGCGACGCTG ACCGACCCGA CGCAGGTGCT GGGCTCGTCC
TACCTGCTGG GCAAACTGAA CCTGGACGCC GAACACGACT ACCGCTTTCT GGGGGACGCG
GCGTTCGACA CCCGCTACAT CAGCAACGCG GTGCTGAGCC AGACCGGGCA GCGCTACCTC
AATGGCGTGG GCTCGGAGCT GGCGCAGATG CAGCAGCTGA TGGACAACGC AGCGGCGGCG
CAAAGCCGGC TGAGCCTGCA ACTGGGGGTG AGCCTCAGCC CGGAACAGGT GGCGGGGCTG
AGCCGCAGCA TCGTGTGGTG GGAGAACATC ACCGTCGACG GGCAGACGGT GCTGGCGCCG
AAGCTGTATC TGGCGCAGGC GGACAGGAGC AACCTGTCAG GCAGCCGCAT CGTGGCCAAC
AGCGTCAGCC TGAGCGCGGG CGGGGACATC GACAACCGCG GCAGCACGGT GACGGCGCTG
AATGTGCTGA ACATCGCCGG CGGCGGCAAC CTGAGCAACA GTGAAGGCGG GCTGCTGAAC
GCCGGCGGCG CGCTGGATCT GGCGGCGCTG GGCAACCTGA CCAATAGCAG CGCGACGATA
CAGGGCAACA CGGTGACGCT GGCCAGCGTC AACGGCGATA TCGTCAACAC CACGACGAGC
AGCCAGTGGC AATTTGAATC CATAAATGGA CGTGAACGAT TAACCCATAC CGACCTCGGC
CAGACCGGGC TGATAACCGC CTGGAACGGG ATGACGCTGC AGGCGGGGCA CGACATCGTG
CTGAACGGAG CGCAGCTGAG CGCGGGCGGG CCGCTGGCGC TGGCGGCGGG CAACGACCTG
CGGCTGAATG CGCTGACCAC GGTGACGGAC ACGGTGCGCG AGGGCGGCGG GGCCACCACC
GAGCGGCGTA ATCAGGGGCT GGTGCAGAGC ACGGTGGCCG GCGGGGGCGA CCTGAGCCTG
AGCGCCGGGC GTGACCTGCG CGGCACGGCG GCGCAGCTGA GCGCGGCGGG GACGCTGGCG
CTGTCGGCGG GGCGTGACCT GAGCCTGCTG TCGGCCGGCG AGGAGCAGTT CAGCTCGAAC
GCGTGGAGCC GGCATCTGGA CTGGCAGCAG ACGGTGACGC AGCAGGGGAC GGTGCTGAAC
GCGGGGGAAG GGCTGAGCCT GCGGGCGGGG CAGGACCTGA CGCTGCAGGG GGCGCAGGCG
GAAACGCGGG GCGCGCTGAC GGCGCAGGCG GGGCGCGACC TGAGCCTGCT GTCGGCGACG
GAAAGCCGGC ATGACTTCTT TGAAGAAACG ACGGTGAAGA AGAAGACCTT CTCGAAGACC
GTGACGCACA CGGTGCGGGA GACGGCGCAG ACCACGGAGA AGGGGACGCT GCTGTCGGCG
GGCAGCGTGG CGCTGACGGC GGGGCAGGAC ATCGGGGTGC GGGGCTCGTC GGTGGCGGCG
GACGGCGGAG TGGCGCTGAC GGCGGGGCGA GACATCACGA CGGCGGCGAG CGTGGAGAGC
TACCGTCAGT ATGAAGACGT CAGCCGCAAG AAGAGCGGGC TGTTCAGCGG GGGCGGGATA
GGGTTTACTA TCGGCAGCAC GTCGCTGCGC CAGACGCTGG AAGCGGCCGG GACGACGCAA
AGCCAGAGCG TCAGTACGCT GGGGAGCACC GGCGGGTCGG TGAGTCTGCG TGCCGGGCAG
GACGTGGCGC TGACGGGCAC CGATGTGATT GCGGCGCGGG ACATTCAGGT GGCGGGCAAT
ACGGTCACCA TCGATCCGGG CTACGATACC CGCCGGCAGT CGCAGAAGAT GGAGCAGAAA
ACCGCCGGAC TGACCGTTGC GCTGTCCGGC GTGGTCGGGT CGGCGCTCAA CAGTGCGGTA
CAGGCGATAC AGGCGGTACG GGAGCAGAGC GACGGCCGCC TGCAGGCACT GCAGGGCATG
AAGGCGGTAT TGTCGGGGTA TCAGGCATAT CAGGGCACGC AGGTAGATAC CAATAACAAG
GGTGCCTCGT CGTTTGTCGG CATCAGCGTG TCGCTGGGGG CGCAGCGCGT CAGCAGCAGC
CAGACCAGCG AGCAGTCGCA GAGCTTCGCC TCGACGCTGA ACGCGGGCCA CGATATCAGC
GTGGTGGCGC GTCAGGGCGA TATCACCGCC GTCGGCAGCC AGTTGAAAGC GGCCAACAAT
GTGGCGCTGA ATGCCTCCCG CGCGATTAAT CTGTTGTCGG CGCGCAACAC CGAGTCGCTG
ACCGGCAGCA ACAGCAGCAG CGGCGGGAAT ATCGGCGTCA GTTTCGGTCT GAGCAACGGC
GGGGCGGGAT TCAGCGTGTT TGCCAACGTC AATGCGGCCA AAGGCCATGA ACTGGGCAGC
GGCAACAGCT GGTCGGAAAC CACGGTGGAT GCCGGGCAGC AGGTGGGGCT GACCAGCGGC
GGCGACACGC GTCTGACCGG GGCGCAGGTC AGCGGCGAGC GCATTGTGGC CAACGTGGGC
GGCGACCTGC TGCTGAAAAG CCAGCAGGAC AGCAACCGCT ATGACTCGAA GCAGACCAGT
GTGTCGGCGG GCGGTAGCTT TACCTTCGGC AGCATGACCG GCAGCGGCTA CCTGAGCGCC
AGCCAGGACA AAATGCACAG CAGCTTTGAC AGCGTGCAGC AGCAGACCGG GCTGTTTGCC
GGCACCGGCG GTTACGACAT CCGCGTCGGC AACCACACGC AACTGGACGG CGCGGTGATT
GGGTCGACCG CCGGCGCGGA TAAAAACCGG CTGGAGACCG GCACGCTGGG CTTCGGCAAT
ATCGACAACC GGGCCGAGTT TTCGGTGTCG CACAGCGGTG TCGGGCTGAG TGCCAGCCCG
TCGCTGAGTA TGTCGGATAT GCTGAAATCG GCGGCTCTGA CCGCGCCGTC GGCGCTGATG
TCGATGGGCC GCGGCGGCAA TGCCGGCAGC ACCACCTACG CGGCGGTGAG CGACGGGGCG
CTGATTATCC GTAATCAGGC CGGACAGCAG CAGGATATTG CCGCGCTGAG CCGGGATGTG
GCGCACGCCA ATAATGCGCT GAGCCCGATT TTCGACAAGG AAAAAGAGCA GAAGCGCCTG
CAGACGGCGC AGAGGGTGGG TGAGCTGGGC GCGCAGGTGA TGGATGTCAT CCGCACCGAA
GGGGAAATCC GCGCGGTGCG GGCAGCAGAA GCCGGCGGCA AGGTGGATCG CCCGGCGGAT
AACGCGACCG AAAAAGAGTG GGAGAAATAC AAGAAAGACC TGACGCAAAC GGCGGATTAC
AAGGCGGTAA TGCAGTCTTA CGGCACCGGC AGCGACCTGC AACGGGCGGC GCAGGCGGCG
ACGGCGGCGA TTCAGGCGCT GGCGGGCGGC GGCAACCTGC AGCAGGCGCT GGCCGGGGCG
TCGGCGCCGT ATCTGGCGCA GCTGGTGAAA GACGTGACGA TGCCGGCGGA CGAGAAAAAG
GCGACGGCAT CGGACATCGC GGCCAACGCG ATGGGCCATG CACTGATGGG TGCGGTGGTG
GCGCAACTGT CCGGCAAGGA TGCGGTGGCG GGCGCGGTGG GTGCGGCCGG CGGCGAACTG
ACCGCCCGGC TGCTGATCAT GCAGAAGCTG TACCCGGGGC GAGACCCCAG CGACCTGACG
GAAGGGGAAA AGCAGTCGGT CAGCGCGCTG GCCTCGCTGG CGGCGGGTCT GGCGTCGGGG
ATTGCGTCGG GGAATACCAC CGGAGCCGCC ACCGGCGCAC AGGCCGGGCG CAATGCGGTG
GAGAATAACT ATCTGAGTGT CTCTGAGAAA TCGGAGCTTG AGATAGCGAA GCAGAAACTC
CGCGACAGCA AAGACCCGGC AGAACGTGAG CAGGCGGAGA AAGACGTTGC CCGGTTAACG
GAGCTGGATA TCTCAAGGGA TAAGAAAGTT ATTGCCGCCT GCGGTAATGG AAATGCTGCC
AGTGCGGGTT GTGCAGCCGC GCGGCTGGAG GCTTATCAAG CCAAAGTGGA GTATGAAAAC
ACCGGCACCT ATAACTCCAG AGCGAGCCAG CAGTACGGCG ATGCTTACGG CCAGATAGTG
AACCTGCTGA ATATCACCAG TGTGGATGCA CAGAATCAGC AGCAGGTGAA AGATGCGATG
GTTAACTACG CTATGAAACA ACTTGGCGTG GATAGAATAA CAGCAGAAGA ATATGTTTCC
ACTTATGATG GAATGAAAAT AATTGCTGCA TCTGTATCTC CTATAATAAT TGGGGAGGCA
GCTAAAACCA GATTAACTGA TTTGGTTGGT AAGGTTAATT CATCTGAAGT TGGCCAGACA
TCAAGGATTG TTAAAACATA TGGCCCGCAT GAGGAGGGGC CATTAGGAAA CCCTAATGAT
TTAAACTCCG CTGCATCAAC CTTTAGAAGT GGTACATATG CTGAAAAAGT TGCAGAAGAA
GATATGTATC TTTATCGTGA CTATGGTGGT AAGGCAAGGG TAAATGGTCG TTATTGGACA
TTGGAGCCTT CTAAAGGACC TGTACAGTCT CAAATTGATA GCGCTGTATT GCCTGAGTGG
GGTAATTCAT TTGAAAATCA AGCGATTATG AAGATACCTA AAGGTACTAA ATTCTATGAG
GGACCTGCTG CACCACAAAC AGGAACAAAA GGCACACGAC CTGAATTAAT TGGTGGTGGT
ACGCAGGTAT ATTTACCTGG TTTGAAAGAT GAATGGATTA TTAAAAAATG A
 
Protein sequence
MKAVKTSQRV KVWALVWLTV LQPMLPAWAA GVTVASGNTA LEAAGNGVPV VNIATPDASG 
LSHNRYHDFN VDPRGLILNN GTARLTPSQL GGLLQNNPNL NGRSAAAILN EVVSPNRSQL
AGYLEVAGPA ANVVVANPYG ITCSGCGFLN TPRITLTTGT PQVDAAGGLS GLDVRGGDIL
IDGAGLDASR SDYFALIART ASLQAGLTAR EAQVVLGANR VGSDGRVTAQ AGEGTAPVLA
LDTGALGGMY ANRISLVSTE RGVGVNTAGL SAREGDIRLS ANGRLQVGDA VAQGALTAQG
ETLALQGSQQ AQGAVTLSGA QGVMLTGSRT RAGQGLTLAS DGRLTAADAG LSAGVREDGT
VQPGYGLRLT GRELALGQSQ LAGDRVSLTA SGAVSQSAGG AWQAGSGLTV NGGALSLDGD
AGAQTLTVSG GSLSGAGRWQ ATGDLTLDGL TTAQWDGALL AGGALSVGAA SLVNRGTLAG
GTVTLTTPAL SNTGTVSGRQ VAVRTAQLTN GGTLSADDAL TVQAQTQLDN GGSLLSGGAL
TIQAGQTDNR GVLAGGAVTL SGDGLWNGGV VQGRQSLGVS ALSGFSQTAD GLLTSGGGMT
LSTGDMDTAG RLTAQGLSLS AGRWRNTGTV SLTGDGQLTV DALDNDGTLL SAGAWAIRGA
AVTNRGTLQG DGLTLQGERL DNRGSLTGVR SLTVTLPGTL TNTGLLTGQR LDLTAGSLAN
GGTLLGVDAL TLTADGALTN TATGRLLTQG AAVLTAASVA NDGEWQAGSL QLTADRLGNG
GRIQSGGALT VALSPAGTLT NTGTLAAQGG LTLSGNYAGA GELYSAGGLT LEGTTIVNDG
GHWQGDTLDI RGGSLTNSGT VTGLTALTVT TAGTLTNTGR LEGRRLDLTA DQLDNGGTLL
GVDALTLAIA GTARNQAGGR WLSQGDGRLT AAELDNQGDW QGDRIAVTAG RVRNAGQVLG
IAALTLTADD ALTNTATGRL LTQGAAVLTA ATAENDGEWQ AGRLQLTADR LRNGGRILSE
GDLQITLPVA DDGLLRAVRQ LAQDVQVAGQ GTLSNSGTLA ADGDSRITGR QVDNQGLLAT
GGALTLTAGD LTNGGRVESR TLQLTGDRLD NGGTLLAEQG GVLRLTDGLT VGADGRLLSN
GDWQIQAGTV TSLGQWQGKN LLLSADSLTN DGALRATNDV TLTLTQDYTG GAGSQVRGNG
AVTLTADRVT QQGDIGGERL QVTAGTLTNG GRLVGLSQLD VAGRGALTNT ADGALLGNGV
TGITAATLDN AGVLQGDALT LQAGTVDNRG RIQGTSGLTL EGVSRYTGTD GSRLLSGGTA
MLAIDSADNA GLWQAGDLRF SGTSLTSRGQ ITGLNSLNID ADSLTSTGQL TTRRVATLRG
RQFDNGGTLT ALGDLTADFG DGIVNQAGGQ LLSGGAGRLT TGTLDNRGLW QASRLALTAD
TLFNQGTLLG LDDGDIRLTG AYVGGVDSRV GGNGAFGLSA ATIDQAGQWQ ARDVTLRGGS
LRNQGTITAG GQLNAVLDEA LENSAGAVLS GGTVSLGAAT VSNGGQIQGR NSLTVEGGNL
LDNLSGGQLL SGGALTLNTP HLTNAGWVQG TDLTLGTAQL DNSGTLQAQN GLTLRLPQWT
NRGTVQAGQL DITTDGQLEN RGTLLGLTRL ALQAARLTNA DGARLYSAGN LQLRTGQLVQ
DGQLAALGDL RADIGNPFTV TRALAAGGQL TLNVTGDLVQ AGTLQGNGVT VTSSGTLTQQ
GRIVAGGGNS TLSAATINQT ESGSIQGGGP LSLLTTGDIT NRGFVGTAGD LLLQAGGVID
NGSLLYGGGN LWLLSDALVN RFGNILAGNS LWIQRDAAGN ASGSVLNSSG TIETQRGDIT
VRTGTLTNQR EGLVVTESSS TATDMPDWAG GTEAKIPVSW FNFKDVKTTE NEVCSGGGGN
DNHSDHCRTS ISKQLSQAVS TQIVTVEQKN IVVSATGGVA QLNSAGHVNI AADVLNNNAS
AITGRGDVFL TGGQLNNQSV QAGSLVKQLK YIAQYYVKRD DYFLFKLIGE PITEYTPGQT
YAATLQAGGA ITASFSQNIS NTSLQPGSGG FMPALATPTL AGVSALTPVE TQAGRGLSGG
TATAVSGSAL SGTGNGVALA GQAERLSAAA GAVTRDTPAG GGTLTPAGID SGLGTAAPVV
PGALAPGDLQ AALRQGLAAV AGPSLTDYPL PTSQNGLFVA DTAGDSRYLI RSNPTLSRLG
QVDNALFGDL RGLLGQTPGT TVPVERSATL TDPTQVLGSS YLLGKLNLDA EHDYRFLGDA
AFDTRYISNA VLSQTGQRYL NGVGSELAQM QQLMDNAAAA QSRLSLQLGV SLSPEQVAGL
SRSIVWWENI TVDGQTVLAP KLYLAQADRS NLSGSRIVAN SVSLSAGGDI DNRGSTVTAL
NVLNIAGGGN LSNSEGGLLN AGGALDLAAL GNLTNSSATI QGNTVTLASV NGDIVNTTTS
SQWQFESING RERLTHTDLG QTGLITAWNG MTLQAGHDIV LNGAQLSAGG PLALAAGNDL
RLNALTTVTD TVREGGGATT ERRNQGLVQS TVAGGGDLSL SAGRDLRGTA AQLSAAGTLA
LSAGRDLSLL SAGEEQFSSN AWSRHLDWQQ TVTQQGTVLN AGEGLSLRAG QDLTLQGAQA
ETRGALTAQA GRDLSLLSAT ESRHDFFEET TVKKKTFSKT VTHTVRETAQ TTEKGTLLSA
GSVALTAGQD IGVRGSSVAA DGGVALTAGR DITTAASVES YRQYEDVSRK KSGLFSGGGI
GFTIGSTSLR QTLEAAGTTQ SQSVSTLGST GGSVSLRAGQ DVALTGTDVI AARDIQVAGN
TVTIDPGYDT RRQSQKMEQK TAGLTVALSG VVGSALNSAV QAIQAVREQS DGRLQALQGM
KAVLSGYQAY QGTQVDTNNK GASSFVGISV SLGAQRVSSS QTSEQSQSFA STLNAGHDIS
VVARQGDITA VGSQLKAANN VALNASRAIN LLSARNTESL TGSNSSSGGN IGVSFGLSNG
GAGFSVFANV NAAKGHELGS GNSWSETTVD AGQQVGLTSG GDTRLTGAQV SGERIVANVG
GDLLLKSQQD SNRYDSKQTS VSAGGSFTFG SMTGSGYLSA SQDKMHSSFD SVQQQTGLFA
GTGGYDIRVG NHTQLDGAVI GSTAGADKNR LETGTLGFGN IDNRAEFSVS HSGVGLSASP
SLSMSDMLKS AALTAPSALM SMGRGGNAGS TTYAAVSDGA LIIRNQAGQQ QDIAALSRDV
AHANNALSPI FDKEKEQKRL QTAQRVGELG AQVMDVIRTE GEIRAVRAAE AGGKVDRPAD
NATEKEWEKY KKDLTQTADY KAVMQSYGTG SDLQRAAQAA TAAIQALAGG GNLQQALAGA
SAPYLAQLVK DVTMPADEKK ATASDIAANA MGHALMGAVV AQLSGKDAVA GAVGAAGGEL
TARLLIMQKL YPGRDPSDLT EGEKQSVSAL ASLAAGLASG IASGNTTGAA TGAQAGRNAV
ENNYLSVSEK SELEIAKQKL RDSKDPAERE QAEKDVARLT ELDISRDKKV IAACGNGNAA
SAGCAAARLE AYQAKVEYEN TGTYNSRASQ QYGDAYGQIV NLLNITSVDA QNQQQVKDAM
VNYAMKQLGV DRITAEEYVS TYDGMKIIAA SVSPIIIGEA AKTRLTDLVG KVNSSEVGQT
SRIVKTYGPH EEGPLGNPND LNSAASTFRS GTYAEKVAEE DMYLYRDYGG KARVNGRYWT
LEPSKGPVQS QIDSAVLPEW GNSFENQAIM KIPKGTKFYE GPAAPQTGTK GTRPELIGGG
TQVYLPGLKD EWIIKK