Gene Dbac_1877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1877 
Symbol 
ID8377549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2141283 
End bp2155685 
Gene Length14403 bp 
Protein Length4800 aa 
Translation table11 
GC content64% 
IMG OID645001105 
ProductHemolysin-type calcium-binding region 
Protein accessionYP_003158384 
Protein GI256829656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAGA CCCAGATATT TGTTCAGCAG GCGGACGGCA CTGCGCGTGA AGTGGAAATC 
GCCAGCGGGG GCGTGGTGCT GTTGAATGCG AATGAGCACT TGCTCATCGC GGCCTCATCT
GACCAGGCGC AGGTCGAGGC CGGGGCTCCG GGTGAAGTCA TCATTAAATT GGAGGGTGTC
GGCGAGTTCA CCGTGGAGTC TGCCGGAGAA ATTCCTGAAC AAATAGCCCT GGCTCCCGAA
ATAATGGGCC TCAAGCCCTT GCCGACCATC GTTTTCGAGC CGACCCGCAC CGACCTGTCC
GATGACGCGC CCACGCATGA GCCGCTAGGC GTGCACCAGG CGCGGGTCGA CAGCACCGCA
TTTCTGGACC GCCAGACAGG CGATGACTTC GGCATGGGCC AGCTCCTGGA GATGGCCGAA
TTCAGCGGCA TGGCCGGAGA AGGATTGGCC GGGCGCAAGG AGACGGGCGA CAAGGACGAG
GATCGCATGC TGGACGACTC GCTCGACGAC GGTCTGTCCG GTGATCCTGG TCAGGATCAC
CTAAACCTCC CCCCGGTCAT CGACCTCAAC GCAACCCTCC TGGTGGACGA CGACGAGGCG
GGGAACCTGC TGGACCACCT GCGGGCCACG GACCGCGAGA GCGGCCCGGC GGACTTGAAA
TACACGATCA GCCAGGGGCC GGCCTACGGC GTGCTGATGA TCGACGGCAA GGAGGTCGAC
GCTACCAGGG TGACCTTCAC CCAGGCCGAT CTCGACTCGG GCCGCGTCAC CTTCCGCTTC
GACCCCCACG CCCAGAACAA GGTCCTGGTC ATTGATGATG ACACCTTCGT ATTCACGGTC
TCCGACGGCG TGAACACCAC CGGCCCGGCC ACCTTCCACA TCCAGAACAC CACGGTCCAG
GTCTGGGGCA CGGACAACAA CGACGGCACC GTCGGCGACG ATCTGACGAA GGCGAATGTC
GACTTCAACC GCGACGGGGT GAAGTTCCAC GTCTACGGCT TCAAGGGCAA TGATACCCTG
CGCGGCGGGT CGGGTGCGGA CACCCTGGAC GGCGGCGCCG GCCAGGATTG TGCCGACTAC
AGCGACAGCG ACGGCTGGGT GAAAGTGGAC CTGACCAAGG GGGCCATGGC CCAGTCCGGT
GGAGGCGAAC GCAATCACGC TGCTGGCGAC GTTCTGAAAG GCATTGAGAA CCTCACAGGT
TCAAACTTCA ACGATGTGCT CATCGGCGAC GGCCAAGCCA ATATCCTCAA CGGTCTGGTC
GGCGACGACA CCCTGTACGG CGGCGCGGGC AACGACTCCA TGATGGGCGG CGACGGCAAC
GACTCCATGA TGGGGGACGG CGGCAACGAC ACGCTCATAA GTGAGGCTGG ACATGACATC
CTGTACGGTG GTGGCGGCGA CGATTCCTTG GATGGCGGCG ATGGCAATGA CACCCTGCAC
GGCGGCCAGG GCAACAACAC CCTGTACGGC GGCGCTGGCA ATGATTCTCT GTCCGTAAAC
TTATCTTATA TAAATTATAG GAGCTTTTTA GACGGCGGTG ACGGCGACGA TACGTTGATA
GGCGGCTTCG GCCGAGACAC CCTATACGGT GGCGACGGGA ATGACTCCAT GATTGCCGGC
GGCATGTATG AAGGAGGACA TCATGAAGTC ATGGACGGCG GCGCAGGCAA TGATACCCTG
CTCTCCGACG GCGGCAACGA CACCCTCTAT GGTCGCGACG GCAATGACTT TCTGGACGGT
GGGACTAATT CCGATCACTT AATAGGCGGT GACGGCAACG ACACGCTCTA TGGTGGCGAT
GCTTCAAATA TAGAATGGAT TCTATTCAGC CCCGGGTATT ACACCGATTG GGATCTTCTG
GACGGCGGTG CCGGTGATGA TGTCCTGGAT GGCGGCCGCG GTGCTGACAC CCTGTACGGC
GGCGACGGCA ACGACTCTCT GTACGGCGGT GATTCCACCC CGTACCAGCC CTATAAAACT
TGGCTTCGCG CCGATAACGA CTTGTTGGAC GGCGGAGCGG GCAACGATAC ACTGGACGGT
GGCGCCGGCA AGGACACCCT GATGGGCGGT GAGGATAACG ACTCCATGAT GGGCGGCGAC
GGCAACGACT CCATGATGGG GGACGGCGGC AATGACACGC TCATAAGTGA GGCTGGACAT
GACATCCTGT ACGGTGGTGA CGGCGACGAT TCCTTGGATG GCGGCGATGG CAATGACACC
CTGCACGGCG GCCAGGGCAA CAACACCCTG TACGGCGGCG CTGGCAATGA TTCTCTGTCC
GTAAACTTAT CGTATATAAA TTATAGGAGC TTTTTAGACG GCGGTGACGG CGACGATACG
TTGATAGGCG GCTTCGGCCG AGACACCCTA TACGGTGGCG ACGGGAATGA CTCCATGATT
GCCGGCGGCA TATATGATGG AGGAAATAAT GGAGTCATGG ACGGCGGCGC AGGCAATGAT
ACCCTGCTCT CCGACGGCGG CAACGACACC CTCTATGGTC GCGACGGCAA TGACTTTCTG
GACGGTGGGA CTAATTCCGA TCACTTAATA GGCGGTGACG GCAACGACAC GCTCTATGGT
GGCGATGCTT CAAATATAGA ATGGATCCCC GGGTATTACA CCGATTGGGA TCTTCTGGAC
GGCGGTGCCG GTGATGATGT CCTGGATGGC GGCCGCGGTG CTGACACCCT GTACGGCGGC
GACGGCAACG ACTCTCTGTA CGGTGGTGTT TCCACCCCGT ACCAGCCCTA TAAAACTTGG
CTTCGCGCCG ATAACGACTT GTTGGACGGC GGTGCGGGCA ACGATACCTT GGACGGCGGC
GTCGGCAATG ACACCCTGAT GGGCGGTGAG GATAACGACT CCATGATGGG CGGCGACGGC
AACGATGTTC TGATGGGTGA CGCGGGCGAC GATACCCTGG TTGGCGGGGT TGGAAAAGAC
ACCCTGAACG GCGGCGAAGA CGTCAACACC GTGGACTACA GCGCCAGCAC GAAGGCCGTG
CGCATCGACC TGAACCAGCA AGACGGAAAA GCCGCCCAGA GCGGCGGAGA GGCTGAAAAC
CACGCCGATG GCGACATCCT CCAGAACATC CGGAACGTCA TCGGCACGGC CGGAGCCGAC
ACGCTCATCG GCGACAGCCA GGCCAACGTG CTTGACGGCC GGGACGGCAA CGATATCATC
ACCGCCGGCG CTGGCGACAC TGTCGTCGGC GGGGCTGGCA ACGATGTGCT GCACAGCAGC
GACACAGCCC TGGATATCGC CCACTCCACG CAGATTACAG GCGTTGAACG CCTCGACCTC
ACGGGCAGCG CCAGCAGCCT GACCGTGAAT GGCGACGCCA TCCTGAACAA CGGCGTGGCC
GACCCGGCCG GCAGCGGCCT GATGGCTCTG GTGGTGACCG GGGACCAGGG CGACGCGGTG
ACCAGGGTCT CCGGCGACGG CTGGACCTGG ACCAAAGTGG GACAGGACGT GGCCCTGGGC
GCTGACGGCA ACACCTATGT CCTGTACGAG GCCGTCAAGG ACGGCGAGAC CGTGCGGCTT
TATGTGCAGA CCGGTCTGGG CGAGGCCGAG ATTACGGACG GCGTGGTCAA GATAACCGGA
ACCGAGGGCC CCGACGACCT GACCGTGGGT TGGAACTTCG ACGATCCCCA GTATACATTT
GATGCGCGCG GCCTGGGCGG CGACGACACC CTGCGCGGCG GCGTGCATGC CGACACCCTG
GACGGCGGCC TGGGCACGGA TGCAGTGGAC TACCACGCCA GCGCCACCTG GGTGAAAGTG
GACCTGAACC TGGCCACGGC CCAGGTCGGC GGCGGCGATG GCAATCATGC GCTGGGCGAC
GTGCTCACCG GCATTGAGAA TCTCACCGGC AGCAACGACA CAACCCATGG CGACGTGCTC
TCGGGCAATG CCCTGAACAA CGAACTCCAC GGCCTGGCCG GCAACGACAC CCTGCGTGGT
GGTGCCGGCA ACGACACGCT TATGGGCGGT GCCGGTGCCG ACCTGCTGGA CGGCGGTGCC
GGCTCGAATA CGGCCGACTA TAGCTCCAGC ACGACAGGAG TTTACGTAAA TCTGGACCAA
AATGGCGCTC AGACAACAGG TGGCGGCACG AGTAGCGACG CATGGGGAGA TACGCTCACG
GGCATAACGA CCCTCATTGG CGGTATCGGC AACGACACCC TCATCGGCTT CGATTTCGGT
GGACCGGCCG CAATGCTTTT GGGCGGGGAT GGCGACGATG TCCTGCACGC CAATGCCGTA
ACGTATTATT ATATTGGCGC TAGCAATACG TTGGATGGCG GTGCTGGCAA CGACACACTG
GGCGGTGCTG GTGGGCGTGA CTCCCTGATC GGCGGTACGG GCAACGATTG GCTGCATGGC
TGGACTGGCG CAGACACCCT GAACGGCGGC GACGGTATCG ACACCGTCGA TTACGCCGAC
GCCTATACCT GGGTAAAAAT TGATCTCAAC CTCACTGGAG CCCAAGGACT TGGAGGTAGG
GACGGCGAAG GGCATAACGA TGCCTTAGAC GACGTGCTCG TGAGCATTGA GAATGTGCGG
GGAAGCCGTT ACGGAGATAG TCTTGTTGGC AACGCCGAGG ATAACAAGCT TGAAGGCTTG
GCTGGAAATG ACACGTTGGA CGGCCGGTCC GGCAACGACA CCCTGGACGG CGCCACGGGC
CACGACACCC TGGACGGCGG GGTGGGCAAC GACTCGATCA TCGGCGGCGA GGGCAACGAC
CTCATCATCG CTGGCGCAGG CGACAATGTT GACGGCGGCG CAGGCCTGGA TGTGCTGAAG
AGCACAGACG CCAGCATCGA TCTTGTCAAC GGCATCCACA TGCAGAACGT GGAACGGGTC
GACCTCACGG GGGCTTCCAC CAGCCTGACC GTGAATGGCG ACGCCATCCT GAACAACGGC
GTGGCCGATC CGGCTGGCAG CGGCCTGATG GCTCTGGTGG TGACCGGGGA CCAGGGCGAC
GCGGTGACCA GGGTCTCCGG CGACGGCTGG ACCTGGACCA AAGTGGGACA GGATCTGGCC
CTGGGCGCTG ACGGCAACAC CTATGTCCTG TACGAGGCCG TCAAGGACGG CGAGACCGTG
CGGCTTTATG TGCAGACCGG TCTGGGCGAG GCCGAGATTA CGGACGGCGT GGTCAAGATA
ACCGGAACCG AGGGCCCCGA CGACCTGACC GTGGGTTGGA ACTTCGACGA TCCCCAGTAT
ACATTTGATG CGCGCGGCCT GGGCGGCGAC GACACCCTGC GCGGCGGCGT GCATGCCGAC
ACCCTGGACG GCGGCCTGGG CACGGATGCA GTGGACTACC ACGCCAGCGC CACCTGGGTG
AACGTGGACC TGAACCTGGC CACGGCCCAG ATTGACGGCG GCACGAACAA CCATGCCGCT
GGCGATGTGC TCACCGGCAT TGAGATTCTC ACCGGCACCA ACGACACAAC CCATGGCGAC
GTGCTCTCGG GCAACGGCCT GAACAACGAG CTTCATGGCC TGCTCGGCGA CGACAGCCTG
TACGGCTTTG CTGGCAACGA CTTGCTGCAT GGCGGGGCCG GGGCGGACCT GCTGGACGGC
GGCACCGGCA CAGACATGGC CGACTACCAC GACAGCGCCA CCTGGGTGAA CGTGGACCTG
ACCCTGGCCA CGGCCCAGGT TGGCGGCGGC GAGGGCAACC ACGCCCTGGG CGACACCCTG
GTGAGCATTG AGAACGTCAC CGGTTCGCAG TATGACGATA GCATCACGGG CAACGGATGG
CACAATGTTC TGGATGGAGG CGACGGCGCT GACACCCTGT ACGGCGGCGG TGGGTACGAT
ACCCTGAACG GTGGCGCGGG CAACGACTCC CTGTACGGCG GCGATGGGGC TTTCTACGAT
AACATGAACG GCGGCGCAGG CGACGACACC CTGGATGGCG GCGCTGGCCA CGACTTCCTG
CTGGGCGGCG ACGGCAACGA CTCCCTGTAT GGCGGCGAAA GGAACTTTGC CGACACCCTG
GACGGCGGCG ACGGCGACGA CACCCTGGAT GGCGGCGAAG ACCGCGACTC CCTGCTGGGC
GGCGACGGCA ATGACTCCCT GGATGGCGGC TATGATCGCG ACACCATGGA TGGCGGTGAC
GGCAACGACT CCCTGTACGG CGGCGATGGG GCTTTCTACG ATAACATGAA CGGCGGCGCA
GGCGACGACA CCCTGGATGG CGGCGCTGGC GAGGACTTCC TGCTGGGCGG CGACGGCAAC
GACTCCCTGT ATGGCGGCGA AAGGAACTTT GCCGACACCC TGGACGGCGG CGACGGCGAC
GACACCCTGG ATGGCGGCGA ATACCGCGAC TCCCTGCTGG GCGGCGGCGG CAATGACTCC
CTGGATGGCG GCTATGAACG CGACACCATG GATGGCGGTG ACGGCAACGA CTCCCTGTAC
GGCGGCGATG GGGCTTTCTA CGATAACATG AACGGCGGCG CAGGCGACGA CACCCTGGAT
GGCGGCGCTG GCCAGGACTT CCTGCTGGGC GGCGACGGCA ACGACTCCCT GTATGGCGGC
GAAATGAACT TTGCCGACAC CCTGGACGGC GGCGACGGCG ACGACACCCT GGATGGCGGC
GAATACCGCG ACTCCCTGCT GGGCGGCGAC GGCAACGACT CCCTGTACGG CGGCAATGAA
GCTTGGAACG ACACCCTGGA TGGCGGCGCG GGCAACGACA CCCTGGATGG CGGCGAAGGC
AAAGACTCCC TGGACGGCGG CGCAGGAAAT GACGTGCTCA TCGGCGGAGC CGGGGCGGAC
TTCATTGACG GCGGCAGCGG CGTCAACACC GTGGACTACA GCGCCAGCAC GAAGGCCGTG
CGCATCGACC TGAACCAGCA AGACGGAAAA GCCGCCCAGA GCGGCGGAGA GGCTGAAAAC
CACGCCGATG GCGACATCCT CCAGAACATC CGGAACGTCA TCGGCACGGC CGGAGCCGAC
ACGCTCATCG GCGACAGCCA GGCCAACGTG CTTGACGGCC GGGACGGCAA CGATATCATC
ACCGCCGGCG CTGGCGACAC TGTCGTCGGC GGGGCTGGCA ACGATGTGCT GCACAGCAGC
GACACAGCCC TGGATATCGC CCACTCCACG CAGATTACAG GCGTTGAACG CCTCGACCTC
ACGGGCAGCG CCAGCAGCCT GACCGTGAAT GGCGACGCCA TCCTGAACAA CGGCGTGGCC
GACCCGGCCG GCAGCGGCCT GATGGCTCTG GTGGTGACCG GGGACCAGGG CGACGCGGTG
ACCAGGGTCT CCGGCGACGG CTGGACCTGG ACCAAAGTGG GACAGGACGT GGCCCTGGGC
GCTGACGGCA ACACCTATGT CCTGTACGAG GCCGTCAAGG ACGGCGAGAC CGTGCGGCTT
TATGTGCAGA CCGGTCTGGG CGAGGCCGAA ATCACGGACG GCGTGGTCAA GATAACCGGA
ACCGAGGGCC CCGACGACCT GACCGTGGGT TGGAACTTCG ACGATCCCCA GTATACATTT
GATGCGCGCG GCCTGGGCGG CGACGACACC CTGCGCGGCG GCGTCCATGC CGACACCCTG
GACGGCGGAG CGGGCGTCGA CACGGTGGAC TACCACGCCA GCGCCACCTG GGTGAACGTG
GACCTGAACC TGGCCACGGC CCAGGTCGGC GGCGGCGATG GCAATCATGC GCTGGGCGAC
GTGCTCACCG GCATTGAGAA CCTCACCGGG ACCAATGACA CCCTGCATGG CGATGTGCTC
ACGGGAAATA CCGGAAATAA CCTGCTCTCC GGCCTGGACG GTAACGACAC CATCAACGCT
GGACTTGGCA ACGATACGTT GGTCGGCGGG GCCGGGGCCG ACCTTCTGGA CGGCGGCGCA
GGCACAGACA TGGCCGACTA CCACGACAGC ACCACCTGGG TGAATGTGGA CTTGAGCCTG
GCCACGGCCC AGACTGGCGG TGGCGATGGC AATCACGCGC AGGACGACGT GCTCACCGGC
ATCGAGAACC TTATCGGCAC CAACGACCTC GCCCATGGCG ACGTGCTCAC CGGCAATGGC
GTGAGCAACT TCATGAACGG CGGCGCAGGC AACGACACCA TCTACGGCGG CAATGCCCGG
GACACCATCT ACGGCGGCGA CGGCGACGAC TGGATCGACG GCGGCGTCCA CGATGACTCC
ATCTATGACG ATGACTCCAT CTATGGCGGC GCAGGCAACG ACACCATCTA CGGCGGCAGT
GCCCAGGACA CCATCTACGG CGGCGACGGC GACGACTCTC TGGTGGGCTA CAACCCCGAC
GGCACAAGCG ACCGAAGGCA GGACACCCTC ATCGGCGGCT GGGGCAACGA CACCCTGGAT
GGCAGTCACG AACTCGATAT TATTTTTGAG CACACGGTGC TCGTCGGTGG TTCCGGAGCC
GACCGCATCA TCGGCAACGG CACAAACACC TTCGCCAACT ACCAACTGAC CGGGCACGGC
GACTTCGATG TCAGCTACCA GGGCGTTTAT CTCGACCTGA ATATCCAGGA CGGCGTCACG
GCCCAGACCG GCAAGTCCGG CGGCGACGAC GCCACCGGCG ACATCCTGAC CGGCATCGTC
CACGCCGCGG GCTCGAACGG CAGCGACACG CTTATCGGCA ACCACCAGGC CAACGCGATG
GCAGGAAACG ACGGCAACGA CCTCATGATT GGCGGCGACG GCGACGACAT CCTCTACGGC
CATGGTGGCA ACAACACGCT GGAGGGCGGC CTGGCTGCGG ATGCCCTCTT GGGCGACCTG
GGCGAAGACA TCGCCTCCTA CGAGCATGCG GCCTCGGGCG TGAATGTTAG CCTGGAAATA
CAGGGTCGCC GCCAGGTCGG CACGGGCGAG GAGAACGGGG ATGAACTCTA CCACATGGAC
GGCCTGTACG GCTCCAACCA CAACGACACC CTCACCGGCT GCAATACGCA ATACTCGGAC
AGCGGGAGCG TGCACAACCG CATCCAGGGC CGCGGCGGCG ACGACATCCT GGCTGGCCTG
GCCGGGGCCG ACACCCTGGA CGGCGGCGAC GGCAACGATA CGGCCGACTA CAGCGCCAGC
ACCAGCACCA GCTGGGTTAA TGTGGACCTG ACCCTGGCCA CGGCCCAAAC CGACGGCGGC
GCGGGCAATC ACGCGCTGGG CGATGTGCTC ACCGGCATCG AGAACCTTAT CGGCACCAAC
GACCTCGCCC ATGGCGACGT GCTCACCGGC AACGACCGCG CCAACATCCT CAGCGGCCTG
GACGGCGCCG ACACCCTGAT CGGCGGCAAG GGCAACGACA CCTTGATCGG CGGGGCCGGG
GCCGACTCCC TGGCCGGCGG CTTGGGCATT CTTGACATGG CCGACTACAG CGCCAGCACG
GCCTGGGTGA ACGTGGACCT GCGCATCCAG GACGGCGCCA CCGCCCAGAG CGGCGGCGGC
ACGGACAGTG CCGGCCAGGA CAACCATGCC CTGGGCGACA CGTTGCTGGG TATCGAGGGC
GTGACCGGCT CGAACTACAA TGACGTGCTC TCCGGCAACA AGGACATAAA TCTCATCTAT
GGCCTTGACG GCGACGACAC CATCTACGGC GACCCCGACA ATACAAAACC AAAAGATGAT
GTCGCACATG TCGACACCAT CTACGGCGGG GCCGGCGACG ACTTGATCTA CGGCAGCGCC
TTCGAGGACA TGATCTACGG CGGCGACGGC AACGACACCA TTTCCGGCGG CGGAGACTAC
GACAGCATCT TGGGCGGGGA GGGCAACGAC CGCCTGATAG GCTACTACTC CAGCGACAAG
GACGAGCACC AGGACCTGCT GATCGGCGGG GCCGGCGACG ACACCCTGGA CGGCAGCCGC
AACAGCGACA CTTCCATGGG TATTCTCAAC GGCGGCATGG GGGCGGACCA CATCTACGGC
AACGGCGTCA ATACCATGGT CTCCTACGAG CGGACCGGGG ACGGCGACGG CATGGAGAAG
AACATCCATG GCGTCTACCT CGACCTGCGC ATCCAGGACG GCATCAAGGC CCAGACCGGC
AAGGAGCGTG TGGAGGACGA CGCCACCGAC GCCACCGGCG ACATCCTCAC CGGCATCGTC
AACGCCTTGG GATCGTACGG TCACGACACC CTCATCGGCA ACGACCAGGG CAACGATATG
TACGGCAACG GCGGCAACAA CCTGCTGATC GGCGGGGACG GCAACGACTC CCTGAGGGGC
GACATCAACG ACGACACCTT GGATGGCGGG CTCGGGGCCG ACCTTATCTG GGGCTGGAAG
GGCGACGACA TCGCCTCCTA CGAGAACGCG GCCTCGGGGG TGAACGTCGA CCTGAGGATC
CAGAATGTGA ACGGCTTGAC CCAGCAAGGC AACGGCGAGG AGGACGGCGA CGAGCTCTGG
TACATGGACG GCCTGTACGG CTCCGCCTTC AACGACACCC TGACCGGCCG CGACACCGAC
GACCCCGACT ACATGAGCAT CCACAACCTC CTGCAGGGCC GCGGGGGCGA CGACATCCTG
AGCGGCCTGG CCGGCAACGA CACCCTGGAC GGCGGCACGG GCAACGACAC GGCCGACTAC
AGCACGAGCG GTTCGGGCGT CCACGTCGTC CTGACCATCC AGGACGGGGT CACGGCCCAG
TCCGGCACGG GCGACGCCGC CGGCGACGTG CTCATCGGCA TCGAGAACGT CATCGGCTCC
TCCTACGACG ACACGCTCAT CGGCGACAGC AATGCCAACG TCCTGAGCGG CCTGGGCGGA
GCGGACTCCA TTGATGGCGG CGGCGGCATC GATACGGTGG ACTATAGCGC CAGCACCGCT
GGGGTGCATA TCGACCTGAG CCGCCAGGAT GGCCTCACGG CCCAGAGCGG CGGGGCTGAT
GGCAATCACG CTGATGGCGA CATTCTTCGG AACATCCAGA ACGTCATCGG CTCCTCCTAC
AACGACACGC TCACCGGCGA CATCGGCAAC AACGTGCTGA GCGGCCTGGG CGGAGCGGAC
TCCATTAATG GCGGCGGCGC CATCAATACG GTGGACTATA GCGCCAGCAC CGCTGGGGTG
CATATCGACC TGAGCCGCCA GGATGGCCTC ACGGCCCAGA GCGGCGGGGC TGATGGCAAT
CACGCTGATG GCGACATTCT TCAGAACATC CAGAACGTCA TCGGCTCCTC CTACAACGAC
ACGCTCATCG GCGACGGCAA CGCCAACGTG CTGAGCGGCC TGGGCGGAGC GGACTCCATT
GTCGGCGGCG ACGGCATCGA CACGGTGGAC TACCGCGCGA GCAACGAGGC TGTAACCATG
GACCTGAGGT ACGGCACCTG CAAAGGTGGT GACGCCGAGG GTGACTCCCT TACCGGCATC
GAGAACGTCT TCGGCTCGAA CTTCAACGAC ACCATTACCG GAAGCAACGG ATCGAACTAC
CTGTTCGGTA TGGACGGCAA CGACTCTCTC GCTGCCGCTT ATGGCGACGA CACCCTGGAC
GGCGGCGCGG GCAACGACAC CCTGCGGGGC GCTAGGGACA AAAACCTCCT GCTTGGCGGC
GACGGCAACG ACAGCCTCTA CGGCCAGGGT AACAACGACA CCCTGGATGG CGAGGACGGA
TCGGATCTCC TTGACGGCGG AACCGGGGCC GATTCCCTCA TCGGCGGCGA CGGCATCGAC
ACGGTGGACT ACCGCGCGAG CAACGAGGCT GTAACCATGG ACCTGAGGTA CGGCACCTGC
AAAGGTGGTG ACGCCGAGGG TGACTCCCTT ACCGGCATCG AGAACGTCTT CGGCTCGAAC
TTCAACGACA CCATTACCGG AAGCAACGGA TCGAACTACC TGTTCGGTAT GGACGGCAAC
GACTCTCTCG CTGCCGCTTA TGGCGACGAC ACCCTGGACG GCGGCGCGGG CAACGACACC
CTGTGGGGCG CTAAGGACAA AAACCTCCTG CTTGGCGGCG ACGGCGACGA CAGGCTCTAC
GGCCAGGGCA ACAACGACAC CTTGGCCGGA GGGGCTGGGG CGGACCTCCT GGATGGCGGA
ACGGGCGTGG ACACGGCGGA CTACAGCGCC AACAGCGCTT GGGTGAATAT AGACATGAGT
CTCGCCACTG CGCAGACGGG CGGGGGCGAC GGCAATCACG CCCTGGGCGA CACCCTCGTC
AGTATTGAGC AAATCATTGG CACGAACGAT ACGACGCATG GTGATGTGCT CATTGGTGAC
GGCGGGGACA ACTTGATCAG CGGCGGTGCC GGCGACGACA CCATCCACGG CGGCGCGGGC
TTCGACTCCA TCTTGGGCGG CGACGGCGAC GACTACCTGG AATGCTACCT GCCGAACAGC
GGCATGCAGG AGGCGTACTT CTCGTACCGT AACAAACTGA TCGGCGGCGC TGGCAACGAC
ACCCTGGTCG GCAGCAATTC CGATGGTGAT GACATACTCT GCGGCGGCAT GGGGGCGGAC
CGCATCATCG GCAACGGCAT CAGCACCATA GCCTCCTACG AGCTGACCGG GCACGGCGAC
TTCGACATCA GCTCGCAGGG CGTTTACCTT GACCTGAGGA TCCAGGACGG GCTCACGGCC
CAGACCGGCA AGGAGCATCT GGAGGACGCC GACACCGACG CCACCGGCGA CATCCTCACC
GGCATCGTCA ACGCCGTGGG ATCGTACGGT CACGACACCC TCATCGGCAA CGACCAGGGC
AACGATATGA ATGGCTTGGA CGGCAACGAT CTCATGATCG GCGGCGTGGG CAACGACTCC
CTGTGGGGCG TGAACAACAA CGACACGCTG GAGGGCGGCC TTGGAGAGGA TTTAATCTGG
GGCGGCATGG ACTACGACAT CGCCTCCTAC GCGAATGCCG CCTCGGGGGT GAAGGTCGAC
CTGAGGATCC AGAATGTGAA GGGCTGGGTC CAGCAAGGCG CGGGCGAGGA GGACGGCGAC
GAGTTCTATT ACATGGACGG CCTGTACGGC TCCGCCTACA ACGACACCCT GACCGGCCGC
GACATCGACC AACTTGGTTT CATGAGTGCT CACAACCGGC TGGAGGGTCG CGAGGGCGAT
GACATCCTGG CTGGCCTGGC CGGGGCCGAC ATCCTGGACG GCGGCGACGG CATCGACACC
GCCGACTACA GCCTGAGCGA AGCCAATGTG CGGGTTGACC TAAGCCAGGC CACTGCCCAG
ACCGGCGGCG GGTCGGTCAT GACCGAGACC ATCCTGGACA GGAATGGTGA CGAATGGAAA
ATTATCTACA CGGGCAACCA CGCCGCCGGT GACGTGCTCA TCGGCATCGA GAACGTCATC
GGCTCCTCCT ACGACGACAC GCTCATCGGC GACGGCAATG CCAACGTGCT GAGCGGCCTG
GGCGGAGCGG ACTCCATTGA TGGCGGCGGC GGCATCGACA CGGCCGACTA CAGCGCGAGC
GGCGCGGCTG TAACCATGGA CCTGAGGTAC GGCACCTACA AAGGTGGTGA CGCCGAGGGT
GACTCCCTTA CCGGCATCGA GAACGTCATC GGCTCGAACT TCAACGACAC CATTACCGGA
AGCAACGGAT CGAACTACCT GTACGGTATG GACGGCAACG ACGCTCTCGC TGCCGCTTAT
GGCGCCGACA CCCTGGACGG CGGCGCGGGC AACGACACCC TGCGGGGCGC TAGGGACAAA
AACCTCCTGC TTGGCGGCGA CGGCGACGAC AGGCTCTACG GCCAGGGTAA CAATGACACC
TTGGCCGGAG GGGCTGGGGC GGACCTCCTG GATGGCGGAA CGGGCGTGGA CGCGGCGGAC
TTCAGCGCCA GCAGCGCCTG GGTGAATATA GACATGAATC TCGCCACGGC GCAGACCGGC
GGTGGCGACG GCAATCACGC CCTGGGCGAC ACCCTCGTCA GCATTGAGCA AATCATTGGC
ACGAACGATA CGACGCATGG TGATGTGCTC ATTGGTGACG GCGGGGACAA CTTGATCAGC
GGCGGTGCCG GCGACGACAC CTTGGACGGC GGCCTGGGCA ACGATCTGCT CTTGGGCGGC
GAGGGCGACG ACCTGCTCAT CGGCGGCGTG ACGGACACCG CTCATGGCGG GGACGGTTTC
GACACCTTCC GCCTGGAGGA CAACATCGGC ACCGGAAGCA TCTTCGACCT GAGCGTCATG
AACGACGCGG GCCGGATCAC AGGCATCGAG CGCATCGACA TCTCCGGCGA TGCCGACGAC
GCCAATGCCC TGGCCCTCAA AGCCTCGGAC GTGCTGGACA CGACGGGCGG CGCCGACACC
CTCTGGGTGC GGGGCGACGC GAATGACAGC GTGACAACGA CGGATTCCGG CTGGCAGTTG
CTCGGCGTCG AGACCGGAGC CGATGGACAG GAGTATAATC ACTATTCGGG GTATGCGGGC
TCGACCTTGG TAAATCTGAT GATCGAGTCG GACATGGCAC AGCAGAACGT CGTGCACGCA
TAG
 
Protein sequence
MSKTQIFVQQ ADGTAREVEI ASGGVVLLNA NEHLLIAASS DQAQVEAGAP GEVIIKLEGV 
GEFTVESAGE IPEQIALAPE IMGLKPLPTI VFEPTRTDLS DDAPTHEPLG VHQARVDSTA
FLDRQTGDDF GMGQLLEMAE FSGMAGEGLA GRKETGDKDE DRMLDDSLDD GLSGDPGQDH
LNLPPVIDLN ATLLVDDDEA GNLLDHLRAT DRESGPADLK YTISQGPAYG VLMIDGKEVD
ATRVTFTQAD LDSGRVTFRF DPHAQNKVLV IDDDTFVFTV SDGVNTTGPA TFHIQNTTVQ
VWGTDNNDGT VGDDLTKANV DFNRDGVKFH VYGFKGNDTL RGGSGADTLD GGAGQDCADY
SDSDGWVKVD LTKGAMAQSG GGERNHAAGD VLKGIENLTG SNFNDVLIGD GQANILNGLV
GDDTLYGGAG NDSMMGGDGN DSMMGDGGND TLISEAGHDI LYGGGGDDSL DGGDGNDTLH
GGQGNNTLYG GAGNDSLSVN LSYINYRSFL DGGDGDDTLI GGFGRDTLYG GDGNDSMIAG
GMYEGGHHEV MDGGAGNDTL LSDGGNDTLY GRDGNDFLDG GTNSDHLIGG DGNDTLYGGD
ASNIEWILFS PGYYTDWDLL DGGAGDDVLD GGRGADTLYG GDGNDSLYGG DSTPYQPYKT
WLRADNDLLD GGAGNDTLDG GAGKDTLMGG EDNDSMMGGD GNDSMMGDGG NDTLISEAGH
DILYGGDGDD SLDGGDGNDT LHGGQGNNTL YGGAGNDSLS VNLSYINYRS FLDGGDGDDT
LIGGFGRDTL YGGDGNDSMI AGGIYDGGNN GVMDGGAGND TLLSDGGNDT LYGRDGNDFL
DGGTNSDHLI GGDGNDTLYG GDASNIEWIP GYYTDWDLLD GGAGDDVLDG GRGADTLYGG
DGNDSLYGGV STPYQPYKTW LRADNDLLDG GAGNDTLDGG VGNDTLMGGE DNDSMMGGDG
NDVLMGDAGD DTLVGGVGKD TLNGGEDVNT VDYSASTKAV RIDLNQQDGK AAQSGGEAEN
HADGDILQNI RNVIGTAGAD TLIGDSQANV LDGRDGNDII TAGAGDTVVG GAGNDVLHSS
DTALDIAHST QITGVERLDL TGSASSLTVN GDAILNNGVA DPAGSGLMAL VVTGDQGDAV
TRVSGDGWTW TKVGQDVALG ADGNTYVLYE AVKDGETVRL YVQTGLGEAE ITDGVVKITG
TEGPDDLTVG WNFDDPQYTF DARGLGGDDT LRGGVHADTL DGGLGTDAVD YHASATWVKV
DLNLATAQVG GGDGNHALGD VLTGIENLTG SNDTTHGDVL SGNALNNELH GLAGNDTLRG
GAGNDTLMGG AGADLLDGGA GSNTADYSSS TTGVYVNLDQ NGAQTTGGGT SSDAWGDTLT
GITTLIGGIG NDTLIGFDFG GPAAMLLGGD GDDVLHANAV TYYYIGASNT LDGGAGNDTL
GGAGGRDSLI GGTGNDWLHG WTGADTLNGG DGIDTVDYAD AYTWVKIDLN LTGAQGLGGR
DGEGHNDALD DVLVSIENVR GSRYGDSLVG NAEDNKLEGL AGNDTLDGRS GNDTLDGATG
HDTLDGGVGN DSIIGGEGND LIIAGAGDNV DGGAGLDVLK STDASIDLVN GIHMQNVERV
DLTGASTSLT VNGDAILNNG VADPAGSGLM ALVVTGDQGD AVTRVSGDGW TWTKVGQDLA
LGADGNTYVL YEAVKDGETV RLYVQTGLGE AEITDGVVKI TGTEGPDDLT VGWNFDDPQY
TFDARGLGGD DTLRGGVHAD TLDGGLGTDA VDYHASATWV NVDLNLATAQ IDGGTNNHAA
GDVLTGIEIL TGTNDTTHGD VLSGNGLNNE LHGLLGDDSL YGFAGNDLLH GGAGADLLDG
GTGTDMADYH DSATWVNVDL TLATAQVGGG EGNHALGDTL VSIENVTGSQ YDDSITGNGW
HNVLDGGDGA DTLYGGGGYD TLNGGAGNDS LYGGDGAFYD NMNGGAGDDT LDGGAGHDFL
LGGDGNDSLY GGERNFADTL DGGDGDDTLD GGEDRDSLLG GDGNDSLDGG YDRDTMDGGD
GNDSLYGGDG AFYDNMNGGA GDDTLDGGAG EDFLLGGDGN DSLYGGERNF ADTLDGGDGD
DTLDGGEYRD SLLGGGGNDS LDGGYERDTM DGGDGNDSLY GGDGAFYDNM NGGAGDDTLD
GGAGQDFLLG GDGNDSLYGG EMNFADTLDG GDGDDTLDGG EYRDSLLGGD GNDSLYGGNE
AWNDTLDGGA GNDTLDGGEG KDSLDGGAGN DVLIGGAGAD FIDGGSGVNT VDYSASTKAV
RIDLNQQDGK AAQSGGEAEN HADGDILQNI RNVIGTAGAD TLIGDSQANV LDGRDGNDII
TAGAGDTVVG GAGNDVLHSS DTALDIAHST QITGVERLDL TGSASSLTVN GDAILNNGVA
DPAGSGLMAL VVTGDQGDAV TRVSGDGWTW TKVGQDVALG ADGNTYVLYE AVKDGETVRL
YVQTGLGEAE ITDGVVKITG TEGPDDLTVG WNFDDPQYTF DARGLGGDDT LRGGVHADTL
DGGAGVDTVD YHASATWVNV DLNLATAQVG GGDGNHALGD VLTGIENLTG TNDTLHGDVL
TGNTGNNLLS GLDGNDTINA GLGNDTLVGG AGADLLDGGA GTDMADYHDS TTWVNVDLSL
ATAQTGGGDG NHAQDDVLTG IENLIGTNDL AHGDVLTGNG VSNFMNGGAG NDTIYGGNAR
DTIYGGDGDD WIDGGVHDDS IYDDDSIYGG AGNDTIYGGS AQDTIYGGDG DDSLVGYNPD
GTSDRRQDTL IGGWGNDTLD GSHELDIIFE HTVLVGGSGA DRIIGNGTNT FANYQLTGHG
DFDVSYQGVY LDLNIQDGVT AQTGKSGGDD ATGDILTGIV HAAGSNGSDT LIGNHQANAM
AGNDGNDLMI GGDGDDILYG HGGNNTLEGG LAADALLGDL GEDIASYEHA ASGVNVSLEI
QGRRQVGTGE ENGDELYHMD GLYGSNHNDT LTGCNTQYSD SGSVHNRIQG RGGDDILAGL
AGADTLDGGD GNDTADYSAS TSTSWVNVDL TLATAQTDGG AGNHALGDVL TGIENLIGTN
DLAHGDVLTG NDRANILSGL DGADTLIGGK GNDTLIGGAG ADSLAGGLGI LDMADYSAST
AWVNVDLRIQ DGATAQSGGG TDSAGQDNHA LGDTLLGIEG VTGSNYNDVL SGNKDINLIY
GLDGDDTIYG DPDNTKPKDD VAHVDTIYGG AGDDLIYGSA FEDMIYGGDG NDTISGGGDY
DSILGGEGND RLIGYYSSDK DEHQDLLIGG AGDDTLDGSR NSDTSMGILN GGMGADHIYG
NGVNTMVSYE RTGDGDGMEK NIHGVYLDLR IQDGIKAQTG KERVEDDATD ATGDILTGIV
NALGSYGHDT LIGNDQGNDM YGNGGNNLLI GGDGNDSLRG DINDDTLDGG LGADLIWGWK
GDDIASYENA ASGVNVDLRI QNVNGLTQQG NGEEDGDELW YMDGLYGSAF NDTLTGRDTD
DPDYMSIHNL LQGRGGDDIL SGLAGNDTLD GGTGNDTADY STSGSGVHVV LTIQDGVTAQ
SGTGDAAGDV LIGIENVIGS SYDDTLIGDS NANVLSGLGG ADSIDGGGGI DTVDYSASTA
GVHIDLSRQD GLTAQSGGAD GNHADGDILR NIQNVIGSSY NDTLTGDIGN NVLSGLGGAD
SINGGGAINT VDYSASTAGV HIDLSRQDGL TAQSGGADGN HADGDILQNI QNVIGSSYND
TLIGDGNANV LSGLGGADSI VGGDGIDTVD YRASNEAVTM DLRYGTCKGG DAEGDSLTGI
ENVFGSNFND TITGSNGSNY LFGMDGNDSL AAAYGDDTLD GGAGNDTLRG ARDKNLLLGG
DGNDSLYGQG NNDTLDGEDG SDLLDGGTGA DSLIGGDGID TVDYRASNEA VTMDLRYGTC
KGGDAEGDSL TGIENVFGSN FNDTITGSNG SNYLFGMDGN DSLAAAYGDD TLDGGAGNDT
LWGAKDKNLL LGGDGDDRLY GQGNNDTLAG GAGADLLDGG TGVDTADYSA NSAWVNIDMS
LATAQTGGGD GNHALGDTLV SIEQIIGTND TTHGDVLIGD GGDNLISGGA GDDTIHGGAG
FDSILGGDGD DYLECYLPNS GMQEAYFSYR NKLIGGAGND TLVGSNSDGD DILCGGMGAD
RIIGNGISTI ASYELTGHGD FDISSQGVYL DLRIQDGLTA QTGKEHLEDA DTDATGDILT
GIVNAVGSYG HDTLIGNDQG NDMNGLDGND LMIGGVGNDS LWGVNNNDTL EGGLGEDLIW
GGMDYDIASY ANAASGVKVD LRIQNVKGWV QQGAGEEDGD EFYYMDGLYG SAYNDTLTGR
DIDQLGFMSA HNRLEGREGD DILAGLAGAD ILDGGDGIDT ADYSLSEANV RVDLSQATAQ
TGGGSVMTET ILDRNGDEWK IIYTGNHAAG DVLIGIENVI GSSYDDTLIG DGNANVLSGL
GGADSIDGGG GIDTADYSAS GAAVTMDLRY GTYKGGDAEG DSLTGIENVI GSNFNDTITG
SNGSNYLYGM DGNDALAAAY GADTLDGGAG NDTLRGARDK NLLLGGDGDD RLYGQGNNDT
LAGGAGADLL DGGTGVDAAD FSASSAWVNI DMNLATAQTG GGDGNHALGD TLVSIEQIIG
TNDTTHGDVL IGDGGDNLIS GGAGDDTLDG GLGNDLLLGG EGDDLLIGGV TDTAHGGDGF
DTFRLEDNIG TGSIFDLSVM NDAGRITGIE RIDISGDADD ANALALKASD VLDTTGGADT
LWVRGDANDS VTTTDSGWQL LGVETGADGQ EYNHYSGYAG STLVNLMIES DMAQQNVVHA