Gene Mmc1_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2988 
Symbol 
ID4482716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3720008 
End bp3733567 
Gene Length13560 bp 
Protein Length4519 aa 
Translation table11 
GC content58% 
IMG OID639723735 
Producthemolysin-type calcium-binding region 
Protein accessionYP_866885 
Protein GI117926268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGC TGTCGGCCAT TGATCAGGAC ACCACTGGAG CACTGACCTA CGGTGCAGGT 
TTGAAAGATA CCGCTGCGGC GCTGGAAGCC AATACCAACA GCTATGTGAC GGGCAACTAT
GTGGTTTCGA TCACCGATGC GGCCACCATG GCGCAGTTGT CGGCCATTGA TCAGGATACC
ACCGGAGCGC TGACCTACGG TGCGGGTCTG AAGGATGGTG TAGCAGCGCT GGAAGCCAAT
ACCAACAGCT ATGTGACGGG CAGCTATGCG GTCTCCATCA CCGATGTGGT GACCATGGCG
CAGTTGTCGG CCATTGATCA GGATACCACC GGTACCCTGA GCTATGGTGC AGGCTTGAAG
GATGGTGCAG CGGCCCTAGC CGCCAACACC AACAGCTACA TTACGGGCAG CTATGTTGTC
TCCGTTACCG ATGCGGCCAC CATGGCGCAA CTTTCGGTCA TGGATACCGC CACCAGCGGC
ACCCTGACCT ACGGTGCGGG CTTGAAGGAC GGTGTTGCGG CACTGGATGC CAATACCAAC
AGCTACGTGA CCGGTTCTTA CCTCGTTTCC ATCACCGATG CGGCCACCAT GGCGCAGTTG
TCGGCCATTG ATCAGGATAC CACTGGAGCG CTGACCTACG GTGCGGGTTT GAAAGATGCC
GCTGCGGCGC TGGAAGCCAA TACCAACAGC TATGTGACGG GCAGCTATGT GGTCTCCATC
ACCGATGCGG CCACCATGGC GCAGCTGTCG GCCATTGATC AGGACACCAC TGGAGCACTG
ACCTACGGTG CAGGTTTGAA AGATGCCGCT GCGGCGCTGG AAGCCAATAC CAACAGCTAT
GTGACGGGCA GCTATGCGGC CTCGATCACC GATGCGGCTA CCATGGCGCA GTTGTCGGCC
ATTGATCAGG ATACCACCGG AGCGCTGACC TACGGTGCGG GTTTGAAAGA TGCCGCTGCG
GCGCTGGAAG CCAATACCAA CAGCTATGTG ACGGGCAGCT ATGCGGTCTC CATCACCGAT
GTGGTGACCA TGGCGCAGTT GTCGGCCATT GATCAGGATA CCACCGGTAC CCTGAGCTAT
GGTGCAGGCT TGAAGGATGG TGCAGCGGCC CTAGCCGCCA ACACCAACAG CTACATTACG
GGCAGCTATG TTGTCTCCGT TACCGATGCG GCCACCATGG CGCAACTTTC GGTCATGGAT
ACCGCCACCA GCGGCACCCT GACCTACGGT GCGGGCTTGA AGGACGGTGT TGCGGCACTG
GATGCCAATA CCAACAGCTA CGTGACCGGT TCTTACCTCG TTTCGATCAC CGATGCGGCC
ACCATGGCGC AGTTGTCGGC CATTGATCAG GATACCACTG GAGCGCTGAC CTACGGTGCG
GGTTTGAAAG ATGCCGCTGC GGCGCTGGAA GCCAATACCA ACAGCTATGT GACGGGCAGC
TATGTGGTCT CCATCACCGA TGCGGCCACC ATGGCGCAGC TGTCGGCCAT TGATCAGGAC
ACCACTGGAG CACTGACCTA CGGTGCAGGT TTGAAAGATG CCGCTGCGGC GCTGGAAGCC
AATACCAACA GCTATGTGAC GGGCAGCTAT GCGGCCTCGA TCACCGATGC GGCTACCATG
GCGCAGTTGT CGGCCATTGA TCAGGATACC ACCGGAGCGC TGACCTACGG TGCGGGTTTG
AAAGATGCCG CTGCGGCGCT GGAAGCCAAT ACCAACAGCT ATGTGACGGG CAGCTATGCG
GTCTCCATCA CCGATGTGGT GACCATGGCG CAGTTGTCGG CCATTGATCA GGATACCACC
GGTACCCTGA GCTATGGTGC AGGCTTGAAG GATGGTGCAG CGGCCCTAGC CGCCAACACC
AACAGCTACA TTACGGGCAG CTATGTTGTC TCCGTTACCG ATGCGGCCAC CATGGCGCAA
CTTTCGGTCA TGGATACCGC CACCAGCGGC ACCCTGACCT ACGGTGCGGG CTTGAAGGAC
GGTGTTGCGG CACTGGATGC CAATACCAAC AGCTACGTGA CCGGTTCTTA CCTCGTTTCG
ATCACCGATG CGGCCACCAT GGCGCAGTTG TCGGCCATTG ATCAGGATAC CACTGGAGCG
CTGACCTACG GTGCGGGTTT GAAAGATGCC GCTGCGGCGC TGGAAGCCAA TACCAACAGC
TATGTGACGG GCAGCTATGT GGTCTCCATC ACCGATGCGG CCACCATGGC GCAGCTGTCG
GCCATTGATC AGGACACCAC TGGAGCACTG ACCTACGGTG CAGGTTTGAA AGATGCCGCT
GCGGCGCTGG AAGCCAATAC CAACAGCTAT GTGACGGGCA GTTATGTGGT TTCGATCACC
GATGCGGCCA CCATGGCGCA GTTGTCGGCC ATTGATCAGG ATACCACCGG AGCGCTGACC
TACGGTGCGG GTCTGAAGGA TGGGGTAGCA GCGCTGGATG CCAATACCAA CAGCTATGTG
ACGGGCAGCT ATGCGGTCTC CATCACCGAT GTGGTGACCA TGGCGCAGTT GTCGGCCATT
GATCAGGATA CCACCGGTAC CCTGAGCTAT GGTGCAGGCT TGAAGGATGG TGCAGCGGCC
CTAGCCGCCA ACACCAACAG CTACATTACG GGCAGCTATG TTGTCTCCGT TACCGATGCG
GCCACCATGG CGCAACTTTC GGTCATGGAT ACCGCCACCA GCGGCACCCT GACCTACGGT
GCGGGCTTGA AGGACGGTGT TGCGGCACTG GATGCCAATA CCAACAGCTA CGTGACCGGT
TCTTACCTCG TTTCGATCAC CGATGCGGCC ACCATGGCGC AGTTGTCGGC CATTGATCAG
GACACCACCG GAGCGCTGAC CTACGGTGCG GGCTTGAAAG ATGCCGCTGC GGCGCTGGTA
GCCAATACCA ACAGCTACGT GACGGGCAGC TATGTGGTTT CGATCACCGA TGCGGCCACC
ATGGCGCAGC TGTCGGCCAT TGATCAGGAT ACCACCGGTG CGTTGACCTA CGGTGCGGGT
TTGAAAGATG CCGTTGCGGC GCTGGAAGCC AATACCAACA GCTATGTGAC GGGTAGCTAT
GTGGTTTCGA TCACCGATGC GGCTACCATG GCGCAGTTGT CGGCCATTGA TCAGGATACC
ACCGGAGCGC TGACCTACGG TGCGGGTCTG AAGGATGGTG TAGCAGCGCT GGATGCCAAT
ACCAACAGCT ATGTGACGGG CAGCTATGCG GTCTCCATCA CCGATGTGGT GACGATGGCG
CAGTTGTCGG CCATTGATCA GGATACCAGC GGTACCCTGA GCTATGGTGC AGGCTTGAAG
GATGGTGCAG CGGCCCTAGC CGCCAACACC AACAGCTACA TTACGGGCAG CTATGTGGTC
TCCGTTACCG ATGCGGCCAC CATGGCGCAA CTTTCGGTCA TGGATACCGC CACCAGCGGC
ACCCTGACCT ACGGTGCGGG CTTGAAGGAC GGTGTTGCGG CACTGGATGC CAATACCAAC
AGCTACGTGA CCGGTTCTTA CCTCGTTTCG ATCACCGATG CGGCCACCAT GGCGCAGTTG
TCGGCCATTG ATCAGGATAC CACTGGAGCG CTGACCTACG GTGCGGGCTT GAAGGATAGC
GTTGCGGCCC TGGAAGCCAA CACCAACAGC TACGTGACAG GTTCTTACCT CGTTTCCATC
ACCGATGCGG CCACCATGGC GCAGCTGTCG GCCATTGATC AGGATACCAC CGGAGCGCTG
ACCTACGGTG CAGGTTTGAA AGATACCGCT GCGGCGCTGG AAGCCAATAC CAATAGCTAT
GTGACGGGCA GCTATGTTGT CTCCATCACC GATGCGGCTA CCATGGCGCA GTTGTCGGCC
ATTGATCAGG ATACCTCTGG AGCGCTGACC TACGGTGCGG GTCTGAAGGA TGGGGTAGCA
GCGCTGGATG CCAATACCAA CAGCTATGTG ACGGGCAGCT ATGCGGTCTC CATCACCGAT
GTGGTGACGA TGGCACAGCT GTCGGCCATT GATCAGGATA CCACCGGTAC CCTGAGCTAT
GGTGCAGGCT TGAAGGATGG TGCAGCGGCC CTAGCCGCCA ACACCAACAG CTACATTACG
GGCAGCTATG TGGTCTCCGT TACCGATGCG GCCACCATGG CGCAACTTTC GGTCATGGAT
ACCGCCACCA GCGGCACCCT GACCTACGGT GCGGGCTTGA AGGACGGTGT TGCGGCACTG
GATGCCAATA CCAACAGCTA CGTGACCGGT TCTTACCTCG TTTCCATCAC CGATGCGGCC
ACCATGGCGC AGTTGTCGGC CATTGATCAG GACACCACCG GAGCGCTGAC CTACGGTGCG
GGCTTGAAAG ATGCCGCTGC GGCGCTGGTA GCCAATACCA ACAGCTACGT GACGGGCAGC
TATGTGGTTT CGATCACCGA TGCGGCCACC ATGGCGCAGC TGTCGGCCAT TGATCAGGAT
ACCACCGGTG CGTTGACCTA CGGTGCGGGT TTGAAAGATG CCGCTGCGGC GCTGGAAGCC
AATACCAACA GCTATGTGAC GGGCAGCTAT GTGGTTTCGA TCACCGATGC GGCCACCATG
GCGCAGTTGT CGGCCATTGA TCAGGATACC ACCGGAGCGC TGACCTACGG TGCGGGTCTG
AAGGATGGGG TAGCAGCGCT GGATGCCAAT ACCAACAGCT ATGTGACGGG CAGCTATGCG
GTCTCCATCA CCGATGTGGT GACCATGGCG CAGTTGTCGG CCATTGATCA GGATACCACC
GGTACCCTGA GCTATGGTGC AGGCTTGAAG GATGGTGCAG CGGCCCTCGC CGCCAACACC
AATAGCTACA TTACGGGCAG CTATGTGGTC TCCGTTACCG ATGCGGCCAC CATGGCGCAA
CTTTCGGTCA TGGATACCGC CACCAGCGGC ACCCTAAGCT ACGGTGCGGG CTTGAAGGAC
GGTGTTGCGG CACTGGATGC CAATACCAAC AGCTACGTGA CCGGTTCTTA CCTCGTTTCC
ATCACCGATG CGGCCACCAT GGCGCAGTTG TCGGCCATTG ATCAGGATAC CACTGGAGCG
CTGACCTACG GTGCGGGCTT GAAGGATAGC GTTGCGGCCC TGGAAGCCAA CACCAACAGC
TACGTGACAG GTTCTTACCT CGTTTCCATC ACCGATGCGG CCACCATGGC GCAGTTGTCG
GCCATTGATC AGGATACCAC CGGAGCGCTG ACCTACGGTG CGGGCTTGAA GGATAGCGTT
GCAGCGCTGG ATGCCAATAC CAACAGCTAT GTGACGGGCA GCTATGCGGT CTCCATCACC
GATGTGGTGA CCATGGCGCA GCTGTCGGCC ATTGATCAGG ATACCACCGG TACCCTGAGC
TACGGTGCGG GTCTGAAGGA TGGTGCAGCG GCCCTCGCCG CCAACACCAA CAGCTACATC
ACGGGCAGCT ATGTGGTCTC CGTTACCGAT GCGGCCACCA TGGCGCAACT TTCGGTCATG
GATACCGCCA CCAGCGGCAC CCTAAGCTAC GGTGCGGGCT TGAAGGACGG TGTTGCGGCA
CTGGATGCCA ATACCAACAG CTACGTGACC GGTTCTTACC TCGTTTCCAT CACCGATGCG
GCCACCATGG CGCAGTTGTC GGCCATTGAT CAGGATACCA CCGGAGCGCT GACTTACGGT
GCGGGCTTGA AGGATAGCGT TGCGGCGCTG GAAGCCAATA CCAACAGCTA TGTGACGGGC
AGCTATGTGG TTTCGATCAC CGATGCGGCC ACCATGGCGC AGCTGTCGGC CATTGATCAG
GATACCACCG GTGCGTTGAC CTACGGTGCG GGTCTGAAAG ATGCCGCTGC GGCCCTGGAA
GCCAATACCA ACAGCTACGT GACGGGTAGT TATGTGGTCT CCATCACCGA TGCGGCCACC
ATGGCGCAGT TGTCGGCCAT TGATCAGGAT ACCACCGGAG CGCTGACCTA CGGTGCGGGT
TTGAAAGATG CCGCTGCGGC GCTGGAAGCC AATACCAACA GCTATGTGAC GGGTAGCTAT
GCGGTCTCGA TAACCGATGC GGCCACCATG GCGCAGCTGT CGGCCATTGA TCAGGATACC
ACGGGTGCGC TGACCTACGG TGCGGGCTTG AAGGATGGGG TAGCGGCGCT TGATGCCAAC
ACCAACAGCT ACGTGACGGG CAGCTATGTG GTTTCGATCA CCGATGCGGC CACCATGGCG
CAGCTGTCGG CCATTGATCA GGATACCACC GGTGCGTTGA CCTACGGTGC AGGTCTGAAA
GATGGCGTCG CGGCGCTGGA AGCCAATACC AACAGCTATG TGACCGGTTC TTACCTCGTT
TCCATCACCG ATGTGGTGAC CATGGCGCAG TTGTCGGCCA TTGATCAGGA CACCACCGGT
ACCCTGACCT ACGGTGCGGG CTTGAAGGAT AGCGTTGCGG CCCTGGAAGC CAATACCAAC
AGCTATGTGA CGGGCAGCTA TGTGGTTTCG ATCACCGATG CGGCCACCAT GGCGCAGCTG
TCGGCCATTG ATCAGGACAC CACCGGAGCG CTGACCTACG GTGCAGGTTT GAAAGATACC
GCTGCGGCGC TGGAAGCCAA TACCAACAGC TATGTGACGG GCAGCTATGC GGTCTCCATC
ACCGATGCGG CCACCATGGC GCAGTTGTCG GCCATTGATC AGGATACCAC CGGAGCGCTG
ACCTACGGTG CGGGTCTGAA GGATAGCGTT GCAGCGCTGG AAGCCAATAC CAACAGCTAT
GTGACGGGCA GCTATGCGGT CTCCATCACC GATGTGGTGA CGATGGCACA GCTGTCGGCC
ATTGATCAGG ATACCACCGG TACCCTGAGC TATGGTGCAG GCTTGAAGGA TGGTGCAGCG
GCCCTAGCCG CCAACACCAA TAGCTACATT ACGGGCAGCT ATGTGGTCTC CGTTACCGAT
GCAGCCACCA TGGCGCAACT TTCGGTCATG GATACCGCCA CCAGCGGCAC CCTGAGCTAC
GGTGCGGGCT TGAAGGACGG TGTTGCGGCA CTGGATGCCA ATACCAACAG CTACGTGACC
GGTTCTTACC TCGTTTCCAT CACCGATGCG GCCACCATGG CGCAGTTGTC GGCCATTGAT
CAGGATACCA CTGGAGCGCT GACCTACGGT GCGGGCTTGA AAGATGCCGC TGCGGCGCTG
GTAGCCAATA CCAACAGCTA TGTGACGGGT AGCTATGTGG TTTCGATCAC CGATGCGGCC
ACCATGGCGC AGTTGTCGGC CATTGATCAG GATACCACCG GTGCGTTGAC TTACGGTGCG
GGTTTGAAAG ATGCCGCTGC GGCGCTGGAA GCCAATACCA ACAGCTATGT GACGGGCAGT
TATGTGGTTT CGATCACCGA TGCGGCCACC ATGGCGCAGT TGTCGGCCAT TGATCAGGAT
ACCTCCGGAG CGCTGACCTA CGGTGCGGGT CTGAAGGATA GCGTTGCAGC GCTGGAAGCC
AATACCAACA GCTATGTAAC GGGCAGCTAT GCGGTCTCCA TCACCGATGT GGTGACCATG
GCGCAGTTGT CGGCCATTGA TCAGGATACC ACCGGTACCC TGAGCTATGG TGCAGGCTTG
AAGGATGGTG CAGCGGCCCT CGCCGCCAAC ACCAACAGCT ACATTACGGG CAGCTATGTG
GTCTCCGTTA CCGATGCGGC CACCATGGCG CAACTTTCGG TCATGGATAT CGCCACCAGC
GGCACCCTAA GCTACGGTGC GGGCTTGAAG GACGGTGTTG CGGCACTGGA TGCCAATACC
AACAGCTACG TGACCGGTTC TTACCTCGTT TCGATCACCG ATGCGGCCAC CATGGCGCAG
TTGTCGGCCA TTGATCAGGA CACCACTGGA GCGCTGACCT ACGGTGCGGG TTTGAAAGAT
GCCGCTGCGG CGCTGGTAGC CAATACCAAC AGCTATGTGA CGGGTAGCTA TGTGGTTTCG
ATCACCGATG CGGCCACCAT GGCGCAGTTG TCGGCCATTG ATCAGGACAC CACTGGAGCA
CTGACCTACG GTGCAGGTTT GAAAGATGCC GTTGCGGCCC TGGAAGCCAA TACCAACAGC
TATGTGACGG GCAGCTATGT GGTTTCGATC ACCGATGCGG CCACCATGGC GCAGTTGTCG
GCCATTGATC AGGATACCAC CGGAGCGCTG ACCTACGGTG CCGGTCTGAA GGATAGCGTT
GCAGCGCTGG ATGCCAATAC CAACAGCTAT GTGACAGGCA GCTATGCGGT CTCCATCACC
GATGTGGTGA CCATGGCGCA GTTGTCGGCC ATTGATCAGG ATACCACCGG TACCCTGAGC
TACGGTGCGG GCTTGAAGGA TGGTGCAGCG GCCCTCGCCG CCAACACCAA CAGCTACATT
ACGGGCAGCT ATGTGGTCTC CGTTACCGAT GCGGCCACCA TGGCGCAACT TTCGGTCATG
GATACCGCCA CCAGCGGCAC CCTGACCTAC GGTGCGGGCT TGAAAGACAG CGTTGCGGCA
CTGGATGCCA ATACCAACAG CTACGTGACG GGCAGCTATC TGGTATCCAT CACCGATGCG
GCCACCATGG CGCAGTTGTC GGCCATTGAT CAGGATACCT CCGGAGCGCT GACCTACGGT
GCGGGCTTGA AAGATGCCGC TGCGGCGCTG GTAGCCAATA CCAACAGCTA TGTGACGGGT
AGCTATGTGG TTTCGATCAC CGATGCGGCC ACCATGGCGC AGTTGTCGGC CATTGATCAG
GATACCACCG GAGCACTGAC CTACGGTGCA GGTTTGAAAG ATGCCGTTGC GGCCCTGGAA
GCCAATACCA ACAGCTATGT GACGGGCAGC TATGTGGTTT CGATCACCGA TGCGGCCACC
ATGGCGCAGT TGTCGGCCAT TGATCAGGAT ACCACCGGAG CGCTGACCTA CGGTGCCGGT
CTGAAGGATA GCGTTGCAGC GCTGGATGCC AATACCAACA GCTATGTGAC AGGCAGCTAT
GCGGTCTCCA TCACCGATGT GGTGACCATG GCGCAGTTGT CGGCCATTGA TCAGGAAACC
ACCGGTACCC TGAGCTACGG TGCGGGCTTG AAGGATGGTG CAGCGGCCCT CGCCGCCAAC
ACCAACAGCT ACATTACGGG CAGCTATGTG GTCTCCGTTA CCGATGCGGC CACCATGGCG
CAACTTTTGG TCATGGATTC CGCCACCAGC GGCACCCTGA CCTACGGTGC GGGCTTGAAA
GACAGCGTTG CGGCACTGGA TGCCAATACC AACAGCTACG TGACGGGCAG CTATGTGGTA
TCCATCACCG ATGCGGCCAC CATGGCGCAG TTGTCGGCCA TTGATCAGGA TACCACCGGA
GCGCTGACCT ACGGTGCGGG CTTGAAAGAT GCCGCTGCGG CGCTGGTAGC CAACACCAAC
AGCTATGTGA CGGGTAGCTA TGTGGTATCC ATCACCGATG CGGCCACCAT GGCGCAGTTG
TCGGCCATTG ATCAGGATAC CACCGGAGCG CTGACTTACG GTGCGGGTTT GAAAGATGCC
GCTGCGGCGC TGGAAGCCAA TACCAACAGC TATGTGACGG GCAGCTATGC GGTCTCCATC
ACCGATGCGG CCACCATGGC GCAGCTGTCG GCCATTGATC AGGATACCAC CGGTGCGTTG
ACCTACGGTG CGGGTTTGAA AGATGCCGCT GCGGCGCTGG AAGCCAATAC CAACAGCTAT
GTGACGGGCA GCTATGCGGT CTCCATCACC GATGCGGCCA CCATGGCGCA GTTGTCGGCC
ATTGATCAGG ATACCACCGG TACCCTGAGC TATGGTGCGG GTTTGAAGGA TGGTGCAGCG
GCCCTAGCCG CCAACACCAA CAGCTACATT ACGGGCAGCT ATGTGGTCTC CGTTACCGAC
GTGGCCACCA TGGCGCAGCT CTCTGCCATG GATACGGCCA CTACCGGCAC CCTGACCTAC
GGGGCAGGCT TGAAGGATAG TGCCGCTGCC TTGGTCGCCA ATACCAACAG CTATGTGACT
GGGGCTGTAA CGGTTACAGT TACGGATGCA GCGACCACGG CTCAGCTGGG TGCAATTGAC
CAAGATACCA CGGGTACTGT GAACTACAGC TTGGCAGGTA TTAAGGATAC GGTTTCCAAT
ATCACCATCG ACTCTGGTAA CTATGTTGCC AATGCGGGTG GGGCCACGAT CACGGTAAAT
GATGGTATCG CTAACCTGAT TACGGACGCC GGTACAGTCG TTACCGGAAC CCGCAATGTT
ACGGTAACGG ATGCAGCCTC CATGGCGCAG CTGTCGCAAA TTGACAACTA CACCACAGGT
GCTCTGAAAT ATGTCACGAT TAAGGATGCG GTGGCGGCCC TGGTTGCTAA TACCAATAGC
TATGTGACAG GTTCCTATGC TGTATCGGTG ACGGATGCCG CCAGCATGGC GCAATTGTCG
GCTATTGATC AGGATACCAC CGGTACCCTG ACCTATACAA AACTGACCGA TGCGGTAGCG
AATTTGGTAA CCAACACCAA CAGCTATGTA ACCGGGTCTG TCAATGTCAC GGTGTCGGAT
ATTGCGACCA TCAGCCAACT CTCAAGCATT GATGCGAACA CTACAGGTTC TGTGACCTAC
ACCCAAATTG GTGATGCCGC AGCAACCTTG GCAACCAATG CTGGCAACTA TGTCAAAGCA
ACAATCCATG TCACGGTGAC GGATGCTGCG ACGATTGCGC AATTGACAAC CATTGATGGG
GATAATACCA CAGGGTCGTT GGTCTACACG GCGGGTGGGG TGAAAGATAG CGCGGCTAAT
TTGGTGGTTA ATACCAACAG CTACGTGACA GGGGCCGTCA ATGTTTCGGT AACGGATACG
GTCTCCATCG CTCAGTTGTC CGCTGTCGAT GAGTACACCA CCGGTACATT GACCTACGGG
GCCGGTGTGA AGGATTCGGT GGCCAACCTG TTGGTCAATA CCAACAGCTA TGTCACGGGT
TCTTATGCGG TCTCGATCAC CGATGTCGCC AGCATGGCGA ATTTGTCGGC GATCGACCAG
TTTACCACGG GCACGCTTAA CTATACGAAG CTGAGCGACA CGGTGTCGGC TTTGGTTGCC
AATACCAACA GCTATGTGAC TGGCTCGGTT AATGTGACCA TTACTGACAA TGCCAGCATG
GCCAACATGT CGGCGATTGA TCAAAACACC ACGGGCACAT TGACCTATAC CAAACTCAGC
GATACCGCCG CTGCTTTGGC GGCCAATACC AACAGCTATG TAACCGGTTC GGTCAATGTA
ACCGTGACGG ATAATGCCAC GGTCGCTCAG TTGACCACCG TGGATGCGGC TACGACGGGG
ACCATTAAGG TCGCCAGCGT GGTGGATAGT GGCTCAAATA TCTCCAGCAA CTTCGCCTAC
GTGGATGGCT TGGGTGTGAG CTATGTAAAC GCCAACGATA ACGTCATGGC CGTGACTGTG
GCTCAGGCGA CGGATGCCAC GGTAACCATA GCCGATGATG ACGTGCTCAC CATCACCGAC
ACCTCCAGTA ACATCCAGGG TTCGACCTTT AGCGCTTTGG TGACTGCGGG TGCTGACGCG
ACCTACCTGG ATGCGACGGA TGACATATTG ACGGTAACAG CCGCACAAGC GGCCACCACC
AATATCAACT TTACCGCCGC AGATGTGGTT ACGGTGAGCG ACACCGGGTC CAACATTCAG
ACCAACATCA CCACGATTTT GGCCAAAGAT GTGGATAAGT TGAGTGCCTC TTCGGCGCTG
GATCTTACCG ATAGTGATGT AACCGGCAAA ACGGTGGACT TCTCTGGTGT TGGGGATACA
ACCATTACCA TGGCCTACGA TAGTTCCAAC GTAGCCTTCG GTAGCCTGAC TTCAACGGGT
GGTGGTGAGC TGATTCTGTC GGTTACAGGT ACGGGTACCT TGGCAACGGT GACGGGCCTA
TCAAACTTCA ATCAACTGTC TGCGGCCAGT GACTTGACCA TCGACTCTGC TCAGGTGGCG
GGCCAAACGT TGGATATGAG TGGTGCTGGT AATGTCACCG TCTCGTTGGC TTCGGCGACC
ACGGCCAGCT ACAGTAGTTT GACATCCACG GGGGCGGGCA CCTTGGGTCT GCAAATTGGT
TCAACCGGAA CCTATACTTC GGTTACGGGT TGGGAGGTGT TTGATAGCAT TACCAGCGAT
TATGCGATCA CGGTTGCGGC CTCGGCCGTT ACCGGAAAAA CGGTGGCCTT TAGTGGTACT
GGCGCGGTCA CCATTGATGG TGCGAGCAGC AGCAATGCAA CCTTCCTAAA CGTGACCCAT
GATTCAGGCA CCTCGGGTGA TTTGATCTTG GCGGTTAGCA GCAGCACGGG TTCGGCTGTG
TTTAGCTCGG GTGCGGCTAA CTTTGATAAG GTTTCGGTGG ATGGTACCCT CTACACCGGT
ATCACCAGTG CCAATGGTCA GATTATTGTC AACGGTTCCA CAAGTGTGGC TGCGGGGATT
TCGGGTACGG CGTACACCGA CTTTATCGTG AACTATGATA CCGACAGCTT GGGAACAACC
TATGCTTCAA CCCAAGGTTT GTTGGGTGGT GCCGGTGACG ACCGCTTGAT TGATACCACC
AGCAGCGGTG TGTTGCTGGC AGGTGAGGCT GGAGACGACA CGCTAACCGG TGGTGCTGGA
AATGACATCT TCTACCATGA TGGTTCCAGC CATGGTTTGG ATACCATTAC CGACTTCGCA
TCAGGGGATC TGTTGCAAAT TGCTTCGGGC AATGCAGGTC AGTGGACGCT GGATTCTACC
GGTCAAGCGC TGACCAAGGC GACCAACAGC AGCCTGGGCT TTAACTTTAA CAGTGGTAAT
GACCTGAATG ATGCGACCTA TGATCTTCAG TTCAACGGTG CCAGTGATGC AGCGGGTCAT
ATCATGACCT TTGACTTGGA TACGGATGCT GATCATACGG TGGCCGGTTC TTCGTATGCA
GATCTGATGG TTAACTTTAG CAGTGGTACC ACCGATGGTT ACACCATGAG TGGTGGTAGC
GGTAACGACC GTTTGATCGA CTACAACGAT AGTGCCTCTA CCTACATGAC GGGTGGTAGT
GGTCAGGATA ACTTTGTCTT CAACGAGAAC ACCACCGGTT TGACAGCGGG TGACTACACG
CTGGCGACAG GCAATGCAGA TATCATTACC GATTGGTCGG TGGATGACTT CTTCATCTGG
GCTTCGAGCA ACCTGATTGC GGGTGGTGTA ACAGCAAACG GAGCTGCGGC AACCAAGACC
TACTTGGATG CTTCGGTTGC GGGTGGTAGC GATGGGTATG ATGAGAGCGC CTCTGCACAC
ATCTACATCC TGCGCGATGC CAACAACGTC GCAACCAGCG GGGATGGAAC AGTGGCAGAT
GCCCAAGCCA CGATTGACGC CATCAACGGG GCTACCTCGG ATGGTACCTT TAATGCGGCG
AGTGCCAGTG ATGTGCTGCT GTTTGCGGTC TATGACAGCA CGACCACCGA CACCCATGTT
TGGTTGGCGA GTGTTGGTGC TAACCAGACG TTGGATGTGG CAGACATGAC CATGGTGGCC
ACTTTGCAAG ATACGAACAT CACAGCGACT GGTGTTATGT ACACAGCCAA CTTTGCGTAA
 
Protein sequence
MAQLSAIDQD TTGALTYGAG LKDTAAALEA NTNSYVTGNY VVSITDAATM AQLSAIDQDT 
TGALTYGAGL KDGVAALEAN TNSYVTGSYA VSITDVVTMA QLSAIDQDTT GTLSYGAGLK
DGAAALAANT NSYITGSYVV SVTDAATMAQ LSVMDTATSG TLTYGAGLKD GVAALDANTN
SYVTGSYLVS ITDAATMAQL SAIDQDTTGA LTYGAGLKDA AAALEANTNS YVTGSYVVSI
TDAATMAQLS AIDQDTTGAL TYGAGLKDAA AALEANTNSY VTGSYAASIT DAATMAQLSA
IDQDTTGALT YGAGLKDAAA ALEANTNSYV TGSYAVSITD VVTMAQLSAI DQDTTGTLSY
GAGLKDGAAA LAANTNSYIT GSYVVSVTDA ATMAQLSVMD TATSGTLTYG AGLKDGVAAL
DANTNSYVTG SYLVSITDAA TMAQLSAIDQ DTTGALTYGA GLKDAAAALE ANTNSYVTGS
YVVSITDAAT MAQLSAIDQD TTGALTYGAG LKDAAAALEA NTNSYVTGSY AASITDAATM
AQLSAIDQDT TGALTYGAGL KDAAAALEAN TNSYVTGSYA VSITDVVTMA QLSAIDQDTT
GTLSYGAGLK DGAAALAANT NSYITGSYVV SVTDAATMAQ LSVMDTATSG TLTYGAGLKD
GVAALDANTN SYVTGSYLVS ITDAATMAQL SAIDQDTTGA LTYGAGLKDA AAALEANTNS
YVTGSYVVSI TDAATMAQLS AIDQDTTGAL TYGAGLKDAA AALEANTNSY VTGSYVVSIT
DAATMAQLSA IDQDTTGALT YGAGLKDGVA ALDANTNSYV TGSYAVSITD VVTMAQLSAI
DQDTTGTLSY GAGLKDGAAA LAANTNSYIT GSYVVSVTDA ATMAQLSVMD TATSGTLTYG
AGLKDGVAAL DANTNSYVTG SYLVSITDAA TMAQLSAIDQ DTTGALTYGA GLKDAAAALV
ANTNSYVTGS YVVSITDAAT MAQLSAIDQD TTGALTYGAG LKDAVAALEA NTNSYVTGSY
VVSITDAATM AQLSAIDQDT TGALTYGAGL KDGVAALDAN TNSYVTGSYA VSITDVVTMA
QLSAIDQDTS GTLSYGAGLK DGAAALAANT NSYITGSYVV SVTDAATMAQ LSVMDTATSG
TLTYGAGLKD GVAALDANTN SYVTGSYLVS ITDAATMAQL SAIDQDTTGA LTYGAGLKDS
VAALEANTNS YVTGSYLVSI TDAATMAQLS AIDQDTTGAL TYGAGLKDTA AALEANTNSY
VTGSYVVSIT DAATMAQLSA IDQDTSGALT YGAGLKDGVA ALDANTNSYV TGSYAVSITD
VVTMAQLSAI DQDTTGTLSY GAGLKDGAAA LAANTNSYIT GSYVVSVTDA ATMAQLSVMD
TATSGTLTYG AGLKDGVAAL DANTNSYVTG SYLVSITDAA TMAQLSAIDQ DTTGALTYGA
GLKDAAAALV ANTNSYVTGS YVVSITDAAT MAQLSAIDQD TTGALTYGAG LKDAAAALEA
NTNSYVTGSY VVSITDAATM AQLSAIDQDT TGALTYGAGL KDGVAALDAN TNSYVTGSYA
VSITDVVTMA QLSAIDQDTT GTLSYGAGLK DGAAALAANT NSYITGSYVV SVTDAATMAQ
LSVMDTATSG TLSYGAGLKD GVAALDANTN SYVTGSYLVS ITDAATMAQL SAIDQDTTGA
LTYGAGLKDS VAALEANTNS YVTGSYLVSI TDAATMAQLS AIDQDTTGAL TYGAGLKDSV
AALDANTNSY VTGSYAVSIT DVVTMAQLSA IDQDTTGTLS YGAGLKDGAA ALAANTNSYI
TGSYVVSVTD AATMAQLSVM DTATSGTLSY GAGLKDGVAA LDANTNSYVT GSYLVSITDA
ATMAQLSAID QDTTGALTYG AGLKDSVAAL EANTNSYVTG SYVVSITDAA TMAQLSAIDQ
DTTGALTYGA GLKDAAAALE ANTNSYVTGS YVVSITDAAT MAQLSAIDQD TTGALTYGAG
LKDAAAALEA NTNSYVTGSY AVSITDAATM AQLSAIDQDT TGALTYGAGL KDGVAALDAN
TNSYVTGSYV VSITDAATMA QLSAIDQDTT GALTYGAGLK DGVAALEANT NSYVTGSYLV
SITDVVTMAQ LSAIDQDTTG TLTYGAGLKD SVAALEANTN SYVTGSYVVS ITDAATMAQL
SAIDQDTTGA LTYGAGLKDT AAALEANTNS YVTGSYAVSI TDAATMAQLS AIDQDTTGAL
TYGAGLKDSV AALEANTNSY VTGSYAVSIT DVVTMAQLSA IDQDTTGTLS YGAGLKDGAA
ALAANTNSYI TGSYVVSVTD AATMAQLSVM DTATSGTLSY GAGLKDGVAA LDANTNSYVT
GSYLVSITDA ATMAQLSAID QDTTGALTYG AGLKDAAAAL VANTNSYVTG SYVVSITDAA
TMAQLSAIDQ DTTGALTYGA GLKDAAAALE ANTNSYVTGS YVVSITDAAT MAQLSAIDQD
TSGALTYGAG LKDSVAALEA NTNSYVTGSY AVSITDVVTM AQLSAIDQDT TGTLSYGAGL
KDGAAALAAN TNSYITGSYV VSVTDAATMA QLSVMDIATS GTLSYGAGLK DGVAALDANT
NSYVTGSYLV SITDAATMAQ LSAIDQDTTG ALTYGAGLKD AAAALVANTN SYVTGSYVVS
ITDAATMAQL SAIDQDTTGA LTYGAGLKDA VAALEANTNS YVTGSYVVSI TDAATMAQLS
AIDQDTTGAL TYGAGLKDSV AALDANTNSY VTGSYAVSIT DVVTMAQLSA IDQDTTGTLS
YGAGLKDGAA ALAANTNSYI TGSYVVSVTD AATMAQLSVM DTATSGTLTY GAGLKDSVAA
LDANTNSYVT GSYLVSITDA ATMAQLSAID QDTSGALTYG AGLKDAAAAL VANTNSYVTG
SYVVSITDAA TMAQLSAIDQ DTTGALTYGA GLKDAVAALE ANTNSYVTGS YVVSITDAAT
MAQLSAIDQD TTGALTYGAG LKDSVAALDA NTNSYVTGSY AVSITDVVTM AQLSAIDQET
TGTLSYGAGL KDGAAALAAN TNSYITGSYV VSVTDAATMA QLLVMDSATS GTLTYGAGLK
DSVAALDANT NSYVTGSYVV SITDAATMAQ LSAIDQDTTG ALTYGAGLKD AAAALVANTN
SYVTGSYVVS ITDAATMAQL SAIDQDTTGA LTYGAGLKDA AAALEANTNS YVTGSYAVSI
TDAATMAQLS AIDQDTTGAL TYGAGLKDAA AALEANTNSY VTGSYAVSIT DAATMAQLSA
IDQDTTGTLS YGAGLKDGAA ALAANTNSYI TGSYVVSVTD VATMAQLSAM DTATTGTLTY
GAGLKDSAAA LVANTNSYVT GAVTVTVTDA ATTAQLGAID QDTTGTVNYS LAGIKDTVSN
ITIDSGNYVA NAGGATITVN DGIANLITDA GTVVTGTRNV TVTDAASMAQ LSQIDNYTTG
ALKYVTIKDA VAALVANTNS YVTGSYAVSV TDAASMAQLS AIDQDTTGTL TYTKLTDAVA
NLVTNTNSYV TGSVNVTVSD IATISQLSSI DANTTGSVTY TQIGDAAATL ATNAGNYVKA
TIHVTVTDAA TIAQLTTIDG DNTTGSLVYT AGGVKDSAAN LVVNTNSYVT GAVNVSVTDT
VSIAQLSAVD EYTTGTLTYG AGVKDSVANL LVNTNSYVTG SYAVSITDVA SMANLSAIDQ
FTTGTLNYTK LSDTVSALVA NTNSYVTGSV NVTITDNASM ANMSAIDQNT TGTLTYTKLS
DTAAALAANT NSYVTGSVNV TVTDNATVAQ LTTVDAATTG TIKVASVVDS GSNISSNFAY
VDGLGVSYVN ANDNVMAVTV AQATDATVTI ADDDVLTITD TSSNIQGSTF SALVTAGADA
TYLDATDDIL TVTAAQAATT NINFTAADVV TVSDTGSNIQ TNITTILAKD VDKLSASSAL
DLTDSDVTGK TVDFSGVGDT TITMAYDSSN VAFGSLTSTG GGELILSVTG TGTLATVTGL
SNFNQLSAAS DLTIDSAQVA GQTLDMSGAG NVTVSLASAT TASYSSLTST GAGTLGLQIG
STGTYTSVTG WEVFDSITSD YAITVAASAV TGKTVAFSGT GAVTIDGASS SNATFLNVTH
DSGTSGDLIL AVSSSTGSAV FSSGAANFDK VSVDGTLYTG ITSANGQIIV NGSTSVAAGI
SGTAYTDFIV NYDTDSLGTT YASTQGLLGG AGDDRLIDTT SSGVLLAGEA GDDTLTGGAG
NDIFYHDGSS HGLDTITDFA SGDLLQIASG NAGQWTLDST GQALTKATNS SLGFNFNSGN
DLNDATYDLQ FNGASDAAGH IMTFDLDTDA DHTVAGSSYA DLMVNFSSGT TDGYTMSGGS
GNDRLIDYND SASTYMTGGS GQDNFVFNEN TTGLTAGDYT LATGNADIIT DWSVDDFFIW
ASSNLIAGGV TANGAAATKT YLDASVAGGS DGYDESASAH IYILRDANNV ATSGDGTVAD
AQATIDAING ATSDGTFNAA SASDVLLFAV YDSTTTDTHV WLASVGANQT LDVADMTMVA
TLQDTNITAT GVMYTANFA