Gene Mmc1_2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2695 
Symbol 
ID4482547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3400104 
End bp3410873 
Gene Length10770 bp 
Protein Length3589 aa 
Translation table11 
GC content54% 
IMG OID639723442 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_866596 
Protein GI117925979 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATAG ATCATAAAAG AGAGTGTTCT GTCCAAAAAA CAGCTATTTT TCCACGTAGA 
CGTGGGCTTT TATCCCTTCT CACCCTGCTT TTGCTGCCCA CCCCAAGAAC CGATCTTTGG
GCTGCACCAG CAATGGATGC TCTGCCAACG GGAGGAAACA TCACCGTTGG CGAGGGGAGT
ATCTATCAAT CGGGCGTCAA TATGGTCATT GAGCAGACCA GCAGTAAGCT GATCGCCAAT
TGGGAGAGCT TTCAGATCGG GGCGGATGCC CAGGTGACGT TTCAGCAGCC AAGTGCCTCA
AGTATTGCCT TAAACCGGGT CATGAGTGCT GGGGCCAGTG AGATCTTTGG ACGCTTGAAC
GCCAATGGTC AGGTTTTTCT CATCAACCCA TCAGGGGTGA TTTTTGGCCA GAATGCCCGT
ATTGAGGTAG GTGGTCTGGT GGCTTCCACT CTTGACCTCT CCGATGACGA TTTTATGCAG
GAGAATTATC GCTTTTCTGC TGGTGAGAGC ACAGGAAGCA TCGAAAACTG GGGCAGCTTG
AATGCGGCTG AAGGGGGATA TATTGCCTTG CTCTCACCCA GGGTGCGCAA TGAGGGGGAT
ATCTCAGCCC CAAAAGGCAC AGCGGTCCTG GCGGCTGGGG ATGAGGTCAG TGTGGATCTT
TATGGAGACA GTCTGATCAA GCTAACGGTG CATAAAGGGA CCCTGGATGC CCTGGCGGAA
AATAAAGGAC TGGTGATGGC TTCCGGTGGA GTGGCGCTGC TTTCTGCCAA AGGGGTTGAT
GCGGTGCATC GGGCGGTGGT CAACAACAGC GGCACCGTGG AGGCTGGGGG GATCACCACC
CAGGGGGGGC GTATCTTTTT GACCGCCGCT GGTGGGGATG TGACCAATGC TGGAACGCTG
GATGTGACCT CTGAACAGGC CAAAGGGGGC AGCATCACCA TGACCGGTGA GACCACCACG
ATCGCCAGTG GGGCCAGGCT GAATGCCTCC GGTGCTACTG GGGGGGGTGA GATTCAGGTG
GGGGGCAGTT GGCAAAACAG TAATCCGGAT GTGGCCCAAT CCCAGCAGAC GGTTGTGGAA
CAAGGGGCAG AGCTAAAGGC TAATGCAACC GACAGCGGGG ATGGTGGAAC CGTCGTGGTG
TGGTCCGATA TCCACAATGT TGAAGGTAAG ACCCGTGTGC ATGGGGTGGT GGAGGCCAAA
GGGGGAGAGC AGGGTGGTGA TGGGGGCCAA GTTGAGACCT CTGGACACCA TCTGGATGTG
ACCGGCATTC AACTGGCACT GGATGCGCCA AAAGGGGCCG GTGGTTTGTG GTTGCTGGAC
CCCAACAATA TTGAGATTAC CAACGCTACG TCCAATTTGG ATGACTCTTC CGCTCCAACC
TATACCTCGA CTGACGATGG TTCCCAAATT GATGTGGCCA CCATCGAAGC TCAGCTTAAT
GCTGGGGTTG TTGTGAAGAT TCAAACGGCC TCAGCGGGTA CCAATGCAGA GGATGGTGAC
ATCATCATCA ATGCCGACAT CAGCAAAACG TCAGGGGATG ATGCAAGCCT GATCCTGTAC
GCCTACGGCA ATATTGTGAT GGATGGACAC AGCATTACCT CCATATCCAA TGAATTGGGT
GTGGCTTTCT ATGCCGATTA CAACAGCTCC AGCGACGGTG CTATCGTCAT CAGCGGTGAC
AGCAGCATTA CCACCAACGG CGGTTATTTT GCGGCTGGCG GCCAGTCTGA CTATTCGGGT
AATGCCATGG GTGGGGCCGA TTATGCCAAC GGGTTTTACT TAGACGGTAC CATTTCCACC
GGTGGAGGCA GTGTCACGAT CTATGGACAG GGTAAGAGTG GGGGGACAGG TCGGGGCATC
CTGATTGAAG GCGACAGCAC CATTGATGCC GCCGGAGGCA GCATAATGAT GGTTGGAACG
GGGGAAAAAG CCGAAGGCGT TTTGATAAAG AGTAGCACCG GTAGCGCATC GGCCCCTAGC
ACCCGCATCA TCACCAGTGG TGATGGTGTC ATTGCCATTG TCGGTATGGT GGCGGATAAC
GGTGCCAACG CCTCTGCCGA CGAGTCTGGC GTCGCCTTTG GGGATGACAG TGTCATTGGT
GCCCATTTAC AAGCCGAAGA TGGCATGATT GTCCTAACAG GAACCGGGGG GGATTTCAGC
CCTGGGGATG TGACCAGTGA TACCGCCAAC GCTTCAGGGG TTCGAATTAA TGGCTCTTTG
ATCGAAACCA CCGGCGATGG CAGCATCGGC ATAAACGGGA CCGGTCCTAC GGCTGATGCT
GGGGATGGTG GCGACCGCAA CGGGGTACAT CTTGAAGATG ATACCGTGAT ACGTACCGGT
AGCGGAGTGG GGGGCAGTGG CCTAATTGCC ATCATCGGCT ACGCTTATGG TTTGGGGGAG
GGGGTCGATA CCAATGATGG GGGGACCCAA TCCATCATTT CCGGTTCTGG TGGCGTCGTC
GTGGCCGGCC AGAACTACAC GGGTAGTGCT GACGGTCTAC AACTCTCCGG AACCATCAAT
GCTGTGGGTG GAACCATTGA ATTGTCTGGT ATCGGTGGCG ATGGAACCGA TCTGGCCGAT
GGTGGCATAG GTGTTTACGC CTCCGGGTTG ACGCTGGGCT CGGATGACAC CACCAGCATC
ACCATTACCG GTACCGCTAA AAGTGGCGAT GCAGAAGGGC TCTTTTTTAC CAATACCACC
ACATTGGATA CCTCCGCTAC CGGCTCCATC ACCATTACCG GCTATGGCAC CGGGGTGGAG
CAGGGCTTTC AAAGCAATGA CACCAATACC GTCATCACGG CCGGTAGTGG GGGGTTGACC
ATCATTGGGG ATGATGCCGA AGGCAATGGT TATGTCTATT TAAAGGGCGC CTATACAACG
GAAGGTGGGG ATATCACCAT TACTGGCAGC GGTACCAATG GTAAGCAGGG AATCTATGCC
TATAACGCCA CCTTCGACAC GCTGACAAGT GGTCAAATTA CCTTCACCGG AACCTCTGCG
GATTCTAATG GCATTACTCT TGACGGGGTG ACCGTTGGCT CCGATGCCAC GGATTCCATT
ACCCTGACTG GAACCAGCTT AAGTAACGCT GGTAACGGGA TCTTTATTGA TTCGGGTAGC
AGCTTTGATA CGTCGTCTGA TGGTTCGATC ACCATTACCG GTTACGGTTC AGGTGACAAT
GAAGGGTTTC AAAGCAACAT TAGTAGCAAC AGCTTTACGG CGGGCAGTGG TTTGACCATC
ACTGGGGATA ACACCTACGG CAACGGTGCG GTAAATATCA TCGGCACTTT CACCACCGAA
ACGGGCGATA TTGTGATCAC TGCGGCTGGG GAGGATGGTC AAGAGAGTAT GTATGTTTCG
TATGCCACCA TCAATACGAT CAGCAGTGGA ACGATTACCC TTACCGGTGT CAGTGCGGAT
GATGATGGTA TCTATATTAA AAATTCCACC TTTGGCTCAG ATGCCACCAG CGCAATCACC
ATTACCGGTA CCAGCGAAAG TGGCAATAAT GAGGGGGTCT ATGTCTACTC AGACAGCTCT
TTTGATACCT CATCCAGCGG CACCATTACC ATCACCGGCT ACGGGACCGG TGACCAAGAG
GCGGTGCAAA GTAATGACAG CGATAACAGT TTTACGGCAG GTAGTGGTGG CCTGACCATC
ACCGGTGATG ATGCCCAAGG CAACGGCGCT GTTTATCTTA AGGGAACCTT CACCGCCGAA
GCTGGCGATA TTGTGATCAC TGCGGCTGGG GGGGGTGGCA AAGAGGGATT CTATGGCAGC
AGTGCGACCA TCAATACCCT GACCAGCGGC AACATTACCA TTGTTGCCAG TTCTGCCGAT
GCCGAAGGTT TCTACGCCAA CGCTGTTACC ATCGGTTCTG ATACCACCGA CAGCATCACC
ATTACCGGAA CCAGCAACAG CGGCGATAAT GAGGGCATCT ATCTGGTTGG CGAAACGGAT
ATTGACACCT CATCCAGCGG CAGCATCACC CTAACCGGCT ATGGTTCAGG CGGTGATCAG
GGGTTCCAAA GCAACTACAA CGGTATCACC GTTACCGCCG GTAGCGGCGG ATTGACCATC
ACCGGTGATG ATACCCAGGG CAATGGCGCG GTTTCGCTGC GGGGGACATT TTCCTCGGAA
GGGGGGGATG TCGTTATCAC CGGCGTTGGT ACCAACAATT TTGATGGCAT TTATGGCAAC
GATGTCACCA TTGACACAAC GGGCACCATT ACCCTCACCG GCACCTCAGA CGATGGTAAT
GGCATTAACC TCGCCTACAC AAACATCGGT TCTGACTCCA CCAGCAGCAT CACCATTACC
GGAACCAGCT TAAGTGGCGA CAATAAGGGT GTTTACCTGG AAAATTATAG CACATTAGAG
ACCTCTGCCA GTGGCAGTAT AACCATAACG GGCTATGCCG CCGGTGCATG GCATGGCTTT
GAAAGTAATA CATCCACCAA CAGCTTTAGT GCCGGTAGCG GTGGATTGAC CATTGTGGGG
GATGATACCT ACGGTAACGG AGGTGTTTAC CTGCTGGGGA CCTTTACCGC CGATGGGGGA
GATATTAGCA TCACCGGTGA AGGGGCCAAT GGTATCGAGG GTATCTATGC CGAAGATGCC
ACCATCAACA CCACCACCAG TGGTGCGATC ACCCTGACCG GAAGTTCGGC AGATTTTGAT
GGTATCTATC TCACTGGTGT CTCCATCGGT TCGGACAGCA CCGACAGTGT TACCATTACC
GGTACCAGCT TAAGTGGAGA TAACGAAGCG ATCTTTGTCA CTGGCAGCAG CTCATTGGAG
ACCTCATCCA GCGGCAGTAT GACCCTGACC GGCGCTGCGG CCGGGGCTTG GCAGGGGTTT
CAGAGTTACT CTACAGGCAA CACATTTAGT GCCGGTAGCG GTGGGTTGAC CATTGTAGGG
GATGATACCT ACGGCAATGG AGGCGTCACC CTGGTGGGTA GTTTTACAGC CGATGGGGGA
GATATCACCA TCACCGGCAA AGGGGCCAAT GGGTTGGAAG CTATCTATGC CGAAGATGCC
ACCATCAACA CCACAACCAG TGGTTCCATC ACCCTGACCG GTAGTGCGGC GGATTCGGAT
GGTATTTATC TCTCCTCTGT CTCCATCGGT TCCGACACCA CCGACAGTGT TACCCTGACC
GGCACCAGCC TGAGTGGTGA TAACAGAGGT ATCTATCTAA CCTCCGCCAC GAGCATGGAT
ACCTCTGCCA GTGGTAGCAT GACCATCACC GGCTACGGTT CCGGCTCCAA ACAAGGATTT
CGGAGTCACC ATGTCGACAA CAGCATGACC GCAGGCAGCG GTGGACTGAC CATTGTTGGG
GATAATACCT ACGGTAATGG CGGGCTCTAT CTTAGGGGGA CCTTCACTGC TGAGGGGGGG
GATATCTCCA TTACCGGAGC GGGGGCCGAT GGTCTAAACG GGATCTATGG GGATAATGTT
ACCATTAATA ATATCACCAG TGGTTCCATT ACACTGACGG GTACTTCGTC AGACAGCGAT
GGCCTGTCCG GTTCGGAGAT GGTCATCGGC TCGGATGCTA CCGACAGTGT CTCCATCACG
GGTTACAGTA CCGCTGACTT CTATACTGGA ATCTATTTTG GCCCCTCCAA TACGGTTGAA
TCCGCTGCGG ATGGAAGCAT AACCGTGACG GGTGCGGGTA CTGGGGTGTT TGGCTACGGT
GTTACCTTTG ATGACAGCAC CAGCACCTTT TCTGCGGGTA GTGGTGGTTT GACCATAGAG
GGGGATGCCA CTCAAGGTAA TGGGGCGCTC TTTTTGGCGG GAAGCTTTAC GGCAGTAGGG
GGGGATATTG TCGTAACTGG TGACGCGACG AGTATACTCG CGGGTATTCA TGCGATAGCC
GCGACCATGA CCACCACCAC AAGCGGTAAT ATCACCCTGA CCGGCACCTC GGAAGGAGGG
GATGGTATCT ATTTTGGTTC CTCTTCGGCA TCGACGCTGA CCACAGCGGC CAGCGGAACC
ATGACGGTAA CTGGGACGGG GAGTGTCGGG GGTGAAGGGG TGGCGATCCA CAACACCACA
CTCTCCGCTG GAAGCGGTGG ATTAACTGTG ACAGGTACAG GTGGTGACTC TGGTCACGGG
GTCTACCTTT GGGAAGACAC CACCCTACAG GCCACCAGTG GCGATATCTC CATTGTGGGT
ACGGCCGGTA GCACCTCGGC CAATGGTGTG ACCATTGATA GCGATGATGG CTTAGTTGTT
GTTCAAACGG TTACGGCTGG CCATATTACC ATCACCGGTT CATCGACTAT GGATGATGGT
ATTGAGGTGC GCTCCGATAC CGGTCACACC GCTACCATCT CTGCGGCGGG TTCAGGGAAT
ACCACCCTGG AGGGAACGGC GTCCAATAGC GATGCCGATG ATCGGGGTAT TGAGCTCAAC
GATGTCGCCA TCACCACGGA ATCGGGTGAT ATTACCCTTA CCGGGCAATC CGCCAGCAGC
AGTGAAGGGA TTGGAATATC CGAAGGCAAT GTGAGCATCT CCTCAACGGA TTCAGGGAGC
ATTACCCTGA TCGCTGATCG GGTGGATCTG ACAGGCTCAT CCAACAGTAT CAGTAGTAGT
GGCGTGTTGC TCATTCAACC CTATAGCGCC AGTAGCGCCA TTGAGATTGG TGGTTCAGGT
GGTGATCTTA ATCTGGCCGC CAGTGTCTTT TCCGATACCC TCGCCGATGG CTTTAGCTAT
ATCCAGATTG GGGATTCTGA CAACAGCGGC GGAATCACTG TGGCAGGTGC AACCTCTGTG
GCGGATAGTC TGCGTCTGAT CCAGGGCAGC GGTAACATTA CCCTCAATGA CAGCCTGACC
ATCAGTACCG CTGGCGATTA TCTACAACTG CACACCACGG GGAGCGGTTC CCAAGGGGGG
GGCGGGGCCA TTGTGGCGGA TAACTTGGAG TTGTTGGGTA GCGGTGGGAG CTATGTCTTG
ACCGCTGCTA CCAGCAGTAC GGGTAATAAT GTGGCCACTT TGGCGGCGGA TACTGGCAGC
GTCACCTATA AGGATCAGGA TGCCCTGACG ATCAACAGTG TCAATACCAC AACAGGTATT
ACGGCCACGG GTCGCATCGC CATTACCACC ATGAGCGATG CCGATGCGGC GGATCTGACC
CTCTCTGGGG ATCTGTCCAC CACCGATACC AGCACTACGG CCATTGTTTT GAATGCAGGA
GAAGATGAAG CTGCGGGACA AACCACCCTG GCCGATAGTC GGGCTGATAT CCTGTATGAC
TATGTTATCA TCTCTACCGG AGCTGATGGA GTTGTCACCT TACTGACGGG CAGTATCAAT
GGCAGCAGTG CCTTGGCCAC GGCGCTTGGG TCGGGCAGTG GTCGCTTCCG TTATGGCAGT
GATGAGACCA CAACCAACTA CATGACGGCA CTGGAGAGTG GCGTTAACGT CGTCTACCGT
GAGCAGCCAA CCTTGACCCT CACGCCAGAT GCCTATGAGA CCAGTTATGG GGATGGAGTG
AACCCCACGG CGTTCTCCAT GAGTTCAGGA ACCTTGGAAA ATGGCGACTC CTTTATAGAT
CCTGACAGCT ACACCATTGC CTCAACGGGT AGTTCCAGCA GCAATGTGGG CAGCTATGAC
CTCTCTTTCT CCAGCTTGTC GTCTACCAAC AGCTTGGGCT ATGCCTTGAG TGGCGCCACC
CGTACAGATG GGCACACCAT TTCTACGGCC GCGTTGACCA TCACCGCCGA GAATGACAGC
AAGACCTATG ATGGCGATGC CTACAGCGGC GGTAATGGGG TGAGCTACAG CGGCTTTGTC
AATGATGAAG AGAGTGCGGT GCTGGGGGGA TCCCTTAGTT ACGGCGGAAC ATCGCAGGAT
GCCACCGATG CCGGAAGCTA CACCATTATT CCCAGCGGTT TGACTTCATC CAACTATGCG
ATCACTTTCA ATAATGGCAC CCTCACGGTC AATAAGGCCG CTTTGAGTGT AACCGCCGAG
AACGACAGCA AGACCTATGA TAGCGAGGCT TACAGCGGCG GTAATGGGGT GAGCTACAGC
GGCTTTGTGG GGGATGAGGA TAGCGCGGTG TTGGGTGGCT CCATTAGTTA CGGCGGAACA
TCGCAGGATG CCACCGACGC CGGGAGTTAC AGCATCACGC CCAGCGGTTT AACCTCGGAT
AACTATAATA TCTCGTTCAA TAATGGCACC CTGACTGTCA ATCAAGCCGC ATTGAGTATC
ACTGCCGAGA ACGACAGCAA GACCTATGAT AGCGAGGCTT ACAGCGGCGG CAATGGGGTG
AGCTACAGCG GCTTTGTGGG GGATGAGGAT AGCGCGGTGT TGGGTGGCTC CATTAGTTAC
GGCGGAACAT CGCAGGATGC CACCGACGCC GGAAGTTACA GCATCACGCC CAGCGGTTTG
ACTTCATCCA ACTATGCGAT CACTTTCAAT AATGGCACCC TCACGGTCAA TAAGGCCGCT
TTGAGTGTGA CCGCCGAGAA CGACAGCAAG ACCTATGATA GCGAGGCTTA CAGCGGCGGC
AATGGGGTGA GCTACAGCGG CTTTGTGGGG GATGAGGATA GCGCGGTGTT GGGTGGCTCC
ATTAGTTACG GCGGAACATC GCAGGATGCC ACCGACGCCG GGAGTTATAC GATTATTCCA
AGCGGCTTAA CCTCTTCAAA CTATGAGATA ACATTCAATA ATGGCACCTT GACGGTCAAT
AAGGCCGCTT TGAGTGTGAC CGCCGAGAAC GACAGCAAAA CCTATGATAG CGAGGCTTAC
AGCGGCGGTA ATGGGGTGAG CTACAGCGGC TTTGTGGGGG ATGAGGATAG CGCGGTGTTG
GGTGGCTCCA TTAGTTACGG CGGAACATCG CAGGATGCCA CCGACGCCGG GAGTTATACG
ATTATTCCAA GCGGCTTAAC CTCTTCAAAC TATGAGATAA CATTCAATAA TGGCACCTTG
ACGGTCAATA AGGCCGCTTT GAGTGTGACC GCCGAGAACG ACAGCAAAAC CTATGATAGC
GAGGCTTACA GCGGCGGTAA TGGGGTGAGC TACAGCGGCT TTGTGGGGGA TGAGGATAGC
GCGGTGTTGG GTGGTTCTCT CAGCTATGGT GGAACATCGC AGGATGCCAC CGACGCTGGA
AGTTACAGCA TCACGCCCAG CGGTTTAACC TCGGATAACT ATGAGATCTC GTTCAATAAT
GGCACCTTGA CGGTCAATAA GGCCGCTTTG AGTGTGACCG CCGAGAACGA CAGCAAGACC
TATGATAGCG AGGCTTACAG CGGCGGTAAT GGGGTGAGCT ACAGCGGCTT TGTGGGGGAT
GAGGATAGCG CGGTGTTGGG TGGCTCCATT AGTTACGGCG GAACATCGCA GGATGCCACC
GACGCCGGGA GTTACAGCAT CACGCCCAGC GGTTTAACCT CGGATAACTA TAATATCTCG
TTCAACAATG GCACCTTGAC GGTCAATAAG GCCGCTTTGA GTGTGACCGC CGAGAACGAC
AGCAAAACCT ATGATAGAGA GGCTTACAGC GGCGGTAATG GGGTGAGCTA CAGCGGCTTT
GTGGGGGATG AGGATAGCGC GGTGTTGGGT GGCTCCCTCA GCTATGGTGG AACATCGCAG
GATGCCACCG ATGCGGGAAG CTATACGATT ATTCCAAGCG GCTTAACCTC TTCAAACTAT
GAGATAACAT TCAATAATGG CACCCTGACT GTCAATCAAG CCGCTTTGAG TGTGACCGCC
GAGAACGATG GCAAGACCTA TGATGGCAAC GCCTACTCTG GCGGTAATGG GGTGAGCTAC
AGCGGCTTTG TGGGGGATGA GGATAGCGCG GTGTTGGGTG GCTCCCTCAG CTATGGTGGA
ACATCGCAGG ATGCCACCGA TGCGGGAAGC TACAGCATCA CCCCCGGCGG TTTGACCTCA
TCCAACTATG CGATCACCTT CCATGATGGC ACCCTGAGCG TCAATCAGGT GGGTTTGACC
GTCACCGCCA ATGATGACAG CAAAACCTAT GATGGAGAGG GGTATACGGA TGGCAATGGG
GTTGTGTATA GCGGTTTTAT TGATGATGAA GACAGTTCGG TGCTGGGGGG CGAACTGACC
TATACGGGAA CCTCTCAAGG GGCGGTTGCT GTGGGTACCT ACGCCATCAT GCCCAGTGGC
TTAAGTGCGA CAAACTATAG CTTTAGCTAT GTGGCGGGCT CCTTGAGCGT GCTCCCCAAA
AGCTATGAGA CCAACACGCC TGATACTGAG GCGGATCTGG TGGTGGCCGC ACCCACCCAG
CTGGAAGAGA TCGCCCTTTC CATCCCCCAA ACAACCCAGT CTATAGGTGA GCAGGATCAA
ATCTTCTCGG TTGTCTCAGG CAATGAAACC GTCGTTTTTC TAAAAGATGC TCAAAATGGT
GACATTGGTG GCGGCATGGA GGATCCCAAG CCGGTACAAG TGGTGGCCTA TCGCGCCGAT
ACACCACCCC GTGTGGAGGA TGGATTTAGC GTCCAAGCGG GACAAAATGC CATTCGCTTG
CGGCCCCTTA ACCGTGTGGA TCAAGAGATC ACCAGCCCTG GTGAGGCGGT TTTTGCACTT
GGTTTTACGG TCCAGGGGGC GCAGGGAGAG GTTTCATTCT CGGTTAATCA GACCCAGCAG
GGGATTGTGA TCAAACCCAA TGGCCAAGCC GCAGCCACCC TGCTGGAGGG TCGACGTGAC
AAGGTGATTG GGGTTGCGCT GTTGCAGTTG CGGCAACAGG GCTCGGTTGC GCTTGAGCAG
CTGAAAACGA TCTATCTGGA TCTGCCATAA
 
Protein sequence
MHIDHKRECS VQKTAIFPRR RGLLSLLTLL LLPTPRTDLW AAPAMDALPT GGNITVGEGS 
IYQSGVNMVI EQTSSKLIAN WESFQIGADA QVTFQQPSAS SIALNRVMSA GASEIFGRLN
ANGQVFLINP SGVIFGQNAR IEVGGLVAST LDLSDDDFMQ ENYRFSAGES TGSIENWGSL
NAAEGGYIAL LSPRVRNEGD ISAPKGTAVL AAGDEVSVDL YGDSLIKLTV HKGTLDALAE
NKGLVMASGG VALLSAKGVD AVHRAVVNNS GTVEAGGITT QGGRIFLTAA GGDVTNAGTL
DVTSEQAKGG SITMTGETTT IASGARLNAS GATGGGEIQV GGSWQNSNPD VAQSQQTVVE
QGAELKANAT DSGDGGTVVV WSDIHNVEGK TRVHGVVEAK GGEQGGDGGQ VETSGHHLDV
TGIQLALDAP KGAGGLWLLD PNNIEITNAT SNLDDSSAPT YTSTDDGSQI DVATIEAQLN
AGVVVKIQTA SAGTNAEDGD IIINADISKT SGDDASLILY AYGNIVMDGH SITSISNELG
VAFYADYNSS SDGAIVISGD SSITTNGGYF AAGGQSDYSG NAMGGADYAN GFYLDGTIST
GGGSVTIYGQ GKSGGTGRGI LIEGDSTIDA AGGSIMMVGT GEKAEGVLIK SSTGSASAPS
TRIITSGDGV IAIVGMVADN GANASADESG VAFGDDSVIG AHLQAEDGMI VLTGTGGDFS
PGDVTSDTAN ASGVRINGSL IETTGDGSIG INGTGPTADA GDGGDRNGVH LEDDTVIRTG
SGVGGSGLIA IIGYAYGLGE GVDTNDGGTQ SIISGSGGVV VAGQNYTGSA DGLQLSGTIN
AVGGTIELSG IGGDGTDLAD GGIGVYASGL TLGSDDTTSI TITGTAKSGD AEGLFFTNTT
TLDTSATGSI TITGYGTGVE QGFQSNDTNT VITAGSGGLT IIGDDAEGNG YVYLKGAYTT
EGGDITITGS GTNGKQGIYA YNATFDTLTS GQITFTGTSA DSNGITLDGV TVGSDATDSI
TLTGTSLSNA GNGIFIDSGS SFDTSSDGSI TITGYGSGDN EGFQSNISSN SFTAGSGLTI
TGDNTYGNGA VNIIGTFTTE TGDIVITAAG EDGQESMYVS YATINTISSG TITLTGVSAD
DDGIYIKNST FGSDATSAIT ITGTSESGNN EGVYVYSDSS FDTSSSGTIT ITGYGTGDQE
AVQSNDSDNS FTAGSGGLTI TGDDAQGNGA VYLKGTFTAE AGDIVITAAG GGGKEGFYGS
SATINTLTSG NITIVASSAD AEGFYANAVT IGSDTTDSIT ITGTSNSGDN EGIYLVGETD
IDTSSSGSIT LTGYGSGGDQ GFQSNYNGIT VTAGSGGLTI TGDDTQGNGA VSLRGTFSSE
GGDVVITGVG TNNFDGIYGN DVTIDTTGTI TLTGTSDDGN GINLAYTNIG SDSTSSITIT
GTSLSGDNKG VYLENYSTLE TSASGSITIT GYAAGAWHGF ESNTSTNSFS AGSGGLTIVG
DDTYGNGGVY LLGTFTADGG DISITGEGAN GIEGIYAEDA TINTTTSGAI TLTGSSADFD
GIYLTGVSIG SDSTDSVTIT GTSLSGDNEA IFVTGSSSLE TSSSGSMTLT GAAAGAWQGF
QSYSTGNTFS AGSGGLTIVG DDTYGNGGVT LVGSFTADGG DITITGKGAN GLEAIYAEDA
TINTTTSGSI TLTGSAADSD GIYLSSVSIG SDTTDSVTLT GTSLSGDNRG IYLTSATSMD
TSASGSMTIT GYGSGSKQGF RSHHVDNSMT AGSGGLTIVG DNTYGNGGLY LRGTFTAEGG
DISITGAGAD GLNGIYGDNV TINNITSGSI TLTGTSSDSD GLSGSEMVIG SDATDSVSIT
GYSTADFYTG IYFGPSNTVE SAADGSITVT GAGTGVFGYG VTFDDSTSTF SAGSGGLTIE
GDATQGNGAL FLAGSFTAVG GDIVVTGDAT SILAGIHAIA ATMTTTTSGN ITLTGTSEGG
DGIYFGSSSA STLTTAASGT MTVTGTGSVG GEGVAIHNTT LSAGSGGLTV TGTGGDSGHG
VYLWEDTTLQ ATSGDISIVG TAGSTSANGV TIDSDDGLVV VQTVTAGHIT ITGSSTMDDG
IEVRSDTGHT ATISAAGSGN TTLEGTASNS DADDRGIELN DVAITTESGD ITLTGQSASS
SEGIGISEGN VSISSTDSGS ITLIADRVDL TGSSNSISSS GVLLIQPYSA SSAIEIGGSG
GDLNLAASVF SDTLADGFSY IQIGDSDNSG GITVAGATSV ADSLRLIQGS GNITLNDSLT
ISTAGDYLQL HTTGSGSQGG GGAIVADNLE LLGSGGSYVL TAATSSTGNN VATLAADTGS
VTYKDQDALT INSVNTTTGI TATGRIAITT MSDADAADLT LSGDLSTTDT STTAIVLNAG
EDEAAGQTTL ADSRADILYD YVIISTGADG VVTLLTGSIN GSSALATALG SGSGRFRYGS
DETTTNYMTA LESGVNVVYR EQPTLTLTPD AYETSYGDGV NPTAFSMSSG TLENGDSFID
PDSYTIASTG SSSSNVGSYD LSFSSLSSTN SLGYALSGAT RTDGHTISTA ALTITAENDS
KTYDGDAYSG GNGVSYSGFV NDEESAVLGG SLSYGGTSQD ATDAGSYTII PSGLTSSNYA
ITFNNGTLTV NKAALSVTAE NDSKTYDSEA YSGGNGVSYS GFVGDEDSAV LGGSISYGGT
SQDATDAGSY SITPSGLTSD NYNISFNNGT LTVNQAALSI TAENDSKTYD SEAYSGGNGV
SYSGFVGDED SAVLGGSISY GGTSQDATDA GSYSITPSGL TSSNYAITFN NGTLTVNKAA
LSVTAENDSK TYDSEAYSGG NGVSYSGFVG DEDSAVLGGS ISYGGTSQDA TDAGSYTIIP
SGLTSSNYEI TFNNGTLTVN KAALSVTAEN DSKTYDSEAY SGGNGVSYSG FVGDEDSAVL
GGSISYGGTS QDATDAGSYT IIPSGLTSSN YEITFNNGTL TVNKAALSVT AENDSKTYDS
EAYSGGNGVS YSGFVGDEDS AVLGGSLSYG GTSQDATDAG SYSITPSGLT SDNYEISFNN
GTLTVNKAAL SVTAENDSKT YDSEAYSGGN GVSYSGFVGD EDSAVLGGSI SYGGTSQDAT
DAGSYSITPS GLTSDNYNIS FNNGTLTVNK AALSVTAEND SKTYDREAYS GGNGVSYSGF
VGDEDSAVLG GSLSYGGTSQ DATDAGSYTI IPSGLTSSNY EITFNNGTLT VNQAALSVTA
ENDGKTYDGN AYSGGNGVSY SGFVGDEDSA VLGGSLSYGG TSQDATDAGS YSITPGGLTS
SNYAITFHDG TLSVNQVGLT VTANDDSKTY DGEGYTDGNG VVYSGFIDDE DSSVLGGELT
YTGTSQGAVA VGTYAIMPSG LSATNYSFSY VAGSLSVLPK SYETNTPDTE ADLVVAAPTQ
LEEIALSIPQ TTQSIGEQDQ IFSVVSGNET VVFLKDAQNG DIGGGMEDPK PVQVVAYRAD
TPPRVEDGFS VQAGQNAIRL RPLNRVDQEI TSPGEAVFAL GFTVQGAQGE VSFSVNQTQQ
GIVIKPNGQA AATLLEGRRD KVIGVALLQL RQQGSVALEQ LKTIYLDLP