Gene Mboo_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1172 
Symbol 
ID5410441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1171606 
End bp1183845 
Gene Length12240 bp 
Protein Length4079 aa 
Translation table11 
GC content57% 
IMG OID640868398 
ProductTPR repeat-containing protein 
Protein accessionYP_001404333 
Protein GI154150715 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGA GTGACGCTGA AGCGCTGATG CGGCAGGGTA CCGAACTCTA TGACCTGGGC 
CGGCACCAGG AAGCCGTGGT CATGTTTGAC CGGGCCCTCA CCCTCTTTCC CAAGCTGCCA
AAGGCCCATT ATTTCAAGGG AATTGCCCTG TATGACCTTG GGAGGTACGA GGATGCTCTC
GATTCCTACG ATCACGCACT CGCGCTCGAC CCCTCTGATA TCAACTCCTG GTACAACAAG
GCAGCGACAC TTGCGCAGAT CGGCAGGAAT AAGGAAGCGC TCGATGCCTG CGACCGGCTT
ATCGCGCTCC GGTTCGATAA CGCGGAGGCG TGGATCCTCA AGGGAATATC CCTTTATGAA
CTGGGGCGAT TCCGCGATGC AATATCTGCC TATGATCATG CACTTGCCAT CGATCCCACC
TATGCAAAGG TCTATTATAA CAAGGGAATC GCCCTTGCGG ATCTCGGCCG GCATGATGAA
GCGATTGCCG CATACGGGAA AGCTGTCGGG ATCGTTCCTG AATATGCAAA GGCATACTAC
AACATGGGCA TATCCCTGTA CGAGATCGGG AGGTACGATG AGGCCCTTGG TGCATTCGAG
AAGGCGCATG ACCTTGATCC TTCCGATCCA TGGGTCTGGT ACTACCGTGC GTTTATCCTC
GCAAAGCAGG AGCGGTATGC CCAGGCCGCA GAAGCCGCCG GGGTCTTTCT CTCTTTTGAG
CCGGAGCACG CGGATATCTG GGTGATCCAG GGCATTTCCC TGTACCGGCT CCGGCGGCTT
GATGAGGCGG CAGACGCGTT CGATCGCGCT ATCGAGCAGG ACCCCCTCGC ACCCGATGCG
TGGCTTTACA AGGGCTTCTC ATTATTCGAT ATGGAGCGGT ACGAGGACGC CACGTATGCT
CTCGACAAAG CTGCAGAGCT TTCACCGCAA ACCACGAAGA TTTACTATAC CCGGGGCAAA
GCCAACCAGC GTCTTGGCAA ATACCGTGAA GCAGTTGCTG ATTTCGACCG GGCGCTTGCC
GCGGAACCGG AGAACGCCGA TGCCCTGTAC AGCCGCGGAG TTTCCTGTAT CCACCTGAGC
CGGTACGATG AATCCCTCAG TGTGTTTGAC CGGATCCTGG CTTCACAGGG AGATCATGCC
GGTGCCTCCT ATTTCCGGGG GGTGGTTCTT TCCCGTCTTG GCAGGCAGGA TGAGGCCATC
AGTGCATTCG AACACACGCT TGCCATTGAC CCGGGATGCG CATCGGCCGC CTACCAGATA
GGTCTTGCCT CGGCAAGCCT GGGACGTTAC AGCGATGCTG TTGCTGCGTA TGACCGTGCA
CTGAAGATCC GCCCCGATTA CCCGGATGCT GTGTACCACA AGGGCTTTGC GCTTGCAAAA
CTGGGTAACA GCGAAGATGC CCTGCTCGAA TTTGACCGGG CCTTAACGGA GAACCCGGGC
AATGCCCCGG CATACCACCA GAAAGGCCAG CTCCTGGTCA GGACCGGCAG GCTCGAAGAA
GCACTGGAGG CCCTGAATAA GAGTATTGCG CTGAAGCCGG ATAACGCACA GGTGTATTAC
GATAAGGGCA GCGCGCTCCT GAAGGCCGAG CGGTTCGGTC CCGCCCTTGA GGCATTTGAT
CAGGCCATCG GGATCTATCC CAATTACGTG AATGCGTACT ACAACAAGGG GATCGCTTTT
TCCCGCACAG GAATGCGCAA AGAAGCACTT GAGGCGTTCG ATCATGCCAT AGCGATCGAT
CCCACTCATA CGCTTGCCCT GTACCATCGG GGAACCATGC TCTCCGGACT GGGACGGTAC
GCTGATGCGG CCGCTGCGTA CGATGCCGTG CTTGCCCTCT CTCCCCAGAA TACTTCCGCA
CTCTATGAAA AAGGCGTAGC GCTTATGCAA CTTTCCCGGT GGAAAGATGC AGCCGAGGCA
TTCGGGCAGG CAGTCGAACA GGACCCGGGC CTTATTGACG CCTGGCTGGC TTTTGGAACA
TGCAATGCAA ACCTGGGGAA ATTCCCTGAT GCGATTGCTG CTTTTGACCG GGTAATTGCA
CTTTCTCCGA AAAATACGCA GGCATTTATC CACAAGGGTA TTGCGCTAGT AACTACCGGA
AAGTTCGAGG AAGCAATTGC CGCACTCAAC CGCGCGCTGG AAGATGCCCC CCGTGACGAA
CGGGCATGGT ACTACAAGGG CATGTCCCTT GCAGCCCTCC AGCGTTTTGA AGAGGCGGTC
CGGTCATTTG AGCGGGTACT GGAGATCAAC CGCCGGTGCT CCCCGGCATT TTTCCAGAAA
GGAAATGCCC TCGCCCATCT CGGAAAACAG CTGGAAGCCA TTATTTCGTA CGACCAGGCA
CTGGAGATTG ATCCTGATAA CCCGGTCACG CTGTACCAAA AGGGAATAGC GCTTGCACAG
CGTGAGAGGT ACGACGATGC AATAAAAACG TTCGAGCGAT TGCTCACGCT CGAACCGGAG
AATGCCCAGG CACTCTATTA CCTTGGCATC GCGTATGCAG GAAGGCAGCG GTTCGATGAG
GCTATCGTTG CATTCGAACG GTCCCTTGAG ATCGATCCAA AAAATCCCCT TGCCCACCAT
TATATGGGAG TTTCACTGGT GGAGTGCGAT AGGTATGACG ATGCCCTCCG CTCGTTCTCC
GAGGCCCTGT TGCTTGATGC GTCAAATGCA TCCACCTATT ATTACCAGGG GATTGCCTTC
CTGCAGTCCC ACCAATACGA GGAGGCTATC GCCGCGCTTA ATACGGCAAT CCGCATGGAT
ACATCGCTTT CCGATGCGTT CACGTACCTT GGCATTTCCC TTGCCCGGCT CGGACGCCAT
GACGAAGCAG TTGCCGCTCT CAACCGGAGC CTTGCCGCAA ATCCCTCGCA GATGGAAGCG
CTTGTCTGCC GGGGCGAATC CCTCATGGTC CTCCAACGGT ATGCTGATGC CGTAGAGACG
TTTGACCGGA TCCTCTCTTT AAACCCGAAC GTGATTTCGG CATGGATGCA GAAAGGAGCC
GCGCTCGAAC GCCTGGTTAA AAAGCAGGAT GCACTGGCGG TCTATACCCG GGTCCTTGAG
ATCAACCCGG GTAACGCAGA TGCATGGGCC CGCAAGGGTG TCCTGCTCCA GGATCTGGGA
AGAACCGCTG AAGCGGTTAC CGCATTTTCA AAAGCTCTCG ATATCAATGC CGGTATCGGT
GGGATCTGGA TGCACAAGGG CGATGCCCTC AGCACGCTGG GGAAGACTTC GGAAGCTGCA
GAAGCATATG CCGAGGCCTT AAAACTGGAT CCCGATCAGG AAGAGGGATG GATTAAGGGA
GGAAGGGCCC TTTTTGATCT GGGACGGTAC CAGGATGCTA TTGATGCATT CGATAATGCG
ATCGCCCTCA ACCAGCGGAG CACTGTTGCG TTCCTGTACA AGGGATTTTC TCTCGAAAAG
ATCAACCGGG CCGGTGAGGC ACTCCAGGTC TTTGAAGTAC TTCTGGAAAT CGATCCGCAC
AACAGCGAAG CCCATTATCA CATGGGCCTT GCACTTGCCG GATCAGGCAG GCCAAAAGAT
GCACTTGCTG CCTTTGAATC TGCACTAAAG ATCCGGGACA CCTTTGCCCC GGCGTGGTAC
AACAAGGGAA AGATGCTCCT TGACCTGGGA AAATACCAGG AAGCCCTGGC TGCATTTGAC
CAGGCGCTCG AACGCGAGCC GGCCTATACT GAAGTTTTTT ACAGCAGGGG GGTGGCGCTT
TCCAAGCTCG GGCGTTTCCC CGAGGCAATA GAAGCGTTTG AGAGGAACCT GGAAAAAGAT
ACCAGCAACG CCCCGGGCTA CTATTTCAAA GGAATTGCCC TCTCCAAACT GGGGCGGTAC
CAGGAAGCAC TGGATGCATT TGACCGTGCG CTTGTGTACG ATCCCGAGAA TGCCCTTGTC
TATTTCCAGA AAGGCCGGGC GCTGGACGGC CTGAACCGGT TTCAGGAAGC GGTTGCTGCA
TTCGAAAAAA CCCTTGCGCT TAAACCCCGG TATTCTGAAG CGCGGATGCG CAAGGGCATC
TCGCTCTACA ACCTGGGCCG GTATGCTGAT GCTATCCGCG ACTTTGACCG GACCATTGCC
GAGAACCCCC ATAATTTCCA TGCCTGGTAC CAGAAGGGCA GGGCCCTTTT TGACTCTGGC
AGCTATACCG AAGCGATCGA TGCTTATGAT CGTGCACTCG AAGTGGAATC GAGCTACCCA
GAGGCCCATT ACCACAAGGG TCTTGCCCTG TACGAGCTGG GCAGGTACGA GGAGGCCCTC
CTGTCCTATG ATCAGGCGCT TGAGAGCAAC CCCCATCTTG ATTATGCCCT CTTCCACCGG
GGCGCAGCCC TCATGAAACT GGAACGCTAC CGCGAAGCGG TACAGGCGTT TGATGCAGCT
CTTTTGCTCC TTCCGAAGTA TGCACCGGCC CACCACCTCA AGGGTGTATC GCTTGCCGCA
CAGGGACTGT ACCAGGATTC CATTTATGCA TATGACCGGG CGCTTGAATG CGATCCGGGT
AGTGGTGAAT CTGCCCTGAA TAAGGCCATG TCCCTCCACA ACCTGGGGCA GGACGAGGAC
GCACTTGCAG CTGCCGTTAA GGCTATCGAG ATCCAGCCGG ATTTTGCTGA GGCCTGGAGG
TATCGGGGTC TGATCCTTTC CAATCTGGGA AGATACCAGG AATCTGTCGA AGCCCTGGAT
CACGCACTTG CCGGAGATCC AAAGAATGCC CGGGTCAACT ACCAGAAAGG CCGGGCTTTT
GACGGTCTCG GGCAGTACGA AAACGCAATC TCTGCATACG ATGCCGCGCT TCAGGCACAA
CCGGATTGTA TCCCTGCACG TATGCACAAG GGAGAGGCGC TCCTCTTCAT CTCCCGATTC
CGCGACGCAA CCAAGGAGTT CGGGAAGATC CTCACCGAAC ACCCGGATAA TGCCGAAGCC
TGGATTAAGA TGGCCCGGGC CCGGTTTTCA CTTGGCGACT ATACCGAGGT TATTGAGGCA
TGCGATCACG CGCTCCGGTT CAATGCAGAC TCTGCAGAGG CCCTTTTGTA CCGCGGCCTT
GCGCAGTACG AACTTGGGAG GTACGAAGAG GCAGTCGAGT CCCTGGCCCG CGCGGAACAG
ATAGATTCCC ATCTTGAACA GGCGGTTTAC CATCTCGGGG CTGCGCTGCT GAAACTCGAG
CGGTACGGCG ACGCAATCCC GGCGTTTGAC CGGGTGCTTT CCCTTAAGCC GGATCAAGCC
ACTGCCCATC ATCTCCGTGG TGTCGCACTT GCTGCACAGG GTATGTACCC GGAGGCAATT
TCCTCCTTTG AAAATGCACT CCGGTACGAT CCCCGGAGCG CCGAATCGGC GCTTAACAAG
GCCATTGCCC TGCACAGCCT GGGCAGGGAC GAGGAATCGA TCCTGGCTTC AGATATTGCC
CTTGGGATCC AGCCGGATTT TGCCGAGGCC TGGTATTATA AGGGCGTTGC CCTCGAAACC
CTGAAAAGGT ACGCCGATGC CGTGCCGGCG TTCTCACGTT CACTCGAACT GGACAGCACT
ACCACCCATG CCTGGTTTGA GATGGGGCTC TGCCTTGTCG AACTCCAGCG TTACGAAGAA
GCCGCAGGTG CGTTCGATCA TGTCCTTGGC CTGGTGTCCG ATTATCCCCC GGCATACTTC
CACAAGGGGA GAGCACTTGC CCTCCTTGGA AAATACGAAG AGGCTGTTGT GGCATTTGAC
AGCGCCCTTG CAATAACCCC GGGAGATGCG ATCGTCCTCT CTGCAAAAGG ACACGCGCTT
GAATCCCTGA AAAAGTACCG GGAGGCAGCG GCTGCATTTG AGGAAGCGAC ATCTGTCAAT
CCGGCGGCCG CAGACGATTA CTATCACCTG GGTCTTGCAT ATATCGAGCA GCACCGGGAT
GAAAAAGCAA TCGCGGCTTT TGCAAAAACG CTCAGGATTG ACCCGGAGAA TCCCGACGCC
CTTTTCCAGG CCGGTATTGT GCTTGCCCGT CTGGAAAAAT ACGATGAGGC GATCGGGCTT
TTTGACCGGT ATCTTGAATT GGGAAAGGAG AATGCCGGGA TCCTTTACGA GAGAGGCTGT
GCATACTTCG CGCTCCAGAA ATACAGTGAG GCAATTGCCT CGTTTGACCG TGCGCTTGCG
CTGGATGCAA ACCACATCGG TGCCCTGGTC AAGAAAGGCC AGTCACGTGC AAATCTGGGG
CAGTACGAAG AAGCAGTAAC CCTCTTTGAC CGTGTTATCA CGCTCGACCC TGAGAATGTC
ATCGCACATT TTGTCATGGG CACTGCACTG GCCCGCCTGG CCCGGTACGA GGATGCCGTG
GTTGCGCTTG ACCGGGCTCT TGAATATGAC GGGAACAATG CACGCATTTA TGCCTGCAAG
GGATACTCCC TGTACCGCCT TGGCCGGTTC AAGGAGTCTG CAGAATCCTT TGCAAAAGCC
CAGAAACGCG AGCCAAAGGA TCCCTTCAGC CTGAGGTTCC GGGGCAAATC CCTCCTGCAT
AACGGGAAAT GGGAGGAAGG CATAGCGATC TTCGACAAAC TGCTGGGGAT AGAGCCAAAG
AGTGCTGATG CCTGGTATTA CAAAGGGATT GCCTATTCGC ACCTGTCCCT CCATGATGAG
GCGCAGGAAT CCTTTGAGCA GGCCCTGACA ATTGACGGGG AGTGTGCTAC TGCCTGGTAC
CAGAAAGGGC TGGTGCTCTT TGAACGGGAG CGCTTCGAAG AGTCCCTGCC GGCATTTGAA
CGGGCAGCAG AGCTCGCCCC GTCAGTACAG GACTATGCAT TCAGGAACGC ACTTTGCCTC
TTTATGCTGG AGAGGTACCC GGAGGCCATA TCCGCCTTCG ATCGTGCCCT GACTCTCGGT
CCCGAGACTG CGGTTATCCA GTACTACCGC GGCCGGGCCC TTGCCGAGAT GCGGGATTAC
GGTGTGGCAC TTGATGCGCT CAACCGGGCC ATTGGCCTTG ATCCGGAAAA CTCATTTACC
TGGCTTGCCA AAGGCAGCGT GCTGCTGGCC CAGAAAGACG GGGCCGCGGC AGTTGCAGCA
TTCGACCAGG CCCTGGTGCT GGATCCCAAA GCCGCAGACG CAGCGTTCTT TAAAGGCGAG
GCTTTTTCAC TTCTTGGAAA CGATGAGGAG GCCATACATG CGTACGACCT TGCGCTCAGC
CTTGAATCCG CATATCCTGA AGGATCGTTT AAGAAAGGGC TTGCCCTTCT CCGGCTGAAA
AATTATAACG GCGCGATCGA GGCATTTGAT GCGGCGATCC AGTTCGTGCC CGGGCATGCA
CAGGCACATT ACCACAAGGG ACTCGCCCTG TTCGCACTGG GTAAGAATGA GAAAGCGATA
CGCTCCTTTA CTCACGCGCT CGAACACGAT CCCTCCCTTT CCGATGCGTT ATTCCACACC
GGTCTTGCAT ATGCAGCGCT GAGCCGGTAC TCCCCTGCAC TTTCTGCGTT TGATAAACTT
CTTGAATCCG GGCCGCAGAA CGCTGAGGCA TTGTTCCAGA AGGGAAGGAT GCTTGCAAAA
CTGGGCCGGC CTGATGAGGC CCTTGCAGTT CTTGAAACCT CCCTTGGCCT GGAAAACAAC
ATTGCCGATG TCTGGCTGCT TAAGGGAAGC GTACTTCTTG AGCAGGAGCG TCTGGAAGAT
GCACTCGAAG TGTTTGACCG GGCATTGGCC CTCACACCGG AGAATAATGC TGCATGGTAC
CGGAAAGGCA AGGCATTCTC CGGCCTGCAC CGGTACCCCG AAGCAATCCA GTGCTTTGAC
CGCGTGGTTA CTTCCGATAC TGGTTGTGCA CAGGCATGGT TCCGGAAAGG GAGTGCGCTT
CTTTCAAACG GGGATCTGAG GGCTGCTATC GAAGCGCTCA CAAAAGCGCT CGAACTCAAA
CCCGACAATG CAAACGGGTG GTACGACCGG GCAGTTGCAC TTGCGGGCCT GGGAAGGTAT
GAAGAATCCA TCCCCTCATA TGACCGGGCC CTGTCCCTCA ACCCGAAGTA CACAAGCGCC
TATTTTGACA AGGGATCGGC GCTCTCCCGC CTGGGCAGAG ACCGGCAGGC AATCGAGGCA
TTCGAGATGG CCTCGGCAAT AGATCCGGAG TTCGCGGTCG CATATCTGGA GAAAGGCCTT
GCCCTTGCCC GTCTTTCCAA AAACAAGGAA GCGGTTGCAG CATTTGATGC CACTCTTGCC
CTTGATCCGG CAAACGTTCC TGCACTTTTC AACAAGGGAC TTGCCCTCGC AAACCTCAAG
AAATTTGCAG ATGCCATTAC CGTGTTCGAT GCAGCCCTCC GCATTGATGC AAAACACTAC
GAGGCCTGGT TTGCCAAAGG ATATGCCCAG TCCCGGCTCC GGCATTATGA TGATGCTGTC
GGGGCATTTG ACCATGCACT TGCCATCGAT CCCGGGCGGT ACGCGGTATG GTATGAAAAA
GGAGTGGCAC TTGCCCGGGC GGGTAAAAAC GATGAAGCAG TTGCCGCATT CTCTGAAGCA
ATTGCACGTG ATGACAAAAA ACCCGAAGCC CAGTACGAGA AAGGCCGCGC TCTTCTCGAA
CTCGGAGAGG ATGAGCAGGC AGTTACCTCG TTTACCCGGG CCCTTGATCT GGACACATCC
TTTGGGGACG CAGCTTATTA CCTCGGTCTT GCCCTTGAGC GTGTCGGGAA GTTTACCGAT
GCGATAACCG CATACGACCG GATGGTTGCT GCGCGGCCCG ATCATTCCGA TGCCTGGTAC
CACCGCGGTA TCGCATCAGA GCGCCTTGGC AGGGATAACG ATGCGGTCCA GGCGTACGAG
AAGGCCCGGC AGATCGAGCC CCACAATCTT CCGCTCCTCT TTGCCGATGG CAGGGCATGG
GCCAGGCTTG GCCAGTTCGA AGATGCAATC CATCTCTTTG ACATTGCCCT TGGAAAAGAG
CCCGGCAACG GCGAGATCCT CTTTGAGAAG GCAAAAGCAC TTGCTGCCCT TGGCCGCCAT
GATGAGGCAC AGGAGATCTT CCGGCTGGCG TTTACCCAGC TCACCGATAA TTACGAACCG
GCGTACCTTC GCGGACTCTC GCTTCTTGCG CTTGAACGGT ATGAAGATGC GGACATGGCG
TTTGATGCAG CGCTTTCCCT GAGCCCGGAC CTCCCGGAGA TCTGGGAGAA AAAAGGCGGA
GCACTGATGC ATGCCGGCAA TTACGAAGGC GCAGTTGCAG CATTCGATCA CGCCATCTCT
CTTTTGCCTG ACGATCCCGG TGCGTACCTG GAGCGTGGCC GGGCTCTTGC CGCACTCAAC
AGGAACGATG AGGCGGTTGC CTCATTTGAT CAGGTACTTG CCCTTGAGCC GGCAGATCCG
GTGGCAAGCT TCGAACGGGG CCGTGCCCTC TATTACGCGG CAAAGTACGA GCATGCCGTT
GAGGCGCTTG ATACAACCCT TTCCAGCGAT CCCCGGCACC CGGGCGCATT GTACTTCCGG
GCTGCATCTC TTGCAGCCCT GGAAAGATAT GCCGAAGCTG CCGAATCGTT CGAACGCCTT
CTTGTGTATA CTCCGGAGAA TGCGGATGCC TGGTATGAGC AGGGCTGCGT GCTGGCCCGG
CTCCGCCACT ACGATGAAGC AATTGCTGCA TTCGACCATG TCCTTAACCT GGTGCCGGAA
CACTTCGATG CCCTGTTCCA GAAGGCCCGG GCGCTTGACG ACCTGGGGAA GTACAGCGAG
GCTGTAACCA GCTATTCTGC AGCCCTTGCC CTGAAGCCTT CCGATGCAAA GACCCACTAT
TACCGGGGTG TCTCCCTTGC AGAGAACGGG CAGCCAGAGG AGGCAGTCAA GGCCTTTGAT
GCGGCCCTGG AGATCGACCC GGTCTTTTCC GATGCGCTCT TTGCCAAAGG AAAGGCACTG
CTCACCCTCG GGATGTTCCG CGAGGCAGTC AAGACCTTTG ACAAGACCCT GCTCATAGAA
AAGAACTATG CCGGGGTCTA TTTCCACAAG GGTCTTGCTC TTGCCGAACT CGGGCGGCAC
GATGAGGCAA TCACTGCATT TGATAAGGAC ATAGATCTTG ATGCCGGCAA TAACGACGCT
TTCTACCACA AAGGGGTCTC TCTTGCTGCC ACGGGAAAGC TTACTAATGC GATGGAGGCA
TTCGATCACG TGATCCAGGC AGATCCCGGT TCCGTCCAGG GCTGGCTTCA CCGGGGGATG
GCGCTCTTTG ATCTTGGCAG GTTCAATGAT GCCATCTCCT CGTACAAAAA AGCGCTTGAG
ATCGGTCCCA CCAATGCAGA TGCCTGGTAC CTGGTGGGCC GGTCCTATTA CGCACTCAAT
ACGTACGACG AAGCTATAGC AGCATTTGAC CGGGCGCTCG ATCTCCAGGG CGAATTTGCC
GAAGCCTGGT ATTACAAGGG GCGGACCCTT TTTGCAATGG GCAAATACGG CGAAGCCGTT
TCCGCGTACG ACAGTACGCT CGTCCTTCGC CCCAAGCACG ATGAGGCATT CTACCACAAG
GGTATGGCCC TTTTGAAACT CCAGCGGGCA GGGGATGCGG TCTCTGCATT CGACCAGGCA
CTGCGGCTCC GCCCGAACTT TTCGTACATC TGGACCGGGA AAGGAATGGC CCTTGCCGCA
CTGGACCGGC ATAAGGATGC AATCTCCTGC TACACCAAGG CAATCGCCCT TGACCGGAAA
GATTCCCGGG CCTATTACCA GGCCGGGCTC TCGTATCTTT CCCTTGGGCG GTACCAGGAT
GCAATCAGGA ATTTTGAGGC AACGCTTGTC CAGCATCCGT CCTGCGCCCG GGCATTCTAC
GCCAAGGGCC GGGCCCTCTG CGGTGTCTCC ATGTTCCACG AGGCAATCAC CTCGTTTGAC
AAGGCCCTCT CCGAGCAGTC GGATTATCCC GAGGCATGGC TCTACCGGGG AATAGCCGAG
GCAAACCTCG AAGAGTTTGA GGAGGCGCTG GACTGCTACA ACCACGCCCT TGCACAAAAC
GAGTCCTACG CGACAGCCCT CCTCAACAAG GGCCGGGCGC TTATCCACCT GGAGCGGACC
GGTGAAGCGC TTGCGGCGAT TGAAAAAGTG CTCACCATCC AGCCGGAATC CGCGGATGCA
TTTTACTATA AAGGCCGCGC CCACCTGAAC CGCAGGCAGG ATGATGACGC CATTGACGCT
TTTAACCGGG CGCTTGCGAT CAACCGGCAG TTTGCCGAGG CGCATTATTA CAAGGGAACT
GCACTGGCAC GCAAAGGACA GTACGAGGAG GCTGTTGCAG CCTTTGATGC AGCCCTGCGG
ATAAAGAGCG ATTACCCCGA GGCATTTTAC GAGAAAGGCC GGGCACTTTT CCACCTTGAG
CGGTCCAAGG AGGCGCTTGC AGCGTATGAC CAGGCCCTCT CTGCAAATCC CGGGTATGCA
GAAGCAATCT TCCAGAAAGG ACGGACGTAT ATTACCCTCC AGAACCCGGA CGGGGCGATC
CGGTCATTCG ACCGCGCCCT CGAGGTCAAC CCGTCCTGCT TCCAGGCACA CTACTGGAAA
GCGCGGACGT TGTACGATGA GGGCAGTTAT GATGCAGCCA TCACGGAATA TGACCGGGCG
ATTGCAATAA AACCGGATCG GCCCGAGCTC TACCGCGACC GAGGTCTTGC CTATGCGGCA
ATCGATCAGT ACCGCGAGGC CATCAAATCC TATGACAAGG CGCTGGAGCT TGATACCCAC
GGTGCCGACG CATTCTCCCA CAAGGGAAGT TCGCTTGCCG AGCTGGGGAT GTACCGCGAC
GCGCTCGAAG CATTTGAGAA AGCTATCGAG AAGGATCCGG AACTTGCGAC CTCCTGGTTT
GGCAAGGGAA ATGTCCTCTA TGATCTTGGC AAGTTTACCG AAGCCTGCGC GGCGTATGAC
GAGGGCCTCC GCCGCGACCC GGAGAATGCC GTGGGCTGGA CGCGGCGGGG CATGTCCCTT
GCCGGCTTAA ACGATCATAA GGCTGCGATC GAGTCTTATG ACCGGGCCCT GGCAATCGAT
CCCTCGTTCT CGATCGCGTA CTTCACCCGG GGCAGCGCTT TTGAAGCGCT CGGCCAGTTT
GAGGAGGCCG AAGCCTCCTT CCGGGCCATG ATCTCCCTCC AGCCCGACTT TGTGGATGCC
TGGATCCACC AGGGCCGGGC CCTTCAGGAA CAGGAGAAAT ACCAGGAAGC CCTGACCTCG
TTCAAACGCG CCCTTGAGAT CGATCCGTCA AGAAAAGAGA TCTGGAACGA CGTCGGCTCG
ACACTTGACA AACTGGGCAA GCATGAAGAA GCCCAAATCT GTTACGAGAA AGCCCTGTAA
 
Protein sequence
MSRSDAEALM RQGTELYDLG RHQEAVVMFD RALTLFPKLP KAHYFKGIAL YDLGRYEDAL 
DSYDHALALD PSDINSWYNK AATLAQIGRN KEALDACDRL IALRFDNAEA WILKGISLYE
LGRFRDAISA YDHALAIDPT YAKVYYNKGI ALADLGRHDE AIAAYGKAVG IVPEYAKAYY
NMGISLYEIG RYDEALGAFE KAHDLDPSDP WVWYYRAFIL AKQERYAQAA EAAGVFLSFE
PEHADIWVIQ GISLYRLRRL DEAADAFDRA IEQDPLAPDA WLYKGFSLFD MERYEDATYA
LDKAAELSPQ TTKIYYTRGK ANQRLGKYRE AVADFDRALA AEPENADALY SRGVSCIHLS
RYDESLSVFD RILASQGDHA GASYFRGVVL SRLGRQDEAI SAFEHTLAID PGCASAAYQI
GLASASLGRY SDAVAAYDRA LKIRPDYPDA VYHKGFALAK LGNSEDALLE FDRALTENPG
NAPAYHQKGQ LLVRTGRLEE ALEALNKSIA LKPDNAQVYY DKGSALLKAE RFGPALEAFD
QAIGIYPNYV NAYYNKGIAF SRTGMRKEAL EAFDHAIAID PTHTLALYHR GTMLSGLGRY
ADAAAAYDAV LALSPQNTSA LYEKGVALMQ LSRWKDAAEA FGQAVEQDPG LIDAWLAFGT
CNANLGKFPD AIAAFDRVIA LSPKNTQAFI HKGIALVTTG KFEEAIAALN RALEDAPRDE
RAWYYKGMSL AALQRFEEAV RSFERVLEIN RRCSPAFFQK GNALAHLGKQ LEAIISYDQA
LEIDPDNPVT LYQKGIALAQ RERYDDAIKT FERLLTLEPE NAQALYYLGI AYAGRQRFDE
AIVAFERSLE IDPKNPLAHH YMGVSLVECD RYDDALRSFS EALLLDASNA STYYYQGIAF
LQSHQYEEAI AALNTAIRMD TSLSDAFTYL GISLARLGRH DEAVAALNRS LAANPSQMEA
LVCRGESLMV LQRYADAVET FDRILSLNPN VISAWMQKGA ALERLVKKQD ALAVYTRVLE
INPGNADAWA RKGVLLQDLG RTAEAVTAFS KALDINAGIG GIWMHKGDAL STLGKTSEAA
EAYAEALKLD PDQEEGWIKG GRALFDLGRY QDAIDAFDNA IALNQRSTVA FLYKGFSLEK
INRAGEALQV FEVLLEIDPH NSEAHYHMGL ALAGSGRPKD ALAAFESALK IRDTFAPAWY
NKGKMLLDLG KYQEALAAFD QALEREPAYT EVFYSRGVAL SKLGRFPEAI EAFERNLEKD
TSNAPGYYFK GIALSKLGRY QEALDAFDRA LVYDPENALV YFQKGRALDG LNRFQEAVAA
FEKTLALKPR YSEARMRKGI SLYNLGRYAD AIRDFDRTIA ENPHNFHAWY QKGRALFDSG
SYTEAIDAYD RALEVESSYP EAHYHKGLAL YELGRYEEAL LSYDQALESN PHLDYALFHR
GAALMKLERY REAVQAFDAA LLLLPKYAPA HHLKGVSLAA QGLYQDSIYA YDRALECDPG
SGESALNKAM SLHNLGQDED ALAAAVKAIE IQPDFAEAWR YRGLILSNLG RYQESVEALD
HALAGDPKNA RVNYQKGRAF DGLGQYENAI SAYDAALQAQ PDCIPARMHK GEALLFISRF
RDATKEFGKI LTEHPDNAEA WIKMARARFS LGDYTEVIEA CDHALRFNAD SAEALLYRGL
AQYELGRYEE AVESLARAEQ IDSHLEQAVY HLGAALLKLE RYGDAIPAFD RVLSLKPDQA
TAHHLRGVAL AAQGMYPEAI SSFENALRYD PRSAESALNK AIALHSLGRD EESILASDIA
LGIQPDFAEA WYYKGVALET LKRYADAVPA FSRSLELDST TTHAWFEMGL CLVELQRYEE
AAGAFDHVLG LVSDYPPAYF HKGRALALLG KYEEAVVAFD SALAITPGDA IVLSAKGHAL
ESLKKYREAA AAFEEATSVN PAAADDYYHL GLAYIEQHRD EKAIAAFAKT LRIDPENPDA
LFQAGIVLAR LEKYDEAIGL FDRYLELGKE NAGILYERGC AYFALQKYSE AIASFDRALA
LDANHIGALV KKGQSRANLG QYEEAVTLFD RVITLDPENV IAHFVMGTAL ARLARYEDAV
VALDRALEYD GNNARIYACK GYSLYRLGRF KESAESFAKA QKREPKDPFS LRFRGKSLLH
NGKWEEGIAI FDKLLGIEPK SADAWYYKGI AYSHLSLHDE AQESFEQALT IDGECATAWY
QKGLVLFERE RFEESLPAFE RAAELAPSVQ DYAFRNALCL FMLERYPEAI SAFDRALTLG
PETAVIQYYR GRALAEMRDY GVALDALNRA IGLDPENSFT WLAKGSVLLA QKDGAAAVAA
FDQALVLDPK AADAAFFKGE AFSLLGNDEE AIHAYDLALS LESAYPEGSF KKGLALLRLK
NYNGAIEAFD AAIQFVPGHA QAHYHKGLAL FALGKNEKAI RSFTHALEHD PSLSDALFHT
GLAYAALSRY SPALSAFDKL LESGPQNAEA LFQKGRMLAK LGRPDEALAV LETSLGLENN
IADVWLLKGS VLLEQERLED ALEVFDRALA LTPENNAAWY RKGKAFSGLH RYPEAIQCFD
RVVTSDTGCA QAWFRKGSAL LSNGDLRAAI EALTKALELK PDNANGWYDR AVALAGLGRY
EESIPSYDRA LSLNPKYTSA YFDKGSALSR LGRDRQAIEA FEMASAIDPE FAVAYLEKGL
ALARLSKNKE AVAAFDATLA LDPANVPALF NKGLALANLK KFADAITVFD AALRIDAKHY
EAWFAKGYAQ SRLRHYDDAV GAFDHALAID PGRYAVWYEK GVALARAGKN DEAVAAFSEA
IARDDKKPEA QYEKGRALLE LGEDEQAVTS FTRALDLDTS FGDAAYYLGL ALERVGKFTD
AITAYDRMVA ARPDHSDAWY HRGIASERLG RDNDAVQAYE KARQIEPHNL PLLFADGRAW
ARLGQFEDAI HLFDIALGKE PGNGEILFEK AKALAALGRH DEAQEIFRLA FTQLTDNYEP
AYLRGLSLLA LERYEDADMA FDAALSLSPD LPEIWEKKGG ALMHAGNYEG AVAAFDHAIS
LLPDDPGAYL ERGRALAALN RNDEAVASFD QVLALEPADP VASFERGRAL YYAAKYEHAV
EALDTTLSSD PRHPGALYFR AASLAALERY AEAAESFERL LVYTPENADA WYEQGCVLAR
LRHYDEAIAA FDHVLNLVPE HFDALFQKAR ALDDLGKYSE AVTSYSAALA LKPSDAKTHY
YRGVSLAENG QPEEAVKAFD AALEIDPVFS DALFAKGKAL LTLGMFREAV KTFDKTLLIE
KNYAGVYFHK GLALAELGRH DEAITAFDKD IDLDAGNNDA FYHKGVSLAA TGKLTNAMEA
FDHVIQADPG SVQGWLHRGM ALFDLGRFND AISSYKKALE IGPTNADAWY LVGRSYYALN
TYDEAIAAFD RALDLQGEFA EAWYYKGRTL FAMGKYGEAV SAYDSTLVLR PKHDEAFYHK
GMALLKLQRA GDAVSAFDQA LRLRPNFSYI WTGKGMALAA LDRHKDAISC YTKAIALDRK
DSRAYYQAGL SYLSLGRYQD AIRNFEATLV QHPSCARAFY AKGRALCGVS MFHEAITSFD
KALSEQSDYP EAWLYRGIAE ANLEEFEEAL DCYNHALAQN ESYATALLNK GRALIHLERT
GEALAAIEKV LTIQPESADA FYYKGRAHLN RRQDDDAIDA FNRALAINRQ FAEAHYYKGT
ALARKGQYEE AVAAFDAALR IKSDYPEAFY EKGRALFHLE RSKEALAAYD QALSANPGYA
EAIFQKGRTY ITLQNPDGAI RSFDRALEVN PSCFQAHYWK ARTLYDEGSY DAAITEYDRA
IAIKPDRPEL YRDRGLAYAA IDQYREAIKS YDKALELDTH GADAFSHKGS SLAELGMYRD
ALEAFEKAIE KDPELATSWF GKGNVLYDLG KFTEACAAYD EGLRRDPENA VGWTRRGMSL
AGLNDHKAAI ESYDRALAID PSFSIAYFTR GSAFEALGQF EEAEASFRAM ISLQPDFVDA
WIHQGRALQE QEKYQEALTS FKRALEIDPS RKEIWNDVGS TLDKLGKHEE AQICYEKAL