Gene A2cp1_3455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_3455 
Symbol 
ID7297471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp3846068 
End bp3858382 
Gene Length12315 bp 
Protein Length4104 aa 
Translation table11 
GC content78% 
IMG OID643596262 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002493851 
Protein GI220918547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGCAC CCTCCCTGGA CGAGCAGATC CGATCCCACG TCGAGTCGCT CGAGGCGGCC 
CCCGGCGACG ACGAGGCGTT TCACGCGCTC GTGGGACTTT ACGAGCGCGG CGCGCGCTGG
GAGGAGCTGA TCGCGCTCTA CGAGGGGCGC GCGCGTCACG CGCCCGGCCC GGGCCCGCTG
CTCGCCGAGG CTGCCACCCT CGCGCACCGG AAGCTGCGCA ACGTGGCTCG CGCGGAGGAC
CTGTACCGGC AGCTGCTCCA GCTCGAGCCG GCGCACCCCG TGGCGCTTCG CGCGCTGGTG
GAGCTCCTCG AGGAGCGCGG CGACTGGGCG GGGGTGGCCG CCGCGCTGGA GCGCGAGGCG
ACCGCCACGG CCGACCGGGC CGCCGCCGCG GCGGTGACGC TCCGGCTGGG CAAGGTCCAG
GAGGAGCGGC TCGGACGGCG GGACCGCGCC GCGCTGCTCT ACGCGCGCGC GCACCGGCTC
GATCCCTCGC TGGAGGAGGC GCGCGCCCGC GCCATGGAGT GCTTCGGCGC GCTCCGCCGC
TTCGCGCAGG CGAAGCGGCT GCTCGACGGC GCGCGCGATG CCGGCGCCGA TCCGCGCGCG
CTCGCCGCCG AGTACGCGCG GCTGGGCGCC GCGCTGGTGG ACGAGCCGCT CGACCACGCG
CTCGCGACCG ACGCGCTCAT CGACGCGCTC GCGCTCGATC GCGCCGCGCC CGGCGCGGCG
GCCGCGCGGG AGCGGCTGCG CGGCTTCCCG CGCGCCTGGC GCGAGGAGGC CGGCCGGCTG
GAGGCCGAGG CGACGCGCGC GTCCGATCGG CGCACCGCCG CGGCGCTCCA GCTCCGGCTC
GCGCAGATCC ACGCCGCCTA CGATCCCGAG GGCGCCGGCC GCGCGGTGGA GCGGATCGAG
CGCGCCTGGG CGCTCGCCCC GGGCGCGCCG GCCGCGCTGG AGCTGCTGGA GCGGGTGCTG
GCCGAGCGCG GCGATCACCG CGGCCACGCC GACGCGCTGG CGCGCCTCGC GACGCAGACC
CGCGACCGCG CCGCGCAGGT CGCCCTGCAC CTGGAGATGG CGCGGGTGGA TCTGGTGCGC
TTCGGCGACG CCGAGTCCGC GCTGGCGGCG CTGCGCCGCG CGCTGGAGCT CGATCCCGCC
TGCGAGCCCG CCGCGCTGCA GGCGTTCGAG CACGAGACCG ACGCGGGCCG CTTCGCGGAG
GCGCTGGACG TGCTGGAGCG CCACCTCGCC GCCGCGCCGG CGAAGCCCGC CCACGCGCCG
CTGCGCGTGC GCGCCGCCGC CATCGCGCGC GAGAAGCTCG GCGATGCGGG CCGCGCCCGG
CGCCACCTCG AGGCGGCGCT GCGCGCCGAC CCCGGCCACG CCCCCGCCGC CGCCGCGCTG
GCGCCGCTCC TCGCCGAGGC CGGCGAGTGG CAGCCGCTCG CGCGCGTGCT GGAGCTGTCG
GCCCGCCGCG AGCGCTCCCC CGGCGAGCGG GTGCGCCTGC TCGAGCGGCT CGCCGAGATC
CAGCAGGAGC GCCTGGGGCG TCCGCGCGAC GCGCTGCGCA CGCTGGCGGG CGCGCTCGCG
CTCGACCCGG CCCGGCCGGC CACCCGCAAG GCCATGGAGG GCGCCGCCGC CCGCGCCGAC
GCGTTCGCGG AGCTGGCGCG CGCCTACCGC ACCGCCGCGG CGGCCGCGAA CGGCGACCTG
AAGGCGCGCA AGACGCTGCT GCGCCGGGTG GCCGAGATCC TGGACCGCGA CCTGGGCCAG
CCGGAGGAGG CCGTCCGGGC CTGGCGCGCG CTGGTCGAGG TGGACCCGGA GGATCGCGGC
GCCCGCGCGG CGCTGGAGTC GTGCCTGGCG CGGGCGGGCC AGCAGCAGGA GCTGGCGCGC
GAGCTCGAGG CCCGGCGCGA GCACGCGGCC GGCGACGAGC GGCGCGCGCT CGGGGCGAAG
CTGGCGCGGC TCTGGGCCGA GGCCGGCGAG CCGGACCGCG CCGGGGCCGC CTGGCGCGCG
GTGCTCGCCG AGGGCCCGGA GGACGACGAG GCGCTGTGGG GCCTGCACGC GGCGCTCGAG
GCGCAGGGCG GCGCGGCCGC GGCCGAGGAG CGGATCGGGA TCCTGGCGCG GCTCGCGGCG
CGCGCGCGGG GTCCCGCCGA GCGCGCCGTG ATCGAGCTGG CCCGCGCCGA GGCGCTGGCG
GAGCCGCTCG GGCGCCACGC CGACGCCGCC GGGGTGGCGC TGGCCGTGGT CGAGGCCGGC
GGGCTCTCCC CGCCGCAGCG CGCGGAGGCG GTGGCGCTCC TGGAGCGGCT GCTGGCGCGC
GGCGCGGAGC CGCTGCGCGT GGCGCAGGCG CTGGCACGCG CGCACGCCGC CGCGGGCGAC
GCTTCCCGGC AGGCCGCCAT GCTGGAGCGG GTGGCGAAGG AGCTGCCCGC CTCCGCCGAC
CCGCGCGAGC GCGCCCGCCA CCTGCTCGAC GCCTCCGCGG TCCGGGCCGA GCGCCTGGGC
GACCGCCGCG GCGCGCTCTC CGCCGCGGCC GCGGCGCTGC GGGCCTGCCC CGACCACGCC
GAGGCCCGCG CCCGCTGCGA GGCGCTGGCC CGCGAGGTCG GCGCGCACCG CGAGCTGCTC
GCGCTGCTGG TCGAGGTGGC GGGCCGGCTC TCCGGGCGGC CCGAGGAGGA GGCGGCGCTC
CGGATCCGCG CCGCCGCCAC CGCCGAGGAG GACCTGGGCG CGTTCGACGA CGCCGCCGCG
CAGCTCCGGC GCGCGCTGGA GCTGCGGCCC GGCGACCCGG CGGTGCTGGC CGGGCTCACG
CGCGCCGCGC TCGCGTCCGA GCGCTGGGCC GACGCCGACC GCCTGCTGGC GCAGCGCGCC
GCCGCGGCGA CCGGCGCCGA GCAGGTGGCG CTGCTCGCGC AGCGCGCCGA GGTGCTGCAG
GAGCGCCTCG GCGACCCCGC CGCCGCCGCC GAGGCGTGCC GCACCGCCCT GGCCCGCTGC
GCCCCGGAGC AGCGCGCGCG CCTGCTGGCG CGGCTCGCCG GCGCCCTCGG GGCCGCCGGC
GACGAGGCCG GGCGGGCCGA GGCGCTCGGC GACCTGTCGG CGGCCTCGCC GGAGCCGGCC
GAGGCCACCC GGGCCGCGCT GGAGAGCGCG CGCATCCGCG CCGGAATGGG CGACGCGCGC
GCCGCGGTGG AGCGGCTGAC CGCCGCGCTG CGGGCCTCGC CCGACGACGC GGCCGCGCTC
GCGGCGCTGG AGGAGCAGCT CGGGGCGGAG GACCCGGCCG CGGCGCTGAT GGCGGCGCGC
GCGCTGGCCG GCCAGGCGGA CCCGCGCCGG CGGCTGCGCG CGCTGGAGGC CGAGGCCCGG
GCACACCCGG AGCCCGGCGG CCGCGCCGCC GCGCACCGCG CTGCCGCGCG CGTCGCGGAG
CAGGAGCTGG GGCAGGCGTC GCTCGCGTTC GCCGCGCTGG CCGCGGCCGC GCGCGAGCTG
CCCGGCGACG CGGACCTGCG GGCCGAGCTG CGGCGCGTGG CCGGCGAGGC CCAGGAGTGG
GAGGCCTGCG CGCGCGTGCA CGACGCGCTG GTCGACGCGG TGCCGGCCTC GGGGCGGCTC
GCGGTCCTGC GCGAGCGCGC CGAGCTCGCC GAGCGCAGGC TGGACCGGGA CCGCGCCGCC
GCGGCCTGGG CCGAGGTCGC CGGGGCGGCG CCGGGCGATC GCGACGCGCT GGCCGCGCTG
CGCCGCCTGC ACCGCGCGCG CGAGCGCTGG GGCGAGCTCG CCGACGTCTG CGCCGCGCTG
GCGGCCGCGC CGGAGGCCGC GCCGGCGGCG CGCGAGGACG CGCTGCGCGA GGAGGCGGCG
GTCGCGGAGG CGCGCCTCGC CGATCCGGCG CGCGCGGCCG CGGCGTGGGG CGAGGTGGCG
GCGCTCGCGC CGGACGACGC GGAGGCGGCG GCGGCGCTGG AGCGGCTGTA CCAGCGCCTC
GACCGGCCGG AGGCGCTCGC CGCCCTGCTG GAGCGCCGCC TGGCCCGCGC GTTCGACGCG
GACGCCGCCG CCCGGCTCGC CGAGCTGCGG CGCTTCCGGC TGGGCGATCC GGCCGGCGCG
CTGGCGCTCC ACGCCGAGCT GCTGCGCCGG GATCCCTCGC GCGCGGACGT GCGCGACGCG
CTCGCCGAGC TGGCCGCGGT CCCGGGGGCG GTCGGGCGCG AGGCGCTCGA CGCCGCCGAC
GCGTCGCTGC GCGAGGGCGG CGAGCACGCC CGGCGGGTGG CGGCGCGGGA GGCGCGGCTC
GCGGCGGTGG AGGACCGCGG CGAGCGCGCC CGGCTGCACG CGGAGCTGCG CGCCATCCTG
GAGCGCGACC TGGGCGAGCC CGGCCTGGCC TGGGTGGCGG CCTGCCGCGC CTTCGCCGAG
GGCGGGCCGT CCCGCGCCGG CGCCGAGGAG GACCTGGCGC GGCTCGCGCG CGAGACCGGC
GCGGAGGACG AGCTGCCCGA CGTCTACGAG CAGGCGGCGG CTGCGGCGGG GCCGGAGGAG
CGGCTCGCCC TGCTCCGCCA GGCGGCCCGG CTGCGCGAGG CGCGCGTCGG CGGCCGGGGG
GCCGTGGACG CGTGGAACGC GGTGCTGGCG CTCGCGCCGG ACGACGCCGA GGCGCTCGAG
GCCCTCGCGG CGCTGCACGA GGCGGCGCGC TCGGCGCGGG AGATCCTCGA GGTCGCGCGC
CGCCGCGCCG CGCTGGCGGA GGGCGAGGAG CGGATCGGCC ACCTGCTGCA CGCGGCGGTG
GTCGCGGACG AGCTGGGCGA CGGGGGCGTG GCCGCCGAGG CCTACCGCGC GGTGCTCGAG
GAGGCCCCGG ACCGGATCGA GGCGCTGGAG GGGCTCGCGC GCGTGCTGGA GCGGGCGCCG
GGCGCGGAGC CCGGGCCGGC GACGGCGGAG CTGCTCCAGG TGCTGGAGGC GCTCGCCCGC
GCCTGCGGGG CCGATCCCGA TCGCCGGGTG GCGGCGCTGC TCCGGCGCGC GGCGCGCCTG
GAGCGCGACC CGGATCCGCG CCGGGCGGTG GAGGGCTACG CCGAGGTGCT CGCCGAGCGC
CCGCGCGAGC CGCAGGCGGT GGCCGGCCTG GAGCGCCTGC TGAAGCGCCC GGACGCGCGC
GAGGGCGCGG CGCGGCTGCT CGAGGACGTG CTCCGGACCG CGGGCGACGC GGGCCGGCTG
GCCGCGCTGC TCGAGGTCCG GCTGGAGGGC GCCGACGAGG CCGAGCGCGC GCCGCTGCTC
GCCGAGATCG CCGCGCTGCA CGAGCGGCTC GGCGACCGGC GCCGCGCGTT CGAGGCGCGG
GTCCGCGAGC TGGCCGACGC GGCGCGGGCG GGACGCGACG CGCCGTCGGC GCGGGCGGAC
CTGGAGCGGC TCGCGGCGGC GACCGGAGCG TGGGCGGAGC TGGCGGAGGC GCTGCGGGTG
GCGCTCGCCG CCGGGCTGCC GGCGCGCGCG GCGCTGGAGG CGCGGCGGCG GCTGGCCGCG
GTGTGCGCCG ACCGGCTCGG CGATCTCGCC GAGGCGGCCC GGCAGTACGA GGAGGTGGCC
GCGGCGGCGG TCTCCCCCGA GACGCTCGGC GCGCTGGCGC GCGTCTACCG CCGCATGGGC
GCCCACCGCG AGCTGGCCGT CACGCTCTCG CGGCTCGCCG AGGTCGCGCC CGCGGCGGCG
GCCCGCAAGG AACTCCTGCT CGAGGTGGCG AAGATCATGG CTGAGCAGCT CTCGGATCGC
GAAGGCGCGG TGGACGCGTA CCGGAAGATC CTCGCGGTGG ACCCGGAGGA CCCCCAGGCG
CTCCGGCTGC TCGGGCGGCT GCTGGGCGCG GCCGAGCGCT GGGAGGAGCT GGTCCAGATC
CTCGACCGGG AGGTGGCGCT CGCCGACCGG CAGCCCAACT TCGTCGCCGA GGCCGCCGAG
CTGCGCTTCC GGCTCGGCCG CATCCGCCAC CAGCGGCTGG CCGACGCGGA GGGCGCGCTC
GTCGCCTACC GCGAGGTGCT GCACCGCGTG CCCCGCCACC CGGCCGCGCT GTCCGCGCTC
GAGGAGCTGG CCCGCGGCAC CGGTCCCGCC GCGCTCGAGG CGGCGCTGCT GCTCGAGCCG
GTGTACGCGG CCGAGGGCGA GCACGGAAAG GTGGTGGAGA CGCTGGAGGC GCGGGCGGCG
AACGAGACCG AGCCCGCCCG GCGCGCCGCG CTGCTGCGGC GGGTGGCCGA GACGTACGGC
GGGCCGCTGC GCAACGCCGA GATGGCGTTC CTGGCCGCGT CCCGCGCGCT CGCGGCGGAC
CCCGACGCGC CCGAGTCGCT CGAGCTGGCG GTCCGCCACG CGCAGGCGGC GGGGCTGGGT
GACGAGCTGG CCGCGCTGCT GGAGGAGCAC GCCGACCGGG CCCGCGAGCC GGTCGCGCGG
GCCGAGTACC AGCGGCGCAT CGCGCGGCTC GCGCACGGCG AGCCGGCCCG CGCGGCCGCG
GCCTGGCAGA AGGTGCTCGA CCTCGCCCCG GACGATCGCG AGGCGCTGGT CGGCCTCATC
GACGCGCTGG AGGCGGGCGC GGATCCCGAG GCGCTGGCGC AGGCGCTGCG GCGCGGGCTG
GCGATGGAGG AGCTGGCCGA GGGCCGCGCC CACCTGCTGC GGCGGCTGGC GGCGGTGCAG
GACGAGCGGC TCGGCGACGC GGCCGGCGCC ATCCAGAGCC TGAAGCGCCT GCTGGAGCTG
TCGCCGGACG ATCGCGAGGC GCTCGGGCGG CTGGATCGCC TGTGCGTGAA GGCCGAGCGC
TGGGTGGATC TCGGCGACGT GCTGGCGCGC GAGATCGCGG CCGCGGCCGA GGCCGGCGAC
GCGAACGTGC TCGGGGCGGT GCGGCAGCGC CTGGCGGAGC TGAAGGAGAA CCGGCTGCTC
GATCGCGAGG GCGCGCTCGA GCTGTACGAG GAGGTGCTGC GCGCCCGGCC GGACCACCCC
GAGGCGCTGG CGCGGCTCGA GGCCATGTTG CAGAAGGACC CCGGCAACGC GCGCGCCGCG
GTGGCGCTGG AGCGGGCCTA CGCCGCGGCC GGCGACCCGC TGCGCCAGGC GGCGGTGCTG
GAGCAGCGCG CCGGCGAGCG CCCGGATCCG CAGGAGCGCA AGGCGCTCTA CCTCGCGCTG
GCCGAGCTGC GCGAGAAGGG GCTCGGCGAT CCCTCGCTCG CGTTCCTGGC GCTGTGCAAG
GCGTTCCGCG AGGACCCCGC CGACCCGGCG CTCCGCGCGC GCATGGAGGC GCTCGCGGCG
CGGAGCGGCC ACGAGGAGGA GCTGGCCGCC ATCTACGAGG ACGAGCTGGA CCGGCTGCCG
CCGTCCGACA CCGCGCAGGT GGCCCTGCGC CTCGGCGCCC TGTACGAGGA GCACCTGGCC
GAGCCGGCCC GCGCCGCCCA GTTCCTGCGG CGCGCCGCGG CGCTCGACCC GGCCGCCGCG
CCGGCGGCGC TGCCCGCGCT GGAGCGCATC TACCAGAAGC TGGAGAGCTG GCCCGACCTC
GCCGACGCGC TGTCGTCGCT CGCCTCCTCC GCCCACGGCG CCGAGCGCGT CCAGCTCCTG
TTCCGGCTGG GGCAGCTCTG CGAGGAGCGG CTCGCCGCGC CGGATCGGGC CGCCGAGGCC
TACGAGGCCG CGGTGGCGGC CGACCCGCGC CACGTGCCGT CGCTCCGGGC GCTCGAGGCG
CTGTACGACG GCGCCGGCCG GCGCGAGGAC CTGTTCCAGA ACCTGGCGGC ACAGCGGGCC
GCGGCGCAGG AGCCGGCCGC CCGGGAGCGC GTGCTGGCCC GCATGGCCGC GCTGGCGGCC
GAGCTGGGCC GCCTCGACGA GGCGGTGGCG CTCTGGAAGG AGCTGCTCGG GATCCGCCCG
CGCCACGAGG CGGGGCTGGC CGCGCTGGAG GACCTCTACG AGCGGCTGGA GCGCTGGCAG
GACCTGGCGC AGCACCTGCG GCTGCGGGTG TCCGCCACGG TGGACCGCCG CGAGATCGCG
CGCCTCAACG ACAAGCTCGG CCACGTGCTC GGCACGCGGC TCGGCGACGC CGCGCAGGCG
GTGCAGTCGT ACAAGGCGGT GCTGGAGTCG GATCCGCGCA ACCGGCGCGC GCTGGAGGCG
CTGCGGGACA TCCACGCCGC CCAGGGCGAC CAGGACGCGC TGGTCTCGGT CTACCGGCGC
CTGGTCCCGC TCCAGGAGGA CGCGGCCGGG GTGAAGCGCG TCCGGCTCGA GCTCGCGGAC
GTGCTGCTGC GGGCCGGCCA GAAGCGCGAG GCGGTGGAGC AGGCCAAGCT CGCGTTCGAC
ATCGAGCCGC ACACGGCCGG CGACCTGGTC CGCATCGAGG AGACCTTCCG CCAGGGCGGC
GCCGCCCAGG ACGGCGTCCG CGCCGCCGAG GCCCGCGCCG CGCTGCTCGC GGCCGAGGGC
GGCCCCGCCG AGGCGGTGCC GGCCTGGCTG GCGGTGGCCG ACCTGTGGCG CGCCCAGAAG
CGGAACGACG CCGCCGCGGC GGCGCTCGAC AAGGTGCTGG AGCTCGACCC GGCGAACCGC
ACCGCCTACG AGCAGCTCCG GGCGCTGCAC GAGACGGCCG GCAACTGGCG CGCGCTGGCG
CGGGTCTGCG ACCTGTTCGC GCCGCACCTG CCCGACCCGG CGGAGAAGCT GGCGCTGCTG
AAGGAGGTGG CCGGCGTCCA CGAGAAGCGG CTCGGCCAGA AGGAGATGAG CTTCCTCGGC
TGGTGCCGCG CGCTGGCCGA GGGGCCCGGC GACGCCGAGG CGCTCGCCGA GGCGGAGCGG
CTGGCCGCCG AGACCGAGGC GTTCGACGAG CTCGCGGCGG TGCTGGAGCA GGTGGCGGAG
GACGCGAAGG GCATGGTCCG GGCGCGCCTG CTGCTCCGCC TGGGCAAGGT GCGGGACGAG
CGGCTGGACG CGCCCGAGGA GGCGGAGGCC GCCTACCGCA AGGCGCTGGA GGCCGACCCG
GCCAGCCCCG AGGCGCTCGA CGCGCTCACG CAGCTGTTCA AGCGGCGCGG GCGGGTGCGC
GACCTGGTCA TCACGCTGGA GCAGAAGCTC GAGGCGGCCG CGGGGCTGGA GGAGAAGAAG
GCCACGCTGC TGGAGATGGC GCGCATCTAC GACGGCGAGC TGCACGACGT CGAGGAGGCG
GTGAGCGCGC TGCGGCGGGT GCTGGAGCTG GACGGCGCGG ATCCGGCGGC GCTGGAGGCG
CTGTCCGAGC TGCTGCGGCG CGAGCAGCGC TGGGCCGACC TGGCCGGCGT GCTGGCCCGG
GCGCGCGACC TCTCCGCCTC GGACGAGGCG CGCATCGCCT ACCAGCTCCA GATCGCCTCG
CTGCACGAGA ACGAGATCGG CGACGACGAG GCGGCGGTGG AGGGCTACCG GACCGTGCTC
GGCCTGGACG ACCGGAACGC CGACGCGCTG GCGGGGCTGG AGCGCCTGTA CACCAAGCTC
GACCGCTTCG CCGAGCTGAA CCGCGTCTAC GAGCGGCTCA TCGCGCTCAC CCAGGACCCG
CGCGAGCAGG TGCGGGTGCT GTCGCGCAGC GCCGCGATCC ACGACGAGAA GCTGCGCGAC
CCGCGCTCCG CCATCGAGAA GAACGAGGCG GTGCTGCGCA TCGACGGCGC GAACGCGGTG
GCCATCAAGA ACCTCGAGCG GCTCTACCGC GACGAGGCGC TCTGGGACCG GCTCATCAGC
GTGATGCAGC ACCACGTCTC GCTGGTGCAG GACCGGCGCG AGCAGGTGAC GCTCGAGGTG
GCCATCGGCG AGATCTGGTG GAAGGAGCTG CAGCGGGTGG ATCGCGCCGA GGCCATGCTG
AGCCACGCGC TCCAGCTCGA CCCCGACTCG CGCCAGGCGG TGAGCGCGCT CGGGCGCCTG
TACGAGAACA GCGGCAACTG GAACCTCGCG CTCGACATGC TGCGCCGCGA GGCGCGGGTG
GCCGGCGGCT CGCGCGACGC GGTGGACCTG CAGGTCCGCA TGGGCGCGAT CTTCGAGGAC
ATGCTCATGG ACGTCGCCGG GGCCAAGGAG GCCTACGGCC GCGCCCTGCA GCTCGACCCG
GGCAACCTCC CGGCCCTGCG CGCGCTGAAG GGCATCGCGG AGCGCGAGCG CGACCGCGAT
CGGTACCTCG AGCTGCTGGT CGACGAGGCC CGCTACGCGA CCGACGTCCA GGAGAAGACC
GAGCGCTACA CCGAGGCGGC GCGCGTCTAC CAGGAGGAGC GCGACGACCG GGAGAGCGCG
GCCCGCTACT ACGAGGAGGC GCTGAAGCGG ACCCCCGGCC ACCTCGACGC GGCGCGCCCG
CTCTCGGACA TCTACGTGGC GCAGGCGCGC TGGGCCGACG CCGAGCGCGT GCTCGACGTG
ATCGTGGGCG TGCTGGACGC GGGCGGCGAC GCGCGCGAGC TGTGCCGCCA GTGCTACCGC
CAGGGCTACG TGGCGGAGAA GCTCGGGCGC ATGGACAAGG CGCTCGCGTC CTACCGCCGG
GCGTACGAGC TCGACGCGAC CTACCTGCCC GCGCTGGAGG GTCTCGGCAA CCTGCTCGTC
CGGCGGGAGG AGTGGGACGA GGCGCTGCGC ATCTTCACCG CGGTCATCAT CCACCACCGC
GACGGCCTCA CCGACCTCGA GGTGGTCGAG ACGCACTGGC AGATCGGCGA GATCGCCGCG
AAGCTGGGGC AGCTCGACCG GGCCGCCAAC GCGTTCCGCA AGGCGCTCGA GATCGACACG
AACCACGAGC CGTCGCGCCG CAGCCTGGTG CGCGTGCTCG AGGCGGTGGG CGACTGGGAG
GGCGCGGTGG ACCAGCGGCA GCGGCTCCTG CCGCTGCTCG AGGGCAAGGC CCGGTTCGAC
GACTTCGTCG CCATCGGCGA GGCCTGCCGC GACAAGCTCC AGGATCCGTA CCAGGCCATC
GACGCGTTCC TGGGCGCCGC CCGCATCGAC CCCACCAGCC TGCCCGTCAC CGAGGCGCTG
CTCGGCCTCT ACCGCGAGAC GCGCCAGGGC CAGAAGGCGG CCGACGTGCT CGGGCAGATG
GTGGCGCGGC CCGAGGTGCA GGCCGACCCG GTCCGCGCCG CCCGGCTCCA CCTGGCGCGC
GCCGAGGTGC TGCGCGACGA GGTCAAGGAC GAGGACGCCG CGCTGGCCGA GCTGGAGCGC
GCGCTCGATC GCAACCCGCG GCTCGTCCAG GCGTTCGCGG CCATCGAGGA CGCCCTCACC
CGCGGCAAGC GCTGGCAGGA GCTGGAGCAG GCGTACCTCC GGATGATCCA GCGGCTGCCC
AAGACGCCGG ACGTGGCGCA GGCGCGCCTG GCGCTGTGGA AGACGGTCGG CGAGCTGTAC
CGCAACGTGC TGCGGAACGA CGACGGCGCG CGGCTCGCCT ACCAGGTGGT GGCGAAGGCC
GACCCGGACG ACGCGGTGTC GCTGGAGCTG TACGCCGACC TCGCGGCGCG GAAGCCGGGC
GAGGAGGCCG AGGCGGTGGC GGCGTACCGG CAGCTGCTGC GCGGCGGGAG CCGCACGCAG
AAGCCGGCCG CGGCGCTGGT GAAGCTGCAC GCCTCGCGGC GCGAGTACGA CCAGGCGTAC
TCGGCGGCGC AGGTGCTCGT CCACCTGCTC GGCGCCACCG ACGGCGAGGA GGTCCAGGTG
GTGGCGCGGC TGCGCAAGTT CGCGCGCGAG CAGGCGAGCC GCCCGCTCGA CGACGCGCTC
TGGGCGCTGC TGCTCCACGA GCGGGTGAAG GGCCCGCTCG CCGACATCAT GACGCTGCTG
GCGGTCCACG CGCGCCCGAT GTTCCTGCAG CGCGACAAGG ACCTCGGGCT GAACCCGAAG
AAGGACGAGC TCGACGTGCA GGGCTCGATG CTCTTCTTCG CCAACATGTT CAAGTACGTG
GAGCGCACGC TCGGCTTCCG GGGCCTGCGG CTGTTCCGGA AGTCCGGCGC CGCGGCGAAG
CTGGCCATCG TGCCCACCGA CCCGGCGGGC CTGGTCGCCT CCGACGAGCT GTTCGAGGAG
GCGCCGAAGA AGGAGCTGTG GTTCGCGATC GGCAAGGCGA TGGCGTTCGC GCGGCCGGAG
CTGTACCTCG CGCGGCTCAT GCCGCACGAC CAGCTCGACC TGGTGTTCCA GGCGGCGTGC
TCGGTGGGCA CCTCGCGCTT CGTGGTGACC GCCGATCCGC ACCTGGTGGA GAAGCTGAAG
CGCGAGCTGG AGCGCACGCT GCCCGAGGGC GTGCGCAAGA ACACCCTCAA GCTCCTGGCC
CGGAGCTACT GTGAGGTGCA GCACCCGGGC GACGTGCGGT CGTACCTGGA CGGCGCGGAG
CTGACCTCGA ACCGCGCGGG CGCGCTGCTC GCCGGCGACC TCGAGGTGGT GCGGCGGGCG
GCGGCGGCGG AGAAGCCCCA GGTCTCGAAG CTGCGGGAGG AGACCCGCCT GCGCGACCTG
GCGACGTTCT GCGTCTCCGA GGAATATGCG ACGCTGCGGG AGAAGCTCGG CCTCTCCTGC
GTCGTGCCGG CGTGA
 
Protein sequence
MSAPSLDEQI RSHVESLEAA PGDDEAFHAL VGLYERGARW EELIALYEGR ARHAPGPGPL 
LAEAATLAHR KLRNVARAED LYRQLLQLEP AHPVALRALV ELLEERGDWA GVAAALEREA
TATADRAAAA AVTLRLGKVQ EERLGRRDRA ALLYARAHRL DPSLEEARAR AMECFGALRR
FAQAKRLLDG ARDAGADPRA LAAEYARLGA ALVDEPLDHA LATDALIDAL ALDRAAPGAA
AARERLRGFP RAWREEAGRL EAEATRASDR RTAAALQLRL AQIHAAYDPE GAGRAVERIE
RAWALAPGAP AALELLERVL AERGDHRGHA DALARLATQT RDRAAQVALH LEMARVDLVR
FGDAESALAA LRRALELDPA CEPAALQAFE HETDAGRFAE ALDVLERHLA AAPAKPAHAP
LRVRAAAIAR EKLGDAGRAR RHLEAALRAD PGHAPAAAAL APLLAEAGEW QPLARVLELS
ARRERSPGER VRLLERLAEI QQERLGRPRD ALRTLAGALA LDPARPATRK AMEGAAARAD
AFAELARAYR TAAAAANGDL KARKTLLRRV AEILDRDLGQ PEEAVRAWRA LVEVDPEDRG
ARAALESCLA RAGQQQELAR ELEARREHAA GDERRALGAK LARLWAEAGE PDRAGAAWRA
VLAEGPEDDE ALWGLHAALE AQGGAAAAEE RIGILARLAA RARGPAERAV IELARAEALA
EPLGRHADAA GVALAVVEAG GLSPPQRAEA VALLERLLAR GAEPLRVAQA LARAHAAAGD
ASRQAAMLER VAKELPASAD PRERARHLLD ASAVRAERLG DRRGALSAAA AALRACPDHA
EARARCEALA REVGAHRELL ALLVEVAGRL SGRPEEEAAL RIRAAATAEE DLGAFDDAAA
QLRRALELRP GDPAVLAGLT RAALASERWA DADRLLAQRA AAATGAEQVA LLAQRAEVLQ
ERLGDPAAAA EACRTALARC APEQRARLLA RLAGALGAAG DEAGRAEALG DLSAASPEPA
EATRAALESA RIRAGMGDAR AAVERLTAAL RASPDDAAAL AALEEQLGAE DPAAALMAAR
ALAGQADPRR RLRALEAEAR AHPEPGGRAA AHRAAARVAE QELGQASLAF AALAAAAREL
PGDADLRAEL RRVAGEAQEW EACARVHDAL VDAVPASGRL AVLRERAELA ERRLDRDRAA
AAWAEVAGAA PGDRDALAAL RRLHRARERW GELADVCAAL AAAPEAAPAA REDALREEAA
VAEARLADPA RAAAAWGEVA ALAPDDAEAA AALERLYQRL DRPEALAALL ERRLARAFDA
DAAARLAELR RFRLGDPAGA LALHAELLRR DPSRADVRDA LAELAAVPGA VGREALDAAD
ASLREGGEHA RRVAAREARL AAVEDRGERA RLHAELRAIL ERDLGEPGLA WVAACRAFAE
GGPSRAGAEE DLARLARETG AEDELPDVYE QAAAAAGPEE RLALLRQAAR LREARVGGRG
AVDAWNAVLA LAPDDAEALE ALAALHEAAR SAREILEVAR RRAALAEGEE RIGHLLHAAV
VADELGDGGV AAEAYRAVLE EAPDRIEALE GLARVLERAP GAEPGPATAE LLQVLEALAR
ACGADPDRRV AALLRRAARL ERDPDPRRAV EGYAEVLAER PREPQAVAGL ERLLKRPDAR
EGAARLLEDV LRTAGDAGRL AALLEVRLEG ADEAERAPLL AEIAALHERL GDRRRAFEAR
VRELADAARA GRDAPSARAD LERLAAATGA WAELAEALRV ALAAGLPARA ALEARRRLAA
VCADRLGDLA EAARQYEEVA AAAVSPETLG ALARVYRRMG AHRELAVTLS RLAEVAPAAA
ARKELLLEVA KIMAEQLSDR EGAVDAYRKI LAVDPEDPQA LRLLGRLLGA AERWEELVQI
LDREVALADR QPNFVAEAAE LRFRLGRIRH QRLADAEGAL VAYREVLHRV PRHPAALSAL
EELARGTGPA ALEAALLLEP VYAAEGEHGK VVETLEARAA NETEPARRAA LLRRVAETYG
GPLRNAEMAF LAASRALAAD PDAPESLELA VRHAQAAGLG DELAALLEEH ADRAREPVAR
AEYQRRIARL AHGEPARAAA AWQKVLDLAP DDREALVGLI DALEAGADPE ALAQALRRGL
AMEELAEGRA HLLRRLAAVQ DERLGDAAGA IQSLKRLLEL SPDDREALGR LDRLCVKAER
WVDLGDVLAR EIAAAAEAGD ANVLGAVRQR LAELKENRLL DREGALELYE EVLRARPDHP
EALARLEAML QKDPGNARAA VALERAYAAA GDPLRQAAVL EQRAGERPDP QERKALYLAL
AELREKGLGD PSLAFLALCK AFREDPADPA LRARMEALAA RSGHEEELAA IYEDELDRLP
PSDTAQVALR LGALYEEHLA EPARAAQFLR RAAALDPAAA PAALPALERI YQKLESWPDL
ADALSSLASS AHGAERVQLL FRLGQLCEER LAAPDRAAEA YEAAVAADPR HVPSLRALEA
LYDGAGRRED LFQNLAAQRA AAQEPAARER VLARMAALAA ELGRLDEAVA LWKELLGIRP
RHEAGLAALE DLYERLERWQ DLAQHLRLRV SATVDRREIA RLNDKLGHVL GTRLGDAAQA
VQSYKAVLES DPRNRRALEA LRDIHAAQGD QDALVSVYRR LVPLQEDAAG VKRVRLELAD
VLLRAGQKRE AVEQAKLAFD IEPHTAGDLV RIEETFRQGG AAQDGVRAAE ARAALLAAEG
GPAEAVPAWL AVADLWRAQK RNDAAAAALD KVLELDPANR TAYEQLRALH ETAGNWRALA
RVCDLFAPHL PDPAEKLALL KEVAGVHEKR LGQKEMSFLG WCRALAEGPG DAEALAEAER
LAAETEAFDE LAAVLEQVAE DAKGMVRARL LLRLGKVRDE RLDAPEEAEA AYRKALEADP
ASPEALDALT QLFKRRGRVR DLVITLEQKL EAAAGLEEKK ATLLEMARIY DGELHDVEEA
VSALRRVLEL DGADPAALEA LSELLRREQR WADLAGVLAR ARDLSASDEA RIAYQLQIAS
LHENEIGDDE AAVEGYRTVL GLDDRNADAL AGLERLYTKL DRFAELNRVY ERLIALTQDP
REQVRVLSRS AAIHDEKLRD PRSAIEKNEA VLRIDGANAV AIKNLERLYR DEALWDRLIS
VMQHHVSLVQ DRREQVTLEV AIGEIWWKEL QRVDRAEAML SHALQLDPDS RQAVSALGRL
YENSGNWNLA LDMLRREARV AGGSRDAVDL QVRMGAIFED MLMDVAGAKE AYGRALQLDP
GNLPALRALK GIAERERDRD RYLELLVDEA RYATDVQEKT ERYTEAARVY QEERDDRESA
ARYYEEALKR TPGHLDAARP LSDIYVAQAR WADAERVLDV IVGVLDAGGD ARELCRQCYR
QGYVAEKLGR MDKALASYRR AYELDATYLP ALEGLGNLLV RREEWDEALR IFTAVIIHHR
DGLTDLEVVE THWQIGEIAA KLGQLDRAAN AFRKALEIDT NHEPSRRSLV RVLEAVGDWE
GAVDQRQRLL PLLEGKARFD DFVAIGEACR DKLQDPYQAI DAFLGAARID PTSLPVTEAL
LGLYRETRQG QKAADVLGQM VARPEVQADP VRAARLHLAR AEVLRDEVKD EDAALAELER
ALDRNPRLVQ AFAAIEDALT RGKRWQELEQ AYLRMIQRLP KTPDVAQARL ALWKTVGELY
RNVLRNDDGA RLAYQVVAKA DPDDAVSLEL YADLAARKPG EEAEAVAAYR QLLRGGSRTQ
KPAAALVKLH ASRREYDQAY SAAQVLVHLL GATDGEEVQV VARLRKFARE QASRPLDDAL
WALLLHERVK GPLADIMTLL AVHARPMFLQ RDKDLGLNPK KDELDVQGSM LFFANMFKYV
ERTLGFRGLR LFRKSGAAAK LAIVPTDPAG LVASDELFEE APKKELWFAI GKAMAFARPE
LYLARLMPHD QLDLVFQAAC SVGTSRFVVT ADPHLVEKLK RELERTLPEG VRKNTLKLLA
RSYCEVQHPG DVRSYLDGAE LTSNRAGALL AGDLEVVRRA AAAEKPQVSK LREETRLRDL
ATFCVSEEYA TLREKLGLSC VVPA