Gene Achl_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1746 
Symbol 
ID7293206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1963184 
End bp1973782 
Gene Length10599 bp 
Protein Length3532 aa 
Translation table11 
GC content69% 
IMG OID643590156 
Productamino acid adenylation domain protein 
Protein accessionYP_002487816 
Protein GI220912507 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000653127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCACA ACGCACTGAA CTCCCCTGCC CGCCCCGCGT CTCCGCGGCT GGACCTGACC 
CAGGCCCAGC GCGGCATCTG GTACGCGCAG AAGCTGGCAC CGGAGAATCC GATGTTCCAG
ATCGGTCAGT TTGTTGAGAT CACCGGGCCC TTGGACCACG GTGTCCTGGC GGCCGCCCTG
GAGCAGGGGA TTGCGGGCAC CGAGGCCCTC ACCATGGCCT TCAGCGAGGA CAAAGACGGG
CCGTTCCAGT ACCCGCGTCC GCGGCCGGCC ATTCTTGAGG TCACCGACCT GGCAGGGGCC
ACAGACCCGG AAGCACAGGC CCGCGCCCTC ATGGACGCCG ACCTGGCGCT CCCCCGGGAT
CCCTCATCGG ACCATATGCT CCACACGGAG CTCATCCGTC TCGCCCCCGA ACGGCACTTC
TTCTACCAGC GCGTGCACCA CCTCCTGTTG GACGGCTATT CCGCTGTCCT GGTCCTGCGC
CGGGTGGCTG AGCTCTACAA CACCATCCTG GCGAACGACG GCGGCGGACA GGCCACCGGG
GAAGGTGCGG GTTTTGCGCG GTTCGCGGAC CTCCTGGCGG CTGAGGCCGG TTATGCCGGG
TCTGACGGCG CCGCAACAGA CCGGGCCTTC TGGGACGGCG AGCTTGCCGA AGCGCCTCCT
GCCACCGGCC TGGCCGGACG TCCGAACGGT GCCGCGGGGT CCCTGATCCG GGCCTCGGTG
GCCCTCCCGT CCGCCGCGGC CATGGCCCTC GAGGGCGCGC CGGCGAGCGC GCCGGCGCTG
GTCCTGACCA CCGCGTCACT GTACCTGCAC CGGATCACCG GTGAGCGCAC CGTCTCGGTG
GCGCTGCCCG TCACGGCCCG GCGTGGACAC CTGGCTAAAT CAACGCCGTC AATGCTGTCG
AACATCGTGC CTGTGAAGCT GACCGTCGAT CCCGGCGAAA CCGTCGCCGC CACCGCCGCA
GCCGTGGGAC GCACCCTCCG TGGTGCCCTG GTCCACCAGC GCTTCCGCTA TGAGGACCTG
GACACCGGAG GCGGCTACCT GGGCCCAGCG GTCAACATCC TGCCCGTCCT GGACACGATC
GCTTTCGGCG AGGCCCGGGG CGCGATGAAC ATCCTCTCCA CCGGGCCAAT TGACGATCTC
TCCATCGTGG TCCACGGCCT CGGCGCCGGC GGTGCAGCAC CGCAGCCCAC AGTCCAGTTC
GAAGCCAACG CGGCCATGTA CACGCAGGAT GTGCTGGCCG GCCACCTTGA CCGCTTTGTA
CGCCTGCTTG AACACGTGGC CACCCGCCCC GACGCGGTGG TGGCTGACCT GGCGGTCACC
ACCCCGGCAG AGGAACAGCA ACTGCTCGCG GCGGGCCAGG CTCACGGCCA GCACCTGCCG
GGCCACACCA TCGTGGAGGA ATTCCAGCAG AACGCGGCGG CACAGCCTGA TCAGACGGCC
GTCGTGGCCA CGGAGGGCAC CCTCACCTTC GCTGAACTGG AGTCCCGTTC GAACCAGCTG
GCGCGGTTCC TGGCCGGGCA CGGTGCAGGG CCCGGAAGCA CGGTGGCGGT ACGGCTGGAC
CGTGGCCTGT TGCTGCCGAT AGCGCTCCTG GCGATCCTCA AGTGCGGCGC CGCCTACCTT
CCGCTGGATC CGGACTACCC TGCCGGCCGC GTGGAGGGCA TGCTCCACGA TGCCGCTCCC
CTGAGGATCC TCACTTCAGC AGCGTTTACG GCGGCAGGGT CCGCGGAGGG GAACCCGCCG
GCACCCGGAC ACGAGGAGCT GGCCACGGAC GTCCCGGTCA CGGTCCTGGA CTCTGCCCTG
ATGGAGGCCT GCCTCGCCGG CAAGGACGGG TCGGCTCCCC CGCCCGCCGC CGGCCAACAG
GATCTCGCGT ACGTCATTTT CACCTCCGGC TCCACGGGCC GGCCCAAGGG CGTCGGTGTC
GAACACCTGG CCCTGCTGAA CCTCTACACC TCGCACCGCG ACACCATCTT TGCCCCGGCG
GAGGCGCGGC TGGGCCGCAA GCTCAGGGTA GCGCACACCG CGGGGCTGTC CTTTGACGCG
TCCTGGGACC CCATCCTGTG GCTGGTGGCA GGCCACGTAC TTCACCTTGT GGACAACGGC
ACCCGCCGGG ATCCTGAGGC CCTCAGCGCC TATCTGGCCG AGGCCGGCAT TGACTCCATC
GAGACCACGC CCTCCTTCGC CAAGGTACTG GTGGCCGGCG GACTGTTCGA CCAGGAACGG
CACCCGTCCG TGGTGGCCCT GGGCGGCGAG GCTGTAGACA CCCAGCTGTG GGAAGACCTG
GCAGGCCGCA ACGGCGTTGT CGCCTACAAC TTCTACGGCC CCACCGAAAC TACCGTGGAC
TCGCTGACCG CGGTCATGGA AGCCGGCACC GCACCCACCC TGGGCGGTTC GGTGGCAAAC
ACCCGCCACT ACATCCTGGA CTCCGGCCTG AACCCGGTTC CGGACAATGC CGTGGGCGAG
CTCTACGTTG CCGGCATCAA CCTTGCCCGC GGTTACCTGG ACCAGCCCGG GCTCAGCTCC
GAACGATTCG TGGCGGATCC TTTCGCACTG GACGGCTCGC GGATGTACCG CACCGGAGAC
ATTGTGCGCC GCCGGGACGA CGGCACCCTT GAATTCCGGG GGCGGCTGGA TGGCCAGGTC
AAGATCCGGG GGTTCCGCAT TGAACCCGCC GAGATCGAAC AGGTGCTGCG CGCCCTGCCC
GGCGTAGACC AGGCAGCCGT CACCGTAGGT ACCAACCGGG CCGGCTACGA CCAGCTCCTG
GCATACGTCA CGCCAGCGTC CACGTCCGAT GGCGGGCCGC AGCATCCGCT GGACACCGCC
GACCTTCGGC GGCAGGCACG GCGGCACCTG CCCGACTACA TGGTGCCGGC CACCGTCACC
AGCATTCCTG TCCTGCCCCT GACACCCAAC GGAAAACTGG ACGTCCAGGC CTTGCCTGTA
CCGGAACAGG ACACTTCGGC CACTACCCCG CGCAACGAGC CCGAACGGAT CGTGGCCGCT
GCGTTCCGTG ACGTTTTGGG CCTCGAGTCG GTGGGCCTGG AGGACGACTT CTTCGACCTG
GGCGGCCACT CACTGCTTGC CACCCGGCTG GTTGCCCTGC TGCGGGACCG CACCGGCACG
GCTCCGGCAC TCCGCACCGT CTTCGAAAAG ACAACAGTGG CGGCCCTCGC CGAAACCCTG
GACCTGGGCA CGGCTCCGGC CCGGCCGCTG GCTGCCGTCA CCCGGCCGGC AGTCCTGCCG
CTGTCCTTCG CCCAGCGCCG GCTGTGGTTC CTCAACCGGC TGGATCCCGC GGCCGGGACG
TACAACATCC CGGTGGTCCT GGACCTGCGC GGGCCCCTGG ACGTCGCGGC CCTGGCTGCG
GCCCTCGGCG AAGTAACTGA CCGCCACGAG ACGCTCCGCA CGGTGTTCCC CCTGGTTGAC
GGCGAACCCG CCCAACGCAT CCTGCCGCCG GCAGAGGGGC GGCCATCCTT GGTTGCCGTG
GAGTGCTCCA CCACCGGGCT GCCGGACGCG CTTGCCGCGG AAACGGGGCG CGGGTTCGAC
CTGGGCCGGG ACCTGCCGTT GCGGGCGGTC CTGTTCCAGC TGGCGCCGGA CCACCACGTC
CTGGCCGTGA CCCTGCACCA CATCGCAGCC GACGGCTGGT CCCTGGCACC CCTGGCCCGG
GATCTGTCGG TCGCGTACAA CCACCTCGCC GCGGGCACGG ACGTGGAGCT GCCGCCGCTG
CCCGTGCAAT ATGCCGATTA CACGCTCTGG CAGCGTTCCG AACTGGGCAG TGAGTCGGAT
CCGGAGAGCC CCATCTCGCG GCAGCTGGAG TTCTGGGCAC GCGAACTGCG CGGCGCACCG
GAGGAGCTGC TGCTGCCCTT TGACCGTGCC CGCGGCACCA GCGGTGACGC TGCGCACCCG
GACGGCGGTC CGGCGTCGTC CGTTTCCCTG AACATTGGCC GCGAAACGGC GGAGCGGCTC
AACACCCTGG CCCGGGAGCA CAACGCCAGC CTGTTCATGG TGCTGCAGGC AGCCTTTGCC
GCTTTGCTCA CCAAGACCGG CGCCGGCGAC GACATTCCGT TGGGCACCCC CGTGGCCGGG
CGGACGGACA CCCAGCTGGA CAGCCTGGTG GGCTTCTTCG TCAACACGCT GGTTTTGCGC
ACCAATACCT CCGGCAACCC CACCGCCGGT GAACTCGTGG AGAGCGTCCG CTATACCAAC
CTGCATGCCT ACGCCAACCA GGACGCGCCC TTTGAGCGGG TGGTGGAGGA GCTCAACCCT
TCCCGCTCGC AGCACCGGCA CCCGCTGTTC CAGGTCATGC TGACCCTCCA GAACACCGCC
ACCACTCCAC TGGCCATGGA CGGCCTGGAA GCCACGGCGG ACCAGGCCCG GGAAGCGGCC
GGCGCAAAAT TTGACCTGCT CCTGGACCTC GCCGAAAGGT CCGGCGGCCT CACCGGTTCC
CTCGCCTACG ACCCCGCCCT CTTCAACGCG GACACCGCTC AACTGCTGGC CGACGGCTTC
CTGGCCGTTG TGGACCAGTT CGCCGCGGAC CCCGCCGTCA CCTTGGACCG GCTGCGGATC
CAGTCGCCTG AACAGCATGC CCTGGTGCTG GCGCACAGCA CCGCCCACAC CTCCGCGGAC
AGCTACGGGG ACGGGACTGG TCAAACGGTC GTGGATGCCT TCCTGGCCAC CGCCGCCAGG
ACCCCCTCCG CGCCCGCCGT GGTTGACGCA GGCGGTCCGG CCGCATGCCT CACTTTCGGC
CGGCTGCGGC AGCGTGTGGA GTCTGTGGCC AGGGGCCTGG TGGCGCTGGG GGTCCAGCCC
GGCGACAGGG TAGCCGTGGC CCTTCCCCGC ACGGCCGATG TGGCCACGGC CGCACTGGCC
GTCCTTGCTG CGGGAGGGGT CTACATTCCG GTGGACCTCA CCTATCCCGC GGAACGGATC
GCCATGATCC TCGACGACGG CGGCCCCGCC CTGGTGCTCG CGGCACCCGA GGGCCCGGGC
CAGGCGGGCC CCGTACCGGC CGCCCGGACA GTCACCCTGG AGATGCTGAT CGCCGCGGGG
ACCGGCATCG CTCATGCAGA CCTGGCCGGG CGACGGCCCG CTCCGGACGA CCTCGCGTAC
GTCCTCTTCA CCTCAGGATC GACCGGCCGG CCGAAGGGTG TTGCCGTATC CCACGGCGCA
CTGGCAAACC TTTACTCCCA CCATCTCCGG ACGCTCTATA CCCCGCGGTT CGAGGCAGCC
CGGGGCGCCA CGGTGTCCGT GGCCCACATC GCCGGACTTG GATTCGATGC GGCCTGGGAC
CCCATGCTGT GGCTGGTGGC TGGTGCCGAA CTCCACATTG TGGCCGACGA CGTGCGCACC
AACGCCCAGG CGCTGGCCTC CTACTGCCGT GCCCACGCCA TCGGAGTCCT GGAAACCACC
CCCACCTACG CTGGCCAGCT CCTGCAGTTC GGCCTCGGTG CGCAGCCTGC TTCTCCAGCC
GCAGATACCG GGCAGCCACT GCCGCTGCTG TTGCTGCTGG GCGGCGAAGC CGTGCCGGCC
GGGCTGTGGG CAACCCTGTC CGGCATGCCC GGCGTGGACG CCTGGAACTT CTACGGGCCC
ACCGAATTCA CCGTGGATTC CTCCACCGCG CGGATCCAGG GTTCCCGGCC CACCATCGGC
ACGGGAATCG CCAACACCGG CACCCTGGTC CTGGACCAGC ACCTGGCCCT TGTACCCCTG
GGTATCCCGG GAGAGCTGTA CCTGGCCGGA CCCGGAATGG CCCGGGGCTA CCACCAGCGC
CCCGGCGAAA CAGCGTCCCG GTTCGTGGCC AACCCCTACG CCAAGGACGG CAGCCGCATG
TACCGCACCG GCGACCTGGT GCGGCGAACC GGCGACGGCA CGCTCGAGTT CGTTGCCCGC
AACGACGAAC AGGTGAAGGT CCGCGGATTC CGGGTGGAAC CGGGTGAGAT CGAGGCTGCG
CTTGCGGCCC ACGATCAGGT GGACCGGGCC GTGGTCCTTC CCGACGGCGA ACCGGCCCAA
CGGCTGATCG CCTACTACAC CGGAACGGCT GCACCGGAGG ACATCCGGAC CCACGCGGCG
GAACGGTTGC CCGACTACAT GGTCCCGGCC ATCCTCATGC CCATCCCGGA CATTCCGCTG
ACACCGCACG GCAAGCTGGA CCGGAAAGCC CTGCCCGCCC CCGCCGTGTC AACAGCCCGG
GTGGGGGCAG CACCCCATAC TCCCGACGAG CACGCCATGG CCGCCGCTTT TGCCGCCGTC
CTGGCCGTGG ACAACGTCTC CATGGGCGAC GACTTCTTCG CCCTGGGCGG GCATTCACTG
CTCGCCATCG ATCTAATGGC CCGGATACGG ATTGCCTTCG GCCGGGAACT TCCCCTGCGC
ACCCTGTTCG ACGCCCCCAC CCCCGCCGGG CTGCTGGCCG CGCTGCGGCC CGGAACGGCG
GACACCGGGA CGGGCACCAT CGAGGCGGGA AACGCTGACG CGCCCGGCAT AGCCGCGAAC
ACTGCTACCG CCGGCGCCGA AGTCATGCCG CTCGGCGAAT GGCTGGACAC CGCCGGAGCC
ACCCGGCCGG AGCACCTGGA ACTGTCCCAC GCGCAGGCAC GCATGTGGTT CCTGAACCAG
CTTGATCCAG GCGCTGCCGA CTACAACATC TCCCTGGCAG TGCGGCTGAC CGGCGCCCTC
GACGAACAGG CCCTGGCCTC CGCCGTGAAG GCACTGTTCC ACCGGCATGA GGTCCTCCGG
ACGGTCTACC CGGAGACCGG CGGCGTCCCC GGCCAGCACA TCCTCGCCAC CGGCAACGCT
TCCTGGATGA ACCTTGCCCT CAGCACCGCC ACATCCGAGG ACAACCTCCG CGAGGAACTG
CAGGCGCAGG CAAGCCGCGG CTTCGATGTC CGCACCGAGG TGCCGCTGCG GGCCGGACTC
ATCCGGATCC ATGCCGAGGC CGGAGAATCA GCACAGGAAG CAGAACCACA ATGGGTGCTG
CACCTGGTGA TCCATCACAT CGCCGGTGAC GGCGCCTCCC TGGCGCCGCT GGCACGGGAC
CTATCCGCTG CGTACTCAGC CGCGCTCGCA AGCGCAGACG GAGAACCTGC CACGCAGGCC
CTTGCTCCCC TGCCCCTGCA GTACGCCGAC TTCAGCGCCT GGCAGCGGCA GCAACTGGCC
GGTCCTGCCC TGGCGGAAAA GGTTGGCCAC TGGCGGCACG CCCTCGCCGG GCTCCCGCCC
GAGTTGATGC TGCCCGCTGA TCACCAGCGG CCCCGCACCG CACGCCAACC GGGGGGACAG
GTGGGCTTCC GCGTGTCCCC TGAAAGTGTG GCGGCGCTCT CAACCCTCGC GTCGTCGTCG
AACGCCAGCC TGTTCATGGC ACTGCACGCC GGCCTTGCCG GCTACCTGCT CCGGACCGGC
GCCGGTGAGG ACCTGGTGAT CGGATCGCCC ACAGCGGGCC GCGCCGATCC GCTGCTCGCC
CCGATGGTGG GCTTCTTCGT GAACACCCTG CCGCTGCGCG TCAACGCCGC CGGCGATCCT
GGCCTGCGCA CCCTGCTGGA CCGGTCCAGG GCCAGCATCC TGGATGCCTT CGACCATGAC
GATGTGCCTT TCGAACGCCT CGTGGAAGCA GTGGCACCGG AGCGCGAACT GGGCCGGCAT
CCCCTGTTCC AGACCATGCT CACGGTGGAC AGTGACGTGC CGGCGGTCCC CCAGCTTCCC
GGCGTCACCG TGGTTCCCGA GCCGGAGGCT GGTACTGGAG AAGCCAAGTT CGATCTGTCC
TTCACGCTTC GGCCGGACGC CGAAGGGGGC CTTGCCGGAA CGCTGGATTA CAACTCCGCC
ATGTTCGAAT CAGCCACCGC CCAGCGCCTG GCGGCCGGTT TCACCCGGTT GCTGGACCTG
GCCGCTTCCG CTCCGGACGT GCCCTTGTCG GCGCTGCCGC TGCTGGAAGA GCACGAGGCC
CGTGACCTGG TTGCGGCGAC GTCGGGCTCC CACGCCGGAA CTGCAGCCCT GGGTGCCGGG
CCGGCACCCG GACAAGCCCC GGGGCATGCG GACCAGGACA TCCCGGCGGC GTTCGCTGCG
GCAGTCCGTG CCACGCCTGA CGCCGTGGCC CTTGTGTCCG ACGCCGGTTC GCTCAGTTTC
GCCGACCTTG AGGCGTCTGC CGAACGTGTC GCCGCCGGGC TCACCGCAGC CGGCGCGGGC
GGCGGCCTGG TGTCCGTCAT GCTGCCCAGG TCCGCCGGCA CGGTCGAGGG CCTCCTCGGC
GTATTGCTGG CGGGCAGCGC CTACAATCCC ATCGATACCG CGTATCCGGA CAACCGCGTG
GCCGCTATCC TCGAAGATGC CGCTCCCGAG GCAGTGCTCA CCAGCTCCGC CGTGGCGCCC
AGGCTCTCCG GCATCCTGGC GGGGCTGGAT CTTGAACCCC GCCTGCTCCT GATCGAAGAC
CTGGCCCACA GCACTGCGCA GGTGCCAATG CCAAACCGCG CCAGGGACCC GCGTGAGCTT
GCGTCGGTGA TGTTCACCTC GGGCTCCACC GGCCGGCCGA AGGGCGTGGA GGTCAGCCAC
GGTGCCCTGG CAGCGCTGCT GGCCTCCCAC CGGGAGACGC TGCTTGCCGG CATCAGCAGG
CGGAAGGTGG CCCACACAAC GGGCGTCGGC TTCGATGCAT CGTGGGATCC CATCCTCTGG
CTGGCGGCCG GGCATGAACT GCACCTGCTC TCCGACGACC TCCGCCGGGA CCCATCGGCC
CTCGCCGGGT ACTTCGTCCG CCAAGGCATC AGTGCCTGGG AGACAACTCC GGGTTACCTG
CACCAACTCC TGGCCGAACC CGGGTTCGCC GCCCACCTGG CCAACCACCC GCGCGGTCGG
GAAGCCTTCA GCCTGGCCTT GGGCGGTGAA GCGTTCGACG CCGGCCTGTG GACGTCCGTG
GCGGCGCTTG CCGGCGTCCG GGCCTGGAAC CTCTACGGCC CCACCGAAGC CACGGTGGAC
ACCGTGGTGG CTTCGGTGGC CGACAGTGAT GCCCCCGTGT TGGGCCGGCC CACGGCCCGT
ACCCGGCTCT ACGTCCTGGA TGACCGGTTG CAGCACGCCC TGCCCGGCGC AGCCGGTGAA
CTGTATATCG CAGGGGCGCA GTTGGCGCGG GGCTACCGCG GACGTCCCGA CCTTACCGCT
GAACGGTTCG TGCCGGATCC TTTCCACGGC GGCGGCGAAC GGATGTACCG GACCGGTGAC
GTGGTGTACC GCCACCCGGA CGGGCGGCTT GTCTTCGCCG GCCGCAACGA CGGGCAGCTG
AAGATCCGTG GTTTCCGGGT GGAACCTGGC GAAGTGGAGT CGGTCCTGAG GGCTGCCCCG
GGCGTCCGGG CCGCAGTGGT CCGGGCAGTC GGTGAAGGCA GCGCAGCGCG GCTGGCGGGT
TACGTCGTCG CGTCAGGTGA CGCCGCGGAA GACCTGCCCG ACGCCGTGCG CCGCCATGCG
CAGGCGGCAC TGCCGGACTA TATGGTTCCC TCAGTGGTCA TGGTGATCCC GGAGGTGCCG
CTGACGCCCC ACGGCAAGGT GGATGCCAAC GCGCTCCCCG ACGCCTCCAC CGCGGGACGC
TCCGGGGGCC GTGCGCCCCG GTCCCCGCAG GAGAAGACGG TGGCCGGCAT CTTCGCCGAG
GTGCTCACCC TTGACCGGGT TGGCGTGGAC GAATCGTTCT TCGAACTGGG CGGCCACTCA
TTCCTGGCGC AGCCGCTGGT CGCCAAGGTC AACGCTGCCC TGGGCACGGA CCTGCAGGTC
CAGTCGCTCT TCCGGGCACC CACCGTGGAA GGGCTGCTGC TGGAAGCAGC CAACGGCTCG
CGCGCCAGCG TCGCGGACAG CCTGCGGCAG GTGCTTCCCC TACGGACAGC CGGGTCCAAA
GCGCCGCTGT TTGCCGTGCA CCCGGCGTCG GGCATCGGCT GGGGATTTGC TTCCATGCTG
GGGCAACTGG ACCCCGAGCG CCCCCTGATC GGGCTGCAGA TGCCGGGCAT GGAACCGGGG
CGGACTTCTC AGGTGGAGGC GGCAACCCTG ACGGAACTGG CCGATGACTA CATTGCCCGG
ATCCGCAGCG TCCAGCCCGA AGGGCCGTAC CATCTGCTGG GTTGGTCATT TGGCGGCTAT
CTGGCCCACC GGCTGGCCAC CAGGCTGCAG GAACGCGGCC ACGAGGTGGC TTTCCTGGCC
ATCCTGGACG CCTTCCCGGA CAACCAGGAA GGAAACACCG CGCCCGGCGA AGACCAGGCG
CTGTGGGCGA GCTACCTGGA AGCGCAGGGC TACGACCTCA CCGGGGAAGA CCTGGCAGGC
GTGGACGTGC ACCGCGCCCA GGAGATCCTG CGCGCCAACC ACAACCCGGT AGGAACGGTC
CCCACAGATT CGGTGGAGGC CATGGCCCGG AACTTCCCGA TCCTGGCACG CCTGATCAGG
GACGAACGGC CCCAACTCTT CACCGGTGAC CTGCTGTTCT TCCGCGCCAC GGAGCAGGTC
CCGCCGGGTA CTCCGGGCAG CGCTTCCTGG TCCCCGTTCG TCACCGGGGC CATCACCGAT
GTTCCTGTTG GCGAGCGGCA CTCCCAGCTG CTCAGTGACC GGGCTCTCAG GACAATCATG
CCTGCCTTGG CCATCCGCCT GGGCGGCGGA ACCGAATAA
 
Protein sequence
MSHNALNSPA RPASPRLDLT QAQRGIWYAQ KLAPENPMFQ IGQFVEITGP LDHGVLAAAL 
EQGIAGTEAL TMAFSEDKDG PFQYPRPRPA ILEVTDLAGA TDPEAQARAL MDADLALPRD
PSSDHMLHTE LIRLAPERHF FYQRVHHLLL DGYSAVLVLR RVAELYNTIL ANDGGGQATG
EGAGFARFAD LLAAEAGYAG SDGAATDRAF WDGELAEAPP ATGLAGRPNG AAGSLIRASV
ALPSAAAMAL EGAPASAPAL VLTTASLYLH RITGERTVSV ALPVTARRGH LAKSTPSMLS
NIVPVKLTVD PGETVAATAA AVGRTLRGAL VHQRFRYEDL DTGGGYLGPA VNILPVLDTI
AFGEARGAMN ILSTGPIDDL SIVVHGLGAG GAAPQPTVQF EANAAMYTQD VLAGHLDRFV
RLLEHVATRP DAVVADLAVT TPAEEQQLLA AGQAHGQHLP GHTIVEEFQQ NAAAQPDQTA
VVATEGTLTF AELESRSNQL ARFLAGHGAG PGSTVAVRLD RGLLLPIALL AILKCGAAYL
PLDPDYPAGR VEGMLHDAAP LRILTSAAFT AAGSAEGNPP APGHEELATD VPVTVLDSAL
MEACLAGKDG SAPPPAAGQQ DLAYVIFTSG STGRPKGVGV EHLALLNLYT SHRDTIFAPA
EARLGRKLRV AHTAGLSFDA SWDPILWLVA GHVLHLVDNG TRRDPEALSA YLAEAGIDSI
ETTPSFAKVL VAGGLFDQER HPSVVALGGE AVDTQLWEDL AGRNGVVAYN FYGPTETTVD
SLTAVMEAGT APTLGGSVAN TRHYILDSGL NPVPDNAVGE LYVAGINLAR GYLDQPGLSS
ERFVADPFAL DGSRMYRTGD IVRRRDDGTL EFRGRLDGQV KIRGFRIEPA EIEQVLRALP
GVDQAAVTVG TNRAGYDQLL AYVTPASTSD GGPQHPLDTA DLRRQARRHL PDYMVPATVT
SIPVLPLTPN GKLDVQALPV PEQDTSATTP RNEPERIVAA AFRDVLGLES VGLEDDFFDL
GGHSLLATRL VALLRDRTGT APALRTVFEK TTVAALAETL DLGTAPARPL AAVTRPAVLP
LSFAQRRLWF LNRLDPAAGT YNIPVVLDLR GPLDVAALAA ALGEVTDRHE TLRTVFPLVD
GEPAQRILPP AEGRPSLVAV ECSTTGLPDA LAAETGRGFD LGRDLPLRAV LFQLAPDHHV
LAVTLHHIAA DGWSLAPLAR DLSVAYNHLA AGTDVELPPL PVQYADYTLW QRSELGSESD
PESPISRQLE FWARELRGAP EELLLPFDRA RGTSGDAAHP DGGPASSVSL NIGRETAERL
NTLAREHNAS LFMVLQAAFA ALLTKTGAGD DIPLGTPVAG RTDTQLDSLV GFFVNTLVLR
TNTSGNPTAG ELVESVRYTN LHAYANQDAP FERVVEELNP SRSQHRHPLF QVMLTLQNTA
TTPLAMDGLE ATADQAREAA GAKFDLLLDL AERSGGLTGS LAYDPALFNA DTAQLLADGF
LAVVDQFAAD PAVTLDRLRI QSPEQHALVL AHSTAHTSAD SYGDGTGQTV VDAFLATAAR
TPSAPAVVDA GGPAACLTFG RLRQRVESVA RGLVALGVQP GDRVAVALPR TADVATAALA
VLAAGGVYIP VDLTYPAERI AMILDDGGPA LVLAAPEGPG QAGPVPAART VTLEMLIAAG
TGIAHADLAG RRPAPDDLAY VLFTSGSTGR PKGVAVSHGA LANLYSHHLR TLYTPRFEAA
RGATVSVAHI AGLGFDAAWD PMLWLVAGAE LHIVADDVRT NAQALASYCR AHAIGVLETT
PTYAGQLLQF GLGAQPASPA ADTGQPLPLL LLLGGEAVPA GLWATLSGMP GVDAWNFYGP
TEFTVDSSTA RIQGSRPTIG TGIANTGTLV LDQHLALVPL GIPGELYLAG PGMARGYHQR
PGETASRFVA NPYAKDGSRM YRTGDLVRRT GDGTLEFVAR NDEQVKVRGF RVEPGEIEAA
LAAHDQVDRA VVLPDGEPAQ RLIAYYTGTA APEDIRTHAA ERLPDYMVPA ILMPIPDIPL
TPHGKLDRKA LPAPAVSTAR VGAAPHTPDE HAMAAAFAAV LAVDNVSMGD DFFALGGHSL
LAIDLMARIR IAFGRELPLR TLFDAPTPAG LLAALRPGTA DTGTGTIEAG NADAPGIAAN
TATAGAEVMP LGEWLDTAGA TRPEHLELSH AQARMWFLNQ LDPGAADYNI SLAVRLTGAL
DEQALASAVK ALFHRHEVLR TVYPETGGVP GQHILATGNA SWMNLALSTA TSEDNLREEL
QAQASRGFDV RTEVPLRAGL IRIHAEAGES AQEAEPQWVL HLVIHHIAGD GASLAPLARD
LSAAYSAALA SADGEPATQA LAPLPLQYAD FSAWQRQQLA GPALAEKVGH WRHALAGLPP
ELMLPADHQR PRTARQPGGQ VGFRVSPESV AALSTLASSS NASLFMALHA GLAGYLLRTG
AGEDLVIGSP TAGRADPLLA PMVGFFVNTL PLRVNAAGDP GLRTLLDRSR ASILDAFDHD
DVPFERLVEA VAPERELGRH PLFQTMLTVD SDVPAVPQLP GVTVVPEPEA GTGEAKFDLS
FTLRPDAEGG LAGTLDYNSA MFESATAQRL AAGFTRLLDL AASAPDVPLS ALPLLEEHEA
RDLVAATSGS HAGTAALGAG PAPGQAPGHA DQDIPAAFAA AVRATPDAVA LVSDAGSLSF
ADLEASAERV AAGLTAAGAG GGLVSVMLPR SAGTVEGLLG VLLAGSAYNP IDTAYPDNRV
AAILEDAAPE AVLTSSAVAP RLSGILAGLD LEPRLLLIED LAHSTAQVPM PNRARDPREL
ASVMFTSGST GRPKGVEVSH GALAALLASH RETLLAGISR RKVAHTTGVG FDASWDPILW
LAAGHELHLL SDDLRRDPSA LAGYFVRQGI SAWETTPGYL HQLLAEPGFA AHLANHPRGR
EAFSLALGGE AFDAGLWTSV AALAGVRAWN LYGPTEATVD TVVASVADSD APVLGRPTAR
TRLYVLDDRL QHALPGAAGE LYIAGAQLAR GYRGRPDLTA ERFVPDPFHG GGERMYRTGD
VVYRHPDGRL VFAGRNDGQL KIRGFRVEPG EVESVLRAAP GVRAAVVRAV GEGSAARLAG
YVVASGDAAE DLPDAVRRHA QAALPDYMVP SVVMVIPEVP LTPHGKVDAN ALPDASTAGR
SGGRAPRSPQ EKTVAGIFAE VLTLDRVGVD ESFFELGGHS FLAQPLVAKV NAALGTDLQV
QSLFRAPTVE GLLLEAANGS RASVADSLRQ VLPLRTAGSK APLFAVHPAS GIGWGFASML
GQLDPERPLI GLQMPGMEPG RTSQVEAATL TELADDYIAR IRSVQPEGPY HLLGWSFGGY
LAHRLATRLQ ERGHEVAFLA ILDAFPDNQE GNTAPGEDQA LWASYLEAQG YDLTGEDLAG
VDVHRAQEIL RANHNPVGTV PTDSVEAMAR NFPILARLIR DERPQLFTGD LLFFRATEQV
PPGTPGSASW SPFVTGAITD VPVGERHSQL LSDRALRTIM PALAIRLGGG TE