Gene Mvan_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1003 
Symbol 
ID4645788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1038535 
End bp1049625 
Gene Length11091 bp 
Protein Length3696 aa 
Translation table11 
GC content71% 
IMG OID639804504 
Productbeta-ketoacyl synthase 
Protein accessionYP_951847 
Protein GI120402018 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases
[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase
[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATC TGCAGAAGCG GGCACTGCAG GCCATCGAGG TCCTGCGGGC GCGGGTGCGT 
GAACTGGAAG GCGCCGGTGA CGGATACGCG ATAGTCGGGT ACGCGCTGCG GTTCCCGGGT
GCCGCCGACG CCGACGAGTT CTGGGATGTC CTGCGCGGCG GGCGGGATGC GATCTCGGAG
GTGCCGCAGG ACCGCTGGGA TGCCGACGAG TTCTTCGACG CCGACCCGGA GGCCGCGGGC
AAGATGGTGA CCCGGCGGGC CGGCTTCGTC GAGGACGTCG CAGGCTTCGA TGCCCCGTTC
TTCGGGGTGT CGGCGCGTGA GGCGATGTTC ATGGACCCGC AACACCGGCT GCTGCTGGAG
ACCGCGTGGC GGGCGGTCGA GCATTCCGGG ACCGCACCCT CTGCACTGGC AGGCACCAGA
ACCGGGGTGT TCATGGGTCT GTCCACCCAT GACTACCTGG GCATGCTCAC CGACAATCTC
ACCCACGACG CCATCGAGGC CTACCTCGGC ACCGGCACCT CACCCGCGGC GGCCATCGGC
CGGATCAGTT ACCGCCTGGG GCTGCAGGGC CCCGCCGTCG CGGTGGACAC GGCGTGCAGT
TCCTCGCTGG TGGCGGTGCA TCAGGCGTGC CAGGCGCTGC GGCTGGCCGA ATGTGACGTC
GCGCTGGCCG GAGGCGTCAA CGTGCTGCTC AGCCCGGCGA CGATGATCAA CTTCTCACGC
GCGCGGATGC TGGCACCCGA CGGCCGGTGC AAAACGTTCG ACGCCACCGC CGACGGCTAT
GTCCGCGGCG AGGGCTGCGG TGTCGTGGTG GTGAAGCGAC TCGCCGACGC GATCCGCCAC
GGTGACCGCA TCCGGGCGGT GATCCGCGGC TCCGCGGTGA ACCAGGACGG CGCCTCCGGT
GGCCTGACCG TTCCCAATGG GGCCGCGCAG CAACGCGTTA TCGCCGACGC CTTGACCAAT
GCGGGCCTCA CCGCCGCCGA CGTCGACTAC CTCGAAGCAC ACGGGACGGG AACCCCGCTG
GGTGATCCGA TCGAAGTCCA GGCCGCCGGT GCGGTGCTGG GCGTGGGACG CGAGCCGGAT
CGGCCGCTGC TGATCGGATC GGTGAAGACC AACATCGGTC ACCTCGAAGC GGCGTCGGGC
ATCGCGGGCC TCCTCAAGGT GATCCTGTCC CTCGAACACC AGGTGCTGCC CAAGCACCTG
AACTTCCGCA GTCCGTCGCC GCACATCCCG TGGAACCGGT TGCCGGTGCG GATCGTCGAC
GACCCGGTCC CGTGGACCCG CAGCGGCCGG TCCCGGGTCG CCGGCATCAG CTCGTTCGGA
TTCTCCGGTA CGAACGCCCA CGTGCTCGTC GAGGAGGCAC CGCGCGCAGA CCGTGCCGGA
GCGGCGCCGC AGCCCGCGGC CCCCGGGGGA TTCGGCATGC TGCCGCTGTC CGCGCGGACC
CCGGACGCGT TGGCGGCGAC CGCGCACCGG CTCCGCACCT GGCTTGCCGC CAACCCCGAC
GCGTCCTGGC CGGATCTGTG CTTCACCGCC GCGGTGGGCC GGTCCCATCT TCGTCATCGC
GCCGCGCTGG TCGTCGACTC CCGGTCGAGC GTGGAGGAGT TGCTCGGCGC GCTCGCCGAG
GACCGTCCGG GCCCGGGACT GGCTCGCGGG GAGTGCGGGG ACCCGCCGAA AACCGCCTGG
CTGTTCACCG GTCAGGGCAG TCAGTACCCC GGTATGGGCC GGGAACTGTT CGACACCGAA
CCCGTGTACC GGGAGGTCAT GACCCGGTGT GCGGAGGCGG TTGCCGACGT CCTCGAACGC
CCACTGCTGG ACGTGGTGTT CGACGCGGAC GGCGGGGAGG CCCTGCGGCA TACCTCCTAC
GCTCAGCCGG CGTTGTACGC GGTGGAGATG GGCCTGGCGC GGCTGTGGCA GTCATGGGGG
ATCGAGCCCG ACGTGGTGCT CGGCCACAGC GTCGGGCAAT ACGCGGCGGC CTGCGTCGCG
GGTGTCATGG GTCTGGAGGA CGGCGCGGTG TTGCTGGCCG AGCGTGGACG CCTGTTCGGC
AGCCTGCCGG AGGGCGGTCG GATGGCGGCG GTGTTCGCCC CCGCCGACCG CGTCGAGAGC
TGCACCGTTG ACTCTCCGCT GCTGTCGGTG GCCGCCTACA ACGGGGACAA CACCGTATTG
TCCGGCCCGG CAGGGGATCT GGAGCGGGCA GTGGCCGGCT TCAGTGCCGC CGGCGTCCGC
TGTGACTGGC TCGACACCAG CCACGCCTTC CATTCCGCGC TGTTGGACCC GGTGCTGGAG
GAATTCGAGT CCTGTGCGGA CCGGTTCGAG TTCGGCACGC CGCAGCGGAC ATTGGTGTGC
AATCGCAGCG GTGAGGTTCT GAGCAGGCAC GCGAGGTTGG ACGGGCAGTA CTGGCGTCGT
CATGCGCGGC AGCCGGTCCG GTTCGCCGAG AGCGTGCGCA CCCTCGCGGA TCTGGGCTGC
CGGCTGCTGA TGGAGATCGG CCCGCAACCG GTGTTGACGG CCGCCGCGAT GCGGACCTGG
CCGGGGACCG TCACTGCGCC GCGCGTCGCC GCGTCGATGC GCCGCAAGGG GTCCGACCGC
CGTCAGATCA CCGAGGCGTT GGCCCAGGCC TACGTCGCCG GACATCGCCC CGAGACCGCC
GCACGGCAAC ACGGGCGGGG CCGACTGCTG GATCTGCCCA CCTACCCGTT TCAGCATCGG
GCCTACTGGT TTCCGCAAGG CCAGGCCCGG ACCGCACCGA CCGACCCGGT GCACACCGAG
ACCGTCCGGT TCCTGGACGA GGAACGCATC GATGAACTCG CAGCGCTGCT GGACGGCCCC
ACCGGCAGCG CGCAGACCGT CGACGTGCTG AGCAAGCTGG CCGCACGGCA CCGGCAGCAG
CGCGGTGCGC AGTCCATCGC CGATGCGCGC TACCAGATCC GGTGGGAGAA ATCCGAGGCT
CCGGGCGCGG CCCGGCCCAC CACCGAGGCG ACCACCTGGC TTCTGGTGAC CCACGATTCC
CGAGCGGCCC AGCCCATTGC CGAGGTGCTG GAGCGGCGGG GACATCCCTA CCACATCGTC
GGGTTACCGG GGTCGGACGC CGACGCGGCG CCGGCCGAGG AGGCGCTGCG CGCGACGGTG
AGCCAACCAC GCCCGCACAT CCTGTATCTC GCGGCCCTCG AGGAACCCGG TGGGTCGGCG
GCAGAGACGC TCGGATGGAT GCAGCATCGG GTCCTGGCCG CGGCCCGGCG GCTCTTCCAA
CTCGCCGCCA CCGCCGGGGC CCGCGCCCCC ATCTGGCTGA TCACCGCTGG GGCACAACGG
ATCACCGGCG ACGAGACCGT CGAACCGGCG CAGACTAGCC TCTGGGGTTT CGGTCGCGCA
GCCGCCCTCG AGCAACCCGA ACTCTGGGGT GGGCTGGCCG ACGTCACGAC GGGTGACGCC
GAGGAGTGGT CAGCCCTGAT CGACCATATC GTGGCGGCGC CGACCAGCGA AGACCAGATC
GCCGTCCGGG GCCGGGAGAT CCATGTCGCA AGGCTCACGC GGTGCGCCGA GAAGGCCAAC
GCCGCCGCGC TGGAATTGCG CAGGGAAGCA ACCTATCTGG TGACCGGTGG GTTGGGCGCA
CTCGGACTGG AGGTCGCCGA GTTTCTGGCC TCGCACGGCG CAGGGCACCT GGTGCTGACG
GGCCGCCGAT CGGCGAGTGA CACCGTACGG CAGCGCATCG ACGCCATGCG GGAACGTTTC
GACTGCCAGG TCCTGGTCGC CACGGCGGAC GTCGCCGACC GCGACGACGT CGCGCGCCTG
TTCAGCACCA TGCGTTCCGA GCTTCCACCG CTGGCCGGCA TCGTGCACGC CGCCGGCGAG
ATCGGGACGA GCCCGTTGCG CACCCTGGAC GATGCCGAGA TGGACCGGGT GTTCGCGGGA
AAGGTCTGGG GAGCAGTTTA TTTGTGTGAG GAAGTCGCAG ACATGGGGCT GGACTTCTTC
CTGGCCACGT CGTCGATCGC CTCGGTCTGG GGCAGCCACG GCCAGACCGC CTACGGCGCG
GCGAACGCCT TCCTCGACGG TCTGGCCGAC AGCCTGCGCG CCCGGGGCAT TCCCGGCATC
AGCGTGAACT TCGGTCCCTG GGCGGCCGGC ATGGCTGATG CGAAAGCACG CGAACAGCTG
AGTCGGCGCG GGGTCAAGAC GTTGGCGCCC GCCGACGCTC TGGCGGGGAT GGCCGACGTC
ATCGCGGCGT CGACGGCCCA CGCCGTGGTG GCCCGGATCG ACTGGGCCAC CTTCCTTCCG
GTCTACCAAC TGCAGCGACG ACGTGCGTTT CTCTCACAGC TGGAGCGTGA GGTGCCCGAC
GTTCCCGACG CCTCGGCACC GGCGCCGTCG GGTACCACCC GCTTCGTCGA GGAGCTCACG
CTCGCGCCCG TCGAGCAACG CCGGAAGCTC GTTCTGGAGT ACCTGCGCGC CGCGGTGGCC
GAGGTGACGC GCGTGGACGC TGCCGAAATC CGGGACGAGG CCGGCTTCTT CGACCTCGGC
ATGGATTCCC TGATGGCCAT CGAACTGCGC CGCCGCCTCG AGCAGGGACT CGGCAAGGAG
TTACCTGTGA CCCTGGCGAT GGATCATCCA CACCTGAGCG ACGCCGCGGA ATACGTGCTC
GGCGATGTCC TCGGCCTCTC GGAACACACC GTGGCCCAGC CCGACACGCC GTCGACGGTG
CGAACGGACG ACCCGATCGC GATCGTGGGA ATGGCGTGCC GCTTTCCGGG CGCAGACGAC
ACGGACGCGT TCTGGTCGGT GCTGGCCGAG GGTGCGGACC TGATCCGGGA GATCCCCGAC
GACCGCTTCG ACATCAACGA GTTCTACGAC CCCGATCCGG AGGCGGCGGG CAAGATCTAC
AGCCGCTACG GCGGATTCCT CGACGGGGTG GACGGATTCG ATCCGGAGTT CTTCGGGATC
TCCCCGCGCG AAGCCGTCTG GATCGATCCT CAGCAGCGGC TGGTCCTGGA GACCGCATGG
GAGGGACTCG AACGTGCCGG GTACTCCGCG GCGGCGCTGC GTGGCAGTCG AAGCGGGGTC
TATGTGGGGG TGGGGGCCAA CGAGTACTCG CACCTGCTGT CGGGTGGCTC CCTCGATGGC
ATCGAGGCGC AGTTCATCAC CGGCAACGCA CTGAACGTCA TCGCCGGGCG GGTGTCGTTC
ACGCTCGGGC TGGAGGGCCC GGCGGTCGCA GTCGACACCG CATGCAGTTC CTCGCTGGTC
GCCGTGCACC AGGCCTGCCA GGGCCTGCAG TCCGGCGACT GCGATCTGGC CCTGGCCGGC
GGGGTCAACG TGTTGCTCAG CCCGGCCACC ACCATCGCCA CCTCGCGGGC CCGGATGCTC
TCGCCCGACG GCCGCTGCAA GACCTTCGAT GCCGCCGCCG ACGGCTACGT GCGCGGCGAG
GGCTGCGGCA TCCTCGTGCT CAAACGGCTC AGCGACGCCG TGCGTGACGG CGACCGGATC
CAGGCGGTGA TCCGCGGCAG CGCGGTGAAC CAGGACGGCG CTTCCGGCGG GCTGACGGTG
CCCAACGGCG GTGCACAGCA ACGGGTCATC GCCGCAGCGC TGACCCGCGC AGGCCTCTCC
GGCAGCGATA TCGACTACCT TGAGGCGCAC GGCACCGGAA CGTCGCTCGG CGACCCGATC
GAGGTCCAGG CCGCCGGGGC GGTACTGGGC GCCGGCCGGG ACCCTGACCG GCCCTTGCTG
ATGGGGTCGG TGAAAACCAA TGTCGGACAC CTCGAGTCGG CGTCCGGCGT CGCCGGCCTG
ATCAAGGTGG TGCTGTCGCT GCAGCACGAG ATGCTGCCCA GACATCTGCA TTTCAAGACG
CCGTCGCCGC ACATTCCGTG GGACCGGCTT GCGGTGCGGG TGGTGGCCGA GCCGACCCCG
TGGCGGGCCG ACGGGCGGCC GAGGCGCGCC GGGGTGAGTT CCTTCGGATT CTCCGGGACC
AACGCGCACG TCGTCCTCGA GGAGCCGCCG GCGCAGTCGG CCGATCTTCG GCAACCCCCG
ACCGACGGGA AGGATGGCGC CGCCGGGCTC GAGTCGGCGA CGGCACCGGA ACCGGCCGGC
GTGCTGCCGA TCTCGGCGCG GTCGCCCGAG GCGCTCACCG CGTTGGCGCG CCGCTACGAA
TCCTGGCTGA CCGCGCATCC CGATGCCGAC ATCGCCGACG TGTGCTACAC CGCCGGCGCC
GGGCGGTCCC ACTTCGAGCA CCGCGCCGCG CTGGTCGTGG ACTCGGTGGA CGGAGCGCGT
GACCTGCTGG CGGGTCTGGC CGAAGACCGG CTGGGCCCGG GTGCGGTGCG CGGTGTGTGC
GGGGACCCGC CGAAGACGGC CTGGCTGTTC ACCGGGCAGG GTAGCCAGTA CCCGGGGATG
GCGCGCGAGC TGTTCGACAC CGAGCCCGTG TTCCGGGACA CGGTGACCCG CTGTGCCGAG
GCGATCGGCG ACACGCTACC GCGCCCGCTG CTGGAGGTGC TCTTCGACAC CGACGGCGAC
AACGGGCAGA CGTTGCGGCA CACCTCGTAT GCCCAGCCGG CGCTGTTCGC GATCGAGATG
GGCCTGGCGC GGCTGTGGCA GGCAAGGGGA ATCGAGCCCG ATGTGGTGCT CGGGCACAGC
GTCGGCCAGT ACGCCGCTGC CTGCGTGGCC GGGGTCTTCG GCCTCGAGGA CGGCGCGCGT
CTGGTCGCCG ACCGCGGCCG GCTGTTCGGT GGTCTGCCCG TGGGCGGCCG AATGGTGGCG
GTGTTCACCG ACGCCGAGGT CGTCGAGGAC TTCGCCGACG AGTTCCCCCA GGTGTCGGTG
GCCGCCTACA ACGGGCCGAA CACCGTGCTG GCCGGCCCGG CGTCCGACCT GGAACAGATC
GTCGCCGGCT GCAGCGGGGA GGGAATCCGC ACCACGTGGT TGGACACCAG TCACGCATTC
CACTCCGCGC TGCTCGACCC GGTGCTCGAC GAATTCGAGT CCTGCGCAGC ACGTCTCGCG
TACGCGGCAC CGACACGCCC GCTGGTCTGC AACCGCACCG GCGCGGTGGT CACCGGGCCC
GGCGTGATCG ATGCGCAGTA CTGGCGACGC CACGCCCGCC AGCCGGTGCA GTTCGCCGAA
AGTGTGCGGA CCCTTGCCGA CCTGGGTTGT TCGGTATTGA TGGAGATCGG GCCGCAACCG
TTCCTCATCG CGGCCGCCCT ACGCGTCTGG CCGGAGTCCG CGGCCACTCC GCGGGCAATC
CCCTCTCTGC GTAAGGGGCC CGATGCGCGG CGGCAGCTCG CCGAGGCGGT GGCCGCTGCC
TATATCGCCG GCCATCGGCT CGATTTCACC GGGCACGACG GGGGGCCGCA TCGCCGGTTG
CCGCTGCCGA CCTACCCATT CCAGCACCGC TCCTACTGGC CGAAGACGGC CGGCATCCGT
TCCGATGGCA GCAGCGGATC CGGACTGCTG GGCTCGGCGC AGGACCTCGC CTCAGGCGAC
ATCGTCCACA CCAGCCGACT GTCGGTGAAG ACGCAGCCAT GGCTGTCCGA CCACGTCATC
TACGGCACGG TGGTGGTTCC CGGCGCCACC TATGCGGCGA TGGCTCTGGC GGCGATGCCG
ACCCCCGCAC GGGTGCAGGA GGTGTTCTTC TACGAGCCGA TCATCCTCGG CGACAAGGAT
TCGCGAGAGG TGCAGCTGAT GCTGCACCGG ACCGACGATG CCGACGGGTG GACATTCGAA
GTGCACAGCC GCCCGTACGG CGATCGCGAC GCCGACTGGT CGCGCAACGC GTCGGGGACC
GTCGCCGCCG GTGTCGGAGA CGCCGCACCC GACGCTGCGG CGGACCCGGT CGACAACGCG
ATCGAGCGGT TGACCCGCAC GCGCCCGCAG CAGCTGTTCG ACGCCTTCGC CGACAACGAT
CTGGCCTGGG GACCCGCCTG GTGCAGCTCA CTGCGCTCGC TGTGGGTCGG CACCGGAGAA
GCGGTGGGCG ACATCGCAGT CGGCGCCGAA CTGGGTGAGC ACCTCGGCCG CGAGCCGATC
CATCCGGTGC TGCTGGATCT GTGCACCGGA ATCGCCGGCG CGGCACTGTT GGCGGCGGCC
GCACAGACCG ACGCCGATCC GGAACTGTTC CTGCCGTTGC GGTACGGGCG GGTGGAAGTG
CGTGAGCGCA TGCCCCGGCG GTTCTACTGC CGGGCCAGGT GGCAGACCGG CGGCATCGAC
AGCGAGACTC AGGTATTCGA CCTCGACTTC CTCGATCACG ATGGCCGAAA CCTCGGCGGT
ATCCGCGAAT TCACGGTGAA GCGCGCCCCC CGGGAGGCGC TGCTGCGCGG GCTCGGCGCC
GACGCGACCC GGCTGATCTA CCGTGTCGGC TGGCGCGAAA CCAGCCCGCC CGTCCGCGAC
GACGGCGCCC TCGGCCCGAG CAGCACCTGG CTGGTCGCCG GACTCGACGC GCTGGCCGAC
GATGTGCCCG GCAGCATCCG GATCGGCTTC GACCGGATCT GGGATCCCGA CCGCGACGGC
CGCGTGGTGC TCGACCCAGC CGACGACGAG CACTGGCGGC GACTCTTCGC GCTTGCCGAC
GAGCGCCGCG CACCGGTGGC GGGCGTCGTA CTCGGCGTCA CCGGGCGGGC ACGGCCGGAG
GAACCGTCGG CTGACTTCGC GGCCCGGCTG GAAGCCCAGA TCCGCGACGT GCTCGGGGCG
GTGCACTCGC TGTCGGCGCT GGGCGCGAAA CCGGCTGCCG GACTGTGGAT CGTCACCGAA
CGCGCCGTCG CCGCCGAAGC CGGCGAACCC GTCGATCCCG TGCAGTCGGC GCTGTGGGGG
CTGGGTCGCA CCCTCATCGC CGAAGAACCG GGCCTGCGCT GCCTGTTGGT CGACCTCGAT
GGATCCGAGG ACGCCGGCCC CGCACTGGCA CGGTTGCTCG GTGCCCCGGG TGACGAACCC
GAACTCGCAC TGCGGCAGGG CAGGCTGCTG GTGCCGCGGC TGTTGCCGTG GTCGCGCAGC
GGTCAACTCG CGATCCCGCG TGGCACCGAT TACGTGCTGG CGCCCACCGA GCGCGGTGTG
CTCGACAACC TGCGCGTGTC CGAGACCGAG GTGACGGCGC CGCCTGAGGG TCACGTCCAG
GTGCGGGTCG AGGCCGCCGG TCTCAACTTC CGCGATGTGC TCAACGCTCT GGGCCTGTAC
CCGGGGGATC CCGGGCCGAT CGGCGGCGAA CTCGCGGGCG TGGTCACGGC GCCGGGCCCC
GATGTCGACG GATTCGATGT CGGGCAGCGG GTTTTCGGCT TCGCGGTGGG CGCGTTCGCC
AGCCGGGTCA ACGTGCCGGT TCAGTTCCTG GCGCCGGTGC CGGAGGGCAT CTCCGCGGTC
GCCGCGGCGA CCACACCCGC TGCGGTGCTG ACCGCCGGAC TGGCGTTCGA CTGGGCGGCG
CTGCGCGCCG GTGACCGCGT GCTGATCCAC GCCGCCAGCG GCGGCGTGGG CCTGGCCGCC
GTCCAGCTGG CCCGGCAGCG TGGTGCGACC GTGTTCGCCA CCGCAAGCGG CTACAAGCGC
CCGATGCTAC GGGCGATGGG AGTCGACCAC GTCTATGACT CGCGCACAAC GGACTTCGCC
GACCAGATCC TCGCCGACAC CGACGGTGCG GGCGTCGACG TGGTGCTCAA CAGCCTGACC
AACGACGGGT TCGTCGCGGC GACCGTGCGG GCGACCGCCC GGAACGGCCG CTTCGCCGAG
ATCGCCAAGC GGGACATCTG GTCGCCCGAG CAGATGGCGG CGGCGCGGCC CGACATCGAC
TACGAGGTCC TGGCGCTGGA CGCCGTCATG CAGCAGGATC CCGACCGGAT CCAGCTCCTG
CTCGCCGACC TCGCCGACAG CCTGGCACGG GGCGAGACAG CGCCGTTGCC GTTCGAGGCC
TACCCGCTGG CCGAGGCCAC AACGGCGTTC CGCCGGATGC AACAGGCGCG GCACACCGGC
AAGATCATCC TGCAGATGCC GAAACCGCTG CAGCCGCACG GCGATCGGAG CTATCTGATC
ACCGGTGGGC TCGGCGCCCT CGGCCTGCAC GCCGCGGCCT ACCTGGCCCA ACTCGGCGCG
GGGGACATCG TGTTGACCGG CCGCCGGATG CCCGACGCCG CGACGGAGCG CGCGATCAGC
GAGATCGCGG AGCGCTACCA CTGCCGGATC CATACCTTCG CCGCCGACGT CGGCGACGAG
TCTGCGGTGC GGGATCTGCT GGAGCGGATC CGGACAGAGT TGCCGCCGCT CGCAGGTGTC
GCCCACCTGG CGGGTGTGCT CGATGATGCG CTGCTGCCGC AGCAGAGCCC GGAACGCTTC
CGGACGACAT TGGCGCCGAA GGCGTTCGGG GCCTGGCATC TGCATCGGCT CACCCGCGAC
ACCGAGCTCG AATTCTTCAT CGTGTACTCG TCGGGGTCGA GCGTGCTCGG TTCCCCCGGC
CAGGCCAACT ACGCCACCGC CAACGCGGTG CTCGACGGGC TGGTGGCGCA CCGCAAGGCG
CGCGGTCTTC CTGCGACCGG TGTCAACTGG GGTCCGTGGG CCCAGGGCGG GATGGCCACC
TCGGACGCGG CACGCGCCAA TCTCGGTGCG CAGGGCCTGA TTCCGCTGGA GCCGACTGCC
GCCTTGAACG CCCTCGGCGA GATCGTCGCA CACGGTATCG CTTCGGCGGT CGTGCTCAAG
GCCAACTGGC AGCGCGCCGC CAAGCTGCTG GGCGCCACCC GTCCACCGAT TCTCGACCAC
GTGCTGCCGA GCGCGGTCGC GGCCGGTCCC GGCGACAGCG CCCTGCTCAA ACAGCTGCAG
GACGTGCCCG AGGCGCAGCG CGGCAGCTTC GTCACCGAAC ATCTGCAGCG GGAACTGCAG
CAGATCCTGG GCCTGGCGCA GCCGCCGGCG GCCACCGGCC GGTTCCTGGA ACTGGGCATG
GACTCGCTGA TGGCGGTCGA GCTCCGAAAC CGGTTGGTGG GACAGTTCGG GAGCGGGTTC
ACCATCACCG CCACGGCGGT GTTCGACTAC CCGACCATCG GGGCGCTCGC CGAGTATCTG
GCCGCGCAGA CGCCCGAATC GACCGAGGAG GGAACCGAGA TGAAGACCGC AGACCCGGAT
GGGCAGCCGG CGACCGTTGG GCACGCGGAT GTCGTTCTCC AGACATCGTG A
 
Protein sequence
MTDLQKRALQ AIEVLRARVR ELEGAGDGYA IVGYALRFPG AADADEFWDV LRGGRDAISE 
VPQDRWDADE FFDADPEAAG KMVTRRAGFV EDVAGFDAPF FGVSAREAMF MDPQHRLLLE
TAWRAVEHSG TAPSALAGTR TGVFMGLSTH DYLGMLTDNL THDAIEAYLG TGTSPAAAIG
RISYRLGLQG PAVAVDTACS SSLVAVHQAC QALRLAECDV ALAGGVNVLL SPATMINFSR
ARMLAPDGRC KTFDATADGY VRGEGCGVVV VKRLADAIRH GDRIRAVIRG SAVNQDGASG
GLTVPNGAAQ QRVIADALTN AGLTAADVDY LEAHGTGTPL GDPIEVQAAG AVLGVGREPD
RPLLIGSVKT NIGHLEAASG IAGLLKVILS LEHQVLPKHL NFRSPSPHIP WNRLPVRIVD
DPVPWTRSGR SRVAGISSFG FSGTNAHVLV EEAPRADRAG AAPQPAAPGG FGMLPLSART
PDALAATAHR LRTWLAANPD ASWPDLCFTA AVGRSHLRHR AALVVDSRSS VEELLGALAE
DRPGPGLARG ECGDPPKTAW LFTGQGSQYP GMGRELFDTE PVYREVMTRC AEAVADVLER
PLLDVVFDAD GGEALRHTSY AQPALYAVEM GLARLWQSWG IEPDVVLGHS VGQYAAACVA
GVMGLEDGAV LLAERGRLFG SLPEGGRMAA VFAPADRVES CTVDSPLLSV AAYNGDNTVL
SGPAGDLERA VAGFSAAGVR CDWLDTSHAF HSALLDPVLE EFESCADRFE FGTPQRTLVC
NRSGEVLSRH ARLDGQYWRR HARQPVRFAE SVRTLADLGC RLLMEIGPQP VLTAAAMRTW
PGTVTAPRVA ASMRRKGSDR RQITEALAQA YVAGHRPETA ARQHGRGRLL DLPTYPFQHR
AYWFPQGQAR TAPTDPVHTE TVRFLDEERI DELAALLDGP TGSAQTVDVL SKLAARHRQQ
RGAQSIADAR YQIRWEKSEA PGAARPTTEA TTWLLVTHDS RAAQPIAEVL ERRGHPYHIV
GLPGSDADAA PAEEALRATV SQPRPHILYL AALEEPGGSA AETLGWMQHR VLAAARRLFQ
LAATAGARAP IWLITAGAQR ITGDETVEPA QTSLWGFGRA AALEQPELWG GLADVTTGDA
EEWSALIDHI VAAPTSEDQI AVRGREIHVA RLTRCAEKAN AAALELRREA TYLVTGGLGA
LGLEVAEFLA SHGAGHLVLT GRRSASDTVR QRIDAMRERF DCQVLVATAD VADRDDVARL
FSTMRSELPP LAGIVHAAGE IGTSPLRTLD DAEMDRVFAG KVWGAVYLCE EVADMGLDFF
LATSSIASVW GSHGQTAYGA ANAFLDGLAD SLRARGIPGI SVNFGPWAAG MADAKAREQL
SRRGVKTLAP ADALAGMADV IAASTAHAVV ARIDWATFLP VYQLQRRRAF LSQLEREVPD
VPDASAPAPS GTTRFVEELT LAPVEQRRKL VLEYLRAAVA EVTRVDAAEI RDEAGFFDLG
MDSLMAIELR RRLEQGLGKE LPVTLAMDHP HLSDAAEYVL GDVLGLSEHT VAQPDTPSTV
RTDDPIAIVG MACRFPGADD TDAFWSVLAE GADLIREIPD DRFDINEFYD PDPEAAGKIY
SRYGGFLDGV DGFDPEFFGI SPREAVWIDP QQRLVLETAW EGLERAGYSA AALRGSRSGV
YVGVGANEYS HLLSGGSLDG IEAQFITGNA LNVIAGRVSF TLGLEGPAVA VDTACSSSLV
AVHQACQGLQ SGDCDLALAG GVNVLLSPAT TIATSRARML SPDGRCKTFD AAADGYVRGE
GCGILVLKRL SDAVRDGDRI QAVIRGSAVN QDGASGGLTV PNGGAQQRVI AAALTRAGLS
GSDIDYLEAH GTGTSLGDPI EVQAAGAVLG AGRDPDRPLL MGSVKTNVGH LESASGVAGL
IKVVLSLQHE MLPRHLHFKT PSPHIPWDRL AVRVVAEPTP WRADGRPRRA GVSSFGFSGT
NAHVVLEEPP AQSADLRQPP TDGKDGAAGL ESATAPEPAG VLPISARSPE ALTALARRYE
SWLTAHPDAD IADVCYTAGA GRSHFEHRAA LVVDSVDGAR DLLAGLAEDR LGPGAVRGVC
GDPPKTAWLF TGQGSQYPGM ARELFDTEPV FRDTVTRCAE AIGDTLPRPL LEVLFDTDGD
NGQTLRHTSY AQPALFAIEM GLARLWQARG IEPDVVLGHS VGQYAAACVA GVFGLEDGAR
LVADRGRLFG GLPVGGRMVA VFTDAEVVED FADEFPQVSV AAYNGPNTVL AGPASDLEQI
VAGCSGEGIR TTWLDTSHAF HSALLDPVLD EFESCAARLA YAAPTRPLVC NRTGAVVTGP
GVIDAQYWRR HARQPVQFAE SVRTLADLGC SVLMEIGPQP FLIAAALRVW PESAATPRAI
PSLRKGPDAR RQLAEAVAAA YIAGHRLDFT GHDGGPHRRL PLPTYPFQHR SYWPKTAGIR
SDGSSGSGLL GSAQDLASGD IVHTSRLSVK TQPWLSDHVI YGTVVVPGAT YAAMALAAMP
TPARVQEVFF YEPIILGDKD SREVQLMLHR TDDADGWTFE VHSRPYGDRD ADWSRNASGT
VAAGVGDAAP DAAADPVDNA IERLTRTRPQ QLFDAFADND LAWGPAWCSS LRSLWVGTGE
AVGDIAVGAE LGEHLGREPI HPVLLDLCTG IAGAALLAAA AQTDADPELF LPLRYGRVEV
RERMPRRFYC RARWQTGGID SETQVFDLDF LDHDGRNLGG IREFTVKRAP REALLRGLGA
DATRLIYRVG WRETSPPVRD DGALGPSSTW LVAGLDALAD DVPGSIRIGF DRIWDPDRDG
RVVLDPADDE HWRRLFALAD ERRAPVAGVV LGVTGRARPE EPSADFAARL EAQIRDVLGA
VHSLSALGAK PAAGLWIVTE RAVAAEAGEP VDPVQSALWG LGRTLIAEEP GLRCLLVDLD
GSEDAGPALA RLLGAPGDEP ELALRQGRLL VPRLLPWSRS GQLAIPRGTD YVLAPTERGV
LDNLRVSETE VTAPPEGHVQ VRVEAAGLNF RDVLNALGLY PGDPGPIGGE LAGVVTAPGP
DVDGFDVGQR VFGFAVGAFA SRVNVPVQFL APVPEGISAV AAATTPAAVL TAGLAFDWAA
LRAGDRVLIH AASGGVGLAA VQLARQRGAT VFATASGYKR PMLRAMGVDH VYDSRTTDFA
DQILADTDGA GVDVVLNSLT NDGFVAATVR ATARNGRFAE IAKRDIWSPE QMAAARPDID
YEVLALDAVM QQDPDRIQLL LADLADSLAR GETAPLPFEA YPLAEATTAF RRMQQARHTG
KIILQMPKPL QPHGDRSYLI TGGLGALGLH AAAYLAQLGA GDIVLTGRRM PDAATERAIS
EIAERYHCRI HTFAADVGDE SAVRDLLERI RTELPPLAGV AHLAGVLDDA LLPQQSPERF
RTTLAPKAFG AWHLHRLTRD TELEFFIVYS SGSSVLGSPG QANYATANAV LDGLVAHRKA
RGLPATGVNW GPWAQGGMAT SDAARANLGA QGLIPLEPTA ALNALGEIVA HGIASAVVLK
ANWQRAAKLL GATRPPILDH VLPSAVAAGP GDSALLKQLQ DVPEAQRGSF VTEHLQRELQ
QILGLAQPPA ATGRFLELGM DSLMAVELRN RLVGQFGSGF TITATAVFDY PTIGALAEYL
AAQTPESTEE GTEMKTADPD GQPATVGHAD VVLQTS