Gene Hoch_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2969 
Symbol 
ID8545357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4086184 
End bp4096467 
Gene Length10284 bp 
Protein Length3427 aa 
Translation table11 
GC content77% 
IMG OID646387646 
ProductAcyl transferase 
Protein accessionYP_003267374 
Protein GI262196165 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00814021 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGATT CAGCGTCCGA CTCCCAACGG CGCGCGCTCC TCGAACGCGC GACCGTCACC 
ATCAAGAAAC TGCGGGCCGA GAACGCCCGC CTGCGCGCCG CCCGCAGCGA GCCCATCGCC
ATCGTCGGCA TGGCCTGCCG CTTCCCGGGC GGGGCCAACG ATCCGGACGG GTACTGGCGG
CTGCTCGCCG AGGGCGTGGA CGCGGTGCGC GAGATTCCGC CCACGCGCTG GCCGGCCGAG
GCGCTCGACC TCGACGCGCT GCCGGCGCTG CGCTGGGCCG GCCTGCTCGA CGACGAGCTC
GCCGGCTTCG ACGCCGAGTT TTTTGGCATC TCGCCGCGCG AGGCGGCGCA GCTCGACCCC
CAGCAGCGGC TGCTGCTCGA GGTGAGCTGG GAGGCGCTGG AGAACGCGCT GCAGCCGGCC
GAGCGGCTGA CCCAGCAGCC CGTGGGCGTG TTCGTCGGCA TCGCGTCGGC CGACTACCAG
CACCGCATCC TGGCGCTCGC TCCGGAGCAG CAGAACGGCT ACTCGGCCAC CGGCAACATG
CCCAGCGTGG CCGCCGGCCG CGTGGCCCAC ACCCTGGGCC TGCAGGGGCC CTGCGCCGCG
GTCGACACCG CGTGCTCGTC GTCACTGGTG GCCCTGCACA TGGCCTGTCA GAGCCTGCGC
GCGCGCGAGT GCGACCTGGC CCTGAGCGGC GGCGTCAACC TGCTGCTGTC GCCGACCTGG
ATGCGCCTGG TCGGGCTCAC CCAGTCGCTG TCTCCGGACG GCCGCTGCCG CACCTTCGAC
GCCCGCGCCA ACGGCTTCGT GCGCGGCGAG GGCTGCGGCG TGGTGGCGCT CAAGCGCCTG
TCCGACGCGC AGCGCGACGG CGACCGGGTG TGGGCGCTGC TGCGCGGCTC GGCTATCAAC
CACGACGGCC GCTCGAGCGG GCTCACCGTG CCCAACGTGC GCGCGCAGGA GGCCACGCTG
ACGCGGGCGC TGGCCAGCGC CGAGGTGGCG GCCGAGGACA TCGACTACGT CGAGGCCCAC
GGCACCGGCA CGCCGCTCGG CGACCCCATC GAGATCGAGG CGCTCAAGGC CGCGCTGGGC
GGCGAGCGCG GCGACGGCAG CCGCTGCGTG CTCGGCTCGG TCAAGACCAA CATCGGCCAC
CTCGAGGCCG CCGCCGGCAT CGCCGGGCTC ATCAAGGTGG TGCTGGCGAT GGGCCGCGAG
ACCGTGCCCG CGCATCTGCA CCTGCGGCAG ATCAACCCGC GCATCTCGCT CGCGCACAGC
GCCCTGCACA TCGCCGCCGA GGCCAGCCCG TGGCCGGCCG GCGAGCGGCC GCGGCGCGCG
GTGGTCAGCT CGTTCGGCAT CAGCGGCACC AACGCCGGCG TGGTGGTCGA GGAGGCGCCG
CCTGCGCCGC GGCCGGCCAC GCCCGCGCGC GCGCCGGCCG CCCTGCTGCT GCCGCTGTCG
GCGCGCGCGC CCGAGGCCCT GCGCGCGCTG GCCCTGGCCC ACGCGCAGCG CCTGGAGGCG
GACGCGGACG CGGGCCCGGG CGCGCTGGCG CGCCACGTCG CGCTCACGGG CACGCGGCGC
AGCCCGCTGC CGCTGCGCCA GGGCTTCGTC GGCGGTGACC GCGGCGAGCT GATCGCCGGC
CTGCGCGCCT TCGCCGGCCA GGACGAGCTG CGCCTGCGCG AGGTCGGCGA CCCGCCCCGG
GTGGCCATGA TCTTCTCCGG GCAGGGCTCG CAGTGGCTGG GCATGGGCGT CGAGCTGTAC
GCGCGCGAGC CGGTGTTCCG CGCCGCCGTC GATGCGTTCG ACGCCGCCAC CCGCGAGGTC
GCCGGCTGGT CGGTGCGCGA CGAGCTGTTC GCCGAGCCCG CGCGCGCCCG GCTCGATCGC
GTCGAGGTCA TCCAGCCGTG CATCGTGGCC GTGCAGCTCG CGCTGGCCGC GCTGTGGCGC
TCGTGGGGCG TGGAGCCGAG CGTGGTCGTC GGCCAGAGCA TGGGCGAGGT GAGCGCCGCG
TGCGTGGCCG GCGCCCTCGA CCTGGCCGAC GCCGCGCGCG TGATCCTCAC CCGCAGCCGC
CTGGTCAAGC AGCTCCGCGG CGGCGCCATG GCCAGCGTCG AGCTGCCCGC GGCCGAGCTC
GCGGACGCGC TCGGCGAGGG CCTGGGCGTG GCCGCCATCA ACGGCCCGCG CTCGAGCGTG
GTCGCCGGGG ACAGCGACGC CGTCGACCGC TTCGTGGCCG AGATGAATCA GCGCGGCGTG
TTCTGCCGCC GGGTCAAGGT CGACTACGCG TCGCACAGCC CCGAGGTCGA GCCCCTGCGC
CAGGCGCTGC TCGACGAGCT GGCGCCGGTG CGCGGCCGGG CCCCGGCGCT GGCCTTCCGC
TCGACCGTCC ACGGCGGCTG GGTCGGCGAC GGCGAGCTCG ACGCCGCGTA CTGGTACCAG
AACCTGCGCC AGCCCGTGCA GCTCTTCCCG GTGCTCGAAC GCCTGCTGGG CGAGGACGGC
GTCGACGTCC TGCTCGAGGT GAGCCCGCAC CCGGTGCTGG GCCCGGTGCT CCAGGCCGCC
GCCGAGCACG CCGGCTGCGA CGCGGCCGTG CTGGCCTCGC TGCGCCGCGA GCAGGCCGAG
CGCCAGACCC TGCTGCTCAC CCTGGCCGGG CTCTACGGCC GCGGCCAGGC CGTGGACTTC
GCGCGCGTGA ACGCGGCCGC CGACGACGCC GACGACGCCG ACGACGCCGA CGCGAACCCC
GACGCCGCGG ACCCCGTCGC CCGCTGGACG CCGCTGCCCA CCTATCCCTG GCAGCGGCGC
CGGCACTGGG TGAACGACGA GGGCGGCCCG CGCCCGCAGG CGGCGGCCGC GCTGCCGGGC
GAGGCGCTGC CGCCGGGGCG CCGGCTGCGC TCGCCGGCGC TGCGCGACGC GGTCTACGAG
CTGGTGCTCG GCGCCGACTC GCTGCGCTGC TTCGACAGCC ACCGGGTGCC CGGCGGCGTC
ATCGCGCCGG CCTCGTGGAT GCTGTCGATG GTGCTGGCCG CGCTGCGCGA TCTCGGACAT
CCGGACGAGA TCGCGCTGCA CCAGCTCAGC TTCGCCCGGC CGCTGGCCAT CCCCGAGGGG
CAGCGCCGGC GGGTGCAGCT CGTGCTGTCG CCCGACGGCC AGCGGCCGGC CCGCTACCAG
ATGCTCGCGG TCGACGCCGA CGCCGACGCC GATGCGCTCG AGGCCTCGGC GTGGACCCTG
CTGTCCGAGG GCGCGATCGC GCTGTCCGAA GACGCAGCGC CCGCGCCGCT CGACGTCGCC
GCCACGAGCG CGCGGCTCGA GCCGGTGGCC GAGGACGCGG TCGCCGGCCT GGTCGGCGAA
CCCGGCCCCA CGCGCTGGGT GGAGGCCGTG CTGCGCGGCC CGCGCGAGGT GCTGTGCCGG
CTGCGCGGGC CGCGCGGCGG CGACCACGGC GACCGCTACC CCGTGCACCC CGAGCCGCTC
AACGAGGCGC TGAGCGCGGC GGTGGCGTGC GCCGGCCTGG GCGGCGGCGG GTTCGCGCCG
GTGGCCGCGC AGGGCCTGCG CTTCACGGGC GGTGAGGCCG GCCCGCCGGC GTGGATCCAC
GGCAGCGTCG AGCAGGTCGA GAGCGGCGGC CGCGCGGCCC TCTCGGCGAC CCTGGCGCTC
TACGACCAGG CCGGGCGCCC GGTCGCCAAG CTGGCGCGCC TGCGCTGCGT GCCGGCCGCG
CTCGAGAGCA GCCTGCAGGC CGAGACCGGG CTGCTCGCGC GCTCCCGCTA CGCGCTCGCC
TGGGAGCCGC TGGCGCCCCC CAGCGCGCCG CTGGCGCCCG GGCGCTGGCT GCTCGTGGCC
GACGTCGGCG GGGTGTGCGA GCTGCTCGCC GCCCGCCTCG AGGCCGAGGG ACACGTCTGC
GTCCGCCTGC CGGCGCCCGC CGCCGACGAC GCCAACCCGG CCGACAGCGC CGATGACGAC
GCGCTCACCG CGGCGTTCGC CGACGCCCTC GCCGACGCCG TCGACAGCGC GGACGGCGAC
GCGCTGCCGC TGCGCGGCCT GATCTTCGGC CCCGGCCTCG ACGCCGACGC CGACGCCGCC
GCCGCCGCCG ACGACGACGA CGACGCCGCG GGCGACGCGC TCGCCCGCTT CGCGGCCACG
CGCGCGCTCC ACACGCTCGC CCGAGCCCTC GCCGGCCGCG CGCTGGCGCC GGTGTGGATC
GCCACCCGCG GGGCCGTGGC CGCGCGCCCG GACGAGACCT CGACGGCGCC GGCCGCGGCC
GCGCTGTGGG GCCTGGGCCG GGTGCTCGGC AGCGAGCACC CCGAGCTGAG CCCGCGCCTG
CTCGACCTCG ACGCCGCCGG CTCGGCGCGG ACCTGCGCCG ACCAGCTCCG GCGCGCGCTG
ACCCTGGCGC TGGGCGGGGA AGACCAGCTC GCGCTGCGCG GCCAGCAGGT GCTCGGCCTG
CGCCTGCGCC GCGTCCGCGC GCGCGACCAG GGCGGCTCGC TGGCGCTGTC GACCGAGGGC
GCGTACCTGA TCACGGGCGG CCTCGGCCGC CTGGGCCTGA GCGTGGCCGA GTGGCTGGTG
GCGCGCGGCG CCCGGCACCT GGTGCTGCTC GCCCGCTCGC TGCCCTCGGC CGCGGCCGAG
GCGCGCATCG CGGCGCTCGA GGCGCAGGGT GCCGAGGTGC TCGCGCTCCA GGCCGACGTG
GCCGACGCGG CCGCCCTGGG CCGGGCGCTG GCCGCCGCCG ACAGCGCCAT GCCCGCGCTG
CGCGGAGTCA TCCACGCCGC CGGGCAGGCG CGGCAGGCGC TGCTGGTCGA CGAGCCGTGG
CGCGACTACG CCCAGGTGCT CGGCGCCAAG GCCGCGGGCG CCTGGAACCT GCACCAGCTC
ACGCGCGAGC GCGCGCTCGA CTTCTTCGTC TGCTTCTCGT CGATCGCGGG CACGCTCGGC
TTCGGCGGCA TGGGCAGCTA CGCGGCCGCC AACGCCTACC TCGACGCCTT CGCCGAGTAC
CGGCGCGGGC GCGGGCTGCC GGCGCTCAGC GTCGCCTGGG GGGTGTGGGA CAGCGATCTC
GACGCGCAGT ACGGCGAACG CGCCCTGCGC GTGGGCCTGG CGCCCTTTGC CGGCGCCGAC
GCCCTGGCCG CGCTCGACAC CCTGGCCGCG GGCGAGGCCG CGCACGCGAT CGTCGCCAAC
ATGGACTGGG CGCGCTACCT CAAGGCCCGC GTCGGCGCCG CGCCGCCGTG GCTGCGCGAG
CTGGCCGCCG TCGGCGACCG GACGCCCGAG GGCTCGGGCG AGGGCGACGC CGCCCTGCTC
GGGCGCCTGC GCGCGCTGCC CGAGCAGGCC GCGGCCGAGC ACATCGCCGA CCACGTCGCC
GGCGCCGTCG CCGAGACCCT GGGCTACCCG CGTCACCACG CGCTGCCGCG CGGCAAGGGC
TTCTTCGACA TCGGCTTCGA CTCGCTGCTG GCCATGGACC TGCGCCGGCG GCTGTCGCGC
GACTTCGCGC ACCCGTTCCC GGTCACCGTG GCCTTCGATC ACCCCACCAT CGAGCGCCTG
GCCGCGTACC TGGCCGCGCA CTGGCAGGAC CACGGCGCGC CCGCGGCGCC GAGCGCGGAC
CAGCCATCGA CAGAGCCATC GACAGAGACA TCGACAGAGA CATCGACAGA GCGCTCGGGC
GCGTCATTGA GCGCACCGGC CGCGGCCGAG GTCGCCGCCG CGGGCGCGCC CGAGCCCATC
GCCCTGGTCG GCATCGGCTG CCGCTTCCCC GGCGGCGTCG TCGGCCCCGA GAGCTACTGG
GAATTGCTGG CCGCCGGCCG CGACGCCACC TCGGAGGCGC CGCGCGGCCG CTGGAACGAC
GAGTCGCTGT TCGACCCCGA CCCGGGCGCG CCCGGCAAGT TCCACGTGCG CCGGGCCGGC
TTCCTCGACG ACATCGAGTC GTTCGATCCC GAGTTCTTCG GCATCTCGCC GCGCGAGGCG
GCGCGCATGG ACCCGCAGCA GCGGCTGCTG CTCGAGGTCA CCTGGGAGGC GCTCGAGCAC
GCGGGCGTGG CCGCCGACGC GCTGGTCGAC TCGAGCACCG GCGTGTTCGT GAGCGGCGCG
CCCAACCAGT ACCTGGAACG CTTCGGCGAC GACCCGATCG AGCTCGACGC CTACGCGCTC
ACCGGCAACC TGCCGTGCAC GCTGTCGGGG CGGGTGTCGT ATGTGCTCGG GCTGCGCGGG
CCCAACCTGT TCCTCGACAC CGGCTGCTCG GGCGCGCTGG TGGCCCTGCA CCTGGCCTGC
CAGAGCCTGC GCGCGGGCGA GTGCGACCTG GCCCTGGTGG CCGGCGTCAA CGTGCTGCTC
TCGGCCGACA TGATGATCGG CCTGAGCAAG ACCGGAGCGC TGTCGCCGGA CGGCCGCTGC
AAGACCTTCG ACGCCGCCGC CAACGGCTTC GGCCGCGGCG AGGGCTGCGG CGTGCTGGTC
GCCAAGCGGC TGCGCGACGC CCGCGCCGAC GGCGATCGCG TCATCGCCGT GGTGCGCGGC
TCGGCGGTCA ACCACGACGG CCGCAGCGGC GGGCTCACGG TGCCCAGCGG CACCGCGCAG
CGCGCCTTGA TGGAGCGCGC CCTGCGCCAG GCCCAGCTCC CGGCCGCCCA GGTCGGCTTC
GTCGAGGCCC ACGGCACCGG CACGCAGCTC GGCGATCCGA TCGAGATCGG CGCGCTGGCC
GCGGTCTACG GCCGCGCCTC GGGGCGCACG GCGCCGTGCT TTCTGGGCGC GGTCAAGAGC
AACCTCGGCC ACCTCGAGGC CGCCGCCGGC GCGGCCGGCG TCATCAAGGC GGCGCTCGCG
CTCGAGCGCG GCGAGATCCC GCCCAACGTG CACCTCGCCG AGCGCAACCC CGACCTGCCC
CTGGCCGACG AGCCCTTCGA GCTGCCGGCG CGCGTCCACC CGTGGCCGAG CGCGAGCCAG
CGCCTGGCCG CGGTGAGCTC GTTCGGCCTC GGCGGCACCA ACGCCCACGC CATCCTCGAG
CGGCTGCCGA CGCCCCCCGA GACCGACACC GGCGCCGACG CGCCCGCGCG CCCGGTGCAC
CTCCTGGCGC TGTCGGCGCG CCACCCCGAG GCCCTCGCCG AGCAGGCGCG GCGCCTGGCC
GAGCACCTCG CCCGCCACCC CGGGCAGCGG CCCGAGGACG TGGCCTTCTC GCTCAACTGC
GGCCGCGCCC ACCTGCCGCA CCGCGCCGCC GTGCGCTTCA CCGGCGGCGA CGATCTGCGC
GAGCGGCTGG GCGCGCTCGC CGCCGACCCC GAGGGCGACG ACGCCATCCG CGGGCTGGTC
ACCGACACCC AGCCGCTGCG GGTCGGCTTC TTGTTCACCG GCCAGGGCTC GCAGTACGCC
GGCATGAGCC GCGCGCTGTA CGCCAGCCAG CCGGTGTTCC GCGAGGCCTT TGACGCCTGC
GCCGAGTTCC TCGAGCGCGA CGCCGAGCGG CCGCTGGCCG CCGTGCTGGC CGACGCCGAG
ACCATCGACC GCACCGGCAA CGCCCAGCCG GCGATCTTCG CCGTGCAGTA CGCGCTCACC
CGGCTGTGGC GCTCGTGGGG GGTCGCGCCC TACGCGGTTT TCGGACACAG CGTCGGAGAA
GTCGCGGCCG CGTGCGCGGC CGGCGCGCTC ACGCTCGAGG ACGCGCTCTT GTTGATCCGC
GAGCGCGCGC GCTGGATGGA GACGGTCCCG GACGGCGGCG TCATGGTCAG CGTGCGGGCG
CCGGCCGAGG TGGTCGCCGA GGCCATCGCG CCGCGCGCCC ACGAGGTCGC GATCGCGGCC
CTCAACGGCC CCGAGAACAC CGTGATCTCG GGCGCCGGCG CGGCCGTGCG GGCGCTCGCG
GAGGAGCTGC GCGGGCGCGG CCTCGAGGCC AAGGAGCTGC GCGTCTCGGT GGCCTTCCAC
TCGCCCGCCC TCGACCCGAT CCTCGAGCCC TTCGAGCGCG CCACCGCCGA GGTGCTCACC
CGGCCGCCGC GGCTGCCGTG GATCGGCGGC CTCACCGGGG CCGCGCTGCG CGGCGACGAG
GTCGACTACT GGCGCCGGCA GATGCGCGAG CCGGTGCAGT TCACGGCCGC CATCGGCGCC
CTGGCCGAGC TCGGCTGCGA TGTGCTGCTC GAGGTCGGCC CGCACCCGAC GCTCACCGGG
CTCGCGGCCG AGAGCCTGCC GCCCGAGCTG GCCTGCCTGC CCTCGCTGCG CCGCGGCCAG
GACGACGACG CCGTGATCGC CGACAGCCTG GGCCGGCTGT ACGCGGCCGG CGCGCCCGTG
GACTGGCGCT CCTGGGACCG GCCGTTCGCG CGCCGGCGCC TGCCGCTGCC CACCTACCCG
TTCCAGCGCC GCCGGCTGTG GTTCGACGCG CCCCCGCGCC AGCACCACAC CAACCCGGTC
ACCGCCTACG AGCGCGAGCA CGAGACCGCC TGGTACTCGC ACTGCGAGTG GCGCGAGCAG
GCGCAGGCGC CGGCCGCCAT CGGCCGCGGC CACTGGGTGC TTTTGGCCGA CCGCGGCGGC
GTCGCGGCCG CCCTGGCCGC CGAGCTCGAG GCCCGCGGCC ATAGCTGCAG CCTGCTGCGG
CCCGGCGACC TCGAGGCCCG CGACCGCGAG GCCTCGGCGG ACGCGGCCGA GCCGCACTGG
ACCGCCACCG CGATGGCGCG CGCGCTCGAC GCCGTGTGCC CGCACGGCCG GCCGCTGCGC
GGCGTGGTCC ACCTGTGGAG TCTCGACCTG CCGGCCACCG CCGCGCTCGC CGACGCCGAC
CTCGAGCACG CCGCCTCGCT CACCCTGGGC AGCGCGCTGG CGCTGGTCCA GGCGCTGGCC
GGGCGCGCGA GCGCGGGCGG CGGTCCGCGG CTGTGGCCGG TGACCCGCGG CGCGGTGTGC
ACCGGCGCGG ACGGCGCCGC GCTGGCGGTG GCGCAGGCGC CGCTGTGGGG CCTGGGCGCG
GTCATCGCCA ACGAGCACCC CGAGCTGTGG GGCGGCGCGC TCGACGCCGA CCCCGCCGAC
CAGCCAGCGG CCGCGCTGGC GGCCGCGCTG GCCGGCGAGC TTCTGGCCGG GCCCGCGGCC
GAGCGCGTGG CCTGGCGCGA GGGCCGGCGC CTGGTGGCCC GGCTGGTGCC CTACCTGCCG
GAGCGCACCG CGCCCCTGCC GGTGCACGCC GAGGGCTGCT ATCTGGTCAC CGGCGGCCAC
GGCGCGCTCG GCCTGGCGGT CGCCGGCTGG CTGGTGCAGC GCGGCGCCAA GCACCTGGTG
CTGATGAGCC GCAGCGGCCC GGACGAGGAC GCGCAGGCGA CCATCGACGC GCTCGGGGCC
GAGGGCGCCG AGGTCATCGA CGTGTGCGCC GACATCGGCG ACCCCGCCCA GGTGCGCGCC
CTGCTGCGCG ACATCGAGGC CCGCGGCGTG CCCCTGCGCG GCGTGGTCCA CGCCGCCGGC
GTGCTCGAGG ACGGGCTGCT GGTCAATCAG TCGTGGGAGG CCTTCGAGCG GGTGCTGCGG
CCCAAGCTGC GCGGCGCCTG GCACCTGCAC CGCAACACCC GCGGGCTCGA CTTCTTCGTG
CACTTCTCGT CGGCCTCGGC GCTGCTCGGC CCCCACGGCC AGGGCAGCTA CGCGGCCGCC
AACGCCTTCC TCGACGCCCT CGCCCACCGC GAGCGCGCCC ACGGGGTGCC GGCGCTCAGC
GTCAACTGGG GGCCGTGGGC GGCCGGCATG GCCGCGCGCC TCGACGCCGA GACCAGCCGG
CGCACCCTGG GCGCGGGCTG GACGCCGCTG GCGGTGGCCG ACGGCTGGCG GGTGCTCGAC
CGCGTGGTCG GCAGCGACGA GGTCCAGGTG GCCGTGCTGC CGGCCAACTG GGCCACGCTC
GCCAGCGAGG GCGCACTCTC GCCGCTCATC GGCGAGCTGG CCGGCGCCGC CGCGCAGGCG
CCCGCGGCCG CGCGCCGCGA CGCCGGCCGC GCGCTCGCCA CCCTGCGGGC CACGGCCCCG
GGCGAGCGCC GGCGGGTGCT CGAGGCCACC GTGCGGCGCG TGGTCGAGCG CACCCTGAGC
TGGAGCGCGG ACGCCGAGCT GGGGCGCAAG CAGCGCTTCG TCGAGGTCGG CCTCGATTCG
CTCATGGCCA TCGAGGTGCG CAACCGGCTG CAGCGCGAGC TGGACCTCAC GCTCGCGGCC
ACCACGCTGT TCAACTACCC CACCGTCGGC GAGCTCAGCG AGCACCTGAG CGAGCTGCTC
ACCACCCACC GGCTGCTCGA CGACGACGCC GGGCCCGCCC AGGACGAAAC CGACGAACCG
TCCGAAATCG CCCACACGTC CGAAATTTCC TCTACGCCCG CGCCGCCCAT CCCCGAAGCC
AGCGCCGACG GCGCGTCCGA CGACGAGCTG TCCGAAGACG AGCTGGTCGC GCTCATCGCC
GCCAAATACG ACTCGCGCAC CTGA
 
Protein sequence
MTDSASDSQR RALLERATVT IKKLRAENAR LRAARSEPIA IVGMACRFPG GANDPDGYWR 
LLAEGVDAVR EIPPTRWPAE ALDLDALPAL RWAGLLDDEL AGFDAEFFGI SPREAAQLDP
QQRLLLEVSW EALENALQPA ERLTQQPVGV FVGIASADYQ HRILALAPEQ QNGYSATGNM
PSVAAGRVAH TLGLQGPCAA VDTACSSSLV ALHMACQSLR ARECDLALSG GVNLLLSPTW
MRLVGLTQSL SPDGRCRTFD ARANGFVRGE GCGVVALKRL SDAQRDGDRV WALLRGSAIN
HDGRSSGLTV PNVRAQEATL TRALASAEVA AEDIDYVEAH GTGTPLGDPI EIEALKAALG
GERGDGSRCV LGSVKTNIGH LEAAAGIAGL IKVVLAMGRE TVPAHLHLRQ INPRISLAHS
ALHIAAEASP WPAGERPRRA VVSSFGISGT NAGVVVEEAP PAPRPATPAR APAALLLPLS
ARAPEALRAL ALAHAQRLEA DADAGPGALA RHVALTGTRR SPLPLRQGFV GGDRGELIAG
LRAFAGQDEL RLREVGDPPR VAMIFSGQGS QWLGMGVELY AREPVFRAAV DAFDAATREV
AGWSVRDELF AEPARARLDR VEVIQPCIVA VQLALAALWR SWGVEPSVVV GQSMGEVSAA
CVAGALDLAD AARVILTRSR LVKQLRGGAM ASVELPAAEL ADALGEGLGV AAINGPRSSV
VAGDSDAVDR FVAEMNQRGV FCRRVKVDYA SHSPEVEPLR QALLDELAPV RGRAPALAFR
STVHGGWVGD GELDAAYWYQ NLRQPVQLFP VLERLLGEDG VDVLLEVSPH PVLGPVLQAA
AEHAGCDAAV LASLRREQAE RQTLLLTLAG LYGRGQAVDF ARVNAAADDA DDADDADANP
DAADPVARWT PLPTYPWQRR RHWVNDEGGP RPQAAAALPG EALPPGRRLR SPALRDAVYE
LVLGADSLRC FDSHRVPGGV IAPASWMLSM VLAALRDLGH PDEIALHQLS FARPLAIPEG
QRRRVQLVLS PDGQRPARYQ MLAVDADADA DALEASAWTL LSEGAIALSE DAAPAPLDVA
ATSARLEPVA EDAVAGLVGE PGPTRWVEAV LRGPREVLCR LRGPRGGDHG DRYPVHPEPL
NEALSAAVAC AGLGGGGFAP VAAQGLRFTG GEAGPPAWIH GSVEQVESGG RAALSATLAL
YDQAGRPVAK LARLRCVPAA LESSLQAETG LLARSRYALA WEPLAPPSAP LAPGRWLLVA
DVGGVCELLA ARLEAEGHVC VRLPAPAADD ANPADSADDD ALTAAFADAL ADAVDSADGD
ALPLRGLIFG PGLDADADAA AAADDDDDAA GDALARFAAT RALHTLARAL AGRALAPVWI
ATRGAVAARP DETSTAPAAA ALWGLGRVLG SEHPELSPRL LDLDAAGSAR TCADQLRRAL
TLALGGEDQL ALRGQQVLGL RLRRVRARDQ GGSLALSTEG AYLITGGLGR LGLSVAEWLV
ARGARHLVLL ARSLPSAAAE ARIAALEAQG AEVLALQADV ADAAALGRAL AAADSAMPAL
RGVIHAAGQA RQALLVDEPW RDYAQVLGAK AAGAWNLHQL TRERALDFFV CFSSIAGTLG
FGGMGSYAAA NAYLDAFAEY RRGRGLPALS VAWGVWDSDL DAQYGERALR VGLAPFAGAD
ALAALDTLAA GEAAHAIVAN MDWARYLKAR VGAAPPWLRE LAAVGDRTPE GSGEGDAALL
GRLRALPEQA AAEHIADHVA GAVAETLGYP RHHALPRGKG FFDIGFDSLL AMDLRRRLSR
DFAHPFPVTV AFDHPTIERL AAYLAAHWQD HGAPAAPSAD QPSTEPSTET STETSTERSG
ASLSAPAAAE VAAAGAPEPI ALVGIGCRFP GGVVGPESYW ELLAAGRDAT SEAPRGRWND
ESLFDPDPGA PGKFHVRRAG FLDDIESFDP EFFGISPREA ARMDPQQRLL LEVTWEALEH
AGVAADALVD SSTGVFVSGA PNQYLERFGD DPIELDAYAL TGNLPCTLSG RVSYVLGLRG
PNLFLDTGCS GALVALHLAC QSLRAGECDL ALVAGVNVLL SADMMIGLSK TGALSPDGRC
KTFDAAANGF GRGEGCGVLV AKRLRDARAD GDRVIAVVRG SAVNHDGRSG GLTVPSGTAQ
RALMERALRQ AQLPAAQVGF VEAHGTGTQL GDPIEIGALA AVYGRASGRT APCFLGAVKS
NLGHLEAAAG AAGVIKAALA LERGEIPPNV HLAERNPDLP LADEPFELPA RVHPWPSASQ
RLAAVSSFGL GGTNAHAILE RLPTPPETDT GADAPARPVH LLALSARHPE ALAEQARRLA
EHLARHPGQR PEDVAFSLNC GRAHLPHRAA VRFTGGDDLR ERLGALAADP EGDDAIRGLV
TDTQPLRVGF LFTGQGSQYA GMSRALYASQ PVFREAFDAC AEFLERDAER PLAAVLADAE
TIDRTGNAQP AIFAVQYALT RLWRSWGVAP YAVFGHSVGE VAAACAAGAL TLEDALLLIR
ERARWMETVP DGGVMVSVRA PAEVVAEAIA PRAHEVAIAA LNGPENTVIS GAGAAVRALA
EELRGRGLEA KELRVSVAFH SPALDPILEP FERATAEVLT RPPRLPWIGG LTGAALRGDE
VDYWRRQMRE PVQFTAAIGA LAELGCDVLL EVGPHPTLTG LAAESLPPEL ACLPSLRRGQ
DDDAVIADSL GRLYAAGAPV DWRSWDRPFA RRRLPLPTYP FQRRRLWFDA PPRQHHTNPV
TAYEREHETA WYSHCEWREQ AQAPAAIGRG HWVLLADRGG VAAALAAELE ARGHSCSLLR
PGDLEARDRE ASADAAEPHW TATAMARALD AVCPHGRPLR GVVHLWSLDL PATAALADAD
LEHAASLTLG SALALVQALA GRASAGGGPR LWPVTRGAVC TGADGAALAV AQAPLWGLGA
VIANEHPELW GGALDADPAD QPAAALAAAL AGELLAGPAA ERVAWREGRR LVARLVPYLP
ERTAPLPVHA EGCYLVTGGH GALGLAVAGW LVQRGAKHLV LMSRSGPDED AQATIDALGA
EGAEVIDVCA DIGDPAQVRA LLRDIEARGV PLRGVVHAAG VLEDGLLVNQ SWEAFERVLR
PKLRGAWHLH RNTRGLDFFV HFSSASALLG PHGQGSYAAA NAFLDALAHR ERAHGVPALS
VNWGPWAAGM AARLDAETSR RTLGAGWTPL AVADGWRVLD RVVGSDEVQV AVLPANWATL
ASEGALSPLI GELAGAAAQA PAAARRDAGR ALATLRATAP GERRRVLEAT VRRVVERTLS
WSADAELGRK QRFVEVGLDS LMAIEVRNRL QRELDLTLAA TTLFNYPTVG ELSEHLSELL
TTHRLLDDDA GPAQDETDEP SEIAHTSEIS STPAPPIPEA SADGASDDEL SEDELVALIA
AKYDSRT