Gene Hoch_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2954 
Symbol 
ID8545342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4027855 
End bp4038204 
Gene Length10350 bp 
Protein Length3449 aa 
Translation table11 
GC content68% 
IMG OID646387633 
ProductAcyl transferase 
Protein accessionYP_003267361 
Protein GI262196152 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.342173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG TACAGCAAAC AGCCGTCGAT GCCCTGCGCG CGGCGCTAAC CACCGTAGAA 
CGCCTTCGCA AGCAAAAGCA GGAGCTGACC GCACGAACGC ACGAGCCCAT CGCGATCGTG
GGAATGGGGT GCCGCTTCCC CGGCTCGGTC AGGACCCCGG AGGCCCTGTG GCGGGTACTG
CACGACGGCC AGGACTTGGT GTCGCGGTTT CCTGAAGATC GAGGCTGGAA GGTCGATTCG
CTCTACGACC CGGATCCCGA TAGCTACGGC AAAAGCTACA CGGTCCAGGG CGGTTTTTTA
CACGACGCCG ACCGGTTTGA CGCGGGTTTC TTCGGCATCA GCCCGCGTGA AGCCCTCGCG
CTCGACCCGC AGCAGCGGCT CCTGCTCGAA ACCTCGTGGG AGACCTTTGA GCGGGCCGGC
ATTGCCCCTG ATTCGCTGCA GGGCTCGGCG ACCGGCGTAT TCGTCGGCGT CATGTACAAT
GACTACGGCA CGCGGCTCTG GTCGCTCCAG CGCGACGCGG GCTTCGTGCT CCACGCCCCT
GAAGACCTCG AGGGCTACAT GGGCGTATCC AGCGCGCCCA GCGCCGCCTC GGGTCGGCTG
TCGTACTCGT TTGGACTGCA GGGTCCGGCG GTGACTTTAG ACACGGCCTG CAGCTCGTCG
CTGGTGGCGA TCCATCTGGC CGCTCAGGCT CTGCGTCGAG GCGAGTGCAC GCTCGCGCTC
GCTGGCGGCG TGACCGTCAT GGCCACGCCA GGAGGGTTCG TCTCGTTCAG TCGCCAGCGG
GCGCTATCCC CGGACGGCCG CTGCAAGGCG TTCTCGGCGG ACGCCGACGG CGCGGGCTGG
GGCGAAGGCG CGGGCATGCT GCTGCTGGAG CGCCTGTCGG ACGCCAAGCG CAACGGCCAT
CCGATTTTGG CGGTCCTGCG CGGCTCCGCG GTGAACCAGG ATGGCAAGAG CCAAGGCCTC
ACGGCTCCGA ACGGCCCCGC TCAGCAGCGG GTGATCCTGC AGGCGCTCGA CAACGCCCGG
CTCACGCCCA ATCAGGTGGA CGCCGTCGAA GCCCACGGCA CGGGCACCAA GCTGGGCGAT
CCCATCGAAG CGCAGGCCCT GCTCGCCACC TACGGCCAGG CGCACACGCC AGAGGCCCCC
GTGTGGCTGG GAAGTCTGAA ATCGAACCTG GGCCACACCC AGGCCGCGGC CGGGGTCGCT
GGCGTCATGA AGATGGTCTT GGCGCTGCAG CACCAGATGC TGCCGGCGAC GCTGCACGCC
CAGACGCCCT CGCCACATAT CGACTGGTCG TCCGGCACGC TCCAGCTGGT GCAGAGCGCA
CGCCCCTGGC AAACGAACGG CCAGCCGCGG CGAGCGGGCG TCTCCTCCTT TGGAGTCTCT
GGAACCAACG CTCACCTGAT CTTGGAAGAA GCGCCGCTGG AGCAGGCGGC GACCGAGCAA
CGAGCAGCTG CGACTGCGCC CGTGGCGGCC TTGCCGTTTT TGCTCTCAGG CAAAACCGAA
GAAGCGCTGA AGGCCCAAGC CCAGCGCTTG CACCAGCACC TACAACGCCA CGAGGACGCC
GCGCTGGTGG ACGTGTCCTA TTCGCTCGCC ACTACCCGCG CGCACTTCGA GCAGCGAGCC
GCCCTCGTCG CCTCCACGCG GGAAGAGCTC CTCGCCGCTC TCGCCGCCCT CGCGAACGGA
GAGAGCGCTC CGTCGCTGGT GGTCGCTCCG CGAAGCGCCG ACGGCAAGGT CGTGTTCGTG
TTCCCCGGCC AAGGGTCGCA GTGGCAGGGC ATGGGGCGAG CGCTCTTGCG GAGTTCGGAC
GCGTTCCGAG CCGAGGTCGA GGCCTGTGAG GCCGCCTTCG CTCCGTACAT CGACGGCTCG
CTGCGCGAAG CGCTCGAGGG CGGCAGCTCA GACCGGGTCG ACGTGCTCCA GCCCGTGCTC
TTCACGATGA TGGTCTCGCT CGCTGCCCAT TGGCGCTCGC TCGGCGTGGT GGCAGCCGCC
GTTGTCGGTC ACAGCCAGGG CGAGGTGGCC GCCGCCTACG TGGCGGGGGC GCTCTCCCTC
GACGACGCGG CCCAGATCGT GGCCCTCAGA AGCCGCGCCT TGCGGCGGGT CGCGGGGCGC
GGCGCAATGG CCGCGGTGGA ACTTGGAGCC GAGCAGCTCG CCACCTACCT CGCGCCGTTT
GAAGAACAGC TCGCCATCGG CGCCGTGAAC AGCCCTCGCG CGAGCCTCGT CGCCGGGCAG
CCGGCGGCGC TCGACGCGCT GCTCGAGAAA CTCGCCGAGG ACGGCGTCTA CACCCAGAAG
GCCCGCGGAA ACCACGCCTC CCACTGCCGC CTGGTCGAGC CGCTCGCGCA AGAGCTGACC
GACGCGCTCC AGGGCATCCG CCCCAGCACC TGCGCCATCC CCCTGTATTC GACCGTCACC
GGGACCAGGC TCGAAGGACA CGAACTCGAC GCCGACTACT GGTACCAAAA TCTGCGCGCC
CCGGTGCTCT TTCAATCCGC TACCGAGCGG CTGCTCGCCG ACGGTCACGA CCTCTTCGTC
GAGCTGAGCC CTCACCCCGT CCTCAGCCTC CCGCTCTACG AGACCTTCGA CGCACGTGAG
CACTCCGCCC AGGTCGTCAC CTCCCTGCGT CGCGGCGACG GTACGCACGC TCGCATGCTC
CTGAGCTTGG GCGAGCTGCA CAACCGCGGT CACAAGCTCG ACTGGCACGC CTTCTTCGCC
CCCTGGCACC CCCGCACCGT CCCCCTGCCT ACCTACGCGT TTCAGTACGA GCGCTTCTGG
CTCGAAGGTA CCCGTAACCA GAGCAGGGAT AGCGCCAGCT CCGGTTCCGC GGAGTCCTCG
TTTTGGCGCT CTGTAGAGGG CGGCGACGCA GATGGCCTCG AACGTCTCCT GAAGCTTCAG
GAGGCCCCCC AACGCTCGGC CTTGGACTCT CTCTTGCCGC TGCTCACCAA CTTTCGCGCA
CAGGCCGGCC AGCAGCGTGC CACCGACGCC TGGCGCTACC GTCTGGTGTG GAAGCCCGTT
ACGATCGCGA CGCACGCGGA ACTCTCCGGC GTTTGGTTGC TGATCGTGCC CGCGAGTCAG
GCGGCCGATA TCTGGGCGCA GTCGCTGATG CGAGGACTCG AAGCTGGCGG CGGCACCGTG
GTCACGTGTC ACGTTCACCA CGATCTTGCC GACCGCGCCA AGTTGGAAGC ACGGCTACGC
GAGCTGCTGG CAGGTAAGGC GCTTCCGTCT GCGCAGACTA TCCGCGGCGT GATTTCACTC
GCCGCGCTCG ATGAAGGGAC GTGTCCAGCC CATCCGTCGT TGAGCAACGG AATGGCCCTG
AACTTCAGCC TGATCCAAGC CCTGCTGGAC GCGGGCCTCA AGAGTTCCGT GTGGCTCTTG
ACTCGCGGAG CAGTCTCGAT CGGGCCCGCG GAACGCTTGA CCAACCCCCT CCAGGCCATG
ACCTGGGGCA CCGGCCGCGT GCTGGGCCTC GAGCATCCCG ACTATTGGGG CGGCCTTATC
GACCTTCCAG AGGCCTGCAG TGACGGCGCA GTCGAGCGCT TGGTCGGTCT GCTCGGCGCC
TGCAGCGAGG AGGACCAGCT CGCCCTGCGC GGCCCCGGCT GGTACGCACG CCGGCTGGTA
CGGGCCCCCT TGGGGGCGGC CGAGGCGGCC GATACGTACG TGCCACGCGG CACCGTCCTG
ATCACCGGCG GCACCGGCGG TCTGGGCGCC CACGTAGCCC GCTGGCTGGC GTCGAAAGGT
GCTCAGCACC TGGTGCTGGT CAGCCGTCGT GGCGAACTCT CACCCGGCGC AGAGCAGCTG
CACGACGAGC TGTCCGAACT CGGCGCACGG GTGACGATCG CTGCGTGCGA CGTGGCCGAT
CGGGCTGCGC TGCACAAGCT ACTCGACGCG CTCGATGCCG AGGGCGCCAG CATCCGGAGC
GTGGTGCACG CCGGCGGCGT GGCGCAGCAG ACCCCTCTCA TCGCGATGAC GCTCAGAGAG
TTTGCGGAGG TCGTCTCGGG CAAAGCGCGT GGCGCTCAGT TTTTGCACGA GCGCTTTGAT
GCTCAGCCCC TCGACGCCTT CGTACTGTTC TCCTCCGGCT CGTCCAGCTG GGGCGGAGGG
GGACAGGGTG CCTACGCCGC CGGCAACGCC TTCCTGGACG CCCTGGCCGA GCACCGGCGC
GGCCTCGGCC GGGCAGCCAC CTCCGTGGCC TGGGGGGCCT GGGCGGGCGA CGGTATGGTG
ACGTTGCTCG ACGACTCCGG CGAAAGCGCG CTCCGAACTC GCGGCATCCT GCCGATGTCC
CCCGAGCTCG CGATCGCCGC CCTGGCGCAA GCGCTCGACC ACCGCGAAAC CAAGCTCACC
GTCGCCAACA TGGACTGGGC CCGCTTCGCC CCCGCGTTTG CCTCGGCCCG TTCACGTCCG
CTGCTCCACG ATCTGGCTGA GGCGAAGGGC GCGCTCGAGG GGGCAGAAGG GACGTCCGCC
CTCGAGGGAC GTGAGACGGA GCTGCTCGCT CACCTGCGCA AGCTCACGGA ACCAGACCGC
CGACGCCTCC TGCTCTCGCG CGTGCTCGAG GAGACCGCGG CTGTGCTGGG GCACGCGGAC
GCCTCGCGCG TTGAAGCCAA GAGGGGTTTC TTCGAGATGG GTCTCGACTC GCTCTTGGCG
TTGGAACTCC GAAAGTGCCT CCAGTCGGCT ACCGGTCTCA AGCTACCTGC CACGATTGCC
TTCGATCACC CGTCACCCGA GCACGTTGCC GCCTTTCTAC AGCAATCGCT CGCGCCGATG
CTGGGTGACC CCAAGGTGGT CGTAGACCAC GCCGCTCATG CGTCGCCCGC GGTCGGCGAA
GGAAACGATC CGATCGCTAT CGTCGGTATG GCGCTCCAGT TTCCAGGAGG GGTTGACGAC
CCCGAGGCGT TCTGGAGCTT GCTCGAGCGG GGCGGCGACG CCGTCGCTCC GATCCCCAAG
AATCGCTGGA ACGCCGACGC GTTCTACGAC CCCGATCCCG AGGCCGTGAA CAAGAGCTAC
GTGCGCGAGG CCGCCATGCT GACGCACATA GACCTTTTTG ATGCGTCCTT TTTTGGAATC
AGCCCTCGCG AAGCCAAGTC TATCGACCCG CAGCACCGCC TCCTCCTGGA GGCCTCGTGG
CATGCGCTCG AGGACGCCGG CATCGTCCCG GCCGCCCTCG AGGACTCGCA GACCGGCGTG
TTCGTGGGCA TCCGCACCGG AGACTATGGA GCCGGCGAGA ACAGCATCGA AGAAACTGAG
GTCTACGCCA TCCAGGGGAT GAGCTCTTCG TTCGCGGCAG GCCGCCTGGC GTTCACGCTC
GGCCTGCGGG GACCTGCGTT AGCAGTGGAC ACCGCTTGCT CGTCGTCGCT GGTCACGTTG
CACCTCGCCT GCAAGGCGCT GCGCAACGGC GAGTGTGAGC TGGCTCTTGC CGCTGGAGTC
AACGTGATGA CCTCGCCAAG CAGCTTCAAG CTGCTCTCGC GCACGCGCTC GTTGGCGCCG
GACGGTCGCA CCAAGGCCTT CTCGGCAAAT GCCGACGGCT ACGGCCGCGG TGAAGGCGTG
ATCGTGGTTG TCCTCGAGCG TCTCAGCCGC GCCCGAGCGG AGGGCCACCG CGTGATGGCC
GTGGTACGCG GCAGCGCCAT CAATCACGAC GGCGCCTCGA GCGGCATCAC CGTCCCGAAC
GGGTCCTCGC AGCAGCAGGT GCTGCGCGCC GCCCTCGAAG ACGCCGGCCT CGCGCCCTCT
GACATCGACG TCGTCGAATG TCACGGAACC GGCACCAAGC TTGGAGACCC GATCGAGGTG
CAGGCTGTCG GCGCCGTCTA CCAAGAAGGG CGCGATCCCC ACTCCCCGCT CCTGCTCGGG
GGCGTGAAGA CCAACATCGG ACACCTTGAG ACCGCCGCGG GACTGGCGGG CGTGGCGAAG
ATGGTGTTGT CGCTTCAGCA CGAGGCGCTC CCTCCTACGC TGCACACAAC GCCTCTGAAC
CCCTTGCTGG ACTGGGAGTC CCTCCCGCTG CGGGTTGTCG ACCGACTGGA ACCGTGGCCC
CGCGAAGACG CTCGCCCTCG CCGCGCAGGC GTCTCCGCCT TTGGTCTTTC CGGTACCAAC
GCCCATGTCA TTCTCGAAGA GCCCCCGGCC GACCCCGCCG CACGGACCGA GGCAGCGAAC
GCCGCGGCCT CCGAACCACC TCCCTGGCCC TTTGTGCTCT CCGGCAAGTC CGAGCCAGCC
TTGCGGGCCC AGGTCGAGCA GCTGCGCGCG TACCTCGCGG CCCATCCTGA TCTCTCGCTT
TCGGATCTCG CCTACTCGCT CGCGACCACC CGCTCGCACT TCGACCATCG TGTCGCGATC
GTCGCCAGCG ACCGCACGGC ACTCATACAC CAACTCGCAG AACTCGGAGC AGGACGTGCG
CCAGCCGATA CGCTTCTCGG CCGCCGTGGG GCCGATGGCA AACTCACGTT CGTATTCCCC
GGGCAGGGTT CGCAATGGAT AGGAATGGCA GCGTCGCTGC TCACCTCCTC TGCGGTGTTC
CGAGCGCAGG TTGAAGCCTG CGAACACGCC TTCTCGCCCT ACATCGACTG GTCGCTGCTC
GCACTTCTCC AAGCAGGCCC AGGAGACCCG GACGCGGCCC GGCTCGACCA AATCGACGTG
CTTCAGCCGG CCCTGTTCAC GGTTATGGTG TCGCTGGCCG CCCTGTGGCG CGCGATGGGC
GTCGAGCCGG ATGCCGTCGT AGGCCACAGC CAGGGGGAAG TAGCTGCGGC CCATGTTGCA
GGTATCCTCT CGCTCGACGT GGCCGCCCGA ATCGTCGCGG TACGGAGCCG GGCGCTCGGC
GCCTTCACAG GCCGGGGGAG CATGGCCGCG GTAGAGTTGC CTCGCGGCGA ACTCGAGCAG
CTTCTCGCCA CCTCCGCGCT GGGCGAGCGC CTGTCCGTGG CCGCCGTCAA CAGCCCGTGT
TCCACGGCCC TCGCCGGTGC CGCTGAGGAC ATCGACGAAC TCCTCCAGCT GCTCGCGACC
GAAGGCATCT TCGCGCTCAA GCTCCGGGCC GACGTAGCCT CCCACTGCGA TCAGATCGAG
CCGCTGCGCG ATCAGCTCTT GAGCGAACTG GGGGAGTTCG ATCCGCAGCC GGCGCAGATT
CCCTTCTACT CGACGGTGAC GGGCAAACGT CTCGCGGGGC CGGAACTCAA CGCGGCGTAC
TGGTTCGACA ATCTGCGGCA ACCCGTGCTT TTCGGAGACG CGACGCAACT CCTGCTCGCG
GACGGTCACC GATTCTTCGT GGAAGTCAGC CCCCATCCGG TGCTGGCGTT CTCGATCCAC
GAAACTCTGG ACGCCGAGGA GCAAACGGCA TGCGTCGTGG GCTCACTGTG GCAGGAGGAG
GGCTATCTCG CCCGCTTCCT GCTCTCGATG GTCGAGCTTC ACGGCGGGGG CTTCCCAGTG
GATTGGCGCA CCTTCTTCCA GCCGACCATG GCTCGCCCGG TGCCGCTTCC CACGTACCGT
TTTCAGCACG AGCGCTTCTG GCTCGAAGGC ACCAAAGCCC AGCACGCCGA CGTGGCCTCT
GCTGGCCTCA GCTCTGCCGA GCACCCGCTG CTCGGCGCGG CCGTCGCGCT GGCCGACTCT
GGCGGATACT TAGTCACCGC TCGTCTGGCG CTCGCCGAAC ATCCCTGGCT CGTCGGACAC
CAGGTCTTCG GAACAGTCAT TCTGCCTGGC ACCGCCTACG TCGAGTTCGC GACGATCGCT
GCCCATCGCG TGGGGCTCGA GCGCGTAGAA GAGCTCACGC TAGAAGCGCC CCTTGCGCTC
TCCGCCGAGG GAGCGGTCTT GCTGCAGCTC TCGCTGGGGC CGCTCGACGA GCGCGGCAGG
CGCGCGTTGA CCATCTACGC CCAACCTGAG CAAGCCGTCG AGGACGGATG GACACGGCAC
GCCACGGGCA CGCTCGCGGC GCGCGACGCG GAGAGTCGTG CGCTCGACTT CGAGTTCCGC
ACCTGGCCGC CGGTGGGGGC CGAGTCGCTG GCGCTCGACG GTTTGTACGA CCAGCTGTCC
GCCGCAGGGC TTCAGTACGG CCCCGCTTTC CAGGGACTGC GCGCTGTCTA CCGCCGCGGT
GAGGAGTTTT TCGCCGACGT CGAGTTGGAG CAGGTGTTCG CGCGAGATGC GCGCCGCTTC
GCCCTCCACC CTGCCCTGCT CGACGCGGCG CTTCACGCGC TCACGTTCCA AGCGATCCAC
GCGGCCACGG ACGTCAGTCT GCCCTTTTCC TGGAATGATG TCTCGCTGCG CTCCGTCGGC
GCCTCCGCCC TACGCGTGCG GCTGCGCCGC TCCGCATCGG GCTCGGGGAT CTCGGTGGAT
ATTGCCGATA CCGCCGGCGA ACCAGTCGCA CACGTAGGAG AGCTGGCGAC CCGCCCCGTG
GCCCCCGAAC AGCTGCATCG CGCATCTGAG CGCCAGGACG GCTTGCTGCG TGTCGACTGG
AGCGACCCCA GGGCCTTTTC CTCTGAAGCC AGGGTGCCGG CTCAGGAGTG GGCGCTGGTG
GGGCCGGAAG ATGCCTCCCT CATATCGCAT GCGAATGCGA CCGGCGTGTC GCTCACGCAT
CACCAAGATC TCAATGCACT GCTCGAGAGC ATCGAGCGAG GCGGTGCGTT TCCCGAGGTC
GTCGTCGTTC CCTCCTACGA CGTCGCCCCC CTTCGCGACG TGATCGACGC GGCGCACAGC
ACCGCCGCGC ACACCCTCGG CGTGTTGCAG ACCTGGCTTG CTGAGGAGCG CTTCGCCACC
GCGCGCCTTA TCGTTGTAAC GCGGGGAGCT ATCGCCACCA GACCCGACGA AGACGTGCTC
GGCCTCGCTC AGGCCTCACT CTGGGGCTTG GTCCGCGTGG CGCAGTCCGA GCACCCCGAC
GCGAGCCTCA CCTTGGTTGA TGTGGACAAC CGAGAGGATT CCTTACACGC CTTATTTGGG
CTCTTCGGCT CACGCACGGC GCCGAACGCG TTGGCGGCGG AGCCGCAGCT CGCAGTTCGG
GCTGGAAACG TCTCGGTACC GCGCCTCGCA CGGCTAACGG AAGCGCCCGA CGCGCACGCA
CGGCCCTTCG ACCCGGCCGG AACCCTGCTG ATCACAGGAG GTACCGGAAC CCTCGGTAGA
CTCTTGGCGC GCCACGTAGT CACAAAACAT GGCGCGCGTC ACCTGCTCCT GGCGTCGCGG
CAGGGGCTCG CTGCCCCGGG CGCTTCAGAA CTCGTCAGTG ACCTTGCAGA AGCCGGCGCC
GAGGCCACCG TCGTAGCCTG CGACGTGTCC GACCGCAGCG CGCTGCAACG GCTCGTCGCA
GCCGTGCCAG CAAATCACCC TCTGACAGGC GTGCTCCACT TGGCTGGTGT ACTCGATGAC
GCTGTGATCG AGTCGCTCAC CCCCAAGCAC TTCGATACCG TCCTGCGCGT CAAGCTCGAC
GCCGCCTATC ATCTGCATGA GCTGACCCTC GAGCACGACC TGGCGGCTTT CGTGATGTTC
TCGTCACTTT CTGGCGTGCT CGGCAGCGCG GGTCAGGCCA ACTACGCGGC TGCGAACACC
TTCCTCGATG CCCTCGCCCA CCACCGCAAG GCACAGGGGC GGCACGGCCT CGCGATCGAC
TGGGGTTATT GGGAAGACAG AAGCGCCCTC ACCGCACATC TAACGGCTGC CGATCTGCAA
CGCTTCGCGC GCAGCGGTCT GCGTCCGCTC TCTGCGCAGG AGGGTCTCGC GCTGTTCGAT
GCGGCGTTGA CCCGGCCCGA CGCAGTCCTG GTCGCTGTGC GCCTCGACGC CATGGCGCTC
GCCAAGCACG CGAACGCACT CCCCCCCCTG CTGAGCGGGC TGGTTCCGGC CAAGGTTGCG
CGACCGATGG CATCCACCGG CACTTCGGTA GCGTCGCTGC AACAGCGCTT GGCGTCTCTG
GTCGTGGAAG AGCGCGCGCC AGCCCTGCTC GAGATCGTGT GCTCTGAGGT CGCCACCGTG
CTCGGCTTGG CAAGCCCCAA CGCGCTCGAT CCGGAAAGGC CACTGCAGGA ACTCGGCCTT
GACTCGCTGA TGGCGCTCGA GATCCGCAAC CGGCTCTCGG CCGCTACCGG ACTTCGGCTT
CGGGCCACGC TGTTGTTCGA CCACCCCACG GCGGCGGCGC TCACCCAGGC GCTCTTGGGC
TGGCTCGTGC CTGACGAGGC CCACGATCAT CAGGAGTCGG CACAGCTCGC CGAGTTGAAC
CGAGTCGAAA GCACGCTCGA AGCACTCCGC GCCATCCCGA CCGTGCGGGA TGCTCTCAGA
GAGCGCCTCG AGGCACTCCT GCGCAAGTGG GGCAGTTCGG ACGCGCCTGC CGAGGCCGGT
TTTGGTCAGC GCGTCGCGGA CGCAAACGTC GACGAGTTGC TCGATCTCCT CGACGAGAAG
TTCGGAGCGG ACATCAATGT CGAGTCGTGA
 
Protein sequence
MSDVQQTAVD ALRAALTTVE RLRKQKQELT ARTHEPIAIV GMGCRFPGSV RTPEALWRVL 
HDGQDLVSRF PEDRGWKVDS LYDPDPDSYG KSYTVQGGFL HDADRFDAGF FGISPREALA
LDPQQRLLLE TSWETFERAG IAPDSLQGSA TGVFVGVMYN DYGTRLWSLQ RDAGFVLHAP
EDLEGYMGVS SAPSAASGRL SYSFGLQGPA VTLDTACSSS LVAIHLAAQA LRRGECTLAL
AGGVTVMATP GGFVSFSRQR ALSPDGRCKA FSADADGAGW GEGAGMLLLE RLSDAKRNGH
PILAVLRGSA VNQDGKSQGL TAPNGPAQQR VILQALDNAR LTPNQVDAVE AHGTGTKLGD
PIEAQALLAT YGQAHTPEAP VWLGSLKSNL GHTQAAAGVA GVMKMVLALQ HQMLPATLHA
QTPSPHIDWS SGTLQLVQSA RPWQTNGQPR RAGVSSFGVS GTNAHLILEE APLEQAATEQ
RAAATAPVAA LPFLLSGKTE EALKAQAQRL HQHLQRHEDA ALVDVSYSLA TTRAHFEQRA
ALVASTREEL LAALAALANG ESAPSLVVAP RSADGKVVFV FPGQGSQWQG MGRALLRSSD
AFRAEVEACE AAFAPYIDGS LREALEGGSS DRVDVLQPVL FTMMVSLAAH WRSLGVVAAA
VVGHSQGEVA AAYVAGALSL DDAAQIVALR SRALRRVAGR GAMAAVELGA EQLATYLAPF
EEQLAIGAVN SPRASLVAGQ PAALDALLEK LAEDGVYTQK ARGNHASHCR LVEPLAQELT
DALQGIRPST CAIPLYSTVT GTRLEGHELD ADYWYQNLRA PVLFQSATER LLADGHDLFV
ELSPHPVLSL PLYETFDARE HSAQVVTSLR RGDGTHARML LSLGELHNRG HKLDWHAFFA
PWHPRTVPLP TYAFQYERFW LEGTRNQSRD SASSGSAESS FWRSVEGGDA DGLERLLKLQ
EAPQRSALDS LLPLLTNFRA QAGQQRATDA WRYRLVWKPV TIATHAELSG VWLLIVPASQ
AADIWAQSLM RGLEAGGGTV VTCHVHHDLA DRAKLEARLR ELLAGKALPS AQTIRGVISL
AALDEGTCPA HPSLSNGMAL NFSLIQALLD AGLKSSVWLL TRGAVSIGPA ERLTNPLQAM
TWGTGRVLGL EHPDYWGGLI DLPEACSDGA VERLVGLLGA CSEEDQLALR GPGWYARRLV
RAPLGAAEAA DTYVPRGTVL ITGGTGGLGA HVARWLASKG AQHLVLVSRR GELSPGAEQL
HDELSELGAR VTIAACDVAD RAALHKLLDA LDAEGASIRS VVHAGGVAQQ TPLIAMTLRE
FAEVVSGKAR GAQFLHERFD AQPLDAFVLF SSGSSSWGGG GQGAYAAGNA FLDALAEHRR
GLGRAATSVA WGAWAGDGMV TLLDDSGESA LRTRGILPMS PELAIAALAQ ALDHRETKLT
VANMDWARFA PAFASARSRP LLHDLAEAKG ALEGAEGTSA LEGRETELLA HLRKLTEPDR
RRLLLSRVLE ETAAVLGHAD ASRVEAKRGF FEMGLDSLLA LELRKCLQSA TGLKLPATIA
FDHPSPEHVA AFLQQSLAPM LGDPKVVVDH AAHASPAVGE GNDPIAIVGM ALQFPGGVDD
PEAFWSLLER GGDAVAPIPK NRWNADAFYD PDPEAVNKSY VREAAMLTHI DLFDASFFGI
SPREAKSIDP QHRLLLEASW HALEDAGIVP AALEDSQTGV FVGIRTGDYG AGENSIEETE
VYAIQGMSSS FAAGRLAFTL GLRGPALAVD TACSSSLVTL HLACKALRNG ECELALAAGV
NVMTSPSSFK LLSRTRSLAP DGRTKAFSAN ADGYGRGEGV IVVVLERLSR ARAEGHRVMA
VVRGSAINHD GASSGITVPN GSSQQQVLRA ALEDAGLAPS DIDVVECHGT GTKLGDPIEV
QAVGAVYQEG RDPHSPLLLG GVKTNIGHLE TAAGLAGVAK MVLSLQHEAL PPTLHTTPLN
PLLDWESLPL RVVDRLEPWP REDARPRRAG VSAFGLSGTN AHVILEEPPA DPAARTEAAN
AAASEPPPWP FVLSGKSEPA LRAQVEQLRA YLAAHPDLSL SDLAYSLATT RSHFDHRVAI
VASDRTALIH QLAELGAGRA PADTLLGRRG ADGKLTFVFP GQGSQWIGMA ASLLTSSAVF
RAQVEACEHA FSPYIDWSLL ALLQAGPGDP DAARLDQIDV LQPALFTVMV SLAALWRAMG
VEPDAVVGHS QGEVAAAHVA GILSLDVAAR IVAVRSRALG AFTGRGSMAA VELPRGELEQ
LLATSALGER LSVAAVNSPC STALAGAAED IDELLQLLAT EGIFALKLRA DVASHCDQIE
PLRDQLLSEL GEFDPQPAQI PFYSTVTGKR LAGPELNAAY WFDNLRQPVL FGDATQLLLA
DGHRFFVEVS PHPVLAFSIH ETLDAEEQTA CVVGSLWQEE GYLARFLLSM VELHGGGFPV
DWRTFFQPTM ARPVPLPTYR FQHERFWLEG TKAQHADVAS AGLSSAEHPL LGAAVALADS
GGYLVTARLA LAEHPWLVGH QVFGTVILPG TAYVEFATIA AHRVGLERVE ELTLEAPLAL
SAEGAVLLQL SLGPLDERGR RALTIYAQPE QAVEDGWTRH ATGTLAARDA ESRALDFEFR
TWPPVGAESL ALDGLYDQLS AAGLQYGPAF QGLRAVYRRG EEFFADVELE QVFARDARRF
ALHPALLDAA LHALTFQAIH AATDVSLPFS WNDVSLRSVG ASALRVRLRR SASGSGISVD
IADTAGEPVA HVGELATRPV APEQLHRASE RQDGLLRVDW SDPRAFSSEA RVPAQEWALV
GPEDASLISH ANATGVSLTH HQDLNALLES IERGGAFPEV VVVPSYDVAP LRDVIDAAHS
TAAHTLGVLQ TWLAEERFAT ARLIVVTRGA IATRPDEDVL GLAQASLWGL VRVAQSEHPD
ASLTLVDVDN REDSLHALFG LFGSRTAPNA LAAEPQLAVR AGNVSVPRLA RLTEAPDAHA
RPFDPAGTLL ITGGTGTLGR LLARHVVTKH GARHLLLASR QGLAAPGASE LVSDLAEAGA
EATVVACDVS DRSALQRLVA AVPANHPLTG VLHLAGVLDD AVIESLTPKH FDTVLRVKLD
AAYHLHELTL EHDLAAFVMF SSLSGVLGSA GQANYAAANT FLDALAHHRK AQGRHGLAID
WGYWEDRSAL TAHLTAADLQ RFARSGLRPL SAQEGLALFD AALTRPDAVL VAVRLDAMAL
AKHANALPPL LSGLVPAKVA RPMASTGTSV ASLQQRLASL VVEERAPALL EIVCSEVATV
LGLASPNALD PERPLQELGL DSLMALEIRN RLSAATGLRL RATLLFDHPT AAALTQALLG
WLVPDEAHDH QESAQLAELN RVESTLEALR AIPTVRDALR ERLEALLRKW GSSDAPAEAG
FGQRVADANV DELLDLLDEK FGADINVES