Gene Hoch_4573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4573 
Symbol 
ID8546980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6244272 
End bp6255044 
Gene Length10773 bp 
Protein Length3590 aa 
Translation table11 
GC content67% 
IMG OID646389248 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003268957 
Protein GI262197748 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAACG CTGAAGGCAA GAACATGAAC AGCAGCGAGC AGACGGAAAA CCCGGCGTCC 
GAAACGCCGG TCGAGGGCGG AAACGGCGTC GCCGCTGCCG CCGGTGCCGA TGTCGCCGCT
GCCGAGGGGC AGAACGCCGC GGCCGACGCG GCCGTGGATA CGGGTGCAGA GGGGGAGGTC
GCAGCGGACG CGGCCGAGGG GGAAGGGGAG GGCGCTGCAG CCGAAGCCGC CGCCGCGGCG
GAAGCCGCCG GGGACGAGGC CGCCGCGGCC GACGCCACCG GGGCGGAAAC CACCGCGGCG
GAAACCGCCG GGGCGGAAAC CGCCGGGGCC GAGGAGGAGG AGGCCGAAGA GGTGCTCTCC
GACGAGCTGC TCCAGGCCAT GGAGACGGCG CGCGCGAGCG AGGCCGAGGG CACCGACGCC
GCGGTCGAAG CCTGGCGCGG CGTCGTCGCC AGCGACCCGA GCAAGCAGGC GCCGCGGCGC
GAGCTGGCCC GGGTGCTGCG CGAGGGGGAG AAGTGGCGGC CGCTGGCCGA CGCCCTCAAA
GAGGAAGAGC AGGAGGCCGC GCGCAGCGCC GCGGCCAAGG TCCAGGTGCT CGAAGAGCTG
GTCGAGGTCT ATCGCGACCA GCTCCGCAAC GAGCAGCAGT CGGTCAAGGC GCTCGAGCGC
ATCACCGAGG TCGCGCCCCA GCACCTGGCC GCCTACGATC AGCTCGCCGA GTACTACGAG
GGCAAGAAGC GCTGGCCCGA CCTCATCAAC ACCTTCACCA AGAAGGCCGA GAACCTGCCG
ACCGAGGCCG AGCAGGTGGC GCTCTACCTC GAGATCGCGC GGCTCTACAT CGACCGCTTC
TCGAACCAGG CCGAGGCCAT CAAGGCCTTC GAGCGGGTTC TCGAGCTCGA CCCCGACAAC
GAGTCGGCGG TGATGCACCT GCTCGAGGTG TACGAGAAGC GCCGCGACTG GGAGAAGCTC
ATCGGCCTGC GCGAGCGCGA GATCGAGGGT ATCGAAGACC CGCTCGATCG CGCCGAGAAG
ACCTACGAAG TGGCCAAGCT GGCGGCCACG CGGGTCAAGA AGCCCGAGGT GTGCATCCAC
TGGTGGGAGA AGGTGCTGGC CACCGACCCC GCGCACGAGG AGGCCATCGG CGAGCTGTAC
AAGCTCTACG AGCGCTCCAA GAGCTGGGAG AAGCTGGCCG AGATCTGCGA GAAGCAGGCG
AACATCGCGC CCGACGAGAA GACCCAGGCC GACTCGCTGC AGAAGCTCGG CCTGCTGTAC
ACCGACAAGA TCGAGGACAC CGACAAGGCG ATCCACGCCT GGCGCCGGCT GCTGCTGCTC
GACAACGAGA ATCGCCGCGC CCAGGACGCG CTCAAGAAGC TCTACATCGC CAATAAGGAT
TGGACCGCGC TCGAGGACTT CTACCGCTCG CAGGGCAAGC TCGACGAGTT CGTGCGCGTG
CTCGAGCGCC AGGTCGAGGC CGGTGACGAA GAGGACCGCC TGCCGCTGGC GATGAAGATC
GCGGTCACCT ACCGCGACGA GATCGAGAAG TCCGATCGCG CCATGCGCGC CTTCGAGAAG
GTGCTCTCGC TCGACGAGAA CAACCTCGAG GCCGCCGAGG CCCTGATCCC GATCTACGAG
GAGGGTCGCG ACCCGCGCAA GCTGGTGCGC GTGCTCGAGA TCCAGCTCGA GCAGACCGAG
GAGCCCGAGC TGCGCGTCGC GCGCACCAAG CAGCTCGCCG AGTACAGCGA GGAGAAGCTG
CGCGACAAGG GCGCGGCCTT TGGCTGGTAC CTCAAAGCCC ACGAGGGCGA GCACCGCGCC
GAGTGGGTGC GCACCGAGCT CGAGCGCCTG GCCGCCGAGA TCGGCACCTG GGCCGAGCTG
GTGGCCGCGT ATCAGGACTC GTATCCCAAG TTCGACGACC CGCAAGAGGC GCTGCCGCTG
ATGGCCGTGG TCGCCCGCGT GCAGGAAGAG GAACTGGGCG AGATCGACCG CGCGCTCGAG
ACCAACCGCA ACATCCTCGA GATCGACGAG GCCAACGAGG GCGCGATCAC GGCCCTCGAG
CGCCTGTATC TCGGTAAAGA GCGCTACGAG GACCTGCTCG ACATCTACAA GCGCAAGCTC
GAGCTGACCT TCGACGGCGA CGCCCGCACC GGCATCCAGT TCAAGATCGG CCAACTGTAC
GAAGAAGAGG TCAAAGACGA CGAGAAGGCC GTCGACGCCT ACCGCGAGAT CCTGGCCGCC
ATCGGCGATG ATCCGCGCGC GCTCCAGGCC CTCGATCGCA TCTACCTGCG CAACGAGCGC
TACGAGGACC TGGCCGAGAC CCTCGAGCAG CAGATCGCGG TCGCGGCCGG CGGCGACGAC
GAGAGCGAGC AGCTCGAGTT CAAGTTCCGC CTCGGCCAGG TGCGCGAGCA GCATCTCGAG
GACGTCGGCG GCGCCATCGT CTGTTACAGC GACATCCTGA TGGTCGAGCC CGGCCACGCG
GGCGCGCGCG CCGCGCTCGA GGCGCGCCTC GACGACGAGG AGCACAAGCT CGAGGCCGCC
GGCATTCTCG AGCCGATCTA CGAGCAGCTC GGCGCCTGGG CGGAGCTGGT GCGCGTGCAC
GAGATCCAGC TCGCGGCCGA GAGCGATGTG CTGCGGCGCG TGACGCTGCT GATGCGCATC
GGCGAGCTGC ACGCCAAGAA GCTCGGCGAC CCCGAGAACG CGTTTGGCGC CTACGCCCGC
TGCTTCCGCG AGGACCCGGC GACCGAGGGC GCCAAGCTGG CGCTCGAGGA GCTGTGCGCG
CTGCTCGACG ACGGCTGGGC GCGCCTGGTC GGTCTGTTCG AGGAGGCGCT CGGTCGCGAC
GACATCGACC CGCTGCTGGC GCACGAGCTG GCGACCAAGG TCGCGCGCGC CTACGAGGAG
CGCCTGGAGA ACAGCGAGAA GGCGGTCGAG TTCTACCGCC GCGCGCTGCA GGTCGACCCC
GACGACGAGG CCGCGCTCGA CGCCTTGGAG CGCATCTTCA CGGTCGGCGA GCAGTTCACC
GAGCTGCTCG AGGTGTTCCG CCGCAAGGCC GACATCGCGA CCGAGCCCGA CGCGCGTCTC
GAGATGCTGT TCCGCATCGC GTCGATCCAC GAGGGTGTGC TCGGCAACGC CGAGGACGCG
ATCTCGGCCT ACAACGAGAT TCTCGGACAG GAGCCCGATA ACCTCACCGC GCTGCGCGCG
CTCGACCGCC TGTATCTGCA GGGCGAGCAG TGGCAGGACC TGGGCGACAC CCTGACCCGC
CAGCTAACCC TGGCCGAGGC CGAGGACGAG CAGGTGAGCC TGCTGGTGCG GCTGGCGCAG
CTGCGCGAGA CCCACCTCGA AGAGCTGGCC GCGGCCATCG AAACCTACCG CCAGGTGCTC
GACCTGCAGC CGCAGAATCC CGACGCCGTG GCCGCCCTCG AGCGGCTGAT CGGCAGCGAG
GATCACGAGC TGACCATCGC CCAGATCCTC GAGCCCATCT ACCAGGCGAC GGGCAACTGG
GAGCGCCAGA TCGGCGTGTA CGAGATCATG GCGAAGCACG CCTACGATCC CGAGCGCAAG
ATCGAGCTGC TGCACCAGAT CGCCGAGCTG TACGAGGTCG GCGGCGACCG CCCGCAGGAG
GCCTTTGACA CCTACGCGCG CGCGTTCCGC GAGGAGCCGC GCTCGGAGCG CACCCAGGGG
CAACTCGACC GCCTGGCCCA GAACCTCGAC CGCTGGCCGC AGCTCGTCGA GCTCTACGAC
AGCGTTATCG CCGAGCTGTC CGACGAAGAC CTCAAGGTCC AGCTCCTGAT CAAGCTGGCC
AAGGTCTACG AGCTGGAGAT CGGCGACGAC AGCAGCGCGG TCGCCACCTA CGAGCGCATC
CTCGAGGTCG CGCCCGAGCA CGTCGAGGCC GCCTCGGCGA TTCAGCTCGT CCACGAGCGC
AACGCCAACT ACCCGGCGCT GGTCGCCATC CTCAAGCGCA AGAGCGAGAT CCTGCTCGAT
CTGCCCGAGC GCAAGTCGCT GCTGTACAAG GCCGCGCAGC TCCAGGAAGA GGTCCTCGAG
GACCTCGACG CCGCCATCGC CACCTATCAG TTGGTGCTCG ATCTCGACGA CATCGACATG
CCGGCGATGA ACGCGCTCGA GCGGCTCTAT ATCCGCCTCG AGCGCTGGGA GATGCTCAAG
GACGTCTACG CCAAGAAGGC CGACCTCGCC GAGCATCCCG ACGACAAGAA GCAGATGCTG
CACGTTCTCG GTCAGGTGTA CGACCAAGAG CTGAGCGACG TCGGCAAGGC CATCGAGACC
TACCAAGCCA TCCTCGACAT CGACCCCGAC GAGCTGTCGG CCATCCAGCA ACTCGATCGC
CTGTTCTCGG CGGCTGAGCG CTGGTACGAC CTGCTGCAGA ACCTCGAGCG CCAGGTCGAG
CTGGCCGAGG CCACCGGCGA GATCGTCGGT CTCAAGTACC GTATCGGCGA GCTGTGGCAG
CACAAGCTGC AGGATCTGGC GCGCGCGATC GACAGCTACC GCGAGGCGCT CGAGCTCGAT
CCCGGTCACC ACGAGACCCT GGTCGCGCTC GAGGGTCTGC TGCGCAGCGA GGAGGGCGAG
CCCGGCGAGC CGATCATGGC CGCGCGCGTG CTCGAGCCCA TCTACGAGGG CAGCGGCGAG
TTCGACAAGC TGGTCCACGT GCTCGAGGTG ATGGTCGCCA ACACCGAGGA TCCCGACCAG
CGTATCGATC TTCTGCACCG CGCGGCCGGT CTGCTCGAGT ACCAGCTCGA CCGCGCCCCG
GGCGCCTTCG AGATGTACTG CCGGGCGCTG CGCGAGGACA ACGGCAACGA GATCACGCTC
GAGAACCTGC CCCGCCTGGC GCAGATCACG AGCGCGTGGC CGACCCTGGC GACGCTGTAT
GCCGAAGAGG CCGACAAGAG CCTCGACGTG CCCCGCCAGG TCGAGCTTTT GTCGCGTCTG
GCGCGCATCC AGGAGCAGGA ACTCGGCCAG TCCGAGCAGG CGATCGCGAC CTACAAGCGC
ATCCTCGAGG TCGATTTCGA CAACCGCGAC GCGATCTTCG CGCTCGACCG GCTGTACAGC
GCCGCCGAGC GCTGGGACGA CCTCACCGAG ATCCTGCGCA AAGAGATTCA GCTCGCCATC
TCCGAGGACG AGATCGTCGA TCTGCAGTTC CGTCTCGGCC AGGTGCTTGA GCAGCGCCTG
AGCGACCTCT CGGGCGCCAT CGAGGTGTAC CGCGAGATCC TCACGATGAA CGACAGCCAC
GCGCCGACGC TCAGCGCGCT GGAGATGCTG TTCCTCGAGG GCCATCATCA GATGGAGATC
GCCGGCATCC TCGAGCCACT GTACGAAGTC GCCGGCGAGT ACGAGAAGCT GCACCGCATC
TACGAGGTGC AGCTCGGCAT GCTCACCGAG GTGAGCGAGC GCCAGGGCAT GTTCCAGCGC
CTGGCCGAGC TGGCCGAGGA GCGCTTGAGC GACCAGGGCC GCGCCTTCCA CTGGTGGGGC
GAGGCCGCGT ACGAGGACCC GCGCTGGGAG CAGGCGGTCG AGGAGAGCGA GCGCCTGGCG
CAAGCCACCA CCGGCTGGCC GCACCTGGTC GAGGTCTACC GGCGCATCCT CGAGGCGCGC
CCCGAAGAGC CCGACGTGCG CCGGCAGACG CTTTTGCGTC TCGCCCGGGT GTACGAATAC
GAGCTGGCGC AGCCGGCCGA CGCCATCGAT TGCCACCTCA AGGTGCTCGA GATCGACGCC
CAGGACATCG ACGCCCTGCG CGCCCTCGAC CGGCTGTACG AAAACGCCGG GATGCACGAG
CAGCTCGTCG ATATCATCGG CCGGCGCATC GCGGTCACGC TCGACGGCGA CGAGATCATC
GAGTTCCACT TCCGCCGCGG CCGCATCTAC GCCGACGCGC TCGACGACCT CGACAAGGCG
CTGGCCTGCT ACGAGGCCGT GCTCGAGCAG GAGAGCCGCA ACCGCACCGC CCTGGAGGCG
AGCGAGCGCA TCTTCTTCCG TCGCGAAGAG TGGGATCGCC TGTACGGCGT GTACGAGAAG
CTCGTCGATG TCGCCGAGGA CGACGAGGAG CTGGCCGACG TCTACGCGCA CATGGCGCGC
ATCACCTCGG AGGCCATCGA CCGCGAGGAC GAAAAAGAGG ACGCGGTCGA GCTGTGGGAG
CGGGTGCTCG ACATCCGCGG CGACGAGCCG CAGGCCCTGG GCGCCCTGGC CGAGCTGTAC
GCGCGGCGCG AGAAGTGGCA GGACCTGGTC GAGATCATCG AGCGCCAGGT GCAGGCGGCG
CCCTCGCAGG CCGAGCAGAT CGTCTTCTAC AAGCGCCTCG GCCGCATCTG GGCCGAACAG
CTCGAGAGCG ATCACAACTC GCTCGACGCC TGGCTGCGCG CCGACGAGCT CGACGGTCAG
GACCTCGAGA CCCTGCGCGC GCTGGCCAAG CTGTACGAGA CCCTGCAGTC GTGGGAGGAT
CTGTCGACCA TCCTCGGCCG CATCGTGGTG CTCGGCCAGG TCACCGGCAG CATCAGCGAA
GACGAGATGA TCGCGCTGCA CGCGCGCGTC GGCGAGATCG AGGGCGACAT CCTCGGCCGC
GTCGATGACG CGGTCGCGGC CTGGCGGCAG GTCGTCGGCC TCGACCCCAG CGACTTCCGC
GCGCTCGACG CGCTCGAGAA GCTGTTCACC CGCGAGGCGC GCTGGGAGGA GTGCATCGAC
GTCCTGCAGA AGCGGGCGCT GATGCTCGAC GAGCCGCAGG AGCGCATCGA CACGCTGCTG
CAGGCGGCCG CGATCTGGGA GGAGAAGGTC CAGGATCTCG ACGAGGCGGC CGCGATCTAC
GCCCGCGTCC ACCAGAGCGA CCCGACCAAC GAGCGCGCCT CCGAGCGCCT CGAGGCCATC
TACCGCGCCA AGCACGAGTG GGGCCCGCTC AACGAGGTGC TGCTGGCCCG GGTCGAGCTG
TGCGAGGATT CCGACAGCAA GATCGATATC CTCGGACAGG TGGCCCAGAT CTACGAGACC
CAGCTCGACG ACTCCGAGTC GGCCTTTGTG GTGCTCCAGG CCGCGTTCCG CGAGGACTAC
TCGCACGAGC GTACGGCCAA AGCGCTCGAG CGTCTGGCGC AGAAGACCCA CAAGTGGGAA
GAGCTGCTCA CCGAGTACAC GCAGCTCGTG CAGAACCTCG AGGCCTCGGA GCCCGACTCC
GCCGCCGACC TGTGGGTCAA GATCGGACGC TGGTACGGCG ACCACCTGTC GCACGTCGAC
TACGCCATCC ACTCGATCCA GCGCGCGCTG AGCCTCGACT CCAACCACAC CGGCGCCCTG
GGCGCGCTCG CCGACTTCCA GGAGGCGCGC GAGTCCTGGT CGGAGCTTAT CGAGACCCGG
CGCAAGCACG CCGCGGTCGA GACCGACCCG GCCAAGAAGG TGCAGCTCTA TCTGTCCCTG
GCGCGCCTGC TCGAGGAGCG CATGCAGGTG CCGATGGAGG CCATCGCCGC CTACCGCTCG
GCGCTGGAGG CCGACCCCTC GTGCATGGAC GCGCTGCTGG CCCTCGAGGG CATGTATCGC
CAGCATGAGA TGTGGGAGCA GCTCATCGAC GTGCTCGGCC GCATCGCCGG GCTGTCCGAG
GACGACGACG AGATCATCCG CCTCAAGCTC GAGATCGGTG AGCTGTGGGA CGTGCGCATG
CTCGACGCGG CGCGCGCCAT CGACGCCTAC CGCGACGTGC TCGACATCGA TCACTCCAAT
CTGCCGGCGC TGCGCGCGCT CGAGCAGCTC TACGAGAAGA CCGGCCAGTC CGAGGCCTAC
CTCAGCAACC TCGAGGCGCA GCTCGACATC TCGTCCGACG CCGAGCGCAT CTCGCTCTAC
GAGCGCATGG CCTCGGCCTG GGAGGAGCGC TTCGGCAAGC TCGACCGCGC CGCCGAGTGC
CTCGAGAAGA TCATCCTCAC CGATGGTCGC AACTACCACG CCTACCGCGA GCTGGCCCGG
CTCTACCGTC AGGACCAGAA GTGGGATCCG CTGATCGAGA CCTATCGCAA CCACATCATG
GCGGCGAGCG ATCCGGCGAC GCGGATCGAG CTGTACTGCG CGATGGGCGA GGTCTACGAC
GAGCAGCTCG AGGATCCGGA CCGCGCGATC GAGTCCTACA AGGACGCGCT GACCTTCGAT
CCCGATGAGC CGAGCGCGCT CGATGCTCTC GGACAACTGT ACGAGAAAAT CAGCGACTGG
GATATGGCCA TCGACGCCAT GAGCCAGCTC GTCCGCATCA CCGACTCGCC GTCCAAGCAG
GTGGCGCTCT ACCACCGCAT CGGTCGCGTG TACGCCAGCG AGCTGCACGA CTACGAGCAG
GGTGAGGCGC AGTTCCTGCG CGCGCTGTCC ATCGACACCA CCCACGTGCC GACGATGGAA
GAGCTGGTGC GCCTGTACTC GGAGCGCGGC GATTGGCTCA AGGCCGCCCA GATGATGGTG
CGGGCCGAGA ACCACACCGA CAGCGTGCTC GACAAGATCC GGCTGCTGTA CCAGGCGGCG
CGCATCTACC TCGACGAGCT GCGCGACCGC GAGCAGTCCA AGCAGTACCT GGCGGCCGTG
ATCGCGCTCG ACCCCGAGCA CGTGGGCGCG GCCGAGCCGC TGGCCGACAT CTACTTCCGC
GAGGAGCAGT GGCAGCCGCT GGCGCCGATC CTCGACATGC TGGTGCGCAA GGCCCAGCAG
GAGCAGGCCG ACCCGCAGCG GCTCAACGAG CTGTACTACC GCACCGCGCG CACGGCCGAC
CACCTGGGCG AAAACGAAAA AGCGCTGCAG TTCTACGGCG GCGCCTACGA CATCGACTCG
ACCTACCTGC CGACCTTGGT CGGACGCGCG GATCTGCTGT TCAAGCAGGC CATCTGGGAG
GACGCGGGCA AGATCTACCA GACGATCCTC GTGCAGCACC GCGATGCGCT GAATGAGGAC
GACGTGGTGC GCATCTACTA CCGACTCGGT ATGGTGCGCC AGCATCTCGG CGAGCGCAAA
AAGGCGCTCA ACATGTTCGA GAAGGCGCTG GAGATCGATC CCACGCATCG CGACACGCTG
CTCGCCGTCA TCGCGATCCA GCAGGAGCAG GGCGAGTTCG AGGCGGTGAT CCACGCCAAG
CGCGGTCTGA TGGCCACGGC CGACGACCAG GAGCGCATGT CCACGCTCAA CGAGATCGGC
GATATCTATC GCGAGCGGCT GCAGAACCCG CAGAAGGCGA TCAGCGCCTA TCTCGAGGCG
CTCGACGTGG TCGCCGACGA CCACCAGCTC CTGCAGAAGG TCCTCGACCT CTACACCGAG
ACCAAGCAGT GGCGCCTGGC GGTGGACACC ATCGAGCGCT TCGTCTCGCT CGAGTCCAAT
CCGCTGTACA AGGGCACCTA CTACCACGCC GCGGGCTCGA TCTGCCGGCG TGAGCTCAAG
GACCTCGATG AGGCCGTGCG CTACTACAAT CAGGCGCTCG ACAACTTCTT CGCCGAGGGC
GTGGAGCTGC CCGAGAGCGT GCTCAAGCGC GCGTTCGAGT CCTTCGAGTA CATCGACAAG
ATGCTGACCA GTCAGCGCGA CTGGAAGGAG CAGGAGCGCG CGTATCGCCA TATGATCAAG
CGCCTCCAGG GCAAGCTGCC GGGCGCGTCC ATCCACGCCC AGCTCTGGCA CTCGCTCGGC
GTTATCTACC TGTCGCGCCT CAAGCACTAC CAGAGCGCGA TCGGGGCCTT CGAGGTCGCG
CAGCAGCTCG ACCCGGGCAA CATCGATCGC CGCGAGATCC TCGTCGGCCT GTACCTCGAG
CAGGGGCCCG AGTACGCGGC CAAGGCGGTT GAGCAGCACA TGCTCATCCT GCGCGAGGAT
CCGCTCAACT ACAGCAGCTA CAAGGCGCTG CGGCGCATCT ACATGGAGTC GCAGCAGTTC
GACAAGGCGT GGTGCGTGTG CAACACGCTG GCCTACCTCA AGCAGGCCGA CGCCGAGGAG
CTGCAGTTCT GCGAGACGTA TAAGCCGCGC GGCTTCACCA AGGCCAAGCA GACGCTCACG
GGTGAGGTCT GGCGCAACAT CTACCACCCG AACGAGAACC GCTACATCAG CTCGATCTTC
GGCGCCATCT GGGAGGGGCC GGTGATGCGG TACGCGCGTC CGGCCAAGGC CTTTGGACTC
AAGCGGCGCG ATCGCCGTGA CGTGGCCAAT GACCAGCTCG TGTTCTCGCG CATCTTCTCG
TACGTGGCGC AGGTGCTCAA CGTGATGCCG CCCGATGTCT ACCTGCAGGA GAATCAGCAG
GGCGACATCA TGCTGGCCAA CGTGCTCGAG AAGCAGCGGC TCATCCCCTC GTTCGTGGTC
GGCAAGAATC TACTCGCCGG TCGCCCCGAG AAAGAGGTGG CCTTTGCCGT GGCGCGCAAG
CTGTGCTTGG TGCGTCCCGA CTACTACCTG CGCCTGGCGC TGCAGACCAA CATCGAGCGC
AAGGTCGCGC TGTTTGCCGC CATCGGCCTG GTGATGCCCA ACTTCCCGGT GCCGCAGGAG
CACATCCCGA TGGTGCAGCA GGAGATGGGG CAGATGCAGG CCCGGGTGCC CCCGGGCAAT
ATCGAGCAGC TCGGCCGTCT GGTGCGCGAC TTCGTCAACA ACACCCCGGG CAACATCAAC
CTGCACGCCT GGGAGCACGC GGTGGACTCC ACGACCTATC GTCTGGGCTT CATCCTGTGC
GGCGACCTCG AGGTGGCCGC GCGCATGGTC TCGGCGGAGC CGGTCGTGGT CGGCGGTCCG
CAGAGCAAGG ACCGGCTCAA AGAGCTGCTG CTGTACTCGG TGTCCGAGGA GTACTTCGCG
GTGCGCACGC AGCTCGGTCT CACCATCGGC TGA
 
Protein sequence
MENAEGKNMN SSEQTENPAS ETPVEGGNGV AAAAGADVAA AEGQNAAADA AVDTGAEGEV 
AADAAEGEGE GAAAEAAAAA EAAGDEAAAA DATGAETTAA ETAGAETAGA EEEEAEEVLS
DELLQAMETA RASEAEGTDA AVEAWRGVVA SDPSKQAPRR ELARVLREGE KWRPLADALK
EEEQEAARSA AAKVQVLEEL VEVYRDQLRN EQQSVKALER ITEVAPQHLA AYDQLAEYYE
GKKRWPDLIN TFTKKAENLP TEAEQVALYL EIARLYIDRF SNQAEAIKAF ERVLELDPDN
ESAVMHLLEV YEKRRDWEKL IGLREREIEG IEDPLDRAEK TYEVAKLAAT RVKKPEVCIH
WWEKVLATDP AHEEAIGELY KLYERSKSWE KLAEICEKQA NIAPDEKTQA DSLQKLGLLY
TDKIEDTDKA IHAWRRLLLL DNENRRAQDA LKKLYIANKD WTALEDFYRS QGKLDEFVRV
LERQVEAGDE EDRLPLAMKI AVTYRDEIEK SDRAMRAFEK VLSLDENNLE AAEALIPIYE
EGRDPRKLVR VLEIQLEQTE EPELRVARTK QLAEYSEEKL RDKGAAFGWY LKAHEGEHRA
EWVRTELERL AAEIGTWAEL VAAYQDSYPK FDDPQEALPL MAVVARVQEE ELGEIDRALE
TNRNILEIDE ANEGAITALE RLYLGKERYE DLLDIYKRKL ELTFDGDART GIQFKIGQLY
EEEVKDDEKA VDAYREILAA IGDDPRALQA LDRIYLRNER YEDLAETLEQ QIAVAAGGDD
ESEQLEFKFR LGQVREQHLE DVGGAIVCYS DILMVEPGHA GARAALEARL DDEEHKLEAA
GILEPIYEQL GAWAELVRVH EIQLAAESDV LRRVTLLMRI GELHAKKLGD PENAFGAYAR
CFREDPATEG AKLALEELCA LLDDGWARLV GLFEEALGRD DIDPLLAHEL ATKVARAYEE
RLENSEKAVE FYRRALQVDP DDEAALDALE RIFTVGEQFT ELLEVFRRKA DIATEPDARL
EMLFRIASIH EGVLGNAEDA ISAYNEILGQ EPDNLTALRA LDRLYLQGEQ WQDLGDTLTR
QLTLAEAEDE QVSLLVRLAQ LRETHLEELA AAIETYRQVL DLQPQNPDAV AALERLIGSE
DHELTIAQIL EPIYQATGNW ERQIGVYEIM AKHAYDPERK IELLHQIAEL YEVGGDRPQE
AFDTYARAFR EEPRSERTQG QLDRLAQNLD RWPQLVELYD SVIAELSDED LKVQLLIKLA
KVYELEIGDD SSAVATYERI LEVAPEHVEA ASAIQLVHER NANYPALVAI LKRKSEILLD
LPERKSLLYK AAQLQEEVLE DLDAAIATYQ LVLDLDDIDM PAMNALERLY IRLERWEMLK
DVYAKKADLA EHPDDKKQML HVLGQVYDQE LSDVGKAIET YQAILDIDPD ELSAIQQLDR
LFSAAERWYD LLQNLERQVE LAEATGEIVG LKYRIGELWQ HKLQDLARAI DSYREALELD
PGHHETLVAL EGLLRSEEGE PGEPIMAARV LEPIYEGSGE FDKLVHVLEV MVANTEDPDQ
RIDLLHRAAG LLEYQLDRAP GAFEMYCRAL REDNGNEITL ENLPRLAQIT SAWPTLATLY
AEEADKSLDV PRQVELLSRL ARIQEQELGQ SEQAIATYKR ILEVDFDNRD AIFALDRLYS
AAERWDDLTE ILRKEIQLAI SEDEIVDLQF RLGQVLEQRL SDLSGAIEVY REILTMNDSH
APTLSALEML FLEGHHQMEI AGILEPLYEV AGEYEKLHRI YEVQLGMLTE VSERQGMFQR
LAELAEERLS DQGRAFHWWG EAAYEDPRWE QAVEESERLA QATTGWPHLV EVYRRILEAR
PEEPDVRRQT LLRLARVYEY ELAQPADAID CHLKVLEIDA QDIDALRALD RLYENAGMHE
QLVDIIGRRI AVTLDGDEII EFHFRRGRIY ADALDDLDKA LACYEAVLEQ ESRNRTALEA
SERIFFRREE WDRLYGVYEK LVDVAEDDEE LADVYAHMAR ITSEAIDRED EKEDAVELWE
RVLDIRGDEP QALGALAELY ARREKWQDLV EIIERQVQAA PSQAEQIVFY KRLGRIWAEQ
LESDHNSLDA WLRADELDGQ DLETLRALAK LYETLQSWED LSTILGRIVV LGQVTGSISE
DEMIALHARV GEIEGDILGR VDDAVAAWRQ VVGLDPSDFR ALDALEKLFT REARWEECID
VLQKRALMLD EPQERIDTLL QAAAIWEEKV QDLDEAAAIY ARVHQSDPTN ERASERLEAI
YRAKHEWGPL NEVLLARVEL CEDSDSKIDI LGQVAQIYET QLDDSESAFV VLQAAFREDY
SHERTAKALE RLAQKTHKWE ELLTEYTQLV QNLEASEPDS AADLWVKIGR WYGDHLSHVD
YAIHSIQRAL SLDSNHTGAL GALADFQEAR ESWSELIETR RKHAAVETDP AKKVQLYLSL
ARLLEERMQV PMEAIAAYRS ALEADPSCMD ALLALEGMYR QHEMWEQLID VLGRIAGLSE
DDDEIIRLKL EIGELWDVRM LDAARAIDAY RDVLDIDHSN LPALRALEQL YEKTGQSEAY
LSNLEAQLDI SSDAERISLY ERMASAWEER FGKLDRAAEC LEKIILTDGR NYHAYRELAR
LYRQDQKWDP LIETYRNHIM AASDPATRIE LYCAMGEVYD EQLEDPDRAI ESYKDALTFD
PDEPSALDAL GQLYEKISDW DMAIDAMSQL VRITDSPSKQ VALYHRIGRV YASELHDYEQ
GEAQFLRALS IDTTHVPTME ELVRLYSERG DWLKAAQMMV RAENHTDSVL DKIRLLYQAA
RIYLDELRDR EQSKQYLAAV IALDPEHVGA AEPLADIYFR EEQWQPLAPI LDMLVRKAQQ
EQADPQRLNE LYYRTARTAD HLGENEKALQ FYGGAYDIDS TYLPTLVGRA DLLFKQAIWE
DAGKIYQTIL VQHRDALNED DVVRIYYRLG MVRQHLGERK KALNMFEKAL EIDPTHRDTL
LAVIAIQQEQ GEFEAVIHAK RGLMATADDQ ERMSTLNEIG DIYRERLQNP QKAISAYLEA
LDVVADDHQL LQKVLDLYTE TKQWRLAVDT IERFVSLESN PLYKGTYYHA AGSICRRELK
DLDEAVRYYN QALDNFFAEG VELPESVLKR AFESFEYIDK MLTSQRDWKE QERAYRHMIK
RLQGKLPGAS IHAQLWHSLG VIYLSRLKHY QSAIGAFEVA QQLDPGNIDR REILVGLYLE
QGPEYAAKAV EQHMLILRED PLNYSSYKAL RRIYMESQQF DKAWCVCNTL AYLKQADAEE
LQFCETYKPR GFTKAKQTLT GEVWRNIYHP NENRYISSIF GAIWEGPVMR YARPAKAFGL
KRRDRRDVAN DQLVFSRIFS YVAQVLNVMP PDVYLQENQQ GDIMLANVLE KQRLIPSFVV
GKNLLAGRPE KEVAFAVARK LCLVRPDYYL RLALQTNIER KVALFAAIGL VMPNFPVPQE
HIPMVQQEMG QMQARVPPGN IEQLGRLVRD FVNNTPGNIN LHAWEHAVDS TTYRLGFILC
GDLEVAARMV SAEPVVVGGP QSKDRLKELL LYSVSEEYFA VRTQLGLTIG