Gene Hoch_2925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2925 
Symbol 
ID8545313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3977097 
End bp3987467 
Gene Length10371 bp 
Protein Length3456 aa 
Translation table11 
GC content69% 
IMG OID646387607 
ProductYD repeat protein 
Protein accessionYP_003267335 
Protein GI262196126 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTGA CGGGCCTGGC GTGCACGAAC GAGGAGGAAG CGCCCAACGC TGGCGACGTG 
GCCGTACACC GACTCACTCC GCGCTCGCTG GAGACCTCGG GCGATGTCTC CGCGGCCTGG
GCGCTCTTCG ATCGCGATAC CACGAGTAAA AACGTATTTT CTATTACCTC GCTCGACGGC
CAAGACATCA TTGCCCACCT CGACGAGGGC AGCGAACTCG AGGCCATCAA GGTATTCGGG
GCGAGCCCGT TTCAGCTCAC GCTCCTCGAC CCGCAGGGCC AGCTCGTCGC CGGCCCGCAC
GCGCTCGACA AACTGCCGAG CGGCTGGACG ACATTCCTGC TGCCGGATGC GCGGCGCGTC
GAACAGCTCA CCCTGCGCTT CGAGCCCACG GGCGACGGCG ACGCTGCCGT CTCGGAGATC
GAGTTCTGGG GCCGAGGCGC GTCCCTGCCG CTGGACTGGG AGCCGAGCGC CACCGATGCG
CCGCCGGCCG GGCTCGCCGA CATCGTGCCT GGCACGCCCG ACAGCCAGCA GCTCAGCCGC
GCGCCGTCTT CGGCGCAACC GGCTTGCGCC TCCTTCGACT TCGAGCTGAG CCGTCATCCC
GGCAGCTATC GCCGCGCCTG GCTGCGCTAC CAGACCGACG GCGTATTTCG TCCGCTCGTC
CTAACCCGCG CCTTCAACGA CGCGCCCGTG ACCCGCGGCT TCTGGGTCCC GCCCATGGCC
GACGAGGCCG GCGCGTTCGT CCATCGCGTC GACACCGAGC ATCTCCGGCT CGGCCACAAC
CAGGTCGAAT TCTGCCTGCC CGGGGAAGCC GCTCGCGCCG TGGCGATTCG CGATATCGAG
CTGGTCGCCG AGCTCGACCA CGGCAGCAAC ATCATCGAGA GCGTCTCGGT GGCCCCCATC
GACGGCGTGC CCACGTACAG CGCCATCGGC CTGCTGCGCG ACGGCCACGC GCCGGTCGCG
GTCACGGCCG GCCAGGAGCT GGTGATCGCG TTCGAGCGCT GGATCGCGCC CGAGGTCGTG
AGCATCGCGG CCGACGCGAC CGCCGACTGG TCCCTGCGCT GCGTGGACGC CGACGGCGCG
GCCCGCGACC TGCCGGCCAC ACTCGCGGAG CAAATCGCCG ACCGCGCCAT CTACACCATC
GACGACGCCA CCGGCGCGCG CTGCGCCGGC CTGCGCATGC GCCCCGCGCT CGCCAGCGGC
GAGGCCGCGG TCACCGACCT GCGCGTCTTC GGCTCGGGCA CCGACCGCCG CGGCGACTTC
CCGCGCATCG TGCTGGCCTC GGCCCGCGAG CACTTCGGCA ACGAAGCCTG GGTGGACGGC
TGGGCGCACG CGCCCGCACA CGTCGGCGGC GGCGTTCGCG TCCGCGTGGA CGACCAAGAC
ACCGACACCA CCACGGGCGT GTTCACGTCC ATGCTCCGCC GCACCAGCGA CCCCAAAGAG
AGCTGGCCCG TGACCATTAC CGCGCGCTTT GGCGACGGCA GCACCTTCAC GCGCCAGTAC
GTGCTCGACC GCGACGGCGG CACCATGCCC GGCGCCGAGG CCCGCGATCC CGTGCTCGAC
GATGGCCTCA CCGAGGCCGA GCGCCGCGCC CGCTTCGGCG ACGAGGGCGA CATCGCCGAG
GCCGAGGTCG CGCCCGGCGA GAGCAAGCGC ATCGAGCTCG GCACCGACGT CACCCTCGAC
ATCCCGGCCG GCGCCATGCA GGGCCGCAAA TCCGTCTCCA TCACCCACCT GAGCAGCGCC
GCCATCCCGC CCATGGATCC CGGCCTGGTC AACGTCACCG CGCCCTTCCG CCGCGGCTAC
GAGTTCCGGC CCCACGGCGA GCTGTTCGAC GACGCCTTGA CTGTGACCCT GCCGTATCAG
CCCTCGCTGC TGCCCAGCGG TTACGTGGCC GAGGACATCC AGACCTTCTA CTACAACGAG
ACCGAAAAGC GCTGGGAGCC GCTGGCGCGC GCCAAGGTCG AGCGCGGCCG CCAGGTGGTC
GAGAGCCTCA CCGACCACTT CACCACCATG ATCAACGCCG TGGTGGTGGC GCCCGAGAGC
CCGCAGATCG CGTCCTTCGA CCCCAACCGC CTCAAGGGCA TCGAGGCCGC CTCGCCCGCC
GCCCGCGTCG GCCTCATCGA GCCGCCGCAG GTCAACGCGC GCGGCGACGC GACCATGCAG
TACCCGCTCG ACATCCCGGC CGGCCGCCGC GGCGTCCAGC CCTCGCTCGG CCTGAGCTAC
AGCTCGGCGC GCGGCAACGG CTGGCTCGGC GTCGGCTGGG ACCTCGGCAC CTCGGCCATC
GAGATCGAGA CCCGCTGGGG CGTGCCCCGC TACCACGCCA CGCTCGAGAC CGAGACCTAT
CTCATCGACG GCACCCAGCT CTCGCCCACC GCCCACCGCG ACCTGCCCAA GCCGCGCGCC
GAGGGCACCA CGCGCATCGC CAACCAGACC GTCAAGGTCT TCCGCCCGCG CACCGAGGGC
GGCTTCGCGC GCATCGTGCG CCACGGCGAC AGTCCGCAGA GCTACTGGTG GGAGGTCACC
AGCACGCAGG GCGTGCGCTC GTTCTACGGC GGCACGCCCG AGTCCGGCAA ACTCGCCGCG
GCCACCCTGT CCGACGACAG CGGCAACGTG TTCCGCTGGG CCCTGCGCGA GATCCGCGAC
ACCCACGGCA ACCGCGTCCG CTTCGACTAC GACGCGGTCA CCTGGAGCGC GCCCGGCGCG
GTGCCGGGCC GCGAGCTGTA CCTGGCCTCG GCGCACTACA CCCTGCGGCA AGGCGAGAGC
ACGGCCCCGT ACCGCGTGGT CCTGGTTCGC GATCCGTGCA CGGCCTCGAG CTGCCGCCCC
GACGTGCTCG TCAGCGGCCG CGGCGGCTTC AAGCAGGTCA CGGCCGAGCG CCTGGGCCGC
ATCGAGGTCT ACTACCGGCA GACGCTCGTG CGCGCCTGGC AGCTCGAGTA CGACCAGGGC
CCCTTCGGCA AGAGCCGCCT GCGCGGCCTG CGGCAGTTCG GCGTCGGCGG CGAGCCCTTC
CCGGGCAACG TCCACACCTT CCACTACTAC GACGAGGTCA CGCAGTCGGC CTCGACCTAC
AGCGGCTTCG CCGCCTCGGC CCCCTGGCAC GCGGGCAACG AGACGCCCGA CAGCGGCCTG
GTGCTGCCCG GCGGTATTTT CGACGCCGTG GGCGAAGGCG AGGTCAGCGC GCTCGGCGGC
TCGCACACAG TGACCGTGGG CGGACACCTG TACGCCGGCC TGTCCTTCGG CCTGCCCGAC
AAGAAATACT CCATCGGCGC CAAGTTCGGC ACCCGCAGCG ACGAGACCGA CGGCCGCGCC
GCGCTCGTGG ACATCGACGG CGACGGCCTG CCCGACCGCG TGTTCCGCGG CTCCGGCGGC
TACTACTTCA ACCGCAACGA GTCCGGTCCG CTGGGCCCGC CGCGCTTTGC TGCCGCGGCC
CAGCCCGTGG CCAACCTGCC GGCGCTGTCG CGCGAATCCT CGCGCATGAG CCTCTCGGTC
GGCGCCGAGG GCTATTTCTT CCCGGCCCAG TTCCACGCCA ACACCTCGTT CTCGTCCGCC
GAGCAAGACA CCTACCTCAG CGACGTCAAC GCCGACGGCC TGGTCGACCT GGTCCACGGC
GGCGCCGTCA CCTTCAGCTA CCTGGACGAA ACCGGCAAGC CGCGCTTCCA CCCCGACAGC
ACCCGCACGC CCGCGCCCAT CGGCGTGAGC CGCGTGTCCT ACGACGCGCT GCCGCCCTCG
TTCGAGGACC CCGCGCTCGA GCACAACCTC GACGACTACA GCCCGCCCGT GGACACCGTG
CGCCGCTGGC AGGCGCCGTA CTCGGGCCAG ATCCGCATCA CGGGCGACGT CGCCCTGGCC
GCGCCGCAGC GCGACGGCGC CGACGGCGTG CGCGCCACCA TCGAGTACGA AGGCGAGCAG
CAGTGGAGCC ACGACTTCGG CCCGGCCGAG ACCCAGCCGC AGCGCCCGAG CCTGATGCTC
GACGTCGCCG CCGGTGAGCG CGTCTACTTC CGCATCCACG GCCGCGACGA CGAGCGCGAC
GACCGCGTGC GCTGGAGCCC CGTGATCGAG TACCTCGACC TCGCGCCCGG CCAGGACGAA
AACGGCCTCG ATCACCACCG CTTCGACGCG GCCGCCGAGT TCACGCTCGC CGGCCGCGCC
GACGCCAACA TGAACATGCC GTTCTCGGGC CGCGTGCGCC TCACCGGCCG CGCCAGCAAG
TCGCGCGAGA CCTCGGACGA CATCGAGATC CGCGTGCTGG TCGACGGCGC CATCGCGTTC
GCCGAGGTGC TCCCGGCCGC CGCCATGACC ACCGTGAACC TCGACCAGGC GCTCGACGTC
GAGGCCGGCA GCGTGGTCGC GCTGCGCATC CACAGCGACT CGCCCGTGGA CCTCACCGCG
TTCGGCTTCG ACCCGGTCGA GGACGACAGC GGCGCCATCA CCCGCGGCCC GCGCCTGCGC
TACGAGGACG CCCGCGACCA GAACGGCGAC CGCGTGCCGG TCGAGGGTCC GGGCGGCGCG
CCGATCTTCG ACGTGCCGGT GGCCTACGAC ATGACCACCT ACGCGCTGCG CATGCCCGAG
ATGCCGGCGC CCGCCTACGT GCCCGCCGTG GGCCGCAGCC TGCGCGTGCA CGGCCGGGTC
GACGCCGGCA ACGCGCCGTT CACGGGCACG GTCACGCTCA CGGCCAAGAG CCGCAACCGC
CTGCTGGCCA AGGCCCAGGT GGTCATCGAA GACGGCGGCG GCCACGATCT GGATTTCGAC
CTCCAGGTCG AACAAGGCGA GCCCGTGTTC TTCTCGTTCT CGGTGAGCGA CCCGGATCTC
TTCGGCGCCC TGCTCACCTC GCTGTCGGCC AGCGGCGCCG GCGCCCTGCC CTACCTCACC
TACGCCGTGG CCCCGGCCGG CCGGCTCTCG CATCCCTACC GCGGCTGGTC CTACGGCGGC
TACAGCGGCG CGGGCGCGCG CGGCGACCTG CCGATTCCGC CCGGCGAGCT CGACAAAGAG
CCGGTGTTCG ACGGCAGCGA GCCGCTCAGC GAGGACAACC TCGAGGAGCT GGCGCGCCGC
TTCGTCGAGG ACGGCCTGCA GGCGTTCCCG TTCCTGCCGG TGTCCGATGC GCCGGCGTGC
GAGGCCGACG CGCCGGCATG CGACGCCACC GCGACCGCGC ACTGGCAAGG CCCGCACGAG
CTGATCTACG TGACCGCGCA CGAGATGTCG ACCATGCGCC TCGGCGGTCC GGTGAAACTC
CTGCCTGCGC CCAGCGAGAT CGCCAACGCC CGCGCGGTGC CGCGCCTGTC GCGCTCGCGC
GGCCAGGCCA TCGGCGGCGG CGCGGTGCCG GTGTCGTTCT CCCAGGGCCA GGGCGAGTCG
CACGGCCTGC TCGACTTCCT CGACATGAAC GGCGACGGCT TCCCCGACGT GGTCACGCCC
GCACGCGTAC AGTACACCTC GCCGCTCGGC GTGCTGTCCG AAAACCGCAA CGTCGGCGCC
CGCGACCTGC GCATCACCAG CAACGAGAAC ACCACCATCG GCATCGGCGG CAACATCGCC
AACGCCATCG CCACCGCGCG CGGCCTGTTC GGCGGCGGCG CGCACAAGGC CGGCGGCGGC
ACCGCCAGCT CGCAGCAGAT GCAGCCCATC GGCTTCACGC TCAGCCTGGG CGCGAGCGAG
GGCGAGGCTT CGTCCGAGGC CGACCTCATG GACGTCAACG GCGACGGCCT GCCCGATCGC
GTGTGGCAGC AAGGCGGGCA GTTGCAGGTC GCGCTCAACC TCGGCTATCG CTTTGCCGCG
CGCGAGCCCT GGGGCCAGGC CGTGATCCAC GAGGGCGAGA CCGAGGAGGA GAACGGCGGC
GCCACGCTCG GCTTCAACGA CGGCCTGTAC GGGTTCAGCG GCGGGCTCAA CGGCAGCCGC
AGCTTCTCGC GCCCGGCCAG CGGCGGCGGC CAGTGCCAGG GCACGCTCAT GGACGTCAAC
GGCGACGGCC TCATCGACTG CGCCCAGCAG GCCGGCAATG GCCTGCGCGT GTGGTTCAAT
CGCGGCCACC GCATCGCCGG CAGCTCGGTG CTGTGGAACG GCGCGGGCGG CGCCGGCATC
GGCGACAACC AGTCCACCAC GCTCGGCGGC GGCGTCTACT TCACCATCGG CTTCGGCTTC
GGCTTCGTCA ACTTCATCAT CAACCCCGGC GGTGACGCCG CCGAGTCGGT CGGCCGCCCC
ATCGCCGCCA TCCGCGACAT CGACGGCGAC GGCTATCCCG ACCACCTGGT CTCGACCACG
CCGCACGAGA TCCGCGTGGC CCGCAACCGC ACCGGCCGCA CCAACCTCCT GCGCCAGGTC
GAGCGGCCGC TCGGCGCCAC CATCACGCTC GACTACACGC GCACGGGCAA CACCCAGGCC
ATGCCGCAGA GCCGCTGGGC GCTGAGCGAG GTCCAGGTGT ACGACGGCCA GGCCGACTCG
GCCGCGCTCG GCGAAGCCAA CGACTACGCG GTCCAGCGCG TCGCCTACGA GAGCGGCCGC
CACGACCGCT ACGAGCGCGA ATTCTACGGC TTCGCCAAGG TTACGGTGGA GACGCTGGAT
ACCCGCGACT GGGACGGCAC CACGGCCACC GACGCGCTGC CCGTGTATCG CCGCCAGGAG
ATGACGTTCT ACAACGGCGG CTATCACGAG CGCGGCTTGC TCGCCAGCAC GCGCATGAGC
GACGGCGATG GCGCGGTGTT CGCGCGCACG CTCAATCAGT ACGAGCTGCG CGACGCCATC
GACGGCCGCG TGCTCACGCC GACCCAGGCC GCGCGCGCGG CGTCGGTGTT CCCGGCGCTG
GTGCAGGAGC ACCGCTATCA CCACGAGGGC GCCGAGGACG CGTTCATCCA CACCTTCACC
ACCCAGGACT ACGACGGCTA CGGCAACGTG ATCGCGCTGT ACGACGCGGG CGGCCCGGGC
GCGAGCGACG ACTATCGCGC CGAGATTCGC TACACCGGAC GACTGTCCGC GTGTCGCGCG
CATCATATCG TCGGCGTGGC CGACCGCATC GAGGTGCGCG ACGCCGCCGG CGCGCTGCTG
CGCCAGCGCG AGTCCGATGT GGCGTGCGGC GACGCCACCG GCAATGTCCG CCAGCTCCGC
GTGTCGCTCG AGGGCGCCGC GGTGGCGCAG ACGGACCTCG CGTATGACGG GGACGGCAAC
CTGCGCGCGG TCACCGCGCC GCCCAACCAC CGCGGCCAGC GCTATCAGCT CGACTACACC
TACGACAGCG ACGTGGCCAC CTACGTGGTC CGCACCGACG ACAGCTTTGG CTACTACGCC
ACCTCCGAGT ACGACCTGCG CTTTGGCGCG CCGCTGCGCG AGACCGACAT CAACGGCAAC
TCGGTGACCT CGAGCTACGA CGCCTTCGGC CGCGCGCATC GCGTGAGCGG GCCGTATGAG
CTCGAGCGCG GGCTGGCGTA CGCCATCGAG TTCGGCTACG CGCCCGATGC CGCGGTGCCG
TTTGCCACCA CGGCGCATAT GGACGTGTTC CGCAACGCGG CCGACCCGAT CGAGACGGTG
ACTTTCGTGG ACGGGCTCGG CCGGGTCACG CAGACCAAAA AGGACGGCAC GCTGCATCGC
GGCGTGGACG CGCCGGCCGA GGACGTCATG ATCGTGTCGG GCCGCACGCT CCACGACCCC
TGGGGCCGCG CGATCGCGCA GTGGTTCCCG ACCGAGGAGC CCAAGAACGA CGCGCTCAAC
CAGGCGTTCA AGGCCGAGGC CGACGGCAGC GCGCCGCCGA CCGAGATGCG CTACGACGTG
CTCGACCGGA CGCTGGAGAC GGTGATTCCC GATGGCACGC TGACCTCGCA GAGCTACGCG
CTGGCGTCGG CGCTGTCGGG CAATGGGCTG TGGCTGCTGA CGACGACGAT CGACGCCGAG
GACAACCGCG GCGATGCGTA TCGCGATGCC CGCGGCAATA TCCGCGCGGT GGTCGAGTAT
CTGGATGGGC GCGGGATCAC GACCCGCTAT GAGTACGATC CGCTGCAGCA AATCCGCCGG
GTGTTCGACG CCGAGGGCAA TCTGACGCGA TCGGAGTATG ACCTGGCGGG GCGGCGGACG
GACGTGGTGC ATCCTGATAG CGGGCTCACG GAGATGGTGT ATGACGCGGC CGGCAATCTG
GTCCGCCGCA TCACGGCCAA CCTGCGCCGC GAAGGCGGCG CCATCGAGTA CGGCTACGAC
TTCACGCATC TGACCGAGAT CCGCTATCCG CGCTACGCGG ACAATGATGT CACGTACACG
TGGGGCACGG CCGGGCTGCG CGGCGCGGGC GGCAACCAGG TGGGCCGCAT CGTCCGCGTG
GACGATAACA GCGGCTTCCA GGAGCAGCGC TACGGCGCGC TCGGTGAGGT GGTGTTCGAG
CGCCGCAGCA TCGATAGCCA CACGATGGGC GAGAGCGACA ACAGCCCCGA GATCTATGTG
ACGCAATACC TCTACGACAC GTGGGGGCGG CTGCAGCAGA TGATCTATCC GGATACCGAG
GTGCTCACCT ACGCGTATGA CTCGGGCGGC CTGGTGCGCG CGGCCGAGGG CGTCAAGCTG
GGCACGCAGT TCCGGTATCT GTCGCGGCTG GAATACGACC GCTTCTCGCA GCGGGTGTTT
CAGGAGACGG GCAACGGTAT CCGCTCGCAT TATGCGTACG ATGCGGAGAA TCGCCGGCTG
CGGACGCTGG AAGCCGGCGA GTTCCAGCGG CTCGAGTACG GCTACGACAA CGTCGGCAAC
GTGCTCAGCC TGGTCAACGA TGTGCCGAAC GCGCGGCCGA ATGAGTACGG CGGGCGCACG
GAGCAGAGCT TCTCGTATGA CGATCTGTAT CGACTGACGG GCGCGAGCGG GACCTGGCAC
CAGCCGCCGA ATAAGCGCAA TCAGTACACG TACACGATGC AGTACGACGA TATCCACAAC
ATCCGCGCCA AGGAGCAGCG GCACTGGATC CGCAACCGCG GCGACGGCAA GGATATTACG
CAGCACAAGA CGACGTACGC CTGGGGCTAC GACTACGGCT CCGAGAAGCC GCACGCGGCC
ACGCACGTGG GCGACCGGAC GTTCTTCTAC GATGACAATG GGAACCAGGT CGGCTGGGAT
CACGATCAGA ACGGTCTGCG CCGGACGATC GTGTGGGATG AGGAAAACCG GGTGCGCTCG
ATTTCGGACA ATGGGCGGAC GACGGACTTC GTGTACGACC ACGGCGGCGA GCGCGTGGTC
AAGAGCGGCG CGCAGGGCGA AACGGTCTAC GTGAATGATA AGTGGACGGT GCGCAATCGG
TCCGTCGGCA CGAAACATGT CTACGTCGGC ACCACGCGCA TCGCGTCGAA GCTGTCACCG
GGCGATGCCC ACGTGCGGCC CGATGAACGC GACCTGGTCT CGGTCATGCT CGGCAAGTGG
TGGGAACACC GCTCCGAGAA CGGGCATGAG CATGGGCGCA ATGTCGAGAT GAACCCGCAT
TATCAGATTC CGAGCGACCT ACCGGATGAT GGAATGCCGG ATACGAACTT CTTGTACTTC
TATCACCCGG ATCACATCGG CAGCACGAGC TTCGTGACCG ACGTGGACGG CGCGCTGTAC
GAACACGTGC AGTACTTCCC GTCCGGCGAG ACGTGGGTGG ACCAGCGCAC GAACACCGAG
CGCACGCCGC ACCTGTTCAG CGGCAAGGAG CTGGACCAGG AGACCGGGCT GTATTACTTC
GGCGCGCGGT ATTACGACCC GCGGGTCGGG CTTTGGGCGA GTGCGGATCC GGCGCAGACG
GAGTATTTGG ACGGCGCGGG AGTGGGCGGC GTATTTATGC CGATCAACCT GGCCACGTAC
ACTTACGCAG CCAATAACCC GATTCGATTT GTCGATCCGG ATGGTCGCTA TTGGCTCGAC
TGGGTGCAGG CGGGCCTCGA TGCTACTTCG CTCGCGCTCG ACGCGACCGG CATTGGGGCG
GCCGTAAGCT GGGCCCCGGA TCTTGTTAAT GCTGGCATAT CCGCCGGCCG CGGCGACTGG
GTAGGGGCGG GCTTGTCGGT GTCTGCCGCA GTCCCTTTCA TCGGAGCGAC GGCCAATGCC
ACTCGCGTGA CCCGCACCGT ACTCAAAAAC TCAGACGACA TTGTCTCTAT CGGAAAGACG
GTTCCGAAGG CTAAGGTTCC CTACAAACGG CCTAGCGGTG CAACGACGCC TGCTCAGCGT
GCTTTCGTTC AGGGGAAACC CTGCGTCGAC TGCGGTCACA TAGCACCGAA ACAGTTTGCA
GATCACAAAA CACCTCTGGT CAAAGAGCAC TATGAAATCG GCAGCATTGA TAAGACTAGA
ATGAAAGAGA TAGATGCCGT GCAGCCACAC TGCCCGACAT GCTCAGCTAG TCAAGGCGGA
AAGCTGAGAC AATACTCACT AGAGCAACGT AAGATTCTGG AGGGCGAGTA A
 
Protein sequence
MLLTGLACTN EEEAPNAGDV AVHRLTPRSL ETSGDVSAAW ALFDRDTTSK NVFSITSLDG 
QDIIAHLDEG SELEAIKVFG ASPFQLTLLD PQGQLVAGPH ALDKLPSGWT TFLLPDARRV
EQLTLRFEPT GDGDAAVSEI EFWGRGASLP LDWEPSATDA PPAGLADIVP GTPDSQQLSR
APSSAQPACA SFDFELSRHP GSYRRAWLRY QTDGVFRPLV LTRAFNDAPV TRGFWVPPMA
DEAGAFVHRV DTEHLRLGHN QVEFCLPGEA ARAVAIRDIE LVAELDHGSN IIESVSVAPI
DGVPTYSAIG LLRDGHAPVA VTAGQELVIA FERWIAPEVV SIAADATADW SLRCVDADGA
ARDLPATLAE QIADRAIYTI DDATGARCAG LRMRPALASG EAAVTDLRVF GSGTDRRGDF
PRIVLASARE HFGNEAWVDG WAHAPAHVGG GVRVRVDDQD TDTTTGVFTS MLRRTSDPKE
SWPVTITARF GDGSTFTRQY VLDRDGGTMP GAEARDPVLD DGLTEAERRA RFGDEGDIAE
AEVAPGESKR IELGTDVTLD IPAGAMQGRK SVSITHLSSA AIPPMDPGLV NVTAPFRRGY
EFRPHGELFD DALTVTLPYQ PSLLPSGYVA EDIQTFYYNE TEKRWEPLAR AKVERGRQVV
ESLTDHFTTM INAVVVAPES PQIASFDPNR LKGIEAASPA ARVGLIEPPQ VNARGDATMQ
YPLDIPAGRR GVQPSLGLSY SSARGNGWLG VGWDLGTSAI EIETRWGVPR YHATLETETY
LIDGTQLSPT AHRDLPKPRA EGTTRIANQT VKVFRPRTEG GFARIVRHGD SPQSYWWEVT
STQGVRSFYG GTPESGKLAA ATLSDDSGNV FRWALREIRD THGNRVRFDY DAVTWSAPGA
VPGRELYLAS AHYTLRQGES TAPYRVVLVR DPCTASSCRP DVLVSGRGGF KQVTAERLGR
IEVYYRQTLV RAWQLEYDQG PFGKSRLRGL RQFGVGGEPF PGNVHTFHYY DEVTQSASTY
SGFAASAPWH AGNETPDSGL VLPGGIFDAV GEGEVSALGG SHTVTVGGHL YAGLSFGLPD
KKYSIGAKFG TRSDETDGRA ALVDIDGDGL PDRVFRGSGG YYFNRNESGP LGPPRFAAAA
QPVANLPALS RESSRMSLSV GAEGYFFPAQ FHANTSFSSA EQDTYLSDVN ADGLVDLVHG
GAVTFSYLDE TGKPRFHPDS TRTPAPIGVS RVSYDALPPS FEDPALEHNL DDYSPPVDTV
RRWQAPYSGQ IRITGDVALA APQRDGADGV RATIEYEGEQ QWSHDFGPAE TQPQRPSLML
DVAAGERVYF RIHGRDDERD DRVRWSPVIE YLDLAPGQDE NGLDHHRFDA AAEFTLAGRA
DANMNMPFSG RVRLTGRASK SRETSDDIEI RVLVDGAIAF AEVLPAAAMT TVNLDQALDV
EAGSVVALRI HSDSPVDLTA FGFDPVEDDS GAITRGPRLR YEDARDQNGD RVPVEGPGGA
PIFDVPVAYD MTTYALRMPE MPAPAYVPAV GRSLRVHGRV DAGNAPFTGT VTLTAKSRNR
LLAKAQVVIE DGGGHDLDFD LQVEQGEPVF FSFSVSDPDL FGALLTSLSA SGAGALPYLT
YAVAPAGRLS HPYRGWSYGG YSGAGARGDL PIPPGELDKE PVFDGSEPLS EDNLEELARR
FVEDGLQAFP FLPVSDAPAC EADAPACDAT ATAHWQGPHE LIYVTAHEMS TMRLGGPVKL
LPAPSEIANA RAVPRLSRSR GQAIGGGAVP VSFSQGQGES HGLLDFLDMN GDGFPDVVTP
ARVQYTSPLG VLSENRNVGA RDLRITSNEN TTIGIGGNIA NAIATARGLF GGGAHKAGGG
TASSQQMQPI GFTLSLGASE GEASSEADLM DVNGDGLPDR VWQQGGQLQV ALNLGYRFAA
REPWGQAVIH EGETEEENGG ATLGFNDGLY GFSGGLNGSR SFSRPASGGG QCQGTLMDVN
GDGLIDCAQQ AGNGLRVWFN RGHRIAGSSV LWNGAGGAGI GDNQSTTLGG GVYFTIGFGF
GFVNFIINPG GDAAESVGRP IAAIRDIDGD GYPDHLVSTT PHEIRVARNR TGRTNLLRQV
ERPLGATITL DYTRTGNTQA MPQSRWALSE VQVYDGQADS AALGEANDYA VQRVAYESGR
HDRYEREFYG FAKVTVETLD TRDWDGTTAT DALPVYRRQE MTFYNGGYHE RGLLASTRMS
DGDGAVFART LNQYELRDAI DGRVLTPTQA ARAASVFPAL VQEHRYHHEG AEDAFIHTFT
TQDYDGYGNV IALYDAGGPG ASDDYRAEIR YTGRLSACRA HHIVGVADRI EVRDAAGALL
RQRESDVACG DATGNVRQLR VSLEGAAVAQ TDLAYDGDGN LRAVTAPPNH RGQRYQLDYT
YDSDVATYVV RTDDSFGYYA TSEYDLRFGA PLRETDINGN SVTSSYDAFG RAHRVSGPYE
LERGLAYAIE FGYAPDAAVP FATTAHMDVF RNAADPIETV TFVDGLGRVT QTKKDGTLHR
GVDAPAEDVM IVSGRTLHDP WGRAIAQWFP TEEPKNDALN QAFKAEADGS APPTEMRYDV
LDRTLETVIP DGTLTSQSYA LASALSGNGL WLLTTTIDAE DNRGDAYRDA RGNIRAVVEY
LDGRGITTRY EYDPLQQIRR VFDAEGNLTR SEYDLAGRRT DVVHPDSGLT EMVYDAAGNL
VRRITANLRR EGGAIEYGYD FTHLTEIRYP RYADNDVTYT WGTAGLRGAG GNQVGRIVRV
DDNSGFQEQR YGALGEVVFE RRSIDSHTMG ESDNSPEIYV TQYLYDTWGR LQQMIYPDTE
VLTYAYDSGG LVRAAEGVKL GTQFRYLSRL EYDRFSQRVF QETGNGIRSH YAYDAENRRL
RTLEAGEFQR LEYGYDNVGN VLSLVNDVPN ARPNEYGGRT EQSFSYDDLY RLTGASGTWH
QPPNKRNQYT YTMQYDDIHN IRAKEQRHWI RNRGDGKDIT QHKTTYAWGY DYGSEKPHAA
THVGDRTFFY DDNGNQVGWD HDQNGLRRTI VWDEENRVRS ISDNGRTTDF VYDHGGERVV
KSGAQGETVY VNDKWTVRNR SVGTKHVYVG TTRIASKLSP GDAHVRPDER DLVSVMLGKW
WEHRSENGHE HGRNVEMNPH YQIPSDLPDD GMPDTNFLYF YHPDHIGSTS FVTDVDGALY
EHVQYFPSGE TWVDQRTNTE RTPHLFSGKE LDQETGLYYF GARYYDPRVG LWASADPAQT
EYLDGAGVGG VFMPINLATY TYAANNPIRF VDPDGRYWLD WVQAGLDATS LALDATGIGA
AVSWAPDLVN AGISAGRGDW VGAGLSVSAA VPFIGATANA TRVTRTVLKN SDDIVSIGKT
VPKAKVPYKR PSGATTPAQR AFVQGKPCVD CGHIAPKQFA DHKTPLVKEH YEIGSIDKTR
MKEIDAVQPH CPTCSASQGG KLRQYSLEQR KILEGE