Gene Mnod_3527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3527 
Symbol 
ID7301376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3640616 
End bp3651181 
Gene Length10566 bp 
Protein Length3521 aa 
Translation table11 
GC content68% 
IMG OID643601196 
ProductFibronectin type III domain protein 
Protein accessionYP_002498740 
Protein GI220923438 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGCA TCGATTTCGA TCCGGCCGCC GCGCCGGAGG ACAGCATGCG CCAGGACGCT 
CCCATCCGGG GGAGGAAGTC CTCTTCCGGC CGCGGCAAGA CCGGCGGCTC GGGCTCGAGC
GCACCCGACA CCCTGTTCTC GAACGCGACG GTCCGTCTCG TGGACCTGCT CGGCGAGGGC
GAGATCACGG GCGTCGTCGG CGGCCTGAAG GGCATCTACT TCAACGACGT CCCGGTCCAG
AACGCGGACG GCACCTTCAA CTTCAAGGGC CTGAGCGCCG ACTTCCGCAC CGGCACGCCG
GATCAGTCCT ACATGCCCGG CTATCCCGAG GTGGAGACGC CGCGGGAGGT CGGGGTCAAG
GTCAGCAAGG CGACGCCGGT CACCGCCGCG ATCAGCGACG GCGAGGCGGA CCGGGCCCGC
GTCATCATCG AGCTGCCGGC CCTGTTCCTG GCCAAGAACG ACGGCTCGGT CCGCCAGAAC
AGCGTCAGCT TCCGGATCGA GGCCCGCTAC AGCGGCGGGC CATGGGTGAA CCAGCTTGGC
GACCTCACCA TCACCGGCAA GAACACCTCG CCTTACTTCG TGTCCTACGA GGTCGCCCTG
CCGCGCAATC CGGCCGGGTC GAGCCCGCCC TGGCAGGTGC GGGTCACGCG CCTGACCGAC
GACACCGACG GCTTCAACAC CAGCCAGGAC AAGTGGACGA GCCAGAGCGA TCTCGTCTTC
TACTCGCTCA CCGCGATCCA GGACGCCAAG TTCAGCTATC CGCATTCGGC CCTGGTCGGC
CTGACCGCGG ACGCGAGCCA GTTCGGCTCC TCGGTGCCGG CCCGGACCTA TCTGGTGGAC
GGCCTGCTCA TCAAGGTGCC GTCGAACTAC GATCCGGTCG CGCGCACCTA CTCCGGCATC
TGGGATGGCA CCTTCAAGGA GGAGTGGTCG GACAATCCGG CCTGGGTCTT CTACGACGTG
CTGTGGAACG ACCGCTATGG GCTCGGCGAG TTCATCAGCG TCGAGAGCAT CGACAAGTGG
ACCCTGTACG AGATCGGCCG CTACTGCGAC GTGCTGGTCT CGGACGGCAA GGGCGGCCAG
GAGCCGCGCT TCCGCTTCAA CGCCCAGATC AGCACTCAGC AGGACGCCTT CGACCTGCTC
CAGCAGATCA GCGCGATCTG GCGCGGCATG GCCTACTGGT CCTCGGGCGC GGTCACGGCC
ACCCAGGACC GGCCCGACGA CGTCCGCCAG CTGGTCACGC CAGCCAACGT CATCGAGGGG
CTGATCACCT ACAGCTCCTC GGGGCGCAAG GCCCGCCACA CGGTGGCGCT GGTCAGCTGG
ACCGACCCGG ACAATCTGTT CAAGCCGCAG ATCGAGGTCG TTGAGCACGG CGAGGGCATT
GCCCGCTACG GCTACAACCC GACCAAGATC GACCTGCTCG GCTGCACCTC GCGTGGGCAG
GCGCATCGCG AGGGCCTGTG GCGCCTCCTG GTCGAGAACT ATGCGACGCA GACCGCCACC
TACCGGGCGG GCCTCGACCA TGCCGTGCGG CGCCCCGGCG ACATCATCGC CATCGCCGAC
CCGCAGATCA GCAACATCGA CGCCGGCGGG CGCCTCAAGG CCGGCTCGAC GGCCTCGACC
CTCCTCCTCG ATCGGCCGGT CACCCTGAAG AGCGGGGTGC CCTACGAGAT CTCCGTCACC
CTGCCGGACG GCAGCGTGGC AGAGCGGCAG ATCACGACGC TGGCCGGCGT CGACCTCACC
GAGGTCAGCA TCTCGCCCGC CCTGCCTTCC GTGCCGGACG CCGCGGCGGT GTGGCAGATC
GCCGGCGAGG TGGTGCCGCA GCTGTTCCGG ATCGTCGGCA TCAAGGAGGT CGAGCCGCAC
ATCTGCGGGA TCCAGGCGCT CCAGCACGAG CCCTCGATCT ATGCGGCGGT GGACGATGGC
GCGGCCTTCG AGCCGCTGAA CATCAGCGAG TTCCCGAACG TCGTTCTCGC CCCGACGAAC
CTCACGGTGA GGGAAAGCAC CTACTTTGAG AACAACCTGC CGCGGCAGAG CCTGCTCCTC
AGCTGGACGG CTGGCCAGCC CTTCAACTCG GTCGCCTACT ACGTCACGGC CATTAAGCCG
AACGGCTCGC TGGTGACCCT ACCGAAGCGC AGCACCACCT CGGCGGACTT TGACGACGCG
GCCACGGGCG AGTGGACCTT CATCGTCCAG GCCGAGGGGT TGAACGGGCG CCTGTCGGAC
GCCGCCCAGA TCACCTACAC GGTCCAGGGC TGGGAAGGGC TTGCCGGGCC CACCGTCACC
GGCCTGCAGG TCAAGGGCGG CGGCAGCGTC TTCACGGGGC GGAGCTGCAC CCTGGAGTGG
GGCCTGACCT GGCCGCCGGA CGTGAGGCCT TACGAGGTCG GTTACGCCTT CCGGGTGTTC
GACGCGGACA CGAATGCGCT CCTCCACACC GAGATCATCA CCGCCGCGCA GGCGACCTAC
GACTACGAGG AGAACCTCAA CGAGGGCGGC CCGCGGCGGC GCTTCCGGGT CTCGGTGGCG
GCGCGCGACG CGATCGGCCG CGAGAGCCAG CCGGCGGTGC TCGTCGTGTC CAATCCGGCG
CCCGCCATGG TCGTCCCGAC GCTGGACTGG ACGACGGAGA GCATCGGCGT CTGGATGACC
CCGCCGGCGG ACTCCGATCT CGCCGGCGTG CTGGTCTGGC TGTCCAAGAC GAGCAACTTC
GACCCGCTCG CCACGTCCCC TGCGGCCGAT GTCGCCCCGA CCGGCTTCCT GCTGCTGCCG
GCCGAGGCCG AGACGACCTA TTACGTGCGG GTCGCCTTCT ACGACAGCTT CGGCAAGAAC
CCGGCCGAGC TGAACATCTC GCCGCAGGTC GAGATCCGCA CCACCAACAA GATCATCGAC
GTCCAGGCGC CCGACATCCC GACCGGGCTC AACCTGACGA CGGATCTGGA AGTCTCGGCG
ACTGGCGTGG CGACGGCCGC GATCACCGCG ACCTGGAACC CGGTCGGCAG CAGCAACCTC
GGGATCTACG AATTCGAGCT GACGGAGGGC GACGGCGTCA CGAACTCGTC CTGGATCCGT
GACCGCGCCG ACAAGGGCCA GCCGACCTTC ACCTGGCGCA ACCTCAAGCC GGGCGTTCTC
TACACGGCCC GCGTCCGCAC CGTGAACGAC AGCGGGGTGG CGGTCTCGGG CTGGTCCCCG
ATCGCTACCA TCACCGCGGC CAAGAACACC GCGAAGCCCG GCGCGATCAC GAGCTTCACC
GTGGACGCCG CCTACCGGAC GGCAAGCCTG TCGTGGGTCA ACCCGAGCGA CCCGGACCTT
GCCGCGATCG AGGTGTGGGT CGGGACGCGC GACGACGGGG CGGACGCCAG CCTGTTCGCG
ACAGTCCCAG CGCCGCTCAA CTTCTTCAGC GACACCACCC TGGAGATCGT GCAGACCCGC
AAATACTGGG TCCGGCCGGT CAACAGCTCG GGCACGGCCG GCGACTTCGT CGGCCCGAAG
ACGGCGACCA CGGCGGCCCT CCCGGCGGCG GCACTGCAGA ACGGTGTCAT CGACCAGACC
AAGCTCGCCG CCTCGGTCGT GGCGCCGACC GCGGTCACCA GCCTGCCGGA TCCCGCAACC
TGGACGGGCC CCGCGCTCGC TTACAACGCG ACGGACGGCA AGCTCTACCG GCTCGTGGGT
GGGCAGTGGA CGACCGCGGT GCAGGCGGTC GACATCACGG GCGCCCTCAC CAGCGCCCAG
ATCCAGTCCA TTGAGGCAGC GAAGGTCGCC GGCCAGCTCA CCGCCTCGCA GATCGCCAGC
ATCAATGCCG CCCAGGTTGT CGGCCAGATC GTCGCCTCGC AGATCGCCAG CATCACGGCG
GCCCAGATCA GCGGGCAACT CACCTCCGAC CAGATCGTCA GCCTTGCGGC CACGAAGGTC
GCCGGCCAGC TCACGGCGGA TCAGATCGCG AGCATCAATG CGGTCGCGAT CCAAGGCCAG
CTCACGGCCG CGCAGCTCGC CGACAACGCC GTCACCCAGG CCAAGCTCGC CTTGGGCCTC
TCGGCGGTCG GGATCGTGTC CAGCCTGCCG GACCCGGCCG GCTACACGGG CCCGTCCGTG
GTGCTCAACG GCGCGGACGG CAAGGTCTAC CGGCTGGTCT CCGGGGCCTG GACCGCCGGC
GTTGCTGCGG GCGACATCAC CGGGCAACTG GGCGAGGCCC AGATCGCGGC CGGCGCCATC
ACGGATGCCA AGATCGCGGC TCTGGCCGCC TCGAAGATCA CCGGCACCCT CACGGACGCC
CAGATCGCCC AGGTTTCGGC CGCGAAGCTC GTCGGGCAGG TGGTTGCGAG CCAGATCGCG
AGCATCGGTG CCGGCCAAAT CTCTGGCCAG ATCACGGCCG CCCAGATCGC GAGCCTGAAC
GCGGCCCAGA TCGCCGGCAC ACTGTCGGAT AGCCAGATCG CCGGCATCAG CACCGCGAAG
CTCATCGGCC AGATCGCCGC GAGTCAGATC GCCGACAACG CCGTTGGCCA GTCGAAGCTC
GCCTCCGGTC TGTCCGCCCT CGGCATCGTC AACGGGCTCC CGAGCCTCGC GGGCTACACC
GGCCCCTCCG TGGTGCTCAA CTCGGACGGC AAGATCTACC GCGCGGTGGG TGGGGCTTGG
ACCGCGGGGG TCGGGGCCTC CGACGTGACG GGCCAGCTCA CGGACGCGCA GCTCGCGGCG
ATCAGCGCGA CCAAGGTCAG CGGCACATTG TCGGATGACC AGCTCGCGGG CATCAGCGCG
GCGAAGCTGA TCGGACAGGT CGTGGCCGCG CAGGTGGCCT CGGTCAATGC CGCCGTGCTC
CAGGGGCAGG TCACGGACGC GCAGATCGCC GGCATTAGCA CGGCGAAGCT CGCAGGCCAG
ATCACTCAGA CCCAGATTAC GGACGGCGCC ATCAGCACGC CGAAGCTGGC GACCGGCGCC
GTCAACGCGG ACAAGATCGA AGCTTGCTCG ATCCTGGCCT CGAAGCTGGC GGTGGCGAGC
CTGAACCTCG CCGCGAACGG CGGCCTCCAG CAGGGCACCT CCGGCTGGTG GGGCGCGGCC
GGCGGCACGG GCATCACTCC GCTCGTGGAG GGCATCCGCA CCGATTGGGC TCCGCAGGGC
ATGCGGGTCT TCTCGGTCCG CTACGACGAT GCCAACAAGC CGACGAGCGG CATCTCCGAG
ATCATCTACG CCAACCCGGA TGCGACGGGG GCCCAGCAGC GGATCCCGGT CACGCCGAAT
GCGCGCTACG AGTTCTCGGC CTACGTCTCG GCGCACCGCT GCACCGCCTA TGTGTCGATC
ATCTGGTGGG ACGCCACCGG CACCTACATC ACCGAGCACG GCGGCAACAA CATCGTCGCC
ACGCAGCTCG GCTCTGGCTC ATCACTGGCG GATTGGGATC GGCTCGCCCG CTCCTGGGTG
ATCGCCACCG CGCCGGCCAA CGCCGCCAGT GCCGATATCC GGGTCCGCTG GTACAACTTC
GGCAGCAACC CCTACTGCAT GGCGGGCGGG TTGTTCTTCG CCCAGGCCGT CGAGGGCCAG
ACCGAGCCGA GCCCGTACTC CGACGCGGGC GTGACCACGA TCGAGGGCGG CAACGTCCGC
ACCAGCTCGA TCTACGGGGA CCGGCTCGTC GCCCGCACCA TCACGGCGGG GCAGATCGCG
GTCGGGGCCA TCACGGCGAC CGAGATTGCC GGTTCGACCA TCACGGGCGA CAAGATCGCG
GGCGCGACCA TCACGGGCGG GCTGATCGCC GGCCGGACGA TCAGCGCCGG GCATATCGTG
GCCGCGACGC TCACCACCAA TGAGATCGCC GCCCGCACCA TCACGGCGAC CAACCTCGCC
GCAGGGACCA TCACCGCCTA CGAGATCGCC GGCTCGGCCA TCACGGGCGA CCGTATCGCG
GGCGGGACGA TCACCGGAGG CAACATCTCC GGCGCCACGA TCACGGGCTT CAACATCTCT
GGCCGCACCA TCGGGGCCGG GCACATCGTC ACCGGCAGCC TCACCGCGAA CGAGATCGCC
GTCAACTCGC TGACGGGTGA CCGGATTGCC GGGTCCACCA TCACGGGTGA CCGCATCGCC
GGCAACACCA TCACGGGGGC GAACATCAGC GCCGGGACGC TGACTGCGCG CGAGCTGGCG
GCCGGATCGG TCACCGCCTC GAAGCTGGTC CTCACCGATC CCAGCAACAT GCTGAAGAAC
GGGGACTTCA CGAACCCGAT CGACGGCTCG GCGAACTCCG AGGGCTGGTC TCTCGGCGGC
GGCGACACGC TCATCGATAC GTCGGTCGCG ACCGACCCCG GCGGTATGCA CCGCCTGCGC
TCGAATGCAC GCGACTGCGC CTATTCCGAC CCGATCGCGG TGCGGCCGGG CGACGTCATC
AACCTGTCGG CGCGCGTCTT CAACGCAAAC GGCGAGCGGG CCAGCCTCAT GGCCGTTCTC
ACCGATGCCG CCGGGGGGAA CGGCCAGTGG CCGACCGCGG CCTACACCGA CCTCAAGAAC
CAATGGGTCG ATCTCTCGGG TCAGATCACC ATCCCCGAAG GCATCCAGCG GATGCAGGTG
CTCCTGCTGT GTGACAAGCC CTACGCCTAT GGCAGCTACG TCTACTGGGG CAAGGTGCAG
GCGCGGAAAG CTGCCAACGC TCAGATGATC GTGGATGGCG CGATCACGGC TGCCAAGATC
GCCGCGAAGT CCATCACCGC AGGTCAGATC GCCACCGGCA CCATCACGGC GACCGAGATT
GCGGGCAGCA CGATCACCGG CGACAAGATC GCCGGCAGCA CCATCACCGG GACCAACATC
GCGGGCGGCA CCATCACGGG CAGCCTCATT GCTGGCGGCA CCATTCAGGC GGGCAACCTC
GCAGCCGGCT CCGTCACGAC CAGTAAGCTG GCGGTTGCGT CGCTGAACCT CGCGCCGAAT
GCCGACTGCA CCAACGTCGA CAGCAGCGGC TTCGTGGCCG GCTGGTCGGT CGGCGGTGGC
AACTCGGGCG CGGTGATCGT CAACGACGGG ACGGACACCA ACCACGCTCC GGCCGGGATG
CGGGCCTTCC GCTGCCGTAC CAACGGGGAG AACATCGCCC AGGGCACGAT CGCAGAAGTC
CACATCGACC GCCAGAAGAC GGACGGCAAT GCCGAGAGGA TCTCCGTCAG GGCCGGCGAG
TATTACGAGA TCAGTGCCTA TCTGGCGACC ATCCGCTGCC AAGCCATTGT CGGCTTGATC
TGGCTCGACA ACAACCAGAG TTACATCGGT GAGACCTGGG GCAATTGGAC GACCCTGGAC
TGGCAGGGTT CGTTCTACAA CTCCGTGGAA GAGTGGGACC AGTACGCCCG CTCGAAGCTG
ATCGTCGTCG CTCCGGCCGG TGCGGTCTAC GCGCAGCCGC GGGTGCGGTT CGGCAACACC
TGGATCAACG GTCAGGGCTG GCACTACGTC ATGGTGTCCG GCCTGATGTT CGCCAAGGCC
ATCGAGGGCC AGACGGAGGT GTCGCCCTAT TCGCCGCCGG GCCTGACCAC CATTGACGGC
GGCACCATCC GCACGGGCTC GCTGCACGCC AACCGCATCA TCGCGGGCTC GATCACCGCG
GATCGGATCG CGGGTGGGAC CATTACGGCC GGTCAGATCG CGGGCGGCAC CATCACCGGA
GACAAGATCG CCGGGCGCAC GATCTCGGCC GGCAATCTGG TCTCGGGCAC GATCACCGCC
AACGAGATCG CCGGGCGGAC CATCACGGCG GACCGGATCG CAGGCGGAAC CATTACGGCC
TACGAGATCG CGTCGCGGAC CATCACGGCC TCGCAGATCG CGACTGGGAC GCTCACCGCC
AATGAGCTGG CGGCCGGATC GGTCACGACC AGCAAGCTTG CCGTCTCCAG CGCCAACTTC
GCCTTCAACG CCGACATGGC CCAGGGCCTC GCCGGCTGGA CGGCTGGCGG CCAGACCTGG
GGCGGGACGC CCGACATCTT CGTGGAGACG GGCTGGGTGC CGTCGGGCTT CAAGGGCTTC
GCCGGCTACG GCACGGGCTG GACTGGGGCC GGCGGACAGT GGTTCGACAC CTGGCACCAG
CGCATCGACA CGAGCAACAA CCTCCAAGCA TTTCCCTGCT ATCCGAACAC GCCCTACGAG
TTCTCGGCCT ACGTCACCTG CCATCGCTGC GATGCGCAGA TGCATGTCCT GTGGCTGGAC
AGCGCCGGCA ACGCCATCCG CTACGACGGA TCGAACGTCA TCGGTCAGTT CGCGAAGGTC
GGCGGAGCCC TGACCGACTA CCCGCGGATG GCGATCCTCG CGACCTCGCC GTCCAACGCC
TACGGGTTCA AGGTGCTCTA CCGCGGTCAA AACATCACCG CGGCGGGCCC CCTCATCTTC
GTAGTCGGGG CCATGTACGC TGCGGCGCGG GCGGGCCAGG CCGAGTGCTC GCCCTACGTG
GACCCCGGGG TCACCACGAT CACCGGCACC AACATCACCA CCGGCTCGAT CTACGGGGAT
CGGCTGGTGG CTCGCACCAT CACGGCGGGC CAGATCGCGG TCGGCGCCAT CACCGCGACC
GAGATTGCCG CCAATACCAT CACGGCGGAC CAGATTGCGG GCTCCACCAT CATCGGCTGG
AATATCCAGG CGCGCACGCT CGGCGCCGGC CATATCGCGG CCGGGGCCAT CACCGCTTAT
GAGATCGCGG CCAACACCAT CACCGCGAAC CAGATCGCCG GCCAGACCAT CATCGGCTGG
AACATCGCCG GCAACACGAT CTCAGCGGAC AAGCTCGTCG CGAACTCGAT CACCGCCGGG
CAGATCTCGG CCGGCGCCAT CGGCGTGGAC CAGCTCGCGG CGGGCGCGAT CACGGCCGAC
AAGATCGGCG TCGGGCTCAA CTCGACCAAC CTGCTCTACA ACTCTGACTT CCGGGCGGGC
ACGCCCGGCG TGAACAACTG GGGCGGCGGC ACCGTCCCCG GCATCTACGG GACGTGGTCG
AACCTGGGGG ATCTGGGCAA CCGCGCCCCC TACATCGGGC TCAACCAATC GGGCCCCGGC
TGGCAGCCGA ACGGGATGGG CTCGTTGCAG GTGAGCTGCG CCGGCACCCC GCCGGCCGGC
TATGTCTGGG ACACCTATGC GTCGACCCCG AAGCAGGACG GCACCTGGAG CCAGCGGTTC
CCGGTGGTCG CCGGCAAGCG ATACGAGGTG TCCGGCTACG TGTCGGCTCA CCGCTGCAAG
GCTCACCTCA TCATCGTCTG GATGGATGCC AACGACGCCT ATTGCGGGGA GGCCTGGACC
AACACCGTCG AGAACGCGAT GAGCAACGGC AACCTGAGCG CCTGGGCGCG CATGGGTTGC
TTCGCCACCG CCCCGTCCAA CGCCGCCACG GCCGCAATCT ACCTGCGGAC GACGTTCAAT
GGTGGTGACG GCCCTTATAC CTTCTGGAGC GGCCTCTACT TCGGGCAGGC CAAGCCCAAT
CAGGCCGAAT ATTCGGACTG GGCGCCGGGC TCCTCGACCG TGATCTGGGG CGACACCATC
GCCACCGGCA CCATGAACGC CAACAAGATC ACCGCCGGCA CCATCACGGC CGACAGGATC
GCTGCGAACG CCATCACCGG GTACCAGATC GCTGGCACGA CCATCTCCGG CTGGCACATC
CAGTCGAACT CGATCTACGC CGACAAGATC CAGGTCGGCG GCGGGCAATC GCTCACGAGC
TGGATGGGCT CGGATACCAC CAAGATCAAT GGCGGCGCGA TCGAAGCTAA CTCGATCCGG
GTCAACGCCC TGACGGTGGC CCTGCGCGGC GTCCGCACGG TCGGCATCGA CTTCTCGGTT
GACAAAAACA CTCGCACCGT CTCGTGGACA GGAGGTCACG TCCTGTGGAT CGACGACGCC
GGCAACAATG TCGCGAGCTG GTGCCCAGGC GGCAGCGGTA ACGCAGGCGC CACCCTCTGC
TACATCTGGT GGGACAAGCG GCGGCCGAAC CAGCTCAACT TCGCCGCCAA CAACTGGCCT
GACATCTTCG CCGATAAGAA CACCGTCCTT ATGTGTTCTT ACGACGGCTA TGCTGGGCTG
AACCCGACAT ACGGCGGGAC GATCATCGAC GGCAGCCGGA TCAACACCGG CACCATCACC
GCCAACCAGA TCGCCGCCAA CGCGATCCAG GCCAGCCACA TCGCGGCCGG ACAGATCACC
GCAGAGAAGA TCGGCGCCGG GCAGGTCACC GCGGACAAGA TCGGGGCCGG CATGATAACG
GCCGGCGTCA TCCAGATCGG CGGCAACAAT TTCGTGCTGG AGGCACCGAA CAACACGGGC
CTTGGTCGCA TGTACTGCCG CGACAGCAAC GGCACGTTGC GTGTGGCGGT TGGCTACATC
AACGATCTCT CCGGTGGCGG CTGGGGTCTC GCGATGTGGG ATGACGCCAG CAACCTCATC
CTCAACGGGA CCGGCGTCAA CGGTGGCGGC CTCTGGGTCA ACTCGGTCAG CGCCAACAAG
CTCTCGGTGA CGCAACTCTC GGCCATCACC GCCAATGTCG GCACCATGAC CGCCGGCCTG
CTTCAGAGCG GCGACGGTCA GATGCAGGTC GACCTTAACA ACAAGCGGAT CATCATCTGG
GGCTGA
 
Protein sequence
MDRIDFDPAA APEDSMRQDA PIRGRKSSSG RGKTGGSGSS APDTLFSNAT VRLVDLLGEG 
EITGVVGGLK GIYFNDVPVQ NADGTFNFKG LSADFRTGTP DQSYMPGYPE VETPREVGVK
VSKATPVTAA ISDGEADRAR VIIELPALFL AKNDGSVRQN SVSFRIEARY SGGPWVNQLG
DLTITGKNTS PYFVSYEVAL PRNPAGSSPP WQVRVTRLTD DTDGFNTSQD KWTSQSDLVF
YSLTAIQDAK FSYPHSALVG LTADASQFGS SVPARTYLVD GLLIKVPSNY DPVARTYSGI
WDGTFKEEWS DNPAWVFYDV LWNDRYGLGE FISVESIDKW TLYEIGRYCD VLVSDGKGGQ
EPRFRFNAQI STQQDAFDLL QQISAIWRGM AYWSSGAVTA TQDRPDDVRQ LVTPANVIEG
LITYSSSGRK ARHTVALVSW TDPDNLFKPQ IEVVEHGEGI ARYGYNPTKI DLLGCTSRGQ
AHREGLWRLL VENYATQTAT YRAGLDHAVR RPGDIIAIAD PQISNIDAGG RLKAGSTAST
LLLDRPVTLK SGVPYEISVT LPDGSVAERQ ITTLAGVDLT EVSISPALPS VPDAAAVWQI
AGEVVPQLFR IVGIKEVEPH ICGIQALQHE PSIYAAVDDG AAFEPLNISE FPNVVLAPTN
LTVRESTYFE NNLPRQSLLL SWTAGQPFNS VAYYVTAIKP NGSLVTLPKR STTSADFDDA
ATGEWTFIVQ AEGLNGRLSD AAQITYTVQG WEGLAGPTVT GLQVKGGGSV FTGRSCTLEW
GLTWPPDVRP YEVGYAFRVF DADTNALLHT EIITAAQATY DYEENLNEGG PRRRFRVSVA
ARDAIGRESQ PAVLVVSNPA PAMVVPTLDW TTESIGVWMT PPADSDLAGV LVWLSKTSNF
DPLATSPAAD VAPTGFLLLP AEAETTYYVR VAFYDSFGKN PAELNISPQV EIRTTNKIID
VQAPDIPTGL NLTTDLEVSA TGVATAAITA TWNPVGSSNL GIYEFELTEG DGVTNSSWIR
DRADKGQPTF TWRNLKPGVL YTARVRTVND SGVAVSGWSP IATITAAKNT AKPGAITSFT
VDAAYRTASL SWVNPSDPDL AAIEVWVGTR DDGADASLFA TVPAPLNFFS DTTLEIVQTR
KYWVRPVNSS GTAGDFVGPK TATTAALPAA ALQNGVIDQT KLAASVVAPT AVTSLPDPAT
WTGPALAYNA TDGKLYRLVG GQWTTAVQAV DITGALTSAQ IQSIEAAKVA GQLTASQIAS
INAAQVVGQI VASQIASITA AQISGQLTSD QIVSLAATKV AGQLTADQIA SINAVAIQGQ
LTAAQLADNA VTQAKLALGL SAVGIVSSLP DPAGYTGPSV VLNGADGKVY RLVSGAWTAG
VAAGDITGQL GEAQIAAGAI TDAKIAALAA SKITGTLTDA QIAQVSAAKL VGQVVASQIA
SIGAGQISGQ ITAAQIASLN AAQIAGTLSD SQIAGISTAK LIGQIAASQI ADNAVGQSKL
ASGLSALGIV NGLPSLAGYT GPSVVLNSDG KIYRAVGGAW TAGVGASDVT GQLTDAQLAA
ISATKVSGTL SDDQLAGISA AKLIGQVVAA QVASVNAAVL QGQVTDAQIA GISTAKLAGQ
ITQTQITDGA ISTPKLATGA VNADKIEACS ILASKLAVAS LNLAANGGLQ QGTSGWWGAA
GGTGITPLVE GIRTDWAPQG MRVFSVRYDD ANKPTSGISE IIYANPDATG AQQRIPVTPN
ARYEFSAYVS AHRCTAYVSI IWWDATGTYI TEHGGNNIVA TQLGSGSSLA DWDRLARSWV
IATAPANAAS ADIRVRWYNF GSNPYCMAGG LFFAQAVEGQ TEPSPYSDAG VTTIEGGNVR
TSSIYGDRLV ARTITAGQIA VGAITATEIA GSTITGDKIA GATITGGLIA GRTISAGHIV
AATLTTNEIA ARTITATNLA AGTITAYEIA GSAITGDRIA GGTITGGNIS GATITGFNIS
GRTIGAGHIV TGSLTANEIA VNSLTGDRIA GSTITGDRIA GNTITGANIS AGTLTARELA
AGSVTASKLV LTDPSNMLKN GDFTNPIDGS ANSEGWSLGG GDTLIDTSVA TDPGGMHRLR
SNARDCAYSD PIAVRPGDVI NLSARVFNAN GERASLMAVL TDAAGGNGQW PTAAYTDLKN
QWVDLSGQIT IPEGIQRMQV LLLCDKPYAY GSYVYWGKVQ ARKAANAQMI VDGAITAAKI
AAKSITAGQI ATGTITATEI AGSTITGDKI AGSTITGTNI AGGTITGSLI AGGTIQAGNL
AAGSVTTSKL AVASLNLAPN ADCTNVDSSG FVAGWSVGGG NSGAVIVNDG TDTNHAPAGM
RAFRCRTNGE NIAQGTIAEV HIDRQKTDGN AERISVRAGE YYEISAYLAT IRCQAIVGLI
WLDNNQSYIG ETWGNWTTLD WQGSFYNSVE EWDQYARSKL IVVAPAGAVY AQPRVRFGNT
WINGQGWHYV MVSGLMFAKA IEGQTEVSPY SPPGLTTIDG GTIRTGSLHA NRIIAGSITA
DRIAGGTITA GQIAGGTITG DKIAGRTISA GNLVSGTITA NEIAGRTITA DRIAGGTITA
YEIASRTITA SQIATGTLTA NELAAGSVTT SKLAVSSANF AFNADMAQGL AGWTAGGQTW
GGTPDIFVET GWVPSGFKGF AGYGTGWTGA GGQWFDTWHQ RIDTSNNLQA FPCYPNTPYE
FSAYVTCHRC DAQMHVLWLD SAGNAIRYDG SNVIGQFAKV GGALTDYPRM AILATSPSNA
YGFKVLYRGQ NITAAGPLIF VVGAMYAAAR AGQAECSPYV DPGVTTITGT NITTGSIYGD
RLVARTITAG QIAVGAITAT EIAANTITAD QIAGSTIIGW NIQARTLGAG HIAAGAITAY
EIAANTITAN QIAGQTIIGW NIAGNTISAD KLVANSITAG QISAGAIGVD QLAAGAITAD
KIGVGLNSTN LLYNSDFRAG TPGVNNWGGG TVPGIYGTWS NLGDLGNRAP YIGLNQSGPG
WQPNGMGSLQ VSCAGTPPAG YVWDTYASTP KQDGTWSQRF PVVAGKRYEV SGYVSAHRCK
AHLIIVWMDA NDAYCGEAWT NTVENAMSNG NLSAWARMGC FATAPSNAAT AAIYLRTTFN
GGDGPYTFWS GLYFGQAKPN QAEYSDWAPG SSTVIWGDTI ATGTMNANKI TAGTITADRI
AANAITGYQI AGTTISGWHI QSNSIYADKI QVGGGQSLTS WMGSDTTKIN GGAIEANSIR
VNALTVALRG VRTVGIDFSV DKNTRTVSWT GGHVLWIDDA GNNVASWCPG GSGNAGATLC
YIWWDKRRPN QLNFAANNWP DIFADKNTVL MCSYDGYAGL NPTYGGTIID GSRINTGTIT
ANQIAANAIQ ASHIAAGQIT AEKIGAGQVT ADKIGAGMIT AGVIQIGGNN FVLEAPNNTG
LGRMYCRDSN GTLRVAVGYI NDLSGGGWGL AMWDDASNLI LNGTGVNGGG LWVNSVSANK
LSVTQLSAIT ANVGTMTAGL LQSGDGQMQV DLNNKRIIIW G