Gene Rsph17029_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3403 
Symbol 
ID4898255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp457776 
End bp468680 
Gene Length10905 bp 
Protein Length3634 aa 
Translation table11 
GC content70% 
IMG OID640114000 
Producthypothetical protein 
Protein accessionYP_001045268 
Protein GI126464155 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGACCA GACATTCGCG TAAGGCGGCC GCTGCCGCGG CCTTCGGTGC AACCCTCGTC 
CCGGCCCTCC TGTGGCTTCT GACCGGCCTC GCTCAGGCGC AGGATGCGGT CGACTACGAC
TGGCGCGTCC AGCTCGACGG CTCGGCCGGC CTCGCCATCG CGGCAGGCGG CGTGGCGCTC
TACAAGGTGG ATGTGCAGAA CACGGGCACC GCGGGCGCCC CGCCGACGCG GATCGAGACC
AACATCCCCG CCAACACGAT CTTCCGCCCG GACCTGTCCT CGGCCAGCTG CGCCGTGGTG
GGCAGCCTCG TCGACTGCGC CATTCCCGCC CTCGCCGAGG ACGCATCCAC CGTCGTGGAC
GTGGCGTTCG AGACGCTGGA CGAAGGCACC TTCGTCCTCT TCGCCCGCGT CCCCGACACC
GATGCGCTGG CCGGGAACAA CTACGAGGAA GTCACCACGA CCGTGGAGCG CGGCGCGGAC
ATCTCGGTCT CGCTGACCGC CGATGCGACC GTGCCCTCGG GCGGCACGGT GCATTATGAA
GTGCGGGTCG AGAACGAGGG ACCCCACACG TCCGAGGCGT TCGAGGTCGA ATTCCCCGTT
CCCACGGGCG TCGTCGACAT GACGGGTCCC GCCGGCTGCA CGCGATCCGG CGGCACCTTC
CTCTGCCGGG TGGCGGGCCC TCTCGCCGTG GGCGACAGCC TCGCCTTCGA CTTCGCCGGT
CAGGTGACGG CCGCATCCAA CTCCACCGTG GCCGCCTCGG TCGCGATCAT GGGCCAGCAG
CCGGCCGATG CCGATTCCAC CAACAACCTT GCCACGGCCA GCACGTCCGT TACCGCAGGC
AGCGACCTGC GCATCGTGAA GTCGCGCACG CCCTCGGGCA CCATTCTCGT GGGCGACGAG
CTGGTCTTCA TCCTGTCGCC CAGCTATCGC GGCGACGCGC CCGGCGGGAC CGTGACGGTG
ACGGATCCAC TGCCCGCGGC CTACAGCTTC GTCTCGGTTC AGGCGGACGC GGGCTGGACC
TGCGGCGAGA GCGGCGGCAC CGTGACCTGC GCCCGCCCTG CCCCCACCAC GGGCGCGGGC
GAGGACGTGT CCATGGGCGA GATCCGCATC ACCGCCCGCG CGGTCGATCC GGGAACGCCC
CTCAACATCG CGACGGTCTC CGCCGCCGAC ACCACCGATC CGTTCCCGGG CAACAACAGC
TCGTCGGTGA GCGTGACCAT CGAGGCGCCC TTCGTCGATC TCGCCCTGTC CAAGTCGGCG
CCCTCTCCGG CCCTCGCCGT GAGCGGCAGC CCCTTCGATT ACAGGATCGG CGTGACGAAC
CGCGGCAATG CGGGCTTCGA CGGCACCGTG CGGGTGACGG ATGCGGTGCC CGCGGGCCTC
ACGATCGAGT CTGTGGCAGG CTCGGGCTGG AGCTGTGCGC CCCTGCCCGT CACGGGCCCC
GCCGACCTCG TCTGCGACCG GCCCTACGAC GGGGCGGCGC CGCTAGCGGC CGGGGGCCGC
GTGCCGGATC TGGTGCTGAC CGCGGTCGCG CTGCAGGACG GGCCGGTGCG CAACACGGCT
GGCGTCGAGA CCGTAGACGG CACGCTCGAG GACACGACGC CGGGCAACAA CTCGGGCGGC
GCCGACGTGA CGGTGAGCCA ACCGCCGTTC GCCGCCGACA TCGGCCTGAC GAAGAGCACC
GCCGCGACCG CCATTGCGGG CGAGCCGCAG ACCTTCGAGG TCGAGATCAC CAACTTCGGC
CCCACCGAGG CCGCGAACGT GCGCTTCAGC GATCCGCTGA CCGGTCTCGC GAACGGCGCG
ACCTCGGGCG CGGGCAACGG CCTCGTGAGC ATCGCGGTCG AACCCTTCGC GGCGACGGGG
GCCAGCTGCG GCGGTGCAAC GACCGGCGCG ACCTCGGTCA CGGCGAGCTG CTCCTTCGAC
ACGCTCCCCG TCTGCACGCC CGGCCTCGAC TGCCCGCGGA TCACCCTCGT GACCACCCCG
GGCGGCAATG CCGGCACGCG GACCAACACA GCGACCGTCG TCTCGCAGAC GACGGCCGAC
CCGGCCTCAG CCAACAACAG CGCGAGCGCG AGCTTCACCG TCGAACCCCG GGCCGACGTC
ACGCTCGAGA AGCTGGCCAA CCCCGATCCC GTCGCCGTGG GCCAGTCGCT GAACTATGTG
CTGACCGCGA AGAACGTGGC GAACGGCCTC AGCCAGGCCG AGAACGTGAC GGTCTCCGAC
ACGCTGCCCT CGGGGCTCGT GTTCCTGTCG GCCAGCCCCT CCACGGGCAG CTGCGCCACG
CAGCCGGCGG CCGGCAGCGC CACCTCGGGC GGCAACAATC AGGTGGTGTG CAACCTCGGC
ACCATCAACA ACGGCTCGCA GCAGACGGTG ACGATCGTCG TGCGGCCGCT TCTGTCGCTG
CTGGAATCGA CCCTCGTCAA CCGGGCCGAG GTCACCACCA GCACGACCGA GACCGATGAG
ACCAACAACG CGGCCGAAGC TTCGGTCGGC GTGACCTTCC CCCGCTTCGA CCTGCTGATC
GCCAAGACCG ATACGGTCGA TCCGGTGACG GTGGGCGACG AGACGGTCTA TCGCATCCTC
GTGACGAACA ACGGCCCCTC GGCGGCCGAG AATGTGATCG TCACCGACAC GCTTCCCGCG
ACGCGGCTGA GCTTCGTGCA GGCCAGCGCC CCGGCGGACG GGTCCTGCGG CACCACGCCC
GCGGCAGGCG CCATCGGCGG TCAGGTGATC TGCAGCGTGC CCTATCTCGC CTCGGGCGAG
AGCCGCACCT TCGAGGTGAC GATGCGGGCC GAGGCCAAGG GCTCGGTCAC CAACACGGCC
TCGGTGACGG CCGACCGCGC CGGCGACTTC GAAAGCCGCA CCGACAACAA CACGGCGACG
CAGAACACGA CGCTGCGGAC CCGGGTGGAC GTGCAGGTGG CGAGCAAGAC GCCCTCGGCG
CCCTCGGTGC CGCTGCGGGA GGATTTCACC TTCGACGTGG TGATCCGCAA CGCCAGCGAC
CCGCGCTATT CCGAGGCCGA CAATGTCGTC TTCACCGACA GCCTGCCCTC GGGCATGGTG
CTGACGGGCA CACCGAGCGT GACGATGGCG TCCGGCACGG CCGGCGCGAC GAGCTGCACC
GGCACGGCCG GCGGCACGGC GGTCAGCTGC TCCTTCGGCA CGCTCTCGCC CGGAGCCGAG
GCGGTGGTCA CCCTGCCCGT CCGGGTCACG GCGATCACGA GCACGCCGCA GAGCTTCACC
AACAGCGCAA GCGTCACGAC CGACTCCGAC GACCGCGTGC CGGCGAACAA CAGCAACAGC
GGCTCGGTCG AGATCACCGG CTCGTCCGTC GCGGGCGCGG TCTGGCGCGA TTTCAACGAG
GACGGCACCC GCGCCGGGAC CGACACCGGC CTTGGGGGCA TCGCCATCCA GCTGAGCGGA
ACCGACCTCA CCGGCGCCCC GGTTACCCGG ACCACCACCA CCGACGCGAG CGGCGGCTAT
AGCTTCGGCC TTCTGGCCGA AGGCACCTAT ACGATCACCC GCACCGGCAG CCTCCCCGCC
CGCCACGAGG ATCTGGCGGC GCTGGTCCCG GCAAGCGGCT CGGGCACCGC GTCGAGCGCC
ACTGTCATCG GCTCGGTGGC GATCGGCCCG GACGAGGATC TGACCGGCTA CGACTTCACC
GTGCGGCCGA TCCCGACCGT GGGCATCGCC AAGCGCGTGT CCTCGCAGCC GGCGCTGCGG
GCCGATGGCG CGTTCCGCGT GAGCTTCGCC TTCTCGGTGC GCAACTTCAG CGTCGAGCCG
GTGCAGGGCC TGACCGTGAC CGACGTGCTT GAGGGCGGTC TGCCGGGCTT CGGCACCTAC
AACCCGGATC TGTCCGCCCT GCAACCGGGG GAATATGGCC TCGTCACGGC GCCCGGCGGC
AGCTGCGGCG GCCTGAACTC GGGCTACACC GGCGCGGGCG CGACCGAACT CGTCTCGGGC
GGCACCCTCG CTGCGGGGGC AAGCTGCACC ATCACCCTCA CGCTGCAGGT GCGGCCAGAG
ATCCCGCTGC CCTATGCGGC GTCGCCGCGC TACCGGAACC AGGCGCAGCT GGATGCCGAG
GGCCAGCTCT CGGGCAAGAC CGTGAACGAT CTGTCCGACA ATGGCTCGAA CCCGGACCCG
AACGGGAACG GCTATCCCAA CGACGAGGGC GAGTTCGATC CGACGCCGGT CAATGTCACC
TATGCCCCCG CCATCGCGGT GGTGAAGACG GCGGATACCT CGGGCTTCTC CGATCCGATC
GCGCCCGGAG ACCCGATCCT CTTCAGCTTC GCGGTGACCA ACACCGGCAA CGTGCCGCTG
GCCGATGTGA CGCTGGCCGA TCCGATGCTG CCCGCCGCCT TCGACGGGCT GACCGTCCCG
GTCCTGCTGC CGGGCGAGAC GGACACGTCC ACCTTCGCGG CCACCTACCT GCTCACCGCC
GCCGACATCG ACGCGGGCCG TGTCGAGAAC CAGGCCACCG CCACCGGCAC CTGGACGCAA
GGCTCGGGCG GCGCTCCGGT GACGGTCAGC GACCTCTCCG GCACGACCAC CGCCAACAAC
ACGCCGACGA CGGTGCAGAT TGGAGCGATC TCGCTCGTGA AGACAGCCGA CGAAAGCGGT
CTCTCCTCGC CGCCCTTGGC GGGCGAGACC ATCCGCTACA GCTTCACCGT GACCAACGGC
GGCGTGGCCC CCCTCACCGA CGTGACGCTC ACCGATGCGG TGCCGGGCGT GCAGGTGACA
GGCGGGCCGA TCTCGCTGGC AGGCGGGGCG TCGGACACGA CGAGCTTCAC CGCGACCTAT
GAGCTGACGC AGGCGGACGT CGATGCGGGC AGCTTCACCA ACGACGCGAG CGTGACGGGC
TTCGTTCAGG TTCAGGGCGG CGGCCGGGTG CCGGTCACGG CCGATGACAG CGTGACCACG
CCGCTCGCGC TCGCGCCCGC CATCACCCTG GTGAAGGAGG TCGACACGAC CGGCCTCTCC
TCGCCGGCGG CGGCGGGCGA CGTGCTCGCC TACAGCTTCA CCGTGACGAA CGCGGGCAAC
GTGACGCTTA CCGACGTGAC GGTGACCGAC GACAGCCTCC CGGGTCTCGT GCTGACCGGC
GGGCCTATCG CAACCCTCGC ACCGGGCGAC AGCGACAGCA CCACCTTCAC CGCCTCCTAC
AGCCTGAAGC AGGAGGATCT CGACCGCGGC TTCGTCGAGA ACACCGCACT GGCGACCGGC
ACCTATGCCG GTCCGGGAGG CACGCCCGCC GAAGTGACCG ACCAGTCCGG GACAGACGCC
TCCAACGACG CGCCCACCGT GGCGACCGTG CCCCCTGCCC CGGCCATCAC GCTGGTCAAG
GCGGTGGATG CGAGCGGCAT CTCCAGCCCC GCGGCGGTCG GCGAGCCGCT CAGCTACAGC
TTCACGGTGA CGAACACCGG CAACGTGACG CTGACGGATG TGACGGTCAC GGACACGAGC
CTGCCGGGCC TCGTCCTGTC GGGCGGGCCG ATCACCCTCG CGCCGGGAGC GAGCGACGCT
GCCACCTTCA CCGCCACCTA TGCGCTGAAG CAGGCCGACA TCGACCGCGG CTACGTCGAG
AACACGGCGC TCGTCACCGG GACCCATGTC GACGGGAACG GCGACAAGAC CGAGGTCGAG
GACGTCTCCG GCACCGACGC CGCGAACGAC CTACCGACCC GCAGCGACGT CGAGGCCGCG
CCCGCCATCG CGCTGGTGAA GACCGTCGAC CTCTCGGCCC TCTCGAGCCC GGTCGCGGCG
GGCGACGTGC TGAGCTACGA CTTTGCCGTG ACCAACACCG GCAACGTCAC GCTCACGAAT
GTGACCGTGA CCGACGACAG CCTCGCGGAT CTCGTGCTGA CGGGCGGCCC CATCGCCTCG
CTCGCGCCGA ACGCCACCGA CAGCACGAGC TACACCGCGA GCTACACGCT GACCCAGGCC
GACATCGACC GCGGCTTCGT CGAGAACACG GCGCTTGCCA CCGGCACCTA CACCGATGGC
GCAGGGGTCG AGACGGAGGT CGAGGATACG TCGGGCACCG ACACGAGCAA CGACCTGCCG
ACGCGCGCCG ATCTCGACGC CCTGCCCTCC ATCGCACTGG TCAAGACGGT CGACGCCTCG
GCGGTCTCCT CTCCGGCGGC GGTGGGCGAT CTGCTCAGCT ACAGCTTCAC GGTGACGAAC
ACCGGCAACG TGACGCTGAC CGATGTGACG GTGACGGACG ACAGCCTTGC CGACCTCGTG
CTCGCGGGGG ATCCGATCCC GACGCTCGCG CCGGGTGCGG CGGACGCCAC CACCTACACC
GCGACCTATG CGCTGAAACA GGCGGACATC GACCGCGGCC ACGTCGAGAA CACCGCGCTT
GTCACCGGCA CGCACACCGA TGGCGCCGGG GTTGAGACGG AGGTCGAGGA TATCTCCGGC
ACCGAGGCCA CCAACGACAC GCCGACCCGC GCCGATCTGG GGACCACTCC CTCGATCGCG
CTGGTGAAGG CCGTCGATCT CTCGGCCGTC TCCTCTCCGG CGGCGGTGGG CGATCTGCTC
ACCTACAGCT TCACGGTGAC GAACACCGGC AACGTGACGC TGACCGACGT GACGGTGACG
GACGACAGCC TCGCGGACCT GATCCTCGCC GGCGACCCGA TCCCGTCGCT CGCGCCGGGT
GCGACCGATG CCACCGCCTA CACGGCGACC TATGCGCTCA AGCAGACCGA CATCGACCGC
GGCTTCGTCG AGAACACCGC GCTCGTCACC GGCATCCACA CCGATGGCGC AGGCGTCGAG
ACCGAGGTCG AGGACATCTC CGGCACCGAG GCCACCAACG ACACGGCGAC GCGCGCCGAT
CTCGAGACCG CGCCCTCCCT CGCGCTGGTG AAGTCGGTCG ATGCTTCGGC CGTCTCCTCC
CCGGCGGCGG TGGGTGAGCT GCTGACCTAC AGCTTTGCCG TGACCAACAC CGGCAACGTG
ACCCTGACCG GCGTCACCGT GACGGACGAC AGCCTCGCGG GTCTCGTGCT CGCGGGAAGC
CCTGTCCCGA CGCTCGCGCC GGGTGCGACC GATGCCACCG CCTACACCGC GACCTATGCG
CTGACGCAGG CGGACATCGA CCGCGGCTTC GTCGAGAACA CGGCGCTGGC CACCGGCACC
TACACCGATG GCGCGGGGGT CGAGACCGAG GTCGAGGATA TCTCCGGCAC CGAGGCCACC
AACGACACGC CGACCCGCGC CGATCTGGAC ACCGCGCCCT CCATCGCGCT GGTGAAGACG
GTCGATGCCT CGGCAGTCTC CTCTCCGGCG GCGGTGGGCG ATCTGCTCAC CTACAGCTTC
GCGGTGACGA ACACCGGCAA CGTGACCCTG ACCGACGTGA CCGTGACCGA CGACAGCCTC
GCGGATCTCG TGCTCACCGG CGGCCCGATC CCGTCGCTGG CCCCGGGCGC GGCGGACGCC
ACCACCTACA CGGCGAGCTA TGCGCTGAAA CAGGCGGACA TCGACCGCGG CTATGTCGAG
AACACGGCGC TCGTTACCGG CACCCATACC GATGGCGCAG GCGTCGAGAC CGAGGTCGAG
GACATCTCCG GCACCGAGGC CACCAACGAC ACGGTGACCC GCGCCGATCT GGGCACCGCG
CCGGCCATCG TTCTGGTCAA GACGGTCGAC GCCTCGGCGG TCTCCTCGCC GGCGGCGGTG
GGGGAACTGC TCAGCTACAG CTTCACGGTG ACCAACACCG GCAACGTGAC CCTGACCGAC
GTCACCGTCA CCGACGACAG CCTCGCCGAC CTCGTCCTGA CCGGCGACCC GATCCCGTCG
CTCGCGCCGG GTGCGACCGA TGCCACCACC TACACGGCGA CCTATGCGCT GAAGCAGGCG
GACATCGACC GCGGCTATGT CGAGAACACC GCGCTCGTCA CCGGCACTCA TACGGATGGC
GCAGGGGTCG AGACCGAGGT CGAGGATATT TCCGGCACGG AGGCCACCAA CGACACGCTG
ACGCGCGCCG ATCTCGACGC CCTGCCCTCC ATCGCGCTGG TCAAGGAGGT CGATGTCTCG
GCCGTCTCCT CTCCGGCAGC GGTGGGCGAC CTGCTCACCT ACAGCTTCAC GGTGACGAAC
ACCGGCAACG TGACGCTGAC GGATGTCACG GTCACGGACA CGAGCCTGCC GGGCCTCACC
CTCACGGGCG GGACCATTGC CAGCCTCGCT CCGAAGGCGA GCGACACCGC CACCTACACG
GCGAGCTATG CGCTCACTCA GGAGGATCTC GACCGGGGCT TCGTCGAGAA TACCGCGCGT
GTGACCGGCA CCTATACCGA CGGCACGGGC GGCGAAACCG AAGTCGAGGA CATCTCCGGC
ACCGACGCGG GCAATGACAC TCCCACCGAA GCCCTGATCG AGCCGGCCCC GGCCCTCGCG
CTGGTGAAGA CGGTGGATCT CTCGGGTCTC GGCACGCCGG CGGAAGTGGG CGAGGCGCTC
ACCTACAGCT TCACGGTGAC CAACACCGGC AATGTGACGC TGACGGATGT GACCGTGACG
GACACGAGCC TGCCGGGCCT CGTCCTTACC GGCAGCCCGG TCGCCCGCCT CGCCCCCGGC
GAGAGCGACA GCACCGCCTA CAGCGCGCGT TATGCCCTGA CGCAGGAGGA TCTCGACCGC
GGCTTCGTCG AGAACACGGC TCTGGCCGCC GGCCTCCATA CGGACGGCAC CGGGCGGGAG
ACGCAAGTCG AGGATGTCTC GGGCACCGAT GTCGGCACCG ACGATCCGAC CGTGGCCCCG
GTGGGGCAGG CGCCGGCCGT GGCGCTGGTC AAGGCGGTCG ACGCTTCGGC CGTCTCCTCG
CCGCCGGCGG TGGGCGACCC GCTGACCTAC AGCTTCACCG TGACGAACAC GGGCAGCGTG
ACGCTGACGG ACGTGACCGT CACCGACGAC AGCCTTCCGG GCCTCGTGCT GGCGGGCAGT
CCCATCCCGC GCCTGGCGCC GGGCGAGAGC GACAGCACGA CCTACAGCGC CCGCTACCTG
CTGACGCAGG AGGATCTCGA TCAGGGTCGG GTGTCGAACA CGGCGCGCGT CACCGGCAGC
TACCGGGCGC CCGACGGCTC GGCCGACACC GTCACCGACA TCTCGGGCAC CGAGATCGAG
AACGACGATC CGACCGACAC CGAGTTCGCG CCCGTTCCCG GCATCGCCCT CGTGAAGACG
GCCGATGTTT CGGGTATCGG CAACCCGGCG GCCATCGGCG AATTGGTCCG CTACAGCTTC
ACCGTGACGA ACACGGGCAA CGTGACGCTC GCGGATGTGA CGGTGAGCGA CACGAGCCTG
CCGGGCCTCG TGCTGAGCGG CAGCCCGATC GGGCGGCTTG CTCCGGGTGA GAGCGACAGC
GTGACCTACA GCGCCGCCTA TGCGCTGACG CAGGCCGACC TCGACCGCGG CGTCATCGAG
AACACGGCCC GGGCTACGGG CGCCTATCGC GGGCCGGATG GCGAACCGGG CACGGTCGAG
GACATCTCGG GCACCGAGGC GGAGAACGAC GAGCCGACGC TGTCGCTCGT GCCGCAGACG
CCCGGGATCG CCCTCGTCAA GGAGGTGGCG GACGAGTCCG TCAGCACGCG CCCGGCCCTG
GGCGACGAGC TTCTCTACCG CTTCACGGTG ACGAACACCG GCAACGTGAC CCTCACCGAC
GTCACCCTGA CCGACGATCT GGCGGGCGCG GTGGTCTCGG GCGGCCCCAT CGCCGCGCTG
GCGCCGGGCG AGACCGACAG CACCACCTTC ACGGCACGCT ATGCGCTGAC GGCGGCGGAT
CTCGAGCGGG GCCAGGTCGC GAACACGGCC CGCGTCAGTG GCACCAATCC GGGCGATCCC
GACACGCCGG TGACGGATGT CTCGGGCACC GAGGTGGGCA ATGACACGCC GACCGTGGTC
GAGCTCGACG TGCCCACGGA TGTGACGGCC ACCAAGACCG CCAGCCCCGA GCGCGTGGTC
ATCGGCGAGA CCGTCTCCTA TGTGCTGGCC TTCACCAACG ACGCGCCGCG CTCGATGCGC
GAGGTGGTGC TGGTCGACCG GATGCCCGAC GGCCTCGTCT ACACGCCCGG CAGCGCCACG
CTCGACGGCA CGCCTCTGGA GCCCGAAGTC AGCGGCCGCT TCCTGCGCTG GCAGGCGGAC
ACCCTGCCCG CGGGCGGCAC GATCACCGTG CGCTTCGCCG CGCGCGTGCT GGGGGCTGCG
CCCTACGGGC CGCTCACCAA CAAGACCTGG CTCCTCGACC GCACGGGCCA GCGCTCCTCG
AACGTGGCGG AGGCCGTGGT GATCCGCGAG CCCGAGCATG TGTTCGAATG CGCCGACATC
ATCGGGAAGG TCTTCGACGA CCGGAACATG AACGGCTATC AGGATCCGAT CGACGGCGCG
GCCCACGGCC GCGGCGCCGA GGCCGAAGAG CCGGGCATCC CCGACGTGCG TCTCGCGACG
CCGAACGGGA CCCTGATCCA GACCGACAAG TTCGGCCGCT TCCACGTCCC CTGCGCCGAG
CTTCCCGGCC AGACCGGCGC GAACTTCACG CTGAAGCTCG ACACCCGCTC GCTGCCCTCG
GGCTACCGGG TGACGACCGA GAACCCGCGC ACGATCCGCG TCACGCCGGG CAAGATGGCC
AAGCTGAACT TCGGTGCGGC TCTGGGCCGG GTGGTGCGGC TCGACCTGAC GGCGGCGGCC
TTCGCCGACG GCCGGCCGAC CGCGGCCTTC GCCCGGGCGC TCGAGCAAAC GGCGGCAGGC
CTCGGCGATG CGCCGGTGGT GGTGCGGATC AGCACCCGGC AGGACGCGGG CGGCGCGGGC
GCGGCAAAAG CGCGGCTCGA TGCGGCCGAG GCGCTGGTGC GCAAGGCCTG GAAGGGTCGG
GCCGGGCCGG TGCTCATCGA ACGCACGATC CAGCGGGACC AGTAA
 
Protein sequence
MWTRHSRKAA AAAAFGATLV PALLWLLTGL AQAQDAVDYD WRVQLDGSAG LAIAAGGVAL 
YKVDVQNTGT AGAPPTRIET NIPANTIFRP DLSSASCAVV GSLVDCAIPA LAEDASTVVD
VAFETLDEGT FVLFARVPDT DALAGNNYEE VTTTVERGAD ISVSLTADAT VPSGGTVHYE
VRVENEGPHT SEAFEVEFPV PTGVVDMTGP AGCTRSGGTF LCRVAGPLAV GDSLAFDFAG
QVTAASNSTV AASVAIMGQQ PADADSTNNL ATASTSVTAG SDLRIVKSRT PSGTILVGDE
LVFILSPSYR GDAPGGTVTV TDPLPAAYSF VSVQADAGWT CGESGGTVTC ARPAPTTGAG
EDVSMGEIRI TARAVDPGTP LNIATVSAAD TTDPFPGNNS SSVSVTIEAP FVDLALSKSA
PSPALAVSGS PFDYRIGVTN RGNAGFDGTV RVTDAVPAGL TIESVAGSGW SCAPLPVTGP
ADLVCDRPYD GAAPLAAGGR VPDLVLTAVA LQDGPVRNTA GVETVDGTLE DTTPGNNSGG
ADVTVSQPPF AADIGLTKST AATAIAGEPQ TFEVEITNFG PTEAANVRFS DPLTGLANGA
TSGAGNGLVS IAVEPFAATG ASCGGATTGA TSVTASCSFD TLPVCTPGLD CPRITLVTTP
GGNAGTRTNT ATVVSQTTAD PASANNSASA SFTVEPRADV TLEKLANPDP VAVGQSLNYV
LTAKNVANGL SQAENVTVSD TLPSGLVFLS ASPSTGSCAT QPAAGSATSG GNNQVVCNLG
TINNGSQQTV TIVVRPLLSL LESTLVNRAE VTTSTTETDE TNNAAEASVG VTFPRFDLLI
AKTDTVDPVT VGDETVYRIL VTNNGPSAAE NVIVTDTLPA TRLSFVQASA PADGSCGTTP
AAGAIGGQVI CSVPYLASGE SRTFEVTMRA EAKGSVTNTA SVTADRAGDF ESRTDNNTAT
QNTTLRTRVD VQVASKTPSA PSVPLREDFT FDVVIRNASD PRYSEADNVV FTDSLPSGMV
LTGTPSVTMA SGTAGATSCT GTAGGTAVSC SFGTLSPGAE AVVTLPVRVT AITSTPQSFT
NSASVTTDSD DRVPANNSNS GSVEITGSSV AGAVWRDFNE DGTRAGTDTG LGGIAIQLSG
TDLTGAPVTR TTTTDASGGY SFGLLAEGTY TITRTGSLPA RHEDLAALVP ASGSGTASSA
TVIGSVAIGP DEDLTGYDFT VRPIPTVGIA KRVSSQPALR ADGAFRVSFA FSVRNFSVEP
VQGLTVTDVL EGGLPGFGTY NPDLSALQPG EYGLVTAPGG SCGGLNSGYT GAGATELVSG
GTLAAGASCT ITLTLQVRPE IPLPYAASPR YRNQAQLDAE GQLSGKTVND LSDNGSNPDP
NGNGYPNDEG EFDPTPVNVT YAPAIAVVKT ADTSGFSDPI APGDPILFSF AVTNTGNVPL
ADVTLADPML PAAFDGLTVP VLLPGETDTS TFAATYLLTA ADIDAGRVEN QATATGTWTQ
GSGGAPVTVS DLSGTTTANN TPTTVQIGAI SLVKTADESG LSSPPLAGET IRYSFTVTNG
GVAPLTDVTL TDAVPGVQVT GGPISLAGGA SDTTSFTATY ELTQADVDAG SFTNDASVTG
FVQVQGGGRV PVTADDSVTT PLALAPAITL VKEVDTTGLS SPAAAGDVLA YSFTVTNAGN
VTLTDVTVTD DSLPGLVLTG GPIATLAPGD SDSTTFTASY SLKQEDLDRG FVENTALATG
TYAGPGGTPA EVTDQSGTDA SNDAPTVATV PPAPAITLVK AVDASGISSP AAVGEPLSYS
FTVTNTGNVT LTDVTVTDTS LPGLVLSGGP ITLAPGASDA ATFTATYALK QADIDRGYVE
NTALVTGTHV DGNGDKTEVE DVSGTDAAND LPTRSDVEAA PAIALVKTVD LSALSSPVAA
GDVLSYDFAV TNTGNVTLTN VTVTDDSLAD LVLTGGPIAS LAPNATDSTS YTASYTLTQA
DIDRGFVENT ALATGTYTDG AGVETEVEDT SGTDTSNDLP TRADLDALPS IALVKTVDAS
AVSSPAAVGD LLSYSFTVTN TGNVTLTDVT VTDDSLADLV LAGDPIPTLA PGAADATTYT
ATYALKQADI DRGHVENTAL VTGTHTDGAG VETEVEDISG TEATNDTPTR ADLGTTPSIA
LVKAVDLSAV SSPAAVGDLL TYSFTVTNTG NVTLTDVTVT DDSLADLILA GDPIPSLAPG
ATDATAYTAT YALKQTDIDR GFVENTALVT GIHTDGAGVE TEVEDISGTE ATNDTATRAD
LETAPSLALV KSVDASAVSS PAAVGELLTY SFAVTNTGNV TLTGVTVTDD SLAGLVLAGS
PVPTLAPGAT DATAYTATYA LTQADIDRGF VENTALATGT YTDGAGVETE VEDISGTEAT
NDTPTRADLD TAPSIALVKT VDASAVSSPA AVGDLLTYSF AVTNTGNVTL TDVTVTDDSL
ADLVLTGGPI PSLAPGAADA TTYTASYALK QADIDRGYVE NTALVTGTHT DGAGVETEVE
DISGTEATND TVTRADLGTA PAIVLVKTVD ASAVSSPAAV GELLSYSFTV TNTGNVTLTD
VTVTDDSLAD LVLTGDPIPS LAPGATDATT YTATYALKQA DIDRGYVENT ALVTGTHTDG
AGVETEVEDI SGTEATNDTL TRADLDALPS IALVKEVDVS AVSSPAAVGD LLTYSFTVTN
TGNVTLTDVT VTDTSLPGLT LTGGTIASLA PKASDTATYT ASYALTQEDL DRGFVENTAR
VTGTYTDGTG GETEVEDISG TDAGNDTPTE ALIEPAPALA LVKTVDLSGL GTPAEVGEAL
TYSFTVTNTG NVTLTDVTVT DTSLPGLVLT GSPVARLAPG ESDSTAYSAR YALTQEDLDR
GFVENTALAA GLHTDGTGRE TQVEDVSGTD VGTDDPTVAP VGQAPAVALV KAVDASAVSS
PPAVGDPLTY SFTVTNTGSV TLTDVTVTDD SLPGLVLAGS PIPRLAPGES DSTTYSARYL
LTQEDLDQGR VSNTARVTGS YRAPDGSADT VTDISGTEIE NDDPTDTEFA PVPGIALVKT
ADVSGIGNPA AIGELVRYSF TVTNTGNVTL ADVTVSDTSL PGLVLSGSPI GRLAPGESDS
VTYSAAYALT QADLDRGVIE NTARATGAYR GPDGEPGTVE DISGTEAEND EPTLSLVPQT
PGIALVKEVA DESVSTRPAL GDELLYRFTV TNTGNVTLTD VTLTDDLAGA VVSGGPIAAL
APGETDSTTF TARYALTAAD LERGQVANTA RVSGTNPGDP DTPVTDVSGT EVGNDTPTVV
ELDVPTDVTA TKTASPERVV IGETVSYVLA FTNDAPRSMR EVVLVDRMPD GLVYTPGSAT
LDGTPLEPEV SGRFLRWQAD TLPAGGTITV RFAARVLGAA PYGPLTNKTW LLDRTGQRSS
NVAEAVVIRE PEHVFECADI IGKVFDDRNM NGYQDPIDGA AHGRGAEAEE PGIPDVRLAT
PNGTLIQTDK FGRFHVPCAE LPGQTGANFT LKLDTRSLPS GYRVTTENPR TIRVTPGKMA
KLNFGAALGR VVRLDLTAAA FADGRPTAAF ARALEQTAAG LGDAPVVVRI STRQDAGGAG
AAKARLDAAE ALVRKAWKGR AGPVLIERTI QRDQ