Gene Rcas_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2197 
Symbol 
ID5539678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2821552 
End bp2837163 
Gene Length15612 bp 
Protein Length5203 aa 
Translation table11 
GC content63% 
IMG OID640894330 
Producthypothetical protein 
Protein accessionYP_001432298 
Protein GI156742169 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CACATTATCA GGTAACAGCC TGTTACGTTC TACACAAATC AACGTCACAA 
AACCAGGAAA GGGATCGGAC CATGCGCCGT CGTCCAACTC TGCGGTTCGC ACCGCTTCTC
ACCTTGCTCG TGGGCGCTCT GACGATTGGT AGTTTTTTCC TTGGAACGAT TGCCCTCGCC
GCCGGCAACC CGCTGCCGGT CGCAACGGTC AACAGCGCGA GCGGTCTGAT CGGCGAACCG
GTGACCCTGA CCGTCACCTT CGATAATGCC AGCACCGGAA CCGGCGATCA GACCGGGTAT
GGTCCGTATA TCGACCTGTT TCTCGACACC ACCGGTCCCG ACGGCGCCCC GTCGTTCGAC
GGGCTAGTGA CGACTGACAT CCGCGCGACC TACCTCAATC AGCCGCTGCC GGGATTAGAG
TTGATCACGA TTAGCGGACC AACCTACGTC CATCCGCTGA CCGGGCAGAC GCTGAGTGTG
CCGGGCTATG GCACGCGCTT CCAGAACGGC GACACGATTG CCGTTCTGAC GTTGCCGTTC
GGCAGTTTCA CCAACACACA GCCACCCGCG CCGATTGATG TGACGCTGCG CATCAGCAAC
CTGGCAGACC TCGGCACGCC GCTGCCGGTC ACGGCAATCG CCGGCTTCCG CTACGGCGCC
GACCCGCTCG ACAACCCCGG CGCCGACCCG CCGATCCGCC AGACGACCCC GTCGGACGCC
GATGTTACGC CGATTCTCTG GCGCCTGACC AAAACCTATC TCGGACCGGA AGACGAAACC
GCGACCGGAC GCAACTATCC GCGGCGCTAT CGCCTGGATG TCGATATTGC CGAAGGGCAG
CCGGTGACGA ATCTGCAAAT CACCGATACT CTGTCAAACA GCATGCGCAT CACCGGCAAC
ACCGCCGCGC AGATGAGCGC GCGGCTCTAC AATACGCCCG GCGTTCCAAC CGAGGTCTTC
AATCCGGCGA ATCTCTCCGG CACTGCCACG CCTGCGGCGC CGGGGGGAAC GCTGGTCTAT
ACGTTCGGCG ACAAGACCGG TGTGCGCGGC GTTGATGCGT CGTTCGAGTT CGAGTTCTAC
ATCCCGCGCG ACACCAGCGC CGCCACACCA ACTGTGCCGC AGGGAACCGA CTCGACGCTC
GCCAACAACA CCGGCAGTTC ATCGGCAACA TGGACCCCAA TCGATACGCG GGATGCGCCG
ACGTCGATTA CCATTACGCT GCCGCCCGAT GCGCATACCC TCCAGCAGCA TTCACTGGCG
ACCCAGAAAT CGGTCACGCC GGTTGATCGC GCGACCCTTG CGCCAACCGG TGGTGTGATC
ATCCCTGGTC AGACGCTGCT GCGCTATGAC ATCGATTTCC AGGTCTCCGA CTATTTTGCG
GTTCAAAACG TCTATCTCGA AGACATCATT TCCGACGGGC AGCGCCTTTT TGTGCGAACA
TCGGGCGGAC CTTCGACTGT GCCAACGCTC CAGGTTGAGA ATGCCTACGT TACCGGTGCG
TCGCCGACGC GCATGAATGT CGCAACGGCG CCCTTCGGCG GTGCGAACAC GATTGTGTAC
GAGCAGCGGT TCACCACTCG CAGCACGCCG CTCTCCGACC CGACTAGCCC GCCATCCGGT
TCGGTGTTTG TCATCAACAC GCCGGCGCCG TCGAGCAGCG GCACAACCTA TGTGCGCTTC
AACATTTCAC AAGAACTGAT TGCGCGCGGC TTCAGCGGGC GGCTCGTCGG CGGCGATATT
CCCAACGCTG GCGGCGCGCC GCAGAACAAC AACCCGCCGC TCTTCGGTCC AACGCAAGGA
CGGATCACGT TCTACGCTGA GGTCAAGGAT GAGTTCAGCG ACGATTTCCC CTCCGGCGAC
CGCTCGGTCG ATCAACTCGA CATTCTGCGC AATACGGTTC CGCAGATCCG CGGTGAGCAG
ATCAACACGA CGACGATCAA CAATCCAACG CCGACGGTGA TCGGGCAGGC GACCGACGAT
ACCGCCGCCA GCGTCGAGTT GCCGATTGGC GACCGCAGCA AGACGCTCTA TGCGCTCAAC
GGACAGACGA CCAACCTCGG CGACCCGGTG ACGGTGCAGC CAGGCGACCT GGTGACGTAT
CGGCTCACCT ATCGCCTGCC GATCAGCAGT TTCGAGAATA TCCGGTTGAT CGATTTCCCA
CCGCTGCCCG TCTTCCCTGT GCCGGATCGC ATGTTCTTCG ACGCGGCGGC GGCGCCTGGC
ACGATTCCCG GCGCAAATAT GGTGACGCGC GGACCGTCTG ACACTTACCT GACGACGATC
AATCCGCCAA CGACGGTGCG CGTGGCGACG ACCGCGAATA TCACCGGCTA CAATCCCGCC
GGCATTGGCT CATTCAGTGG CGCACCCACG ACCATCGACG GCGTCGCGCT GGCGAACGGC
GACCGCGTGC TGGTGAAGGA TCAGACCGAT GCGCGGCAGA ATGGCGTCTA TGTGGTCGTG
GATGCCACAA CCGGCGCCTG GAACCGCGCC GCCGATTTCG ATACCCCGCG CGAACTTACG
AACGGTCCGC TCGTCGGCGT GACAGCCGGC GCTACCAACG CCAATCAGCA CTTCCGTCAG
TCGAATCAGA ACTTCAACAC CTTCAACACC GATCCGATCA CCTGGGCGCC GTTCATCACC
ACCGACGCAG CCGGCAATAG TTTCACCCTC AACTTCGGCT GGCACGACGA TGTGGACGGC
GGACGGCGCT CCTCGCTCAT TGACGTGCTG GTGACCCTGC GCGTCGCCGA TGCGGCGTTC
GCCAACGACC TGTTCCTGAC CAACCAACTG CGGGTGAGTG AGTCGTCCAC CAATGCGGGA
TCGAAGGATT TCGATGAGAT TGTCATGTTC GAGGTCATTC GCCCGAATGT GACAATCAAT
AAGGGAATCG TTGGCTATAA CGCGAGTGGT CTGACCCTTG GCGGCGTGAC GTTCAACCCG
CCTGATGGAC CGACGACCTT CACCGGCGCG CCGGTCTACA CCAACACTCA GGCGCTGGCA
ATCGGCGCTT CCGACTCCCT GGTGCTGACC GACGCCGGGG ATCGAGTGCG CTATGCGATT
GTGCTCCAGA ACGAAGCGCG TGGTGATGCC TACGATGTCA CGGTGACCGA CGCCATCCCT
TCCGCTTATG CGCGACCTGC AACGCTGGCA GCCGCCAATT TCGCCGTCCG CCGCGGCGAC
GGCACGCTGC TCACCGGCGA TGTCGTCAGT GGCACGGTGC GGGTTGCCAC CACCGCACCG
TTGACCGGGG CGACATTTAC CACAACGCCG AATAATGGTC AGTTCACGAA TGCGCCACGC
ACTATCGACG GCGTGACGTT GAACGTCGGC GACCGGGTGC TGGTGAAGGA TCAGACCAGC
GCAACCCAGA ACGGCGTCTA CGTCGTCACT GCGGTCTTCC CCGGCGTCAA CCAGGCGACT
CTGACCCGCG CCGACGATTT TGACGATAAC ACTGAACTGA CCGGCGGCTA CCGCGTCGCC
GTGCTCGGCG GGTTGAGCAG CAACGCCAAT CGCGCCTTCA GCGCAACCGG ACCAATTACT
CTCAACACGA CGCCGGTTAC CTGGAGCGAT GCTGGCATCA GCGATTACTA TGCGACATAC
AACCCCTTGA CCGGCGCCTT CAGCGTTGTG CTGGCAGACA ATTACACGGC GGGCAACACA
ACGGCGCCAA CCCGCGACGA CCGGCGCGGC GGTCTCAGCC GTGGCGCGTC GGGTCCGAGC
AGCAACATCG TCGGCGTGAC GAACGGCTCG AACACCATCA TCATCACCTA TGACGTGACC
CTGGGCGATA ACGTCGAGCC GAATCAGCAG ATCATCAACA CTGCCACACT GACGAACGTC
GCCACGAGCG ACGGCGGCAT CGACGATCTG CCCGATCCGT TCGATACGGC GCTGGTCACT
ATTCGCCGTC CAGAGATTGC GAAAACACTG ACCGCCACGG AGATCGAGAA TGCCACGAAT
GCGCGCCAAC AGGCGGTCAT TGGCGAACTC GTCACCTATA CCCTGATACT GACCGTGCCC
GAAGGGGTTA TGTCAAACGC CGCGCTGACC GATACGCTCG ACGTCGGGCT GGCGTTTGTC
GATGTGACGG GAGTCACGGC GTCACCGTCC CTCAGTTTTG TCGGCGGCGG GCTGCCGACA
GTTGGCGCAA CTCCGGCAAA CACCACGATT GGCGCAGGCG GGCAGACACT TGTGTTCAAT
TTCGGCGCGA TCACCAACAG CGACCGGAAT AATGCCGCTG ACGAAACAAT CGCCATCACC
TACCGCGCCG TTGTGCTGAA TACCGTCGCC AACCAGAGCG GCGCGCAGCG CAACAACCGC
GCCGATTTTT CGTGGGATGT CCCTGGACAG GGTTCATATG CGCTGTTGCC GGTCGAAACC
GAAAATGTCA CGATCATCGA GCCGACCCTG GCGGTGGGCA AGACTTTGCC GCCTGGTCCA
TATGATGCCG GCGATACCAT CGACTACACC ATCACCATCA GTCACACAGC AGGCAGCCAG
ACCGACGCCT ACGATGCGGT GATCACCGAT ACCCTGCCCA TCGATCCGTT TGGCAGCGGC
TCGCTCATTC TCACGCCAGC AGTGCTGAGC GTCACCGACA GCGCCAGTCT GCTGACGACC
GCCGATTTCA CGCTGACCGG CAGTGATGCG ACATCCTGGA CCCTCAGCAA CCCGACGCCG
ATTGATGTTG CCCAGGGACG CACAATCAGC ATCGTTGTGC GCGGCACGCT CTCCAACGCC
GCCATGCCCG GCGCGACGAT CACAAACACT GCCTTTCTGC GCTGGACGTC GCTCGACGGC
AATCCGGGGC AACGCTCGAC GCACGCTGCC GACTCAACGG AGCGCACCGG CGCCGATGGA
CCCGGCGGCG CGCTGAACGA CTACGCCGCC ACCGGTTCGG CCACGTTCGT CATCCCGACG
GTAACGCCGG GCAAGGCGCT GGTGGCCACA TCCGAAGCGC ACACCTCCGA CAGCAACGTG
ACGATTGGCG AAATTGTGCG CTTCCGCATT GCAATTGGCG GACCGGAAGC CTCGGTCTAT
GCCTTCTCAC TGGTCGACCG GTTGCCGCCG GGGTTGACCT TCCTCAATGA CGGGTCGGCG
CGCTTCGCGT TCATTGCCAA CAGCAACATC ACCACGGCAG GCGTCTATGA CGTTGCTCCG
GTTTCCTGCC CGGCAATCAA CCCGACCGGC GTCGCCACGC TTGCCGATGT GTTGAATCCA
ACAATTCTCC CCTCGTCGAG CATTAACTGC ACGTTTGGGG ACAGCAATAT CTCCAGCAAC
GAAACGACGA ACCAGGATGT CTATGGCAGT GGAACCGATG TCTTCTTCCG CTTTGGCAAT
CTGTCGAACA ACGACAATGA CGCCGATGAA GAGTTTATTG TCGTCGAGTT CAACGCGATT
GTCGATAACG ACACGACTGA CCCGAACGAC GCTGGCGATG TGCGCAACAA TACTGTGGTT
GCGCGCATGA ACCCGCCTGG CTTCGGCGCA TTCGAGACCG ATCCCTCGGC GCCGACGTCT
GTCACCGTCG TCGAGCCGAA TATTGCGTTC AATGCCGCGA CGAACAACAA GACCGCCACC
CCGACGAGCG GCGACGCCTT TGACCTCATC ACCTACACCG TCACCTTTGC GAACGCGACG
GGCGCGATGG TGAGCGATGC CTTTGACGCG CGCATTCTCG ATACGCTGCC CGCCGATGTA
ACCCTCATTC CTGCCAGCAT CTCGGTGTCG TCCGCAGGCG GCTGCGCTAC CGGCGTCACC
AACGCCAGCG CAGGCAACAC AGTGGACGTG ACGATCGGGC GTGTGCCGCC GGGATGTAAT
GTGACCATCA CCTACCAGGC AACCCTGAAT GTCTCCGTCG TCCCTGGTCA GGCGATCACG
AACACCGCGA GCCTGACCTA CACCAGCCTG CCGGGGAACA CCGGCACGGG CGGGTTCTTC
GGCTCGACGG CGGGAACGAG CGGCAGCGCC ACCGGCGAGC GCAACGGTTC CGGCGGCATT
AACGACTACG CTGGCAGCGA CACGGCGATG GTCACGATTG TTGCGCCGGC GCTGGCAAAG
CGGATCGTCG CCACGTCGGA AGCGCATACC GCCGATCCGC TCGTCCTGGC AGATTTCAAT
AACGCCGGGT TCGATGGCTT CCTGCCGGGG TCAGTGTTCA CCGGCGGCAA CAACACCTGG
GACGACGCGC CGGGCAATGT CACAGTCTTC TCGACTTTCG TGCGCATTGC CGGAAACTCG
ACAGAGCGCG GCGGCGGTTT CATCACCTTC GCCACGCCGG TCGATCTCTC CAACCACACT
GCGCTGGCGC TCTCCGCGCG CCTGGTCGCC GGGAATGGCG CCAATAACAT CGATATCCAC
CTTCAGGACG CCGACGGCAC CAACTGGCGC TGGCGCTTCC CTGCGAGCAA CTTCACCACC
TCGACGTTTA CGCTTGTGCC GCAAAACCTG CTCGGGCCAA ACAGCACGAC GATCGCAGCA
GGCACAACGC CGGGTCTCGA TCTGCGCAAC ATTGTTCGCC TCGAAATTCG CGGCGACGAT
GGCACAGCGC AGTTCGACAT CGATATCGAC GCAGTGATCG CGTTCGGCAA TATCGCAGCG
CCGGGTGAGA TTGTGCGCTA CCGGCTGGTG ACGACGATCC CTGAAGGCAC GTCGCCAAAC
CTGCAACTGC TGGATCGCAT TCCCACCGGC ATGCGCTTCA TCAACGACAA TACCGCGCGG
GTGGCATTCG TCAGCACCAA CGGCATAACA TCGACGAGCG CCGGCGCGCA GGTTCCCGCA
CTCAGCGGCA CAGGGCTGAA TATTACCGGC AACCAGGATA GTGTTGTCGG ATTGAGCCTG
GCGGTCGGCA GCGGCAATGG GCTGGCAATC GGCGAGGGGA CTGCGTTCGA CGGCAATGTC
TCATCGAGCA ACGATGTCTT AACAGATACC GACACATATA ACAATGGCAG CGATGTCTTC
TTCCGCCTCG GAAATATCGT CAACAACGAC AACGACGACG ATCTGGAATT CATCGTCATC
GAGTTCAACG CTCAGGTGTT GAACGTGTTC AACGTCGGCA ACCAGAGCGG CGTGGGGTTG
AACAACGACT TCCAGTACTT CCGCAATGGG TCGCAGGTTG GATCAACGTC TGCGCAGAAC
GGGATCAACC GCATCACCGT CGTCGAGCCG CAGATCAACA ACCTGAGCAA GACGCTCGCC
AGCGCGCCGC CGGTTGATGC CGGGGATGTC TTCACCTACA CGCTGCGCTT CGCCAACGGC
GTCGCCTGGC CCACCAGCCC GGCGGTTCCG GTGCGCGTCG CCACTACCGG GAACATCGCC
GGTTTCAACG CCACCGGCGG GTTCGGCGGA ACCGGTCAGT TCGCCAATGC GCCCGCCACG
GTCGATGGCG TGACACTGAA TGTCGGCGAC CGCATCCTGG TGCGCAGCCA GACGAACGCT
GCGGAAAACG GCATCTATAC GCTGGTCTTC ATCGATCCAT TCACCGGCGC GCGCGTCTGG
GACCGCGCAA CCGACATGGA CGACACAGCG GAACTGGCGC TCGGCTACCG TGTGTTCGTG
CAAGAAGGCG CCACCCTGGC AGGTAGAACC TACTACCTGG ACGAGCCGGT TCCAACCATC
AACGCCGGTG CGCTCAACTG GCGCGAAGTG GCGCCGGCAA TGACCGTGGC AGTGGCGACA
ACCGGTTCAT TGGGAGGAGC GAATTTCAAT TCTAGTGGCG GACTCCTTGG ACGAGGAACA
CTGACAACAA CCGCAACGAC CATCGACGGT ATTCCGGTCA ACGCAACCAA CTTCCCGGTT
GGTACGCGCA TCCTGGTGAA GAACCAGAGC ACCGCCGCAA ATGGTGTGTA CCAGGTCACC
GGGTTTACCT CCCCAACCCT GACCCTCGAG CGCGTGCCGG AGTTCGATTC GCACCTCGAA
GCGGTGATCG GCGCGCAGGT GTATGTCACC GGCGGCACGA TCAACGCCGG GCGCACCTTC
GCGGTGCAAA CGGCGCCTTC AGAAACGCCT ATCACCACGT CCACTAATTT CGAGATTGTG
GATCAAGTCG CCGCTTTTGA TGTGACGGTC TTTGATCAGC TTCCACCAAC GCTGGAACTC
TTGGGCGTGC AGATCGACGC GCCGACCGGA ACCACGGCGA CCAACGGCTC GACCCTCGGC
GTCGGCGGCG TGATCAGTTA CACGCTGGAT CGCCTCGACT CGGTCCAGGA CATCACCGCC
GGCAGAAACG ATGTCGTCAT CACGGCGACA GTGCGCGTGG TTACCGGAAC GGTCGCCAGC
GCACAGATCA CGAACACAGC GCGGGTGCAG TACACCAGCC TGCCGGGCGC GCGCGGCACG
CTCAGCAACC CGACCGGCTC CAACGTTGAT GCGGGCACGG CAGGTACGCA GAATGGCGAG
CGCACCGGGC AGGGTGTGCT CAACCCGACC AACAACACGC CGCCGTCGAA CAACGGAATC
CGCAATAACT ACAGCGTCGG CGCTATTGCG CTGAACCACC TGGCGGCGCC GATGTTCGAC
AAGCAATTCC AGGGTGGCAG CATCTCCGAT GATGACAGCA GCGTGCCGGG CACTTCCGGC
GCCAGCGTCG CTGTCGGCGA AGCGGTGCTC TACGACCTGC GCGTGACTCT GCCCGAAGGG
ACGACGAACA ACCTGCGCGT CGTCGATGCC GTGCCCGATG GGATGCGTTT CGACACCTCG
TTCAACGGCG GTCTCGGCTA CCAGATTGTA ACGACGGCCG GCGGATCGCT CGCCGAAAAC
TTCAGTGATC CGGCGGCGGT TGCGTCGCCG ACGCTGTCCG TCTCCGGCAC GGGGACGCTC
GGACAGGACG GCGTGGACGC GCTCTTCTCC TTCGGCAACG TCACCATTGC CGATGACAAC
AACCCGAACA ACAACTCCTT CATCATCCGC GTTCGCCTGA TTGCCACCAA TACGGCGGCG
AACCAGAGCG GCGCCAGCCG CGACAACGGC GGCGCGCTGC GCTACACGAA CGGCTACAGC
GGCGGTGACA ACGAACTGCG CGATCCGACC GAACCGCGCG TGACGATCAT CGAGCCGACA
CCATCGATCC TGAAGTCGGT CAGCGGCGCA GCAGCCGACG CTGGCGACCC GATCACCTAT
ACGCTGCGCA TCGAGAATAT CGCGCTACGC AGCGAAATGG ACGCTTACGA CGTGGTGATC
AGCGACACAA TTGCGCCCGA ACTGATCAAC CCGACGATTG TTGCAGTCAA TGTTACGGGC
GCTCCCGACG TAAGCGCCGC CGACTTCGAG ATCGCTACCG TCGGACCGGA TCGCATCCTG
CGCACGACTG CGCCGATTAC CCTGCCGCTC GGCGCTACGG TGACCATCAC ATTCACCGGC
GACCTGACGA GCACAGTGAC GACCGATCAG ATCATCACCA ACCGCGCGTC GATGTTCTGG
AGCAGCACAC CCGGAACCAA CCCGGACGAG CGCACCGGCG CCGATGTGCC CAACCCGCCG
GAGTGCAGCA GCCTGCCCGG CGACTGCAAC CTGGACAGCA CGCAACTCAA CAATTATGGT
CTCATTTCAT CAGTAACGAC GACCGCAATT GCGCCGGTTG TCGTGTCGAA ATCGGTGGTC
GAAGGGATTG CCCCCTCGAC GCCTGGCACA GACGTGACCA TCGGCGAAAT CGTGACCTAT
CGCCTGGCAG TCAGTCTGCC GGAGGGGGTA ACAAGCGGGC TGGTGATCAC CGACACGGTG
CCGAGCGGCA TGGCATACCT GCCGGGAAGC GCGACGCTGA TCACCGGCAC GCTCGCCACC
GGTGACCCGC CGCTCGCCGG CGGTCACCCG ACCGATGCAG GCAGCCTGGC GTTCAACGGC
GTCTTCAGTG ACACGACCGA TCCTTCGGTG ACGCCAATCG GCAGCGGGCA ATTCCTGAAC
GGAACCGGCA TCCGCTTCGA GTTCGACCAG ATCACGCTGC CGGGGGACAA CGACGCGCGC
AACAATACCT TCTTCATTCG CTACCAGGTT GTCGTGCTCG ATGTAGCGGG CAACACCGGC
TTCAGCGGCA GCCAGAAGAC CCTGACGAAT AGTGGGCAGT TCGATGTGCC ATCGACGCCG
CAGCCGCCGA CCGATCTGCT GATTGACCCG AACGGCGCAA CGGTCACGGT CGTCGAGCCG
GAACTGGCGA TTGCCAAATC GGTCACTCCA GCAAGCGGCG ACGCTGGCGA TGCCGTGACA
TACACCATCA CCCTGAGCCA CACGGCGCGC AGTCTGGCAG ACGCCTTCGA TGTGGCGATC
AGCGATGCGC TGCCGGCGAC GGTTGGCTCA TCACTCACTG CGCCGATCAC AATCGCCAAT
GTGAACGCCA CGCACTCGGT TGATGGCGAC ATCACCGGCA ACTTTAGCGT GGCGGGCAAC
ACGTTGACAA CGACTACGCC GTTCACGCTT CCGCGCGGCG AGACTGTCAC CCTGACGATC
ACCGGCGTGC TGCGGCAGAG CGTCCAGCCC GGCGAGGTCA TCACCAACAC CGCCGTGCTG
ACCTCCACCA GCCTGCCCGG TCCAAACCAG GACCTCAGCC CGGATGCCAC CGGCGTCAGC
GATGGAACCG ACCGCGAGCG CAGCCGCAGC AGCAGCAGCA GCGCCACACA CTCCGTCGGG
CAGGGCGCCT TCAGCAAGGC GATCCTGAGC ACCAGCGCCA CCCACACGAG CGGCACAGAC
GTCACCATCG GCGAAGTCAT CAGTTATACC CTGATCGTCG ATCTACCGGA AGGCACGATC
CGCGACCTGA CGCTGACCGA TGACCTGCCG GCAGGACTCG ACTACGAAGG GGTCACGGTC
ATCACCGACG CAGCGCAGAG CGGCGGTCTG TTGACGCAGG ACTTCGCCGG AACCGTGCCG
TCGATCACCG TCACCGGCGG CGCGGGCAGC GGCGATGACG TCACCTTTAC GTTCACCGGC
GACGTCGTGG TGACCGGTGA CAACGACGCG GCAAACAACC GCTTCCTGGT TGTGGCGCGG
GTGCGGGCGC TCAATGAGCC GGGGATGGTC GGGTTGATCC CGCCGGGACA GACCATCCTG
ACGAATACCG CAACCATGCG TTACACCGAC GGTTCGAACA TCACACGCGC GTTCACCGAC
ACTGAAACAG TGCGCGTCGT TGAGCCGCAG TTGACGATCA CTAAAGATAT TGTTCAGATG
GTGGCGAATG CGGGCGATCC GATCACGATC ACACTGACGG TGACGAACAT CGGCGCGTCC
GATGCCTTCG ACGTGGTTAT CACCGATACG CTGCCGCCTG AGTTCGACGC CGACACCACA
TCTTTCGGCA CGGCGGGGAG CGACTACCCG GCGACCTTCA CCCCCTCGCG CACCGGCAAT
CAGGTGCGCT ACGAGGGTGG ACCGATCCCG GTCGGTGCGA CGATGACCTT CACCTTCCGC
GTCAACCTGA CCGGCACGGT GACGCCGGGG ACGACGATCA CCAACACGGC GCGCATGGCG
CAATCGACCA GCCTGCCGGG TGATGATCCA ACCGAGCGGG TGCAGACGCC GGTTGAGTCC
TCTGACACGC TCACCATTCG TAGCAACAGT CTGAGCGGCT TCGTCTACGT GGACAGCGAC
AACGACGGCG TGTTCGACAC GGGTGAAAGC GGCATCGGCG GCGTGACCAT CACGCTGTCG
GGAACCGACC ACCTGGGCAA CAGCGTCCTG CTGACGACCA CCACGACGAT CACCGGCTTC
TACCGCTTCG ACAATCTCTA TCCGGGTGTG TACACCCTGC TGGAAACGCA GCCGTCCGGC
TATCTGGACG GAACCGACGC TATCGGCACA CAGGGCGGCG CGACCGGCAA TGATGTGTTG
AGCAACATCG TGCTGCCGGT TGACACCTCG ACCAACGGCG AGAACAACAA TTTCGGCGAA
ATACCGGCAG CCCGCATCGC CGGGTTCGTG TACGAGGACG ACAACAACGA CGGCGTCTTC
GACACCGGCG AGAACGGCAT CGGCGGCGTG ACCGTCACCC TGACCGGAAC CGACGACCTG
GGCAACCCGG TGAACCTGAC GACCACCACG ACGATCACCG GCTTCTACGC CTTCGACAAC
CTGCGTCCGG GGACCTACAC CGTGAGCGAG ACGCAGCCGT CCGGCTATCT CGACGGACTT
GACACGGCGG GAACGCTCGG CGGCGACACG ACGGTCAACG ACCGGATCGC CAGCATCGCG
CTGCCGCCAG GCGCTGCCAG CCTGAACAAC AACTTTGGTG AACTGCGTCC GGCAAGCCTG
TCGGGGTTGG TCTACCGCGA CGACAACAAC AACGGCAGCC GCGACGGGAG CGAGCCGGGG
ATCAGTGGCG TCACCATCAC CCTGACCGGA ACCGACGACC TGGGCAACCC GGTGAACCTG
ACGACCACGA CCACGATCAC CGGCTTCTAC ACCTTCGACA ACCTGCGTCC GGGAACCTAC
ACGGTGATCG AGACGCAGCC GTCCGGCTAC TTCGACGGCG CCGAAACCGT CGGTACGGCA
GGCGGGAGCA TCCTCAGCAA CGACGTGATC GGCAATATCA CCCTGAATGC AGGCGTTGCT
GCGACGGGGT ATGACTTCGG TGAGGTTCCG GCAGCCCGCA TCGCCGGGTT CGTGTACGAG
GACGACGACA ACGACGGCGT CTTCGACACC GGCGAGAACG GCATCGGCGG CGTGACGATC
ACGCTGACCG GAACCGACGA CCTGGGCAAC CCGGTGAGCC TGACGACCAC CACGACGATC
ACCGGCTTCT ACGCCTTCGA CAACCTGCGT CCGGGGACCT ACACCGTGAG CGAGACGCAG
CCGTCCGGCT ATCTCGACGG ACGCGACACA GCAGGAACGC TCGGCGGCGA CACGATGATC
AACGACCGGA TCGCAGGCAT CACACTGCCG CCAGGCGGCG CGAGCCTGAA CAACAACTTC
GGCGAACTGC GCCCGGCAAG CCTGTCGGGG TTGGTCTACC GCGACGACAA CGATAATGGT
GTGCCTGACG CGGGTGAGCC GGGGATCAGC GGCGTCACCA TCACCCTGAC TGGAACCGAC
GACCTGGGCA ACAGTGTCCT GCTAACAACC ACGACCACGA TCACCGGCTT CTACACCTTC
GACAACCTGC GTCCGGGGAC CTACACCGTC AGCGAGACGC AACCGGCGGC GTACAACGAT
GGACGCGACC GCGTCGGCAC GGAAGGCGGC GACCTGAGCA ACGATCAAGT CAGCACCATC
GTGCTTGGCG CAGGGGTCGA TGCGGCCAAC TACGACTTCG GCGAACTGGG AACCTTCGTC
AGCGGCGTCG TCTGGATCGA CACCAATCGT GATGGAACGC TCGACAGCGG CGAGAGCGGG
CGTCTCGGCG GCGTGACGAT TACGCTGCGC GACAGCCTGG GGAATGTCGT CTCCACAACG
ACCACCCTGG CAGACGGCAG CTACCGCTTC GACAACCTGC CGGCGGGCAA CTACACCATC
GAGCAGATGC AGCCGACCGG CTACGGCAGC TCGACGCCGA ACACGCTGAG CGTCACGGTT
CCACTGACCG GTCTGACCGA CCAGAACTTT GGCGAGACCG TCAGCACCCT GAGCGGCTTC
GTCTACGTGG ACAGCGACAA CGACGGCGTG TTCGACACCG GTGAAAGCGG TATCGGCGGC
GTGACCGTCA CCCTGCTCGA CGGCGTGGGG AATGTCGTGT CTACGACGCT GACCATGGCG
GATGGCAGTT ACCGCTTCGA GAACCTTCTG GCGGGGACCT ACACCATCAG CGAGACACAG
CCGCTGATCT ACAGCGACGG GCAGGACAGC GTCGGCACGA TTGGCGGCGC ACCGGTCGGC
ACGCTCGTCA GCAACGACGT GATCGGCGAC ATCGTTCTGC CCGCCGGAAT CGACGGCATC
ACCTACAACT TTGGCGAATT GGCAAACGCC GGGCTGGGTG ACCGCGTGTG GCTCGACCGC
AACGGCGATG GCGTGCAAGA CGCCGGCGAG CCGGGAATCG GCGGTGTGAC GGTCTACCTC
GACCTGAACA ACAACGGCTC CCTCAACGCC GGCGAACCCA CGGTAACGAC CGATGCGGAT
GGTCGCTACT TCTTCGGCGG TCTGGCGGGC GGAACATACA CCGTGCGCGT GGATGCGACC
ACACTGCCGG CAGGCGTCAG CCAGACCTAC GACCTGGACG GCGCAACCGC AACGCCACAC
ACCGCGATTG CATCGCTGGC GGCGGGTGCG ACCCGCACCG ACGTGGACTT TGGCTACCGG
GGCAGCGCCA GCATCGGCGA CCGCGTGTGG CTCGACCGCA ACGGCGATGG CGCGCAAGAC
GTCGGTGAGC CAGGAATCGG CGGTGTGACG GTCTATCTCG ACCTGAACAA CAATGGCGCG
CGTGATCCGA ACGAACCATT CGATGCCACC GATGCCAGCG GCAACTACCT GATCGATGGG
CTGCATGCCG GAACGTACAC CGTGCGGATC GATGCCTCGA CCCTGCCCGG CGGCATCGAT
GCCACCTACG ATCTGGACGG AATCGGGACG CCTGGCATGG TCACCGGTGT CACGTTGAGC
GACGGACAGG CGCGCATCGA CGTAGACTTC GGCTACCAGG GCAGCGCCAG CATCGGCGAC
CGTGTCTGGA ACGATGCCAA CGCCAATGGC ATCCAGGAAG GCGGAGAAAC GGGGGTCAGC
GGCATTGTCG TCGCGCTCTA CGACTCGACC GGCACGCTGC TGATCACCAC CACGACCGAC
CTGAACGGCA ACTACCTGTT CGACAACCTG CCCGCAGGGA CGTACACGGT GGGAATTGGC
GCAACGCCGG GACGCAGCAT CAGCCCGCGC GGAGCAGGAA GTGATCCGGC GCTCGACTCC
GACGTTGACC GCACCACGCG GCGCAGCAAC CTGATTACCC TGAGCACCGG CGAGGCGCGC
CGCGATCTCG ACATTGGTCT GTATCAACTG GCGTCGGTAG GCAGCCTGGT GTGGCTCGAC
CGCGACCTGG ACGGCATCCA TGAGGCGGAT GAACCGGGGA TCGGCGGCAT CGAAGTACGA
TTGCTGCGCA GTGACGGCAC ACTCGTTGCC ACACGGACCA CGGACGCCAA TGGGTACTAT
ATGTTCACCG ACGTTGAACC GGGTGAGTAT CGGATCGCAT TCAGCGTCCC GCCCGGCTAC
TACGTCAGCC CGCCGCACCG GGGAAGCGAT CGGGGCAACG ACAGCGACGC CGATCCGGCG
ACCGGTCAGA CGCCAATCTT CACCCTGACA CCAGGGCAGA TCGATCCGAC GTGGTACCTT
GGGCTGTCGC CGATCTCGCC GACCGCCATC CAACTGACGC GCTTCAGCGT CGAGCGCAGC
GCGAACGGCA TCGTCATTCG CTGGGAAACG GCGGCTGAAT TCCAGACGCG CGGTTTCCAC
ATCGAACGCA GCGCCAGCGG CAGTCGCAAC GATGCTGCGC GCATCACAGA TCGCCTCATT
CCCGCAAGAG GAAGCATCGG CAGCGGCGCG GCATACGTCT GGAGCGATAC AACAGCGGCG
CCAGGCGTTC GTTACACCTA CTGGCTGGTC GAAGAGACCA CCGACGGCTC AACACACATC
TACGGACCGG CGACGTCCGA AGCGACGACC GGCGGAACGT ACACGGTCGT TCTGCCGCTG
ATTATCCGGT AG
 
Protein sequence
MKVTHYQVTA CYVLHKSTSQ NQERDRTMRR RPTLRFAPLL TLLVGALTIG SFFLGTIALA 
AGNPLPVATV NSASGLIGEP VTLTVTFDNA STGTGDQTGY GPYIDLFLDT TGPDGAPSFD
GLVTTDIRAT YLNQPLPGLE LITISGPTYV HPLTGQTLSV PGYGTRFQNG DTIAVLTLPF
GSFTNTQPPA PIDVTLRISN LADLGTPLPV TAIAGFRYGA DPLDNPGADP PIRQTTPSDA
DVTPILWRLT KTYLGPEDET ATGRNYPRRY RLDVDIAEGQ PVTNLQITDT LSNSMRITGN
TAAQMSARLY NTPGVPTEVF NPANLSGTAT PAAPGGTLVY TFGDKTGVRG VDASFEFEFY
IPRDTSAATP TVPQGTDSTL ANNTGSSSAT WTPIDTRDAP TSITITLPPD AHTLQQHSLA
TQKSVTPVDR ATLAPTGGVI IPGQTLLRYD IDFQVSDYFA VQNVYLEDII SDGQRLFVRT
SGGPSTVPTL QVENAYVTGA SPTRMNVATA PFGGANTIVY EQRFTTRSTP LSDPTSPPSG
SVFVINTPAP SSSGTTYVRF NISQELIARG FSGRLVGGDI PNAGGAPQNN NPPLFGPTQG
RITFYAEVKD EFSDDFPSGD RSVDQLDILR NTVPQIRGEQ INTTTINNPT PTVIGQATDD
TAASVELPIG DRSKTLYALN GQTTNLGDPV TVQPGDLVTY RLTYRLPISS FENIRLIDFP
PLPVFPVPDR MFFDAAAAPG TIPGANMVTR GPSDTYLTTI NPPTTVRVAT TANITGYNPA
GIGSFSGAPT TIDGVALANG DRVLVKDQTD ARQNGVYVVV DATTGAWNRA ADFDTPRELT
NGPLVGVTAG ATNANQHFRQ SNQNFNTFNT DPITWAPFIT TDAAGNSFTL NFGWHDDVDG
GRRSSLIDVL VTLRVADAAF ANDLFLTNQL RVSESSTNAG SKDFDEIVMF EVIRPNVTIN
KGIVGYNASG LTLGGVTFNP PDGPTTFTGA PVYTNTQALA IGASDSLVLT DAGDRVRYAI
VLQNEARGDA YDVTVTDAIP SAYARPATLA AANFAVRRGD GTLLTGDVVS GTVRVATTAP
LTGATFTTTP NNGQFTNAPR TIDGVTLNVG DRVLVKDQTS ATQNGVYVVT AVFPGVNQAT
LTRADDFDDN TELTGGYRVA VLGGLSSNAN RAFSATGPIT LNTTPVTWSD AGISDYYATY
NPLTGAFSVV LADNYTAGNT TAPTRDDRRG GLSRGASGPS SNIVGVTNGS NTIIITYDVT
LGDNVEPNQQ IINTATLTNV ATSDGGIDDL PDPFDTALVT IRRPEIAKTL TATEIENATN
ARQQAVIGEL VTYTLILTVP EGVMSNAALT DTLDVGLAFV DVTGVTASPS LSFVGGGLPT
VGATPANTTI GAGGQTLVFN FGAITNSDRN NAADETIAIT YRAVVLNTVA NQSGAQRNNR
ADFSWDVPGQ GSYALLPVET ENVTIIEPTL AVGKTLPPGP YDAGDTIDYT ITISHTAGSQ
TDAYDAVITD TLPIDPFGSG SLILTPAVLS VTDSASLLTT ADFTLTGSDA TSWTLSNPTP
IDVAQGRTIS IVVRGTLSNA AMPGATITNT AFLRWTSLDG NPGQRSTHAA DSTERTGADG
PGGALNDYAA TGSATFVIPT VTPGKALVAT SEAHTSDSNV TIGEIVRFRI AIGGPEASVY
AFSLVDRLPP GLTFLNDGSA RFAFIANSNI TTAGVYDVAP VSCPAINPTG VATLADVLNP
TILPSSSINC TFGDSNISSN ETTNQDVYGS GTDVFFRFGN LSNNDNDADE EFIVVEFNAI
VDNDTTDPND AGDVRNNTVV ARMNPPGFGA FETDPSAPTS VTVVEPNIAF NAATNNKTAT
PTSGDAFDLI TYTVTFANAT GAMVSDAFDA RILDTLPADV TLIPASISVS SAGGCATGVT
NASAGNTVDV TIGRVPPGCN VTITYQATLN VSVVPGQAIT NTASLTYTSL PGNTGTGGFF
GSTAGTSGSA TGERNGSGGI NDYAGSDTAM VTIVAPALAK RIVATSEAHT ADPLVLADFN
NAGFDGFLPG SVFTGGNNTW DDAPGNVTVF STFVRIAGNS TERGGGFITF ATPVDLSNHT
ALALSARLVA GNGANNIDIH LQDADGTNWR WRFPASNFTT STFTLVPQNL LGPNSTTIAA
GTTPGLDLRN IVRLEIRGDD GTAQFDIDID AVIAFGNIAA PGEIVRYRLV TTIPEGTSPN
LQLLDRIPTG MRFINDNTAR VAFVSTNGIT STSAGAQVPA LSGTGLNITG NQDSVVGLSL
AVGSGNGLAI GEGTAFDGNV SSSNDVLTDT DTYNNGSDVF FRLGNIVNND NDDDLEFIVI
EFNAQVLNVF NVGNQSGVGL NNDFQYFRNG SQVGSTSAQN GINRITVVEP QINNLSKTLA
SAPPVDAGDV FTYTLRFANG VAWPTSPAVP VRVATTGNIA GFNATGGFGG TGQFANAPAT
VDGVTLNVGD RILVRSQTNA AENGIYTLVF IDPFTGARVW DRATDMDDTA ELALGYRVFV
QEGATLAGRT YYLDEPVPTI NAGALNWREV APAMTVAVAT TGSLGGANFN SSGGLLGRGT
LTTTATTIDG IPVNATNFPV GTRILVKNQS TAANGVYQVT GFTSPTLTLE RVPEFDSHLE
AVIGAQVYVT GGTINAGRTF AVQTAPSETP ITTSTNFEIV DQVAAFDVTV FDQLPPTLEL
LGVQIDAPTG TTATNGSTLG VGGVISYTLD RLDSVQDITA GRNDVVITAT VRVVTGTVAS
AQITNTARVQ YTSLPGARGT LSNPTGSNVD AGTAGTQNGE RTGQGVLNPT NNTPPSNNGI
RNNYSVGAIA LNHLAAPMFD KQFQGGSISD DDSSVPGTSG ASVAVGEAVL YDLRVTLPEG
TTNNLRVVDA VPDGMRFDTS FNGGLGYQIV TTAGGSLAEN FSDPAAVASP TLSVSGTGTL
GQDGVDALFS FGNVTIADDN NPNNNSFIIR VRLIATNTAA NQSGASRDNG GALRYTNGYS
GGDNELRDPT EPRVTIIEPT PSILKSVSGA AADAGDPITY TLRIENIALR SEMDAYDVVI
SDTIAPELIN PTIVAVNVTG APDVSAADFE IATVGPDRIL RTTAPITLPL GATVTITFTG
DLTSTVTTDQ IITNRASMFW SSTPGTNPDE RTGADVPNPP ECSSLPGDCN LDSTQLNNYG
LISSVTTTAI APVVVSKSVV EGIAPSTPGT DVTIGEIVTY RLAVSLPEGV TSGLVITDTV
PSGMAYLPGS ATLITGTLAT GDPPLAGGHP TDAGSLAFNG VFSDTTDPSV TPIGSGQFLN
GTGIRFEFDQ ITLPGDNDAR NNTFFIRYQV VVLDVAGNTG FSGSQKTLTN SGQFDVPSTP
QPPTDLLIDP NGATVTVVEP ELAIAKSVTP ASGDAGDAVT YTITLSHTAR SLADAFDVAI
SDALPATVGS SLTAPITIAN VNATHSVDGD ITGNFSVAGN TLTTTTPFTL PRGETVTLTI
TGVLRQSVQP GEVITNTAVL TSTSLPGPNQ DLSPDATGVS DGTDRERSRS SSSSATHSVG
QGAFSKAILS TSATHTSGTD VTIGEVISYT LIVDLPEGTI RDLTLTDDLP AGLDYEGVTV
ITDAAQSGGL LTQDFAGTVP SITVTGGAGS GDDVTFTFTG DVVVTGDNDA ANNRFLVVAR
VRALNEPGMV GLIPPGQTIL TNTATMRYTD GSNITRAFTD TETVRVVEPQ LTITKDIVQM
VANAGDPITI TLTVTNIGAS DAFDVVITDT LPPEFDADTT SFGTAGSDYP ATFTPSRTGN
QVRYEGGPIP VGATMTFTFR VNLTGTVTPG TTITNTARMA QSTSLPGDDP TERVQTPVES
SDTLTIRSNS LSGFVYVDSD NDGVFDTGES GIGGVTITLS GTDHLGNSVL LTTTTTITGF
YRFDNLYPGV YTLLETQPSG YLDGTDAIGT QGGATGNDVL SNIVLPVDTS TNGENNNFGE
IPAARIAGFV YEDDNNDGVF DTGENGIGGV TVTLTGTDDL GNPVNLTTTT TITGFYAFDN
LRPGTYTVSE TQPSGYLDGL DTAGTLGGDT TVNDRIASIA LPPGAASLNN NFGELRPASL
SGLVYRDDNN NGSRDGSEPG ISGVTITLTG TDDLGNPVNL TTTTTITGFY TFDNLRPGTY
TVIETQPSGY FDGAETVGTA GGSILSNDVI GNITLNAGVA ATGYDFGEVP AARIAGFVYE
DDDNDGVFDT GENGIGGVTI TLTGTDDLGN PVSLTTTTTI TGFYAFDNLR PGTYTVSETQ
PSGYLDGRDT AGTLGGDTMI NDRIAGITLP PGGASLNNNF GELRPASLSG LVYRDDNDNG
VPDAGEPGIS GVTITLTGTD DLGNSVLLTT TTTITGFYTF DNLRPGTYTV SETQPAAYND
GRDRVGTEGG DLSNDQVSTI VLGAGVDAAN YDFGELGTFV SGVVWIDTNR DGTLDSGESG
RLGGVTITLR DSLGNVVSTT TTLADGSYRF DNLPAGNYTI EQMQPTGYGS STPNTLSVTV
PLTGLTDQNF GETVSTLSGF VYVDSDNDGV FDTGESGIGG VTVTLLDGVG NVVSTTLTMA
DGSYRFENLL AGTYTISETQ PLIYSDGQDS VGTIGGAPVG TLVSNDVIGD IVLPAGIDGI
TYNFGELANA GLGDRVWLDR NGDGVQDAGE PGIGGVTVYL DLNNNGSLNA GEPTVTTDAD
GRYFFGGLAG GTYTVRVDAT TLPAGVSQTY DLDGATATPH TAIASLAAGA TRTDVDFGYR
GSASIGDRVW LDRNGDGAQD VGEPGIGGVT VYLDLNNNGA RDPNEPFDAT DASGNYLIDG
LHAGTYTVRI DASTLPGGID ATYDLDGIGT PGMVTGVTLS DGQARIDVDF GYQGSASIGD
RVWNDANANG IQEGGETGVS GIVVALYDST GTLLITTTTD LNGNYLFDNL PAGTYTVGIG
ATPGRSISPR GAGSDPALDS DVDRTTRRSN LITLSTGEAR RDLDIGLYQL ASVGSLVWLD
RDLDGIHEAD EPGIGGIEVR LLRSDGTLVA TRTTDANGYY MFTDVEPGEY RIAFSVPPGY
YVSPPHRGSD RGNDSDADPA TGQTPIFTLT PGQIDPTWYL GLSPISPTAI QLTRFSVERS
ANGIVIRWET AAEFQTRGFH IERSASGSRN DAARITDRLI PARGSIGSGA AYVWSDTTAA
PGVRYTYWLV EETTDGSTHI YGPATSEATT GGTYTVVLPL IIR