Gene RoseRS_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1068 
Symbol 
ID5208014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1311535 
End bp1327035 
Gene Length15501 bp 
Protein Length5166 aa 
Translation table11 
GC content63% 
IMG OID640594682 
Producthypothetical protein 
Protein accessionYP_001275427 
Protein GI148655222 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGCC GTTCACGCCT TGCGTCATTG TTTCTCGTGC TTGCAGTCAC GGCCGCGAGC 
CTGGGTCTGC TCGCATCAAA CGCTGTATTT GCCGGCGGCA ATCCGCTGCC GGTCGCAACG
GTGAACAATG CTACCGGTCT CATCGGCGAG ACGGTGACCC TGACCGTCAC CTTCGATAAT
GCCAGCACCG GCACGGGCGA TCAGACCGGT TACGGTCCCT ACATCGACCT GTTTCTCGAT
ACCACCGGTC CTGATGGCGC TCCGTCCTTC GATGGACTGG TGACGACCGG TATGCGGGCG
ACGTATCTGG GGCAACCGCT GCCGGGAATG GAAGTGATCC CGATTACCGG ACCGACGTAT
GTCCATCCGT TGACCGGACA GACGCTGTCG GTTCCCGGTT ATGGAACGCG CTTCCAGAAC
GGCGATACGA TTGTTGTGTT GACCCTGCTG TTCGGCAGTT TCACCAATAC GCAGCCGCCG
GCGCCGATCG ATGTATCGCT TCGCATCAGC AATCTGGCAG ACCTGGGAAC GCCGCTGCCG
GTGACGGCGG TTGCTGGCTT CCGCTACGGC GCCGACCCGC TTGACAACCC CGGCGCCGAT
CCGCCCATCC GCCAGACGAC GCCGTCCGAT GCTAATGTCA CACCGACACT CTGGACGCTG
ACGAAGACCT ACCTGGGACC TGAGGACGAA ACTGCGACCG GGCGCAACTA TCCGCGGCGC
TACCGCCTCG ATGTCGATAT TGCCGAGGGC CAGCCCGTGA CGAACCTTCA GATCACCGAC
GTGCTGGCGA ACAGCATGCG TATCACCGGC AATACTGCCG CACAGATGAG CGCGATCCGC
TACGATGCTC CAGGCATGCC CACCGAGGTC TTCACACCGG CGAACCTCTC CGGCACAGCC
ACGCCTGCCG CGCCCGGCGG TACGCTGATC TATTCGTTCG GCGACAAAAC CGGCGTCCGG
GGAATTGACG CCTCGTTCGA GTTCGAGTTC TACATCCCGC GCGATACCAG CGCTGCGACG
CCAACCGTGC CGCAGGGAAC CGACTCGACA TTCGCCAACA ATACTGGCAG TTCGTCGGCA
ACGTGGACGC CGATTGATAC CCGCGACACA TCCACGCCGA TCACGATCAC CCTGCCGCCC
AATGCGCATA CGCTCCAGCA GCATTCCCTG GCAACCCAGA AATCGGTGAC ACCCGTCGAC
CGCAACACGC TGAACCCTAC CGGCGGTGCA ATCATCCCCG GCTCGACGCT GCTGCGCTAC
GATATCGATT TCCAGGTGTC GGACTACTTC GCGGTGCAGA ATGTCTACCT CGAAGATATT
ATCTCCGATG GGCAGCGCCT GTTTGTGCGC ACATCAGGCA TAACCTCGAC CGTTCCGACG
CTCCAGGTCG AGAATGCTTA CGTGACCGGT GCATCCCCCA CGCGCACCAC TATCGCTGCT
GCGCCCTTCA GCGGCACGGG GGTGATCACA TACGAGCAAC GCTTCACGAC TCGCGGCACA
CCACTCTCCG ATCCCACCAG CCCGCCGTCG GGTCCAGTGT TCACGATTCT GACGCCGGCG
CCCTCGCTGA GCGGCACTAC GTATGTGCGC TTCGATATTT CGCGTGAACT GATCGCCCGT
GGCGTGAGCG GTCGTCTGGT CGGCGGCGAC ATTCCGAATG CCGGCGGTGC GCCCCAGAAC
AACAATCCGC CGCTCTTCGG TCCAACCAGA GGGCGCATTA CGTTCTACAC CGAAGTCAAG
GACGAGTTCA GCGATGATTT CCCCTCCGGC GACCGCTCGG TCGACCAGCT GGATATTTTG
AGGAATACGG TGCCGCAGAT CCGTGGCGAG CAGATCAATA CGACCACGAT CAACAATGCC
ACGCCGACGG TTATCGGGCA GGCGACCGAT GATACCGCCG CCAGCGTCGA ACTGCCGATC
GGCGACCGAA ACAAGACGAT CTACGCCCTC AACGGGCAGA CGACCAATCT CGGCGATCCG
GTGACCGTTC AGCCCGGCGA TCTGGTGACC TACCGGTTGA CATATCGCCT GCCGATCAGC
AGTTTCGAGA ATGTCCAGTT CATCGACTTC CCGCCGTTGC CGGTCTTCCC TGTGCCGGAT
CGACTCTTCT TCGACGCCTC CGCTGCGCCT GGCACGGTTC CCGGCGCGAA CATGGTGAGC
CGTGGACCTG GCGATACCTA CCTGACGACG GTCAACCCGC CAACCACGGT GCGCGTCGCA
ACCACCGGCA ATATGACCGG TTACAACCCT GCCGGCATTG GATCGTTCAG CGGAGCGCCC
TCCATCATCG ACGGCGTCAC CCTGGCGAAC GGTGACCGGG TGCTGGTGAA GGATCAGACC
GATGCGCGAC AGAATGGCGT CTATGTGGTT GTGAATGCTG CAAGCGGCGT CTGGAACCGC
GCCGCCGACT TCGACACTGC GCGCGAAATC ACGAACAGTC CACTCGTCGG CGTGACTGCC
GGCGCCACCA ATGCCGGTCA GCACTTCCGC CAGTCGAACC CCACGTTCAA CACCTTCAAC
ACCGATGCGA TCACGTGGGC GCCGTTCATC ACGACCGATG CCGCCGGGAA CAGTTTCACC
CTCAACTTTG GCTGGCATGA CGACGTGGAC GGCGGTCGAC GGGAGTCGTT GATCGACCTG
CTGGTGACCC TGCGCGTTGC TGATGCCGCA TTTGCGAATG ACCTCTTCCT GACCAACCAA
CTGCGCGTTA ATGAGTTTTC GACCAACGCC GGTTCGCAGT ACTTCGACGA GATCGTGATG
TTCGAAGTTA TTCGCCCGAA CGTGACGATC AACAAGGGCA TCGTCGGGTA CAACGGTGTC
GGTCTGACTC TCGGCGGGGT GACGTTCAAT CCGCCTGACG GTTCCACCGC CTTCGGCGGG
ACACCGGTCT ATACTGCAAC CCAGGCGAGC GCCATCGGTT CGTCCGATGT GTCACTGCTC
TCCGATGCCG GTGACCGGGT GCGGTATGCG ATTGTGCTGC AAAACGAAGG GCGCGGTGAC
GCATACGATG TGACTGTCAC CGATACTATT CCCGCCGACT ACATCCGCCC GGCGACGCTG
GCTGCCGCGA ATTTCGCCGT GCGCCGCGGC GATGGAACCC TGCTCACCGG CGATGTCGTC
AGCGGCACGG TGCGCGTGGC GACCACCGCA CCACTGACCG GCGCAACGTA TGCCACGTCG
CCGAATAACG GTCAGTTCAC CAATGCGCCG CGCATCATCG ACGGGGTGAC ATTGAACGTG
GGTGACCGCG TGCTGGTGAA AGACCAGAGT GTTGCTTCTC AGAATGGCAT CTATGTCGTG
ACGAGCGTCT TTCCCGGCGT CAATCAGGCG ACCCTCACCC GCGCCGACGA CTTCGACGAT
GATGCCGAAC TGACCGGCGG CTACCGGGTT GCGGTGCTCG GCGGACTGAG CAGCAACGCC
AACCGCGCCT TCAGCACGCC TGGACCGATC ACCCTCAATA CCACACCGAT CACCTGGACT
GATGCCGGCG TCAGCGATTA CTACGCCGTG TACAATCCGC TCACCGGCGT CTTCAGCGTG
ACGCTGGCGG ACAATTACAC GGCGGGCAAC ACCACCGCTC CAACCCGCGA TGATCGCCCC
GGCGGTCTCA GTCGCGGCGC GTCCGGTCCG GCGAGCAATG TCGTCGCAGT AACGAATGGC
TCAAATACGG TTATTATCAC CTATGACGTG ACCCTTGGCG ATAACGTCGA ACCCAACCGG
CAGATTATCA ACACGGCGAC CCTCACCCAT GTCGCCACCA GCGACGGCGG GGTGGATGAT
CAGCCCGATC CGTTCGACAC CGCGCAGGTC ACCATCCGCC GCCCAACCAT TGCCAAGACC
CTGACCGCTA CCGAAATCGA GGATTCGTTC AACAACCGCC AGCAGGCAGT CATCGGCGAA
CTGATCACGT ACACACTGAC GCTCACCGTA CCCGAAGGGG TCATGTCGAA CGCCACCCTC
ACCGACACCC TCGACGCGGG GCTGGCGTTC GTCGATGTGA CGGGAGTCAG CGCGTCGCCG
GCCCTCGCAT TCACCGGCGG CGGGCTGCCG ACGGTCGGCG CAACCCCGGC GAACACGACG
ATCGGCGCAG GCGGACAGAC CATCACCTTC AATTTCGGCG CGATCACCAA CAGCAATCGC
GACAACACCG TCGAAGAAAC GATTGTCATC ACCTACCGCG CCATCGTGCT GAACACGTCC
GCCAACCAGA GCGGCGCGCA GCGCAACAAC CGCGTCGAGT TCTCGTGGGT TGTTTCCGGT
CAGGGTTCAT ATGCGCTGAC GCCGGTCGAA GCCGCCAATG TCACGATCAT CGAGCCGACC
CTTGGCGTCG CCAAGACCCT CCCGCCCGGC GTGTACGATG CCGGTGACCT GATCGACTAC
ACGATCACCT TCAGCCATAC TGCTGGCAGC CAGGGAACCG CATACGACGC GGTGCTCACC
GATGCGTTGC CGATCGATCC GTTCGGCAGC GGCTCGTTGA TCCTCACCCC GACAATCCTG
AGCGTCACCG ATAGCGCCGG TACGTTGACT ACTGCCGACT TCACGCTGAC CGGCAGCAAT
GCGACATCCT GGACGCTCAG CAACCCGACG CCGGTTGACG TTGCGCCCGG TCGCGTTGTG
ACGATCACCG TGCGCGGCAC GCTCTCCAAC GCGGCGATGC CCGGCGCGGC AATCACGAAT
ACCGCCTTCC TGCGCTGGAC ATCGCTCGAC GGGACTCCAG GGCAGCGTTC GACGCACAAC
GCTGCTTCGA CCGAGCGCAC CGGCGATGAT GGACCCGGCG GCGCGTTGAA CGATTATGCC
GCCACCGGCT CGGTTGCCTT CATCGTGCCG ACGGTGACGC CGGGCAAGGC GCTGATCGCC
ACCTCTGAAG CGCACACCTC CGGCAGCAAC GTTGCCATCG GTGAAATTGT GCGCTTCCGC
ATTGCCATTG GCGGACCGGA AGCCTCGGTC TATGCCTTCT CGCTGGTTGA TCGACTCCCG
CCAGGTTTGA CCTTCCTCAA CGATGGCTCG GCGCGGTTCG TCTTCATCGC CGATAACAAC
ATCACGACCG CAGGCGTCTA CGATGTTGCA CCGGTATCGT GCCCGGCAAT CAGCCCGGCT
GGGGTAACGA CGCTGGCGGA TGTGCTGAAT CCAGCAACCC TTCTCTCATC GAGCATTAAC
TGCACCTTTG GAGACAGCAA CATTTCCAGC AACGAAACGG TCAATCAGGA TGTCTACGCC
AGCGGAACGG ACGTGTTCTT CCGCTTCGGT AATCTGTCGA ACAACGACAA TGATCCGGGT
GAAGAGTTTA TCGTTGTTGA GTTCAACGCA ATCGTCGATA ATGACACGTT TGACCCGAAC
GATGCCGGCA ATGTGCGTTC CAACACCGTC GTTGCGCGCA TGAATCCGCC CGGCTTCGGC
GCATTCGAGA CCGCTCCGTC GCCAGAGGTG ACGGTGACGG TGGTTGAACC GAACATTCCC
TTCAACGTCG CGACAAATAA CAAAATTGCG ACGCCGACAT CGGGCGATGC GTTCGATGTG
ATCACCTACA CGGTCACCTA CACGAACGCG ACTGGCGTGA CAGTGACCGA CGCCTTCGAT
GTGCGTCTCC TCGATGTCCT GCCCACAGAT ATGACCCTGA TCACCGGGAG TGTCGCAGTC
ACATCTTCCT GCGCCACCGG CATCACCAAC AGCAGCGCCG GGAACACGGT CGATGTCACC
GTCGGGAGCG TGCCGCCGGG GTGCAGTGTG ACCGTCACCT ACCAGGCGAC GCTGAATGTG
TCGGTCATTC CGGGGCAGAC GATCACCAAC ACTGCAACGC TGACCTACAC CAGTTTGCCG
GGGAGCAGCG GCACCGGCGG CTTCTTCGGC TCAACGGCGG GTTCCGGCGG CAGCGCCACC
GGCGAGCGCA ACGGTTCTGG CGGGATCAAC GACTATGCCG GCAGCGACAC GGCGACGGTC
ACCATCGTTG CACCGACGCT GGCGAAGCGG ATCGTCGCCA CGTCCGAAGC GCACACTGCC
GACCCGCTGG TGCTGGCGGA TTTCACGAAC ACTGGGTTTG ACGGCTTGAC AGGTTCCTGG
AATACAAACG TCTTCACATA CCCGACCTTT GTGCGCATCT CAGGAAGTGC AACCGAAAGT
GGCGGGGGGT ACATCACCTT CGCTTCGCCG GTTGACCTCT CCAATCACAC GGCGCTGGCG
CTTTCGGCAC GCCTGGTCGC CGGGAACGGC GCCGACAACA TCGATGTGCA TCTCCAGGAC
GCCGACGGCA CCAACTGGCG CTGGCGCTTC CCTGCGAGCA GTTTCAACTT CTCGACATTC
ACGCTCGTGC CACAGAGCCT GATCGGACCC AACAGCACAA CCATCGCCGC CGGTACAACA
CCGGGTCTCA ACCTGCGCAA CATCACCCGG CTCGAAATCC GCGGCGACAA TGGCACGGCT
CAGTTCGCCA TCGACATCGA TGCGGTCATC GCGTTCGGCA ATGCGGCCGT GCCGGGCGAG
ATTGTGCGTT ACCGGCTGAT CGCCACCATT CCCGAAGGCA CATCGCCGAA CCTGCAACTG
CACGACCGCA TTCCCTTCGG CATGCGCTTC ATCAACGACG ATACCGTGCG CGTGGCGTTT
GTCAGCAATA GCGGCGTCAT CACTTCGACA AGCGCTGGTG CGCAGGTTCC CGCGCTCAGC
GGCGCAGGAT TGAATATCAC CGGCAACCAG GATAGCGTCA TCGGGTTGAG CCTGGCAGTC
GGAAGCGGCA ACGGGCTGGC AATCGGCGAA GGAACGGCAT TTGACGGCAA TGTTTCCAAC
ACCAGCACCG TTACTACTGA CACCGACACG TTCAACGACG GGAGTGATGT GTACTTCCGC
CTGGGGAACG TCGTGAACAA CGATAACGAC GATGATCTGG AGTTTGTGGT TGTCGAGTTC
AATGCTCAGG TGCTCAATAC ATTCAGCGAT GGTAACCAGA GCGGCAGGCA ACTGAACAAC
GATTTCCGCT ACCTCCGCAG CGGCATCCAG GTTGGCACGA CGTCTGTGAC CAACGATGTC
AATCGCGTCA CCGTCGTCGA GCCGCAGATC AACAATCTGA GCAAGACGAT CAGCGGCGCG
CCTCCGGCAG ACGCTGGTGA CGTGTTCACT TACACGCTGC GCTTCGCCAA CGGCGTCGCA
TGGCCCTCCT CGCCGGCTGT GCCGGTGCGC GTTGCCACGA CCGGCAACAT CGCCGGGTTC
AATCCGACTG GCGGCTTCGG CGGAACCGGT CAGTTCACCA ATGCGCCCGC AACCGTCGAT
GGCGTGACGT TGAATGTCGG TGACCGCATC CTGGTGCGCA GCCAGGCGAA CCCGGCGCAG
AACGGCGTCT ATACGCTGGT CTTCATCGAC CCGTTCACCG GCGCGCGCAC CTGGGATCGC
GCTACCGACC TGGACAGCAA CGCTGAACTG GCGATCGGCT ACCGCGTGAT CGTTCAGGAA
GGCTCGCTGG CAGGCAGAAC CTACTACCTG GATGAGCCGT TGCCGTCGTC GATCAACGCC
GGACCGATCC TCTGGCGCGA CGTCGATCCG GCGCTGACGG TCGCAGTCGC AACGACCGGC
AATCTGGGCG GCGGAACGTT CGACGCGACC GGCGGAACGC TCGGACGCGG CACGCTCAGC
ACAACGGCAA CCGTCATCGA CGGAGTAACG GTCAACGCAA CCAACTTCCC GGTCGGCACA
CGCATTCTCG TCAAAAACCA GACGACCCAA AGCCAGAACG GCATCTACCG CGTGACCGGT
TTCAGCGGAT CGACCATGAA CCTCGAACGG GTGGCTGAAT TCGACTCTCG CCCGGAAGCG
ATTGACGGCA CGCAGGTGTA CGTCACCGGC GGAACGATCA ACGCCGGGCG CACGTTCGCG
GTCTCAGGGG CGCCCTCTGG AACACCGATC ACAACCTCAC TGACATTCGT GCTGGTCGAT
CAGGTGACCG CCTTCGATGT GACTGTCTAC GACCAGTTGC CGCCGACCCT TGAACTCCTC
GGCGTGCAGA TCGACGCGCC GACGGGGACG ACGGCGACCA ATGGTTCAAC TCTGGGTGTC
GGCGGGGTGA TCAGTTACAC GCTCGACCGG CTCGACTCAG TCCAGGACAT TACCTCCGGC
AGAACCGATG TGGTCGTCAC GGCGACGGTG CGCGTGGTTG CGGGAACGGT CGCCGGCGCG
CAGATCACCA ATACGGCGCG GGTGCAGTAC ACCAGTCTCC CCGGCGAGCG CGGCACAATC
GCCAACCCCA CCGGATCGAG CGTCGCAGCC GGAACTGCCG GAACGCAGAA CGGGGAGCGC
ACCGGCGGCG ACGTGCCCAA CCCGATCAAC AACTCGTCGC CATCGATCAA CACCATTCGG
AACAACTACA GCGTTGGCGC AATCGCGCTG AACCGCCTGG CGCAACCGTC CTTCGACAAG
CAGTTCCAGG GTGGTTCGAT CTCCGATGAC GACACCAGCG TGCCGGGCAC GTCCGGCATC
AGCGTCACCG TTGGCGAAGC GGTACTCTAC GACCTGCTCG TCACCATGCC CGAAGGAACC
ACCAACAATC TGCGGGTGGT CGATGCCGTT CCCGATGGGA TGCGCTTCGA CACGTCGTTC
AACGGCGTCG GCTATCAGAT CGTTACTTCT GCTGGCGGTC TGCTCACCGA CAACTTCAGC
AATCCGGCGG CGGTTGCATC GCCGACCCTG ACCGTCTCCG GCACAGGCAC GCTCGGCGAC
GATGGTGTGG ACGCGCTGTT CACGTTCGGG AATGTCACGG TTGATGATGA CAACAACCCG
AACAACAATG CCTTCATCAT CCGCGTGCGC CTGATTGTGA CGAACACAAC GGCGAACCAG
AGCGGCGCAA CCCGCGCCAA CGGCGGTGCG CTGCGCTACA ACGATGGGTA CAGCGGCGCG
GATGTGACAC TGCGCGATCC GACCGAGCCG CAGGTGACGG TTGTCGAGCC GACGCCATCG
ATCCTGAAGT CGGTCAGCGG CGCGGCAGCC GACGCCGGCG ACCCGATCAC TTACACGATC
CGTCTGGAGA ACAGCGCGCC GCGCAGCGAG ATGACCGCCT ACGATGTGGT GATCAGCGAC
ACGATTGCGC CCGAACTGAT CAACCCGACG ATTGTCGCGG TCAGCGCAAC AGGCGCGCCC
GTGAGCGTCG CCGACTTCGA GATCGTAACT GTTGGACCGG ATCGCATCCT GCGCACGACT
GCGCCGATCA CCCTGCCGCT CGGATCGACG GTGATCATCA CCTTCACCGG CGACCTGACG
AGCACAGTGA CGACCGATCA GATCATTACC AACCGTGCGT CGATGTTCTG GAGCAGCACT
CCCGGCGCCA ACCCTGATGA GCGCAATGGG AGCGGTGTTC CCAACCCGCC GGAGTGCAGC
AGTCTGCCCG GCAACTGCAA CCTGGACAAC ACGCAACTCA ACAATTACGG TCTCATCTCA
TCGGTGACAA CCACCGCCAT TGCGCCGGTG GTCGTCTCGA AGTCGGTGAT CGAAGGGGTG
GCGCCTTCGA CGAGTGGCAC GAATGTCACC ATCGGTGAGA TCGTCACCTA CCGCCTGGCG
GTCAGTCTGC CGGAAGGCGT CACGAGCGGG CTGGTCATCA CCGATACGCT TCCAGCAGGC
ATGGCGTACC TGCCGGGCAG CGCCACGCTT GTGACCGGCA CGCTTGCCAC GGGTGACCCG
GCGCTCAGCG GCGGGCATCC GGCGGGCGCT GGCACGCTCG CCTTCAACGG CGTCTTCAGC
GATCCGACCG ATCCGGCGGT CACGCCGATT GGCAGCCTGC AATTCCTCAA CGGAACCGAC
GTGCGCTTCA CGTTTGATCA GATCACGTTG CCGGGCGACA ACGACACCCG CAACAATACG
TTCTTCATCC GCTACCAGGC GGTCGTGCTC GATGTGCCCG GCAACACCGG CTTCACCGGC
AGCCAGACAA CACTGACGAA CAGCGGGCAA TTCGATGTGC CCTCGACGCC GCAACCGCCG
GTCGACCTGT TGATCGATCC CAACGGCGCA ACCGTCACCG TCGTCGAGCC GGAACTGGCG
ATTGCCAAGT CGGTCGCTCC GACCAGCGGC GATGCCGGCG ACCCGGTGAC CTACACGATC
ACCCTGAGCC ATACGGCGCG CAGCCTGGCG GATGCCTTCG ACGTGGCGAT CACTGATGCG
CTGCCAGCAA CGATCGGCTC ATCGCTCACG GCGCCGATCA CGATTGCACA GGTGAATGCC
ACGCACTCGG TTGATGGTGA TATTACCGGC AACTTTGGCG TCACGAGCAA CACGCTGACG
ACCACGACAC CGTTTACGCT GCCGCTCAAC GCAACCGTCA CGGTGACGAT CACCGGCGTG
TTGCGGCAAA GCGTACAGCC CGGCGAGACG ATCACCAATA CCGTCGCTCT GACCTCCACC
AGTCTGCCCG GTCTGAACCA GGACCTCAGC CCGGACACGA CCGGCGTCAG CGATGGAACC
GACCGCGAGC GCAGCCGCAG CGCCAGCAGC AGCGTGACGC TTTCCACCGG GCAGGGCGCG
TTCAGCAAGG AGCTCTTCGC CACCAGCGCA CCACATACGA GCGGCGAGGA CATCACCATC
GGCGAGATCA TCAGTTATAC CCTGATCGTC GATCTGCCGG AAGGCACGAT CCGCAACCTG
GTGTTGACCG ACGATCTGCC TGCGGGTCTC GACTATGAAG GGTTCACGGT CATCACCGAC
GCAGCGCAGA GCGGCGGACT GCTGACGCAG GACTTCGTCG GAACCGTTCC GTCGATCACC
GTGACCGGCG GCGCGGGCAG CGGCGATGAT GTCACCCTGA CGTTCGATAG CGACGTTGTG
GTGACGGGCG ACAACAATGC GGCGAACAAC CGCTTCCTGG TCGTAGTGCG CGTGCGGGCG
CTCGACGAAC CGGGGATGGT CGGGCTGATC CCGCCGGGGC AGACCACCCT GACGAACACC
GCAGCGATGC GCTACACCGA CGGCTCGAAC ATCACGCGAA CGTTCACCGA TACCGAAACG
GTGCGCGTCG TCGAACCGCA GCTGACGATC ACCAAGAATA TCGTTCAGAC TGTTGCAAAT
GCGGGTGATC CGATCACGAT CACGCTGACG GTGACAAACA CTGGCACATC GGATGCCTTC
GATGTTGTCA TTACCGACAC GCTGCCACCC GATTTCGACG CTGCAACCAC GACCTTCGGC
GCAGCGGGAA GCGACTATCC GGCGACGTTC ACCCCATCGC GCACCGGTAG CGAGGTGCGC
TACGAGGGCG GACCGATCCC GGCTGGCGCG ACGGTGACCT TCACCTTCCG CGTCAACCTG
ACCGGTACAG TGACGCCCGG CGCGACGATT GTCAACACGG CGCGCATGGC GCGTTCGACC
AGTCTGCCAG GCGATGATCC GACGGAGCGG GTGCAGACGC CGGTCGACTC GTCCGACACC
ATGACCATCC GCAGCAACAG TCTGAGCGGC TTCGTGTATG TCGATGCCGA CAACGACGGC
ATCTTCGACA CGGGCGAAAG CGGCATCGGC GGCGTGACCA TCACGCTGTC GGGAACCGAT
CATCTGGGCA ACAGCGTGTT GCTGACGACC ACGACTACGA TCACTGGCTT CTACCGCTTT
GACAACCTGC GTCCGGGAAC CTACACGCTG GCGGAAACCC AGCCAGCCGG TTACCTGGAT
GGACGAGACA CCATTGGCAC ACAGGGCGGC ACGACCGGCA ACGATGTGTT GAGCAACATT
GTTCTGCCAG TTGACGCCTC GACCAACGGG GAGAACAACA ACTTTGGCGA ACTGCTGCCC
GCGCGCATCG CCGGGTTCGG GTACGAGGAC GACGACAACG ATGGCGTGTT CGACACGGGC
GAAAGCGGCA TCTCCGGCGT CACCATCACC CTGAGCGGGA CCGACGACCT GGGCAACAGC
GTGCTGCTGA CGACCACGAC CACCGTGACC GGCTTCTACG CCTTCGACAA CCTGCGTCCG
GGAACCTACA CGGTCAGCGA GACTCAGCCG ACCGGCTATC TCGACGGGCG CGACACGGCA
GGAACCCTCG GCGGCGACAC CACGCTCAAC GACCGGATCG CCAGCATCAC CCTGCCTTCC
GGCGTTGCCA GCCTGAACAA CAACTTCGGC GAACTGCGTC CGGCGAGCCT GTCGGGACGG
GTCTACCGCG ACGACAACAA CAATGGCAGC CTCGACGCGG GCGAGCCGGG CATCTCCGGC
GTCACCATCA CGCTGAGCGG GACCGACGAC CTGGGCAACA CGGTCACCCT GACGACGACG
ACCGATGCAA GCGGCGTCTA CACCTTCACC AACCTGCGTC CGGGGACCTA CACGGTCAGC
GAGACCCAGC CGTCCGGCTA TCTCGATGGC GCAGAGAGCG TCGGCAGCGC GGGCGGAAGC
GTCATCACCA ACGATGTCAT CGGCAACGTC ACCCTGAGCG CAGGCGTTGC AGCGACCGGC
TACAACTTCG GCGAACTGCT GGCGGCGCGC ATCGCCGGGT TCGTGTACGA GGACGACGAC
AACGATGGCG TGTTCGACAC GGGCGAAAGC GGCATCTCCG GCGTCACCAT CACCCTGAGC
GGCACCGACG ACCTGGGCAA CGGCGTGCTG CTGACGACCA CGACCACGAT CACCGGCTTC
TACGCCTTCG ACAACCTGCG TCCGGGGACC TACACGGTCA GCGAGACCCA GCCGTCCGGC
TATCTCGACG GACGCGACAC GGCGGGAACC CTCGGCGGCG ACACCACGCT CAACGACCGG
ATCGCCAGCA TCACCCTGCC TTCCGGCGCT GCCAGCCTGA ACAACAACTT CGGCGAACTG
CGTCCGGCGA GCCTGTCGGG GCGGGTCTAC CGCGACGACA ACAATGACGG CATCCCTGAT
GCAGGCGAAC CGGGCATTGG CGGGGTTACC ATCACCCTGA CCGGAACCGA CGACCTGGGC
AACAGCGTGC TGCTGACGAC CACGACCAGC ATCACCGGGT TCTACACCTT CGACACCCTG
CGTCCGGGGA CCTACACGGT CAGCGAGACG CAACCAATCG CGTACAATGA CGGAATTGAT
CGCTCTGGGA CGGCAGGCGG CGACCTGATT AACGATCAGG TCAGCAACAT CGTGCTTGGC
GCAGGGGTCG ATGCCGTGAA TTACGACTTC GGCGAACGGG GAACCTTCGT CAGCGGCATC
GTCTGGATCG ACACCGACCG CGACGGAACG CTCGACGGTG GTGAAAACGG ACGACTCGGC
GGCGTCACCG TCACGCTGTC GGGAACCGAT CTCCTGGGCA ACAGTGTGCT GCTGACGACC
ACAACGACGA TCACCGGCTT CTACATCTTT GACAACCTGC CAGCTGGCAC CTATGTCATT
GAGCAGACGC AGCCAACCGG CTACGGCAGT TCGACCCCCA ATACCCTGAG CGTGACTGTC
CCGCTCACAG GGTTGACCGA TCAGAACTTC GGTGAGACCG TGAGCACCCT GAGCGGCTAC
GTGTATGTGG ACAGCAACAA CAATGGTGTG TTCGATGCGG GTGAAAGCGG CATTGGCGGC
GTGACGGTCA CGCTGACCGG AACTGACGTC AACGGCGTAT CCGTGACTCT TACGACGTTG
ACCCTGGCGG ATGGCAGTTA CCGCTTCGAG AACCTGTTGG CGGGAACCTA CACCATTAGC
GAGACGCAGC CGCTGATCTA TAGCGATGGG CTGGAGAGCA TCGGCACGAT TGACGGAACA
CCCGTCGGCA TGCTCGTCAG CAACGACGTG ATCGGCAACA TCACACTGTC AGCAGGAACC
GACGGCATCA ACTACAACTT TGGGGAACTG GCTGACGCAG GGTTGGGTGA CCGCGTGTGG
CTCGACCGCA ACGGCGACGG CGCACAGGAC CCTGGTGAGT CGGGGATTGG CGGCGTGCAG
GTGTATCTCG ACCTGAACAA CGATGGTGTG CTTGATGCAG GCGAGCCGGT GACCACCACC
AACGCGCTCG GTCAGTACTT CTTCGGCAAT CTGCCGGGCG GAACGTACAC CGTGCGCGTG
GATACGACCA CGCTGCCGGG AGGCGTCGGC CAGACCTACG ACCTGGATGG CGCAACCGTA
ACGCCGCATG CGGCAACTGC ATCGCTGGCG GCAGGCGCGA CCCGCACCGA CGTGGACTTC
GGCTACCGTG GAACCGCCAG CATCGGCGAC CGCGTGTGGC TCGACCGCAA CGGCGATGGC
GTGCAGGACG CCGGTGAGCC AGGTCTGAGC GGCGTTATCG TCTACCTGGA CCTGAATGGC
AACGGCGTGC GTGATGCCGA TGAGCCGTTT GACGCCACCG ATGCGAGCGG CAACTATCTG
ATCGGCGGGC TGCTCGCCGG AACGTACACC GTGCGGGTGG ACGCTTCAAC CCTCCCCGAC
GGGGTCAGCG CCACCTACGA TCTGGACGGC GTCGGGACGC CTGGCGTTGT TACCGGCGTG
ACGCTCAGTA GCGGTCAGGC GCGCACCGAC GTCGACTTCG GCTACCGTGG AACCGCCAGC
ATCGGCGACC GCGTGTGGAA CGACGCCAAC GCCAACGGCA TCCAGGACGC CGGAGAAACG
GGAGTCAGCG GCATTGTTGT CGCACTCTAC GACTCGACCG GCACGCTGCT GATCACCACG
ACGACCGACC TGAACGGCAA TTATCTGTTC GATAACCTGC CCGCCGGGAC GTACACCGTG
GGGGTTGGCG CGACGCCGGG GCGCAGCATC AGCCCGCGTG GAGCAGGAAG CGATTCGGCG
CTCGACTCCG ACATTGATCG GGCGACCCGG CGCAGTAACC CGATCACACT TGCCATCGGC
GAGGATCGCC GCGACATCGA CATCGGTCTG TATCAACTGG CATCGGTTGG AAGCCTGGTC
TGGCTCGACC GCGACCTGGA CGGCATCCGC GAGGCGGATG AACCGGGGAT CGGTGGCATC
GAGGTACGCT TGCTACGCAG CGATGGAACG GTCGTCGCCA CACAGATGAC CGATGCAAAT
GGCTACTTCA TGTTCACCGA TGTCGAACCG GGCGAGTACC GGATCGCGTT CAGCGTCCCA
TCCGGCTACT ACGTCAGCCC CTTCCGGCAG GGGAACGACC GCAGCACCGA CAGTGACGCG
GACCCGGCAA CCGGTCTGAC GCCGATCTTC ACCCTGACGC CAGGGCAGAT CGACCCGACC
TGGTTCATGG GGCTGTCGCC AATCTCACCG ACCGCCATCC AGTTGACGCG CTTCAGCGCC
GAACGCGGCG CACAGGGCGT CGTCGTGCGC TGGGAGACGG CTGCGGAGTA CGGAACGCGC
GGCTTCTACC TGGAACGCAG CGCAACCGGC AGCCGCAGTG ATGCCGTGCG GATCACCGAT
CGCCTGATCC CGGCGAGGGG CAGCGTCAGC AGCGGTGCGG CATACGAGTG GAACGACACG
ACAGCCGCGC CGGGAACACG CTACACCTAC TGGTTGATCG AAGAGACCGT CGATGGCTCG
ACGCACATCT ACGGACCGGC GACTCTCGCA TCAACGACGG GCGGCGGGTA TACTGTGATG
CTGCCGCTCA TCGTGCGGTA G
 
Protein sequence
MHRRSRLASL FLVLAVTAAS LGLLASNAVF AGGNPLPVAT VNNATGLIGE TVTLTVTFDN 
ASTGTGDQTG YGPYIDLFLD TTGPDGAPSF DGLVTTGMRA TYLGQPLPGM EVIPITGPTY
VHPLTGQTLS VPGYGTRFQN GDTIVVLTLL FGSFTNTQPP APIDVSLRIS NLADLGTPLP
VTAVAGFRYG ADPLDNPGAD PPIRQTTPSD ANVTPTLWTL TKTYLGPEDE TATGRNYPRR
YRLDVDIAEG QPVTNLQITD VLANSMRITG NTAAQMSAIR YDAPGMPTEV FTPANLSGTA
TPAAPGGTLI YSFGDKTGVR GIDASFEFEF YIPRDTSAAT PTVPQGTDST FANNTGSSSA
TWTPIDTRDT STPITITLPP NAHTLQQHSL ATQKSVTPVD RNTLNPTGGA IIPGSTLLRY
DIDFQVSDYF AVQNVYLEDI ISDGQRLFVR TSGITSTVPT LQVENAYVTG ASPTRTTIAA
APFSGTGVIT YEQRFTTRGT PLSDPTSPPS GPVFTILTPA PSLSGTTYVR FDISRELIAR
GVSGRLVGGD IPNAGGAPQN NNPPLFGPTR GRITFYTEVK DEFSDDFPSG DRSVDQLDIL
RNTVPQIRGE QINTTTINNA TPTVIGQATD DTAASVELPI GDRNKTIYAL NGQTTNLGDP
VTVQPGDLVT YRLTYRLPIS SFENVQFIDF PPLPVFPVPD RLFFDASAAP GTVPGANMVS
RGPGDTYLTT VNPPTTVRVA TTGNMTGYNP AGIGSFSGAP SIIDGVTLAN GDRVLVKDQT
DARQNGVYVV VNAASGVWNR AADFDTAREI TNSPLVGVTA GATNAGQHFR QSNPTFNTFN
TDAITWAPFI TTDAAGNSFT LNFGWHDDVD GGRRESLIDL LVTLRVADAA FANDLFLTNQ
LRVNEFSTNA GSQYFDEIVM FEVIRPNVTI NKGIVGYNGV GLTLGGVTFN PPDGSTAFGG
TPVYTATQAS AIGSSDVSLL SDAGDRVRYA IVLQNEGRGD AYDVTVTDTI PADYIRPATL
AAANFAVRRG DGTLLTGDVV SGTVRVATTA PLTGATYATS PNNGQFTNAP RIIDGVTLNV
GDRVLVKDQS VASQNGIYVV TSVFPGVNQA TLTRADDFDD DAELTGGYRV AVLGGLSSNA
NRAFSTPGPI TLNTTPITWT DAGVSDYYAV YNPLTGVFSV TLADNYTAGN TTAPTRDDRP
GGLSRGASGP ASNVVAVTNG SNTVIITYDV TLGDNVEPNR QIINTATLTH VATSDGGVDD
QPDPFDTAQV TIRRPTIAKT LTATEIEDSF NNRQQAVIGE LITYTLTLTV PEGVMSNATL
TDTLDAGLAF VDVTGVSASP ALAFTGGGLP TVGATPANTT IGAGGQTITF NFGAITNSNR
DNTVEETIVI TYRAIVLNTS ANQSGAQRNN RVEFSWVVSG QGSYALTPVE AANVTIIEPT
LGVAKTLPPG VYDAGDLIDY TITFSHTAGS QGTAYDAVLT DALPIDPFGS GSLILTPTIL
SVTDSAGTLT TADFTLTGSN ATSWTLSNPT PVDVAPGRVV TITVRGTLSN AAMPGAAITN
TAFLRWTSLD GTPGQRSTHN AASTERTGDD GPGGALNDYA ATGSVAFIVP TVTPGKALIA
TSEAHTSGSN VAIGEIVRFR IAIGGPEASV YAFSLVDRLP PGLTFLNDGS ARFVFIADNN
ITTAGVYDVA PVSCPAISPA GVTTLADVLN PATLLSSSIN CTFGDSNISS NETVNQDVYA
SGTDVFFRFG NLSNNDNDPG EEFIVVEFNA IVDNDTFDPN DAGNVRSNTV VARMNPPGFG
AFETAPSPEV TVTVVEPNIP FNVATNNKIA TPTSGDAFDV ITYTVTYTNA TGVTVTDAFD
VRLLDVLPTD MTLITGSVAV TSSCATGITN SSAGNTVDVT VGSVPPGCSV TVTYQATLNV
SVIPGQTITN TATLTYTSLP GSSGTGGFFG STAGSGGSAT GERNGSGGIN DYAGSDTATV
TIVAPTLAKR IVATSEAHTA DPLVLADFTN TGFDGLTGSW NTNVFTYPTF VRISGSATES
GGGYITFASP VDLSNHTALA LSARLVAGNG ADNIDVHLQD ADGTNWRWRF PASSFNFSTF
TLVPQSLIGP NSTTIAAGTT PGLNLRNITR LEIRGDNGTA QFAIDIDAVI AFGNAAVPGE
IVRYRLIATI PEGTSPNLQL HDRIPFGMRF INDDTVRVAF VSNSGVITST SAGAQVPALS
GAGLNITGNQ DSVIGLSLAV GSGNGLAIGE GTAFDGNVSN TSTVTTDTDT FNDGSDVYFR
LGNVVNNDND DDLEFVVVEF NAQVLNTFSD GNQSGRQLNN DFRYLRSGIQ VGTTSVTNDV
NRVTVVEPQI NNLSKTISGA PPADAGDVFT YTLRFANGVA WPSSPAVPVR VATTGNIAGF
NPTGGFGGTG QFTNAPATVD GVTLNVGDRI LVRSQANPAQ NGVYTLVFID PFTGARTWDR
ATDLDSNAEL AIGYRVIVQE GSLAGRTYYL DEPLPSSINA GPILWRDVDP ALTVAVATTG
NLGGGTFDAT GGTLGRGTLS TTATVIDGVT VNATNFPVGT RILVKNQTTQ SQNGIYRVTG
FSGSTMNLER VAEFDSRPEA IDGTQVYVTG GTINAGRTFA VSGAPSGTPI TTSLTFVLVD
QVTAFDVTVY DQLPPTLELL GVQIDAPTGT TATNGSTLGV GGVISYTLDR LDSVQDITSG
RTDVVVTATV RVVAGTVAGA QITNTARVQY TSLPGERGTI ANPTGSSVAA GTAGTQNGER
TGGDVPNPIN NSSPSINTIR NNYSVGAIAL NRLAQPSFDK QFQGGSISDD DTSVPGTSGI
SVTVGEAVLY DLLVTMPEGT TNNLRVVDAV PDGMRFDTSF NGVGYQIVTS AGGLLTDNFS
NPAAVASPTL TVSGTGTLGD DGVDALFTFG NVTVDDDNNP NNNAFIIRVR LIVTNTTANQ
SGATRANGGA LRYNDGYSGA DVTLRDPTEP QVTVVEPTPS ILKSVSGAAA DAGDPITYTI
RLENSAPRSE MTAYDVVISD TIAPELINPT IVAVSATGAP VSVADFEIVT VGPDRILRTT
APITLPLGST VIITFTGDLT STVTTDQIIT NRASMFWSST PGANPDERNG SGVPNPPECS
SLPGNCNLDN TQLNNYGLIS SVTTTAIAPV VVSKSVIEGV APSTSGTNVT IGEIVTYRLA
VSLPEGVTSG LVITDTLPAG MAYLPGSATL VTGTLATGDP ALSGGHPAGA GTLAFNGVFS
DPTDPAVTPI GSLQFLNGTD VRFTFDQITL PGDNDTRNNT FFIRYQAVVL DVPGNTGFTG
SQTTLTNSGQ FDVPSTPQPP VDLLIDPNGA TVTVVEPELA IAKSVAPTSG DAGDPVTYTI
TLSHTARSLA DAFDVAITDA LPATIGSSLT APITIAQVNA THSVDGDITG NFGVTSNTLT
TTTPFTLPLN ATVTVTITGV LRQSVQPGET ITNTVALTST SLPGLNQDLS PDTTGVSDGT
DRERSRSASS SVTLSTGQGA FSKELFATSA PHTSGEDITI GEIISYTLIV DLPEGTIRNL
VLTDDLPAGL DYEGFTVITD AAQSGGLLTQ DFVGTVPSIT VTGGAGSGDD VTLTFDSDVV
VTGDNNAANN RFLVVVRVRA LDEPGMVGLI PPGQTTLTNT AAMRYTDGSN ITRTFTDTET
VRVVEPQLTI TKNIVQTVAN AGDPITITLT VTNTGTSDAF DVVITDTLPP DFDAATTTFG
AAGSDYPATF TPSRTGSEVR YEGGPIPAGA TVTFTFRVNL TGTVTPGATI VNTARMARST
SLPGDDPTER VQTPVDSSDT MTIRSNSLSG FVYVDADNDG IFDTGESGIG GVTITLSGTD
HLGNSVLLTT TTTITGFYRF DNLRPGTYTL AETQPAGYLD GRDTIGTQGG TTGNDVLSNI
VLPVDASTNG ENNNFGELLP ARIAGFGYED DDNDGVFDTG ESGISGVTIT LSGTDDLGNS
VLLTTTTTVT GFYAFDNLRP GTYTVSETQP TGYLDGRDTA GTLGGDTTLN DRIASITLPS
GVASLNNNFG ELRPASLSGR VYRDDNNNGS LDAGEPGISG VTITLSGTDD LGNTVTLTTT
TDASGVYTFT NLRPGTYTVS ETQPSGYLDG AESVGSAGGS VITNDVIGNV TLSAGVAATG
YNFGELLAAR IAGFVYEDDD NDGVFDTGES GISGVTITLS GTDDLGNGVL LTTTTTITGF
YAFDNLRPGT YTVSETQPSG YLDGRDTAGT LGGDTTLNDR IASITLPSGA ASLNNNFGEL
RPASLSGRVY RDDNNDGIPD AGEPGIGGVT ITLTGTDDLG NSVLLTTTTS ITGFYTFDTL
RPGTYTVSET QPIAYNDGID RSGTAGGDLI NDQVSNIVLG AGVDAVNYDF GERGTFVSGI
VWIDTDRDGT LDGGENGRLG GVTVTLSGTD LLGNSVLLTT TTTITGFYIF DNLPAGTYVI
EQTQPTGYGS STPNTLSVTV PLTGLTDQNF GETVSTLSGY VYVDSNNNGV FDAGESGIGG
VTVTLTGTDV NGVSVTLTTL TLADGSYRFE NLLAGTYTIS ETQPLIYSDG LESIGTIDGT
PVGMLVSNDV IGNITLSAGT DGINYNFGEL ADAGLGDRVW LDRNGDGAQD PGESGIGGVQ
VYLDLNNDGV LDAGEPVTTT NALGQYFFGN LPGGTYTVRV DTTTLPGGVG QTYDLDGATV
TPHAATASLA AGATRTDVDF GYRGTASIGD RVWLDRNGDG VQDAGEPGLS GVIVYLDLNG
NGVRDADEPF DATDASGNYL IGGLLAGTYT VRVDASTLPD GVSATYDLDG VGTPGVVTGV
TLSSGQARTD VDFGYRGTAS IGDRVWNDAN ANGIQDAGET GVSGIVVALY DSTGTLLITT
TTDLNGNYLF DNLPAGTYTV GVGATPGRSI SPRGAGSDSA LDSDIDRATR RSNPITLAIG
EDRRDIDIGL YQLASVGSLV WLDRDLDGIR EADEPGIGGI EVRLLRSDGT VVATQMTDAN
GYFMFTDVEP GEYRIAFSVP SGYYVSPFRQ GNDRSTDSDA DPATGLTPIF TLTPGQIDPT
WFMGLSPISP TAIQLTRFSA ERGAQGVVVR WETAAEYGTR GFYLERSATG SRSDAVRITD
RLIPARGSVS SGAAYEWNDT TAAPGTRYTY WLIEETVDGS THIYGPATLA STTGGGYTVM
LPLIVR