Gene RPB_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1638 
Symbol 
ID3909915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1852146 
End bp1867430 
Gene Length15285 bp 
Protein Length5094 aa 
Translation table11 
GC content65% 
IMG OID637883532 
ProductVCBS 
Protein accessionYP_485257 
Protein GI86748761 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.592919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA TCAACGGAAC CAACGGCAAC GACAATCCCC TGGTCGGCTC GACCGGGGCG 
GATACCATTC ATGGTCTGGA TGGCGATGAT CTCATCCGAG CCGGCAACGG CAACGATACC
GTGTTCGGTG ATGCCGGTGA CGATATCCTC GACGGCGGCA CCGGCTTCGA TGTCTTGACC
GGAGGCGCCG GCAACGACGC TCTGCTGGGC GGAAGCAATC TGGGCGGGCT GGACATTGCG
GCCTATTCGG GCAAATGGTC CGACTATACG ATCACCGTCA ACGCAGCCGG CAACTACACG
GTCGTTGACA GGCGCGCGGG CAGTCCGGAC GGCACGGATA CGGTCAGCGG CGTCGAAGCG
TTCCGCTTTG CCGCGCCGGA CGGCAACGGC ACCGTCGACT TTCTGCTGGC TGATCTTCTG
AACGCGCCGC CGGTCGCGGG GGCCGACGTC GGCACCGTCG CCGAGAGCGC CGGAACGCCA
ACGAGCGGCA ACAGCGTCTT GGGCAACGTG CTCGGCAACG ACAGCGATCC GAACGCGCCC
TATGACGTGC TCTCGGTGTC GGCGATCGCG GGCGGCACCA TCGGCTCGGA ATTCGCCGGC
AGCTTCGGCA AGATCACCCT CAATGCCGAC GGCAGCTACA CCTATGTGGT GAACGAAGCG
GCGGTCGACG CGGCCGACGC GCCCGCGCCG GGCACCACCC TGACCGACCA GTTCACCTAC
ACACTGAGCG ATGCCGGCGG CCTCACCCAG ACCGCGACGC TGACCATCAC CATCACCGGC
ACCAACGACG CGCCGGTGGC GCAGGCGGAT ACGCTGTCGA GCTTCGTCAC CAACGGCGCG
GCGCGCACCA TCGCGTTCAG CGAATTGACG GCGAACGATT CCGCCGGCCC GGCCAACGAG
GCGGATCAGA CCTTTACGGT CACCGGCGTG AGCAACGCCT CGGGCGGCAC CGTCACGATC
AATAACAACG GCACCGCCGG CGACACCAGC GACGATTTCA TCCAGTTCAC GCCCGACGCC
GGCTTCAACG GCACCGCCTC GTTCGACTAC ACCATCACCG ACAATGGCAC GACCAACGGC
GTCCCGAATC CGCTGACCAG CACCGGCCAC GTGTCGTTCT CGGTGACTTC GCCGAACCAC
GCCCCGCAGG GTGAGGACGC CGTGGTCACC ATTTCGGAAG ATGGCGCGCA CACCTTCCAG
GTCTCCGATT TCGCCTTCAG TGATGCCGAC CTGCCGGCCC AAACCTTGAA CGCCGTGATC
ATCACGTCGT TGCCGGCCGC CGGCAGCCTG ACGCTGAACG GCACGCCCGT GACGGCCGGC
CAGGTGATCG CGGCCACCAG CATCCCGGGT CTGGTGTTTA CCCCGGCGCC CGATGCCAAT
GGCGCCGGCT ACGCCTCGTT CACCTTCCAG GTCCGCGACA GCGGCGGCAC CGCCAATGGC
GGCGTCGACA CGGATACGAC GGCTAACAGT TTCACCTTCA ACGTCACGCC GGTGAACGAC
GGACCGGTGG CCGCGAACCC GATCGCAAAC CAGTCCTCGC CCGAGGACCA GCAGTGGACC
TTCCAGGTTC CGGCCGATGC CTTTAGCGAT ATCGACAGTC CGTCGCTGAA TTATTCGGCG
ACGCTGGGCA ATGGCGACCC ATTGCCTGGG ACACTCGTTT TCGATGCCTC GACGCGGACG
TTCTCGGGCT CCCCGCCCCA GGATTACAAC GGCACGTTCT CGCTGAAGGT GACGGCGTCG
GATGGGACGT TGTCGGTGTC GGACATATTC GAGTTGACGA TCACGCCTAT GAACGACGCT
GCGATCATCA GCGGCGACGT GGCGGGCGCG GTTGACGAGG ATACGGTTGC GACGGCGACC
GGGACGCTGC TGGCGTCGGA CGTCGACAAC GCGACGAACG CCTTCCAGGC GAATGCTGGA
TCGACGACCC ATGGCAGCTA CGTGGTCAAC GCGGCTGGCG TGTGGACGTA CACGCTCAAC
AACGCCGACA CGGCGGTCGA TGCGCTGAAC AATCTGGACA CGCTGAGCGA CAGCTTCACG
GTCCTGTCGG CGGACGGCAC GTCGCAGGTG GTGAGCATCA CCATCACCGG CACCAACGAC
GCCGCGATCA TCAGCGGCAC TGCGACGGGC GCGGTTGACG AGGATACGGT TGCGACGGCG
ACCGGGACGC TGCTGGCGTC GGACGTCGAC AACGCGGCGA ACGCCTTCCA GGCGAATGCT
GGATCGACGA CCCATGGCAG CTACGTGGTC AACGCGGCTG GCGTGTGGAC ATACACGCTC
AACAACGCCG ACACGGCGGT CGATGCGCTG AACAATCTGG ACACGCTGAG CGACAGCTTC
ACGGTCCTGT CGGCGGACGG CACGTCGCAG GTGGTGAGCA TCACCATCAC CGGCACCAAC
GACGCTGCGA TCATCAGCGG CGACGTGGCG GGCGCGGTTG ACGAGGATAC GGTTGCGACG
GCGACCGGGA CGCTGCTGGC GTCGGACGTC GACAACGCGA CGAACGCCTT CCAGGCGAAT
GCTGGATCGA CGACCCATGG CAGCTACGTG GTCAACGCGG CTGGCGTGTG GACCTACACG
CTCGACAACA CCGACCCGGC GGTCAATGCG CTGAACAATC TGGACACGCT GAGCGACAGC
TTCACGGTCC TGTCGGCGGA CGGCACGTCG CAGGTGGTGA GCATCACCAT CACCGGCACC
AACGACGCTG CGATCATCAG CGGCACTGCG ACGGGCGCGG TTGACGAGGA TACGGTTGCG
ACGGCGACCG GGACGCTGCT GGCGTCGGAC GTCGACAACG CGGCGAACGC CTTCCAGGCG
AATGCTGGAT CGACGACCCA TGGCAGCTAC GTGGTCAACG CGGCTGGCGT GTGGACCTAC
ACGCTCGACA ACACCGACCC GGCGGTCAAT GCGCTGAACA ATCTGGACAC GCTGAGCGAC
AGCTTCACGG TCCTGTCGGC GGACGGCACG TCGCAGGTGG TGAGCATCAC CATCACCGGC
ACCAACGACG CTGCGATCAT CAGCGGCGAC GTGGCGGGCG CGGTTGACGA GGATACGGTT
GCGACGGCGA CCGGGACGCT GCTGGCGTCG GACGTCGACA ACGCGACGAA CGCCTTCCAG
GCGAATGCTG GATCGACGAC CCATGGCAGC TACGTGGTCA ACGCGGCTGG CGTGTGGACC
TACACGCTCG ACAACGCCGA TACGGCGGTC GATGCGCTGA ACAATCTGGA CACGCTGAGC
GACAGCTTCA CGGTCCTGTC GGCGGACGGC ACGTCGCAGG TGGTGAGCAT CACCATCACC
GGCACCAACG ACGCTGCGAT CATCAGCGGC ACTGCGACGG GCGCGGTTGA CGAGGATACG
GTTGCGACGG CGACCGGGAC GCTGCTGGCG TCGGACGTCG ACAACGCGGC GAACGCCTTC
CAGGCGAATG CTGGATCGAC GACCCATGGC AGCTACGTGG TCAACGCGGC TGGCGTGTGG
ACATACACGC TCGACAACGC CGACACGGCG GTCGATGCGC TGAACAATCT GGACACGCTG
AGCGACAGCT TCACGGTCCT GTCGGCGGAC GGCACGTCGC AGGTGGTGAG CATCACCATC
ACCGGCACCA ACGACGCTGC GATCATCAGC GGCACTGCGA CGGGCGCGGT TGACGAGGAT
ACGGTTGCGA CGGCGACCGG GACGCTGCTG GCGTCGGACG TCGACAACGC GGCGAACGCC
TTCCAGGCGA ATGCTGGATC GACGACCCAT GGCAGCTACG TGGTCAACGC GGCTGGCGTG
TGGACATACA CGCTCGACAA CGCCGACACG GCGGTCGATG CGCTGAACAA TCTGGACACG
CTGAGCGACA GCTTCACGGT CCTGTCGGCG GACGGCACGT CGCAGGTGGT GAGCATCACC
ATCACCGGCA CCAACGACGC TGCGATCATC AGCGGCACTG CGACGGGCGC GGTTGACGAG
GATACGGTTG CGACGGCGAC CGGGACGCTG CTGGCGTCGG ACGTCGACAA CGCGGCGAAC
GCCTTCCAGG CGAATGCTGG ATCGACGACC CATGGCAGCT ACGTGGTCAA CGCGGCTGGC
GTGTGGACAT ACACGCTCAA CAACGCCGAC ACGGCGGTCG ATGCGCTGAA CAATCTGGAC
ACGCTGAGCG ACAGCTTCAC GGTCCTGTCG GCGGACGGCA CGTCGCAGGT GGTGAGCATC
ACCATCACCG GCACCAACGA CGCTGCGATC ATCAGCGGCG ACGTGGCGGG CGCGGTTGAC
GAGGATACGG TTGCGACGGC GACCGGGACG CTGCTGGCGT CGGACGTCGA CAACGCGACG
AACGCCTTCC AGGCGAATGC TGGATCGACG ACCCATGGCA GCTACGTGGT CGATGGGACG
GGCACCTGGA CCTACACGCT CAACAACGCC GACCCGGCGG TCAATGCGCT GAACGATCTC
GGCACGCTGA GCGACAGCTT CACGGTCCTG TCGGCGGACG GCACGTCGCA GGTGGTGAGC
ATCACCATCA CCGGCACCAA CGACGCTGCG ATCATCAGCG GCACTGCGAC GGGCGCGGTT
GACGAGGATA CGGTTGCGAC GGCGACCGGG ACGCTGCTGG CGTCGGACGT CGACAACGCG
GCGAACGCCT TCCAGGCGAA TGCTGGATCG ACGACCCATG GCAGCTACGT GGTCAACGCG
GCTGGCGTGT GGACCTACAC GCTCGACAAC ACCGACCCGG CGGTCAATGC GCTGAACGAT
CTCGGCACGC TGAGCGACAG CTTCACGGTC CTGTCGGCGG ACGGCACGTC GCAGGTGGTG
AGCATCACCA TCACCGGCAC CAACGACGCT GCGATCATCA GCGGCACTGC GACGGGCGCG
GTTGACGAGG ATACGGTTGC GACGGCGACC GGTACGTTGA CGGCGTCTGA CGTCGACAAC
GCGGCGAACG TCTTCCAGGC GAATGCCGGA TCGACGACCC ATGGCAGCTA CGCCGTCACC
GCGGCTGGCG TGTGGACCTA CACGCTCGAC AACACCGACC CGGCGGTCAA TGCGCTGAAC
GATCTCGGCA CGCTGAGCGA CAGCTTCACG GTCCTGTCGG CGGACGGCAC GTCGCAGGTG
GTGAGCATCA CCATCACCGG CACCAACGAT GCTGCGATCA TCAGCGGCAC TGCGACGGGC
GCGGTTGACG AGGATACGGT TGCGACGGCG ACCGGGACGC TGCTGGCGTC GGACGTCGAC
AACGCGGCGA ACGCCTTCCA GGCGAATGCT GGATCGACGA CCCATGGCAG CTACGTGGTC
GATGGGACGG GCACCTGGAC CTACACGCTC GACAACACCG ACCCGGCGGT CAATGCGCTG
AACGATCTCG GCACGCTGAG CGACAGCTTC ACGGTCCTGT CGGCGGACGG CACGTCGCAG
GTGGTGAGCA TCACCATCAC CGGCACCAAC GACGCTGCGA TCATCAGCGG CACTGCGACG
GGCGCGGTTG ACGAGGATAC GGTTGCGACG GCGACCGGTA CGTTGACGGC GTCGGACGTC
GACAACGCGG CGAACGCCTT CCAGGCGAAT GCTGGATCGA CGACCCATGG CAGCTACGTG
GTCGATGGGA CGGGCGTGTG GACGTACACG CTCAACAACG CCGACACGGC GGTCGATGCG
CTGAACAATC TGGACACGCT GAGCGACAGC TTCACGGTCC TGTCGGCGGA CGGCACGTCG
CAGGTGGTGA GCATCACCAT CACCGGCACC AACGACGCTG CGATCATCAG CGGCGACGTG
GCGGGCGCGG TTGACGAGGA TACGGTTGCG ACGGCGACCG GGACGCTGCT GGCGTCGGAC
GTCGACAACG CGACGAACGC CTTCCAGGCG AATGCTGGAT CGACGACCCA TGGCAGCTAC
GTGGTCGATG GGACGGGCAC CTGGACCTAC ACGCTCGACA ACACCGACCC GGCGGTCAAT
GCGCTGAACG ATCTCGGCAC GCTGAGCGAC AGCTTCACGG TCCTGTCGGC GGACGGCACG
TCACAGGTGG TGAGCATCAC CATCACCGGC ACCAACGACG CGCCGGTGAT CGACGGTGAT
CCGGATGTGA CCGGCGTGCA GCCGATTCCG ACCCAGACCG TGGCCGAGGA CGGCACCGTG
GCGGCGCTGG CCGCGCGCGT GCAGCAGCTG ATCGCCGGCG GCATCAGCGA CGTCGACGGC
GAAGCCGTGA CGCTGACGCT GACGCTGACC TATCCGGCGG GCAGCGGCAT TGCTTCGCAG
CAGATCCTCG TCAATCCGGC GGTCGACTTC ACCTGGACTC CGCCGACGAA CTTCAACGGC
ACGATCCAGG TCGACCTAGC GGCGTTCGAC GGAACGGCCA CCACCCACGC CGGCTTCGAC
CTGGTGGTGA CCCCGGTCAA CGACGCGCCG GTTCTGACCG GTTCGGCGGC GACGCTGGCT
GCGGGTGTCG AGGACACCAC TTACACGGTG TCCGCCGCGG ACTTGCTGGC GGGCTTCACG
GACGTCGATG GCGATACGCT GAGCGTTGCG GATCTGTCGG CGAACCATGG CGTCGTGACC
GACAACGGCG ACGGCACTTA CACGATCGCG CCGACGTCGA ACTACAATGG TCCGGTGACC
CTGAGCTACA ATGTCGTCGA CGGCCAGACC GGTGTGACCG CGGCGACGCA GACCTTTGCG
CTGGCTGCGG CCCCCGACCT CGACACGGTC CTGACCAATG CGGCGCTCGA TCCCGACGTG
CTCGACGCCA ACGGCAATCT GCACTTCGGC AGCGGCAATG CCGGCACCGG CTTCGCCGTG
GCGACCGACG CCGTCGACGC ACCCGGCGTC GAACTCGGCC TCAGCGCGGT GCTGCGCTAC
AGCGGCACCG CTCCGATCGA CCTGACGCTC GACCCGACCG GCCACACCTA TGTGGTCCCG
GCCGGCACGG CCGGCGGTAC GCCGCAGGAC GGTGCGGGGT CCGCCGACGA CAACTGGGCG
CGCTGGAACT TCAGCTTCTC GATCGGCGCC GATGCCGATA TGAGCGGCAA CGAAACCATC
GGCGACCTCG ACTATCGCTT CACGATCAGC AGCATCGGTG AAAGCGGAGC GCTCACCGAG
CTCCTGTCCT ACACCGTTGC CCAGATCGCG GAAGCATATG ACTTGCTGTA TGGACCAGGA
GCAGGGGCGG CCTTCCTGAA CCAAAGCATC TATCAGGACA CGATCAACCT TGAATGGGCG
CACATTCTCG GCTCGGATCC AAACCATCCG TTCGATCCGA ACCTGCCCGG CTACTACCAG
ATCGACCTGG TGGCCTCGAA GGACGGCTCG ACGCTGGTCA GCGACTACAT CAAGGTCCGC
GTCAATTCCG CTCCTGATGC GCAAGATGAC GTCAATGGAC TCGAAACGCT GAAGGAGGCG
GGCGTCGCTG CCGGCGATGC GGCGGCGACC GGCAATGTGC TGACCAACGA CGCCGATCCG
GACACGCTGC CGACGCCGGA CAAGCCCCTG CTGGTCGTCA CCCAGGTCGG CGTCACGGCG
GTGGCCTCGA CCGGCACCAC GATCGACGGC ACCTACGGCA CCCTGACCAT CGCGGCCAAC
GGCGCCTGGA GCTACGCGCT CGACGACGCG CTCCCGACCA CCCAGGCGCT GCAGGTGGGC
CTCAATGGCA CCGAGACCTT CACCTACACC GTCGCCGATC GCTTCGGCGT CACCGACACC
GCGACCCTGA CGCTGTCGAT CGACGGCAGC AACGACGCGC CGGTGGCGTA TGTCGTGCCG
GCGTTCCCGT CGATCGCCGA GGACGCTTCG TTCTCGGGCA CGGTCGGCCT GAACTTGCCG
GTGGCGCTGT TCGCCTCCGA CGTCGACAAC GCCCTGAATC CGTCTTCGCT GACCTTCACC
GGTGCGACGA TCACGATCGG AGCAACGGTC ATCAACGTCA GCGACCTCGC CGCCGCCGGC
ATCGACTACA CGCCGGGCAG CGGTGTGTTC CACTTCAACG GCGCCGTCGC GGCCTATCAG
TCGCTTGGGG TCGGCGATAG CGCGGAGGTG GTGGTGTCGT TCACCGCCAC CGACGGCAGC
GCCGTCAGCA ATGCCGGCTC GGTGAGCTTC ACCGTCACCG GCACCAACGA CGCGCCGACC
ACTTCGGCGG TGACGCTGAC GTCGGTCGCC GAAGACAGCG GCGCGCGTCT AATCACGCAG
GCCGAGCTGC TGGACAATGC CGTCGACATC GACGGCGACA TCCTGACCGC GACCGGCCTC
ACCATCGCGT CCGGCAAGGG GACGCTGGTC AACAACACCG ACGGCACCTG GAGCTATACC
CCGGCCGCCA ACGACGACAC CGTGGTGTCG TTCAGCTACA CCATCGTGGA CGGCCACACC
GGCTCGGTTG CCGGCTCGGC CACCCTCGAC ATCACGCCGG TCAACGACGC CCCCACGCTC
GGATTCCGTC AGGGCTTCGA GGACGATAGC GCGGGCATCA TCGACGGTCT CGCCAGCAAC
GGCACGACTT ACGGACATCT GGCGATCGTC AACACTTTCC AATCCGCAAG CGGAACGGTC
TCCGCGGCCG ACGGCGGCGA CTTCGCCGTG TTCACCCAGG CAGGACCGCT CAATGACGAG
TCCGGCCCGT TCACCCGGTT CGACGGCTAC CGCTCGGAGT TCGTCGACGG TCTCACCACC
AGCGTGAAGG TCTATCTGGA CACCAACCTG GCGTCGGGCG AGGGCTTCGA CTATTCCGTC
GCGGCCAACC GCACCAATGG AAATCACCTT CGCGACTACA TCTTCCATGT CACCAAGGAC
AGCTCGACCG GCCAATTGCT GGTCGGCGCC TCCAACAACA CCAACTTCAA CCCGCGTGAA
GACCTCGACA CGCTGGCCAA CCACGGCACG ATCACCAGTT CGGGCTGGTA CACGCTGCAG
TACGTGTTCC GCGACAATGG CGGCGTGCTG GCGGTCGAAC TCTCCGTGCT CGACAGCGCC
GGTGCGGTCG TGTTCACGCA AACGCTCTCG AGCCCGTCCG ATCTCATCAC GGACGTTGGC
GGCAACCGCT ACGGCTGGTT CACCAATATC GACGTGACCG AGGGCATCGC GGTCGACAGC
TTCACGCTGG GCGATTTCAC CGGCAAGGTG AGCGAGCTGG CGGGCACCAA GGACGACTCC
GCCACCATCC ACAGAGACAG CGGCATCATC CCGCTGCGCG ACGTCGATCT CGGCGACAGC
CATACCGTGA CGTTCCTGTC GCAGGCGGAC GGCTATCTCG GCAGCTTCTC GCTGGCGACC
GCCGACAGCA CCTTCGACGG CGAGGGCAAG GTGACCTGGA CCTTCCAGGT TGCCGACAGC
GCGCTCGACG CGCTCAAGGC CGGCGAGGTG AGGACGCAGT CCTACACCGT CACCGTGGAC
GACGGCAACG GTGGCACCGC CTCGCAGGTG GTCTCGGTGA CCCTGACCGG CGCCAATGAC
GGCGCCGTGG TCGGCGGCAC CGTGGTCGGC AGCGTCACCG AAGACGGCGA CGGTCTCGCC
GCGTTCCAGA CCACCGGCGG CACTCTCACC GTCGACGACA AGGATGCCGG CGAGAGCTTC
TACCAGGAAG CGGCGGCGAA CACCGCCTAC GGCTCGTACA CGCTCGCCGC CAATGGTGAG
TGGGTCTATA CGCTCGCCAA CGGCCACGCG GCCGTGCAGA GCCTGGCGGC CGGCGAAAGC
CTCACCGACA GCTTCACGGC CAGGACGATC GACGGCACGC TGCAGACCGT GACGATCACC
ATCAACGGCG CCAACGACAT CACCAGCACG GTGAGCGGCA GTGTCGATAA CACGGCGACC
AAGGGCGGTA GTGGAAACGA ACTGATCGTC GGTACGGGCA TCCCGGCCAG CGGGTTTGGG
CTGGTCAACC AGACCGATTT CGGCATCGAG CTTGGTTTGC AGGTGATCTA CCGGCAGGGT
CCGACCGTCG CGCCCACCGA CGTCGATGGC TACGGCGACG GCGTCCTGCA CTTCACCGTC
AATGACGGCG CGCAGGAGAC CGCCAACGGT TCGGGCAGCG ACAACGCCGC CCGCGCGGCC
TGGAGCTTCA ACTACTCGAT CGCGACCGGG CTGAATGGCG AAAGCAGCAA TCTCGGCGCC
TTCACCTTCA AGCTGCTGTA CGACGTCGAC CCGACCGAGA GTGCCAACTA CAAGACCTTG
ACGATGGTCT CAAAGGTCGG CGGCGGTTAC GAATGGCTGG ACGAGCAAGG CCACTCCATC
ATTTCGGATG ACGGCGGCAA CGCCAATGTG GCGCAAAACT CCGAGAACTA CGCGTTCTCG
CTTGCGCAAG CCTACCTGGC CAATGTCTAT GGCCCGGCCA ACAACTTCGA CGGCGATGCG
CGGTTCGACA TCCAGTTGCA GGCCTATTCG GGCGCGACGC TGCTGGCCAC CAACCACATC
GCGGTCGACG TGATCGAGTC GAATACGACG CCGGTCGCGG TGGCGGACAG CAACGCGCAT
GATCAGGTGA TCGAAGGCGG CGTCAGCCAG GCCAGCGATA TCACTGCCAC CGGCAACGTG
CTCACCAACG ACGTCGAGCC CGACATCGGC GACACCAAGA CGGCGACGCT GCTGAACGGC
GTGGCTGTGG GAGCAACCGG CACCGTCGCG TACGGCACTT ACGGCAACGT AGTGCTCAAT
CGAGACGGCG GCTGGACCTA TACTCTCGAC CCGAACAAGT CCGACACCCA GGCCCTGGTC
GACGGCGCGA CCGCGACCGA GGTCTTCACC TACACGATGA CCGACCTCGA CGGCGCGACC
TCGACGTCCA CGCTGACCAT CACCATCACC GGGTCGAACG ACGCGCCGGA GGTCGCCGGC
ACGATCTCGG GTGAAAGGGC TGAGGGTACC TCGGCCTTCA CCCTGAACCT GCTGCAGGGT
GCCAGCGATC CGGACAGCGG CGACACGCTG AGCGTGACCA ACGTGCAGTA TGCGGTGGAC
AACGGTACGG CCTCGACCAC GGTGCCGACG GGCCTCGTCC TGACGGACGC GACCTTGTCG
GTCGATCCGA GCGATGCGGT GTTCAATAGC CTGGCGGTCG GCCAATCGAA GGTGATCACC
GTCACCTACG ACGTCACCGA CGGCCACGGC GGAACGGTGG CGCAGACGGC GACTGTGACC
ATCACCGGCA CCAACGATGC GCCGGTGATC GCGGGCGGCC CGCAGAGCGG CAAGGCATTC
GAGGCGGGCG ACCTCCACAA CATGATCGAA GCCGATGTGG CGGCGGATCA CAAGTTCGAG
CCCGTCGTGG TGCTCGACGA CACGATCCAG ACCCTGATCA ACACTCACCC GACCGCGATG
AATGTGGTGC TGCAGGGCGT CTTGACGGCG CTTCAGGTGA CCAATCCGTC CGCGACGCTG
GCTGACGCGA TCGCTCAGGT CTGGGACAAT CTCGACGACC ACTACACGGC GACGAACTAC
TACAACACCA GTGTCAACGA GCAGTTCATC CGGCTCGGTG TCGAGTACGT GAAGTATCTC
GAGGCCGGCG GTCATCCGCT GGTCGACGTC GTCGCCAAAT ACGAGCCGGA CGTCAGTGGA
AACGGCGTCC CGGATCGTGT TCAGTCGATG CACGACAATC TGCTGGGCAA TCTCAATCAC
ACCGACTTCG ACAGCCGGTT CAGTGGTCAG TTGAAGATCG ACCTGGAAAA CCTGATCAAG
ACGATTGATC CAAATCTCGA TCTGCTGACG CGACAGGTCT CGGAAGTGCA CAGCGGCAGA
GAGAGCAATG CCGCGAACAA GCCCCTGGCC GTCGCCTTCG ACCAGGCCCA CGGCCTGCTG
CCGGTGGCGT CCGGCCAGTT CACGGCGACC GACGTCGACA ACGGCGACAC GCTGACCTGG
ACGATCGACA GCGCCAACGG CGTCACAGGC GCCAACGGCA CCTACGGCAC GCTCACGCTC
GATGGCACGG GCAAGTGGTC CTACACACTC GACGACAGCC GGACCGTGAC GCAGGCGCTT
TCCGAAGACG ACACCGCGAC CGAGACGTTC ATCGTCAAGG TGTCGGACAA CCATGGCGGC
TTCGACACCG AGACCGTGAC CATCACGGTG AAGGGTGCCA ACGACGCCGC GGCGATTACC
GGTACCTCGA CGGCGAGCCT GACCGAGACC AACGCGGTGC TGACCGCGGC CGGAAACCTC
GACGCTACCG ACGTCGACGG CGCGGGGAGC TTCACGGAGC AGAGCGACGT CGCCGGCTCG
AACGGCTACG GCTTGTTCTC GATCGACGCC TCGGGCGCCT GGACCTACAC GACAAACTCG
GCGAACGACG CGTTCGTCGC CGGCCAGACC TACACCGACA GCATCACGGT GGCCACGGAC
GACGGCACGA ATCAGCTGAT CATCGTCACC ATCACCGGCA CCAACGACGC GCCGGTGGTG
ACGGGCGGTG TCACCACCAG CGCGACCGAG GATGGGTTGC CCGTCACGGT CAACGCGCTG
GCGAATGCCA GCGACGCGGA CGCGGGTTCG GCGCTGCTGG TGGTGCCACC GGCAGTCCTG
CCGGCCGGCG TCACCTTCAT CGCTCCGGGC GCGGCGCAGA CGATCGATTT CGAGAGCTAT
GCGCTCGGAT CGGTGGTCGG CCAGAACGGC TGGACCGACG CCTCGCCGAA CTCGCCGGCC
AACGCGATCG TCGATGTCGG CGGCACCCAC AACCAAGTGC TCCGGTTGGC CAACGACCCG
TCGTCGGGCG ATTTCGGCGG ACCGTACACA CCCGCGCTCG CTGTTGCGGC GGGCGAGTCG
GCCTCGGGTG CTGCGGTCGA CCAGTTCGTG CTGTCGTTCA ACTTCAAGGC AGTGCAGAAC
ATGGCGGACG GCTCGCGGAT CGAGATCGGT CTGGCGAACA CCGCGAGCAA CGACCGCAAC
AACTTCATGG TGCTGGAATA CACCGGCGAA CCCGGTGTCG GCTTGCGGCT GGCGATCAAC
TCGCCGCTCG CGAACGCTAA TGAATGGAGC AACAACTCGT TCGATTTCGC GACAGGCAAC
GTGACGCTGG CCGCGAATAT CGATCCCAAC GCTTGGCACA CTGTGAAGGT TGTCGCGAAG
TTCAACGACG GCAGCAACAA CGATGTGCTG CAATACTACC TCGATGGTGT CTATATCGGC
TCGGGTGGTT CGTTCGAGAA CTACTTCGAA TACGCGCGTG GCAACAGCCA CGATGCGTCC
GTGTACGCCG TTAACAAGGT GTTGTTCCGG GCGGGAGAGC CGGCAGGCAA CCCGTTCGCT
GCCGACGGTT CTGGCGGCAA CCGCCAGGGC TTCTACATCG ACGATATCTC GACGCAGGCG
GCGAGCAGCA CGGCGGGCTT CCAGCTCGAC CCGACCCATG CGGACTATCA GCATCTCGCC
CAGGACCAGA CGCAGATCGT CACTGTCAAC TATGGTGTCT CCGACGGCAT CACCACCACG
CCGACGACCG CGACCTTCAC GGTCACGGGC GTCAACGACG CGCCGGTCGC GGCCGCCGAT
ACGGCGACTG CAACGGAAGA CGGCGCGGTG GTCATGGCGA CCGTCGCCAG CAACGACACC
GACGCCGACC AGGGTCACGT TTTGACCTAC ACGCTCAATG AGCCGGTCGA CGGACTGACG
CTGAACGCCG ATGGCAGCTA CAGCTTCGAT CCCACGCACG CGACGTATCA GTCTCTCGCG
GCCGGCGCCC AGATCAATGT GGTCGCGAAC TACACTGTGA AAGACCAGTT CAATGCGGAA
TCGCACGCGA CTCTGACCAT CAAGGTGACC GGCACCAACG ACGCCCCCAC CATGGCGGAT
CTGACGCCCA TTTCGATCAA CGAAACGGCG GACTACGACT ACTTCGACGC AATCGAGGGA
GCGATCGTCG CGACGGACGC CGATGCCAAT TCGACTCTGA GCTACGGCAT CAACACCGGC
AGCGGGACCG TCAACTCGCT CATGGGCACC TATGGAACCC TGACCGTCAA TGCGAACGGC
ACGTATTCGT ACGCGCCCAA TGCAATCGCG ATCAATGCCC TGACGGGACC CGCGTCGGAG
ACGTTCGACG TCACGGTGAG TGACGGGATT GCGACGACGA TGAAGCCGTT GACGATCACC
ATCAACGGCG TCAACGACGT GCCGAACGAT ATCGTGTGGT CGGAAACGAT GGGCGGCGTG
AACCCCAACG TGGGCGCACC CGTGGTCGCC GAATTCTCCC GCGTCGGCTA TGCGATCGGC
CAATTGCGGG CGGAAGACCT TGCCGAGCCG TCGCAGACAG TGTCGTTCCA ATTGGTGGGC
GACGCCACCG GCCTCACGGC AAATACGGAT GGCGTGATCA CGGTCGACAG CGCCGGCCTG
GTCAAGCTTC TGGATCCGAC GAAGCTCAGC GTCGCCGCAT CCGGCTACGA TCCGGACAGC
CCGATCACCG AAGTGTTCCC GGGCGTGCTC GGCTACTCGT TCTTCGTCAA GGTCACGGAC
AGCGTCGGGG CGAGCGTGGT GGTGGAGCAA TACGTCACCG TCGCCGAACA GACCAAGTTC
GACGGCTCGA CCGTGCTGGC CCAACTCAAC GGGTTGGACA ACGAGTTCGA TCTGACGGCG
ACCAGTGGAC CGACTTCGTT CGCTGGCGGT GCGTTTGCCG CGGCCGACAA TTTCGTCGAT
GGCGGTGCGC GCAACGACAT ATTGACGACG GGCAGCGGCA ACGACGTGCT GGTCGGCGGC
AGCGGTGACG ACACACTCAA TGCCGGCAAC GGCGTCAATC AGCTCGACGG TGGCATCGGC
AATGACGTTC TCACCGCTGG CACCGGCGAC GACTGGTTCG TCGGCGGGAC TGGCAATGAT
ACGATCACCA CGGGCGACGC TGTCGATATC GTCGATGCAT CGTCGCCGGA GGTCGAACAC
GACCACGACG TGATCCAGTA CAATTGGACC TCGTCCAGCG GCCTGCTGAG GAACGACGTC
TTCACGCAAA CCGCTCCGAC ATCGGGGATC GCCGGAGCCA ATGTCACCAA CTACGCGGAC
ACGATCATCG ACTTCACGGA CGGGTCGGAC AAGATCGCCT TCACGGCCGG CACCAGCGAA
CACTCATTCG GCCTCGGCGG GGTGGGCCAG CTGTCGGCGA GCAAACTGGT GATCTACGCC
GGCAAGGGCG GCGATGGCTT AGCGGGTACC GCGGACGACG TGTTGGCGAA CAACGTGATC
GCCGGTTCTC AAGCCGACGG CGCGCAGCTC AAGTACTTCC AGGATAGTGG CGAGCTCTGG
TACGATCGTG ACGGTGATAC CAATGTTGGC GTGACCGACA TCGCCAAGGT CGCCACGCTG
ACCAATCATG CAGCGTTGAC CCACAGCGAC ATTCTGTTGG TCTGA
 
Protein sequence
MAIINGTNGN DNPLVGSTGA DTIHGLDGDD LIRAGNGNDT VFGDAGDDIL DGGTGFDVLT 
GGAGNDALLG GSNLGGLDIA AYSGKWSDYT ITVNAAGNYT VVDRRAGSPD GTDTVSGVEA
FRFAAPDGNG TVDFLLADLL NAPPVAGADV GTVAESAGTP TSGNSVLGNV LGNDSDPNAP
YDVLSVSAIA GGTIGSEFAG SFGKITLNAD GSYTYVVNEA AVDAADAPAP GTTLTDQFTY
TLSDAGGLTQ TATLTITITG TNDAPVAQAD TLSSFVTNGA ARTIAFSELT ANDSAGPANE
ADQTFTVTGV SNASGGTVTI NNNGTAGDTS DDFIQFTPDA GFNGTASFDY TITDNGTTNG
VPNPLTSTGH VSFSVTSPNH APQGEDAVVT ISEDGAHTFQ VSDFAFSDAD LPAQTLNAVI
ITSLPAAGSL TLNGTPVTAG QVIAATSIPG LVFTPAPDAN GAGYASFTFQ VRDSGGTANG
GVDTDTTANS FTFNVTPVND GPVAANPIAN QSSPEDQQWT FQVPADAFSD IDSPSLNYSA
TLGNGDPLPG TLVFDASTRT FSGSPPQDYN GTFSLKVTAS DGTLSVSDIF ELTITPMNDA
AIISGDVAGA VDEDTVATAT GTLLASDVDN ATNAFQANAG STTHGSYVVN AAGVWTYTLN
NADTAVDALN NLDTLSDSFT VLSADGTSQV VSITITGTND AAIISGTATG AVDEDTVATA
TGTLLASDVD NAANAFQANA GSTTHGSYVV NAAGVWTYTL NNADTAVDAL NNLDTLSDSF
TVLSADGTSQ VVSITITGTN DAAIISGDVA GAVDEDTVAT ATGTLLASDV DNATNAFQAN
AGSTTHGSYV VNAAGVWTYT LDNTDPAVNA LNNLDTLSDS FTVLSADGTS QVVSITITGT
NDAAIISGTA TGAVDEDTVA TATGTLLASD VDNAANAFQA NAGSTTHGSY VVNAAGVWTY
TLDNTDPAVN ALNNLDTLSD SFTVLSADGT SQVVSITITG TNDAAIISGD VAGAVDEDTV
ATATGTLLAS DVDNATNAFQ ANAGSTTHGS YVVNAAGVWT YTLDNADTAV DALNNLDTLS
DSFTVLSADG TSQVVSITIT GTNDAAIISG TATGAVDEDT VATATGTLLA SDVDNAANAF
QANAGSTTHG SYVVNAAGVW TYTLDNADTA VDALNNLDTL SDSFTVLSAD GTSQVVSITI
TGTNDAAIIS GTATGAVDED TVATATGTLL ASDVDNAANA FQANAGSTTH GSYVVNAAGV
WTYTLDNADT AVDALNNLDT LSDSFTVLSA DGTSQVVSIT ITGTNDAAII SGTATGAVDE
DTVATATGTL LASDVDNAAN AFQANAGSTT HGSYVVNAAG VWTYTLNNAD TAVDALNNLD
TLSDSFTVLS ADGTSQVVSI TITGTNDAAI ISGDVAGAVD EDTVATATGT LLASDVDNAT
NAFQANAGST THGSYVVDGT GTWTYTLNNA DPAVNALNDL GTLSDSFTVL SADGTSQVVS
ITITGTNDAA IISGTATGAV DEDTVATATG TLLASDVDNA ANAFQANAGS TTHGSYVVNA
AGVWTYTLDN TDPAVNALND LGTLSDSFTV LSADGTSQVV SITITGTNDA AIISGTATGA
VDEDTVATAT GTLTASDVDN AANVFQANAG STTHGSYAVT AAGVWTYTLD NTDPAVNALN
DLGTLSDSFT VLSADGTSQV VSITITGTND AAIISGTATG AVDEDTVATA TGTLLASDVD
NAANAFQANA GSTTHGSYVV DGTGTWTYTL DNTDPAVNAL NDLGTLSDSF TVLSADGTSQ
VVSITITGTN DAAIISGTAT GAVDEDTVAT ATGTLTASDV DNAANAFQAN AGSTTHGSYV
VDGTGVWTYT LNNADTAVDA LNNLDTLSDS FTVLSADGTS QVVSITITGT NDAAIISGDV
AGAVDEDTVA TATGTLLASD VDNATNAFQA NAGSTTHGSY VVDGTGTWTY TLDNTDPAVN
ALNDLGTLSD SFTVLSADGT SQVVSITITG TNDAPVIDGD PDVTGVQPIP TQTVAEDGTV
AALAARVQQL IAGGISDVDG EAVTLTLTLT YPAGSGIASQ QILVNPAVDF TWTPPTNFNG
TIQVDLAAFD GTATTHAGFD LVVTPVNDAP VLTGSAATLA AGVEDTTYTV SAADLLAGFT
DVDGDTLSVA DLSANHGVVT DNGDGTYTIA PTSNYNGPVT LSYNVVDGQT GVTAATQTFA
LAAAPDLDTV LTNAALDPDV LDANGNLHFG SGNAGTGFAV ATDAVDAPGV ELGLSAVLRY
SGTAPIDLTL DPTGHTYVVP AGTAGGTPQD GAGSADDNWA RWNFSFSIGA DADMSGNETI
GDLDYRFTIS SIGESGALTE LLSYTVAQIA EAYDLLYGPG AGAAFLNQSI YQDTINLEWA
HILGSDPNHP FDPNLPGYYQ IDLVASKDGS TLVSDYIKVR VNSAPDAQDD VNGLETLKEA
GVAAGDAAAT GNVLTNDADP DTLPTPDKPL LVVTQVGVTA VASTGTTIDG TYGTLTIAAN
GAWSYALDDA LPTTQALQVG LNGTETFTYT VADRFGVTDT ATLTLSIDGS NDAPVAYVVP
AFPSIAEDAS FSGTVGLNLP VALFASDVDN ALNPSSLTFT GATITIGATV INVSDLAAAG
IDYTPGSGVF HFNGAVAAYQ SLGVGDSAEV VVSFTATDGS AVSNAGSVSF TVTGTNDAPT
TSAVTLTSVA EDSGARLITQ AELLDNAVDI DGDILTATGL TIASGKGTLV NNTDGTWSYT
PAANDDTVVS FSYTIVDGHT GSVAGSATLD ITPVNDAPTL GFRQGFEDDS AGIIDGLASN
GTTYGHLAIV NTFQSASGTV SAADGGDFAV FTQAGPLNDE SGPFTRFDGY RSEFVDGLTT
SVKVYLDTNL ASGEGFDYSV AANRTNGNHL RDYIFHVTKD SSTGQLLVGA SNNTNFNPRE
DLDTLANHGT ITSSGWYTLQ YVFRDNGGVL AVELSVLDSA GAVVFTQTLS SPSDLITDVG
GNRYGWFTNI DVTEGIAVDS FTLGDFTGKV SELAGTKDDS ATIHRDSGII PLRDVDLGDS
HTVTFLSQAD GYLGSFSLAT ADSTFDGEGK VTWTFQVADS ALDALKAGEV RTQSYTVTVD
DGNGGTASQV VSVTLTGAND GAVVGGTVVG SVTEDGDGLA AFQTTGGTLT VDDKDAGESF
YQEAAANTAY GSYTLAANGE WVYTLANGHA AVQSLAAGES LTDSFTARTI DGTLQTVTIT
INGANDITST VSGSVDNTAT KGGSGNELIV GTGIPASGFG LVNQTDFGIE LGLQVIYRQG
PTVAPTDVDG YGDGVLHFTV NDGAQETANG SGSDNAARAA WSFNYSIATG LNGESSNLGA
FTFKLLYDVD PTESANYKTL TMVSKVGGGY EWLDEQGHSI ISDDGGNANV AQNSENYAFS
LAQAYLANVY GPANNFDGDA RFDIQLQAYS GATLLATNHI AVDVIESNTT PVAVADSNAH
DQVIEGGVSQ ASDITATGNV LTNDVEPDIG DTKTATLLNG VAVGATGTVA YGTYGNVVLN
RDGGWTYTLD PNKSDTQALV DGATATEVFT YTMTDLDGAT STSTLTITIT GSNDAPEVAG
TISGERAEGT SAFTLNLLQG ASDPDSGDTL SVTNVQYAVD NGTASTTVPT GLVLTDATLS
VDPSDAVFNS LAVGQSKVIT VTYDVTDGHG GTVAQTATVT ITGTNDAPVI AGGPQSGKAF
EAGDLHNMIE ADVAADHKFE PVVVLDDTIQ TLINTHPTAM NVVLQGVLTA LQVTNPSATL
ADAIAQVWDN LDDHYTATNY YNTSVNEQFI RLGVEYVKYL EAGGHPLVDV VAKYEPDVSG
NGVPDRVQSM HDNLLGNLNH TDFDSRFSGQ LKIDLENLIK TIDPNLDLLT RQVSEVHSGR
ESNAANKPLA VAFDQAHGLL PVASGQFTAT DVDNGDTLTW TIDSANGVTG ANGTYGTLTL
DGTGKWSYTL DDSRTVTQAL SEDDTATETF IVKVSDNHGG FDTETVTITV KGANDAAAIT
GTSTASLTET NAVLTAAGNL DATDVDGAGS FTEQSDVAGS NGYGLFSIDA SGAWTYTTNS
ANDAFVAGQT YTDSITVATD DGTNQLIIVT ITGTNDAPVV TGGVTTSATE DGLPVTVNAL
ANASDADAGS ALLVVPPAVL PAGVTFIAPG AAQTIDFESY ALGSVVGQNG WTDASPNSPA
NAIVDVGGTH NQVLRLANDP SSGDFGGPYT PALAVAAGES ASGAAVDQFV LSFNFKAVQN
MADGSRIEIG LANTASNDRN NFMVLEYTGE PGVGLRLAIN SPLANANEWS NNSFDFATGN
VTLAANIDPN AWHTVKVVAK FNDGSNNDVL QYYLDGVYIG SGGSFENYFE YARGNSHDAS
VYAVNKVLFR AGEPAGNPFA ADGSGGNRQG FYIDDISTQA ASSTAGFQLD PTHADYQHLA
QDQTQIVTVN YGVSDGITTT PTTATFTVTG VNDAPVAAAD TATATEDGAV VMATVASNDT
DADQGHVLTY TLNEPVDGLT LNADGSYSFD PTHATYQSLA AGAQINVVAN YTVKDQFNAE
SHATLTIKVT GTNDAPTMAD LTPISINETA DYDYFDAIEG AIVATDADAN STLSYGINTG
SGTVNSLMGT YGTLTVNANG TYSYAPNAIA INALTGPASE TFDVTVSDGI ATTMKPLTIT
INGVNDVPND IVWSETMGGV NPNVGAPVVA EFSRVGYAIG QLRAEDLAEP SQTVSFQLVG
DATGLTANTD GVITVDSAGL VKLLDPTKLS VAASGYDPDS PITEVFPGVL GYSFFVKVTD
SVGASVVVEQ YVTVAEQTKF DGSTVLAQLN GLDNEFDLTA TSGPTSFAGG AFAAADNFVD
GGARNDILTT GSGNDVLVGG SGDDTLNAGN GVNQLDGGIG NDVLTAGTGD DWFVGGTGND
TITTGDAVDI VDASSPEVEH DHDVIQYNWT SSSGLLRNDV FTQTAPTSGI AGANVTNYAD
TIIDFTDGSD KIAFTAGTSE HSFGLGGVGQ LSASKLVIYA GKGGDGLAGT ADDVLANNVI
AGSQADGAQL KYFQDSGELW YDRDGDTNVG VTDIAKVATL TNHAALTHSD ILLV