Gene Plut_0676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_0676 
Symbol 
ID3745949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp781424 
End bp795409 
Gene Length13986 bp 
Protein Length4661 aa 
Translation table11 
GC content57% 
IMG OID637768715 
ProductVCBS 
Protein accessionYP_374597 
Protein GI78186554 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000119465 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCTATCA CTCTCAATGA AGATTACAGC GCGCTCAGCC TTAGTTTTCT TGTAAGTTTG 
CTTGGTGATG ATTCTACCAT ACTTACGCTG TCAGTGTCCG CCGGCCATCT TATTCTGACC
GATGGTGGCG CAAGTCCGGA TGTGACGATT GAAGGTAGCG GTACCGATAG TGTTACTGTT
ACTGCCAGTT CAAAAGCTGA TCTTGCTGAC TGGCTCGCTG ATATGGTGAA TAATAGCTCA
ATCTATATCA CTGCCTTCAA TTCTCTGGAG CCTGTTACGC TTGACTATGT TATTGATAAT
GGGACGAGTG ACGATATCAC GGGCACTGAG CAGCTTGTGT TTACTCCTGT TAATGACCCG
GCATGCCTTG ACATGAACGG TTCCGATGAG GTCGGCTGCA CATATGTCAC GAAGTGGATT
GTTGGCTCAG TAGATCCGAT TTCCATTCTT GATAAAGATT GGGTATCGGG AGATCCTGAT
GGAGTGAGCA TGATTACAAG CGCCATCATT ACCTTGAGTG GGCGGTTGGA TGTGATTGCT
TCTGAGTTTC TGTCGACGTC ATTGGAGGGT GCTACGACGT ATGAGGGTGA GTCAGGAACT
ATCACGATTT ACGGAGACTC GACTAGGAGC ATTATCCTTA GCGGGGATGC CAGTCTTGCG
GATTATATGG CTGTAATCAG CGGCATCGTC TATCAGAATA CGTTACCGAA TACATCAAAG
ACTGGGGATA GGGTCGTTAC GATTTCAATG ACCGATGTGG ACGGGATTGC GGCAAATTCT
TCAGCTATAT CGTTACTTAC TCCCATTGAT GTGACTCTGG TGAATGAGGG TGACCGGATT
TTCATTGATA CTGGTGACGG GATGGTTGAT AGCGGCCTGA CGGTGCTGAT GGTAAAGGAT
GCAACCCATG TCATAGCTTC AGGTGAGTTG CCGATATTTG ATAGCGCGTT TGCTGATGGT
CATTATTTTC TCAATTTTCT TGCGCCGACT GCCGATCCCG ATGCCGAAGA GTGGAGCACT
GATGAGTTAG AAGGTGTACC GACTGTGCTC ACTACGACCA TTGTCGTTGC AAAAGTGCCA
ATTGTTGACC TTAATGGAGC GGCAGTCGGA TCTGATGGCA CGCTTGCGTA TCTCGAAGGC
GACTCATCTG CGTTCATAGC GCCCGTAGGT ACTATTACGA GTCTCGAGGC AACCATGGCG
TCTCTGGTTA TAACATTATC CGGAGTATTG GATGATGCTC ATGAGAGCTT GTCTTTTACC
GGCACTCTTC CTACCGGGGT TACTTCTTCA GTCTCGTCTT CTGATGGTAC GTATGTTCTG
ACTCTCTCCG GGTCAAAGGC CATTGCGGAC TACCAGGCGC TTCTCCGGAC AATTATCTAT
ACCAATACAT CTGAAAGCCC TGATGTAACG GATGTCAGAA CAATTACTGT TTATGCCACC
AATACTGGTG GTGTTGAAGG TATTTCTCAG GAGGTAGCTG TAACTGTTGC TGGTGTCAAT
GATGCTCCAG TCATCACTCC GGTTGTCGTT ACTGGAACGG TGGTTGAGGA TGTGGTTTTT
TCTGCAAGCG GGTCAATGAC CTATGCGGAT GTTGATGTGA ACACTGACAC GATGACTGTT
ACGATTTCTG AGTCGACAGT GACATCGTTC GATCAAGAAT TGACAGCCGA ACAGATAGCT
GCCATCAAGA GTGCTTTTTC AATTGTTCCA GAGAGCGGTA CCACGGGGGT CACTCCAGCC
GGCAGTGTCT CCTGGACCTA TGTTGTAGGT GCTCATGAGC TTGATTTCCT TGCCTATAAT
GTAACAGTGG CCGCTGTGTT TACGATTTCT GTGGACGACG GCCATGGAGG GATCGACGCC
CAGAGTGTTA CAATCTCAAT TACGGGTACC GATGATGCTC CGGTGATTAC TGTGGCTGAG
GGTGATGCAT CAATAGGCAT CGTATCGGAG GACGTTGCTG TAGTTGTCGA TAATCCGAAT
ACGGTTGATG TAGAAAACGG CGGATATATT GTGGCAAGCG GGATGCTTTC ATACAGTGAT
GCTGATGCTG CCGACGAACT GAGCATCGTG TCTTACGCGC AGCAGGGTGG TGCCACCTCC
AGTGAGGGTG TTGTTGTCAC ATCGGCGCTT GCAGTAGCCC TGTCCTCTGC TCTTGTGGTG
GGTTCGATTT CCGGGAATAG CGGTGACTTT GACTGGAGCT TCGCCCTAGA TAATTCCCTC
GTACAGTACC TCGGCGCAGG GGATTCTGTG ACAGTCAACT ATCTGATTAC GATTGCCGAC
AACAGTGGCG CTGCTGATAA CGCAGCTCAG CTCCTTAGCA TTACTGTCAA CGGGACTAAT
GATACGCCGG TGATTAGCTT TGCTTTGGGC AATGATGCCG GAGCTGTTGC CGAGGATGGT
GCTGAAAGCC TTACCGCTGG TGGGACTGTC TCCTTTTCTG AAGTTGATGC GTATGACGAA
TTGTCATCCA GCGTCGACTT GACTTCAATC GAGTGGAGTG CTACAGATGC TGGCGGTGCG
CATATCCCGT TGCCGGATTC TTTCGCCACT GCACTTGAAG GTGCAATGTC GATTGTACAG
AGCGGAATAA ATGATGGATC AATTGGATGG AACTTTGAGT TAGAGAACAA TCTGACGCAG
TTCCTGGCAA AGGATGAGGT CGTAACAGCC GTGTTCACCA TCACGGTGGA TGACGCCAAG
GGTGGCACAG ACACCCAGGA CGTGACCATC ACGCTGACAG GCTCCAACGA CGCACCGGTA
ATCACGGTGG GTGAAGACGG GAGCATAGCT GACTTACTCA CTGAAGCTGA CGGCGCACTG
AAAGCCGATG GTACGCTGAG TGTCGAGGAT CTCGACACCA CCAACTCGGT TGCAGTGAGC
GTAGACTCGG TAGCTGCCTC TCAGCTAGAC GGTTCCGGTG TAGCCATGGC CAGAGACAGC
AGTGAACCGG CCAGTGCAGA CTTGCTTGCA ATGCTGACGG CCACAGCGGA CCCGATTGAC
GGCACAGTAA CGACCGGGGA TATTGCCTGG GCATTCAACA GCGGTAGCGA AGCCTTCAAC
TACCTTGCAG CCGGAGAGAA ACTGGTGTTG ACCTACACGC TGACAGCCAG CGATGGAACA
GCCAGCGACG ACCAGACTGT GACGATAACG ATCACTGGCA CGAACGACAT TCCAAGTGTT
GACGTGACGG ATGTTAAGCC CATTTTGGAG TCCACTGATG CCCATGCACA GGTTCTTGCG
GATAGCGGTA CAGTGACCTT CAATGATATT GATAGCACTG ACCTGATAGA CGTCACATGT
GCCTATAATG AAGATATTGT ATGGAGCGGA GGTACAATCA ATGGAGCTCT TGCAGTTGCA
CTTGTTGACG GGTTCAGTGT CGATACCGGA GACAATCTCG AAGCTCCGGG GAACACAGCT
TGGAATTATA GTGTAGCTGG TGTTGATCTT GACTTCCTTT CCGAGGGGGA GACAATCACT
TTTAGTTATA CAGTGACGGC AACGGATACA CAAAGTGCAA GCAGCACCGA CACCGTAGAA
ATCACGATCA GCGGCACGAA CGATGTGCCG GTGGTAACCA ACGAAGCAGA AGCGCTTGCA
GGAGAAGTCG TTGAAGCCGG TAACCTGGAT GACGGAGAAG TAAGCGCCGG TACGGTAAGT
GCTACAGGAA CGCTGAGCTC GAGTGATGTT GATGCCAGTG CCACCGCCAC CTGGAGCCTG
CTGGGCACCC CGAGTACCAC ATACGGCACA ATGGCCATCG ACAGCGCCAG CGGCTTCTGG
ACCTACAGCC TGGACAACAG CCTTGAGGCA ACGAAGGCAC TGGATGAAGG CGAGAGTGCA
ACCCAGACCT ACACCGCCCG GGTCACGGAC GACAAGGGAG CCTATGTTGA CCAGACCATC
ACGATCACGA TCAGCGGCAC GAACGATGTG CCGGTGGTAA CCAACGAAGC AGAAGCGCTT
GCAGGAGAAG TCGTTGAAGC CGGTAACCTG GATGACGGAG AAGTAAGCGC CGGTACGGTA
AGTGCTACAG GAACGCTGAG CTCGAGTGAT GTTGATGCCA GTGCCACCGC CACCTGGAGC
CTGCTGGGCA CCCCGAGTAC CACATACGGC ACAATGGCCA TCGACAGCGC CAGCGGCTTC
TGGACCTACA GCCTGGACAA CAGCCTTGAG GCAACGAAGG CACTGGATGA AGGCGAGAGT
GCAACCCAGA CCTACACCGC CCGGGTCACG GACGACAAGG GAGCCTATGT TGACCAGACC
ATCACGATCA CGATCAGCGG CACGAACGAT GTGCCGGTGG TAACCAACGA AGCAGAAGCG
CTTGCAGGAG AAGTCGTTGA AGCCGGTAAC CTGGATGACG GAGAAGTAAG CGCCGGTACG
GTAAGTGCTA CAGGAACGCT GAGCTCGAGT GATGTTGATG CCAGTGCCAC CGCCACCTGG
AGCCTGCTGG GCACCCCGAG TACCACATAC GGCACAATGG CCATCGACAG CGCCAGCGGC
TTCTGGACCT ACAGCCTGGA CAACAGCCTT GAGGCAACGA AGGCACTGGA TGAAGGCGAG
AGTGCAACCC AGACCTACAC CGCCCGGGTC ACGGACGACA AGGGAGCCTA TGTTGACCAG
ACCATCACGA TCACGATCAG CGGCACGAAC GATGTGCCGG TGGTAACCAA CGAAGCAGAA
GCGCTTGCAG GAGAAGTCGT TGAAGCCGGT AACCTGGATG ACGGAGAAGT AAGCGCCGGT
ACGGTAAGTG CTACAGGAAC GCTGAGCTCG AGTGATGTTG ATGCCAGTGC CACCGCCACC
TGGAGCCTGC TGGGCACCCC GAGTACCACA TACGGCACAA TGGCCATCGA CAGCGCCAGC
GGCTTCTGGA CCTACAGCCT GGACAACAGC CTTGAGGCAA CGAAGGCACT GGATGAAGGC
GAGAGTGCAA CCCAGACCTA CACCGCCCGG GTCACGGACG ACAAGGGAGC CTATGTTGAC
CAGACCATCA CGATCACGAT CAGCGGCACG AACGATGTGC CGGTGGTAAC CAACGAAGCA
GAAGCGCTTG CAGGAGAAGT CGTTGAAGCC GGTAACCTGG ATGACGGAGA AGTAAGCGCC
GGTACGGTAA GTGCTACAGG AACGCTGAGC TCGAGTGATG TTGATGCCAG TGCCACCGCC
ACCTGGAGCC TGCTGGGCAC CCCGAGTACC ACATACGGCA CAATGGCCAT CGACAGCGCC
AGCGGCTTCT GGACCTACAG CCTGGACAAC AGCCTTGAGG CAACGAAGGC ACTGGATGAA
GGCGAGAGTG CAACCCAGAC CTACACCGCC CGGGTCACGG ACGACAAGGG AGCCTATGTT
GACCAGACCA TCACGATCAC GATCAGCGGC ACGAACGATG TGCCGGTGGT AACCAACGAA
GCAGAAGCGC TTGCAGGAGA AGTCGTTGAA GCCGGTAACC TGGATGACGG AGAAGTAAGC
GCCGGTACGG TAAGTGCTAC AGGAACGCTG AGCTCGAGTG ATGTTGATGC CAGTGCCACC
GCCACCTGGA GCCTGCTGGG CACCCCGAGT ACCACATACG GCACAATGGC CATCGACAGC
GCCAGCGGCT TCTGGACCTA CAGCCTGGAC AACAGCCTTG AGGCAACGAA GGCACTGGAT
GAAGGCGAGA GTGCAACCCA GACCTACACC GCCCGGGTCA CGGACGACAA GGGAGCCTAT
GTTGACCAGA CCATCACGAT CACGATCAGC GGCACGAACG ATGTGCCGGT GGTAACCAAC
GAAGCAGAAG CGCTTGCAGG AGAAGTCGTT GAAGCCGGTA ACCTGGATGA CGGAGAAGTA
AGCGCCGGTA CGGTAAGTGC TACAGGAACG CTGAGCTCGA GTGATGTTGA TGCCAGTGCC
ACCGCCACCT GGAGCCTGCT GGGCACCCCG AGTACCACAT ACGGCACAAT GGCCATCGAC
AGCGCCAGCG GCTTCTGGAC CTACAGCCTG GACAACAGCC TTGAGGCAAC GAAGGCACTG
GATGAAGGCG AGAGTGCAAC CCAGACCTAC ACCGCCCGGG TCACGGACGA CAAGGGAGCC
TATGTTGACC AGACCATCAC GATCACGATC AGCGGCACGA ACGATGTGCC GGTGGTAACC
AACGAAGCAG AAGCGCTTGC AGGAGAAGTC GTTGAAGCCG GTAACCTGGA TGACGGAGAA
GTAAGCGCCG GTACGGTAAG TGCTACAGGA ACGCTGAGCT CGAGTGATGT TGATGCCAGT
GCCACCGCCA CCTGGAGCCT GCTGGGCACC CCGAGTACCA CATACGGCAC AATGGCCATC
GACAGCGCCA GCGGCTTCTG GACCTACAGC CTGGACAACA GCCTTGAGGC AACGAAGGCA
CTGGATGAAG GCGAGAGTGC AACCCAGACC TACACCGCCC GGGTCACGGA CGACAAGGGA
GCCTATGTTG ACCAGACCAT CACGATCACG ATCAGCGGCA CGAACGATGT GCCGGTGGTA
ACCAACGAAG CAGAAGCGCT TGCAGGAGAA GTCGTTGAAG CCGGTAACCT GGATGACGGA
GAAGTAAGCG CCGGTACGGT AAGTGCTACA GGAACGCTGA GCTCGAGTGA TGTTGATGCC
AGTGCCACCG CCACCTGGAG CCTGCTGGGC ACCCCGAGTA CCACATACGG CACAATGGCC
ATCGACAGCG CCAGCGGCTT CTGGACCTAC AGCCTGGACA ACAGCCTTGA GGCAACGAAG
GCACTGGATG AAGGCGAGAG TGCAACCCAG ACCTACACCG CCCGGGTCAC GGACGACAAG
GGAGCCTATG TTGACCAGAC CATCACGATC ACGATCAGCG GCACGAACGA TGTGCCGGTG
GTAACCAACG AAGCAGAAGC GCTTGCAGGA GAAGTCGTTG AAGCCGGTAA CCTGGATGAC
GGAGAAGTAA GCGCCGGTAC GGTAAGTGCT ACAGGAACGC TGAGCTCGAG TGATGTTGAT
GCCAGTGCCA CCGCCACCTG GAGCCTGCTG GGCACCCCGA GTACCACATA CGGCACAATG
GCCATCGACA GCGCCAGCGG CTTCTGGACC TACAGCCTGG ACAACAGCCT TGAGGCAACG
AAGGCACTGG ATGAAGGCGA GAGTGCAACC CAGACCTACA CCGCCCGGGT CACGGACGAC
AAGGGAGCCT ATGTTGACCA GACCATCACG ATCACGATCA GCGGCACGAA CGATGTGCCG
GTGGTAACCA ACGAAGCAGA AGCGCTTGCA GGAGAAGTCG TTGAAGCCGG TAACCTGGAT
GACGGAGAAG TAAGCGCCGG TACGGTAAGT GCTACAGGAA CGCTGAGCTC GAGTGATGTT
GATGCCAGTG CCACCGCCAC CTGGAGCCTG CTGGGCACCC CGAGTACCAC ATACGGCACA
ATGGCCATCG ACAGCGCCAG CGGCTTCTGG ACCTACAGCC TGGACAACAG CCTTGAGGCA
ACGAAGGCAC TGGATGAAGG CGAGAGTGCA ACCCAGACCT ACACCGCCCG GGTCACGGAC
GACAAGGGAG CCTATGTTGA CCAGACCATC ACGATCACGA TCAGCGGCAC GAACGATGTG
CCGGTGGTAA CCAACGAAGC AGAAGCGCTT GCAGGAGAAG TCGTTGAAGC CGGTAACCTG
GATGACGGAG AAGTAAGCGC CGGTACGGTA AGTGCTACAG GAACGCTGAG CTCGAGTGAT
GTTGATGCCA GTGCCACCGC CACCTGGAGC CTGCTGGGCA CCCCGAGTAC CACATACGGC
ACAATGGCCA TCGACAGCGC CAGCGGCTTC TGGACCTACA GCCTGGACAA CAGCCTTGAG
GCAACGAAGG CACTGGATGA AGGCGAGAGT GCAACCCAGA CCTACACCGC CCGGGTCACG
GACGACAAGG GAGCCTATGT TGACCAGACC ATCACGATCA CGATCAGCGG CACGAACGAT
GTGCCGGTGG TAACCAACGA AGCAGAAGCG CTTGCAGGAG AAGTCGTTGA AGCCGGTAAC
CTGGATGACG GAGAAGTAAG CGCCGGTACG GTAAGTGCTA CAGGAACGCT GAGCTCGAGT
GATGTTGATG CCAGTGCCAC CGCCACCTGG AGCCTGCTGG GCACCCCGAG TACCACATAC
GGCACAATGG CCATCGACAG CGCCAGCGGC TTCTGGACCT ACAGCCTGGA CAACAGCCTT
GAGGCAACGA AGGCACTGGA TGAAGGCGAG AGTGCAACCC AGACCTACAC CGCCCGGGTC
ACGGACGACA AGGGAGCCTA TGTTGACCAG ACCATCACGA TCACGATCAG CGGCACGAAC
GATGTGCCGG TGGTAACCAA CGAAGCAGAA GCGCTTGCAG GAGAAGTCGT TGAAGCCGGT
AACCTGGATG ACGGAGAAGT AAGCGCCGGT ACGGTAAGTG CTACAGGAAC GCTGAGCTCG
AGTGATGTTG ATGCCAGTGC CACCGCCACC TGGAGCCTGC TGGGCACCCC GAGTACCACA
TACGGCACAA TGGCCATCGA CAGCGCCAGC GGCTTCTGGA CCTACAGCCT GGACAACAGC
CTTGAGGCAA CGAAGGCACT GGATGAAGGC GAGAGTGCAA CCCAGACCTA CACCGCCCGG
GTCACGGACG ACAAGGGAGC CTATGTTGAC CAGACCATCA CGATCACGAT CAGCGGCACG
AACGATGTGC CGGTGGTAAC CAACGAAGCA GAAGCGCTTG CAGGAGAAGT CGTTGAAGCC
GGTAACCTGG ATGACGGAGA AGTAAGCGCC GGTACGGTAA GTGCTACAGG AACGCTGAGC
TCGAGTGATG TTGATGCCAG TGCCACCGCC ACCTGGAGCC TGCTGGGCAC CCCGAGTACC
ACATACGGCA CAATGGCCAT CGACAGCGCC AGCGGCTTCT GGACCTACAG CCTGGACAAC
AGCCTTGAGG CAACGAAGGC ACTGGATGAA GGCGAGAGTG CAACCCAGAC CTACACCGCC
CGGGTCACGG ACGACAAGGG AGCCTATGTT GACCAGACCA TCACGATCAC GATCAGCGGC
ACGAACGATG TGCCGGTGGT AACCAACGAA GCAGAAGCGC TTGCAGGAGA AGTCGTTGAA
GCCGGTAACC TGGATGACGG AGAAGTAAGC GCCGGTACGG TAAGTGCTAC AGGAACGCTG
AGCTCGAGTG ATGTTGATGC CAGTGCCACC GCCACCTGGA GCCTGCTGGG CACCCCGAGT
ACCACATACG GCACAATGGC CATCGACAGC GCCAGCGGCT TCTGGACCTA CAGCCTGGAC
AACAGCCTTG AGGCAACGAA GGCACTGGAT GAAGGCGAGA GTGCAACCCA GACCTACACC
GCCCGGGTCA CGGACGACAA GGGAGCCTAT GTTGACCAGA CCATCACGAT CACGATCAGC
GGCACGAACG ATGTGCCGGT GGTAACCAAC GAAGCAGAAG CGCTTGCAGG AGAAGTCGTT
GAAGCCGGTA ACCTGGATGA CGGAGAAGTA AGCGCCGGTA CGGTAAGTGC TACAGGAACG
CTGAGCTCGA GTGATGTTGA TGCCAGTGCC ACCGCCACCT GGAGCCTGCT GGGCACCCCG
AGTACCACAT ACGGCACAAT GGCCATCGAC AGCGCCAGCG GCTTCTGGAC CTACAGCCTG
GACAACAGCC TTGAGGCAAC GAAGGCACTG GATGAAGGCG AGAGTGCAAC CCAGACCTAC
ACCGCCCGGG TCACGGACGA CAAGGGAGCC TATGTTGACC AGACCATCAC GATCACGATC
AGCGGCACGA ACGATGTGCC GGTGGTAACC AACGAAGCAG AAGCGCTTGC AGGAGAAGTC
GTTGAAGCCG GTAACCTGGA TGACGGAGAA GTAAGCGCCG GTACGGTAAG TGCTACAGGA
ACGCTGAGCT CGAGTGATGT TGATGCCAGT GCCACCGCCA CCTGGAGCCT GCTGGGCACC
CCGAGTACCA CATACGGCAC AATGGCCATC GACAGCGCCA GCGGCTTCTG GACCTACAGC
CTGGACAACA GCCTTGAGGC AACGAAGGCA CTGGATGAAG GCGAGAGTGC AACCCAGACC
TACACCGCCC GGGTCACGGA CGACAAGGGA GCCTATGTTG ACCAGACCAT CACGATCACG
ATCAGCGGCA CGAACGATGT GCCGGTGGTA ACCAACGAAG CAGAAGCGCT TGCAGGAGAA
GTCGTTGAAG CCGGTAACCT GGATGACGGA GAAGTAAGCG CCGGTACGGT AAGTGCTACA
GGAACGCTGA GCTCGAGTGA TGTTGATGCC AGTGCCACCG CCACCTGGAG CCTGCTGGGC
ACCCCGAGTA CCACATACGG CACAATGGCC ATCGACAGCG CCAGCGGCTT CTGGACCTAC
AGCCTGGACA ACAGCCTTGA GGCAACGAAG GCACTGGATG AAGGCGAGAG TGCAACCCAG
ACCTACACCG CCCGGGTCAC GGACGACAAG GGAGCCTATG TTGACCAGAC CATCACGATC
ACGATCAGCG GCACGAACGA TGTGCCGGTG GTAACCAACG AAGCAGAAGC GCTTGCAGGA
GAAGTCGTTG AAGCCGGTAA CCTGGATGAC GGAGAAGTAA GCGCCGGTAC GGTAAGTGCT
ACAGGAACGC TGAGCTCGAG TGATGTTGAT GCCAGTGCCA CCGCCACCTG GAGCCTGCTG
GGCACCCCGA GTACCACATA CGGCACAATG GCCATCGACA GCGCCAGCGG CTTCTGGACC
TACAGCCTGG ACAACAGCCT TGAGGCAACG AAGGCACTGG ATGAAGGCGA GAGTGCAACC
CAGACCTACA CCGCCCGGGT CACGGACGAC AAGGGAGCCT ATGTTGACCA GACCATCACG
ATCACGATCA GCGGCACGAA CGATGTGCCG GTGGTAACCA ACGAAGCAGA AGCGCTTGCA
GGAGAAGTCG TTGAAGCCGG TAACCTGGAT GACGGAGAAG TAAGCGCCGG TACGGTAAGT
GCTACAGGAA CGCTGAGCTC GAGTGATGTT GATGCCAGTG CCACCGCCAC CTGGAGCCTG
CTGGGCACCC CGAGTACCAC ATACGGCACA ATGGCCATCG ACAGCGCCAG CGGCTTCTGG
ACCTACAGCC TGGACAACAG CCTTGAGGCA ACGAAGGCAC TGGATGAAGG CGAGAGTGCA
ACCCAGACCT ACACCGCCCG GGTCACGGAC GACAAGGGAG CCTATGTTGA CCAGACCATC
ACGATCACGA TCAGCGGCAC GAACGATGTG CCGGTGGTAA CCAACGAAGC AGAAGCGCTT
GCAGGAGAAG TCGTTGAAGC CGGTAACCTG GATGACGGAG AAGTAAGCGC CGGTACGGTA
AGTGCTACAG GAACGCTGAG CTCGAGTGAT GTTGATGCCA GTGCCACCGC CACCTGGAGC
CTGCTGGGCA CCCCGAGTAC CACATACGGC ACAATGGCCA TCGACAGCGC CAGCGGCTTC
TGGACCTACA GCCTGGACAA CAGCCTTGAG GCAACGAAGG CACTGGATGA AGGCGAGAGT
GCAACCCAGA CCTACACCGC CCGGGTCACG GACGACAAGG GAGCCTATGT TGACCAGACC
ATCACGATCA CGATCAGCGG CACGAACGAT GTGCCGGTGG TAACCAACGA AGCAGAAGCG
CTTGCAGGAG AAGTCGTTGA AGCCGGTAAC CTGGATGACG GAGAAGTAAG CGCCGGTACG
GTAAGTGCTA CAGGAACGCT GAGCTCGAGT GATGTTGATG CCAGTGCCAC CGCCACCTGG
AGCCTGCTGG GCACCCCGAG TACCACATAC GGCACAATGG CCATCGACAG CGCCAGCGGC
TTCTGGACCT ACAGCCTGGA CAACAGCCTT GAGGCAACGA AGGCACTGGA TGAAGGCGAG
AGTGCAACCC AGACCTACAC CGCCCGGGTC ACGGACGACA AGGGAGCCTA TGTTGACCAG
ACCATCACGA TCACGATCAG CGGCACGAAC GATGTGCCGG TGGTAACCAA CGAAGCAGAA
GCGCTTGCAG GAGAAGTCGT TGAAGCCGGT AACCTGGATG ACGGAGAAGT AAGCGCCGGT
ACGGTAAGTG CTACAGGAAC GCTGAGCTCG AGTGATGTTG ATGCCAGTGC CACCGCCACC
TGGAGCCTGC TGGGCACCCC GAGTACCACA TACGGCACAA TGGCCATCGA CAGCGCCAGC
GGCTTCTGGA CCTACAGCCT GGACAACAGC CTTGAGGCAA CGAAGGCACT GGATGAAGGC
GAGAGTGCAA CCCAGACCTA CACCGCCCGG GTCACGGACG ACAAGGGAGC CTATGTTGAC
CAGACCATCA CGATCACGAT CAGCGGCACG AACGATGTGC CGGTGGTAAC CAACGAAGCA
GAAGCGCTTG CAGGAGAAGT CGTTGAAGCC GGTAACCTGG ATGACGGAGA AGTAAGCGCC
GGTACGGTAA GTGCTACAGG AACGCTGAGC TCGAGTGATG TTGATGCCAG TGCCACCGCC
ACCTGGAGCC TGCTGGGCAC CCCGAGTACC ACATACGGCA CAATGGCCAT CGACAGCGCC
AGCGGCTTCT GGACCTACAG CCTGGACAAC AGCCTTGAGG CAACGAAGGC ACTGGATGAA
GGCGAGAGTG CAACCCAGAC CTACACCGCC CGGGTCACGG ACGACAAGGG AGCCTATGTT
GACCAGACCA TCACGATCAC GATCAGCGGC ACGAACGATG TGCCGGTGGT AACCAACGAA
GCAGAAGCGC TTGCAGGAGA AGTCGTTGAA GCCGGTAACC TGGATGACGG AGAAGTAAGC
GCCGGTACGG TAAGTGCTAC AGGAACGCTG AGCTCGAGTG ATGTTGATGC CAGTGCCACC
GCCACCTGGA GCCTGCTGGG CACCCCGAGT ACCACATACG GCACAATGGC CATCGACAGC
GCCAGCGGCT TCTGGACCTA CAGCCTGGAC AACAGCCTTG AGGCAACGAA GGCACTGGAT
GAAGGCGAGA GTGCAACCCA GACCTACACC GCCCGGGTCA CGGACGACAA GGGAGCCTAT
GTTGACCAGA CCATCACGAT CACGATCAGC GGCACACTGG ATAGGTATAC GCCGACAACC
CATGTGACAT TCTGGAAGGA TAATACCGCT GATGACGGTT ATAACCCTCT CGCACTTGAG
GGCGTCACGA CCGGTTATGT GCCTGATGTC ACTTCCAGTA CTGAGGACAC TATCGAGTTC
AGGGACATGT CCGTTGCACT CAATGACGCT GGCGCCTATA GGGACTATAC CATGGATGTG
TGGAAGGCAA ACAAGGTTGA GGATGTCAAC ACGTTTACGC TGAAATTCGA TCTCCCCACA
GGTACGACCT ATGAATGGGA CGCAGCGGAT GTGTTTACTA CTGGATGGAA CATCTCTGAA
TACCAGAATG ATCAGACGGT AACCATCGTC GGGTATTCAT TGACTGAAAC AATCGACGAC
GCTGCTGTTC AGTTGGGCAC GCTGACATTC AGTGGCAACC TCAACACCGA TACCCTCAAC
CTTACAGGCG GGGTGCTCAC TGCGGTTAAT CTAACCGACT TTAGTGAAGA CAGTACAGCA
ACAGGCACGA TTGTGCTGGA TCCTGTGATG ACGGATACTG ATGCCACTGG TGCAGACGCG
TTGGAAGCAG CAATTGACCA GGGTGGGTAC AGCTTGTTGT CCTCTGCGAG TATCGCAATT
GCAGATGGTA CGGTATCGAA CATCGACATG GTGATTGCGG CGATGATTGT GGCTGCCGAT
GACAGTGCTG AAGACTCAAA TCTTATTGGC AACTACACTC TCGCGCAGTT GTATGCGGCA
GATGTTGACC ACAGCGGTAC GGTTGATGCA ACAGATGTTG CATTGATCGG GCAGATGTCT
GCTGGCGTGA CGGAGGCTCC GGCCAATGAG TGGATTTTCG TGGCGGCTGA TGTTGCTGAT
GATGCCGCTA CGGGTACAAG CGTTGACTGG ACTAAAACCC TTACCGAGAT TGATCTGCAG
AGTAATGCAA CGGTCGAGCT GGTCGGTATC GTCAAGGGTG ATGTGAACGG CAGCTGGGTA
GGTTAA
 
Protein sequence
MSITLNEDYS ALSLSFLVSL LGDDSTILTL SVSAGHLILT DGGASPDVTI EGSGTDSVTV 
TASSKADLAD WLADMVNNSS IYITAFNSLE PVTLDYVIDN GTSDDITGTE QLVFTPVNDP
ACLDMNGSDE VGCTYVTKWI VGSVDPISIL DKDWVSGDPD GVSMITSAII TLSGRLDVIA
SEFLSTSLEG ATTYEGESGT ITIYGDSTRS IILSGDASLA DYMAVISGIV YQNTLPNTSK
TGDRVVTISM TDVDGIAANS SAISLLTPID VTLVNEGDRI FIDTGDGMVD SGLTVLMVKD
ATHVIASGEL PIFDSAFADG HYFLNFLAPT ADPDAEEWST DELEGVPTVL TTTIVVAKVP
IVDLNGAAVG SDGTLAYLEG DSSAFIAPVG TITSLEATMA SLVITLSGVL DDAHESLSFT
GTLPTGVTSS VSSSDGTYVL TLSGSKAIAD YQALLRTIIY TNTSESPDVT DVRTITVYAT
NTGGVEGISQ EVAVTVAGVN DAPVITPVVV TGTVVEDVVF SASGSMTYAD VDVNTDTMTV
TISESTVTSF DQELTAEQIA AIKSAFSIVP ESGTTGVTPA GSVSWTYVVG AHELDFLAYN
VTVAAVFTIS VDDGHGGIDA QSVTISITGT DDAPVITVAE GDASIGIVSE DVAVVVDNPN
TVDVENGGYI VASGMLSYSD ADAADELSIV SYAQQGGATS SEGVVVTSAL AVALSSALVV
GSISGNSGDF DWSFALDNSL VQYLGAGDSV TVNYLITIAD NSGAADNAAQ LLSITVNGTN
DTPVISFALG NDAGAVAEDG AESLTAGGTV SFSEVDAYDE LSSSVDLTSI EWSATDAGGA
HIPLPDSFAT ALEGAMSIVQ SGINDGSIGW NFELENNLTQ FLAKDEVVTA VFTITVDDAK
GGTDTQDVTI TLTGSNDAPV ITVGEDGSIA DLLTEADGAL KADGTLSVED LDTTNSVAVS
VDSVAASQLD GSGVAMARDS SEPASADLLA MLTATADPID GTVTTGDIAW AFNSGSEAFN
YLAAGEKLVL TYTLTASDGT ASDDQTVTIT ITGTNDIPSV DVTDVKPILE STDAHAQVLA
DSGTVTFNDI DSTDLIDVTC AYNEDIVWSG GTINGALAVA LVDGFSVDTG DNLEAPGNTA
WNYSVAGVDL DFLSEGETIT FSYTVTATDT QSASSTDTVE ITISGTNDVP VVTNEAEALA
GEVVEAGNLD DGEVSAGTVS ATGTLSSSDV DASATATWSL LGTPSTTYGT MAIDSASGFW
TYSLDNSLEA TKALDEGESA TQTYTARVTD DKGAYVDQTI TITISGTNDV PVVTNEAEAL
AGEVVEAGNL DDGEVSAGTV SATGTLSSSD VDASATATWS LLGTPSTTYG TMAIDSASGF
WTYSLDNSLE ATKALDEGES ATQTYTARVT DDKGAYVDQT ITITISGTND VPVVTNEAEA
LAGEVVEAGN LDDGEVSAGT VSATGTLSSS DVDASATATW SLLGTPSTTY GTMAIDSASG
FWTYSLDNSL EATKALDEGE SATQTYTARV TDDKGAYVDQ TITITISGTN DVPVVTNEAE
ALAGEVVEAG NLDDGEVSAG TVSATGTLSS SDVDASATAT WSLLGTPSTT YGTMAIDSAS
GFWTYSLDNS LEATKALDEG ESATQTYTAR VTDDKGAYVD QTITITISGT NDVPVVTNEA
EALAGEVVEA GNLDDGEVSA GTVSATGTLS SSDVDASATA TWSLLGTPST TYGTMAIDSA
SGFWTYSLDN SLEATKALDE GESATQTYTA RVTDDKGAYV DQTITITISG TNDVPVVTNE
AEALAGEVVE AGNLDDGEVS AGTVSATGTL SSSDVDASAT ATWSLLGTPS TTYGTMAIDS
ASGFWTYSLD NSLEATKALD EGESATQTYT ARVTDDKGAY VDQTITITIS GTNDVPVVTN
EAEALAGEVV EAGNLDDGEV SAGTVSATGT LSSSDVDASA TATWSLLGTP STTYGTMAID
SASGFWTYSL DNSLEATKAL DEGESATQTY TARVTDDKGA YVDQTITITI SGTNDVPVVT
NEAEALAGEV VEAGNLDDGE VSAGTVSATG TLSSSDVDAS ATATWSLLGT PSTTYGTMAI
DSASGFWTYS LDNSLEATKA LDEGESATQT YTARVTDDKG AYVDQTITIT ISGTNDVPVV
TNEAEALAGE VVEAGNLDDG EVSAGTVSAT GTLSSSDVDA SATATWSLLG TPSTTYGTMA
IDSASGFWTY SLDNSLEATK ALDEGESATQ TYTARVTDDK GAYVDQTITI TISGTNDVPV
VTNEAEALAG EVVEAGNLDD GEVSAGTVSA TGTLSSSDVD ASATATWSLL GTPSTTYGTM
AIDSASGFWT YSLDNSLEAT KALDEGESAT QTYTARVTDD KGAYVDQTIT ITISGTNDVP
VVTNEAEALA GEVVEAGNLD DGEVSAGTVS ATGTLSSSDV DASATATWSL LGTPSTTYGT
MAIDSASGFW TYSLDNSLEA TKALDEGESA TQTYTARVTD DKGAYVDQTI TITISGTNDV
PVVTNEAEAL AGEVVEAGNL DDGEVSAGTV SATGTLSSSD VDASATATWS LLGTPSTTYG
TMAIDSASGF WTYSLDNSLE ATKALDEGES ATQTYTARVT DDKGAYVDQT ITITISGTND
VPVVTNEAEA LAGEVVEAGN LDDGEVSAGT VSATGTLSSS DVDASATATW SLLGTPSTTY
GTMAIDSASG FWTYSLDNSL EATKALDEGE SATQTYTARV TDDKGAYVDQ TITITISGTN
DVPVVTNEAE ALAGEVVEAG NLDDGEVSAG TVSATGTLSS SDVDASATAT WSLLGTPSTT
YGTMAIDSAS GFWTYSLDNS LEATKALDEG ESATQTYTAR VTDDKGAYVD QTITITISGT
NDVPVVTNEA EALAGEVVEA GNLDDGEVSA GTVSATGTLS SSDVDASATA TWSLLGTPST
TYGTMAIDSA SGFWTYSLDN SLEATKALDE GESATQTYTA RVTDDKGAYV DQTITITISG
TNDVPVVTNE AEALAGEVVE AGNLDDGEVS AGTVSATGTL SSSDVDASAT ATWSLLGTPS
TTYGTMAIDS ASGFWTYSLD NSLEATKALD EGESATQTYT ARVTDDKGAY VDQTITITIS
GTNDVPVVTN EAEALAGEVV EAGNLDDGEV SAGTVSATGT LSSSDVDASA TATWSLLGTP
STTYGTMAID SASGFWTYSL DNSLEATKAL DEGESATQTY TARVTDDKGA YVDQTITITI
SGTNDVPVVT NEAEALAGEV VEAGNLDDGE VSAGTVSATG TLSSSDVDAS ATATWSLLGT
PSTTYGTMAI DSASGFWTYS LDNSLEATKA LDEGESATQT YTARVTDDKG AYVDQTITIT
ISGTNDVPVV TNEAEALAGE VVEAGNLDDG EVSAGTVSAT GTLSSSDVDA SATATWSLLG
TPSTTYGTMA IDSASGFWTY SLDNSLEATK ALDEGESATQ TYTARVTDDK GAYVDQTITI
TISGTNDVPV VTNEAEALAG EVVEAGNLDD GEVSAGTVSA TGTLSSSDVD ASATATWSLL
GTPSTTYGTM AIDSASGFWT YSLDNSLEAT KALDEGESAT QTYTARVTDD KGAYVDQTIT
ITISGTNDVP VVTNEAEALA GEVVEAGNLD DGEVSAGTVS ATGTLSSSDV DASATATWSL
LGTPSTTYGT MAIDSASGFW TYSLDNSLEA TKALDEGESA TQTYTARVTD DKGAYVDQTI
TITISGTNDV PVVTNEAEAL AGEVVEAGNL DDGEVSAGTV SATGTLSSSD VDASATATWS
LLGTPSTTYG TMAIDSASGF WTYSLDNSLE ATKALDEGES ATQTYTARVT DDKGAYVDQT
ITITISGTND VPVVTNEAEA LAGEVVEAGN LDDGEVSAGT VSATGTLSSS DVDASATATW
SLLGTPSTTY GTMAIDSASG FWTYSLDNSL EATKALDEGE SATQTYTARV TDDKGAYVDQ
TITITISGTN DVPVVTNEAE ALAGEVVEAG NLDDGEVSAG TVSATGTLSS SDVDASATAT
WSLLGTPSTT YGTMAIDSAS GFWTYSLDNS LEATKALDEG ESATQTYTAR VTDDKGAYVD
QTITITISGT NDVPVVTNEA EALAGEVVEA GNLDDGEVSA GTVSATGTLS SSDVDASATA
TWSLLGTPST TYGTMAIDSA SGFWTYSLDN SLEATKALDE GESATQTYTA RVTDDKGAYV
DQTITITISG TNDVPVVTNE AEALAGEVVE AGNLDDGEVS AGTVSATGTL SSSDVDASAT
ATWSLLGTPS TTYGTMAIDS ASGFWTYSLD NSLEATKALD EGESATQTYT ARVTDDKGAY
VDQTITITIS GTLDRYTPTT HVTFWKDNTA DDGYNPLALE GVTTGYVPDV TSSTEDTIEF
RDMSVALNDA GAYRDYTMDV WKANKVEDVN TFTLKFDLPT GTTYEWDAAD VFTTGWNISE
YQNDQTVTIV GYSLTETIDD AAVQLGTLTF SGNLNTDTLN LTGGVLTAVN LTDFSEDSTA
TGTIVLDPVM TDTDATGADA LEAAIDQGGY SLLSSASIAI ADGTVSNIDM VIAAMIVAAD
DSAEDSNLIG NYTLAQLYAA DVDHSGTVDA TDVALIGQMS AGVTEAPANE WIFVAADVAD
DAATGTSVDW TKTLTEIDLQ SNATVELVGI VKGDVNGSWV G