Gene Lcho_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3739 
Symbol 
ID6163824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4183129 
End bp4195824 
Gene Length12696 bp 
Protein Length4231 aa 
Translation table11 
GC content69% 
IMG OID641666512 
Productouter membrane adhesin like proteiin 
Protein accessionYP_001792758 
Protein GI171060409 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGA CGCCGAACAA CAAGTCCTGG AAGAAGCACC ACACCGCCCC CGCGCCGCGC 
GCCTGGGCCC TGGAAGCGCG CCTGATGTTC GACGCCGCGG CGGTGGCCGA TGCGGTGCAT
CAGCTGAGCG CCGAGACCGA CACGCATGTG CTCGATCTGC AGGCCAGCAG CGCAGCGCAG
ACGACCACGG CGTCGGCCGT GGAAACCACG CCACACCCGA TCGAGGGCCT GTTCCGCATC
GCGACAGCCC CCGGCGATGT CGCACCGACG CTGCTCGCCA GTCAAGCCGA GGCGCAACGG
CTGCTGCAGG AATTCGCGCA GCGCCCGGAC GCACGCGAGC AGCTGTTCGC CCTGTTCAAC
GGCAACCAGG CCGAACCGTC GGCCGAGTGG ACCCGGGCGG CCGACGCCTA CCTGGCGGCC
CTGCGCAGCG GCGAGGTGTC CATCGAGGTC CAGCTGCGCT CGGCGGCCGA CCTGAAGGGC
AACATGGGCG CCTTTTCGGT CGACGGCGCC GACGGCCAGC CCGTCATCTA CCTGAACGCG
GACTGGGTGG CGAGCGGCGT CGCCACCGAC GCGCTGACCC GCGTGCTGGC GGAAGAATTC
GGCCACGGCA TCGACCACGC CCTCAACGGC AGCACCGATA CCACCGGCGA CGAAGGCGAA
GCGTTTGCCG CCGTGGCCTT GAATCTCGGC CTCGACCCGA CCCAACAGCA GCGCATCACG
GCCGAGGACG ACCACACCAG CCTCGTGCTG GACGGCCACG CCCTGACGGT GGAGCTGGCC
GGCACCGCGG AAGTCTCGGT GCCGTTTTCG GAGGGCTACA TCGGCACCGT CGGCACCAGC
ACGGGCAAGG CCAACAACAT CCTGAATTTC TCGACCCTGG GCATCACCCG GGCGTCGTTC
TTCCAGGACT CGACCACCGG CAGCTTCGGC GGCACCCAGG GCAACGACCT GTCGGGCGGC
ATCCGGCTCA CGCTCGCCTC CGGGCAGGTC ATCACCATCA ACGGCGCCAT CAACTGGCGC
GACACCGCCG GCTCGACGCT CTACGCCTTC GGCTTCATCC CCGATCCGGC CACGCCGAAC
ATCGCCATCA GCTACGGCAG CGGCCAGACC TACACGATCA CCAGCAGCAG CAACTTCGGC
CTCGAGACCA TCGGGGTGAC CTACAGCGTG GCCGACGGCA GCAACGTGTC GGGCAACGCG
GCCACATCCG GCCTGCTGAC CAGCCTGAAC ACCTACCTGG CAGAAGTCCA GGCCAGCGCC
CCCGGCGGCC CGGTCACCGT CACCAGCCTG AGCACCAGCG ACAGCACGCC GACACTCGGC
GGCACCGCCA CGCTCGGCGC CAACGAGACG CTCACCGTCA TCGTCAACGG CACGACCTAT
ACCACCAGCA CCGGCCTGAC GCTGGGCGCC GGCAGCACCT GGTCGCTGAC GATTCCGGAC
GCCAAGCTGC TGGCCAACGC GACCTATGGC GTCACGGCCA CGATCACCAA CGCATCCGGC
TACACGCTGA CCGACACCAC GAGCAGCGAG CTGATCGTCA ACACCGCCTT GCCGAGCAAC
GTCGCGCCGA CCGCCGATGC CGTCAGCACC AGCGGCACGG AAGACGCCGC GTCGATCACC
GTGGCGCTGT CGGCCACCGA CAGCGACGGC ACGGTCGCCA GCTACACCAT CGCCACCCTG
CCCGCCAACG GCACGCTCTA CACCGACGCG GCCCGCACCC AGGCGGTGCT GGCCGGCACG
CCGTTCAGCA CGTCGACGCT GTACTTCGTA CCCACCGCCC ACTGGAACGG CAGCACCAGC
TTCGGCTACG TCGCCACCGA CGACGGGGGC GCGAGTTCCA CCAGCACCAC GGCCAGCATC
ACGGTGAGCG CAGTCAACGA CGCACCCGTG GTTCTGGACG ATGCACAGAC CACGGCGGAG
AACACCGTTC TGCATGCGTC GGTCGTGCCG GCCACCGACG TCGACGCCCC GCCCGAGATC
CAGGACACCG GCACGCTCGA CTTCACCATC GCCAACCGCA CCTTCTCGTT CTTCGGCCCG
GAAACCGGCA GTAACGAGTT CAACGTCAAC GTCGGCAGCG GACCGACCGG CTTCGGCAGT
GCCGCGGCGA TGGCGGCGGC TTTCCAGGCC CATCCCAACT ACGCCCTGCT GCCGTACACG
ATCGGCGTCA ACGCCGCGGG CGACGGCCTG CAGCTGGACT TCAAGGTCAG CGGCAACTAC
GGCGGGCGGG GTCTCGAGAA ATGGGGCGAC GGCCCCTCGT GGTTGACCAC GCTGCGCGAA
GGCCAGGACC TGGTCTACTC CGTCGTCACC GACGTGCCGG CCGGCCAGGG CACGCTGTCG
TTCAACGCCG ACGGCAGCTA TGACTTCGAT CCCGGCACCG CCTTCGACGA CCTGGCGCCG
GGTGCATCAC GCAGCACCAC GTTCACCTAC ACCGCCACCG ACCCCGATGG CAGCGCCGCC
GTTGCCAGGA CCGTGACGAT CACGGTGACC GGTGCCAACG ACGCGCCCAC GGTCGCAGCC
AGTCTGGCCG ATGCCGCCGC CACGCAGGGC ACAGGCTTCT CGCACACCGT GCCGGCCGGC
GCCTTCGCCG ACGTCGACGT GGGCGACACC CGCAGCTACA CCGCCACGCT GGCCGACGGA
TCGGCGCTGC CGGCCTGGCT GAGCTTCGAC GCCGCCACCC GCACGTTCAG CGGCACACCG
GCCAATGCCG ACGTCGGCAC GATCAGCGTC AAGGTCACGG CCTTCGACGG CAGCAGCGCC
ACCGCCGACG ACACCTTCGA CATCGTCGTG ACCGACGTCA ACGATGCGCC CTCCGTCGCC
AATCCGATCG CCGACCAGGC CGCCACCGAG GACTCGCCGT TCTCGTTCAC CGTGCCGGCG
AACGCCTTCG CCGACGTCGA TGTCGGCGAC ACCCGCAGCT ACACCGCCAC GCTGGCCGAC
GGCGCCGCCC TGCCGGCCTG GCTGAGCTTC GATCCGGCCA CGCGCACCTT TTCCGGCACA
CCCGCCAACG CCGACGTGGG CACGATCAGC GTCAAGGTCA CCGCCACCGA CAGTGGCCAG
GCCACCGCCG ACGACACCTT CGACATCGTC GTGGCCAACG TCAACGACAC ACCGGTTCTG
GCCGATACCC CCCTCGCGCT CACCGTGGCC GAAGACGCCG GCACACCCGT CGGCGCGGTG
GGCTCGCTGA TCGGCGCCTT CACCGGCGGC AGCAGCGACG CCGACACCGG CGCGGCCAAG
GGCATTGCCA TCACGGGTGC CGACACCAGC AAAGGCAGCT GGTACTACAC GACGGACGGT
GGCGCCAACT GGCAGGCGCT CGGCGCAGTG AGCGCAACCA GCGCGCGCGT CCTGGCCGAC
GACGGCAACA CCCGGCTGTA TTTCAAGCCG GCAGCCCATG CGAACGGCGA CGTCACTGCA
GGACTCACGT TCAAGGCCTG GGACCAGAAC GGCGGACATG CCAATGGCAC AGCGAATGTC
GACACGTTGG GGGGGGCTGC GCTGATTGGC GGTTACAACA CGCCTGGGAC CTCCTTTGAC
GTGAAGCTTT CTGCCGACGG CACCAAGGCC TTCGTGGCGG ACACTTCTGG TGGACTGCAG
GTCATCGATG TCAGCAACCC GGCTGCCCCC ACCGTGCTGG GAAGTTATGG CAATGCGTCC
ACCTACTTCC TGGCGCTCTC GGCCGACGGC ACCAAGGCCT ATCTGGGCAA CGAAGCAAAC
GATTTCCTCA TTGTCGACAT CTCGAACCCG GCATCGCCGA CCCTCCTGGG CACGCTAGTC
ACAACGGGGT ATGCGTACGA AATCGCACTC TCGACGGATG GCACGAAGGC CTATCTGGCC
GACAGCGCCA GCCTCAAGAT CATTGACATC ACCAATCCTG CGGCCCCTGC GCTCATCGGG
AGCTTTGCGG AAGCGGGGGG GGGCGGGGCC TTCTTCGTCA CGCTGTCGCC CGATGGCACC
AAGGCCTTCG TGGGGAACAC TTCCAGCGGT CTGCAGATCT TGGATGTCTC GACGCCGGCC
GCTCCCACAC TGCTGGGAAC GTACGACACC CCAGGGACAG CCTACACAGT GACGCTCTCG
GCAGATGGCA CCAAAGCCTT CGTGGCTGAC ATGGCCAGCG GCTTGCAGAT CATCGACGTC
TCGAATCCTG CGGCGCCGAC CCTTCTCGGC ACCTACAACA CGACTGGATC GGCCTGGGAC
GTGCGGCTAT CGGCAGACGG CACCAAAGCC TATCTGGCTG ATGCAAGCAG CGGCCTGCTG
ATCATCGACA TCTCGAATCC TTCAGCCCCC ACGCTGCTCG GCACCTACAA CACAGCCGGA
AGCGCCTATG GCCTGACGCT CTCTGCCGAT GAAACCAAAG CCTATGTGGC CGACGGTGCG
AGCGGGTTGC AGATCATCTC GTTGACCACG TCCCCCACCG AATTCTCCAC GGCCACCGAC
ACCATCGCCG TCGCCATCAC CGCCGTCAAC GACGCACCGG TGGCCACGGG CAACGCCACG
CTGGCCGCCA TCGCCGAAGA CACCCCCAAC CCCGCCGGCG CCACCGTGGC CAGCCTGTTC
GGGGCCAACT TCAGCGACAG CACCGACCAG GTCAGCGGCG GCAGCAGCGC CCACACCCTG
GCCGGCATCG CGATCACCGG CTACACGGTC GACGCGGCGC AAGGTGCCTG GCAGTACAGC
ACCGACAGCG GCGCCCACTG GACCAGCGTG CCGGGCATCG GCGCCGAAAC CGGCGCCTTC
ACGCTGCAGG CCGCAACGCT GCTGCGCTTC CTGCCCGCGG CCGACTACAA CGGCCCCGCG
CCGACCCTCA CCACCCGCCT GATCGACAGC AGCACGACCG TCGCCGATGC CGCCACGCTG
GACGCCAGCA CCCACGGTGG CAGCACGGCC TTGTCCGACG CCACCGTCGC CCTGAACAGC
AGCGTCACGG CCGTCAATGA CGCCCCGTTG CTCACCGGCG ATCTGGCCGC CTCGGTCGCG
GTCGGCAACC GCTACACGAT CACCTCCGGC GACCTGGGCT ACACCGATCC GGACGACGGC
AATGCCGACA TCACGTTCAC CGTCAGCGCC CTGGGCAACG GCAGCATCGA AGTCGACGGC
ACGTCGGCCA CCCAGTTCAC CGGCACCCAG CTGGCGGCTG GCCAGGTTCG CTTCGTGCAC
GACGGCAGCA ACACCACCAG CGCCTCGTTC AGCGTCCGTG TGGAAGACGG CAACGAAGAC
AGCTCGACAC CCGCAGACAG CACCTTCAAC CTGATCGTCA CCCCTGTCAA CGTGGCCCCG
GTGATCACCA GCCACGGCGG CGACGCCACC GCCTCGGTGA ACTACGCCGA AAACGGCAGC
ACAGCCGTCA CCACGTTCAC GGCCACCGAC GCCGACAGCG GCGACACCCG CACTTTCAGC
ATCAGCGGGG GTGCCGATGC GGCCCTGTTC GACATCGGTG CCAGCACCGG CGCGCTGACC
TTCAAGGCAA GCCCGGATTT CGAGGGAACC GGGGACAACA GCTACGACGT CACGGTCAAG
GTCGCCGATG CGGCCGGTGC GTTCGATGAG CAGACGCTGA CGGTTCAGGT CACGAACGTG
AACGAGGCGC CCACGCTCGT GAATGCGATC GCCGACCAGG CCGCGACCGA GGACTCGCCG
TTCTCGTTCA CCGTGCCGGC CGATGCGTTT GCCGACGTCG ATGTCGATGT CGGTGACACC
CGCAGCTACG CGGCCACGCT GGCCGACGGC TCGGCGCTGC CCGCGTGGCT GAGCTTCGAT
GCGGCCACGC GCACGTTCAG CGGCACGCCG GCCAATGGCG ACGTGGGCAC GATCAGCGTC
AAGGTCACGG CCACAGACGG CAGCAACGCC TCCGCCGATG ACAGCTTCGA CATCGTCGTC
GCCAATGTCA ACGACGCACC CACCGTCGCA AACCCGATCG CCGACCAGGC CGCTACCGAG
GACTCGGCCT TCAGCTTCAC CGTGCCAGCC GATGCGTTTG CCGATGTCGA CGTCGGGGAC
ACCCGCGCCT ACACCGCCAC GCTGGCCGAC GGCTCAGCGC TGCCCGCCTG GCTCAGCTTC
AATCCGGCCA CGCGCACCTT CAGCGGCACG CCGGCCAACG CCGACGTCGG CACGCTCAGC
GTCAAGGTCA CGGCCACCGA CGGTGCGCTG GCCTCCGCCG ACGACAGCTT CGACATCGTC
GTCGCCAACG TCAACGACGC GCCGACGCTC GCACATGCCA TCGCCGACCA GGCCGCGACC
GAGGACTCGG CCTTCAGCTT CACCGTGCCG GCCGATGCAT TTGCCGATGT CGATGTCGGT
GACTCCCGCA GCTACGCGGC CACGCTGGCC GACGGCTCGG CGCTGCCCGC GTGGCTGAGC
TTCGATGCGG CCACGCGCAC CTTCAGCGGC ACGCCCGCCA ACGCCGACGT CGGCACGATC
AGCGTCAAGG TCACGGCCAC CGACGGCAGC AACGCCTTCG CCGACGACAG CTTCGACATC
GTCGTGGCCG ACGTCAACGA CGCACCCGCT GTCGCAAACC CGATCGCCGA TCAGGCCGCG
ACCGAGGACT CGGCCTTCAG CTTCACCGTG CCGGCCGATG TGTTCGCCGA TGTCGATGTC
GGTGACACCC GCAGCTACGT GGCCACGCTG GCCGACGGCT CGGCGCTGCC GGCTTGGCTG
AGCTTCAATC CGGCCACGCG CACGTTCAGC GGCACGCCGG CCAACGCCGA TGTCGGCACG
ATCAGCGTCA AGGTCACGGC CACCGACGGC AGCAACGCCT CCGCCGATGA CAGCTTCGAC
ATCGTCGTGG CCGACGTCAA CGACGCACCC GCTGTCGCAA ACCCGATCGC CGACCAGGCC
GCCACCGAGG ACTCGCCGTT CTCGTTCACC GTGCCAGCCG ATGCGTTTGC CGATGTCGAC
GTCGGTGACA CCCGCAGCTA CGCGGCCACG CTCGCCGACG GCTCGGCGCT GCCTGCGTGG
CTGAGCTTCA ACGCCGCCAC GCGCACGTTC AGCGGCACGC CGGCCAACGC CGACGTCGGC
ACGATCAGCG TCAAGGTCAC GGCCACCGAC GGTGCGCTGG CCAGCGCCGA CGACAGCTTC
GACATCGTCG TGGCCAACGT CAACGACGCA CCGACGCTCG TGAATGCGAT CGCCGACCAG
GCCGCCACCG AGGACTCGCC GTTCTCGTTG ACCGTGCCAG CCGATGCGTT TGCCGATGTC
GACGTCGGGG ACTCCCGCGC CTACACCGCC ACGCTCGCCG ACGGCTCGGC GCTGCCCGCC
TGGCTGAGCT TCGATGCCGC CACCCGCACC TTCAGCGGCA CGCCGGCCAA TAGCGACGTG
GGCACGATCA GCGTCAAGCT CACAGCCTTC GACGGTGCGC TGGCCAGCGC CGACGACAGC
TTCGACATCG TCGTCGCCGA CGTCAACGAC GCACCGACGC TCGTGAATGC GATCGCCGAC
CAGGCCGCGA CCGAGGACTC GCCGTTCTCG TTCACCGTGC CGGTCGATGC ATTTGCCGAC
GTCGATGTCG ATGTCGGTGA CACCCGCAGC TACGCGGCCA CACTGGCCGA CGGCTCGGCG
CTGCCGGCGT GGCTGAGCTT CGATGCGACA ACCCGCACCT TCAGCGGCAC GCCGGCCAAT
GGCGACGTGG GCACGATCAG CGTCAAGGTC ACGGCCACAG ACGGCAGCAA CGTCTCGGCC
GATGACAGCT TCGACATCGT CGTCGCCAAC GTCAACGACG CACCCACTGT CGCAAACCCG
ATCGCCGACC AGGTCGCAAC CGAGGACAGC CTGTTCTCGT TCACCGTGCC GGCCGATGCG
TTTGCCGATG TCGACGTCGG TGACTCCCGC AGCTACGCGG CCACGCTGGC CGACGGCTCG
GCGCTGCCGG CCTGGCTGAG CTTCGATGCG ACAACTCGCA CCTTCAGCGG CACGCCGGCC
AACGCCGATG TCGGCACGAT CAGCGTCAAG CTCACGGCCT TCGACGGTGC GCTGGTCTCC
GCCGACGACA GCTTCGACAT CGTCGTCGCC AATGTCAACG ACGCGCCGAC GCTCGCACAT
GCCATCGCCG ACCAGGCCGC GACCGAGGAC TCGCCGTTCT CGTTGACCGT GCCGGCCGAT
GCATTTGCCG ACGTCGATGT CGGTGACTCC CGCAGCTACG CGGCCACGCT GGCCGACGGC
TCGGCGCTGC CGGCCTGGCT GAGCTTCGAT GCCGCCACCC GAACGTTCAG CGGCACGCCG
GCCAACGCCG ACGTCGGCAC GCTCAGCGTC AAGTTCACGG CCACCGACGA CAGCAACGCC
TCGGCCGACG ACAGCTTCGA CATCGTCGTG GCCAACGTCA ACGACGCACC GACGCTCATG
AATGAGATCG CCGACCAGGC CGCGACCGAG GACTCGCCGT TCTCGTTGAC CGTGCCGGCC
GATGCATTTG CCGACGTCGA TGTCGGTGAC TCCCGCAGCT ACGCGGCCAC GCTGGCCGAC
GGCTCGGCGC TGCCGGCCTG GCTGAGCTTC GATGCCGCCA CGCGCACGTT CAGCGGCACG
CCGGCCAATG GCGACGTGGG CACGATCAGC GTCAAGGTCA CCGCCACCGA CGGCAGCAAC
GTCTCCGCCG ACGACAGCTT CGACATCGTC GTCGCCAACG TCAACGACGC GCCCACTGTC
GCAAACCCGA TCGCCGACCA GGCCGCGACC GAGGACTCGC CGTTCTCGTT CACCGTGCCG
GCCGATGTGT TTGCCGATGT CGACGTCGGG GACACCCGCG CCTACACCGC CACGCTCGCC
GACGGCTCGG CGCTGCCGGC CTGGCTGAGC TTCGATGCGA CAACTCGCAC CTTCAGCGGC
ACGCCGGCCA ATGGCGACGT GGGCACGCTC AGCGTCAAGG TCACTGCCAC CGACGGCAGC
AACGCCTCCG CTGACGACAG CTTCGACATC GTCGTCGCCA ATGTCAACGA CGCACCGACG
CTCATGAATG AGGTCGCCGA CCAGGCCGCG ACCGAGGACA GCCTGTTCTC GTTCACCGTG
CCGGCCGATG CATTTGCCGA TGTCGACGTC GGGGACACCC GCGCCTACAC CGCCACGCTC
GCCGACGGCT CGGCGCTGCC GGCCTGGCTG AGCTTCGATG CGACAACTCG CACCTTCAGC
GGCACGCCGG CCAATGGCGA CGTGGGCACG CTCAGCGTCA AGGTCACGGC CACCGACGGC
AGCAACGCCT CCGCCGATGA CAGCTTCGAC ATCGTCGTCG CCAACGTCAA CGACGCGCCC
ACTGTCGCAA ACCCGATCGC CGATCAGGCC GCGACCGAGG ACTCGGCCTT CAGCTTCACC
GTGCTGGCCG ATGCATTTGC CGACGTCGAT GTCGGGGACA CCCGCGCCTA CACCGCCACG
CTCGCCGACG GCTCGGCGCT GCCGGCCTGG CTGAGCTTCG ATGCCGCCAC CCGCACCTTC
AGCGGCACGC CGGCCAATGG CGACGTGGGC ACGATCAGCG TCAAGGTCAC GGCCACAGAC
GGCAGCAACG CCTCCGCCGA TGACAGCTTC GACATCGTCG TCGCCAATGT CAACGACGCA
CCCACCGTCG CAAACCCGAT CGCCGACCAG GCCGCTACCG AGGACTCGGC CTTCAGCTTC
ACCGTGCCAG CCGATGCGTT TGCCGATGTC GACGTCGGGG ACACCCGCGC CTACACCGCC
ACGCTGGCCG ACGGCTCAGC GCTGCCCGCC TGGCTCAGCT TCAATCCGGC CACGCGCACC
TTCAGCGGCA CGCCGGCCAA CGCCGACGTC GGCACGCTCA GCGTCAAGGT CACGGCCACC
GACGGTGCGC TGGCCTCCGC CGACGACAGC TTCGACATCG TCGTCGCCAA CGTCAACGAC
GCGCCGACGC TCGCACATGC CATCGCCGAC CAGGCCGCGA CCGAGGACTC GGCCTTCAGC
TTCACCGTGC CGGCCGATGT GTTTGCCGAT GTCGACGTCG GGGACACCCG CGCCTACACC
GCCACGCTCG CCGACGGCTC GGCGCTGCCG GCCTGGCTGA GCTTCGATGC GACAACTCGC
ACCTTCAGCG GCACGCCGGC CAATGGCGAC GTGGGCACGA TCAGCGTCAA GGTCACGGCC
ACCGACGGTG CGCTGGCCTC CGCCGACGAC AGCTTCGACA TCGTCGTCGC CAACGTCAAC
GACGCGCCGA CGCTCGCACA TGCCATCGCC GACCAGGCCG CGACCGAGGA CTCGGCCTTC
AGCTTCACCG TGCCGGCCGA TGCATTTGCC GATGTCGATG TCGGTGACTC CCGCAGCTAC
GCGGCCACGC TGGCCGACGG CTCGGCGCTG CCCGCGTGGC TGAGCTTCGA TGCGGCCACG
CGCACCTTCA GCGGCACGCC GGCCAACGCC GACGTCGGCA CGCTCAGCGT CAAGGTCACG
GCCACCGACG GTGCGCTGGC CTCCGCCGAC GACAGCTTCG ACATCGTCGT CGCCAACGTC
AACGACGCGC CGACGCTCGC ACATGCCATC GCCGACCAGG CCGCGACCGA GGACTCGGCC
TTCAGCTTCA CCGTGCCGGC CGATGCATTT GCCGATGTCG ATGTCGGTGA CTCCCGCAGC
TACGCGGCCA CGCTGGCCGA CGGCTCGGCG CTGCCCGCGT GGCTGAGCTT CGATGCGGCC
ACGCGCACCT TCAGCGGCAC GCCCGCCAAC GCCGACGTCG GCACGATCAG CGTCAAGGTC
ACGGCCACCG ACGGCAGCAA CGCCTTCGCC GACGACAGCT TCGACATCGT CGTGGCCGAC
GTCAACGACG CACCCGCTGT CGCAAACCCG ATCGCCGATC AGGCCGCGAC CGAGGACTCG
GCCTTCAGCT TCACCGTGCC GGCCGATGTG TTCGCCGATG TCGATGTCGG TGACACCCGC
AGCTACGTGG CCACGCTGGC CGACGGCTCG GCGCTGCCGG CTTGGCTGAG CTTCAATCCG
GCCACGCGCA CGTTCAGCGG CACGCCGGCC AACGCCGATG TCGGCACGAT CAGCGTCAAG
GTCACGGCCA CCGACGGCAG CAACGCCTCC GCCGATGACA GCTTCGACAT CGTCGTGGCC
GACGTCAACG ACGCACCCGC TGTCGCAAAC CCGATCGCCG ACCAGGCCGC CACCGAGGAC
TCGCCGTTCT CGTTCACCGT GCCAGCCGAT GCGTTTGCCG ATGTCGACGT CGGTGACACC
CGCAGCTACG CGGCCACGCT CGCCGACGGC TCGGCGCTGC CTGCGTGGCT GAGCTTCAAC
GCCGCCACGC GCACGTTCAG CGGCACGCCG GCCAACGCCG ACGTCGGCAC GATCAGCGTC
AAGTTCACGG CCACCGACGG CAGCAACGCC TCGGCCGACG ACACCTTCGA CATCGTCGTG
GCCGACGTCA ACGACGCGCC CACCTGGTCC GACGTCGACA CGGCCGCAAC GGCAGCGCTC
ACGGCGCAGG ACACGGCCGT TACCGGTGTG CTGCCGGCCG CGGGCGATAC CGAGGGCGAC
ACCCTTTCGT ACGGCAAAGC CGCCGATCCC GCGCATGGCA GCGTCACCGT CAGCGCCGAC
GGGCACTATG TCTACACGCC GTCTGCCGGT TTCCACGGCA CCGATTCGTT CGAGGTCAGC
GTCGACGACG GCCACGGCGG GCGCAGCACG CTGACGGTCC GTGTCACGGT GCTGCCGGCG
CCCACGCTGG GCTTGCCGGC CGGGTCTGAT CTCGGCAGTT CAAGCACCGA CCGCATCACG
TCGGCCGCGG TGATTACGCT CGACGGCGCA GCCGCGGCCG GCCAGACGCT GCGGCTCTAC
GGCCCGCAGG GTCAGCTGAT CGCCACCGTG GCGACCGATG CGCAAGGTCG CTGGTCGGCG
GACCGCATCG ACCTGTCCGG CATGCAGGGG GATGACGCAG GCGCGGTCAA GGGTGCGGCA
GGCCGCTACA GCTTCAGCGT GCGCATGGTG TTGCCGTCGG GCGTGGAAAG CGCGCCGACG
CCGCTGACGG TGACACGCGA GATCCCGCTG GTGATCGAGG CCGCTGCGGC ACCGGCGCCC
GCGCCGATCC CCGAGGTTGC CGCGGCAGAA CCGGCTGCAG CCCCGGCTGC GGCGCCACAA
CCGGCGTTCG ACTCGGCGCT GGTGAGCACA CCGGTCACCG CACCGGTTGC CTCGTCGACC
GAAGCGCCGC GAGCGAGCAC GCCGCCGGTC ACGGGCCGCG ATGAATCGGT TGCGCCGCCA
CAGACCCCGA CGCAGCGTTC GTCCGCCGAC GGCGACATCT ACACCCGCTC GAGCGGCTTC
CAGGTGATGG TCACGCCCTC GAGCGAGCCG AGCCTGAAGC TGTTCAACGG CGTGCAGGAC
CAGGTCGTGC CGATGAACCG GCTGCTGATC GTGCAGGTGC CGGCCGATGC CTTCGTGCAC
ACGGTGCTGG CCGAGACGGT GACGCTGAGC GCCAGCCGCG CCGACGGCAC GCCGCTGCCG
GCGTGGCTGA GCTTCGACAG CAGGTCGGGC AAGTTCGTGG GCGAGCCGCC TGCAGGCCAG
GCGCAGGACC TGGCCATCCG CATCACGGCA CGCGACACCC AGGGCCGCGA GGCGACGACG
ATGTTCCGCG TCAAGGTCAC CGAGGCAGCC GGCAACGGCG TCAGCGGCCG GGCCAGCTTC
AATCAGCAAC TGGCTCGCGG CGAGGCGCTG GTCTTCAAGC CCGGTCAGCG CGCCTGGCAG
GCCCAGCCGC GACCGGCCGT GATGCGCCGG GGCTGA
 
Protein sequence
MSSTPNNKSW KKHHTAPAPR AWALEARLMF DAAAVADAVH QLSAETDTHV LDLQASSAAQ 
TTTASAVETT PHPIEGLFRI ATAPGDVAPT LLASQAEAQR LLQEFAQRPD AREQLFALFN
GNQAEPSAEW TRAADAYLAA LRSGEVSIEV QLRSAADLKG NMGAFSVDGA DGQPVIYLNA
DWVASGVATD ALTRVLAEEF GHGIDHALNG STDTTGDEGE AFAAVALNLG LDPTQQQRIT
AEDDHTSLVL DGHALTVELA GTAEVSVPFS EGYIGTVGTS TGKANNILNF STLGITRASF
FQDSTTGSFG GTQGNDLSGG IRLTLASGQV ITINGAINWR DTAGSTLYAF GFIPDPATPN
IAISYGSGQT YTITSSSNFG LETIGVTYSV ADGSNVSGNA ATSGLLTSLN TYLAEVQASA
PGGPVTVTSL STSDSTPTLG GTATLGANET LTVIVNGTTY TTSTGLTLGA GSTWSLTIPD
AKLLANATYG VTATITNASG YTLTDTTSSE LIVNTALPSN VAPTADAVST SGTEDAASIT
VALSATDSDG TVASYTIATL PANGTLYTDA ARTQAVLAGT PFSTSTLYFV PTAHWNGSTS
FGYVATDDGG ASSTSTTASI TVSAVNDAPV VLDDAQTTAE NTVLHASVVP ATDVDAPPEI
QDTGTLDFTI ANRTFSFFGP ETGSNEFNVN VGSGPTGFGS AAAMAAAFQA HPNYALLPYT
IGVNAAGDGL QLDFKVSGNY GGRGLEKWGD GPSWLTTLRE GQDLVYSVVT DVPAGQGTLS
FNADGSYDFD PGTAFDDLAP GASRSTTFTY TATDPDGSAA VARTVTITVT GANDAPTVAA
SLADAAATQG TGFSHTVPAG AFADVDVGDT RSYTATLADG SALPAWLSFD AATRTFSGTP
ANADVGTISV KVTAFDGSSA TADDTFDIVV TDVNDAPSVA NPIADQAATE DSPFSFTVPA
NAFADVDVGD TRSYTATLAD GAALPAWLSF DPATRTFSGT PANADVGTIS VKVTATDSGQ
ATADDTFDIV VANVNDTPVL ADTPLALTVA EDAGTPVGAV GSLIGAFTGG SSDADTGAAK
GIAITGADTS KGSWYYTTDG GANWQALGAV SATSARVLAD DGNTRLYFKP AAHANGDVTA
GLTFKAWDQN GGHANGTANV DTLGGAALIG GYNTPGTSFD VKLSADGTKA FVADTSGGLQ
VIDVSNPAAP TVLGSYGNAS TYFLALSADG TKAYLGNEAN DFLIVDISNP ASPTLLGTLV
TTGYAYEIAL STDGTKAYLA DSASLKIIDI TNPAAPALIG SFAEAGGGGA FFVTLSPDGT
KAFVGNTSSG LQILDVSTPA APTLLGTYDT PGTAYTVTLS ADGTKAFVAD MASGLQIIDV
SNPAAPTLLG TYNTTGSAWD VRLSADGTKA YLADASSGLL IIDISNPSAP TLLGTYNTAG
SAYGLTLSAD ETKAYVADGA SGLQIISLTT SPTEFSTATD TIAVAITAVN DAPVATGNAT
LAAIAEDTPN PAGATVASLF GANFSDSTDQ VSGGSSAHTL AGIAITGYTV DAAQGAWQYS
TDSGAHWTSV PGIGAETGAF TLQAATLLRF LPAADYNGPA PTLTTRLIDS STTVADAATL
DASTHGGSTA LSDATVALNS SVTAVNDAPL LTGDLAASVA VGNRYTITSG DLGYTDPDDG
NADITFTVSA LGNGSIEVDG TSATQFTGTQ LAAGQVRFVH DGSNTTSASF SVRVEDGNED
SSTPADSTFN LIVTPVNVAP VITSHGGDAT ASVNYAENGS TAVTTFTATD ADSGDTRTFS
ISGGADAALF DIGASTGALT FKASPDFEGT GDNSYDVTVK VADAAGAFDE QTLTVQVTNV
NEAPTLVNAI ADQAATEDSP FSFTVPADAF ADVDVDVGDT RSYAATLADG SALPAWLSFD
AATRTFSGTP ANGDVGTISV KVTATDGSNA SADDSFDIVV ANVNDAPTVA NPIADQAATE
DSAFSFTVPA DAFADVDVGD TRAYTATLAD GSALPAWLSF NPATRTFSGT PANADVGTLS
VKVTATDGAL ASADDSFDIV VANVNDAPTL AHAIADQAAT EDSAFSFTVP ADAFADVDVG
DSRSYAATLA DGSALPAWLS FDAATRTFSG TPANADVGTI SVKVTATDGS NAFADDSFDI
VVADVNDAPA VANPIADQAA TEDSAFSFTV PADVFADVDV GDTRSYVATL ADGSALPAWL
SFNPATRTFS GTPANADVGT ISVKVTATDG SNASADDSFD IVVADVNDAP AVANPIADQA
ATEDSPFSFT VPADAFADVD VGDTRSYAAT LADGSALPAW LSFNAATRTF SGTPANADVG
TISVKVTATD GALASADDSF DIVVANVNDA PTLVNAIADQ AATEDSPFSL TVPADAFADV
DVGDSRAYTA TLADGSALPA WLSFDAATRT FSGTPANSDV GTISVKLTAF DGALASADDS
FDIVVADVND APTLVNAIAD QAATEDSPFS FTVPVDAFAD VDVDVGDTRS YAATLADGSA
LPAWLSFDAT TRTFSGTPAN GDVGTISVKV TATDGSNVSA DDSFDIVVAN VNDAPTVANP
IADQVATEDS LFSFTVPADA FADVDVGDSR SYAATLADGS ALPAWLSFDA TTRTFSGTPA
NADVGTISVK LTAFDGALVS ADDSFDIVVA NVNDAPTLAH AIADQAATED SPFSLTVPAD
AFADVDVGDS RSYAATLADG SALPAWLSFD AATRTFSGTP ANADVGTLSV KFTATDDSNA
SADDSFDIVV ANVNDAPTLM NEIADQAATE DSPFSLTVPA DAFADVDVGD SRSYAATLAD
GSALPAWLSF DAATRTFSGT PANGDVGTIS VKVTATDGSN VSADDSFDIV VANVNDAPTV
ANPIADQAAT EDSPFSFTVP ADVFADVDVG DTRAYTATLA DGSALPAWLS FDATTRTFSG
TPANGDVGTL SVKVTATDGS NASADDSFDI VVANVNDAPT LMNEVADQAA TEDSLFSFTV
PADAFADVDV GDTRAYTATL ADGSALPAWL SFDATTRTFS GTPANGDVGT LSVKVTATDG
SNASADDSFD IVVANVNDAP TVANPIADQA ATEDSAFSFT VLADAFADVD VGDTRAYTAT
LADGSALPAW LSFDAATRTF SGTPANGDVG TISVKVTATD GSNASADDSF DIVVANVNDA
PTVANPIADQ AATEDSAFSF TVPADAFADV DVGDTRAYTA TLADGSALPA WLSFNPATRT
FSGTPANADV GTLSVKVTAT DGALASADDS FDIVVANVND APTLAHAIAD QAATEDSAFS
FTVPADVFAD VDVGDTRAYT ATLADGSALP AWLSFDATTR TFSGTPANGD VGTISVKVTA
TDGALASADD SFDIVVANVN DAPTLAHAIA DQAATEDSAF SFTVPADAFA DVDVGDSRSY
AATLADGSAL PAWLSFDAAT RTFSGTPANA DVGTLSVKVT ATDGALASAD DSFDIVVANV
NDAPTLAHAI ADQAATEDSA FSFTVPADAF ADVDVGDSRS YAATLADGSA LPAWLSFDAA
TRTFSGTPAN ADVGTISVKV TATDGSNAFA DDSFDIVVAD VNDAPAVANP IADQAATEDS
AFSFTVPADV FADVDVGDTR SYVATLADGS ALPAWLSFNP ATRTFSGTPA NADVGTISVK
VTATDGSNAS ADDSFDIVVA DVNDAPAVAN PIADQAATED SPFSFTVPAD AFADVDVGDT
RSYAATLADG SALPAWLSFN AATRTFSGTP ANADVGTISV KFTATDGSNA SADDTFDIVV
ADVNDAPTWS DVDTAATAAL TAQDTAVTGV LPAAGDTEGD TLSYGKAADP AHGSVTVSAD
GHYVYTPSAG FHGTDSFEVS VDDGHGGRST LTVRVTVLPA PTLGLPAGSD LGSSSTDRIT
SAAVITLDGA AAAGQTLRLY GPQGQLIATV ATDAQGRWSA DRIDLSGMQG DDAGAVKGAA
GRYSFSVRMV LPSGVESAPT PLTVTREIPL VIEAAAAPAP APIPEVAAAE PAAAPAAAPQ
PAFDSALVST PVTAPVASST EAPRASTPPV TGRDESVAPP QTPTQRSSAD GDIYTRSSGF
QVMVTPSSEP SLKLFNGVQD QVVPMNRLLI VQVPADAFVH TVLAETVTLS ASRADGTPLP
AWLSFDSRSG KFVGEPPAGQ AQDLAIRITA RDTQGREATT MFRVKVTEAA GNGVSGRASF
NQQLARGEAL VFKPGQRAWQ AQPRPAVMRR G