Gene GSU1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1154 
Symbol 
ID2686838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1246831 
End bp1260972 
Gene Length14142 bp 
Protein Length4713 aa 
Translation table11 
GC content63% 
IMG OID637125828 
Productsurface protein 
Protein accessionNP_952207 
Protein GI39996256 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAA AACGAGCCAA AAGGACACTT GGCGCCTTTA CCGGCAAACG CAGCACCTTC 
AGAAAACTGG TTTCTCTCAG CGCAGCACTG ACCCTCTCTC TCCCGCCCCA GGCCTTCCCC
GCCCAGATCG TTCCCGATGG CAGAACCGGC ACCTCCCTCA CCATCAGGGA CAACGTCACT
GACGTCACCA CCTCCACCGT CCGTGGCGCC AATGCCTACA ACTCTTTCCA GACCTTCGAC
GTCTACCGGG GTAACGTGGT CAACCTTCAC GTACCGGACA GCGCCGTCAA CCTCCTGAAC
CTGGTCCACG GCCAGGCAAG TACCATCGAC GGCATCCTCA ACGCTTACAA GGACGGCCGC
ATCGGCGGTA ACGTCTTCTT CGCGAACCCC TACGGCTTTC TGGTCGGCGC GTCGGGAAGC
GTCAACGTCG GCGCCCTGTC CGTCATGACG CCCACCACGT CATTCATGGA GAGCTTTTTC
CTGGCCCCCG GTGTCCCCTC TGAAAGTGCC GCCGCCATGA TGCTGAACGG CACAGTCCCC
ATAAACCCCG ACGGGCTCAT TTCGATCAGG GGGCAGATCA ATGCCCTCGG CACTATTCGC
CTCAGCGGCG GCAGCGTCGT CAACAGCGGA ACGATCACTT CCACCGGCAC CTACGGGACA
GCAGCCGACA CGGGTTCGCT CTTCAGAGCC ATGGTCAACG CGGACGGACT TGAAAGCGGC
AACAACATCG TCGCGCGCAA CGGCGGCATT GAAATTGTGG CTGCTCAAAG CGTCGAGAAC
TATGGATCAA TTTACAGCAA GGGCCAGTCG CTCCATCTCC AGGCCGGCAC GGAACTGATC
GTGGCGGATG GCGAAACGAT CTCCACCCGG AAGATCGAGG GCGATCCCTC CGATGCCGCC
GTGCACCTGT TGACCGATAA TTCGAGCGGC AACTCGGGCA ACCTGACCCT TGAGGCGCCA
ACCATCACCC TGGGCTCCGG CGCCCGGCTG CTGACTCATG CCAGCGGCGA CTACCTTGCC
GGCAACATCG AGCTCCTTGC CGGGCAAAAC ATCACTCTCA ATAACGGCGC CCGTCTCCTG
GCCGGCCATG CCAGCGACCC GGCCAAGGGA GGCGATGTGC TGCTCAAGGT ATCCGCTATC
AACGCCATCG GCGCATCACG CACCGCAGAC GCCGGCATCA GGGCAGTGAA CTCCGTCATT
CGCGGCCGGA ACGTGACCCT GTCGTCCATT GCGGACACAT CCCTGATCGT GCATCTGCTG
GAACAGAACC CGACCCTCTC GCTGGATGAG GCACAGGCCT ATCTGAACAG CGAACTGGAC
GACCTGGTGT CCGACGGACC CGGCGGCGAG TACCTGGCAG TCACGACCAG CGCCACGGCC
AAGACCGAGC TGTACGGCAC AACTATCGAA GGGACCGGCG CGGTAACCAT CGAGGCAAAG
GCCGGCGCCC GGGCCGGGTT CAAAAAGAAC GCCGTCGCCG AAGTGATCAT CGACGACCTG
CGGGACGCCG ACCAGGCCAC GGTTCTTGCG AAAAGCTACA TCAGGGGCAA TAAGGTCTCC
ATCACCAGCA CCGCCGACAC CTCGCTTACC TTCAACGTTC TTGGCAGTGT TCTGAAGCTG
ACCGACCAGA GTTGGTTGCC GGATCCGGTC ACGGGCGAGC TCCAGCTTCT CAATGATCAA
CTCTTCGACT TCAGCGAAAT ACCCCTGGTA TCCCTCTCAA CTGCCACGGC GCACACAACT
GTCGGCGGAG CCACCTTCAT CAGTGCCGGG GACACGCTTA CCATCAGCTC CGAGGGCATA
TCAGCGGCAA AACCCACCTT TTCCAGCCCC CTGCTCTTCT CGGCGGCATG GGGCGAATCC
ACCGTTGAAG CAAAAACCCT GGTGAATGGA ACAACGGAAC TGTCCGCCGC CAACAAAGCA
ACAGTCAAGG CGACCACCGA CGTCGAGATT AATGTGACTG CCGACGTAAA CTCTACCAAC
AAACCCGTTG ATGCCGTATT CGTTCATGCG AAGAATACCG CCGTCACCAC ATCCCTGGTG
GGAAACGATA CCACCACCAC AGCCGGGGCG GTTGAGGTGA ACGCAGCAGC CACGGCGGAC
ATCTCGGCCA ATGCCCTGGC CAAAAACGCC GGGGGAAGCG GCGTGGGCAT CGCCGTGGCG
GTCAATGAGT CGACCACGAC GACGACGGCA ACCCTTGGCG GCAACGTTAC CGCCGACGCC
GGCAACGTCA CGGTGAAAGC CACCACGGAC ATCACCAAAA ATAACACCGG CGCCGACGCG
GCGACGCTGG GCAACCCGAA CACCATCTCG GCCAGAATTA CCGACTTCCA GGCCGGGATC
AAGCGAAACG TCACCAAAGG CATCATCGAC GCCACCGGCC TGCTCAAGCC GGAAACCTCC
GAGCGCATTA CCGGCTTCAT CTTCCCGGGC ATCAAGGAAG GCACGTTCAA CCTCTCGGGA
GCGGTTACCT ACAGCAAGTC CGTGAATACC ACCACCGCGG CCATAGCCCC GGACGCCACG
GTACAGGCTC AGGGAAACAT CGACGTCACC GCCCGGATCG ATGATCGTCC CAACGCCAGC
GTCGGCTCCA AGGCCACCTC TACCGGCACT GCCATCGGCG GGGCGGCCGT TATCGCCGAC
TTCACCAACA ATGCCAGCGC CAGCATCGGG ACCGGCGCTT CGGTCGACGC CACGGGAAGC
CTGCTGGTCG ATGCCCAGAC CGTCGTCCCT TACCCCTGGC AGATCGATTG GGACTCGCCA
GTTACGATCC TGAATCATCT CCAGGACGGA ATTCTCGACC TGCTGCTCAC CTCCTACGCA
ATCAACTCGG CTGGCGGGAA GAGCGGCATG GGCCTGGCAG CGGCGGTCAG CGTCTTTAAC
CTGGAAAACA ATGCCAACGC ATGGATCGAC GAAGGGGCTA GGATCAACAC CGTTTTCGAC
AAAGACGCCA TGACCCTGCC GAACCAGATC GTCACCGTGC ACGCGAAAAA CGACATCAGC
ACCGTCAACG CGGTGGGGCT GCTGTCGAAG AAATTTCTCG GCACCAGCGG CGGCAAGGCG
GCAATCGGCG GCTCGGGCAA TATTATCGAC ATCAGGGAAA ACGCGACGGC AACCATCCGC
GGCGATTCCG TGGTGAAATC CGAGTCTACC ATCGATGTCA AGGCGGAGAA CGTGAACCAC
CTGGTCACCG TGACCGAGGC CGGCGGTTCG TCGGACCAGG TGGGAGTTGA AGGGGCGGTT
TCCATAAACA CCATCACCGG CGGGGCAGTG GCGGCCATCG ACGATGATGC CGATGTGGAC
GCCGGAGGCA ATATCAGCGT TGAGGCCAAG GGAACTGCGA AGACCATTTC CGTGGCCGGC
GGCGTGGTGG CCACCAAGGG CCAGGTCGGC ATCGGCTTCG CCGTGTCGCT CAACACCATC
GATACCGATG CTTCGGCTTA CATCGGCAAC TACGACCCGC TACAGCAGGA TGATGTACCT
GCCTTGGGCC AGGTCAGCAC TGACGGCTCT CTGACGGTCA AGGCAACCTC CTCCAATGAG
ATCGGGGCCT ATTCCGTGAC TGGAGCCCTT GCCACCAACA GCACGGCCCA GACCGAAGTC
CCCAAGGATG CTCAGGAAAC GAAGGATGGC GCCGGCAGCG TGGCCGGTTC CAGCGGCAGC
GGCAAAGGCA CGTTCGGCAT TGCCGTGTCG GGCGATGCGT CGGTAAACGA CATCACGTCC
GACACCCTGG CCTACATCAG CGACGGCGCC ACGGTAAGCC AGGCCGGCAA CGCCACCCTG
AGCGCCACCA ACACCCTGGC CGTCAACGCC CTAGCCGGCG CCGTAACGAT CTCCACTCAG
CAGGAAGGCA ACGGTCTGGC AGGTTCCTAC TCGCAGAACA CCCTGGGCGG CACGACCGCC
GCCTACCTGG ATGACGCATC GCTTACCATC AGCGGAGACC TGGACATGGA TGCCCAGGTG
AACGGTGAGA TCAATACCAT CTCGGCCTCG GTCCAGGGCA CAAAGGGGAA GGTGGGGGTT
GCCGGTTCGG TGTCGGTCAA TGAAATTACC AACACAACCC AGACATACCT GACCGGGAGT
ACGGTCCGAG GAGTCGAGGC TGTGGACCTG ACCGCCCGGG ACGACTCGAT CATCAAATCC
ATCGCCGGCG CCGTGTCCTA TGGCGGCAAG GCGGGAATCG GCCTCTCCTT CGCCTGGAAC
AGCCTGGACA ACCTGACCCA GGCGTATGTG GATACATCCG CCCTGACCGC CACTGGGGAC
ATCACCGTCT CGGCCACCAC CAACAACGCC ATCGACACCA TCTCCGCCGC CCTCGGGGCC
AGCACGGGGG ACATGGCCGG CGAGGCCGCG GTGTCGGTCA ACACCCTCAG CAACGAGACC
CATGCCTGGA TCTCCGGCCA GAACAACGGC AGCGGCGTCG AGTCAGTCGG CAGCATTTCC
CTTGCCGCCG ACGACCAGTC GCGGATATTT GCCATTGCCG GCGGGCTCGC CGCCACCTCC
GGCAAGGCGG CCTTCGGCCT CTCCTTTGCC TGGAGCGATG TGAGCAACAT TGTTGATGCC
GGCATCCGGA CCGGCGCAGA CGTGGAATCT ACCTCGGGTA ACGTGGAGGT TGCGGCCGAT
TCCACCACTC GGGTGCAAGC CTTTGCCGTG GGAGGGAGCT TCGCAAGCAA GGTGGGCATC
GGCGGATCAG TATCGGTGGC AGAAGGCACC AACAGCGTGA CCGCCACCAT TGACGGCACG
TCCGGCGTAA CCGCCGACGG CAACGTGCTG GTCACCGCTT CGGACGACGT GGATATCTTC
AGCCTGGCGG GCAACGTGGC TGGCGCGGGC AGCGCGGCCA TCGCCGTCGC CAACTCCACG
CTCGTCACTC ATAACCTCGT GGAAGCGACC CTTGGCGCCG GCGCCACGGT CAGTGCCCGC
GGCAACGGCA CTGCGGGCCG GATCTATACC GGCGACAAGG ACGCCTCGGG GAACCGGACA
AAAGAAGATG TAACCGGGCT GGCGGTTTCA GCCGCCTCCT TCGAAAATCT CCAGACCATC
GCTGCCGGCG GAGCCGGCGG CGGCAAGGTG GGCATTGCGG GTTCGGCCAC CGTCACCGTC
CTGGACGAAA AAACTTACGC CACCGTCGGC CAAGGCGCAC ACGTGAACGA TGCCGATGAC
GGCGATGCCG CCCAAAACGT CCTTATCCGG GCATCGGACA GGACCGGCCT GCTCGGCGTG
GCCGGGGCGG TGGCTTTCGG CGGATCGGGT GGAGTCGGAG CTGGGGCCGA CGTGGGGGTA
ATCACCAAGG ACACCGAGGC AAGCATCGCC TCGTCCGCCC AGACGCCCAC GACGGTCAAG
GCAAAGGGGA ACATCGTCGT CACGGCCGAC AGCAGCGAAG ATATCACCTC GGTGGCGGCC
TCGCTCTCTG CCGGCGGGTC GGCCGGCATC GCAGGGTCGG CATCGGTCTA TGATCTGGGG
CTGAACACCG CCGCCACCAT CGGCAACAGC GCCGTGGTGC GCGCCGATGG CAGTGTCGCA
GTCAGCGCCC ATGACGGCAC CGAGATGGAC ATGATCGCCG GGAACGGGGC CTTTGGGGGG
ACCGCCGGCG TCGGCGCCTC GGCGGCGGTG CAGGTGATCA CCAAGACCGT TATCGCCGCC
ATCGGCGAAC AGGCCGATGT AACGGGACGT GGAACAGGGG ATGGGGTCGT GGTTGCCGAC
GGCGGGTTTG CCGTGTCTTA CGGAGCCGAT TCCGGCGACG AGGGGGAAAT CCGCGCACCC
ACCACCAATG GCTCCGGCAG CGACAGTGGT GCCCTGACCG GCCAGCGCTC CGCACCCCCC
AGCACCAGGA CGGTCAATGG AGTTGCGGTG ACCGCCACGA ATCAGGACGA TATCGAATCG
ATCTCCGCAA CCGGCTCCAT CGCGGGGACA GCGGCAATCA CCCTGGCCGG CAACGTCAAC
GCCATCACAA CCACCACCTC AGCCACCATC GGCACGGGGG CCACGGTCAA CCAGGATACC
GCCGCAGCCG ACGCCGGACA GTCGGTCCTC GTGGCCGCCG GAAACGACTA CTACCATATG
GGAGTGGCCG GATCGGGCTC CGGTGCCGGG GCGGTAGGCA TCGGCGTCGG GGCCGATGTG
ACCGTGGCGA ACCTGACCAC CACGGCGGAC ATCGGCGAAG GCGCGCTGGT CAGTGCCGCA
AAGGACGTGG AGGTAAGCGC CCTGGCAGGA GAAGAGGTGC TCTCCATCTC CGCCAGCCTT
GGCGTGGCGG GTACGGTCGG CGTCTCCGGC TCCGTATCCG TCCTGTCCAT CGACAACACC
ACGAGTGCCG GCACCGGGAC GGACTCGATG GTGGACGCGG GCGGCAACGT GCGTATTGCG
GCCCGGGACG ACACCGAAAC CGACATGATC GCCGGCACCG TTGCCATCGG CATCGGGGGC
GCCGGAGTCG GCGGAGCAGT GGGAGTCACT TCGGTGAGCA AGGAAACGAC GGCGACCGTG
GGGAGCAATG CCACGGTGAA CGCCCGCGGC AACGACACCG GCTCCATGAC CGCCTATACC
GGCGATGACA GTGATACAAC CGGGCAAATC AGGGGACTTT CGGTGGAGGC GGCCTCTTCC
GAGGATATTT TCTCTGTGGC GGCCGCAGGT GCCGGCGGAT TCTACGCCGG CGTATCGGGT
GCGGTTACGG TGCAGACCGT TGATTCCGCT ACCCGCGCCT CCATCGGTAC CAATGCTTCC
GTCAACGAAG GGGTCACGGA TGGTCACGAT GAGCAGGATG TGAACGTGAG CGCCCGCAAC
AGCGCCAGAA CCAACGTCAT AACCGGCGCG CTGGGGGTCG GTGCGCTGGG AGCGGCCGGC
GGCGTCGACG TGGGCGCCAT CAGGAACGAC ACCAGCGCGT CCATCGCCGA TGGTGCCCAG
ATCTACGCCA ACCGGGACGT GGAGGTCAAT GCCCTCGCAA AAACCGAAAT CGATTCGGTC
GTGGTGAGTG CTGCCGGCGG TCTGGGAGCA ATCGCCGGAG GGGTTGCGGT CTATTCGGTG
GGGACCGGCC TGGAACAGGA GGCCAAAGAT CAGCTGAAAT CCGATGGCGG CGACTTCGCC
GACGTCAATA GCTATGCCGA CGACCAGGCC TCGGACAACA GTATCGGCAC ACTGCTCACC
GGCTCCGGAG ACAGCCGCAT CCGGTCCATC GCCGCCGATG CCCAAGCCAA GCGCTCGGAC
GTTGCCGTCA CTGATCAGTT GAATAATCAG ACCCCACGCG GCACTGCCGC CTTCATCGGC
GGGGCTGCAG TGGAGGCCGG ACGCCATGTG GACGTGTCTG CCAGGCGGAC GGTGGATGCC
GACATCCTCG CCGGCGCAGC CGCGGGGGGA GCTCTGGGCC TCGGGGCCGG TATCGGCATA
GTCAATGTCA GCGGTTCTAC CCAGGCGTAC ATCCTCGGTT CCGGCCGGGC CAATGCGGCA
GGCAACATCC TGGTTTCGGC CAACACCGAC GCCACGGGCA CCGTGGACGC CTATGTGGGC
ACGGGCGGCA TCGTCGCGGT CAACGCCGCG CTGGCGATCT ACAACGACAC GGCAAGCACC
TCCGCTTACC TGGGCGACGG CGGGGTAATC GACAGGGCCG ATCAGGTCGA CATCAACGCC
ACCGGTCTGC ATACCGTAAC GGCCCACACC TTCGGCGTCA GTGCCGGCGC CGCTGCCGGC
GGTCTCTCCC TGGCAAAGGC CCGTGTCGGC GGCACAATCG ATGCCTCGGT GGGCGAGGAT
ACCCGGATCG GCCAGGACAG CCGTAACACG GGCGATACCG TTGGCAGCCT GTCCGTGAGC
GCCCAGACCA TCACCAACGG CACGGCCCGT TCCGAAGCCG CTGCGGGCGG CATCTTGTCG
GGCCAGGGTT CCATTGCCAC GGCAGACATC GGCCCCACGG TAACTGCCGA GATCGGCAAC
CGGACCGCCG CCCGCGTCGA CACCGACGTG GCGGCAATGG CAACCGCCAC CGTCACGGGC
ACGGCAACGG CCAACGGGGC CAGCATTGGA GCCCTCGGCG TGGGGGTCTC CGAGGCAACG
GCTCGGACCA TGCCCGTGGT CAGGGCCACC ATCGGCGACG AGACGGTCAT TACTGCGGGA
CAGGATGTAA CGGTAAGCGG CACGGCGACC ACTACCGCCA CGACCCACGC TACTGCGTCT
GCCGGGGCGC TGATCGGCAT AGCCGGCACC ACTTCAACCG CGACCGGCCT GCCGCAGGTG
AACACTGCCA TCGGCAGCGG CAGCACCATT GAGGCGGACA GAGCCGTATC CGTCACCGCA
ACCACCACCA ACAGCGCATC TTCCGATGCC AGCGGCTGGA TCGGCGGATT GGCGGCCTTC
GGCTCCAACA CAGCCACGGC CGTCACCGCA GTGCCGCTCT ACGACGCCCA GGGGCGTTTC
ATCTCCTGGT TTGGCTCAAC GACAGGTGCT CTCATCGGCG GCAATACGGC CATAACCACC
GACAGCGTTA ACGTGGCCGC CACCTCGTCC AATGTCGCCC TGGCCGACTC CCGGGCCGGT
GCCGGCGGCG GCATAGCAGG GGTCACGACC AGGGCCGAAA CTATCCAGGT CAACACCACT
ACCGCTGCCA TCGCCGACAG CAGCGATGAC AATGCCCGAA AGATCCATGC AACGAATAGT
ATCGCCATTA CGGCAGATGC TCTGACTACC CTCAACGCTT TTGCCGACAG CTCAACGGCC
GGCCTCGTGG GGGTGAGCGG CGCCCGGACC GACAATGCCG CAACGTCCGT GGTGGAGGCG
TCGGTGGGAA CCAATAACGC CCTCGAAGCC GGGACGGACC TGGCTGTCCT GGCCCTGAAT
GAGATCGTAA AGTACAGCCC GCAACGCCCG GCCAACCTCA CATCAGGGGC GGGCGGGGTA
TTCGGCGGCG CCGCCGGGCA GAGTTCCACC TCCCTGAGCA CCTTCACCAC TGCTACCCTG
GCAGGCAATA CCATCAGCGG GTCGGACCAG ACGATCAGCG CCGGCGGCGA CCTGACGGTG
GCCGCTGAAA ACGCCGTGCT GGCCACAGAC ATGGCACAAC TTTCCGCTGG AGGGCTCATC
GCCGTCGCCG ACGTGCGATC GGCCATCACC AGCGACAACA CCGCCACGGC CACAATCGGA
GCCAATGCGA ACGTGCACGC CGGCAACGAC CTGAACGTTC TGGCCAAGAC CAACGCCAAC
GTCCAAACCA CCACCAACAC CAGTACTTGG GGATTCGCCG CCGGCGGCGA TGGGACCGCA
CTCAACACCG TTGTGGCGGA CAACGACGTG GTGGTCGGGA CGAATGCCGC GCTCAGTGCC
GACAACGACA TCGACCTCTT TGCCGGCCAG GGGCTCTCGG ATCTCCAGAA CAGTCTCATC
TCCCGGGCAG ACGCCCGTTC CTGGGTGTCG GGGGCCATCC CGGTCAGCGA CGTGACGGGA
TGGGCCTACC TCTATGACTT CAACGACATC CTTATCGACA CCGGCTCGAA CCTGAAGGCG
GGCCGAGACA TCAATCTGGG CGCCTTCAGC GGCCTTGCCA CCGTCGAGGG ATATGCCAAG
GCCAAAAAGA AATCATACCT CCTCTTCGGC ATACCCATCA CCATCTACAG CAACGGAAGC
CGGCGGAGCT GGTTCTTCAA CCACGAGGGT ACCGACGTCA GCGGCCCGTC CGTCACCGTG
AACGGCACCC TGGAAAGCGG ACTGAACCGC CACAAGACCC TGGTGATCGG ACCGGACGGG
ACGGTGGTCG GAGGTACCCT TACCTCGGCT GACTACGAGC AGACCACCAT TAACCTGCGG
GACAAGGTGC TGGACAAGAA GGCCAAACTC GACGCCAAGA TTGCCGAGAT CGACCCTTCC
GGCACGTATC CCAACCTGCC GGAAGGTGAC AAGATCTTGT ACGACGCCCT CAAGACCGAG
GTGCAGATCC TGGAGCAGAA ACTGGCAGAG TGGGAGGGGA AGAGCAACGC GGAACTTACG
GTTCCCCTGA TCGCGGTCAA GGACCTCATG ACCGGGTCGG GCGACATCAA TATCACCACA
ACGACCCTCA AGGGGACCGG GACCCTGAAG GTGCCGGGGA CCGACTTCAT GATCAGGATC
GACAACAACT CTCTGGCGCA TCTGGAGTTG AACGACCTGG AAATCCCCAA ATCTGCCAGC
GGCAATGTCA ATCTAAACGG CAAGGCCATC ACGTCCCACA GTTATGGCGG AGAGACGCTG
CAGATCGTTG CCGGACAGAA TCTGGGCCGG CGGATCGAGA TCTTCAACAA CGCCTATCTG
GATGACTTCC CCGGGGCCCT CACCCCGTCC GATATCGTGT TGAAGGGCGA CATCATCAAC
TACGGCGGCC GGGTTTCCAT CCGGAACAAC AGCGGCAGCG TCGCCGTCGG CGGCAACATT
ATCGCCGACG ACCTGGACAT GGTCATGCAG GGCGGCTTTG TCCGCGAGTG GCAGCCGGGC
CTCTACCAGC CGGGCCACCT GATTGCCGGC AACAACATCT ATATCAGCGG CGAGATACTG
GACATCAACG ACACCATCCA GAGCGGCATT CCTTACCGCT ACATCACTAT TCCCGAGTTC
GATCCCGAAA CACTCGGTCC GGACATGGTC ATCCCGACCA TCGGCGACGA GTTGAGCGTG
GCCAAGTGGG ACCCGGTAAA CCAGCGGATC GTCATCTACC GGGTGGACTT CGGCGGCGGA
AAGGTGGAAC TGTTCGGCAA TATCGTCAGC ACGACCGGCA CGGGCGCCCT GAAGGTCATG
GACGGCTACG GCGAAATTGT CATTGAGAAC CTCTCGTCCA GGGACGTGGT GATCAACACC
CTGGACGTGG GACCGCGGGT CGACGGCCAG ATCAAGATCG TCGACACCGG CCGCAAGTAC
ACTGACAACG GCATCTACGT GGGGGACAAC AACCAGCTCC TCACCCTGAT CACCGGCAAC
GGAGACGCCC TCAACGTGTC CCAGGGATAT CAGCTCTGGG ACGCGGCGAC CAGAAAATTT
ATCTATACCG AACTGGACAG CCACAAGGCT GACAGCCGGT CTTCATCCTA TGCACCCCAT
GCCGGCGCGC GCGTCGCTAC GTTCAGCGAC TGGACCATCA CCGAAAAGGA TGTGGCAGCA
GCTTGGGACA ACTGGTGGAC TACCCGCTTC ACCGCCGGTA ACTTCTGGCT CCAGTTCGCA
CTGATGGTGG ATGCGGGCAT GAAGCAGGCC ATTCTCGACC AGTTCGGCAA GAAAGCGGAC
AATCCCATCG GCATCGAATT CCTGGGCAAT CTGGACGAAG GGAGGATCAG CATCTTCAAC
AGCGGCGCCG CCGGACCCTC CGACATCTAC CTGAACGGCT CCATCAGGAA CACAGTCGGC
AACGTGAGCA TCAGGAACGA CCGGGGCGGA ATCTACTCTC TCAACGATAC CTATCTGGTC
ACCGGCAGGA ATATCGCCCT CACAGCCACG GAGGGGAGCA TCGGCACCTT GGACCAGGGG
ATCAGGACCG ACACGGTTGG CGGGAGTCTC AGGGCCACTG CCGGCGGCCT GATCAATGTG
GAAGAGGTGG AGGGAGACCT GGTGATCGAC ACCGTCACCA CCACGGGGGA TGTGCGTCTC
GTATCGGCAG GCTCACTCAG GGACGGTTCC GGAACCTCAC CGTCCATCAC CGGCACCAAC
ATCTCTCTCA CTGCCACGGC GGGCGGCATC GGAACCGGGG ACAACGCCCT CGTGGTCAAC
GCCGACGGCA TCCTGACCGC GGAGTCGCTC CACAGCATCT ACCTGACGGA AAAAGAGGGG
AATGCGCACA TCAACCGGAT TGCGTCCCGG GAAGGAGACG TGGTGCTTAC CGTGGACGGC
GGCCTTGAGG ATTACAACTT CAACGAGGGC CTCGATGACG ACACGAAGGA CAAGCTCCTG
ACAACTTGGG ACGACCTGAA ACTGACCGAC GACACGAAAG TCCAGCAATC CATCGATCAG
TACAAGGAGC AGAAGAAAAG CCAGTATCAG GCGGCACACC GCCTCTCCGA CAACGGTACC
CCCTTTGATC CGAGCGACGA CCAGTATGAC GCCACCTACG ATCCGTCATG GCAATACACC
CTGACCGCCA CGGAACAGAG TGAATTCAAC GAGGGCGTCT GGACCGCCGA TGAACTGCTC
AACGCCAAGA ACCTGACCAC CATCCCCGAA CTGGGCAAGA CCGAGGTTCT CATCGAAGAG
GCGAACGTCT CGGGCCGCAA CGTCACCATC GTGACAGGGG CCGGTGTCGG CAGCGTCCTT
GCCGACGAGG TGATTTCCGC CGACGCCATC AGCAACGGCA CGGTGACTCC CGACCAGCGG
ATCATGGTCG CCCGTGCCGA GAAGGACGAT ATCACCATCG ACAATGGCAA CCTGATCGTG
CAACTGAAGA ACGACGTGGA TGTGCGGGCC TCCCAGAGCG TGACCATCCA GTCCCGCGAC
CATGTCTACC TGGGAGCTGA AACGGATGTG AACATCGACC GGGTCGACGC AGGAAACGGG
GATATCCGCC TGAAGATCGT AGGCGGAATA ATCAACGGCC GCACCGATGA CGAGGCAAAC
CTGATCGGCA GGGATCTCAT CCTCGAAGCC TCTGCGGGTG GGGTCGGCTC CGCAGCCAGA
CCGCTGGTCA CCGACCTGTC ACTGGGAGGT GTGCTGACCG CCCGGGCGAG GGACGGTATC
TTCATCAGGG AAGCGGGCGG CGACATCTCG GCCGACAGCA TCATCAGCCA GAACGGCGCA
GTGGAACTGA CCGTCGCCAA CGGGTCGGCC GCCATCGGCC AAATCTCCGC ACCCGGACAT
GTCCTGCTCG AAGTATCCGG CAACATCGTT AACGGTCGGG ATGACAACGG GGTCAACATC
ATCGGCGACG ACCTGATCAT CGAGTCCTCG GCAGGGGGCG CCGGCACATC GGCGAACGCG
CTGGTCACTG ACCTGTCCGG CAACGGAGTC CTGACCGCCT GGGTCCGCGA CGATCTCTTC
CTGGAAGAAC GGAACGGCGA CCTCACCATC GATACTATCG CAAGCACAAA CGGCGCAGTG
GAACTGTCGG TCGCCGCCGG CTCGGCCATT GTCGGCGGGA TCACCGCCCC GCGCCGAATC
CGGATGACCG CATCCGCTAA CATCGTCAAC GGCCGTGACG ACGGCCGGGA GAACCTGATC
ACCGACGACC TGAGCCTTGA AGCGGCCGGC GGCAGTGTCG GCTCCGCGGA GAAGTTCATT
GTCTCCCGCC TGCGGCCGGC GGGCATACTG ACCGGGCTGT CACAGGACAG CTTCTTCCTT
GAAGAACAAG GCGGCGGACT CACCGTCGAC AGCGTTGTCA GCCAGACCGG GTCGGTGCAC
CTTACGGTGC CCGACGGGTC CGTCGACGCC GATCATATCT CCGCGCCCGG CACGGTTTCG
ATCCGGGCGA ACGGTCCGCT CCTCACGGTC CACCGTGTTG ACCCGACAGT ACTCGACGTG
CGCAACACCT TCTCCGGCGG GACCATCGTG GTGGGCCAAG CCGACGTTGC CGAATCGGTC
ATGGCCCGGG GCGACACGGT GCTCTTGGGA GAAATCCATC ACACGGGGAG CGGGACGCTT
CACTTCGACG TGGACGGCGG CAGCAAGACA ATGGCGGACA TGGTCCGAAT CGGGACCGAT
TCGAACACCG CCATCGACTT TGATCATCTC TCGTCCGACA CCGCCGTAAT AACCGCCGAT
GTGGACAATC TGAGCCTCTT CGATACCCGT ATCGGCAATC GGGGCGATTT CAGCAACAGC
CTGTATCACG TTATCGTCGG CAACCGGGAC AAAAGGGTTC GGCCGTGCCA CCTGCAACTC
TATGCCACGG AGCCTTTCTC CCTGACGATG ACTGCCGACA AACGCTTCAC GACCACCGCA
TTCGCTGTCA ACTATGATCC TCACTTCGTG GTCAACGGGT TCAGCACCGA GAACAGCGTG
GTCGGCACGA CCGAAAAGAT GATCTGGACC GGCAAGCGGC AGAACCGCCT GTACTACGAC
CCCATGGAAC CGGGATCCCG CCCATGGCAG CGGCACATGG CGCCCTCCGG ACATGACGCG
GTGGATATTC AGCCCGGAGC GGTAGGCATC GACGCAAGCG ATTCCCTGCT GGAAGCCGAT
ACGGTGAAGG TACTCACCGG CAATACTGGT GCAAACCGTT AG
 
Protein sequence
MNGKRAKRTL GAFTGKRSTF RKLVSLSAAL TLSLPPQAFP AQIVPDGRTG TSLTIRDNVT 
DVTTSTVRGA NAYNSFQTFD VYRGNVVNLH VPDSAVNLLN LVHGQASTID GILNAYKDGR
IGGNVFFANP YGFLVGASGS VNVGALSVMT PTTSFMESFF LAPGVPSESA AAMMLNGTVP
INPDGLISIR GQINALGTIR LSGGSVVNSG TITSTGTYGT AADTGSLFRA MVNADGLESG
NNIVARNGGI EIVAAQSVEN YGSIYSKGQS LHLQAGTELI VADGETISTR KIEGDPSDAA
VHLLTDNSSG NSGNLTLEAP TITLGSGARL LTHASGDYLA GNIELLAGQN ITLNNGARLL
AGHASDPAKG GDVLLKVSAI NAIGASRTAD AGIRAVNSVI RGRNVTLSSI ADTSLIVHLL
EQNPTLSLDE AQAYLNSELD DLVSDGPGGE YLAVTTSATA KTELYGTTIE GTGAVTIEAK
AGARAGFKKN AVAEVIIDDL RDADQATVLA KSYIRGNKVS ITSTADTSLT FNVLGSVLKL
TDQSWLPDPV TGELQLLNDQ LFDFSEIPLV SLSTATAHTT VGGATFISAG DTLTISSEGI
SAAKPTFSSP LLFSAAWGES TVEAKTLVNG TTELSAANKA TVKATTDVEI NVTADVNSTN
KPVDAVFVHA KNTAVTTSLV GNDTTTTAGA VEVNAAATAD ISANALAKNA GGSGVGIAVA
VNESTTTTTA TLGGNVTADA GNVTVKATTD ITKNNTGADA ATLGNPNTIS ARITDFQAGI
KRNVTKGIID ATGLLKPETS ERITGFIFPG IKEGTFNLSG AVTYSKSVNT TTAAIAPDAT
VQAQGNIDVT ARIDDRPNAS VGSKATSTGT AIGGAAVIAD FTNNASASIG TGASVDATGS
LLVDAQTVVP YPWQIDWDSP VTILNHLQDG ILDLLLTSYA INSAGGKSGM GLAAAVSVFN
LENNANAWID EGARINTVFD KDAMTLPNQI VTVHAKNDIS TVNAVGLLSK KFLGTSGGKA
AIGGSGNIID IRENATATIR GDSVVKSEST IDVKAENVNH LVTVTEAGGS SDQVGVEGAV
SINTITGGAV AAIDDDADVD AGGNISVEAK GTAKTISVAG GVVATKGQVG IGFAVSLNTI
DTDASAYIGN YDPLQQDDVP ALGQVSTDGS LTVKATSSNE IGAYSVTGAL ATNSTAQTEV
PKDAQETKDG AGSVAGSSGS GKGTFGIAVS GDASVNDITS DTLAYISDGA TVSQAGNATL
SATNTLAVNA LAGAVTISTQ QEGNGLAGSY SQNTLGGTTA AYLDDASLTI SGDLDMDAQV
NGEINTISAS VQGTKGKVGV AGSVSVNEIT NTTQTYLTGS TVRGVEAVDL TARDDSIIKS
IAGAVSYGGK AGIGLSFAWN SLDNLTQAYV DTSALTATGD ITVSATTNNA IDTISAALGA
STGDMAGEAA VSVNTLSNET HAWISGQNNG SGVESVGSIS LAADDQSRIF AIAGGLAATS
GKAAFGLSFA WSDVSNIVDA GIRTGADVES TSGNVEVAAD STTRVQAFAV GGSFASKVGI
GGSVSVAEGT NSVTATIDGT SGVTADGNVL VTASDDVDIF SLAGNVAGAG SAAIAVANST
LVTHNLVEAT LGAGATVSAR GNGTAGRIYT GDKDASGNRT KEDVTGLAVS AASFENLQTI
AAGGAGGGKV GIAGSATVTV LDEKTYATVG QGAHVNDADD GDAAQNVLIR ASDRTGLLGV
AGAVAFGGSG GVGAGADVGV ITKDTEASIA SSAQTPTTVK AKGNIVVTAD SSEDITSVAA
SLSAGGSAGI AGSASVYDLG LNTAATIGNS AVVRADGSVA VSAHDGTEMD MIAGNGAFGG
TAGVGASAAV QVITKTVIAA IGEQADVTGR GTGDGVVVAD GGFAVSYGAD SGDEGEIRAP
TTNGSGSDSG ALTGQRSAPP STRTVNGVAV TATNQDDIES ISATGSIAGT AAITLAGNVN
AITTTTSATI GTGATVNQDT AAADAGQSVL VAAGNDYYHM GVAGSGSGAG AVGIGVGADV
TVANLTTTAD IGEGALVSAA KDVEVSALAG EEVLSISASL GVAGTVGVSG SVSVLSIDNT
TSAGTGTDSM VDAGGNVRIA ARDDTETDMI AGTVAIGIGG AGVGGAVGVT SVSKETTATV
GSNATVNARG NDTGSMTAYT GDDSDTTGQI RGLSVEAASS EDIFSVAAAG AGGFYAGVSG
AVTVQTVDSA TRASIGTNAS VNEGVTDGHD EQDVNVSARN SARTNVITGA LGVGALGAAG
GVDVGAIRND TSASIADGAQ IYANRDVEVN ALAKTEIDSV VVSAAGGLGA IAGGVAVYSV
GTGLEQEAKD QLKSDGGDFA DVNSYADDQA SDNSIGTLLT GSGDSRIRSI AADAQAKRSD
VAVTDQLNNQ TPRGTAAFIG GAAVEAGRHV DVSARRTVDA DILAGAAAGG ALGLGAGIGI
VNVSGSTQAY ILGSGRANAA GNILVSANTD ATGTVDAYVG TGGIVAVNAA LAIYNDTAST
SAYLGDGGVI DRADQVDINA TGLHTVTAHT FGVSAGAAAG GLSLAKARVG GTIDASVGED
TRIGQDSRNT GDTVGSLSVS AQTITNGTAR SEAAAGGILS GQGSIATADI GPTVTAEIGN
RTAARVDTDV AAMATATVTG TATANGASIG ALGVGVSEAT ARTMPVVRAT IGDETVITAG
QDVTVSGTAT TTATTHATAS AGALIGIAGT TSTATGLPQV NTAIGSGSTI EADRAVSVTA
TTTNSASSDA SGWIGGLAAF GSNTATAVTA VPLYDAQGRF ISWFGSTTGA LIGGNTAITT
DSVNVAATSS NVALADSRAG AGGGIAGVTT RAETIQVNTT TAAIADSSDD NARKIHATNS
IAITADALTT LNAFADSSTA GLVGVSGART DNAATSVVEA SVGTNNALEA GTDLAVLALN
EIVKYSPQRP ANLTSGAGGV FGGAAGQSST SLSTFTTATL AGNTISGSDQ TISAGGDLTV
AAENAVLATD MAQLSAGGLI AVADVRSAIT SDNTATATIG ANANVHAGND LNVLAKTNAN
VQTTTNTSTW GFAAGGDGTA LNTVVADNDV VVGTNAALSA DNDIDLFAGQ GLSDLQNSLI
SRADARSWVS GAIPVSDVTG WAYLYDFNDI LIDTGSNLKA GRDINLGAFS GLATVEGYAK
AKKKSYLLFG IPITIYSNGS RRSWFFNHEG TDVSGPSVTV NGTLESGLNR HKTLVIGPDG
TVVGGTLTSA DYEQTTINLR DKVLDKKAKL DAKIAEIDPS GTYPNLPEGD KILYDALKTE
VQILEQKLAE WEGKSNAELT VPLIAVKDLM TGSGDINITT TTLKGTGTLK VPGTDFMIRI
DNNSLAHLEL NDLEIPKSAS GNVNLNGKAI TSHSYGGETL QIVAGQNLGR RIEIFNNAYL
DDFPGALTPS DIVLKGDIIN YGGRVSIRNN SGSVAVGGNI IADDLDMVMQ GGFVREWQPG
LYQPGHLIAG NNIYISGEIL DINDTIQSGI PYRYITIPEF DPETLGPDMV IPTIGDELSV
AKWDPVNQRI VIYRVDFGGG KVELFGNIVS TTGTGALKVM DGYGEIVIEN LSSRDVVINT
LDVGPRVDGQ IKIVDTGRKY TDNGIYVGDN NQLLTLITGN GDALNVSQGY QLWDAATRKF
IYTELDSHKA DSRSSSYAPH AGARVATFSD WTITEKDVAA AWDNWWTTRF TAGNFWLQFA
LMVDAGMKQA ILDQFGKKAD NPIGIEFLGN LDEGRISIFN SGAAGPSDIY LNGSIRNTVG
NVSIRNDRGG IYSLNDTYLV TGRNIALTAT EGSIGTLDQG IRTDTVGGSL RATAGGLINV
EEVEGDLVID TVTTTGDVRL VSAGSLRDGS GTSPSITGTN ISLTATAGGI GTGDNALVVN
ADGILTAESL HSIYLTEKEG NAHINRIASR EGDVVLTVDG GLEDYNFNEG LDDDTKDKLL
TTWDDLKLTD DTKVQQSIDQ YKEQKKSQYQ AAHRLSDNGT PFDPSDDQYD ATYDPSWQYT
LTATEQSEFN EGVWTADELL NAKNLTTIPE LGKTEVLIEE ANVSGRNVTI VTGAGVGSVL
ADEVISADAI SNGTVTPDQR IMVARAEKDD ITIDNGNLIV QLKNDVDVRA SQSVTIQSRD
HVYLGAETDV NIDRVDAGNG DIRLKIVGGI INGRTDDEAN LIGRDLILEA SAGGVGSAAR
PLVTDLSLGG VLTARARDGI FIREAGGDIS ADSIISQNGA VELTVANGSA AIGQISAPGH
VLLEVSGNIV NGRDDNGVNI IGDDLIIESS AGGAGTSANA LVTDLSGNGV LTAWVRDDLF
LEERNGDLTI DTIASTNGAV ELSVAAGSAI VGGITAPRRI RMTASANIVN GRDDGRENLI
TDDLSLEAAG GSVGSAEKFI VSRLRPAGIL TGLSQDSFFL EEQGGGLTVD SVVSQTGSVH
LTVPDGSVDA DHISAPGTVS IRANGPLLTV HRVDPTVLDV RNTFSGGTIV VGQADVAESV
MARGDTVLLG EIHHTGSGTL HFDVDGGSKT MADMVRIGTD SNTAIDFDHL SSDTAVITAD
VDNLSLFDTR IGNRGDFSNS LYHVIVGNRD KRVRPCHLQL YATEPFSLTM TADKRFTTTA
FAVNYDPHFV VNGFSTENSV VGTTEKMIWT GKRQNRLYYD PMEPGSRPWQ RHMAPSGHDA
VDIQPGAVGI DASDSLLEAD TVKVLTGNTG ANR