Gene Spro_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4077 
Symbol 
ID5607034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4521585 
End bp4532393 
Gene Length10809 bp 
Protein Length3602 aa 
Translation table11 
GC content61% 
IMG OID640939638 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001480300 
Protein GI157372311 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC ATTGTTATCG CCTTATCTTC AGCCGCACCC ACGGGGAGCT GCGGGTGGTG 
TCCGAACTGG CCCGCAGCTG CAGCAGCGAG CCGGGGCAAC GCATCGGCTC AGGGATAACA
GGTGGAAGTC GTCTGTGGGT CACCGTGCGT CGGTCAGTAT GGCTGCTGGG GCTGTTGATG
TTCGCCGGAC CCGTCATGGC AGACGGCATT GTCGCGGACG GTGCTGCCAA TCCGGCCCAG
CGGCCGGAGG TCATCAATAC TCAGAACGGC TTGCCGCAGG TCAACATCAC CACCCCCAAC
CAGGCCGGCG TGTCGCACAA CCAGTATCAG CAGTTCGATG TTGACGCCAA AGGTGCCATC
CTCAACAACT CGGCGGTGAT GACCTCGACC CAGATGGCCG GGATGATCCA GGGTAACCCC
AACCTCAATC CCAACGCTGC ACCGGCTCGC GTCATCCTCA ACGAAGTCAA CAGCAATAAT
CCCAGCCAGC TGCGCGGGTT TATGGAAGTG GCCGGCGGCA AGGCACAGGT GATTGTGGCC
AACCCCGCCG GCATTGTCTG CAACGGCTGC GGCACCATCA ACGCCGGGCG CATGACGTTG
ACCACCGGCA AGGCGCAGTT GAATGCTGAC GGCAGTGTGG CGGGCTATCA GGTTGAGCGC
GGCGTGGTGC GCATCGAGGG TGGCGGACTT AACGGGGATG CCCGTCATGA TACCGAGTAC
GTCGACATTC TGGCGCGTGC GGTGGAGGTC AATGCCGGCG TCTGGGCCAA AGAAGGCGTG
TCGGTGGTGG CCGGGCGCAA CCGCGTGAGT ACGGACGGTA AAACCGCGAC ACCGCTTTCG
GACGACGGCA GTGCCAGACC TGAGCTTGCC ATCGACATGG GCCAGATGGG CGGCATGTAC
AGCGGCAGTA TCCGCATGAT TGGCACCGAA GCCGGCGTCG GTGTGCGTAA TCAGAGCGGA
CAGGTCCGGG CAGGCAAAAC GCTCACCGTC AGCAGTGAAG GTAAGCTCAG CTGGCGGTCT
GATGCACCGG ATGCCGCCAC ACAGGCCGGT GGCGATATCA GCCTGGCGGC GAAAGGTGAC
ATCGAGACGC ACGGCAAAGT GCACAGCGGC GGCCAGCTTG CCGTGCAGAG TCGTGAGGGA
ATGTTAACGC AATCCGGTAC GCTGGCGGCG GCTGGGAATG TGCACCTGCA TGCCGCACGC
GGCATCCAAA GCAGTGGCCA CCTGCTGGCG GGCAGTGATG CCAACAGCCA GATTGTGCGT
GATGCCAGCC TGCAGTTGGA CAGTCAGGGC GATATCCGCG CCAGCGGCAG TTTGCTGAGT
AAAAAGAACG TCAACGCGTC CGGACGCCGG GTGGATGTCA GTGGTGCGAA GGTGGCTGCA
GGGCGCACCG CGCTGACGGC CCGCGAAGGG GGGGTGGCAT TGCGCCAGTC CACCGTCGAC
AGCGGCGAGC TTGTCGTGAA CACGGCGGGT GATGTGGACG CGCAACAGGC CAGGGTCAAG
GCCGGGCGCT GGACGGTGGA TGCCGACAAT CTGTTCAACC AACAGGCCAC CTGGTCACAG
ACCAATGAGG GTGAGAGTCG CTTCACGCTG GCGGGCACGC TGGATAACAG TGACGGTGCC
ATTGAGACGC AGCGCCTTCT GTTTTCCGCC GGCCAACTGA CCAATCAACG TGGGCGCCTG
GTGGCGCTGG GCGGCGCGGC GCAGCACTGG CAGGTCGGTG GCCTGCTCGA TAATGCCGCC
GGGACGGTGG GCAGCAACGG CGACCTGCGC CTGGATGCCG GGCGCCTCGA GAATCAGAGC
GGTACGGTAA AAACCCAGTC GGGTCTCACG CTCCATGCTG ACGGCGCAGT GAACAATGCC
GGGGGTAACC TGCTGGCCGG GAGCGGCCTG ACGCTCGAGG CGGGCGGCGA CCTGAACAAC
CTGTCCGGTA CCCTGAGTGG CGGCGAGGTT CGGCTGACGG CGCAACAGGT GGATAACGCG
CAGGGGCAGC TCATTGCTCA GGGTAATCTC AACCTGACCG CCAGCCACCT GGACAACCAG
GACGGCCTGA TGGGGGCCGG CAAGGCGCTG GATGTGCATG CCGGCGACTG GGACAACCGC
GGCGGTACGG CGCAAGGCGA AACGGCCGTC ACGGCAACGG CTGGCAACCT CAACAACGAC
GGCGGTAAGC TGCTGTCAGG CCAGGCATCA ACCCTGACGA CCTCGGGCAA CGCCACCAAC
CGCGGCGGCG AAATCAGCGC CGCGGTGCTG ACGGTGAAGT CTGACCGTCT CGACAACACG
CAGGGCAAGG TGATTGGCCA GCAGAGCCTG GAGCTGAATG CTCGCCAGGG ACTGGACAAT
ACGCTGGGGC TGTTGGGCGC CGGTGACGCA CTGACTGTGC GCACCGACGG TGAATTGAAT
AACCGTCGCG GCACGGTGCA AGGGAATGGG CAAACCACCG TGGCGGCACG GGACATCCGC
AACGAGGCCG GCAAGCTGCT GGGCGGGCAA CGGCTCACCC TCACCACCTC GGGGATACTG
GGTAACCACG AGGGGGAAAT CAGCGGCGAA TCGCTCACAC TGGCGGCGCA GCGCCTCGAT
AACACTCAGG GTAAGGTGGT TGCCAAGCAG GATATGAGCC TGACGGCACA GCAAGGGCTG
AGTAACGCCG CCGGCTGGCT TGAAGCCGGC CATGCACTCT CCGTAAAGAC CGGTGGTGAC
TGGGACAACC GTGGCGGCAC CACACAAGGC GGTCATCAGG TGACCGCCAC CGCGCAGTCT
CTCGACAATA CCGGTGGTCG CCTGCAATCT GGCGGTGACC TTCGCTTTGA TACTGCAGGT
GATATTCTCA ACCGAACCGG CAAGCTGACC GCGCAACGCA CACTCGACGT CCGTAGCGGT
GATGCCGCTC TCTTTGACAA CGATGGCGGG TCACTGCAAA GCGGCGGCGA TCTGTCTCTG
CAGGGAGGTC AGTTGACCAA CCGCACCGCC GGTGTGGTCC TCGGCGGTCA GGCGCTTTCC
CTGAGCCTGA CCGGCGGCTG GGACAATCAG GGCGGCACGT TTACCGGTAA GGGACGCGCA
GGGGTGCGCG CCGCTAATCT GTTGAATGCC CGGGGCGTTA TCAACGCGCT GGGCAGCCTG
GACATGCAGT TCACCGGCAA GCTGGATAAC GGCCAGGGGC GGATTTTTAG TCAATCGTCT
CAGGTACTAC AGGCGCAGGA CATTTTCAAC GCCCAGGGCT GGATGGGCAG CCAGGGCGGC
TGGCAGGCCA TCAGCGGCGG CTTTGACAAT ACAGCCGGCA GTGTGCAGAG CCTGCTGGGA
GCACAACTTG CGGCGGACTG GCTGGGTAAC GCTAAAGGCG TGGTGCAGTC GGCGGCGGAC
CTGGTACTGC GCGTCGCACA GGATATCGAT AATCGCGACG GCAAGGTCTC GGCGCAGGGG
CAGTTTGCTG TTACGGGTGC CAAAGACGGC GAACACGCCG GCGCCATCAA TAACGCCGGC
GGTCAGTGGC TGGCCGGAGA GGGGCTGAGT ATCGCCGCCC GCGCACTCGA CAATACGCAG
GGCGGACTGC TCTACAGCCA GAAACAACAG CGCCTTACGC TGAGTGACGC GCTGAACAAC
CGCGATGGGA AAGTGCAAAG TGGTGAGGCG CTTCAACTTG ACGCACAAAC GCTGAACAAT
GCCGGCGGGA CGATAGACGG CCAGCAACAG GTGGCGCTGC GGATTCTCGG CTTACTGGAA
AATACTGGCG GTGCGGTGCG CAGTAATGGC GACCAGCAGG TGTCGGCGGC CGGTATCAAT
AATACGCGCG GCGTATTCAG CAGCCGCGGC GGCATCACGG TGACATCAAA GCTGCTGGAC
AACGCCGGTG GCACGCTCAT CAGTCAGGGC TCGGGCATCT ACCGTATCGA CCAGCTTAAC
AACCAGCACG GCAAGGTGCA CAGCGGTGAG GCGCTCACGC TCGAGGGCGA GTTGGTGAAC
AACCAGGGCG GGCAACTGGT GTCGACCCAG GGGCTGACGC TCAAGACCGG TGTGCTCGAC
AACAGTGGTC AGGGCTCGAT AAGCAGTCAG GCAGCGCTTG ATGTGCGGGC CGATCGCCTG
AACAACCGCG ACGGCGGCCT GATACTGGGC ACTACGCGTA CCGACATCAC CGCCCGTGAT
ATCGACAATA CCGCAGGCCG CCTGCAGAGC AGCGGGCAGA TGACCCTCTT GGGGGTAACG
CAGCTGGACA ACCGCCAGGG GCGTCTCCTG GCCAACGGCA ATCTCGACAT CAATGCCGAC
CGGTCATCGA CCGACTCGCC GCTGGCGCTG CTCAACCAGG GCGGGCGCGT GGAGAGCGCC
GAGCAACTCA CTGTGCATAC GCGTACCCTG GATAACCAGA ACGGTACCCT GCTGGGGCTG
CAGGCGCTGA CACTGTCCGC GCAGCAGGAC TACACCCGCC AGGCCGGTGA CACCGTCAGC
AGTAACGGTA CGGTGACGTT CTCACTCACT GGTGCCTTTA CCAACCTGGC TGACTGGTTG
CTGCCGGGCA ACCTGGTGCT CACTGCGGCC AGCATCACCA ACCCGGCTAC CCTGGTCGGC
AAAACGCTAC AGCTGACAAC CGGGGCCTTG CAAAACACCG GGCGCATTGA GGCTGACAGC
ATGACGCTGA ACGTCGATAC CCTGGACAAC GCCGCGGCGC TGATGGGGGA CGCTATCACC
GTGCGCGGGC GCGTTATCGA CAACCACGGC GCGCCTGCAG TGATGGCGGC GACCCAAAGC
CTGACATTGC ACGCCAGTGA GCGCCTGACC AACCGGGAGG GTGCCCTGCT CTACAGCGGT
GACCGTCTGC ATATGCACAG TGATGACCTG ATTGAAAACC GAGCCAGCTT TATCGAAGCG
GACGGTGACG CCACGATTGA GGCCCGCCGC CTGAACAACC TGCGTGAAGG GCTGGTGATT
GAACGCGCTG CGGAAAAGCG CGACTACAAA TGGCACCGCT ACAACTATTA CTGGCGTTCT
TACGGTGAAG ATGTCAATCC CGATGTCAGT ACCATGGCCC CGACCACTCA GCAGTTGACC
TTCCAGAATG ACGCGGCGGC ACAAACCAAC CGCTATGGTA CCCTGCTGGC CATTGATGCG
GCGGGGAAAC GCGCACAGGT GCGGGTCAAA GACAACACCG GCCAACTGAC CGACCTGTGG
GTCAACTACC TGGCGCTCAA GCCAAATGCC GACGGCAGCT ATGCCATGAC GTTCTATGAA
ACGCATGGGG GCAATCAGTT GGCGACCATC CCGACGCCTT ACCAAAACGG CTTCCACTGG
GAGCACGACT GGACGCAGGT GATGACCTGG GACCCTGAGA AGCACATCGA TATCGCTACC
GCGCCGTTTG TCACCGATTA TAACAACCTG CGCGAACGCA CCGCGACCGG TACAGTGACG
CGCGACAAAC TGGTGAGTGA AGGTATCGGC GCGCGCATTC TGGCCGGCGG CAATATGGTG
CTGCGCATTA CCGGTGCGCT GCTCAATGAC GCCAGCGTCA TTACGGCCAA CGGTAACCTG
ACGCAGGACG GCGGCGGCAG CGTGGACAAC CGCGGTTACT CGGTCAATGA ACGACGCCAG
GAACATATTG TCGACCACTA CGACAGGGCC GAATCGCACT GGTATCCGAC GTTCAATCGG
GACGAAACCA CGGCGCTGGC GACCGTTGAT GGTGTGATCA CCGGCAATGG CATGGTCACC
ATCAACGGGG CCCGCATCAC CAATACCACG GTCAATCAGG CGCAAATCAG CCAGCTTCAG
GCGGCATTAA ACGCCGTGGA TGCCGAACGT GCCGAGCTCG AGCGCAACCC GCTGGCCTTC
ACGGTAGAGG GCTCTACACG TCCTAGCGGT GACACGCAAC TTGCTCCAGG TGAGGCAGTG
ACCCGTCCGG AGGCCACGCC GTCTTCACCG CTGGGACGCC CGCTGCTGCC GTCTGAGCTG
GCGCTGACGC AGTTACAACA CCTGGCCAAT GTGGCCACCG CTATCCCCAA TAACGGTTTG
TTTAGTCAAC ATTCGGCGAC CGGCAGCCCG TTCCTGGTGG TGACCGATGA ACGCTTTACC
CGCCGCGACA ACTTTATCAG CAGTGACTAC ATGCTCGAGC GCGTGGGGTA TGACCCTGCG
CAGGCCCATA AACGCCTGGG GGACGGTTTT TACGAACAAC GCCTGGTGCG CGAGCAGGTG
CTTGCGCTGA CCGGCAAACC GTCCGTCAAG GGCTGGGATG CAATGACGCA GTACCAGCAA
CTCATGAATA ACGGGACCAA AGTCGCCCAG GACTTCCATC TGGTGCCGGG CGTGGCGTTG
ACGCCGGAGC AGATTGCGGC GCTGCAACAG GATATTGTCT GGCTGGTCAG TGAGACGGTA
CAAACGGCCG ACGGCCCGCA AACGGTGTGG GCGCCGAAGG TGTACCTGGC ACAGACCACG
CTGCGCCTGA CCGGCGATGG CGCGGTGATT GGCGGCGACA ACCTGCAACT CTCGGCAAAC
AGCATCACCA ATGCCGGCAA TTTGTTTGCC GACAAGGCCC TGACGGTCGA CGCCGGGCAG
TTCCTGCATC AGGGCGGCGA TATCAAGGCT GGCAGCATCG ATGTGCAGGC CGACAGCCTG
ACTCTGAGCA CCAACCTGCA GGACGCGCTG CGCCAGGCAA CCATGAGCGC AGGCGACATC
CACCTCAGTG GCACCGATAT CACGCTCTCA GGCGCGAAGC TCGACGCCAC CCATGCGCTG
AGCCTGAGTG CGCGTAACGA CCTGGCCATC ACCGCCGCGA AAAGCAGTCA TACTGCCGAC
CTCGAGTTCA TCTCGGGTTC AATGGGCAAC CGCACCCGTG GCGGCACGGA AGCGGCAGGT
TCCCGGATGG CGCACGTCAG CGGCGAATGG CAGCAGGCAC AGGGGAGTGA GCTTAACGCC
GGCGGCAACC TGACGCTCAA TGCCGGGCAC GACGTTCTGC TTACGGGTAG CCAGGCAAAA
GCAGGCGGTC AGCTTGGCGT GCAGGCCGGA GGCAACATCA ATCTCCTTGC AGATAAGACC
ACCAATACCA CTCACCTGGA CGCTAACAGC CGGACGTCAT CAGTCAGCAA CGACCGCCAG
GAAGAGCGCC TGGCGCTCAG CAGCCTGGGC GGTGACCAGA GTGTAACGCT GATAGCAGGC
AATCACCTGC TGGCCGAAGG CGCGCAGATT GACAGCAAAG CGGGTCGCAT TGGACTCAGT
GCGCAGGACG TGACCATTAA AGACGCACGT ACCCGTACGC AAGATCTGGA CAGCGAAAAC
AAACGCGGGG GCAAAACCAA AAGCCATCGC ATCGAGCAAA CCGAACGTGA AATCAGCACG
GGTAGCACCT TCAGCGGACG CGACGGGGTG ACCGTGATCG GTCGCGAAGG TGACGTCACC
GTCACCGGCA GCACCCTGCA CAGTGACCAG GGTGCCATCG CCCTGCAGGC GAAAAAGGAT
GTGACCCTCA ATACGGCCAC CGAGCGAGAG TCGCGTTACA GTGAGGAACG TTCCGAGAAA
AAAGGCTTCC TGAACAAGAG CAGCAGCCAC ACCGTGACTG ACGACCGAAC CACGCGCGAG
AAAGGCACGC TGCTGAGTGG CAACAGCGTC AGCATCAGCG CCGGCAATGA CCTGACGGTC
ACCGGTTCTG CGATTGCCGC CGACCGGGAC GTGGACTTGC AGGCTGGCCA TAACGTTGAC
ATCGGTGCGG CCACAGAGAC CGAGTCCCAC TATCTGCTGG AAGAGAAGAA AAAAAGCGGC
CTGTTGGGCA GCGGTGGCAT CGGCTTCACG ATGGGCAAAC AGTCGAGCAA ACATGAAATC
GACGAAAAGG GCACCACCCA GAGCCAGAGC GTCAGCACGG TAGGCAGCAG CCAGGGCAGC
GTCAACGTCA CAGCGGGCAA CCAACTGCAC ATTGGTGGCG CCGACCTGGT GGCAGGCCAA
GACCTGAATC TTACGGGCGA CAGCGTGACT ATTGACCCGG GCTTTGATGT ACGTACGCGC
AAAGAAACGT TCGAGCAGAA GCAGAGCGGC CTGAGTGTAG CACTGTCGGG CACCGTGGGC
AGCGCACTCA ATACCGCCGT CAGCTCGGCG CAGCAGGCAA GGAAAGAAGG CGATGGGCGC
CTCAGCGCGC TGCAGAATAC CAAGGCGGCG CTGTCCGGTG TGCAGGCCGC GCAAGCCTAT
TCACGTGACA ATGCGCTGAC CGCGTCGGCG GAGGCGAAAA ACGCCGCTGC CGGCCTGAGT
GCAGATGACC CAAAAGCCGC GCAAGGGGCC ACCAACACCG TCGGCGTCAG CGCCTCCTAT
GGCAGCCAGT CGTCGAAGAG CGAAACCCGT ACCGACAGCC GCCAGTCGCA GGGCAGCACG
CTGACTGCCG GACAAAACCT GTCGATAACG GCGACCGGCA AGAACCACAC CGCACAGAGT
GGCGATATCG CCATCACCGG CAGCCAGTTG AAAGCGGGCA AAGACCTGTC ACTCGATGCG
GCGCGGGATA TTAGCCTGCA ATCGGCACAG AACACTGAAA GCACGGTCGG CAAGAACGAG
AGCCGCGGCG GTAACGTCGG TGTGGGTATC GGGGTTGGCT CGGGCGGGTA CGGCATCACC
GTCTCGGCCG GCGTCAATGC GGGTAAAGGG CATGAGAATG GCAACGGCCT GACCCATACT
GAAACCACCC TGGATGCGGG AAGCACCCTC AAGGTGACCA GCGGCCGCGA CACCACATTG
AAAGGTGCGC AGGCGAGCGG AGAAAAAGTC ACCGTTGATG TGGGCCGTGA CCTGACATTG
CAAAGCGAAC AGGACAGCGA CCGTTATGAT GCCAAACAGC AAAACGTCAG CGCGGGTGGC
AGCTTCACCT TCGGTTCGAT GACCGGCTCA GCCAACGTGA GTGCCAGCCA GGACAAGCTC
AAGAGCAATT TCGACAGCGT CAAGGAACAG ACCGGACTGT TTGCCGGCAA AGGTGGGTAT
GACGTCACCG TCAAAAACCA CACCCAGCTC GACGGGGCCG TCATCGCCAG CACCGCGGAC
AAGGAGAAAA ACCGTCTCGA CACCGGCACG CTGGGCTGGA CGGACATCCA CAACCAGGCG
GACTACAGCG CGACGCACAG CGGCGGGTCA TTCAGTACCG GCGGCCCGGT GGGTAAAGAC
CTGCTGACCA ACATGGCCGG TGGCATGCTG TCGGGGGCCA ACAACAGCGG GCATGCCGAG
GGCACGACAA AGGCCGGCGT CAGCGAAGGT ACGCTGATAG TCCGGGATGC CGACAAACAG
CAACAGGATG TTGCACAGCT TAACCGGGAT ACCGAACATG CCAACGACGG CAGTATCAGC
CCGATATTTA ACAAGGAGAA GGAGCAAAAC CGACTCAAAC AAGCCCAGCT GATAGGGGAG
ATTGGCGGCC AGGCGATGGA CGTCATCCGT ACGCAGGGCG ATATCGCCGG GCTGAAGGCG
CAGAAAGACC CGTCCGCGCT GGCGCAGGCC CGGGAACAAC TGGAGAAAAG CGGCAAGCCG
GCAAACGACG CGGCGGTGAT GCAGCGGGCG TATGACAACG CGATGCGGCA ATACGGCACC
GGCAGCGACC TGCAGAAGGC GGCGCAGGCG GTTACGGGGG CGCTGACGGC ACTGGCGGGC
AATAATCTGG CCGGAGCACT GGCGAGTGGC GCATCGCCGT ACCTGGCGAC GGAAATCAAG
AAACGGGTAG GTGAAGACAA CATCGCCGCC AACGCAATGG CGCATGCCGT GCTGGGCGCG
GTGACAGCGC AGTTGAATAA CCAGTCGGCC ACCGCTGGCG GACTGGGCGC CGGAGGCGGG
GAACTGGCGG CGCGCTACAT TGCCGGCCAG CTGTTCCCGG GCAAGATGAC GGAACAACTG
AGCGAAAGCG AGAAACAGCA GGTCAGCGCG CTGAGTCAGT TGGCCGCAGG GCTTGCAGGC
GGTCTGGCAA CGGGGGATAC TGCGGGAGCG GTGACCGGCG GGCAGGCCAG CAAGAATGCG
GTGGATAATA ACTCGCTGAG CGGAGATCAA GCCCGCGAAT CTGTTAAGCA GGTTGCCGGA
AATATGAAGG ATCAGGTCAG GGATAAACTT GGCGAAGGTA CACTCTCTGC TATTGTTAAC
AGCATTATCG GTGCGGCGGC GGATACCGGC GATGCGGTAT TAGGTGGGGC GGATTACGGT
GCTGATGGAG CTATGGCGCT GACTGCCTGC GCTCTGGGAG ACAGCTACTG CGACAAGGCA
TTAAGCGATC TGGCGGGTAA AAATCAGGCT GCGGCAGATA CGCTGAAAGC CCTGATGAAG
AGTGAAACCT GGAAAGCGGT TGCCGGGCAG GTTAAAGAAG CCGCACAAGG TAACCAGCTT
GCTCTGGAAG CCACTGGCGG AATGCTGGCG GGTATGTTCC TGCCAGGTAA GAAACTACCT
GATATTGAGG TTGCTAATAA GTTACCAAGT GGACCGAGTA GCTCAATAGT CCCTGGTGGG
GGATTAGCTG CTCATGAAGC GGCAGGTGGG CACCTGATCG ATAGGCATGT GGGAAAAACA
GAAGCGGAGT TATTGAATAG AGTGTCAACG GGTAATGTTA AATCAGCGTC TTCATTTACA
GATAGGGCCA CTGCTGAAGC AGTCACAAGT AAGGCAATTG ATAGCAATCA GGCTAAGATC
AATAGTTACC TTTCAGGTAG CCAGAAAGGG TATTTAGAGA TTGATTATCA ATCCAATGTA
CCTATTGGTA TTAGTGTCTC TCGCGGTTCT ACAAATGTTT CCTCTGTGAC GAATGCTAGA
ATAATCATTG CAAGAGATCC TTCAATGCCA GCGGGGTATA AAATCATTAC TGGATATCCA
ACGCCATGA
 
Protein sequence
MNKHCYRLIF SRTHGELRVV SELARSCSSE PGQRIGSGIT GGSRLWVTVR RSVWLLGLLM 
FAGPVMADGI VADGAANPAQ RPEVINTQNG LPQVNITTPN QAGVSHNQYQ QFDVDAKGAI
LNNSAVMTST QMAGMIQGNP NLNPNAAPAR VILNEVNSNN PSQLRGFMEV AGGKAQVIVA
NPAGIVCNGC GTINAGRMTL TTGKAQLNAD GSVAGYQVER GVVRIEGGGL NGDARHDTEY
VDILARAVEV NAGVWAKEGV SVVAGRNRVS TDGKTATPLS DDGSARPELA IDMGQMGGMY
SGSIRMIGTE AGVGVRNQSG QVRAGKTLTV SSEGKLSWRS DAPDAATQAG GDISLAAKGD
IETHGKVHSG GQLAVQSREG MLTQSGTLAA AGNVHLHAAR GIQSSGHLLA GSDANSQIVR
DASLQLDSQG DIRASGSLLS KKNVNASGRR VDVSGAKVAA GRTALTAREG GVALRQSTVD
SGELVVNTAG DVDAQQARVK AGRWTVDADN LFNQQATWSQ TNEGESRFTL AGTLDNSDGA
IETQRLLFSA GQLTNQRGRL VALGGAAQHW QVGGLLDNAA GTVGSNGDLR LDAGRLENQS
GTVKTQSGLT LHADGAVNNA GGNLLAGSGL TLEAGGDLNN LSGTLSGGEV RLTAQQVDNA
QGQLIAQGNL NLTASHLDNQ DGLMGAGKAL DVHAGDWDNR GGTAQGETAV TATAGNLNND
GGKLLSGQAS TLTTSGNATN RGGEISAAVL TVKSDRLDNT QGKVIGQQSL ELNARQGLDN
TLGLLGAGDA LTVRTDGELN NRRGTVQGNG QTTVAARDIR NEAGKLLGGQ RLTLTTSGIL
GNHEGEISGE SLTLAAQRLD NTQGKVVAKQ DMSLTAQQGL SNAAGWLEAG HALSVKTGGD
WDNRGGTTQG GHQVTATAQS LDNTGGRLQS GGDLRFDTAG DILNRTGKLT AQRTLDVRSG
DAALFDNDGG SLQSGGDLSL QGGQLTNRTA GVVLGGQALS LSLTGGWDNQ GGTFTGKGRA
GVRAANLLNA RGVINALGSL DMQFTGKLDN GQGRIFSQSS QVLQAQDIFN AQGWMGSQGG
WQAISGGFDN TAGSVQSLLG AQLAADWLGN AKGVVQSAAD LVLRVAQDID NRDGKVSAQG
QFAVTGAKDG EHAGAINNAG GQWLAGEGLS IAARALDNTQ GGLLYSQKQQ RLTLSDALNN
RDGKVQSGEA LQLDAQTLNN AGGTIDGQQQ VALRILGLLE NTGGAVRSNG DQQVSAAGIN
NTRGVFSSRG GITVTSKLLD NAGGTLISQG SGIYRIDQLN NQHGKVHSGE ALTLEGELVN
NQGGQLVSTQ GLTLKTGVLD NSGQGSISSQ AALDVRADRL NNRDGGLILG TTRTDITARD
IDNTAGRLQS SGQMTLLGVT QLDNRQGRLL ANGNLDINAD RSSTDSPLAL LNQGGRVESA
EQLTVHTRTL DNQNGTLLGL QALTLSAQQD YTRQAGDTVS SNGTVTFSLT GAFTNLADWL
LPGNLVLTAA SITNPATLVG KTLQLTTGAL QNTGRIEADS MTLNVDTLDN AAALMGDAIT
VRGRVIDNHG APAVMAATQS LTLHASERLT NREGALLYSG DRLHMHSDDL IENRASFIEA
DGDATIEARR LNNLREGLVI ERAAEKRDYK WHRYNYYWRS YGEDVNPDVS TMAPTTQQLT
FQNDAAAQTN RYGTLLAIDA AGKRAQVRVK DNTGQLTDLW VNYLALKPNA DGSYAMTFYE
THGGNQLATI PTPYQNGFHW EHDWTQVMTW DPEKHIDIAT APFVTDYNNL RERTATGTVT
RDKLVSEGIG ARILAGGNMV LRITGALLND ASVITANGNL TQDGGGSVDN RGYSVNERRQ
EHIVDHYDRA ESHWYPTFNR DETTALATVD GVITGNGMVT INGARITNTT VNQAQISQLQ
AALNAVDAER AELERNPLAF TVEGSTRPSG DTQLAPGEAV TRPEATPSSP LGRPLLPSEL
ALTQLQHLAN VATAIPNNGL FSQHSATGSP FLVVTDERFT RRDNFISSDY MLERVGYDPA
QAHKRLGDGF YEQRLVREQV LALTGKPSVK GWDAMTQYQQ LMNNGTKVAQ DFHLVPGVAL
TPEQIAALQQ DIVWLVSETV QTADGPQTVW APKVYLAQTT LRLTGDGAVI GGDNLQLSAN
SITNAGNLFA DKALTVDAGQ FLHQGGDIKA GSIDVQADSL TLSTNLQDAL RQATMSAGDI
HLSGTDITLS GAKLDATHAL SLSARNDLAI TAAKSSHTAD LEFISGSMGN RTRGGTEAAG
SRMAHVSGEW QQAQGSELNA GGNLTLNAGH DVLLTGSQAK AGGQLGVQAG GNINLLADKT
TNTTHLDANS RTSSVSNDRQ EERLALSSLG GDQSVTLIAG NHLLAEGAQI DSKAGRIGLS
AQDVTIKDAR TRTQDLDSEN KRGGKTKSHR IEQTEREIST GSTFSGRDGV TVIGREGDVT
VTGSTLHSDQ GAIALQAKKD VTLNTATERE SRYSEERSEK KGFLNKSSSH TVTDDRTTRE
KGTLLSGNSV SISAGNDLTV TGSAIAADRD VDLQAGHNVD IGAATETESH YLLEEKKKSG
LLGSGGIGFT MGKQSSKHEI DEKGTTQSQS VSTVGSSQGS VNVTAGNQLH IGGADLVAGQ
DLNLTGDSVT IDPGFDVRTR KETFEQKQSG LSVALSGTVG SALNTAVSSA QQARKEGDGR
LSALQNTKAA LSGVQAAQAY SRDNALTASA EAKNAAAGLS ADDPKAAQGA TNTVGVSASY
GSQSSKSETR TDSRQSQGST LTAGQNLSIT ATGKNHTAQS GDIAITGSQL KAGKDLSLDA
ARDISLQSAQ NTESTVGKNE SRGGNVGVGI GVGSGGYGIT VSAGVNAGKG HENGNGLTHT
ETTLDAGSTL KVTSGRDTTL KGAQASGEKV TVDVGRDLTL QSEQDSDRYD AKQQNVSAGG
SFTFGSMTGS ANVSASQDKL KSNFDSVKEQ TGLFAGKGGY DVTVKNHTQL DGAVIASTAD
KEKNRLDTGT LGWTDIHNQA DYSATHSGGS FSTGGPVGKD LLTNMAGGML SGANNSGHAE
GTTKAGVSEG TLIVRDADKQ QQDVAQLNRD TEHANDGSIS PIFNKEKEQN RLKQAQLIGE
IGGQAMDVIR TQGDIAGLKA QKDPSALAQA REQLEKSGKP ANDAAVMQRA YDNAMRQYGT
GSDLQKAAQA VTGALTALAG NNLAGALASG ASPYLATEIK KRVGEDNIAA NAMAHAVLGA
VTAQLNNQSA TAGGLGAGGG ELAARYIAGQ LFPGKMTEQL SESEKQQVSA LSQLAAGLAG
GLATGDTAGA VTGGQASKNA VDNNSLSGDQ ARESVKQVAG NMKDQVRDKL GEGTLSAIVN
SIIGAAADTG DAVLGGADYG ADGAMALTAC ALGDSYCDKA LSDLAGKNQA AADTLKALMK
SETWKAVAGQ VKEAAQGNQL ALEATGGMLA GMFLPGKKLP DIEVANKLPS GPSSSIVPGG
GLAAHEAAGG HLIDRHVGKT EAELLNRVST GNVKSASSFT DRATAEAVTS KAIDSNQAKI
NSYLSGSQKG YLEIDYQSNV PIGISVSRGS TNVSSVTNAR IIIARDPSMP AGYKIITGYP
TP