Gene Haur_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1999 
Symbol 
ID5733888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2467541 
End bp2479264 
Gene Length11724 bp 
Protein Length3907 aa 
Translation table11 
GC content51% 
IMG OID641279143 
ProductLamG domain-containing protein 
Protein accessionYP_001544770 
Protein GI159898523 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTGGT TGAACACGCT TACGCCATGG CGAATCGTCG GTTTATTAAC GCTGCTTTGT 
CTCGGGTTGG CTGGGCTGGC TCGCTCGCCA CAAGCCTTGC CAACCAGCGC TGATCCCATT
CAGCGAGCCT GGCAATTGGC GCAAATCAGC GGCAGCTATC GCTATAGCAT TGATCTGGCC
CAAACGACCA CCGCCGCACC CAGCCTTGCC AATGCTGGAG CCAATCCGCA ACGCAGCGAG
CGTTTAAGCA TCAATGGCAC GATCGATCAG GCCGCTGATC GCATGGAAAT GACGATTGCT
AACAACGCCG AGCAAAATGT CAGCAATAGT TTCGCCCTGA AAATTGAGCA AGGCCGCAGC
TTTGGGCGTT ATGGCACTGG CAATTGGCAG CCAATCGAGC AATCGAGCGA TCTTTTTGCA
CCAGCTGGCG ATCCCTTGGG CTTTTTAGCA GGGCTTGAGC AGGTAACCAG CCTTGGCAAA
GAAACGCGCA ACGTTGGTAA TTTGCGCTTG GAGTTTCAAC ACTATCAATT TAATTTCAAT
GGCGAGCAAT TCGCCACCTA TTTGCAACCC AAACTCGAAG CCCAATTGCG CCAACGCGGC
GAGTTACCAG TCGGCATGAG CCTCGATACC AGCACCAGCT ACGCCCAAAT GAATGGTACA
GGCGAAGTTT GGCTTGATCA GCATGGCTTG CCCAGCCGTC TGAACCTACA TTTGAATTTG
CCAGCCCAAG CTGATGGCTC AAACGTCAGC GCCACCATTC GCTCCGATTT CTTCGATTTT
GATCAGCGTC GTTTGGCTTT AGCAAGTAGT TCGTTTTGGC AACAGCCTGA TCGTTGGTTG
GGCTATCGCC TTGATCAAAC TCAACGCCAA TTGCAGCCAC TCGCCCAAAT CGCCTTGATT
CCATGTTTAG AAATTTTGCT TTTGGCCTTG GCTGTAGTGT TTTGGCGCAA ACGTCGCACC
CAAACACTGA TGGCTGCCAG TGTGATTAGC GCCATGTTGA TCACGCCCAT GCTGCAGGCC
GAAAATGTTG CTGCCACCCG CAGCAAGCAA CTCGCCAAAC AGGCCGAACA AACTGAACAA
CAAGCCCAAG CCGAGTTACA GCAACAGGCA ATGACCAAAA CTCAAAACCT TTGGCAACCC
AACACCAAAC CAGAAGCCCA AGCGCTTAGT CCTAAACAAC TTGCCAGCAA CGGCAGCGAT
AGCGATGGCG ATGGCTTAAG CGATGCCGAT GAAGACGATT GGTATAGCTG CGCCACGCTT
GGCTCAAGCA CTGGCTATTG TGCTGGCGTG AGCAACGCCA AAGATTACGA TGGCGATGGC
TTGAGCGATG GCGCGGAAGT CAATCAACTT GGCACATTGC CACGCAACAG CGATAGCGAT
GGCGATTCGA TTCCCGACGG TTTGGAAGTG AAGGGGTTTC GCTACCAAGG CCAACAATGG
TATCTCGACC CACTGAGCAA CGATAGTAAT CAAGATGGCT TGCTCGATAG CGTCGATTGT
GGCATTTGGT CGCAAACCTC AAACGCCTAC AATCCCAACG CGATCTGCCC TGATACTGAT
GGCGACGGCC AGCCCGATCT GTTTGATCGC GATAACGATA ACGATGGGAT TAATGATGAT
GTTGATCTTT CGCCCAATTT GCTGAGCAGC GCAGTTTTTG ATGATCAAAA TCCGCTCAAG
TTGCAAATTG ATAACCTCAA AACCAATCGC CCAGTTTTTG TTGATTTTCA GCTGCGGCCA
ACCGATCTTG AGCATTTGAG CTATTCTTCG CGCATTTTGG ATTGGCCTGC CAACGACAAT
CAAGGCCAAA TTCAGCGAGT GCTCGACACG ACCTTTGCCA CCAGCACCAA CAGCGATATT
CGGATCAACG CGATCAACGC CGATTATGGT GATGTGATGA TGGTGCCGAT GCTGGAAATT
ACCATGCCCT ACAGCACTGG TCATTATGCT AACTTGCCAA TTACCACGAC CTACCAAAAT
AATCAGCGCC AACTTGGCGT GGGCGTTGAG CAATGGCTCG ATTCAAGCGA ACTTGATCCA
TATGGCATTA CCGTGCGTGA TCTCGATACG AATTCAGGGA ATTTGCTAGC CTATGTGCCA
GTTGTGCGGG TTAGCAATAG CATGGGTGGT AATCCAGCGG CCTTTGCTGC TCGCATGCTC
TATTATCCCG AACAAGGCAG CAACGGCACG GCCAATTGGG GCGCGGCGCA ACAAGTGCGT
TTGGTTTGGC TGGTGCAGAT GATCACCGAT GAATGTGTTG ATGCTGAAGC CGACCCCAGC
ACCTGCGCCC GCCAAGAAAA CCTGAGTATC GTGCAGATTT ACGACGAAGC GTGGTCGCTG
GCAGGCATGA GCGTCAGCGA AGATCATGGC GCAAAAACCG CAATCATGTA CGAGCAACCA
AGCAGCGACA GCAATTTACA ACTTGACGAC GAGTTGTGGA TGGTTGCCTG GAATCTTAAT
AACACGTTTA TTCGTGGTCG CGATTGCGAT ACACTCAACG GCACCACCTG CCAAGGCAAC
GGCCAACGCG ATGTGCGCAT CAACAATCTG GCCAGCCAAA TCGCAAGCTG GTCGAGCAAC
AGCAGCAACA TAGCAGTGCA AACCAGCAAT TTCTTGCACG AAGGCTATCT CTCGCAAATA
TCGATGCAGC AAATTAGCGA CCTGCTTGAA ACGCAATTTA ATAGTTATAG CAGCCAAACC
AACCCAAGCT TGCTGATTGC CCAAGAAAAA AGCAATCGTA GCCTAAATTT ACAAGATGGC
GCTGGTTTAA GTAATGGCCT AGCTCGCTTT GATCTCAACC CCAGCACGAT TAAATTAACC
ACGATTGCGG GAATGAGTTG GGCTACCTAT CAATTTATTA ATGGTGCTTG GCAAAATTAT
GCAATTGATG ATTATCTGAC CTTGTTGGAA ACCACGCTGC GCACCAACCA AACCTTTATT
CAACATGAAG TTGGCGAAGA ATTATCAGGT AAATTAATTT GGGCACAGCT GTTTTATGCC
AGCATTAATC AAGGCTTTTT GGGGCTGGCT GAAATCGATA GCGTGCCCGT TTGGAAACCA
AGCGCCGATG CCCTAACCGA ACAGGATTAT CAAGCAACCT GGTTTGTGAA CCCTAGCCCA
AGCGGTGGGC ATGGCTTTGC GGCAACGGCG AATCTCTACG CTGAACAATT GCAAAAAGTC
TATAAAAATC TGCGCAAAGC CCAATCAACT TCGAATAAAT GGGCCAAATT CGACCGCACG
CTCAGCCACG ACAAGATCGA CCCGACAAGC TTGCAAGGCC TCAAAACAGT TCGTGCCGCC
ACCAATTATG CGATTGGGGT TTCGATTGGC TTGACTTTGC TGGGCGGAAC CTTGCTGTTA
ATTGGCTTGA CCACCAATAA CGCCGCACTA ATCAAAACCA GCCAAATTAT CCTGACCGCA
GCATCAATGT TGCTGGTGGT TACCCGCATC ACCATGATCG TCAGCCAAAT GACGGCGGTT
GTGCGGGCTG GTTCAACATT AATTTCCACG TTGAGCATGC TGCAATCAGT GGCCAAGGCC
AATCGTTCGC TGGGCACAGT TAATTTGGTG ATTGCCCTGA CCATGATTTG GGGGGTGTTT
ATCTTCCAAC TGGCCAGCGG CCAATTTGGC TGGGGTTCGC ATAGCGCCAA TTTGGCCTTA
GCGCAGGCAA TCGCTGCCAC AATTATGATT TTTGTGATGC TGGTAATTGC CTTCATTCCG
ATTATCGGGC CGTTGATCGT GGCAATTATT TCGTTGGTCG ATTTGATCTT GCGGCTGTTC
AATATTCGTG GCTTTAGCGA TTGGATGAGC CAATTTATCG CCGACCAACT CTACGAAGCC
AATAACCTGC TCGATAACCT GAGCGATCCC AATCGGCTCA ACATCACCAG CAAAACGATC
GATTTGCTGT ATCCTGAATT GGGTTTTGTA CGCTCGAATG CTTTGAGCAT GACCTTCGCC
GTCACTAATA CACTCAAATA TAACGACCAT TTCGACGTAG GCGATGCTCG CCGTTCGACC
TTCCGTTATT GGTTGCAACG CAGCGCCACC GATCAACACG CAGCGCTTGG GCAAAATCAA
ATGCGCAACG AATGGGATGC GATCGGCAAT CGCCAGATTC AAACCAGCGC CGAGATTAGC
CTGCCAGAAG CCTTGCCCTT CAGCGATGTT GGCACAGGCA TCAATCGCTC ACTTGGTCAG
CAGTTATTTA TTACCGAAGC TTCGGTCGCG CCCTATGAAG GCTGTTGGTT GGTCGCAACG
ATTGAAGTCG ATTGCACATG GTATGACACC AAGGGTTCGA GCCATTCCAA CATTGGCGAA
TTTCAATATT TCGATATTTT GCCTGATACG ATCGAGCAAT TTGCCAACCT CGATTGGAAC
ATGAACCCCG ATCTGCGCTT TAACCAAATT AAAGATGCTG ATGGCGATGG CTTGATCAGC
CAAGCCTTGG GTGGCGTTGA CCCCGATGAT ACGCGCTTCG ATAGCGACAA CGATGGCTTA
TCAGATTATG TTGAGCTGAG CAACGGTACA AATCCCCAGA ATGGCGACAG CGATCAAGAC
CAATTGAGCG ATTTTGAGGA GCATGCCTAC CACACCAACC CGCTCAACCG CGATAGCGAT
AGCGATGGCC TGGTTGATTA TGTTGAGTTC AAACAGGGTT GGCTAGTCGC CTACAGCGAT
CTTAGTAGCA CTCAGACCAA ACTCGCGCGG ATATGGTCGG GCAATAGCAG CGATGCCGAT
AACGATTCGC TCAGCGATTT AGAAGAATAT ACCTTTGGCT TCAACCCGTG GGTCGCCACC
GATCCTTCGT TAATTGATAA TTTAGTCGAG ATCGAACGAC TCAGCGTCAA CGAGCAAAAC
GCCCCGCGTT TGTTGGCGCA ATTTGGCGAG CGCGAACAAA GCCAAGCCTT CGCCGATAGC
TCAGGCGCAA ATCACACGCT GCGTTGTAGC CCAAGTCAAT GCCCAAGTTT GGCGATTGGG
CGCTATGCCA ATGGAGTTAA ATTCGATGGC ACCAACGATC AACTCAATAG CAATGTTGGT
ATGCTCGAAT TAGCCCAAGC CAGCTTTACT TTGGCTGCAT GGGTCAAAAC GACCAGCACC
AAACAGGCGA TCATCACTAA GAACGATGGC GATAGCACGT GGGAACGCGG CGAAAAATCG
TTCTACATTA ATGCCAACGG CCAACCAACC TTCGTCGGTT TTGGCAACAA CTACATCACC
AGCAACCAAA GTGTCAACGA CGATCGCTGG CATCATGTGG CAGTCGAATG GGATTACCCC
AATTTGACGG GCACGATCTA CATCGATGGC ATTGACCGCA CCAATCGAAC CAGCACTAAT
TACCGCACTA ACAACCGCGA TAACGCCAAC GATACATTAA AAGTTGGCCG CAACAACTCC
AACAGCAACG AAGCTGCCAA TAATTTCAAA GGCCTGCTCG ATGAAGTTGC GGTGTTTGAT
CGCACGCTTG GCACGGCCCA AATCAATGAT CTGATGCAAG GGCGCTACAA TCCAAATGAT
CGCTTATTGG CTCCAGGCGC AGCGCTGAGT TATCAAGCCA CGATTTCCAA CACCTTGCCC
GCCCAAAGCG TCAATGGCCA ACTGAGCGCC AGCAACCAAA CCAGCGATCC TGATCTGGCG
AACCCAAATA TTGCTTTGCG CATGGAAACG CTTGATCGCC AACAAAGCTA TCGCAATAGT
GCTGGCTCGA ACGAAACTGC CTATTGTCTT GGTCAGCGCT GCCCCACCAG CGAATTGCTC
GGCGGTCGCG ATTATGCCGT GCGCTTCGAT GGGATCGACG ACCAATTGAG CATTCCGATG
CAGCTTGATG CTGACGTGGC AACTGGCGAG ATGCTGCGTT CGCTCAAATT CTCAATCAAA
CTTGAGCAAT TGCCTGCCTC TGGCCAAACT GCCACAATTT ATAGCAGCGT TGTCAGCGCA
ACCAGCGATT TACAAATTGT GATTAATAGT GCTGGCAATT TGGTTATCAG CGTTGGTGGC
AGTACCAGCA GCAGTTATTA CAACGGCAGC ACCACCGATA GCACATGGCG CAAACCGCAC
TATAGCGTTT ATTCGTTCGC TAATAATCTT AATATGTGGG TCGAAGTGCA GCTTGATTAT
CGCAATTTCA GCGATACTAC AGGCCGCACA ACGCTTTCGA TCAACGGAAC CCAAGATAGT
CGGCTAGATT ATAGTTATTG GCCACGCTTG AATATTGGCA ATGCAAGGTT GGGGGCGAAC
GCCAACAGCA AGGCATTCCA AGGCAGTTTA GGCTATCTCT GGGCAACCAA CGAAAAGAAT
GTGCGCTTGC TCGACCTCAA TTTCAACGAA GATTATGGCT ATACTGGCAA TTACTATCAC
AACACGGCGG GCAACCAACA AGCGCTCAGT TGTGGCACGA GCAGCACTTG TCCGAGCTAT
CAACTGCAAG GCTACAACGG CCAAGCGATT CGGCTTGATG GCAACGACGA TTATCTCAAT
TTGCCCAATA GTGCCAGCTT TACGCCTGGC GAGCGCACAT TGCGTATGGC GGTCAAACTT
GAGCAACTAC CAGCCAGCGG CCAAGTTGTC AGCTTGCTCG ATACGGTTTG TGCCAACAGC
AACAGCTATT GTTTGGACCT GACGATTAAT TCCAGCGGCC AGTTGATTAT GCACATTTCG
CCAGGCACGA CCTTGACTTC AAGCACGATT TTCAGCCAAG CCAACCTTAA CACTTGGGTG
CAACTTGAGT TCGATTTTGT GATCGCCTTC AACTCCAGTA GCCGTTCTAG TGTGATGGTC
TATGCCAACG GCAGCTTAGT GCTGATCGAC AGTCAATTTA GCATTGGCCC ATTGGAGCTA
GCGCCCAGCC GCATCGGACG TTCGGTCGCT AATACCAATC CGCTCAAAGC TTCGATCGAT
GATCTGTTTA TTCAAGGCAG CTATAACCTA AGTTTTGATG CGCCGCCGTT TGATAGCGAA
ATGCGCAATC ATGTCAATCA AGCTCGGGTT GCTAGCTGTG AGTTTGTGTT AACTTGCCCA
GCATTTGAGC AAAATGGCCG CTACGATCAG GCGCTTAGTT TCGATGGCCT CAACGATAGC
TTGCTGCTTG ATCGCAGCAT CAGCGAAGAT TTCAGCATTA GTTTCTGGCT GCGTAGCAAC
CAAACCAGTG GCGCGGCCAG CAATTGGTGG CAAGGCACAG GCTTAATCGA TGCCAATGTG
CTGAGTCCAA GCAACGACTT CGGCATCAGC CTTGGCAACA ATGGTCAAGT GATGTTTGGC
CTCGGCAATA GCAACGGCAC AAGTACCACG ATCAAAACAA GAGCAATCAA CGATAATCAA
TGGCATCATG TGGTTGCCAC GCGTCACAAA CAAACTGGCG CAATCAAGCT TTATCTCGAT
GGAACCTTGG CAATCAGCGG CACTGGGCAT CTCAATCGGC TCGATATTCC CCAACTGCGG
ATTGGTGCAG CCCTGAATAA TAGCAACTTC TACCAAGGCC AGCTTGATGA ATTAGTGATT
ATTCCAGCGG TGATCGAGCG GGATGCGGTG CAATTCTTGA TGCAAAGCAC CTATCCAGCG
ATTAAGATCG ATGCAGATTT TGCGCCGTTC CACTTGCCAG CTCAAACCAG CATGGTTGTC
AGCAGCACTG CTCAGGTTAA TCCCAACATT ATCAGCAGCC AACAGCGCTT TGAGCAAGAA
GCCGAAGCCG CAATTGCCTT ACAACAGAAT ATTGGCTATC CCGTTACCGA CCTCAACGCC
AACAAACTGC CAATCTTCTT GCCCTTTGAA GATGTGCCAG GCTCGCAAAA CTTCAAAAAT
GTTGGCAATG TCAGCTATTG GTCGAATTTG GGCTACGATA AAACCAACCT CGAATGTAGC
AATCCCGATT GTCCAAGCGC TGGCTTGCGT GGGTATGTTG AGCGAGCCGC CTATTTCGAT
GGCAACGGCG ATCATTTGCG TTTCAGCGAC CAATGTTGTA GCCCTGAAGC CACAACCATC
GCGGCATGGG TCAATGGCAA CAGCGGCACA ATTGCCGATT TGCGCAACCC CAACAACGCC
ACTGGTCTAC GTCTAGCCTA TGATAACTTC TTGATCACCA TCGATCTTGA TGGCTCAACC
GCTAGCGGTG GCAATGCGAG CTTTGTCGTG CCATTCGAGT TGCCAGAAGG CCAGTGGAGC
CATGTCGTCG CCAATTACGA TAAAACCAAC CAACTGGCAA CGGTCTATAT CAACGGTCAG
TTGGCCGCCA GCACCGCCTT GCCTGGAGCC AACGGAACCC ACGCTAGCCT GAGTTCGCAG
TTTCCAACGA TTGGCGCAAA TCAAGGCGAT GGTGGCGATT TCTATCGGGG CTACTTGGAT
GATCTGCGGA TTTATCAGGT AGTCTTGAGC GCGAGCCAAA TTCAAAGCTT ATACAAAACG
TCAGCACCAA TCTTGAAATT TGAATTCGAC GAGGAGCAAA ACACCAGCGA GTTTCACGAT
CGTTCGCAAG CTGGCTATTT GGGCAAACCA ATCACGACCC AATGTGCCAC GCTCGATCTT
GAGAGCATTC GCGCGAATCA ATTGGCTACC AGTCCGAGCA CCCTGTACGT CGCCAAAGCC
AACGATCGGC TAGCCAGCAT TCGCTTAAAT AGCCAAGTTA CCCAACCGCT CTCTGCCAGC
AGCGTGCTCT GTGCAGGCGA TCAACTGAGC GTTGGTTTGA TCAACAGCAA TGGCAGCACC
AGCCTGATCG CCAATAAAAC GATTGATGTA GCCGCTATTG GCAACGGTAG CGCCATTTTC
CAACAGGGCA GCAATCGCAT CACCCTGCGC TGGGCGATCG ATGATCAACT CTTGTATCGC
CACAATCCAG CACCTGGCAC CGATGGCAAA ATTGGCCGCA CCGTGTTGTT CGATGGCAAA
GGCGCAATCC AAGTTGATCA AGCCAATGCA ATCAACAATT TACAAAATCA ATTCACAATT
TTGGCATGGG TCAAACCCGA TACTATCACC GACACCACTT CGTTGCAACG CTTTATTGCC
GCAGGTCGCG ATAACTCAGT CAATGGTTTT GGCTTTGGCT TGAGCGGCTC AGCCTTGAAT
TTTGGCATCT TTGGCGGCTT CAAATATAGC AGCAGCGCCA CGGTTGCTCC ACGGGTTTGG
CAGCAAGTCG CGGTCGTATT CGATGCCAGC AACGATGCCA AGTTCTACCT CGATGGCGAA
TATATCGACA CCATCGCAGG CAGCAGCGCC GTACCCGCCA ACAACGACGA CCCGCTGTTT
ATCGGCGCAA GCACCGACCA ACAAGGCTTG TTCAACGATC AATTCCGCGG CCAACTCGAT
GAACTTACAA TCTATCAACG CGAATTGAGC ACTGCCGAAA TCTACAACCT CTATTTGCGC
GATTTACGCT GGTATCGAGC GCGATCAACC AGCTATCTGA CGATTGATAA CGATCAGCCG
AGCGTTAGTT TGTTGGCAAG CAATCATTAT CGAGCCAATG CGCCATTCCA ACTCGCGGTT
TCGGCTACCG ACCCCAGCTC AAGCATCCGC TTGGTCGATA TGGGCGTGCG TGGGCCACAA
GCCAGCAGCT ACGAATGGAC AAGCGTCGCC GTATGTGCCG AGGCCCAAGC CAACAACGCT
GCTTGGTGTC CAGTGATTGA TCCTAGCAAA TTTGGCGGAG CAGGAAGCTA TCAAGTAATC
TTCCGCGCTG TTGATGCAGT TGGCCATGAA ACCACCAGCG CAGCAGCAAC TCTGTATGTC
GATGGCGCTG GGCCAATCGC AACCCTCGAG CACAACCAAA ACTGGCGTGC ATTACAGCCT
GTTGCCAACC ATGATCTGCA ATGGACGCTG ACATTATCGG GAACGATCGG CGATCCACAG
CTTGGCAATG GCATCGCAGG CAGCGGCCTT GAGCAAACCA AGGTGTTGAT TGGCCTGTTC
GATCAACAAA ATCGTTTGAT TGGCAGCGAG AGTTGGCAAC AAGCGCAGGT TAATGGCGAA
CGCTGGTCGA TCAATTATCA AATTAGTGGC AGTCGGCCAA GCGGCAGCTA TCGGGTCGAA
ATGAGTGCGG TTGATCAGGT AGGCAACCAA TTAACCACCG CCTTGCGAGC AAACCAAGCG
CAGCAACAAA GCTTACTCCT CGATGGGCGA CCACCAAGCG TCGATCTGTT CCAAGCGATC
GCAGCAACAC CGTTAATTAG CGAAACATGG CAGATTAGCG GTACACTCAA CGAGTTGCCA
AGCTGGCATG GAGCCGTCGC CAGCTATCAC TTTGAAACAG CCGCAGCACG CAACGATAGC
AGCGGCAATA ACTATCATGC AAGCTGTAAT GCCTGCCCCA GCAGTGCAAC TGGCCAATTT
GGCATGGCCG CCAGTTTCAA CCCAGCCAAC CAACAACGCC TGACCGTGGC GACAACCGCA
AGTTTGAACC TTAGTCAAGC CAGCTTCAGC GCCTGGATCA AGCCGAATTG GAATAGCAGC
ACTGGCTCGG CTTATAGCAT TTTGGCCTTG GCTGATAGCA GCAACACCCG CTACAACTGG
CAAATTGCCA GCGATTATCG TAGCATTCGT CTGTTCAATG GCAGCACGAC TAACAGCATC
AGCGCTACGA TCACGCCAAA TCAATGGCAG CATGTCGCCT TGGTACAAAC TGGCAGCGAA
TGGACGGCCT ATTTGAATGG CACAAATTTA GGCAGCGTTG AACAAAGCTT TGGCACTACC
AGCAATCTGC CGTTACAGAT TGGCGCGGCC AAGCCAAACA GCGGCTTCTT CAATGGTCAA
CTTGACGAAG TGCAAATCTA TGAGCGAGCC TTGTCGGCGC GTGAAATCTA TGCTTTGGCT
CAAAGCGAGC ATGCAGGCGT GAACCAAGCC CAAATCTGGC TTGAAGCCTA TCATTTTGAT
GGCAGCCCTA GCAGCGAAAT TTGGCAATCA GCGCCAATTC AAGCGCAGGG CACGGATCTG
GCAACCTGGA GCTACAACCC ACCAGCCGCT AGCGAAGGCT TTTATCAATT GCATGTTCGC
GGCAACGATG CTTTTGCCAA CACCAACGAC GAGCGAATCA TCTGGCGTGG CAGCATCGAC
ACCCAAGCCC CACGGGTCAG CATCAATGCA ACCCAAGGCG GCAGCGGTGC AAATAGCTAC
ACCGATTATG TCATCAGCGC CGAGGATTTA TTTATTGATG AAAACTCGTT ACTCTCGCCA
TGCGCTGGCA GCCCAATCAG CTATGGCTAC TACCCAAATC CAGCGCGAAT CAATCGGCTC
AGCGTCAGTT GTCGAGTCAA TGGGCATAGC CAAAGCATAG TGATCGTCAA AGCCTGTGAT
TATGCGGGCC ACTGTAGCAC CAGCAGCGTT GATCCCTTGC CAACACCCAC GCCAACCGCG
ACCGCAATTC CATCGGCCAC ACCGATTGCT AGCGCAACAC CAACCGCCAG TGCAACGATC
ATTCCATCGG CTACGCCAAG CGCGACGATT GTCCCTTCGG TAACTGCCAC TGGCGTTCCA
TCGGTCACAC CAACGGCAAT TGCGACTGCC AGCACCACAG CCATTGCTTC GCTAACCGCA
ACCGCAACGG CAACGCTGAC GGCTACACCT TCAGCAAGCC CAACCGCGAC GGCGACGGCA
ACCTCAACAA CTACACCAAC CGCGACGGCG ACGCTGACGA TTACACCAAC CAAAACCGCG
ACGGCAACGG CAGTTCCTTC GATCACACCA TCGCCGACCC CAACCATCGT GCGCTGGTAT
TTCCCTTGGG TGACGGTTCA ATAA
 
Protein sequence
MRWLNTLTPW RIVGLLTLLC LGLAGLARSP QALPTSADPI QRAWQLAQIS GSYRYSIDLA 
QTTTAAPSLA NAGANPQRSE RLSINGTIDQ AADRMEMTIA NNAEQNVSNS FALKIEQGRS
FGRYGTGNWQ PIEQSSDLFA PAGDPLGFLA GLEQVTSLGK ETRNVGNLRL EFQHYQFNFN
GEQFATYLQP KLEAQLRQRG ELPVGMSLDT STSYAQMNGT GEVWLDQHGL PSRLNLHLNL
PAQADGSNVS ATIRSDFFDF DQRRLALASS SFWQQPDRWL GYRLDQTQRQ LQPLAQIALI
PCLEILLLAL AVVFWRKRRT QTLMAASVIS AMLITPMLQA ENVAATRSKQ LAKQAEQTEQ
QAQAELQQQA MTKTQNLWQP NTKPEAQALS PKQLASNGSD SDGDGLSDAD EDDWYSCATL
GSSTGYCAGV SNAKDYDGDG LSDGAEVNQL GTLPRNSDSD GDSIPDGLEV KGFRYQGQQW
YLDPLSNDSN QDGLLDSVDC GIWSQTSNAY NPNAICPDTD GDGQPDLFDR DNDNDGINDD
VDLSPNLLSS AVFDDQNPLK LQIDNLKTNR PVFVDFQLRP TDLEHLSYSS RILDWPANDN
QGQIQRVLDT TFATSTNSDI RINAINADYG DVMMVPMLEI TMPYSTGHYA NLPITTTYQN
NQRQLGVGVE QWLDSSELDP YGITVRDLDT NSGNLLAYVP VVRVSNSMGG NPAAFAARML
YYPEQGSNGT ANWGAAQQVR LVWLVQMITD ECVDAEADPS TCARQENLSI VQIYDEAWSL
AGMSVSEDHG AKTAIMYEQP SSDSNLQLDD ELWMVAWNLN NTFIRGRDCD TLNGTTCQGN
GQRDVRINNL ASQIASWSSN SSNIAVQTSN FLHEGYLSQI SMQQISDLLE TQFNSYSSQT
NPSLLIAQEK SNRSLNLQDG AGLSNGLARF DLNPSTIKLT TIAGMSWATY QFINGAWQNY
AIDDYLTLLE TTLRTNQTFI QHEVGEELSG KLIWAQLFYA SINQGFLGLA EIDSVPVWKP
SADALTEQDY QATWFVNPSP SGGHGFAATA NLYAEQLQKV YKNLRKAQST SNKWAKFDRT
LSHDKIDPTS LQGLKTVRAA TNYAIGVSIG LTLLGGTLLL IGLTTNNAAL IKTSQIILTA
ASMLLVVTRI TMIVSQMTAV VRAGSTLIST LSMLQSVAKA NRSLGTVNLV IALTMIWGVF
IFQLASGQFG WGSHSANLAL AQAIAATIMI FVMLVIAFIP IIGPLIVAII SLVDLILRLF
NIRGFSDWMS QFIADQLYEA NNLLDNLSDP NRLNITSKTI DLLYPELGFV RSNALSMTFA
VTNTLKYNDH FDVGDARRST FRYWLQRSAT DQHAALGQNQ MRNEWDAIGN RQIQTSAEIS
LPEALPFSDV GTGINRSLGQ QLFITEASVA PYEGCWLVAT IEVDCTWYDT KGSSHSNIGE
FQYFDILPDT IEQFANLDWN MNPDLRFNQI KDADGDGLIS QALGGVDPDD TRFDSDNDGL
SDYVELSNGT NPQNGDSDQD QLSDFEEHAY HTNPLNRDSD SDGLVDYVEF KQGWLVAYSD
LSSTQTKLAR IWSGNSSDAD NDSLSDLEEY TFGFNPWVAT DPSLIDNLVE IERLSVNEQN
APRLLAQFGE REQSQAFADS SGANHTLRCS PSQCPSLAIG RYANGVKFDG TNDQLNSNVG
MLELAQASFT LAAWVKTTST KQAIITKNDG DSTWERGEKS FYINANGQPT FVGFGNNYIT
SNQSVNDDRW HHVAVEWDYP NLTGTIYIDG IDRTNRTSTN YRTNNRDNAN DTLKVGRNNS
NSNEAANNFK GLLDEVAVFD RTLGTAQIND LMQGRYNPND RLLAPGAALS YQATISNTLP
AQSVNGQLSA SNQTSDPDLA NPNIALRMET LDRQQSYRNS AGSNETAYCL GQRCPTSELL
GGRDYAVRFD GIDDQLSIPM QLDADVATGE MLRSLKFSIK LEQLPASGQT ATIYSSVVSA
TSDLQIVINS AGNLVISVGG STSSSYYNGS TTDSTWRKPH YSVYSFANNL NMWVEVQLDY
RNFSDTTGRT TLSINGTQDS RLDYSYWPRL NIGNARLGAN ANSKAFQGSL GYLWATNEKN
VRLLDLNFNE DYGYTGNYYH NTAGNQQALS CGTSSTCPSY QLQGYNGQAI RLDGNDDYLN
LPNSASFTPG ERTLRMAVKL EQLPASGQVV SLLDTVCANS NSYCLDLTIN SSGQLIMHIS
PGTTLTSSTI FSQANLNTWV QLEFDFVIAF NSSSRSSVMV YANGSLVLID SQFSIGPLEL
APSRIGRSVA NTNPLKASID DLFIQGSYNL SFDAPPFDSE MRNHVNQARV ASCEFVLTCP
AFEQNGRYDQ ALSFDGLNDS LLLDRSISED FSISFWLRSN QTSGAASNWW QGTGLIDANV
LSPSNDFGIS LGNNGQVMFG LGNSNGTSTT IKTRAINDNQ WHHVVATRHK QTGAIKLYLD
GTLAISGTGH LNRLDIPQLR IGAALNNSNF YQGQLDELVI IPAVIERDAV QFLMQSTYPA
IKIDADFAPF HLPAQTSMVV SSTAQVNPNI ISSQQRFEQE AEAAIALQQN IGYPVTDLNA
NKLPIFLPFE DVPGSQNFKN VGNVSYWSNL GYDKTNLECS NPDCPSAGLR GYVERAAYFD
GNGDHLRFSD QCCSPEATTI AAWVNGNSGT IADLRNPNNA TGLRLAYDNF LITIDLDGST
ASGGNASFVV PFELPEGQWS HVVANYDKTN QLATVYINGQ LAASTALPGA NGTHASLSSQ
FPTIGANQGD GGDFYRGYLD DLRIYQVVLS ASQIQSLYKT SAPILKFEFD EEQNTSEFHD
RSQAGYLGKP ITTQCATLDL ESIRANQLAT SPSTLYVAKA NDRLASIRLN SQVTQPLSAS
SVLCAGDQLS VGLINSNGST SLIANKTIDV AAIGNGSAIF QQGSNRITLR WAIDDQLLYR
HNPAPGTDGK IGRTVLFDGK GAIQVDQANA INNLQNQFTI LAWVKPDTIT DTTSLQRFIA
AGRDNSVNGF GFGLSGSALN FGIFGGFKYS SSATVAPRVW QQVAVVFDAS NDAKFYLDGE
YIDTIAGSSA VPANNDDPLF IGASTDQQGL FNDQFRGQLD ELTIYQRELS TAEIYNLYLR
DLRWYRARST SYLTIDNDQP SVSLLASNHY RANAPFQLAV SATDPSSSIR LVDMGVRGPQ
ASSYEWTSVA VCAEAQANNA AWCPVIDPSK FGGAGSYQVI FRAVDAVGHE TTSAAATLYV
DGAGPIATLE HNQNWRALQP VANHDLQWTL TLSGTIGDPQ LGNGIAGSGL EQTKVLIGLF
DQQNRLIGSE SWQQAQVNGE RWSINYQISG SRPSGSYRVE MSAVDQVGNQ LTTALRANQA
QQQSLLLDGR PPSVDLFQAI AATPLISETW QISGTLNELP SWHGAVASYH FETAAARNDS
SGNNYHASCN ACPSSATGQF GMAASFNPAN QQRLTVATTA SLNLSQASFS AWIKPNWNSS
TGSAYSILAL ADSSNTRYNW QIASDYRSIR LFNGSTTNSI SATITPNQWQ HVALVQTGSE
WTAYLNGTNL GSVEQSFGTT SNLPLQIGAA KPNSGFFNGQ LDEVQIYERA LSAREIYALA
QSEHAGVNQA QIWLEAYHFD GSPSSEIWQS APIQAQGTDL ATWSYNPPAA SEGFYQLHVR
GNDAFANTND ERIIWRGSID TQAPRVSINA TQGGSGANSY TDYVISAEDL FIDENSLLSP
CAGSPISYGY YPNPARINRL SVSCRVNGHS QSIVIVKACD YAGHCSTSSV DPLPTPTPTA
TAIPSATPIA SATPTASATI IPSATPSATI VPSVTATGVP SVTPTAIATA STTAIASLTA
TATATLTATP SASPTATATA TSTTTPTATA TLTITPTKTA TATAVPSITP SPTPTIVRWY
FPWVTVQ