Gene Ent638_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0053 
Symbol 
ID5113156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp57040 
End bp68943 
Gene Length11904 bp 
Protein Length3967 aa 
Translation table11 
GC content59% 
IMG OID640490209 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001174794 
Protein GI146309720 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.927925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC GTCACCCACC CGTTCGTTTC TCTCAGCGTC TGATTAGCTG GGTCGTCTGC 
GGCTTAATGA TCTGGCAGCC GGTTGCGCCT GCCTTTGCGG CGGTGATGAC GCCGACCGGC
AACACCACTA TGGATAAGGC CGGAAACGGC GTGCCGGTGG TGAATATCGC CACGCCGAAC
GGGGCGGGGA TTTCCCACAA CCAGTTCGAC AGCTACAACG TCGGAAAAGA GGGCGTCATC
CTCAATAACG CCACCGATCG CCTGACCCAA ACCCAGCTCG GTGGGCTGAT CCAGAACAAC
CCCAATTTGC AGGCGGGACG CGAAGCGAAA GGGATCATTA ACGAAGTGAC GGGCGCAAAT
CGCTCGCAAC TTCAGGGGTA TACCGAAGTC GGCGGAAAAG CGGCGAACGT GATGGTCGCC
AACCCCTATG GAATTACGTG TAACGGCTGT GGATTCATCA ATACGCCGAA TGTGACCTTA
ACCACCGGTA AGCCGCAGTT CGATGCGAGC GGCAATCTGA TGGCGCTGGA CGTCACCAAA
GGGGCGATCA CAGTTGAAGG CCAGGGACTG GATGCCAGCA AAAGCGACGC GCTGTCGATT
ATTTCCCGTG CAACCGAGGT CAATGCGGCG ATTCACGCCA AAGATCTGAC CGTCATTGCC
GGGGCCAATC GCGTAGGCGC GGATGGCAGC GTTAAAGCGA TTGCCGGACA GAACGCGACC
CCGACCGTGG CGGTCGATAC CGGTGCGCTG GGCGGGATGT ACGCGAACCG AATTCGTCTG
GTATCCAGCG ATACAGGCGT GGGCGTCAAC CTCGGGAATC TTAACGCGCG GCAGGGCGAT
ATTCAGCTCG ATGCCAGCGG TAAACTGACG GTCACCAACA GCCTTGCCAG CGGATCGATT
ACCGCGAAAG GCGCAGGCGT GACGCTGAAC GGCAGTCATC AGGCAGGCGG ATCGCTGAAT
GTCGCCAGTA CGCAGGATAT GACCCTGAAT AACAGCTTGC TCACCAGCGG CGGCGAGATG
CGCCTTGCCA GTGACGGCAA ACTTCAGGCA ACCGGCGGGG GAGCGAACAG CAAAGGCGCG
TTGACGGTAA ACAGCGGTCA GGATATGACG CAGACCAATA CAAAGCTGGT GGGCCAGGGA
AATACCACGC TCAACAGCAA CGGTAAGCTC ACTATCAACG GCGGGGGCGT GTCGGGCAAC
GGCGATCTCA ATGTTGTCAG CAGCAAGGAT ATGACCATCG CTAACGCGGC AGTGGGCAGC
AATGCCAACG CCACCTTAAC CAGCGGCGGC ACGCTCACGG CGACCGCAGG GGCGATCTCG
GCGGGTAAAA CATTGACGCT CAAAGGGCAA CAGCTCGCGC TGAACGACCG CAGCCGTGCG
GATGCCACCG GAAACATTCG CCTTGAGGGT GAACGTCTGA CAAACCAGGG GCAAATCAAC
GCCGCTGGCA ATCTCACGAT CGCCGCCAAT CAGGCGGCCA ACAGCGGACA GATGGCGGCA
AAAGGTCGCG TTGATGCGCA AACGGGCGAG CTGATTAACA GCGGTTCATT GCAGGGCAAC
GGCGTCACGC TGAAAAGCCA AACACTCGCC AACAGCGGGA CGTTGCAAAG CGGTGGTCAA
TTGGCGCTCA GCGCCGGCAC GTTAAACCAG CAAGGGACGC TCAGCGCGAA AGGGGATGCG
GATCTCGACG TCAGCGAATC GCTGCAAAAC AGCGGCGATC TGCTCGTCGA CGGCGGGCTG
AATGTCAAAA CCGGCGCGTT GGTGCAAAAC GGTGTGCTCT CGGGCGCGAA AGCGCTCAGC
GTCAACGCGG ATAGCATCAC CAGCGGCAAA GCGTCGCGCA CCACCAGTCA GGGCAATATT
CAGCTTACCG CCACGCAACT CGCGGATCTT AACGGCCAGA CGGACGCTGC GGGTGCGCTG
AACGTCAGCA CTAAAAATCT GACCACCGGT GCGGATGCGC ACCTCCAGAG CGGTCAGGAT
CTGACATTAC AGGCGGAAAA TGCCACTCTC AACGGCACTC ACGCAGCGAA AGGGGCGCTC
AACGTTACCA CGCAGACGCT CGACCACGGC GGTAAATCGA CGGCGAAAAC GCTAGCGTTT
AATGCTCGCG AGGGTATCAC CAGCAGCGGT GAACTGACGG CGGACGATAT TTCGCTCAGC
GGGCAGCACA TTACCCACAG CGGAACGGCG AAAGCGAAAA ACATCACGTT CACCGCCCCA
CAGTTCATCA ATAACAGCGG GACGCTGGTG GCCGACGCCC TGACCCTCGA CGGGAAGCGC
ATCACCAATA GCGGCCTGCT TCAGGCGAAT ACCCAGTTCA ATCTGCATGC CGACGCGCTG
GAGAACCTTG CCAGCGGCGC GCTGTACAGC GTCCGCGATC TGACGCTCGA TCTGCCGGAA
CTCACCAACC AGGGGCTTAT CACCACCGAT GGAAATCTGT GGCTGAAAGG CGACGTCCTG
GCCAACGCCG GGGAAATCAA CGGCGTCAAT CTGCGTCTCG ACAACGCCAG CCTGGCAAAT
CAGGCGGCGG GTCGTTTGCT GGCAGATGAC CAGCTGAGTT TTACGGGTTC GGCGCTGGAC
AACGACGGCC AGATGGCGGC GAACAACGTC CAACTGACGG CAGATAGCCT GCAAAACCAC
GGCGTGATGC AGGGCAATGA CGCGCTGACG CTGAACGCTA CAGAGATAAG CAATAGCGGC
GCGTTACGTA CCGACGGCAC GCTGGATCTG CACGGCGTAT CGTTTGAGAA CACCGGCGAA
TTGAGCGCCA CCGATCTGCT TTTCACGCTG ACGCGTCAGA TGAATAATCA TGCTGACGGC
AAAATTATCG CCAAAAATGA CCTGACGCTG ACCGCGCCAG AGCTTCTCAA TAGCGGGCTG
CTGGCGGGAG CAAACACCCA GCTCAACGCA GGAACCCTTC TCAACAGCGG TACGCTGCAA
GGGACAAACT CCCTGACCGC GACGGGCAAA AAGCTCGATA ACCAACTGGC GGGCAAGCTG
CTTTCCGGCG GTGAACTGCA TCTGCAAAAC GACAAGCTGA CCAACGCCGG TTTGTTGCAG
GGTAAAACGC TGAACCTGGC GACCGGCGAG TGGATCAACA GCGGCAACGC GCTGGGCGAA
GCGGGCGTGA CGGCGACCGT CGGCGGCGCT TTCAGCAACC AGGGCAACAT GCTGAGCCAG
CAACAGCTCG ATCTGAACGC CGGCAACATC ACAAACCAGG GGCAATTGCT GGCGAAAGTA
CTGACGCTGC ACGGCGATCT GCTCAACAGC GGGCTGTTGC AGGGCAGCGA TGCGTTGGCA
TGGACGGGTG ACCGTTTTAC CAACCAGGCA CAGGGCCAGG CGACGGGCGG CGAAACGCTG
ACGCTTTCCG GAACCAGCCT GAACAATCAG GGGCAAATTC AATCGCGAGA CGCGGCGTTA
ACCGCCGATA CGCTGACTAA TAGCGGTTCG GTGCAGGCGC TGGACCGCCT GAAGTTGAAC
GTCAACGGGC GGCTCGATAA TCAGGGCGCG ATGCTCAGCC AAAATCTGTT TGAGCTGACG
GCGGCGCAGC TGTTTAACGA CGGTCAACTG GCGGCAAAAT CGCTGACGCT CAACACCCCG
CAGATCACCA ACACTGGCAC CGTGCAGGGT AACGACAGCC TGACGCTGGT CACCCGCAAC
CTGACAAACG CGCAGAGCGG ACAGCTGGTC AGCGGCGGTT CGCTGGATCT CGATCTCGAT
AAGCTCGACA ACGCCGGGCT GATGCAGGTG AATCAGCGCT TCACGGTGAA GGGAAACGAT
CTGCTTAACC GCGGCGATAT ACAGGCCGAT GCGCTGGATT TTGCGCTCAG CAAAACGCTG
AATAACCAGG GCGGCATTGT TGCCAAAAAT GGCGCAGCAC TGAACGCACC GACGCTGACC
AACAGCGGGA CGCTGGCAGG TAAAACCCTG ACGCTGAGCG GCACGGATAT TCGCAACAGC
GGGCTGATTC AGGGCAACGA TAACGCCGAC GCGACGGCCA GCCGTATTAC CAACGACGCG
GTGGGCAAGT GGATTTCTGG TGGGGCGCTG ACGTTTAACG GCGGTCAGCT CACCAACGCA
GGCGCTGTGC AGGGCGCGAC GATTGGCCTC ACGGCAGCGT CGCTGGACAA CAGCGGAACA
CTCAACGGAC TCAACGGCTT CACCGGGACG TTCACCGGCA AAGTGAACAA CGCTGGACAA
ATTCAGAGCG GCGGCGCGCT GAGTTTCAGC GCCGACAGCA TTCTCAACCC CGGCCGCATG
ACCGGTAAAA CCCTGACGCT GAACGCTCGC GACCTCAATA ACAGCGGGCT GTGGCAAGGC
ACTGATGGCC TGTCGCTCAC CGGCGATACG CTTGTGACCA CGGCGGCATC TCGCACGCTG
ACCGGCGGCG CGTTAACGCT GGATGCCGGA CAGCTAACCA CCCAAGGCAC GCTTCAGGGC
AACAACGTCG ATGTGACTTC TGACGGTTGG ACGCACGGCG GATCGCTTCT GAGCCTGGGA
GAATTGACCG CGAATGTCGG CGGGACGCTC ACTTCAACCG GTTCGCTGAT GAGCAAAGGC
GCGGCAGACG TGACGGCGCA AACGCTGGAT AACCGTGGGC AGTTGCTGGG TGAAGGCGAT
GTGACGCTCG GCGGCGGTAC GCTGAAAAAC AGCGGCACGG TGCAGGGGAA AAATCTCGAC
CTGCATCAGA GCAGTATCAA CAACCAGGGG ACGCTAACCG GGCTGGACAG CCTGACGATC
GAAGCCCGTC AGCAGTTGAT GGCGCGTATG GCAATGGCCG CGCCGCAGCA GGAGCTGATC
AACGGGACGA CGGGCGCGCT GCTCACGCAG GGCACGCTGA ATATCACGTC CGGGACCGTG
ACGAACGCTG GAAGCTGGCA GGGACAAAAC ATTCTGCTTA ACGCCCAGTC GCTGACGAAC
AGCGGCGCGG TGCAGAGCGC CGATGCGCTG CAAATGACGC TGGCGAATAC GCTGACCAGC
AGTGCGGGCA GCAAAATTAC TGCGATGGGC ACGGCGACGC TGCAGGCGCT GTCGCTCATC
AATCAGGGGC AGTGGGCGGC CAAAAATCTG ACGCTGAAAG GCGCGACGCT CAACAACAGC
GGTGCGATCA GCGGCGTTAA CGGACTCACG CTGGCGCAAA CGGGCGCGGT GACTCAGGAG
CAGAGCGGGA CGATGCTCTC CGGCGGGGCG CTGAACGTGA ACGCTGCGTC CGTGAGTAAC
GACGGCAAAA TTCAGGGTGC GACGCTCGGC GTAACCACCG GTGTGTTGAC CAATAATGGA
CGTTTGCAGG GCGATAACGG CGTCACGCTG GGCCTTAGCG GTAATCTGAC CAATAACGCG
AGCGGGGAAA TTGTCAGCCG TCAGGCGCTG ACGGTGACCA CCCCTTCCCT GTTCAATTAC
GGGCTGATGC AGGGCGGCGG CGAAACCAGC GTCACTGCCA CCAGCCAGGC GCGGAACGAC
GGCAAACTGT TGTCTGGCGC GCGTCTTACG CTGAACACAC CGCAGTACAC CGGCGCGGGC
TGGCTGCAGG CGACGAATCT CATTCTGAAC GCTGCGACGG CGACCAACAG CGGTACCTGG
GTTGCCGATC AGGCGACGCT GACGGGGAAC ACTTTTACCA ACCAGGGCAC CACACAGGCG
GGAACGCTGG CCGTTAATTA CAGCCAGTTG ACCAACAACG GCACGCTGCT GGGCAACACG
CAGTTGACGG TGGGCGCTAA TCAGGTCAAT CAGAGCGCGG CGGGTAAACT GTTCAGCGGC
GGCGATCTGT GGCTCGAAAG TAAAGGGCTG GATGTTGTCG GACAGGTCGT CTCGCTCGGC
AATCTGACCT TAAAACTCAC CAACGCGTTT ACCAGTAAAA CGGCGCTGGC CGCGGGTAAA
ACGCTCGCGA TCAGCAGCAA TGGGGCGATC GATAACCGCA GCGTGATGCA GGGGCAGGCG
GTCAATTTGA CGGCGGGTGG GCAGTTAAGC AATAACGGAC AGATCACCAC CGGAACCGGG
GCGAGCACAC TTTCCGGCAG TACTGTTGCG CTCAATGGTG CGGGGTCAAT TCAGGGCGGC
GGCGATATCA ACATTGCCAG CCGCGGCAAT ATGACCGTCG ACGGTTTTAC CGGCACGCGC
GGCTCCCTGA CGCTCAGCGC GCCGGGATCC ATCATCAACA CCGCGCTGCT GTACGCGGCC
AATAATATGG CGCTGTACGC CAACAGCATC ACCAACCAGC GCGGCGATAT CCTCGCCGGA
AATAGCCTGT GGATGCAGAA AGATGCGGCG GGAAATGCCA ACAGCCAAGT GGTGAACACG
TCGGGGAATA TTGAGACCAC GCGCGGTGAT ATCACGATCA GAACAGGAAG TCTGCTGAAT
GAGCGCGAGG GTATCAGTGA AACCCGCAGT TATCAGGCCG CGACGGACAG CCCTGCGGCA
AGCGGTGCGA CTTCCATTAG TGTAAAAGTC ACGGACCTGC CCCCTGATGA GTGGGGATAT
ATTTATACAG CTTACAGCGG TGCGGGTGGC GGTAATATCT TTTCAATTGT CGCGCCGATG
CCAAATGGGG CCGTACAACG TTATCTGGTC GGTTCGACCG TGGTCAATGT CACCGCAACA
GGAGGCGTAG CGCGCATCGC CGCTAACCGC GATCTGACTA TCAACGCCGC CACGCTGAAT
AACCGTGCTG GTTATTTACT TGCCGGGAAT GGGATGAATC TTTCCGGCAA CAGTCTGAAT
AACCAGTCCT GGTTTGGCTA TTCAGAGGAT GAATATAAGG TCTATCGTTA CAATGGGAAA
AAAGGCAAGG TATCCAGCCT GGAAGGAAGC CCCGCCTCTG GCAACGATAA AAACCGTCGT
GTGACCTATA CGCTGGATGG CGCACCTCAG TACGAAACCC ATACCACCGA TCAGGCCCTC
CGCGCGGTGA TTCAGGCCGG CGGTCAGGTT GCGGCGAATT TTACCAGTAA TATTAGCAAC
ACGGCGACGA CCTCGAATGG CGGAGGAATA AGCCATGCGA TTCCAGCACC TTCCCTGAAC
ACGCTCAGCA ATAAAGCGAT CGGCGGCGGG GCGCAAAAAC AGAGCCTGAC AAATACCGCA
GCGGTGGCGG TGAACTCGCC GGAGTGGAAC GATCAGCTCC AGGGTGCGTT GCAGCAAATC
AACGGCGGCG GCGCGCTGGA CGGCAACGGC GCAAGCAATA CCGCGCTGAC CAACATTTCT
ACGACCCAAA AAGGCAACGC CAACCTTGGC AAACTGGACA GTCTGGCAAA TGCAGGCGTC
ACAACAGCGG CGCTGAATAA CGCCACCGGC GGTGCGACCG GGCAGTATCA GGGTAAAACC
GTGGATACCA GCGCCTATCC GCTGCCGTCA GGCAAAAACG GCTATTTTGT GATTTCCGAT
AACCCGAAAA GCCCGTACCT GATTAACGTC AACCCAAAAC TCAACGACCT CGGAAAACTG
GATCCGGCGC TGTTTGCCGA CCTGAATGCG CTGCTCGGCA TTAAACCGTC GACAGCCGCG
CCACAAGAGA CGCGCACTGC GTTTACCGAT GAAAAACAGG TGCTCGGATC GTCCTATATG
CTGGGGCGTC TGAACCTCAA TCCGGACTAC GACTATCGCT TCCTCGGTGA CGCGGCGTTT
GATACCCGCT ACGTCAGCAA CGTTGTGCTC AATCAAACCG GCAACCGCTA TCTGAACGGT
ATTGGCTCCG ATCTCGATCA GATGCGCTAT CTGATGGACA ACGCTGCCGC GCAACAGCAG
TCGCTCGGCC TGCAGTTTGG CGTCTCGCTG AACGCTGACC AGATCGCTGC GCTCGACCAC
AGCATCATCT GGTGGGAAAA AGCGACCATC AACGGCGAAA CGGTGATGGT GCCGAAGGTT
TACCTGTCGC CGAAAGATGT CACCGTCAAC AACGGCAGCG TGATTGCGGG CAACAACGTC
ACCCTGAAAA GCGGCAATAT CACCAACAGC GGCAGTTCAC TGCTGGCCAA TAACAGCCTG
ACGATCGACA GCCAGAACAG CATCAGCAAC CTCAACGACG GCCTGATGAA AGCGGGCGGA
GCGCTGAACC TCAGCGCTAT CGGCGATATC AATAATATCA GCTCAACCAT CAGCGGTAAG
ACGGTGGCGC TGGAGAGCCT GGACGGCAGC ATTAATAACC TGACGCTGGC GGACCAGATT
GATATCAATG CGAAGGGCAA ACGCAGCAAT GTGACGATCA AAGACACCGT GCTGGGCACC
ACGGCGTCGA TTACCGCGCA GGATTCGCTG TCGCTGGAGG CGGGTAAAAA CATCACCGTC
ACTGGCGCGA ACCTGGCGTC CGGCGGCGAT ATGCTGCTGA ATGCGTGGGG CGATATCGCC
GTCAACGCCA ATCAGGTGAA TGACGCCTAC AGCTCCAGCC AGGCGAAAAC CAGCCGTTCA
TCCGTGACGT ATCAGGGAAG CACCGTCAGT GCGGGCGGCG ATCTCATCGT CAATGCCGGG
CACAATATCG ATCTCACCGC CAGCGACCTT AAAGCAGGCG GCAGCGCGGG ACTGAGTGCG
GGCAACGATC TGAACCTGAA CGCGGCGCAA ACCAGCGAAA GCAGCCGTAA AGGCAAGAGC
GAATCGCACA GCACCGACCT CGATCGCACT ACCGTTTCTG CAGGTGAAAA CCTGGTGCTG
AAAGCCGGAC AGGACATCAA CGCACAGGCG GCAGCGCTGG CGGCGGAGAA AAACGTCGGG
CTACAGGCCG GACGCGATGT GAATCTCGCG TCCCAGGAGA CCCGCGAGGG CGACAGCTAC
AAGTCGAGCA AGAAAACGGT CATCAACGAG TCCGTACGCC AGCAGGGAAC CGAAATCGCC
AGCGGCGGTA ACACGGTGAT TATCGCTGGA CGCGATGTCA ATTCGCAGGC CACGCAGGTT
ACCGCGCAGG GCGATATCGG TGTGGCGGCG GGTCACGACG TCAATCTGAC GACGGCTACG
GAAAGTGATT ACTACTACAA AGAGCAAACC AAAACCAAGA GTGGTTTCCT CAGCAAAAAA
ACCACCCATA CGATCCAGGA AAGTAGCGCG ACGCGTGAAG CCGGAACGCT GCTGAGCGGG
GATAACGTGA AGGTCACGGC GGGAAATAAC CTGCTGGTGC AAGGCTCTTC GGTGGTCGGC
GACGATGAAG TGGGGCTTAA AGCCGGTAAC AACGTCGATA TTGTGGCGGC GACCAACAGC
AACACCGACT GGCGCTTCAA GGAGACCAAA AAGAGCGGCC TGATGGGAAC CGGCGGAATT
GGTTTCACCA TCGGCAGCAG TAAATCCACG CACGATCTGC GCGAAAGCGG CACGACCCAA
AGCCAGAGCT TCAGTACCGT GGGCTCAACG GGCGGCAACG TTGCTATTAC TGCCGGTAAT
CAGCTCCATG TCGGTGGTGC GGATCTGGTG GCGGGCAAAG ATCTTGCGCT GAAGGGCGAT
AGCGTCATCA TCGAACCGGG GCACGATAAA CGCACGCGCG ATGAGACGTT TGAACAAAAA
GCGAGCGGCC TGACAGTCGC CCTTTCAGGC GCAGTGGGGA GTGCGATTAA CAGCGCGGTG
CAGTCGGCGC AAGCCGCTAA AGAGGAAAGT GATGGCCGAC TGGCGGCGTT GCAGGCTACG
AAAGCGGTGC TCTCCGGTGT GCAGGCGGGA CAGGGCGCGG TCGTGGCGCA GCAGACGGGC
GATCCGGCTA ACGGCTTTGG CGTCAGCATT TCTCTGACCA CGCAAAAATC AAAATCGCAG
AATCATGCCG AAAGCGACGT GGCGGCGGGT AGCACGCTGA ATGCGGGCGG AAACCTGGCG
ATCACCGCGA CGGGCAAAGG GAAAAGCGAA CACAGCGGCG ACGTTGTGAT CGCGGGCAGC
CAGCTGAAAG CGGGTGGTGA CACGTCACTG AATGCAGAAA ATGACATTTT GCTGACGGGC
GCGGCCAACA CGCAAAAATC CACCGGCAAA AACAGCAGCA GCGGCGGTGG CGTGGGGGTG
AGCATCGGTG GTGGCAGCGG CGGGGCCGGG ATCAGCGTCT TCGCCAACGT TAATGCTGCA
AAAGGCAATG AAAAAGGTAA CGGCACCTCC TGGACTGAAA CCACGCTGGA CAGCGGCGGC
ACGGTATCGA TGACCAGCGG GCGCGACGCC ATTCTGAACG GTGCGCAGGT CAGCGGCGAC
AAAGTTGTCG CGGATATTGG TCGCGATCTA TGGATGAGCA GTCAGCAGGA CAGCAACGAT
TTCAAATCGA AGCAAACCAG CGTGGCGGCA GGCGGTAGCT TCACCTTCGG CAGTATGACC
GGCTCCGGCT ACATCAGCGT CAGCCAGGAC AAAATGAAGA GCACCTATGA TTCGGTGCAG
GAGCAGACCG GGCTGTTTGC AGGCAACGGC GGCTTTGATG TGACTGTTGG GCGCCACACT
CAACTTGATG GTGCGGTGAT CGCCTCGACG GCCTCGGCGG ATAAAAACAG TCTGGATACC
GGCACGCTCG GCTTTAGCGA TCTGCACAAC GAAGCGGATT ACAAAGTCAG CCATACGGGG
ATCAGCCTGA GCAGCAGTAA GCCATCGGGC GGCGATTTTT CGATGGGCGG GATGATTTCA
GCGGCAAGTA ACAGCGGGCA CGCAGAAGGC ACCACGCAGG CGGCGGTGGC GAACGGTACC
ATCACCGTGC GTGACAAGGC TAACCAGAAG CAGGACGTAG CCAACCTGAG TCGCGATACC
GAAAACGCCA ACGACAGCAT CAGCCCGATC TTCGACAAAG AGAAAGAGCA AAATCGCCTG
AAAACGGTGG GACTGATTAG CGATATTGGT AGCCAGGCTG TAAACATTGC GCAGACGCAG
GGAGAAATTG CGAAGCGGAA CGCGATGAGC GATCCGGTGT CACTGAATGC GGCAAAAGCG
AAACTGAAAG CGGAGGGCAA TGCTAACCCA ACGGATAAGC AAATTGCGGA TCAGGCAGGG
CAAACGGCGA TGCAGCAGTA CGGCACGGGC AGTCCGCTGC AACGCGGTAT TCAGTCGGTA
ACAGCAGCGC TTCAAGGGCT GGCGGGCGGC AATATAGCTG GCGCGCTGGC GGGAGCGTCT
GCACCGGAGC TGGCTAACAT TATCGGTCAT CATATGGGCA TTGATGATGA CCCTGCGACC
AAAGCCGTTG CACACGCTAT TCTGGGCGGT GCTGTAGCGG CGTTGCAAGG TAACAATGCA
ATGGCAGGGG CAGCGGGGGC TGCCGTGGGT GAAGTCATTG CCAGTCAGCT TTATCCGAAT
GTACCGAAAG AAAAACTGAC GGAAGAGCAA AAGCAGACGA TCAGTACGCT AGCATCCATC
TCCGCGGGAA TAGCCGGAGG GTTAGCAGGC GATAGCACAT TGTCAGCAGC GACGGGATCG
CAGGCTGGCA AGAATGCGGT TGAAAATAAT TCAATGTCTG AGCTTGTGCC GCCTCGCGTA
CAGCAGGATG CTTCTCTGGC GTTTGATCCG AGTCAGCAAG GAAAGAGCGC AGAAGAGATC
AACGCTGCAA TTGATGCGTC ACATTTGGGG CCTTCCTGGG GTACAGAGTA TAAAGTTAAA
CCATATGTAA AAGGTGAAGT AGCCGCAGGC ACAGGGCCCG GTTATTATTA TGATACGAGT
ATAGACCCAT ACCAGATTTC AGCTAATCGT GGAGAAACAT TAGCTGTCGG TGGACGCGTC
TCCGGGCAGA TAGGGATTCA GTTTGGACCA TATTTCCCTG GGGCTATTGA TTCCGAAAGA
AATAGTTCCA TAGGCTTAGG GCTTGGGGTC ATTTCTAGCG AAATCTCTTA TGGAAAAGAT
GGTTTTAGTT TTAGTTTCGG AGTGGGACCT GCATGGGGAT GGAGCGGCGT GTCAACCAAT
GCTGCCGGTG AAAAAGTGGA TATCAATGGT TCCTCTGGAA CTGAATTCTA TCATCATGAC
TTCGGCCAGG ATAAAGCAAA ATGA
 
Protein sequence
MDTRHPPVRF SQRLISWVVC GLMIWQPVAP AFAAVMTPTG NTTMDKAGNG VPVVNIATPN 
GAGISHNQFD SYNVGKEGVI LNNATDRLTQ TQLGGLIQNN PNLQAGREAK GIINEVTGAN
RSQLQGYTEV GGKAANVMVA NPYGITCNGC GFINTPNVTL TTGKPQFDAS GNLMALDVTK
GAITVEGQGL DASKSDALSI ISRATEVNAA IHAKDLTVIA GANRVGADGS VKAIAGQNAT
PTVAVDTGAL GGMYANRIRL VSSDTGVGVN LGNLNARQGD IQLDASGKLT VTNSLASGSI
TAKGAGVTLN GSHQAGGSLN VASTQDMTLN NSLLTSGGEM RLASDGKLQA TGGGANSKGA
LTVNSGQDMT QTNTKLVGQG NTTLNSNGKL TINGGGVSGN GDLNVVSSKD MTIANAAVGS
NANATLTSGG TLTATAGAIS AGKTLTLKGQ QLALNDRSRA DATGNIRLEG ERLTNQGQIN
AAGNLTIAAN QAANSGQMAA KGRVDAQTGE LINSGSLQGN GVTLKSQTLA NSGTLQSGGQ
LALSAGTLNQ QGTLSAKGDA DLDVSESLQN SGDLLVDGGL NVKTGALVQN GVLSGAKALS
VNADSITSGK ASRTTSQGNI QLTATQLADL NGQTDAAGAL NVSTKNLTTG ADAHLQSGQD
LTLQAENATL NGTHAAKGAL NVTTQTLDHG GKSTAKTLAF NAREGITSSG ELTADDISLS
GQHITHSGTA KAKNITFTAP QFINNSGTLV ADALTLDGKR ITNSGLLQAN TQFNLHADAL
ENLASGALYS VRDLTLDLPE LTNQGLITTD GNLWLKGDVL ANAGEINGVN LRLDNASLAN
QAAGRLLADD QLSFTGSALD NDGQMAANNV QLTADSLQNH GVMQGNDALT LNATEISNSG
ALRTDGTLDL HGVSFENTGE LSATDLLFTL TRQMNNHADG KIIAKNDLTL TAPELLNSGL
LAGANTQLNA GTLLNSGTLQ GTNSLTATGK KLDNQLAGKL LSGGELHLQN DKLTNAGLLQ
GKTLNLATGE WINSGNALGE AGVTATVGGA FSNQGNMLSQ QQLDLNAGNI TNQGQLLAKV
LTLHGDLLNS GLLQGSDALA WTGDRFTNQA QGQATGGETL TLSGTSLNNQ GQIQSRDAAL
TADTLTNSGS VQALDRLKLN VNGRLDNQGA MLSQNLFELT AAQLFNDGQL AAKSLTLNTP
QITNTGTVQG NDSLTLVTRN LTNAQSGQLV SGGSLDLDLD KLDNAGLMQV NQRFTVKGND
LLNRGDIQAD ALDFALSKTL NNQGGIVAKN GAALNAPTLT NSGTLAGKTL TLSGTDIRNS
GLIQGNDNAD ATASRITNDA VGKWISGGAL TFNGGQLTNA GAVQGATIGL TAASLDNSGT
LNGLNGFTGT FTGKVNNAGQ IQSGGALSFS ADSILNPGRM TGKTLTLNAR DLNNSGLWQG
TDGLSLTGDT LVTTAASRTL TGGALTLDAG QLTTQGTLQG NNVDVTSDGW THGGSLLSLG
ELTANVGGTL TSTGSLMSKG AADVTAQTLD NRGQLLGEGD VTLGGGTLKN SGTVQGKNLD
LHQSSINNQG TLTGLDSLTI EARQQLMARM AMAAPQQELI NGTTGALLTQ GTLNITSGTV
TNAGSWQGQN ILLNAQSLTN SGAVQSADAL QMTLANTLTS SAGSKITAMG TATLQALSLI
NQGQWAAKNL TLKGATLNNS GAISGVNGLT LAQTGAVTQE QSGTMLSGGA LNVNAASVSN
DGKIQGATLG VTTGVLTNNG RLQGDNGVTL GLSGNLTNNA SGEIVSRQAL TVTTPSLFNY
GLMQGGGETS VTATSQARND GKLLSGARLT LNTPQYTGAG WLQATNLILN AATATNSGTW
VADQATLTGN TFTNQGTTQA GTLAVNYSQL TNNGTLLGNT QLTVGANQVN QSAAGKLFSG
GDLWLESKGL DVVGQVVSLG NLTLKLTNAF TSKTALAAGK TLAISSNGAI DNRSVMQGQA
VNLTAGGQLS NNGQITTGTG ASTLSGSTVA LNGAGSIQGG GDINIASRGN MTVDGFTGTR
GSLTLSAPGS IINTALLYAA NNMALYANSI TNQRGDILAG NSLWMQKDAA GNANSQVVNT
SGNIETTRGD ITIRTGSLLN EREGISETRS YQAATDSPAA SGATSISVKV TDLPPDEWGY
IYTAYSGAGG GNIFSIVAPM PNGAVQRYLV GSTVVNVTAT GGVARIAANR DLTINAATLN
NRAGYLLAGN GMNLSGNSLN NQSWFGYSED EYKVYRYNGK KGKVSSLEGS PASGNDKNRR
VTYTLDGAPQ YETHTTDQAL RAVIQAGGQV AANFTSNISN TATTSNGGGI SHAIPAPSLN
TLSNKAIGGG AQKQSLTNTA AVAVNSPEWN DQLQGALQQI NGGGALDGNG ASNTALTNIS
TTQKGNANLG KLDSLANAGV TTAALNNATG GATGQYQGKT VDTSAYPLPS GKNGYFVISD
NPKSPYLINV NPKLNDLGKL DPALFADLNA LLGIKPSTAA PQETRTAFTD EKQVLGSSYM
LGRLNLNPDY DYRFLGDAAF DTRYVSNVVL NQTGNRYLNG IGSDLDQMRY LMDNAAAQQQ
SLGLQFGVSL NADQIAALDH SIIWWEKATI NGETVMVPKV YLSPKDVTVN NGSVIAGNNV
TLKSGNITNS GSSLLANNSL TIDSQNSISN LNDGLMKAGG ALNLSAIGDI NNISSTISGK
TVALESLDGS INNLTLADQI DINAKGKRSN VTIKDTVLGT TASITAQDSL SLEAGKNITV
TGANLASGGD MLLNAWGDIA VNANQVNDAY SSSQAKTSRS SVTYQGSTVS AGGDLIVNAG
HNIDLTASDL KAGGSAGLSA GNDLNLNAAQ TSESSRKGKS ESHSTDLDRT TVSAGENLVL
KAGQDINAQA AALAAEKNVG LQAGRDVNLA SQETREGDSY KSSKKTVINE SVRQQGTEIA
SGGNTVIIAG RDVNSQATQV TAQGDIGVAA GHDVNLTTAT ESDYYYKEQT KTKSGFLSKK
TTHTIQESSA TREAGTLLSG DNVKVTAGNN LLVQGSSVVG DDEVGLKAGN NVDIVAATNS
NTDWRFKETK KSGLMGTGGI GFTIGSSKST HDLRESGTTQ SQSFSTVGST GGNVAITAGN
QLHVGGADLV AGKDLALKGD SVIIEPGHDK RTRDETFEQK ASGLTVALSG AVGSAINSAV
QSAQAAKEES DGRLAALQAT KAVLSGVQAG QGAVVAQQTG DPANGFGVSI SLTTQKSKSQ
NHAESDVAAG STLNAGGNLA ITATGKGKSE HSGDVVIAGS QLKAGGDTSL NAENDILLTG
AANTQKSTGK NSSSGGGVGV SIGGGSGGAG ISVFANVNAA KGNEKGNGTS WTETTLDSGG
TVSMTSGRDA ILNGAQVSGD KVVADIGRDL WMSSQQDSND FKSKQTSVAA GGSFTFGSMT
GSGYISVSQD KMKSTYDSVQ EQTGLFAGNG GFDVTVGRHT QLDGAVIAST ASADKNSLDT
GTLGFSDLHN EADYKVSHTG ISLSSSKPSG GDFSMGGMIS AASNSGHAEG TTQAAVANGT
ITVRDKANQK QDVANLSRDT ENANDSISPI FDKEKEQNRL KTVGLISDIG SQAVNIAQTQ
GEIAKRNAMS DPVSLNAAKA KLKAEGNANP TDKQIADQAG QTAMQQYGTG SPLQRGIQSV
TAALQGLAGG NIAGALAGAS APELANIIGH HMGIDDDPAT KAVAHAILGG AVAALQGNNA
MAGAAGAAVG EVIASQLYPN VPKEKLTEEQ KQTISTLASI SAGIAGGLAG DSTLSAATGS
QAGKNAVENN SMSELVPPRV QQDASLAFDP SQQGKSAEEI NAAIDASHLG PSWGTEYKVK
PYVKGEVAAG TGPGYYYDTS IDPYQISANR GETLAVGGRV SGQIGIQFGP YFPGAIDSER
NSSIGLGLGV ISSEISYGKD GFSFSFGVGP AWGWSGVSTN AAGEKVDING SSGTEFYHHD
FGQDKAK