Gene Ping_2848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPing_2848 
Symbol 
ID4625873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychromonas ingrahamii 37 
KingdomBacteria 
Replicon accessionNC_008709 
Strand
Start bp3508568 
End bp3522010 
Gene Length13443 bp 
Protein Length4480 aa 
Translation table11 
GC content50% 
IMG OID639797970 
Producthemagglutinin/hemolysin-related protein 
Protein accessionYP_944153 
Protein GI119946473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT TGACTTTAAT TGTTGAGTTA CCAGACGGCA CACAACAGAC GCTTGTTTTA 
TCTAAATTAC AAATTCTTGC TGCATTAAAT GGCGTTGTTT ATACCTTGGT CGAACAGGAT
ACTCAGCGGG TGCCTGAAGA GTTGGTATTA AAGCGTAAAG GTGATGATTT ATACATTGAA
GTTGATGGGG TGCCTATTGC GCAGATTGAC GGTTTTTATA GCGCCGAGAT GAATGCAATT
TTCTCTGCGG ATGGCACATT AACACCTGGC TATGGTATGG CAGTGACCAG CTCAGATGTG
TTAGAGGGCA GTTTGGTGAA TGATAATGGT GAAGCAACCG TTGTCTGGGC AGCACAAGAA
AGTGGTTTGT CACCTTTAGT CTGGACTGGT GGCATTCTTG CTGGTGGTAT TGTTGCGGCA
GTCGCGCTGT CTGGTGGTGG AGGTGGTACG GAAACGATTG CACCCAAAGA TTCCAGCGCA
GATGCGGTAG TAGTAAATGC GATCGCCAAC GACAATGTAG TGAATAGTGG CGAAGCAACG
GAAGGTTTTA ATATTACGGG GTCAGGTGAA ACGGGCGCAA CGGTTAACTT AACCTTTGAA
AGTGCGATCA TTTTGGCGGC CGGTAATACC GCGATTGTGG ACGGGGATGG TAACTGGACG
GTTGCGGTGA CCAGTACTGA TATCGCTGCA ATGGGCGAAG GCGCAGAGTT GATCTCTGTC
ACTCAAACCG ATATCGCAGG TAACATGAGT GCCCCAAGTG TTAAAACGAT TGACATTGAC
ACGACCGCAC CGACCGCCCG TTCGATGGCG GCGAGCCCTA TGGTTGCCCG CGATGATCAG
GGTTCAGTGC AAGGCGATTT AACTTCGGGC GATAGCACCG ACGACACCAC GCTAATACTC
AGTGGCAGCA ATGAGGCAGG CTCGAGGGTT AATGTTTATA ACGGCAGCAC CTTACTGGGT
GCGGCAACTA TTAGCGGAAC GACCTGGCGC TACAGCGCGA CTATTGTCGA TGGCACTAGC
TACCAGTTCA ATGCTCAGGA AACCGATGCT GCGGGCAACG AAAGTGCTGT CACCAGTAAC
TTTGCAGTAA CCGGTGATAC GACAGCACCG ACCGCCAGTT CAATGGCGGC CAGCCCAATG
GTGGTCAGCG ATGATCAGGG TTCAGTGCAA GGCGATTTCA CCGATGGCGA TAGCACCGAC
GACACCGCGC TAATACTCAG TGGCAGCAAT GAGGCAGGCT CAAGGGTTAA TGTTTATAAC
GGCAGCACCT TACTGGGTGC GGCAACTATT AGCGGAACGA CCTGGAGTTA CAGCGCTACT
GTTACCGATG GCACTAGCTA CCAGTTCAAT ACTAAAGAGA CCGATGCTGC GGGCAACGAA
AGCGTTGCCA CCAGTAATTT TGCAGTAACC GGTGATACGA CAGCACCAAC CGCCAGTTCG
ATGGTGGCCA GCGATGATCA GGGTTCAGTG CAAGGCGATT TAAGTGATGG TGACAGCACC
GACGATACTA TGCTGGTGCT TAGTGGCAGC AATGAGGCGG GCTCAAGGGT TAATGTTTAT
AGTGGCAGCA CCCTATTGGG TGCGGCAACT ATTAGTGGAA CGACCTGGAG CTACAGCGCT
ACTGTTACCG ATGGCACTAG CTACCAGTTC AATGCTCAGG AAACCGATGC TGCGGGCAAC
GAAAGTGTTG TCACCAGTAA CTTTGCAGTA ACCGGTGATA CGACAGCACC AACAGCTAGT
TCAATGGCGG AGAGCCCGAT GGTGGTAAGC GATAATCAGG GTTCAGTGAC AGGTGATTTA
ACCTCGGGCG ATAGCACCGA CGATACTGCG CTAGTGCTTA GTGGCAGCAA TGAGGCTGGC
TCAAGCATTA ATGTTTATAA TGGCGGTGTC TTACTGGGTG CGGCAACTAT TAGCGGAACG
ACCTGGCGCT ACAGCGCTAC TGTTACCGAT GGTACTGCCT ACCAGTTCAA TACTCAGGAA
ACCGATGCCG CGGGCAACGA AAGTCCTGCC ACCAGTAACT TTTCAGTGAC CGGGGATATG
ACCGCACCGA CCGCCAGTTC AATGGCGGCG AGCCCGATGG TGGTCAGCGA TGATCAGGGT
TCAGTGCAAG GCGATTTAAC CTCGGGTGAT AGCACCGACG ACACCGCGCT AATACTCAGT
GGCAGCAATG AGGCAGGCTC AAGTGTTAAT GTTTATAACG GCAGCACCTT ACTGGGTGCG
GCAACTATTA GCGGAACGAC CTGGAGTTAC AGCGCTACTG TTACCGATGG CACTAGCTAC
CAGTTCAATA CTAAAGAGAC CGATGCTGCG GGTAACGAAA GCGTTGCCAC CAGTAATTTT
GCAGTAACCG GTGATACGAC GGCACCGACC GCCAGTTCGA TGGTGGCCAG CGATGATCAG
GGTTCAGTGC AAGGCGATTT AAGCGATGGT GATAGCACCG ACGATACCAC GCTAATACTC
AGTGGCAGCA ATGAGGCAGG TTCAAGGGTT AATGTTTATA ACGGCAGCAC CTTACTGGGG
GCGGCAACTG TTACTGGAAC AGGCTGGATC TACAACGCAC ATGTTGCTGA TGGCACTACC
TACCAATTCA ATACTCAGGA AACCGATGTC GCGGGCAACG AAAGCCCTGC CACCAGTGAC
TTTGCAGTAA CCGGTGATAC GACAGCACCG ACCGCCAGTT CAATGGCGGC CAGCCCAATG
GTGGTCAGCG ATGATCAGGG TTCAGTGCAA GGCGATTTAA CCTCGGGTGA TAGCACCGAC
GATACCACGC TAATACTCAG TGGCAGCAAT GAGGCTGGCT CAAAGGTTAA TGTTTATAGT
GGCAGCACCC TATTGGGGGC CGCAACTATT AGCGGAACAA CTTGGCGCTA CAGCGCGACT
ATTGTCGATG GCACTAGCTA CCAGTTCAAT GCTCAGGAAA CCGATGCTGC GGGCAACGAA
AGTGTTGTCA CCAGTAACTT TGCAGTAACC GGTGATACGA CAGCACCAAC AGCTAGTTCA
ATGGCGGAGA GCCCGATGGT GGTAAGCGAT AATCAGGGTT CAGTGACAGG TGATTTAACC
TCGGGCGATA GCACCGACGA TACTGCGCTA GTGCTTAGTG GCAGCAATGA GGCTGGCTCA
AGCATTAATG TTTATAATGG CGGTGTCTTA CTGGGTGCGG CAACTATTAG CGGAACGACC
TGGCGCTACA GCGCTACTGT TACCGATGGT ACTGCCTACC AGTTCAATAC TCAGGAAACC
GATGCCGCGG GCAACGAAAG TCCTGCCACC AGTAACTTTT CAGTGACCGG GGATATGACC
GCACCGACCG CCAGTTCAAT GGCGGCGAGC CCGATGGTGG TCAGCGATGA TCAGGGTTCA
GTGCAAGGCG ATTTAACCTC GGGTGATAGC ACCGACGACA CCGCGCTAAT ACTCAGTGGC
AGCAATGAGG CAGGCTCAAG TGTTAATGTT TATAACGGCA GCACCTTACT GGGTGCGGCA
ACTATTAGCG GAACGACCTG GAGTTACAGC GCTACTGTTA CCGATGGCAC TAGCTACCAG
TTCAATACTA AAGAGACCGA TGCTGCGGGT AACGAAAGCG TTGCCACCAG TAATTTTGCA
GTAACCGGTG ATACGACGGC ACCGACCGCC AGTTCGATGG TGGCCAGCGA TGATCAGGGT
TCAGTGCAAG GCGATTTAAG CGATGGTGAT AGCACCGACG ATACCACGCT AATACTCAGT
GGCAGCAATG AGGCAGGTTC AAGGGTTAAT GTTTATAACG GCAGCACCTT ACTGGGGGCG
GCAACTGTTA CTGGAACAGG CTGGATCTAC AACGCACATG TTGCTGATGG CACTACCTAC
CAATTCAATA CTCAGGAAAC CGATGTCGCG GGCAACGAAA GCCCTGCCAC CAGTGACTTT
GCAGTAACCG GTGATACGAC AGCACCGACC GCCAGTTCAA TGGCGGCCAG CCCAATGGTG
GTCAGCGATG ATCAGGGTTC AGTGCAAGGC GATTTAACCT CGGGTGATAG CACCGACGAT
ACCACGCTAA TACTCAGTGG CAGCAATGAG GCTGGCTCAA AGGTTAATGT TTATAGTGGC
AGCACCCTAT TGGGGGCCGC AACTATTAGC GGAACGACCT GGCGCTACAG CGCTACTGTT
ACCGATGGTA CTGCCTACCA GTTCAATACT CAGGAAACCG ATGCCGCGGG TAACGAAAGT
CCAGCCACCA GTAACTTTTC AGTGACCGGG GATACGACCG CACCAACCGC TAGTTCAATG
GCGGCGAGCC CGATGGTGGT CAGCGATAAT CAGGGTTCAG TGACAGGTGA TTTAGCCTCG
GGCGATAGCA CCGACGATAC TGCGCTAGTG CTTAGTGGCA GCAATGAGGC TGGCTCAAAG
GTTAATGTTT ATAGTGGCAA CACCCTATTG GGGGCCGCAA CTATTAGCGG AACAACCTGG
CGCTACAGCG CTACTGTTGC CGATGGCACT GCCTACCAGT TCAATACTCA GGAAACCGAT
GCCGCGGGCA ACGAAAGTCC TGCCACCAGT AACTTTTCAG TGACTGGGGA TACGACCGCA
CCAACAGCTA GTTCAATGGC GGCGAGCCCG ATGGTGATCA GCGATAATCA GGGTTCAGTG
ACAGGTGATT TAACCTCGGG CGATAGCACC GACGATACTG CGCTAGTGCT TAGTGGCAGC
AATGAGGCTG GCTCAAGCAT TAATGTTTAT AATGGCGGTG TCTTACTGGG TGCGGCAACT
ATTAGCGGAA CGACCTGGCG CTACAGCGCT ACTGTTACCG ATGGTACTGC CTACCAGTTC
AATACTCAGG AAACCGATGC CGCGGGCAAC GAAAGTCCTG CCACCAGTAA CTTTTCAGTG
ACCGGTGATA CGACCGCACC GACTGTTGCG ATTACGGATA ACACCGTTGA TACGGCCACC
GGCGAAGTGA TCTATACCTT CACCTTCCCG GAAGCCGTCA ACGACTTTAC CGTGGCTGAT
GTTACTGTCA CCGGCGGCAG CAAAGTTGGC AGCTTTGACA GCGGTGTCGA TGGAGACAGT
GTCTACACTC TGGTAGTGAC GCCTGACGCG AACAGCATTA CCGACATGAC CGTTAATGTA
GCCGCTGATA TTGCTAAGGA TACCGCCGGC AATAACAATC TGGTTGCGAC CGAATCCGTG
CAAGCAGTGG ATACGGTTAT TCCGACGGTG GCCATCAGTG ATAATACTAC CGGCACCGCA
ATTGGCGAGG TAATTTACAC CTTTAACTTT AGCGAAGCGA TGAGTGGTTT TACAATTGAT
GATGTGATCC TAACGGGGGG CGCAAAAGGC ACGTTTACTG AGGTTTCAGC CAGCCAATAT
ACCTTAGTAG TGACGCCTGA CGCGAACAGC ATTACCGACA TGACCGTTAA TGTAGCCGCT
GATATTGCTA AGGATCCCGC CGGCAATAAT AATCTGGTTG CGACTGAATC CGTGCAAGCA
GTGGATACGG TTATTCCGAC GGTGGCCATC AGCGATAATA CTACCGGCAC CGCAATTGGC
GAGGTAATTT ACACCTTTAA CTTTAGCGAA GCGATGAGTG GTTTTACAAT TGATGATGTG
ATCCTAACGG GGGGCGCAAA AGGCACCTTT ACTGAGGTTT CAGCCAGCGA ATATACCCTA
GTCGTGACGC CTGATGCGAA CAGCATTACC GACATGACCG TTAATGTAGC CGCTGATATT
GCTAAGGATA CCGCCGGCAA TAATAATCTG GTTGCGACTG AATCCGTGCA GGCAGTGGAT
ACGGTTATTC CGACGGTGAC CATCAGCGAT AATACTACCG GCACCGCAAT TGGCGAGGTA
ATTTACACCT TTAACTTTAG CGAAGCGATG AGTGGTTTTA CTGCCAATGA TGTGACCTTG
ACAGGGGGCA CAAAAGGCAC CTTTACTGAG GTTTCAGTCA GCCAATATAC CCTAGTAGTA
ACGCCTGACG CGAACAGCAT TACCGACATG ACCGTTAATG TGGCCGCGGA TATTGCTAAG
GATCCCGCCG GCAATAATAA TCTGGTTGCG ACTGAATCCG TGCAGGCAGT GGATACGGTT
ATTCCGACGG TGACCATCAG CGATAATACT ACTGGCACCG CAATCGGCGA GGTAATTTAC
ACCTTTAACT TTAGCGAAGC GATGAGTGGT TTTACTGCCA ATGATGTGAC CTTGACAGGG
GGCACAAAAG GCACCTTTAC TGAGGTTTCA GCCAGCCAAT ATACCCTAGT AGTAACGCCT
GACGCGAACA GCATTACCGA CATGACCGTT AATGTGGCCG CTGATATTGC TAAGGATCCC
GCCGGCAATA ATAATCTGGT TGCGACTGAA TCCGTGCAGG TACTGGATAC AGTTATTCCG
ACGGTGACCA TCAGCGATAA TACTACGGGC ACCGCAATTG GTGAGGTAAT TTACACCTTT
AACTTTAGCG AAGCGATGAG TGGCTTTACT GCCAATGATG TGACCCTAAC GGGGGGCACA
AAAGGCACCT TTACTGAGGT TTCAGCCAGC CAATATACCC TAGTAGTAAC GCCTGACGCG
AACAGCATTA CCGACATGAC CGTTAATGTG GCCGCTGATA TTGCTAAGGA TCCCGCCGGC
AATAATAATC TGGTTGCGAC TGAATCCGTG CAGGTACTGG ATACAGTTAT TCCGACGGTG
ACCATCAGCG ATAATACTAC GGGCACCGCA ATTGGTGAGG TAATTTACAC CTTTAACTTT
AGCGAAGCGA TGAGTGGCTT TACTGCCAAT GATGTGACCC TAACGGGGGG CACAAAAGGC
ACCTTTACTG AGGTTTCAGC CAGCCAATAT ACCTTAGTAG TGACGCCTGA CGCGAACAGC
ATTACCGACA TGACCGTTAA TGTGGCCGCT GATATTGCTA AGGATCCCGC CGGCAATAAT
AATCTGGTTG CGACTGAATC CGTGCAGGCA GTGGATACGG TTATTCCGAC GGTGGCCATC
AGCGATAATA CTACCGGCAC CGCAATTGGT GAGGTAATTT ACACCTTTAA CTTTAGCGAA
GCGATGAGTG GTTTTACAAT TGATGATGTG ATCCTAACGG GGGGCACAAA AGGCACCTTT
ACTGAGGTTT CAGCCAGCCA ATATACTCTA GTAGTAACGC CGAGCGCGAA CAGCATTACC
GACATGACCG TTAATGTAGC CGCTGATATT GCTAAGGATA CCGCCGGCAA TAATAATCTG
GTTGCGACTG AATCTGTGCA GGCAGTGGAT ACGGTTATTC CGACGGTGAC CATCAGCGAT
AATACTACCG GCACCGCAAT TGGGGAGGTA ATTTACACCT TTAACTTTAG CGAAGCGATG
AGTGGTTTTA CAATTGATGA TGTGACCCTA ACGGGGGGCA CAAAAGGCAC CTTTACTGAG
GTTTCAGCCA GCCAATATAC TCTAGTAGTG ACGCCTGACG CGAACAGCAT TACCGACATG
ACCGTTAATG TAGCCGCTGA TATTGCTAAG GATCCCGCCG GCAATAATAA TCTGGTTGCG
ACCGAATCCG TACAGGCAGT GGATACGGTT ATTCCGACGG TGACCATCAG CGATAATACT
ACCGGCACCG CAATTGGCGA GGTAATTTAC AGCTTTAACT TTAGTGAAGC GATGAGTGGT
TTTACTGCCA ATGATGTGAT CCTAATGGGG GGCACAAAAG GCACCTTTAC GGAGGTTTCA
GCCAGCCAAT ATACCTTAGT CGTGACGCCT GACGCGAACA GTACGACCAA TATGACCGTT
AATGTGGCCG CGGATATTGC TAAGGATACC GCGGGCAATA ACAATCTGGT TGCGACTGAA
TCCGTGCAAG CAGTGGATAC AGTTATTCCG ACGGTGACCA TCAGCGATAA TACTACCGGC
ACCGCAATTG GCGAGGTAAT TTACACCTTT AACTTTAGCG AAGCGATGAG TGGTTTTACA
ATTGATGATG TGACCCTAAC GGGGGGCGCA AAAGGTACCT TTACTGAGGT TTCAGCCAGC
CAATATACCC TGGTAGTAAC GCCTGACGCG AACAGCATTA CCGACATGAC CGTTAATGTA
GCCGCTGATA TTGCTAAGGA TCCCGCCGGC AATAACAATC TGGTTGCGAC CGAATCCGTG
CAGGCAGTGG ATACGGTTAT TCCGACGGTG GCCATCAGCG ATAATACTAC CGGCACCGCA
ATTGGGGAGG TAATTTACAC CTTTAACTTT AGTGAAGCGA TGAGTGGTTT TACAATTGAT
GATGTGATCC TAACGGGGGG CACAAAAGGT ACCTTTACTG AGGTTTCAGC CAGCCAATAT
ACCCTAGTTG TGACGCCTGA CGCGAACAGC ATTACCGAGA TGACCGTCAA TGTGGCCGCT
GATATTGCTA AGGATACCGC CGGCAATAAT AGTCTGGTTG CGACTGAATC CGTGCAGGCA
GTGGATACGG TTATTCCGAC GGTGACCATC AGCGATAATA CTACCGGCAC CGCAATTGGG
GAGGTAATTT ACACCTTTAA CTTTAGTGAA GCGATGAGTG GTTTTACAAT TGATGATGTG
ATCCTAACGG GGGGCACAAA AGGCACCTTT ACTGAGGTTT CAGCCAGCCA ATATACCTTA
GTCGTGACGC CTGACGCGAA CAGCATTACC GACATGACTG TTAATGTAGC CGCTGATATT
GCTAAGGATC CCGCCGGCAA TAATAATCTG GTTGCGACTG AATCCGTGCA GGTACTGGAT
ACAGTTATTC CGACGGTGAC CATCAGCGAT AATACTACCG GCACCGCAAT TGGGGAGGTA
ATTTACACCT TTAACTTTAG CGAAGCGATG AGTGGTTTTA CTGCCAATGA TGTGACCTTG
ACCGGGGGCA CAAAAGGCAC CTTTACGGAG GTTTCAGCCA GCCAATATAC CTTAGTCGTG
ACGCCTGACG CGAACAGTAC GACCAATATG ACCGTTAATG TGGCCGCTGA TATTGCTAAG
GATACCGCCG GCAATAACAA TCTGGTTGCG ACTGAATCCG TGCAGGCAGT GGATACGGTT
ATTCCGACGG TGACCATCAG CGATAATACT ACCGGCACCG CAATTGGTGA GGTAACTTAC
ACCTTTAACT TTAGCGAAGC GATGAGTGGT TTTACTGCCA ATGATGTGAC CTTGACCGGG
GGCACAAAAG GCACCTTTAC GGAGGTTTCA GCCAGCCAAT ATACCTTAGT CGTGACGCCT
GACGCGAACA GTACGACCAA TATGACCGTT AATGTGGCCG CTGATATTGC TAAGGATACC
GCGGGCAATA ACAATCTGGT TGCGACCGAA TCCGTGCAGG CAGTGGATAC GGTTATTCCG
ACGGTGACCA TCACTGATAA TACTACCGGC ACCGCAATTG GGGAGGTAAT TTACACCTTT
AACTTTAGCG AAGCGATGAG TGGTTTTACT GCCAATGATG TGACCTTGAC CGGGGGCACA
AAAGGCACCT TTACGGAGGT TTCAGCCAGC CAATATACCT TAGTCGTGAC GCCTGACGCG
AACAGTACGA CCAATATGAC CGTTAATGTG GCCGCTGATA TTGCTAAGGA TACCGCGGGC
AATAACAATC TGGTTGCGAC CGAATCCGTG CAGGCAGTGG ATACGGTTAT TCCGACGGTG
ACCATCACTG ATAATACTAC CGGCACCGCA ATTGGCGAGG TAATTTACAG CTTTAACTTT
AGTGAAGCGA TGAGTGGTTT TACTGCCAAT GATGTGATCC TAATGGGGGG CACAAAAGGC
ACCTTTACTG AGGTTTCAGC CAGCCAATAT ACCCTAGTCG TGACGCCTGA CGCGAACAGC
ATTACCGACA TGACCGTTAA TGTGGCCGCT GATATTGCTA AGGATCCCGC CGGTAATAAT
AATCTGGCTG CGACTGAATC TGTGCAGGCA GTGGATACAG TTATTCCGAC GGTGACCATC
AGCGATAATA CTACCGGCAC CGCAATTGGG GAGGTAATTT ACAGCTTTAA CTTTAGTGAA
GCGATGAGTG GCTTTGCAAT TGATGATGTG ATCCTAACGG GGGGCACAAA AGGCACCTTT
ACTGAGATTT CAGCCAGCCA ATATACCTTA GTCGTGACGC CTGACGCGAA CAGCATTACC
GACATGACCG TTAATGTAGC CACTGATATT GCTAAGGATC CCGCCGGTAA TAATAATCTG
GCTGCGACTG AATCTGTGCA GGCAGTGGAT ACAGTTATTC CGACGGTGAC CATCAGCGAT
AATACTACCG GCACCGCAAT TGGGGAGGTA ATTTACAGCT TTAACTTTAG TGAAGCGATG
AGTGGCTTTG CAATTGATGA TGTGATCCTA ACGGGGGGCA CAAAAGGCAC CTTTACTGAG
ATTTCAGCCA GCCAATATAC CTTAGTCGTG ACGCCTGACG CGAACAGCAT TACCGACATG
ACTGTTAATG TAGCCGCTGA TATTGCTAAG GATACCGCGG GTAATAATAA TCTGGTTGCG
ACCGAATCCG TGCAGGCAGT GGATACGGTT ATTCCGACGG TGGCCATCAG CGATAATACT
ACCGGCACCG CAATTGGGGA GGTAATTTAC ACCTTTAACT TTAGCGAAGC GATGAGTGGT
TTTACTGCCA ATGATGTGAC CTTGACCGGG GGCACAAAAG GCACCTTTAC GGAGGTTTCA
GCCAGCCAAT ATACCTTAGT CGTGACGCCT GACGCGAACA GTACGACCAA TATGACCGTT
AATGTGGCCG CTGATATTGC TAAGGATACC GCCGGCAATA ACAATCTGGT TGCGACTGAA
TCCGTGCAGG CAGTGGATAC GGTTATTCCG ACGGTGACCA TCAGCGATAA TACTACCGGC
ACCGCAATTG GTGAGGTAAC TTACACCTTT AACTTTAGCG AAGCGATGAG TGGTTTTACT
GCCAATGATG TGACCTTGAC CGGGGGCACA AAAGGCACCT TTACGGAGGT TTCAGCCAGC
CAATATACCT TAGTCGTGAC GCCTGACGCG AACAGTACGA CCAATATGAC CGTTAATGTG
GCCGCTGATA TTGCTAAGGA TACCGCGGGC AATAACAATC TGGTTGCGAC CGAATCCGTG
CAGGCAGTGG ATACGGTTAT TCCGACGGTG ACCATCACTG ATAATACTAC CGGCACCGCA
ATTGGCGAGG TAATTTACAG CTTTAACTTT AGTGAAGCGA TGAGTGGTTT TACTGCCAAT
GATGTGATCC TAACGGGGGG CACAAAAGGC ACCTTTACTG AGATTTCAGC CAGCCAATAT
ACCTTAGTCG TGACGCCTGA CGCGAACAGT ACGACCAATA TGACCGTCAA TGTGGCCGCT
GATATTGCTA AGGATACCGC GGGTAATAAT AATCTGGTTG CGACCGAATC CGTGCAGGCA
GTGGATACGG TTATTCCGAC GGTGGCCATC AGCGATAATA CTACCGGCAC CGCAATTGGC
GAGGTAATTT ACACCTTTAA CTTTAGCGAA GCGATGAGTG GTTTTACTGC CAATGATGTG
ATCCTAACGG GGGGCACAAA AGGCACCTTT ACTGAGATTT CAGCCAGCCA ATATACCTTA
GTCGTGACGC CTGACGCGAA CAGTACGACC AATATGACCG TCAATGTGGC CGCTGATATT
GCTAAGGATA CCGCGGGTAA TAATAATCTG GTTGCGACCG AATCCGTGCA GGCAGTGGAT
ACGGTTATTC CGACGGTGGC CATCAGCGAT AATACTACCG GCACCGCAAT TGGCGAGGTA
ATTTACACCT TTAACTTTAG CGAAGCGATG AGTGGTTTTA CTGCCAATGA TGTGATCCTA
ATGGGGGGCA CAAAAGGCAC CTTTACTGAG GTTTCAGCCA GCCAATATAC CCTGGTAGTA
ACGCCGAGCG CGAACAGCAT TACCGACATG ACCGTTAATG TGGCCGCTGA TATTGCTAAG
GATACCGCCG GCAATAACAA TCTGGTTGCG ACCGAATCCG TGCAGGCAGT GGATACAGTT
ATTCCGACGG TGGCCATCAG CGATAATACT ACCGGCACCG CAATTGGCGA GGTAATTTAC
ACCTTTAACT TTAGCGAAGC GATGAGTGGT TTTACAATTG ATGATGTGAC CCTAACGGGG
GGCGCAAAAG GTACCTTTAC TGAGGTTTCA GCCAGCCAAT ATACCCTGGT AGTAACGCCG
AGCGCGAACA GCATTACCGA CATGACCGTT AATGTGGCCG CTGATATTGC TAAGGATACC
GCCGGCAATA ACAATCTGGT TGCGACCGAA TCCGTGCAGG CAGTGGATAC GGTTATTCCG
ACGGTGGCCA TCAGCGATAA TACTACCGGC ACCGCAATTG GCGAGGTAAT TTACACCTTT
AACTTTAGCG AAGCGATGAG TGGTTTTACT GCCAATGATG TGACCTTGAT AGGGGGCACA
AAAGGCACCT TTACTGAGGT TTCAGCCAGC CAATATACCT TAGTCGTGAC GCCTGACGCG
AACAGCATTA CCGACATGAC CGTTAATGTG GCTGCTGATA TTGCTAAGGA TACCGCCGGC
AATAATAATC TGGCTGCGAC TGAATCCGTG CAAGCTGTGG ATACCGAAGT CCCAACCACC
ACCATTACCG GTGCGGTTTA TAATGAAGCG AACAATACAC TGGTATTGAC TGGTACGAAT
ATCACTACGT TATTAAGCGG TGCTGAAGAT GTTAATACCG ACCTCAAGGC CAATTTAAAT
TGGAGTAAAC TGCACTGGGA TATGGATAAC GATAATAATG ATACGAATAA TGTCCCTTTT
ACCGTAGATG ATATCAAAAG TGCCGTGGCA ACAAATAATT CGACCTTTAC TATTACGCTA
AATGATAGCA AAGCAGCAGC ATTAGAAGGT GATGTAAATT TATTAGCGAA CGGTGGGAGT
GATAACCTCG ATATAGCGAC AGGTTTTGTT GGAGATACCG CTGGTAATAT TGCTACCACA
GATGATTATA ATGGTGAAAT AAACGATGCC TCTGTGGTTG TGTTTGATCT AGTGAATGGT
GTGTCATCGA GTCATAGCGG GCGGATTTTC GACGCGAATA TTGATTATAC CGTGTATATA
TTGGTCGATT CTGCTAACTT GGAATTTAAG ACTGTCAGCG ATGAAAGTTG GGGGAGATGG
ACCGCAGCCA ATAATTTGGA TGCGAGCGAT CAAGTGATTA TCGTTGGCAA TGCCGGGTTG
ATCGACTTAA ACGGAGCTGA TGCAGGTAAT CTGGCCGATA GTATGGTGGC GGGGTCAGCG
AATAAGTTGG AGATTAGAAA TACGAAAATT ACGACTGGAT ATAAGCAGGG AGTCAGTATC
ATGAGAACAG GATTCTTTAA GGAGACTACT AGTTTCAGCA GTGTGCAACA TATGGTGCAG
GGTAAGCTGT GGAAGGGAAC AGTGGCAGAT ATGAGAGGTA CTACCGTGGG TTCCACCCCG
GTGGATCACT ATCTGCAATC CATGCCAATC CACGTGTTAA CGTCGCAGGG GCTGGCTCCT
TAA
 
Protein sequence
MNKLTLIVEL PDGTQQTLVL SKLQILAALN GVVYTLVEQD TQRVPEELVL KRKGDDLYIE 
VDGVPIAQID GFYSAEMNAI FSADGTLTPG YGMAVTSSDV LEGSLVNDNG EATVVWAAQE
SGLSPLVWTG GILAGGIVAA VALSGGGGGT ETIAPKDSSA DAVVVNAIAN DNVVNSGEAT
EGFNITGSGE TGATVNLTFE SAIILAAGNT AIVDGDGNWT VAVTSTDIAA MGEGAELISV
TQTDIAGNMS APSVKTIDID TTAPTARSMA ASPMVARDDQ GSVQGDLTSG DSTDDTTLIL
SGSNEAGSRV NVYNGSTLLG AATISGTTWR YSATIVDGTS YQFNAQETDA AGNESAVTSN
FAVTGDTTAP TASSMAASPM VVSDDQGSVQ GDFTDGDSTD DTALILSGSN EAGSRVNVYN
GSTLLGAATI SGTTWSYSAT VTDGTSYQFN TKETDAAGNE SVATSNFAVT GDTTAPTASS
MVASDDQGSV QGDLSDGDST DDTMLVLSGS NEAGSRVNVY SGSTLLGAAT ISGTTWSYSA
TVTDGTSYQF NAQETDAAGN ESVVTSNFAV TGDTTAPTAS SMAESPMVVS DNQGSVTGDL
TSGDSTDDTA LVLSGSNEAG SSINVYNGGV LLGAATISGT TWRYSATVTD GTAYQFNTQE
TDAAGNESPA TSNFSVTGDM TAPTASSMAA SPMVVSDDQG SVQGDLTSGD STDDTALILS
GSNEAGSSVN VYNGSTLLGA ATISGTTWSY SATVTDGTSY QFNTKETDAA GNESVATSNF
AVTGDTTAPT ASSMVASDDQ GSVQGDLSDG DSTDDTTLIL SGSNEAGSRV NVYNGSTLLG
AATVTGTGWI YNAHVADGTT YQFNTQETDV AGNESPATSD FAVTGDTTAP TASSMAASPM
VVSDDQGSVQ GDLTSGDSTD DTTLILSGSN EAGSKVNVYS GSTLLGAATI SGTTWRYSAT
IVDGTSYQFN AQETDAAGNE SVVTSNFAVT GDTTAPTASS MAESPMVVSD NQGSVTGDLT
SGDSTDDTAL VLSGSNEAGS SINVYNGGVL LGAATISGTT WRYSATVTDG TAYQFNTQET
DAAGNESPAT SNFSVTGDMT APTASSMAAS PMVVSDDQGS VQGDLTSGDS TDDTALILSG
SNEAGSSVNV YNGSTLLGAA TISGTTWSYS ATVTDGTSYQ FNTKETDAAG NESVATSNFA
VTGDTTAPTA SSMVASDDQG SVQGDLSDGD STDDTTLILS GSNEAGSRVN VYNGSTLLGA
ATVTGTGWIY NAHVADGTTY QFNTQETDVA GNESPATSDF AVTGDTTAPT ASSMAASPMV
VSDDQGSVQG DLTSGDSTDD TTLILSGSNE AGSKVNVYSG STLLGAATIS GTTWRYSATV
TDGTAYQFNT QETDAAGNES PATSNFSVTG DTTAPTASSM AASPMVVSDN QGSVTGDLAS
GDSTDDTALV LSGSNEAGSK VNVYSGNTLL GAATISGTTW RYSATVADGT AYQFNTQETD
AAGNESPATS NFSVTGDTTA PTASSMAASP MVISDNQGSV TGDLTSGDST DDTALVLSGS
NEAGSSINVY NGGVLLGAAT ISGTTWRYSA TVTDGTAYQF NTQETDAAGN ESPATSNFSV
TGDTTAPTVA ITDNTVDTAT GEVIYTFTFP EAVNDFTVAD VTVTGGSKVG SFDSGVDGDS
VYTLVVTPDA NSITDMTVNV AADIAKDTAG NNNLVATESV QAVDTVIPTV AISDNTTGTA
IGEVIYTFNF SEAMSGFTID DVILTGGAKG TFTEVSASQY TLVVTPDANS ITDMTVNVAA
DIAKDPAGNN NLVATESVQA VDTVIPTVAI SDNTTGTAIG EVIYTFNFSE AMSGFTIDDV
ILTGGAKGTF TEVSASEYTL VVTPDANSIT DMTVNVAADI AKDTAGNNNL VATESVQAVD
TVIPTVTISD NTTGTAIGEV IYTFNFSEAM SGFTANDVTL TGGTKGTFTE VSVSQYTLVV
TPDANSITDM TVNVAADIAK DPAGNNNLVA TESVQAVDTV IPTVTISDNT TGTAIGEVIY
TFNFSEAMSG FTANDVTLTG GTKGTFTEVS ASQYTLVVTP DANSITDMTV NVAADIAKDP
AGNNNLVATE SVQVLDTVIP TVTISDNTTG TAIGEVIYTF NFSEAMSGFT ANDVTLTGGT
KGTFTEVSAS QYTLVVTPDA NSITDMTVNV AADIAKDPAG NNNLVATESV QVLDTVIPTV
TISDNTTGTA IGEVIYTFNF SEAMSGFTAN DVTLTGGTKG TFTEVSASQY TLVVTPDANS
ITDMTVNVAA DIAKDPAGNN NLVATESVQA VDTVIPTVAI SDNTTGTAIG EVIYTFNFSE
AMSGFTIDDV ILTGGTKGTF TEVSASQYTL VVTPSANSIT DMTVNVAADI AKDTAGNNNL
VATESVQAVD TVIPTVTISD NTTGTAIGEV IYTFNFSEAM SGFTIDDVTL TGGTKGTFTE
VSASQYTLVV TPDANSITDM TVNVAADIAK DPAGNNNLVA TESVQAVDTV IPTVTISDNT
TGTAIGEVIY SFNFSEAMSG FTANDVILMG GTKGTFTEVS ASQYTLVVTP DANSTTNMTV
NVAADIAKDT AGNNNLVATE SVQAVDTVIP TVTISDNTTG TAIGEVIYTF NFSEAMSGFT
IDDVTLTGGA KGTFTEVSAS QYTLVVTPDA NSITDMTVNV AADIAKDPAG NNNLVATESV
QAVDTVIPTV AISDNTTGTA IGEVIYTFNF SEAMSGFTID DVILTGGTKG TFTEVSASQY
TLVVTPDANS ITEMTVNVAA DIAKDTAGNN SLVATESVQA VDTVIPTVTI SDNTTGTAIG
EVIYTFNFSE AMSGFTIDDV ILTGGTKGTF TEVSASQYTL VVTPDANSIT DMTVNVAADI
AKDPAGNNNL VATESVQVLD TVIPTVTISD NTTGTAIGEV IYTFNFSEAM SGFTANDVTL
TGGTKGTFTE VSASQYTLVV TPDANSTTNM TVNVAADIAK DTAGNNNLVA TESVQAVDTV
IPTVTISDNT TGTAIGEVTY TFNFSEAMSG FTANDVTLTG GTKGTFTEVS ASQYTLVVTP
DANSTTNMTV NVAADIAKDT AGNNNLVATE SVQAVDTVIP TVTITDNTTG TAIGEVIYTF
NFSEAMSGFT ANDVTLTGGT KGTFTEVSAS QYTLVVTPDA NSTTNMTVNV AADIAKDTAG
NNNLVATESV QAVDTVIPTV TITDNTTGTA IGEVIYSFNF SEAMSGFTAN DVILMGGTKG
TFTEVSASQY TLVVTPDANS ITDMTVNVAA DIAKDPAGNN NLAATESVQA VDTVIPTVTI
SDNTTGTAIG EVIYSFNFSE AMSGFAIDDV ILTGGTKGTF TEISASQYTL VVTPDANSIT
DMTVNVATDI AKDPAGNNNL AATESVQAVD TVIPTVTISD NTTGTAIGEV IYSFNFSEAM
SGFAIDDVIL TGGTKGTFTE ISASQYTLVV TPDANSITDM TVNVAADIAK DTAGNNNLVA
TESVQAVDTV IPTVAISDNT TGTAIGEVIY TFNFSEAMSG FTANDVTLTG GTKGTFTEVS
ASQYTLVVTP DANSTTNMTV NVAADIAKDT AGNNNLVATE SVQAVDTVIP TVTISDNTTG
TAIGEVTYTF NFSEAMSGFT ANDVTLTGGT KGTFTEVSAS QYTLVVTPDA NSTTNMTVNV
AADIAKDTAG NNNLVATESV QAVDTVIPTV TITDNTTGTA IGEVIYSFNF SEAMSGFTAN
DVILTGGTKG TFTEISASQY TLVVTPDANS TTNMTVNVAA DIAKDTAGNN NLVATESVQA
VDTVIPTVAI SDNTTGTAIG EVIYTFNFSE AMSGFTANDV ILTGGTKGTF TEISASQYTL
VVTPDANSTT NMTVNVAADI AKDTAGNNNL VATESVQAVD TVIPTVAISD NTTGTAIGEV
IYTFNFSEAM SGFTANDVIL MGGTKGTFTE VSASQYTLVV TPSANSITDM TVNVAADIAK
DTAGNNNLVA TESVQAVDTV IPTVAISDNT TGTAIGEVIY TFNFSEAMSG FTIDDVTLTG
GAKGTFTEVS ASQYTLVVTP SANSITDMTV NVAADIAKDT AGNNNLVATE SVQAVDTVIP
TVAISDNTTG TAIGEVIYTF NFSEAMSGFT ANDVTLIGGT KGTFTEVSAS QYTLVVTPDA
NSITDMTVNV AADIAKDTAG NNNLAATESV QAVDTEVPTT TITGAVYNEA NNTLVLTGTN
ITTLLSGAED VNTDLKANLN WSKLHWDMDN DNNDTNNVPF TVDDIKSAVA TNNSTFTITL
NDSKAAALEG DVNLLANGGS DNLDIATGFV GDTAGNIATT DDYNGEINDA SVVVFDLVNG
VSSSHSGRIF DANIDYTVYI LVDSANLEFK TVSDESWGRW TAANNLDASD QVIIVGNAGL
IDLNGADAGN LADSMVAGSA NKLEIRNTKI TTGYKQGVSI MRTGFFKETT SFSSVQHMVQ
GKLWKGTVAD MRGTTVGSTP VDHYLQSMPI HVLTSQGLAP