Gene SNSL254_A2903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2903 
Symbol 
ID6484751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2830044 
End bp2841209 
Gene Length11166 bp 
Protein Length3721 aa 
Translation table11 
GC content59% 
IMG OID642738223 
Productlarge repetitive protein 
Protein accessionYP_002041952 
Protein GI194444016 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTAC TCGCCGTGGT TTCGAAATTG ACTGGCGTCT CCACCACTGT GGAATCCTCA 
GCGGTCACTC TTAACGCCCC GTCAATTGTT AAATTATCAG TGGCCCGGGA GGAAATTAGT
CAACTTACGC GCATTAATCA GGATCTGGTG GTGAGGCTCC ATTCCGGCGA AACGATCACG
ATTAAAAACT TTTACGTTAC CAACGATCTG GGCGCAAGCC AGTTAGTACT GGCGGAAAAC
GATGGCACGT TATGGTGGGT AGAAAATCCG CAAGCCGGGC TACATTTTGA ACAAATCGCT
GATATTAATG AGCTGCTGGT CACTTCTGGC GCTTCCCATG AAGCAGGCGG CGCCGTTTGG
CCGTGGGTAC TGGCTGGCGC GGTGGCGGCT GGCGGCATTG CCGCTATCGC GTCTTCCGGC
GGCGGCGATT CCCACCATCA TTCGGATGGC GATAATCCGC CCCCCGATAA CACCAATCCT
GACGGTAATC CCCCTGATAA CAGCAATCCC GGCGGCAGTA CCCCCAACGG CAATACTCCA
GGTAGCAGTA ATCCTGTAGA TACTACCCCG CCTCTCGCTC CCGGCGAATT ATTGATTTCA
GCGGACGGAA AAACGGTAAG CGGGCAAGCC GAAGCGGGCA GTACCATTAC CATCAAAGAT
CCCTCAGGCA ACGTCGTTGG CGAGGGCAAA GCGGATAGCG ACGGTAAATT TAGTATTGAT
CTGACAGCGC CACAGATTAG CGGCGAACAA CTTACCGTGA CCGCGACTGA CGATGCCGGC
AATACCGGCC CATCCGCAAC CATTGATGCG CCCAACATTC CTCTCCCCGA TACACCGGTT
ATCACCGCCG CTATCGATGA TGCCGCTCCC CTCACCGGCA CGCTGAGCAA TAATCAGTTT
ACGAACGACA GTACCCCCAC TCTGGAGGGC ACCGGCAGCG CAGGCACAGT CATCCATATT
TACGCCAATG GTCAGGAAAT AGGCTCCACA ACGGTTGATA CCAGCGGAAA CTGGCGTTTT
GCCATTACCA GCGCGCTAGC GGATGGGGAA AATCATTTCA CCGCCATTGC GACTAACGTT
AAAGGCGAAA GTAGCGAATC AGCCCGCTTT ACGCTGACTA TCGACACACT CAGCCCCGAT
GCCCCACGCG TTGAACTGAT TGCCGATAAC ACCGGTTTGC TCACCGGGCC GCTACAGAAT
AATGACCGGA CTGACGAGGC AAAACCGCTA TTTTCCGGGC AGGGAGAGGC AGGCAATACC
ATCACGATTA AAGAAGGTTC AACCGTTATC GGCAGCGCTA CCGTAGACGA AAATGGACGC
TGGACCTTTA CGCCGACTAC GCCGTTAAGC GATGGCGAAC ATACCTTTAC CGTCGAACAA
AGCGACAAAG CCGGAAACAC GAGCCGCGTG ACGACAACGC CTACTATCAT TGTGGACACC
ACACCGCCTG ACGCCGCTAT CATTGATAAT GTTGCGAAAG ACGGCACAAC CGTTAGCGGC
ACCGCTGAAG CTGGCAGTAC CGTGTCGATC TATGACCCGG CGGGAAATTA CCTGGGCTCC
ACGATTACCG GAGAAAATAA CCACTTCAGC ATCACGCTGA ATCCGGCTCA GACCCACGGC
GAGCGTCTGG AAGCGCGTAT TCAGGACGCC GTCGGTAACA TCGGCCCCGC CACGGAGTTT
ACCGCTTCTG ATTCACAGTA TCCTGCCCAG CCGACTATCC TTACCGTGAC GGATGACGCT
GGCGCCGTTA CCGGGCTGCT GAAAAATGGC GATGCCACAG ATGATAACCG CCCAACCCTC
AGCGGTACTG CTGAACCAGG CAGTACGATA TCGATTAACG ATAATGGCTT TCCTGTACCG
ACCTTTCCGC CCATTGTCGC TGACGCTGAC GGCAAATGGA GCTTTACCCC CTCGCTGGCA
CTTGCCGATG GCGACCATGT CTTTACCGCT ACCGCGACCA ACGATCGCGG CACCAGCGGG
CAGTCCGTCT CCTTTACCAT TGATATCGAC ACGCAGCCGC CGGTGCTGGA AGGCCTGGCG
GTTAGCGACG TCGGCGACAG ACTTACCGGC ACTACGGAAG CTGGCAGTAC TGTGGTTATC
AAAGACAGCC TGGGAAATAC GCTCGGTAGT GGAACGGCAG GCGACGACGG TACCTTCTCA
ATAGGTATTA GCCCGGCGAA AATTAACGGC GAAACATTGA GCATTAGCGT TACCGATAAA
GCCGCGAATA GCGGTCCGGT AGAAACGCTG AACGCGCCGG ATAAAACTGC GCCTGCGGCA
CCGGACGGTC TTATCGTGGC GACCGACGGT CTGTCCGTAA GCGGTCAGGC GGAAGCCGGG
GCAACGGTCA CTATCCGCGA CAGTAGCAAC ACCGTACTTG GCAGCGCCGT CGCTAACGGC
AACGGACAAT TTATCGTTCC GCTGAATACG GCGCAGACTA ACGGCCAAGC GCTTATCGCT
ACCGCCACCG ATGTCGCGAA AAACGAAAGC GCCGCCGCGA CGGTTATCGC GCCGGACAGT
ACCGCGCCGG AAATGCCGAA AAACGTGGTA ATTAGTGAGG ATGGCACCAG TATCAGCGGC
ACCGCCGAAC CGGGTAGCGC CATCACGATC GCCACGCCGG ACGGCAAGCC GCTTAGCAGC
GGCAAAGCAG ATGGCGAAGG TCATTTTACC CTTCCCCTCG TCCCCGCACA GACCAACGGC
GAACAGGTTA CCGTCACCGC CACCGACAAC GCCAACAACG TCAGCCCGCC AACTACAGCG
CAAGCGCCCG ATATCACCGC CCCGGATAAG CCCATTATCA CTCAGGTGCT GGACGATGTT
GAAAGCTTCA CCGGGCCGCT GGTTAACGGA CAAACCACCA ATGACAACCG CCCCACCCTT
AGCGGTACGG CGGAGGCTGG CGCGCGTGTC GAAATTTTTG ATAACGGTGT TTCGCTGGGA
CTCGCCACGC TACAGCCCAA CGGCACCTGG ACGTTTACGC CGTCGCAAAA TTTAGGTGAA
GGCGCGCATC GACTGACCGT AATCGCAACC GACGCTAAAG GCAATGCCAG TCCGGCCGCG
TCATTCGACC TGGTGGTCGA TACGCAATCG CCGCAGCAAC CGGTAATCAC CTTCATTACA
GATGATGCGC CGGGTATTCT CGGTAGCGTC GCGCATCTGG GGCTCACTAA CGACAGCACG
CCAACGATTA ACGGTACAGG TGAACCGGGT TCCACAGTAC ACCTGTATCA GAATGGCGCC
CGGATAGCGG ATATTATCGT CGGTAATTCC GGCGTCTGGA GCTACGCCTA CACCACGGCC
TCGCCACTGG CGGACGACAC CTACACCTTT ACCGTGACGG CCAGCGACAG TAACGGCAAC
ACCACGCCTT TTTCGACCGA TTTTACGATT ACCATTGATA CCCAGGCCCC TGCCGCTCCC
GGCGTTATCG GCGTAGCTGA CGGCGACGGA AATACGATTG ATACCAATCA GATTACCCAG
GAATCCCAGC CCCGGTTGAG CGGTAGCGGC ACCGCAGGCG ATACAATCAT CCTTTACGAT
AACGGCAATG CCATAGGTCA GGCGCTGGTC GGTACGGACG GGCGCTGGCA GTTTACGCCG
CCTGCCGCGC TGGGCGACGG CGACCACCTT CTGACCGCTC GCGCCAACGA TCCGGCGGGG
AACGAAAGTC CCGAATCCAT CAGCTTTACC CTACGCATCG ATACCCAGGC GCCGGATGCG
CCGCAGATCG TGTCAGCCGC CATCACCGGC GGAGAAGGCG AGGTGCTACT GGCAAACGGC
AGTATTACCA ATCAGCGTAT GCCGACCCTC AGCGGCACCG GCGAACCCGG CACCATCATC
ACCCTGTACA ATAACGGCGT AGAACTGGCT ACCGTCCAGG TCAATCCACA GGGTAGCTGG
ACCTATCCGC TAACCCGTAA TCTGAGCGAA GGGTTAAACA TCCTGACGGC CACCGCCACG
GATGCCGCAG GCAATAGTAG CCCGACCTCC GGCGTTTTCT CCGTTACCCT TGATACCCAG
CCTCCAGCGC AGCCTGACGC GCCGCTAATC AGCGATAACG TCGGCGAAGT CCAGGATACT
ATTGTCAGCG GCGCAACCAC TGACGACAAT ACACCGGTCA TTCACGGCAC TGGCGACATC
GGCAGCATTA TTACGCTCTA TAACGGCAGC AGCGTTTTAG GCGTAGTCAC CGTCGATGAG
ACCGGCACCT GGACGCTGCC GGTGACCAGC GCGTTGCCGG ATGGCGTCTA CACCCTGACC
GCCATTGCCG CCGATGCCGC CGGAAACAGC AGCGGCGTAT CGAACAGCTT TACCTTCACC
GTCGACACCG TTCCGTTGCA GCCGCCCGTC GTCAATGAGA TCCTTGACGA TGTTGTACCA
GTGACCGGGC CATTAACCGA TGGCGCCTTT ACTAACGATC GGACGCTGAC TATCAACGGC
AGCGGCGAAA ACGGCAGCAC CGTCACGATT TACGACAATG GCGTGGCAAT CGGGACGGCG
CTCGTCACCG ACGGGACCTG GACATTCAAT ACGCCCGAAT TGTCAGAAGC CAGCCATGCG
CTAACCTTCA GCGCGACTGA CAATGCTGGA AATACCACGG CGCAAACCCA GCCGATCACC
ATTACCGTGG ATATCACCGC CCCGCCCGCG CCAACAGTCC AGACGGTGGA CGACGATGGC
ACGCGCGTGG CCGGACTTGC CGATCCTTAC GCTACCGTTG AAATTCACCA TGCCGATGGC
ACCCTGGTCG GCAGCGCTGT CGCTAATGGC ACCGGTGAAT TCGTCGTTAC GCTCAGTCCG
GCGCAAACCG ATGGCGGTAC GCTGACGGCA ATTGCTATCG ATCGCGCGGG GAATAACGGC
CCGGCTACGA ATTTTCCCGC TTCCGACAGC GGTCTGCCCG CCGTCCCGGC CATCACGGCG
ATTGAAGATG ATGTCGGGAG CGTACAGGGG AATATTGCGG CGGGCGGCGC CACGGACGAC
ACCACGCCGA CGCTGCGCGG CACTACGGAT ATCGGCTCTA CCGTTGAAGT TTTCATTAAT
GGCGATTCGG CAGGCTTTGC CACCGTTGAC GCCAGCGGGA ACTGGATCTT TGAGATCACG
ACGCCATTAA GCGAAAGCAC ACATTACTTC ACCGTCCAGG CAACCAATGC GAATGGCCCG
GGCGGCCTGT CCGCACCGGT CGGGATCACT GTCGATCTTA GCGCGCCGGC GCAACCGGTT
ATTACCAGCG CAACGGATGA TGTCCCCGGC ATGACCGGTA CGCTGGATAA CGGCGCGCTC
ACCAATGATT CACGCCCGAC GCTCAACGGA ACGGGAGAAG CAGGCGCCAC CATCCGCATT
CTGGATAACG GCGTAGAAAT CGGTTCCGCC ACGGTAGATC AAAGCGGCAA CTGGCGCTTC
ACCCCGAACG CGCCGCTGGA GAGCAACGCG CACATCTTTA CCGCCGTGGC GACCGATCCC
GCTGGCAATA GCGGCCAGCC TTCGGACGGC TTTACGCTGA ACATTGACGC GCAGGCGCCA
GATGTGCCGG TTATCACGTC CGTGATTGAC GATAACAATC AACCGACCGT TCCGGTGTTA
CCGGGGCAAT CCACCGACGA TCGGCAGCCA ATACTGAACG GAACTGGCGA ACCTGGCGCG
ACAATCACCA TTTTTGATAA CGGTACGCCG CTTGGCACGG CTCAGGTAGG CGAAAACGGT
AGCTGGACAT TCCCGGTGCC CCGCAATTTG TCAGAGGGAA GCCATAATCT GACGGTTAGC
GCTACCGATC CGGCGGGCAA TACCAGCGCG GTCTCCGCGC CGTGGACGAT CATAGTCGAT
ATTACGCCTC CGGCGATCCC GGTTCTCACC TCCGTCGTGG ATGACCAGCC CGGTATTACC
GGCAACCTGG TTAGCGGGCA GCTAACGAAC GATGCGACGC CCACCCTGAA CGGGCGCGGA
GAGGCAGGCG CGACGATTAA TGTCTATCTT GACGGTAATC CCGCGTCCAT CGGTACCACG
ACGGTGAATA GCGACGGCAC ATGGAGTTTC ACGCCGCAGA CGCCGCTTGC AAACGGTAGC
CACACGTTCA CCCTTAGCGC CACCGATCCG GCGGGTAATA GCAGCGCGGT GTCCAGCGGA
TTTGTGCTGA CGATTGACGC CACACCACCC GCCGCGCCGG TTATCGCCAG CGTGGCAGAC
AATACGGCGC CGGTGACGGG CATCGTCCCC AACGGCGGCT CGACGAACGA AACCCGACCA
ACACTCTCGG GTACCGGTGA GGCGGGTACA ACCATCTCGA TTTATAATGG CAGCGCGCTG
GTCGGCACGG CGCAAGTTCA GGCCAACGGT AGCTGGAGCT TTACGCCGTC TACCTCGCTG
GGCGCGGGCG TCTGGAACCT GACGGCGACA GCAACCGATG CGGCAGGCAA TACCAGCGCC
GCGTCCGAAA TACGCTCGTT TACTATTGAT ACCACGGCTC CCGCCGCGCC TGTTATTGAT
ACGGTCTACG ACGGTACGGG CCCCATTACC GGCAATCTGA GTTCAGGACA GATCACAGAC
GAGGCGCGCC CTGTCATTAG CGGCACCCGT GAAGCCAACA CAACTATTCG TCTCTACGAT
AACGGCACAC TGCTGGCTGA AATTCCCGCC GACAATAGCA GTAGCTGGCG CTACACGCCC
GATGCCTCTC TGGCGACGGG CAACCATGTA ATTACCGTCA TTGCCGTTGA TGCCGCAGGC
AACGCCAGTC CCGTTTCGGA CAGCGTTAAT TTCGTCGTCG ATACCACGCC GCCGCTGACG
CCGGTAATCA CATCAGTCAG TGACGATCAG GCGCCAGGCC TCGGCACGAT CGCGAACGGC
CAAAATACCA ACGATCCTAC GCCAACCTTC AGCGGCACCG CAGAAGCCGG CGCCACGATT
ACGCTCTATG AAAATGGTAC GGTCATTGGC ACGACAACGG CTCAGCCTGA CGGCGCGTGG
AGCGTCGCCA CCTCAACGCT GGCAAGCGGA ACGCACGTCA TCACCGCCGT CGCCACCGAT
GCCGCAGGAA ACAGCAGCCC GAACAGTACA GCTTTCACCC TGACGGTCGA TACCACCGCG
CCGCAAACGC CAATCCTGAC GTCCGTGGTG GATGACGTCG CGGGCGGGGT CACAGGAAAT
CTCGCTAATG GTCAGATAAC CAATGATAAC CGCCCCACGC TGAACGGCAC TGCCGAAGCG
GGCAGCGTGG TCAGTATCTA TGATGGCAAC ACTCTGCTTG GCGTCACCTC GGCTAACGCG
GGCGGCGCGT GGAGCTTCAC GCCGACGACA GGGTTAAACG ACGGCACGCG CATATTAACA
GTGACCGCCA CCGACCCGGC AGGCAACGTT AGCCCGGCCA CCAGCGGTTT TACTATCGTG
GTCGATACCC TTGCGCCAAC GGTTCCGCTT ATAACCAGCA TCGTTGATGA TGTCCCGAAC
AACACCGGCG CCATTGGCAA TGGACAATCG ACCAACGACA CACAGCCGAC GCTCAATGGT
ACCGCGGAAG CCAACAGCGC GGTAAGCATC TTCGATAATG GTGCGCTGGT CGCGACCGTG
AACGCCAATG CCAGCGGCAA CTGGAGCTGG ACGCCAACCG CCGCGCTCGG CCAGGGAAGT
CACGCCTATA GCGTTAGCGC CGCCGATGCA GCTGGTAACG TTAGCGCCGC TTCGCCATCG
ATAACGATTA TCGTGGATAC CATTGCGCCC GGCGCGCCCG GCAACCTGGT CATCAATGCT
ACCGGTAATC GGGTGACGGG CACCGCGGAA GCAGGCAGTA CAGTGACGAT TACCTCTGAT
ACTGGTGTGG TACTGGGAAC CGCCACCGCC GACGGTACAG GCAGCTTCAC CGCCACACTC
ACGCCCGCGC AGACCAATGG TCAGCCGCTA CTGGCATTTG CCCAGGATAA AGCAGGCAAC
ACTGGCATTG CCGCCGGATT TACCGCGCCC GATACGCGCG TGCCGGAAGC GCCGATCATC
ACCAACGTAG TGGATGATGT GGGTATTTAT ACCGGCGCTA TCGCCAACGG TCAGGTCACT
AATGACGCCC AACCCACATT GAATGGTACC GCTCAGGCGG GCGCCACGGT GAGCATTTAT
AACAACGGGG CGCTGCTCGG CACCACCACG GCGAACGCCA GCGGAAACTG GAGCTTTACC
CCGACAGGCA ATTTGACCGA AGGCAGCCAC GCCTTCACCG CCACCGCGAC TAACGCCAAC
GGAACAGGCA GCGTCTCCAC CGCCGCGACG GTGATTGTCG ATACGCTGGC GCCCGGTACG
CCGTCAGGTA CGCTCAGCGC CGATGGCGGT TCACTTTCCG GGCAGGCAGA GGCAAACAGC
ACCGTAACCG TCACGCTGAC GGGGGGCGTG ACGCTCACCA CCACCGCTGG CAGCAACGGC
GCATGGTCTC TCACCTTGCC GACAAAACAA ATTGAAGGTC AACTCATTAA CGTGACGGCG
ACTGACGCTG CGGGTAACGC CTCTGGCACG TTAGGCATTA CCGCGCCGGT TCTGCCGCTG
GCGGCAAGGG ATAACATCAC CAGCCTTGAT CTGACCTCTA CCGCCGTCAC CAGCACGCAA
AACTATTCGG ATTACGGCCT GCTGCTGGTT GGCGCGCTTG GCAATGTCGC CTCGGTTTTG
GGTAACGATA CCGCTCAGGT TGAGTTCACC ATTGCTGAAG GTGGTACGGG CGACGTCACC
ATCGATGCCG CCGCAACGGG AATCGTGCTT TCGCTGCTCA GCACTCAGGA GATAGTGGTA
CAGCGCTATG ACACCAGCCT CGGCGCCTGG ACGACGATCG TCAACACCGC CGTTGGCGAC
TTCGCGAATT TGCTTACCCT GACCGGGAGC GGCGTTACCC TGAACCTGAA CGGACTGGGC
GAAGGCCAGT ACCGGGTACT CACTTATAAC ACCAGTCTGC TCGCCACCGG GTCATATACC
AGCCTGGATG TCGATGTACA CCAGACCAGC GCAGGTATTA TTAGCGGGCC AACCATCAGT
ACCGGCAACG TCATGGCTGA TGATACCGCG CCGACGGGCA CCACGGTCAC CGCCATCACC
AACGCCAACG GCGTCAGTAC GCCGGTCGGC GCGGGCGGCG TGGATATCCT GGGGCAATAC
GGCACGCTGC ACATTAATCA GGATGGCAGT TACACCTACA CGCTGACTAA GCCCACGGCG
GGATACGGAC ATAAAGAGAG CTTCACCTAC ACCATCACCC AGAATGGCGT CGGTAGCAGC
GCCGCGCAAC TGGTTATTAA TTTGGGTCCC GCGCCTGTAC CGGGCAGCGT GATAGCGACA
GACAATAACG CCTCGCTGGT CTTTGATACT CACGTTAGCT ACGTCAACAA CGGTCCCTCG
ACACAAAGCG GCGTCACGGT ATTAAGCGTC GGACTTGGTA ATGTACTGAA CGCGAATCTG
CTTGATGATA TGACTAATCC GATCATCTTT AACGTTGAAG AAGGCGCTAC GCGAACCATG
ACGTTACAGG GAACCGTCGG CGGCGTCTCA CTGGTTTCCA CGTTCGATCT GTACGTTTAT
CGCTTCAACG ATGCCATTCA GCAATATGAG CAGTTCCGGG TGGAAAAGGG CTGGATTAAC
ACCCTGCTGT TAGCCGGACA GTCCCAGCCG CTGACCCTGA CGTTGCCTGG CGGCGAATAC
CTGTTCGTGC TGAATACCGC CAGCGGCATT AGCGTCCTCA CTGGCTATAC GTTGGCGATT
TCCCAGGACC ACACCTATGC CGTTGACAGT ATCACCGCCA ACACCACCGG CAACGTACTG
ACCAATGATG TCGTCCCTAC GGACGCCCTC CTCACTGAAG TAAACGGCGT GGCGATTGCG
GCGACCGGCA CAACGGAGGT AAACGGGCTG TATGGCTCGC TCATCATTGA CGCAAGAGGC
AACTATACCT ACACGCTGAA GAACGGCGTC GGCGCCGACA GCATTAAAAC GCCGGACAGC
TTTATCTATA CGGTCAAAGC GCCAAACGGC GATACCGATA CGGCCTCGCT CAATATCACG
CCAACCGCCA GGGCGCTGGA TGCGATTAAT GATGTCAGCG ATACCCTCAG CGTCGCCACG
CTTCAGGATA CCGCTGCCTG GCTGGACTCC AGCGTCGGCA GCGCCAGTTG GGGGCTACTC
GGCAAATCGG GCAGCGGGAG CGGCACCTTT GACGTTGCAA CGGGCACCGT ACTTAAAGGC
GCGTCACTGG TCTTTGATGT CTCCACGCTC ATTACGCTGG GCAATCTGAA TATTAGCTGG
GCCATTCAGG AGAACGGGAC CGTCATACGC AACGGAACCG TTCCGGTGGC GAATATCACG
CTGGGCAGCG CGACGGTGAC CGTCAACCTG AGCGGCCTGG AGCTGGATGC CGGAACGTAC
ACGCTTAACT TTACCGGCAC CAATACCCTG GCCGGGGCGG CGACGATCAC GCCACGCGTC
ATCGGCACCA CCGTCGATCT GGATAATTTT GAAACGTCCG GAACGCATAC CGTTCTCGGC
AATATTTTTG ACGGCAGCGA CGCGGCGGGG GCGATGGATC AGCTTAATAC GGTGAATACC
CGCCTGAGCA TTAGCGGGTA TAACGGCAGC GCCGCCACGC TGGACGCCGC GGCGAATACC
ACCAGCGCCA CGATTCAGGG ACATTACGGC ACATTGCAAA TTAACCTTGA TGGCGCTTAC
ACCTACACCC TGAATAATGG CGTCGCGATG TCGTCCATCA CCAGTAAAGA GGTCTTTACC
TATCAACTGG ATGACAAGAT GGGTCATACG GATAGCGCCA CATTGACCAT TGATATGGCG
CCGCAAATCG TCAGTACCAA CCAAAACGAT GTTCTCATCG GCTCCGCCTA TGGCGATACG
CTGATTTACC ACCTGTTAAA CGGCGCGGAC GCGACCGGCG GCAACGGCGC CGATCGCTGG
CAAAACTTCT CCACCGCGCA GGGCGACAAG ATCGATATCC ACGAACTGCT GACCGGCTGG
GATCACCAGG CGGCGACGCT GGGTAACTTT GTTCAGGTTC ATACCAGCGA CGCCAATACG
GTGATATCCG TCGATCGCGA CGGCGCCGGC AGCGCGTTTA AATCAACTGA CCTTGTCACT
CTGGAGAATG TGCAGCTCAC GCTAAATGAT CTGTTGCAGA ACAACCACCT GATAACCGGC
GGTTGA
 
Protein sequence
MRLLAVVSKL TGVSTTVESS AVTLNAPSIV KLSVAREEIS QLTRINQDLV VRLHSGETIT 
IKNFYVTNDL GASQLVLAEN DGTLWWVENP QAGLHFEQIA DINELLVTSG ASHEAGGAVW
PWVLAGAVAA GGIAAIASSG GGDSHHHSDG DNPPPDNTNP DGNPPDNSNP GGSTPNGNTP
GSSNPVDTTP PLAPGELLIS ADGKTVSGQA EAGSTITIKD PSGNVVGEGK ADSDGKFSID
LTAPQISGEQ LTVTATDDAG NTGPSATIDA PNIPLPDTPV ITAAIDDAAP LTGTLSNNQF
TNDSTPTLEG TGSAGTVIHI YANGQEIGST TVDTSGNWRF AITSALADGE NHFTAIATNV
KGESSESARF TLTIDTLSPD APRVELIADN TGLLTGPLQN NDRTDEAKPL FSGQGEAGNT
ITIKEGSTVI GSATVDENGR WTFTPTTPLS DGEHTFTVEQ SDKAGNTSRV TTTPTIIVDT
TPPDAAIIDN VAKDGTTVSG TAEAGSTVSI YDPAGNYLGS TITGENNHFS ITLNPAQTHG
ERLEARIQDA VGNIGPATEF TASDSQYPAQ PTILTVTDDA GAVTGLLKNG DATDDNRPTL
SGTAEPGSTI SINDNGFPVP TFPPIVADAD GKWSFTPSLA LADGDHVFTA TATNDRGTSG
QSVSFTIDID TQPPVLEGLA VSDVGDRLTG TTEAGSTVVI KDSLGNTLGS GTAGDDGTFS
IGISPAKING ETLSISVTDK AANSGPVETL NAPDKTAPAA PDGLIVATDG LSVSGQAEAG
ATVTIRDSSN TVLGSAVANG NGQFIVPLNT AQTNGQALIA TATDVAKNES AAATVIAPDS
TAPEMPKNVV ISEDGTSISG TAEPGSAITI ATPDGKPLSS GKADGEGHFT LPLVPAQTNG
EQVTVTATDN ANNVSPPTTA QAPDITAPDK PIITQVLDDV ESFTGPLVNG QTTNDNRPTL
SGTAEAGARV EIFDNGVSLG LATLQPNGTW TFTPSQNLGE GAHRLTVIAT DAKGNASPAA
SFDLVVDTQS PQQPVITFIT DDAPGILGSV AHLGLTNDST PTINGTGEPG STVHLYQNGA
RIADIIVGNS GVWSYAYTTA SPLADDTYTF TVTASDSNGN TTPFSTDFTI TIDTQAPAAP
GVIGVADGDG NTIDTNQITQ ESQPRLSGSG TAGDTIILYD NGNAIGQALV GTDGRWQFTP
PAALGDGDHL LTARANDPAG NESPESISFT LRIDTQAPDA PQIVSAAITG GEGEVLLANG
SITNQRMPTL SGTGEPGTII TLYNNGVELA TVQVNPQGSW TYPLTRNLSE GLNILTATAT
DAAGNSSPTS GVFSVTLDTQ PPAQPDAPLI SDNVGEVQDT IVSGATTDDN TPVIHGTGDI
GSIITLYNGS SVLGVVTVDE TGTWTLPVTS ALPDGVYTLT AIAADAAGNS SGVSNSFTFT
VDTVPLQPPV VNEILDDVVP VTGPLTDGAF TNDRTLTING SGENGSTVTI YDNGVAIGTA
LVTDGTWTFN TPELSEASHA LTFSATDNAG NTTAQTQPIT ITVDITAPPA PTVQTVDDDG
TRVAGLADPY ATVEIHHADG TLVGSAVANG TGEFVVTLSP AQTDGGTLTA IAIDRAGNNG
PATNFPASDS GLPAVPAITA IEDDVGSVQG NIAAGGATDD TTPTLRGTTD IGSTVEVFIN
GDSAGFATVD ASGNWIFEIT TPLSESTHYF TVQATNANGP GGLSAPVGIT VDLSAPAQPV
ITSATDDVPG MTGTLDNGAL TNDSRPTLNG TGEAGATIRI LDNGVEIGSA TVDQSGNWRF
TPNAPLESNA HIFTAVATDP AGNSGQPSDG FTLNIDAQAP DVPVITSVID DNNQPTVPVL
PGQSTDDRQP ILNGTGEPGA TITIFDNGTP LGTAQVGENG SWTFPVPRNL SEGSHNLTVS
ATDPAGNTSA VSAPWTIIVD ITPPAIPVLT SVVDDQPGIT GNLVSGQLTN DATPTLNGRG
EAGATINVYL DGNPASIGTT TVNSDGTWSF TPQTPLANGS HTFTLSATDP AGNSSAVSSG
FVLTIDATPP AAPVIASVAD NTAPVTGIVP NGGSTNETRP TLSGTGEAGT TISIYNGSAL
VGTAQVQANG SWSFTPSTSL GAGVWNLTAT ATDAAGNTSA ASEIRSFTID TTAPAAPVID
TVYDGTGPIT GNLSSGQITD EARPVISGTR EANTTIRLYD NGTLLAEIPA DNSSSWRYTP
DASLATGNHV ITVIAVDAAG NASPVSDSVN FVVDTTPPLT PVITSVSDDQ APGLGTIANG
QNTNDPTPTF SGTAEAGATI TLYENGTVIG TTTAQPDGAW SVATSTLASG THVITAVATD
AAGNSSPNST AFTLTVDTTA PQTPILTSVV DDVAGGVTGN LANGQITNDN RPTLNGTAEA
GSVVSIYDGN TLLGVTSANA GGAWSFTPTT GLNDGTRILT VTATDPAGNV SPATSGFTIV
VDTLAPTVPL ITSIVDDVPN NTGAIGNGQS TNDTQPTLNG TAEANSAVSI FDNGALVATV
NANASGNWSW TPTAALGQGS HAYSVSAADA AGNVSAASPS ITIIVDTIAP GAPGNLVINA
TGNRVTGTAE AGSTVTITSD TGVVLGTATA DGTGSFTATL TPAQTNGQPL LAFAQDKAGN
TGIAAGFTAP DTRVPEAPII TNVVDDVGIY TGAIANGQVT NDAQPTLNGT AQAGATVSIY
NNGALLGTTT ANASGNWSFT PTGNLTEGSH AFTATATNAN GTGSVSTAAT VIVDTLAPGT
PSGTLSADGG SLSGQAEANS TVTVTLTGGV TLTTTAGSNG AWSLTLPTKQ IEGQLINVTA
TDAAGNASGT LGITAPVLPL AARDNITSLD LTSTAVTSTQ NYSDYGLLLV GALGNVASVL
GNDTAQVEFT IAEGGTGDVT IDAAATGIVL SLLSTQEIVV QRYDTSLGAW TTIVNTAVGD
FANLLTLTGS GVTLNLNGLG EGQYRVLTYN TSLLATGSYT SLDVDVHQTS AGIISGPTIS
TGNVMADDTA PTGTTVTAIT NANGVSTPVG AGGVDILGQY GTLHINQDGS YTYTLTKPTA
GYGHKESFTY TITQNGVGSS AAQLVINLGP APVPGSVIAT DNNASLVFDT HVSYVNNGPS
TQSGVTVLSV GLGNVLNANL LDDMTNPIIF NVEEGATRTM TLQGTVGGVS LVSTFDLYVY
RFNDAIQQYE QFRVEKGWIN TLLLAGQSQP LTLTLPGGEY LFVLNTASGI SVLTGYTLAI
SQDHTYAVDS ITANTTGNVL TNDVVPTDAL LTEVNGVAIA ATGTTEVNGL YGSLIIDARG
NYTYTLKNGV GADSIKTPDS FIYTVKAPNG DTDTASLNIT PTARALDAIN DVSDTLSVAT
LQDTAAWLDS SVGSASWGLL GKSGSGSGTF DVATGTVLKG ASLVFDVSTL ITLGNLNISW
AIQENGTVIR NGTVPVANIT LGSATVTVNL SGLELDAGTY TLNFTGTNTL AGAATITPRV
IGTTVDLDNF ETSGTHTVLG NIFDGSDAAG AMDQLNTVNT RLSISGYNGS AATLDAAANT
TSATIQGHYG TLQINLDGAY TYTLNNGVAM SSITSKEVFT YQLDDKMGHT DSATLTIDMA
PQIVSTNQND VLIGSAYGDT LIYHLLNGAD ATGGNGADRW QNFSTAQGDK IDIHELLTGW
DHQAATLGNF VQVHTSDANT VISVDRDGAG SAFKSTDLVT LENVQLTLND LLQNNHLITG
G