Gene Gura_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3011 
Symbol 
ID5165677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3480740 
End bp3495025 
Gene Length14286 bp 
Protein Length4761 aa 
Translation table11 
GC content58% 
IMG OID640550506 
ProductLamG domain-containing protein 
Protein accessionYP_001231756 
Protein GI148265050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC TGATCGGAAT TGTGTGGGCG GTGTTGATTA CTCTGGGGAT GGGCGGTGTC 
GCGACGGCGC AGGTGGCTGA TCCTCTGGCC AATGTCCTCT CCAGCTTCGA CTTCAACATA
GTCGGCGTGG GGCTCAAGGC TGACCCCGAA TACCAGGCAG TGCCGAAGGG GATCGCCAGC
AAGGTGAACA CCCTGTTCGA TACCGGCACG TTCAACATTG ACGAGATCGC TGCCCAGCTT
CCCGCCGACT ACACGGTACG GGCCGAGCTT TCCGGCCCTT CGTTCCAGAC CCCGCTTCCT
CTCGTGACCA AGCCGGGAAA ACCGTTCGAC CTCCCGACCC TGGCTATCAT CGGCAAATAC
ACCCTCAATA ACATCCGCCT CGTGGACGGA AGCGGCAAGA CCCTGTTCGG CGCCGTGCCC
CAGGCGGTGG CCATCGAGTC GATCCCGGAT CCGCTCGTGA CCTCGGTCAC TACCCGGCCG
CTCTCTACCC AGGAGCTGCA GGATCGTGGC GTGACCTTCG ACAGCTCCAA CTTTACCGCA
TACGAGTTTA CCGCCGCCAT TGCCACAGAG AGTGGCCAGG TGCCGCTGAG ACTGCCGGTG
CTTATTCCCA ATGCATCAAC TCTATATGAA CCCGAGAAAC TGCCGCCTAT GCCTGGCATC
GGACTCGGGA TGCCGACGGA AATAACTGCT TCCCCCCAGA CCCCTGTCCC CGAAAATATC
AGCATCTCCG GGTTTATGCT TGAAGCGTCC AAAGTGGGTG AACCGGGAGC GCCGCCGATT
CAACTTCCCC CCATCCCCGG TATCGTCGTT ATCCCGGGGA ATATAGGCTT TCTCCATCAG
TATTTCAGCG CCATGGCCAT CGTCACCAAC GGTGCTCCCG GTCAGTCCAA CCTGGTGGTA
AAGGACATCC AGGTTAAAAT CATCTTCCCG TCCGGTGCGG ATTTAAACCC CGGTACCGAT
GACATTTCCG GCGATGACCC TTTACGCATG GCCAAAGGGA CTGGCGGTTT CTTTCCACGG
ATCATGCCGG CGATGCATGC CGGTCCAGAC GGCAAATACG GCACTGCCGA CGATATCAGC
CTTCTCCATC CGGCTGAATC GGGACAGGCG GATTTTACCA TCGAGGGGCT GAAGGAAGGG
ACCCATAAGC TCGATTTTGA AATAACTGCC ACCCTCGAAG GGCTCCCCAT CGGCCCGGTT
ACCCTCAAGG GCAAGGCGAC CGGCGCCGTG CTGGTGCGTA ACCCCGATTT TTCCATAACC
CTCGGCCATC CGGCAACGGT GCGGAGCGGC GAGGCTTACG ACCTGTTCGT CACGGTTACC
AATACCTCGC AGGCGGTCGC CAACCTGGTT TCCATCCACC TTGACTCCCG CGCCCTGTCG
GGTGCCGTCT TTGCCAACGG CGAGGACCCG GACAAGCAGA TCGAAACCAT ACTGTCGGGC
TCATCGGCGA CGATAAAGTA TCGCGTCATC TCCCAGCGCA CCGGCAAGGT CACGGCCACC
GCCTTTGCCT CGGAAGAGGT CAAGGGGCGC TTTATCCTCC GTATGGGAGT CGGGGAGCTG
GGGATTCCGC TCTCCCCCGA CAGCCTCATC CTTCCCTATA CCGACGGACT ACCTGCTGAT
CTGATCAATG CCGCAGTCGG ACTTCTCGGC CAGGCATGGA GTGTTGCCAC GGCGCCCACC
GGCGCGCTCC CAGCCGATGT CCTTCCAATT CCCAAGCAGA TCATCACCGC TCGCGCCAAC
GACCTTTCCG AGGCGGGGCT GCGCATCCTG CTGAACGATA CGACCGTGAA GGCGGTGGAA
GACCTGGCCT TTGACTTCAT CGGCTCGGAC AATGCCAATC GTCCCTTCGA CAGTTTGCGG
CGGCGTTCCA CCCAAGGGCT GAACCTGAAC AACGCCATTG CCGCTGTCTT CCAGAATGAA
ATCCAGGCGA CCGGCGCGTT TCCCTTCCAG GCCGGCTTGG CGAGACAATC CTCCTACCGT
CCCGGCCATA TTTCCGTTAT CACCACCAGT GCGCCGGTGA GGGTCCGTCT TTCCGACTCT
GTCGGCAACC GCAGCGGCGG GCTCTCTGAC GGTGAGATGT TCCGGGAGAT CCCCTACGGT
GATCAGTTGT CCCTCTCCCG GAACGATACG GGTCGCAGCA CCCTGACCCT TGTCACCAAG
CTGGATGCCG GTAGTTATCG GCTTGACCTG GAGGCTTACG CCGATGCCGG CTTCGATCTC
GGTATTGTCA TCCCCGCTGC CGACGGCGTC CTTAGCCAGG TAACGTTTAC CGGCATTACC
ATGCCGGCCG GCGCCAAGGC GCGGCTCGGC CTCACTCCCG GCGGCAGCGA TTTTTCCCTG
CAAATCGATA CCAATGGCGA CGGCGTCCCT GAGACTATTA TTGCCCCCGC TGCCACCCTT
GTCATTCCTG ACCAGGCCCC CGAGGTGGTG GCTGTCACCC AGATTGTCCC CGGTTTCGGC
CCCGGTGGCG ACAAGCATGG CCGCACCGTG GCGATCCTTT TTTCCGAAAA GGTCACCAAG
GTGAGCGCCC AAAATCTGGC CAATTACAGC GTCGATGAGA ACGCGGCACG CATTTCTTAC
CTCCAGCCTA GCGGCAGAAT GGCCTTTATC CTCCTTCGGG ACGGCATCGG TCCGTTTGTC
GAGCGGAAAA TCACTGTTTC CGGCATCACG GACCTGAAAC AGAACCAGCT TACTCCGGTC
ACCATGCCGA TCCGCACCAC CGCCAAGGGA CCTGCCGCTC TGGTAAACGG CTCCGTGCGG
AAAGCGAGCG GTGAGACGAT ACCCGGCGCT GTTGTCCGGC TTATGCAGCT GCAGTGGGTT
GAACAAGACT ATGAGCGCCT CCAGAAATAC TTCATCATCA GTGAAAAGCC GGCCGATGCC
AACGGTCGTT ACCAGTTCGA CTATGTTTTG CAAAATGACG ATCCGTCCGG GCCGTTCCAA
GTCGAGGCGG TTGATCCGCA GACGTCTGAT GGGGCCAATA TCATCACCTC CGTAGTTTTC
AATGAACAGA AGCTCAATCT TGATCTTTTC ATGAAGGCGC GCGGCAGCGT CTCCGGGGTG
GTGCGCGATG CCGCAGGCCA GGCGGTGGCA AGTGCCCAGG TGCTGATAAC TACCTTGAGC
GGTGACCGGT CGTATTTAGC CGTTTCCGAT GCTGTCGGCG CTTTTTCTTT CGCCAACGTG
CGGGTCGGCG CCTTCACTTT GAAGGCGGTC AGCCAATCCC TCTTTGCCGA GGGGAGCATC
ATGGGGACCC TCCCCGATAA CGGCGGCAGC ATTACACAGG ACATCACCAT CTATCGGCTT
TCGGATGCAA AACGGGGCAA TATTGCCGGC AAGGTGCTCG GTTCAGACGG TACCCCCCGG
GCGGGTGTCA TAGTTGTCGC CCAGATTTTT GAAGGTAGCG GAGTCAGATA CAGCAACTGG
ATGAGGAGCG GCTCAGACGG CAGCTATGCC TTCAACGGCA TTTTTGCCGG CAAACTGCGG
ATCGACGTGA AGGACGACGC CAGCGGTGAA AGAACCTCCG CCAGTGGTAC GGTGCTGGAA
GGGGGCACAT CTGTTTTCAA CATTATCCTG AAAGGGACCG GTACGGTTAC CGGGAAAGTC
GAACGGGAGG ACGGCCAGTC GGTGGCCGGT CTCCATGTAA TAGCAGACGT GGATGGTACA
ACCAGGATCG TGCAGACCGA CACGCAGGGT AATTTCACCA TCAGCGGGGT CCCGCTCGGT
ACGGTCTCCC TGAGGGTAAC CAATCCGCGG GACTTCAATC AGACACTTGT TTCACTCAGC
GTAAGCCTGC TGACCGCCGG CGATACTGCC AATGCCTACC TGTTTGTCCC GGCAAAATCC
TTTTTTACCG GGACCATCCA GGGGACCGTC TATCGCCTGG ACGGCAGCGT CTTTCCCCAT
GCCACGGTGC GGCTGGTGAA TTTGTTCAGC AACACATATT TTGCATACAA GGCGGACGGA
GAAGGGAAAT ACGTCATTCC CGCTTTGCCG ATGCAAACCC ATTATCTTAC AGTGGTCAAC
GGCCGGGAGA TTGCCAATGC CAGCACCACC CTCTGGTACG ACACCCAGAC CAGAACCGTC
GATCTCCATC CGGTCGGCAT GGGGAGCGTT ACCGGCACCA TTTACGATGA AGGCTCCGGC
ATGATCCCGG TGGGGGCGGA TGTGGCGCTC TCCTCCATGC GGCCCGATGG TCTCGGCTGG
CTCCGCTATG ATCGCTCGGC AAATGTGAAG AGCGATCCCC AGAGCGGCAG GTACACCTTT
ACCAATGTCT ATGTGGGTGA TTTTACGGTC AGTGCGTTCA ATGTTCTCCG CCCCACCCTT
GTTTCCAAAA GAGGCACTTT AACCGCCAAC AATGAGACAG CAACTGCGGA CCTGGTACTG
AAAGACACCT TCGGCTCCAT ATCCGGCCAG GTGCTCTTGC CCAATGGCAC GCCGGCGGGG
GCGGATATTA CGGTTGCGGT AACTTTCGGC GGCGCTGATG TGGTCGTCAC CACGAACGCG
GAAGGTAAAT TCCAGTTCAG GCCGGTTATT CCGGCGGGGA ATTACAGTGT CCTGGTCGAA
GATCTGGTTA CCACTCTCAA GGGGAAGGGG TATGTTTCCG TCCCTGCGGG ACGTGACGTT
GCTGTCACCA TACGTTTACT CGGGCGTGGT TCCATGACTG TGCGTGCGCT CAATGCAGAC
AATACCATTG CGCCCAATAC CGCTATCAGT TTAAAGGGGA GTGATTTTCC CAATGACCTT
GCCAACGGGG TCACCGGTCC GAATGGCACG ATAACTTTCG AGAATCTCAC AGAAGGGCGC
TATGCCATTT CTGCCATGGG AAGCTCAAAC CGCGGCGGGA GGGCCGAAGG AAACATCCCG
CTTGACCATG CCGCAGTCAA TGTCGACGTC AGGCTGGCCC CCTCCGGCAG CTTTAGCGGC
ACCTTTTTCA AGTCGGACGG GGTTACGCCG ATTCAGGGGG GGCAGATAAA ACTTCTCAAC
AGTGGAAAGC AAGTGGTCGC CTATGCGTTT ACCTCCACCG AGGCAGCCAC TGCCGGACAG
TTCCGTTTGG ATTTTGTGCC GCTCGGGGAC TTCACCCTGG AGGGTTTTGA TCCGATCAGC
GAACGGCGAG GGGTCGGCGG CGGTAAATTG ACCTCAGATG GAGAAACCGT GATTGCCAAC
GTGGCTGTCA CCCCCATCGG CGTGGTCAAG GGAAGGGTAT TGAACTATTC AGGGACCGCA
CCGGTCTCCA ATGCAAATGT GAGAATATGG GTAAACGGCG TCAGCAGCTA TTCCAATGAA
ACATCAACCA GTCCGGATGG GAGCTTCCTT TTTGCCGGGG TGCCGGCCGG TCGGTTCAAT
CTGGATGCAA CCGAACCGTT AACCCGTCTG CACGGCCAGG CCACCGGCGC CATCAGCTAT
GAAAGTGAAA TTGCCCAGAC CGAACTGCAC ATAGCTCCGA CCGGCTCCAT CGAAGGAAGT
GTGCTGATGC CGGACCGCAC CACTCCGGCA GGGAGCGCCA CCGTGACCTT GGAGGAGAGC
GGCGTTACCA CCCAGGTCGA TCCGGCAACC GGTGCTTTCC GCTTCCTGAA CCTTGCAGCC
GGCAAATCCT ATTCGATCCG TGCCAATGAA AACGGCGCCA ATCGGGTCGG AAAGACCATA
ACTACCATCA CCGGTGACGG TGAGATCGCT CGGGCGGATA TTACCTTGCG CGGCATAGGC
GTTGTCGAAG GGATTGTCTT CGATACGAAC ACCACTGCTC CTCTTGAGGG GGCGAGGGTG
ACCATTCAGA CCAACACGAC TTCCGCCGAC GCTTATACCG ATTCAACCGG CAGCTACCGC
TTTGCCGACG TTCCGGCTTC CTCATTTACT CTTCGCGCCA GCCATCCGCA ACGGCTTACC
GCTGCTTCAG CTTCCGGGAC CCTGGACAAC GAAGGCCAGA TAGTCGACAT CAATCTCACC
TTTGGTTCCG TCGGTTCCGT CACCGGCACG GTTGTCATGG CTGACGGCAT TACTCCAGCC
CGTGGCGGAG TGGTGAAATT CACCGGCGGC GGCAGAACAT TTATCGCGGT CATCGATACG
AACGGCCAGT TCGGTTTCAA TAATATTCCG CTCTGCTCTT TCAAGCTCTC CATAGAAGAT
GCATCCGGTC TTGCGATCGG TTACGCCTTG GGGAATATCG TTTCTAATGG AGAAGTCGTA
GCGGTTGGTA CCATCATCCT CGATGACAAG CCGATTACCG TCATTGCGGT GGATCCTGTC
AGCGGAGCTG TTGACATTCC GGTCAGCCAG GCCATTAAAA TCCTTTTTTC CGAGCCGGCC
AATCCGCTCA CTGTCAATTC TTCTACAGTT TACATCCAGC AAGGAACGAG CCGCATCACC
GGCTCACTGG TGCTTGATCA GGACAATAAA GGGGTCACCT TCACCCCGTC GGCGCCATTG
ACCGGTTTTA CTCTCTATAC CGTTGTCGTT ACCACCGGAA TCCAGGACCG GGTCGAGAGA
CCCTTGCCGC AGACCTTTAC CAGCACATTT TCCACTGTTG ACAATACCCC GCCGAGTGTG
AAATCGGTCT CCCCCTCCAA CGGGACGATC GAGGTGGCAA CCGACGGGGT GGTACGGGTC
ACCTTCAGTG AAACCATCGA TCCCGCCAAT GTCTCCGGGA TCAAGCTCCT GCAGGGGAAT
ATCCCGGTTG CCGCGCAGCT TGACCTGATC CAGGGGGGAA CCGTGGCGGT CCTTACACCG
CTCAATCCGC TTGTCGCAAA CGGGAATTAT ACAGTTTCCG TATCCAATGC CCGCGACATG
GTCGGAAATA TCCAGCAGGG GACCTTCCTC TCCTCATTCA ACACCATCGA CACCATCGCG
CCGACCATCA GCTCCCTCAC CGTCCCGGCC AACGCCGACC TGATCCGGGG CAACACGGTG
GCTGTCACTG CCGTTTCCCC GGGCACGGAC GTGGCATTCG TCGATTTCTA CGTGGACGGC
ATGTTGACGG CCACCGACAC CACGGCTCCT TACAGCATGA ACCTCTTGCT TTCAAAGGAA
GGGGCTGTGC AGGTGAAGGC CGTGGCCCAG GACCGGGTGG GGAACCGTTG GCTTCCGGTA
TCTCTAGATC TGACCGTTGC GGCGGACCAG CCGCCGACGG TGGCGTTTAC CGCTCCAGCT
GAAGGAAGCA GTGTCAATAC CGGCGCTGGC TTCAGTGTTA CCGTCCAGGG TAGTGACGAT
CTGACCGTGA AAGAGATTGC CCTGACAGTT GCCGGAGAGG TCATGGCTAC CCAGACAAAA
ACCAATCTCT CGGGCAAAAA CGTTTCTACT ACATTCAATT TTACGGCGCC CACCACCATC
ACCCAAGGGG GAAATATCGT TCTGACCGCC ATTGCCAAGG ACTCGGCGGG AAATTCGAGC
CAGGCTGCCC TGCGGACTTT AACTGTGCAT GACGGCATCG CCCCGGTGGC GGTTTCTCTC
GTAAGCACCG GCCAGACGGT CAAGTACCGG CCGGGCGATA CCGGCACGGC CACCTTCGAT
GCTACGGACA ATGTCGGCGT CACCCGTATC GCTTGCAGCG CCAGCGGTGC GGTCACGGAA
AACCGGGACT TTGCCCTGCC CTCGCCGCAG AGCAGTGTCA GCCAGGAGTT CAGCTTCACC
GTTCCGGTCA ACGCGGCCTC CAACGCGACG ACTACTATCT CCTGCACTGC CTTTGATGCA
GCCGGCTACC TGGCAACAAG ATCCATAACC CTTGCGGTGG CTGACGTGAT TCCGCCGCAG
GTTACGGGTG CGTCCGTTGC CAACAACGCT ACCGACGTGC CGGTAGGCTC GTCGATCTCG
GTCTCGTTCA GCGAGGCGCT TGCGACGTCT TCTGTTACCG CTTCCTCAGT TGTTCTGACC
GACACGGCGG GGCAGCCCGT GCCGGGAAGC GTCACCATTG CCGCCGACCG GAAGGGGATC
ACCTTCAAAC CGACAAGCGC ACTTGCCCGC GGCGCAGCCT ACACCCTGAC CGTTACTGCG
GCCATTACCG ACGCCGCCGG CAATTCCCTG GCCGCACCGT ACGATGTCTT CTTCACCACC
GACAATACCG CGCCGTCCCT CAAGACCATC AGCCCGGCAA GCGGTTCGCA GAACGTCCCG
GTCGGCTCTG CCATCGCCTT CACCTTCAGC GAGGCCATCG ATCCGGCTTC CGTGAAGAGC
GACAGTATCT CCCTCTCCTC TGCATTCGGC CCGGTGGCGG GCACTGTGGG GCTCTCTGCG
GACAATTCGA GCGCCATCTT CAAACCTCTG GGGCAGCTCA GCTTCAGCCG AGACTACACG
ATCACCTTCA AGGCGAGCAT TGCGGATATT TCCGGCAATC TCACTGCAGT AAATTACACG
GGGGCCTTCC TGACCCAGAG TCCCAGCTCG GACCTGGTCG GCCTCTGGAC CATGGATGGC
GACTGGAGCG ATTCCTCCGG CAACGGGAAT CATGGGACGG CCAGCGGTGG TGCCGCATTT
GCCTCCGACC ATGCCGGTGG CGCCATGGCG GGGAGCTTTG ACGGTGCGAA CGATTATATC
AAGGTGAATA ATTCTTCCAG CCTGAATCCG ACTGCGATTA CGGTGGAGGC ATGGGCGAAG
AGCGCAACGC CCATGTGGGG CTCAGGCTCA ATTGCGAGTA AATACGGGGC ATACACTTTG
CGTCCGGTGG AAGGGTCGAA GGAGCTTAGG TTCTACGCAG GATCGCAGTA TGCCGCAGTC
AATGATCCCG ATCTCGATCC GACCCAGTGG CACCACTATG CGGGAACTTA CGACGGTTTC
AGCATAAAGC TCTATGTTGA CGGCGTAGTG AGAAGTACGG TCGCCTATAA CGGCAATCTA
TACACAGCCG GCACAAATCC GTTGTATATC GGCTGGGACG ATTGGCAGAC ATGGAGATAC
TTCAAAGGGT TGATCGATGA AGTGGCCGTT TACAAACAGG CCCTTCCTGC CGAAGACATA
TTCGAACATT ACCATGCTGC ACTGACAAGC GACCGTCTGC CTCCTGCACC GCCGACAGTC
AATGCTGTAG AGTCACCCAC ATTCAACAAC AACATCATCC TTACTGGAAC CAAGGAAGCG
GACGCGTCCG TCAGGGTAAA CGGCAGGGAG GTGGTCGGCC ACGATGCATC AACCACCTGG
CAAACGATCT ACTCCCTGCA ACCAGGGCAG AACATCCTCG ACATCACCAG CCGGGACATG
GCGGGGAATG CCAGCGATCC GGTCACCATT TCGGTGGACC TCCTCCCGGC CAACCAGCGC
GATCCGGACA TCGTGGGACT CTGGCATCTG GACGGAAACT GGCTGGATTA TTCCGGGAAC
GGGAACCATG CTACTGGTAA TGGCGCCGTG TTCTCTGCCG AGTTGATCGA GGGAGCGGCA
GCGGCGATAT TTGACGGATC TAACGACTAC GTCTCCATTT CCGACAGCGC CAGCCTGGAT
GTAACCACTG CGATGACGCT GGAAGCATGG ATAAAACCCA ATAACGTCAG TACGTACCAG
CAGATCATTG ATAAGTTTGG AACCTATGGC GACTCTACGT ATCGAATCGG CTTGGTGCCA
TCAGGGCAAA TCAGCTTCGA TATCAGCGGA AACGGAGGTG CAATCGATTA CACAGTTTCG
ACAAACGCGC CAATTACCCA GGGCAAATGG CATCATGTGG CAGCGACGTT CGATGCCGGG
GCGGTGAAGC TTTACGTGAA CGGCGTTGAA GTCGCTTCGA AAGTTTCATC GATAACCGTG
CTGAAGGCCG GCACCTCTCC TCTCAATTTG GGCTTTGAGC CCACGACAGG TCGATATTTC
AACGGCCTTA TGGACGAGGT CGCCATCTAC AAGCGCGCCT TGTCTCCAGA GGAAATACGG
TCTCACTATA ACGCGTTACC AACGGTCACT CTTGCCAGTC CTGGCCAGAC AACGAAATAC
AGACCGGGTG ACGCAGGGAA CGCAACCGTT ACCGTCAACC ACGATCCGGG TGTGAGCAGC
TTGATCTGCA CCGCTTCCGG CGCGGCTTCG GGAATGATGA CCCTGCCGTT CGGCTCGCTA
CAGACGGCGG TATCGCAGGA TTTTGCCTTC AATGTGACAG CGAATGCCGC ATCCTATGCA
ACCGTCACCC TTACCTGCAG TGCTGTTGAT GCTGCCGGAC ATATCGGTTC ATCCAGCATC
AACCTGACAG TGTCGGATAT CGTTGCGCCG ACGGTGGCCG GTGCATCCAT CGCGGACCAC
GCCGTGAACG TGGCGGCAAC GGAAAGCTTT ACGGTATCGT TCTATGAAGT GCTTGCACAG
TCAACGGTGA ATGCCACTTC GGTTTCCTTG ATGACAGACA ATGGGACGAA CCTGATGGTT
GGGGGGACGG TAACGCTCTC TCCAGATCGG AAAAGCATTA GCTTTACTCC GGCAACGGCA
TTGGACGGGG ATACGCCGTA CCGGCTGACC GTATCCACAT CTGTGACCGA CATGGCAGGA
AATCCGCTTA CCGCGGATTA TGTGCTGCAC TTCACGACCC AGTCGGTAAC GGCGGTTTCA
GTGGCCGGTC AGGGGACAAG CACTGCTCCA TACGTGGTGG CTGCAGGGCG GTACAGCACG
ATCTCGATTA CAGGCAGTTA CGTCGTCTTC GACGGGCCGG TGGCTGCGGA TTCGCTGTCA
CTCACTGGTG GCAGCGTGTT GACGCATGAG CAGACGGGGC TCACAGGGGC AGAGCAATTG
GATATTGCTG CATCCAGTAT TACCATCGAT GCCGCGTCCA AAATCGATGT AACCGGCAAG
GGGTATCTTG GTGCCTGGCA AGGATTGAAT GGCGGCTTGC CGCGGACCCA TGGGAACATG
ACGAGCGGAA CGAGCAGCGA CTATTACAAC GGCGGCAGTT ACGGCGGTCT GGGCGGAATT
TATAGCGGCA GTGTCAACGG AGCATACGGC GATTTGACCA ATCTCAACGA GCCGGGAAGC
GGCGGAAGCG GCTATCCTAC CAACTCCAGC TATGCGGGCG GCAATGGTGG CGGACTTGTC
CGGATCAAGG CGGGTACCTT AAGCCTGAGC GGGAGTATCC TTGCTGACGG GGCTTCAGCA
ACCTATGGCA GCGGCAGCGG CGGCGGCATA CTGATTGACG TTTCAACGCT GACCGGCAGT
GGTGCCATCT ATGCCCGCGG TGGAAGCAGT TCTTATGGTG CTGGCGGTGG CGGGCGGATC
GCCGTCTATT ATGACACGAT CAGTCTCGCT GTTGCCAACA TCATCGCCTC CGGCGGACAG
TCCGGCAATG GAGGCAACAG TGCCCGTAAC GGCGGGGCCG GGACCATATA CCTGAAGAAT
AACGGCAAGG ATAAGGCAGA CCTGATTATC TGCAATAATG CTATTGTCAG CAGCGTAGCT
ACGCCGGTCC CCGGCGGCGA TTACGGAACT GTTGATGTAA AGGGTGGAAC ACTCATCGCC
ATGAACGGCA GCTTCACTAC CGAGAGCGAC ATAGTCCTGA CCAACACCCA GATGACCATC
AACGGTTCTA TTGCCATACC GAATTTGTTG ATGGACAACA GCACCCTCAT CATTAACGGT
TCGTTGAAGG CCGCCGGCAA CATCGTAATG CAGAATAAGA GCATGCTCAC CCATTCCGGG
GCAACGACCG CTGTTGTGCA GATGCTAGAC ATCACCGCAA CAAATGTCAC CATTGACTCC
ACGTCGAAGA TCGACGTGAG CGGCAAGGGG TATCTGGGGG GCTGGCAGGG GGGGAATAAC
ACCAATACCG GCCGGACACT AGGGAACACG ACCGACGGTG GGAGTGTATA TAACAACGGC
GGCAGTTTCG GTGGTCTGGG CGGGATATCT GCGTGGAGCG GCAATGTGAA CGGCAGTTAC
GGGAATCCGC AGTTGCCGGA CGAGTTGGGA AGTGGTGGCG GCGGAAATGG AAGCAGCAAC
GCCGGTGGCA ATGGTGGCGG TCTGGTGAAA ATAAGCGCCG GGACGATGAG CCTCTTAGGG
AGTATCCTGG TCGATGGCGG TACCACGTAT GTCAGCGGTG GCGGAAGTGG CGGTAGTATC
CAGATCAACG TGGGGACGCT GACAGGCAGC GGGACGATCA GCGCCCGAGG AGGTGCAGCC
ACAGCCAATA CGAACTACGG CGCCGGCGGC GGTGGTCGGA TCGCGATTTA TTATGGGGTT
AATACCTTCC CGACGGCCAA CATTACCGCC TCTGGCGGAA AAGGTGGTGA CGGCAGCAAC
CCCGCGCGGA ACGGCGGAGC AGGGACGATC TACCTGAAGG ATAATGTAAA ATCGCTTGGA
GACCTGATAG TTGATAACCG GGGGATACTG ACGTCCAACA CGACAACGGT GCCGGGCGGT
GACTATGGCC TGGTGGACGT CAGGGGTGGT GCCGCTATAT CTATGAACGG AGACCTGAAC
CCTGGTATGG ACATGGTGAT TGCCGGCAGC CAGTTGACGG TTTCGGGTGG GATTAAGGCG
CCGGGTAACC TAACGATTGA CAACAGTGTC GTTACAGTAG CTGGAGCGGT AAACGTTTCG
GGCGTGCTTG AACTTAAGAA CCAGAGCGTT CTTTCCCATT ATGTTGCCAC AACGACAAGT
CAATGGAAAC TGGAAGTAAC GGCCGGTTCA GTTACCGTTG ACGCCACCTC GAAGATCGAC
GTAAGCGGCA AGGGTTATCT TGGTGGCTGG CGGGGAGGGA ACAACACGAA TACCGGCCGG
ACATTGGGGA ACACGACCGA CGGTGGGAGT GTATATACCA ACGGCGGCAG TTACGGCGGT
CTGGGCGGCA TATCTGCATG GGGCGGCAAT GTGAACGGCA GTTACGGGAA TCCGCAGTTG
CCGAATGAAT TGGGAAGCGG TGGTGGCGGA AATGGAAGCA GCAGCAATGC TGGTGGCAAT
GGTGGAGGTC TGGTGAAGAT AAGCGCCGGG ACGATGAGCC TGTTGGGGAG TATCCTGGCC
GATGGCGGTA CCACGTATGT CAGCGGTGGC GGAAGTGGCG GTAGTATCCA GATCAACGTG
GGGACGCTGA CAGGCAGCGG GACGATCAGC GCCCGTGGCG GTGCAGCCAC AGCCAATACG
AACTATGGCG CCGGCGGTGG TGGTCGGATC GCGATTTATT ACGGGGTTAA TGCCTTCCCG
ACGGTCAACA TTACCGCCTC TGGCGGAAAA GGCGGTGACG GCAGCAACAC CGCGCGGAAC
GGCGGAGCAG GGACGATCTA CCTGAAGGAT GATGTGAAAT CGCTTGGAGA CCTGATCGTT
GATAACCGGG GGATAGATGC GCGTGACGAC TCTACTCTGA TGAAATTGAT CGGGCGTGGA
GCCATTTCAA TCATTACCTC CGACAGTCTG ACCATGTCCG GATCGAATTG GACCGCCGGT
GCGTTTGCAG GATTGAGAAT CAATCCGAAT GTGAACCAGA GTGTCTACTT CACGATTAAG
GACAACACTG CCGATACCAT CTATATTAAC TCTGCAGATG GTAACCTGAA CCAGATGGCG
GCCGTGGGGG ATACGTTCAG CGGAGTGTTT GCCCTGAATC AACTGCAGAT ACTGGGTAAG
GCGCGAGTTT ATACTACTGA GCAGTATAAT GTGGCTACCG ACGTTATTGT GGACAACTCT
GTTCTTACCG CAAGCGAGAT TTATGCGGAT CAACTAAGTG TTACGAACGC AGGTATGGTA
ACTCAACCGT CGACAACAAC TACTACTGCG TACCGGTTAA AGATCGACGC AGTTACCGAC
TTTACCGTTG ACGCCACTTC AAAGATCGAC GTAAGCGGGA AGGGATATCT GGGCGGCTGG
CAGGGGGAGA ACAACACGAA TACCGGGCGG ACATTGGGGA ACACGACCGA CGGTGGGAGT
GTATATACCA ACGGCGGCAG TTACGGCGGT CTGGGCGGCA TATCTGCGTG GGGCGGCAAT
GTGAACGGCA GTTACGGGAA TCCGCAGTTG CCGAACGAGT TGGGAAGCGG TGGTGGCGGA
AATGGAAGCA GCAGCAATGC TGGTGGCAAT GGTGGAGGTC TGGTGAAGAT AAGCGCCGGG
ACGATGAGCC TGTTGGGGAG TATCCTGGTC GATGGCGGTA CCACGTATGT CAGCGGTGGC
GGAAGTGGCG GTAGTATCCA GATCAACGTG GGGACGCTGA CAGGCAGCGG GACGATCAGC
GCCCGTGGCG GTGCAGCCAC AGCCAATACG AACTATGGCG CCGGCGGCGG CGGTCGGATC
GCGATTTATT ACGGAATGAG CAGTTTTGCT CCAGAGAATA TCAAAGTTTC GGGTGGCATA
AGCGGAAACG GCGGAACGGC TGCAAGAAAC GGAAGTATTG GAACGATTTA CACCTTGCAG
CGTTAG
 
Protein sequence
MKRLIGIVWA VLITLGMGGV ATAQVADPLA NVLSSFDFNI VGVGLKADPE YQAVPKGIAS 
KVNTLFDTGT FNIDEIAAQL PADYTVRAEL SGPSFQTPLP LVTKPGKPFD LPTLAIIGKY
TLNNIRLVDG SGKTLFGAVP QAVAIESIPD PLVTSVTTRP LSTQELQDRG VTFDSSNFTA
YEFTAAIATE SGQVPLRLPV LIPNASTLYE PEKLPPMPGI GLGMPTEITA SPQTPVPENI
SISGFMLEAS KVGEPGAPPI QLPPIPGIVV IPGNIGFLHQ YFSAMAIVTN GAPGQSNLVV
KDIQVKIIFP SGADLNPGTD DISGDDPLRM AKGTGGFFPR IMPAMHAGPD GKYGTADDIS
LLHPAESGQA DFTIEGLKEG THKLDFEITA TLEGLPIGPV TLKGKATGAV LVRNPDFSIT
LGHPATVRSG EAYDLFVTVT NTSQAVANLV SIHLDSRALS GAVFANGEDP DKQIETILSG
SSATIKYRVI SQRTGKVTAT AFASEEVKGR FILRMGVGEL GIPLSPDSLI LPYTDGLPAD
LINAAVGLLG QAWSVATAPT GALPADVLPI PKQIITARAN DLSEAGLRIL LNDTTVKAVE
DLAFDFIGSD NANRPFDSLR RRSTQGLNLN NAIAAVFQNE IQATGAFPFQ AGLARQSSYR
PGHISVITTS APVRVRLSDS VGNRSGGLSD GEMFREIPYG DQLSLSRNDT GRSTLTLVTK
LDAGSYRLDL EAYADAGFDL GIVIPAADGV LSQVTFTGIT MPAGAKARLG LTPGGSDFSL
QIDTNGDGVP ETIIAPAATL VIPDQAPEVV AVTQIVPGFG PGGDKHGRTV AILFSEKVTK
VSAQNLANYS VDENAARISY LQPSGRMAFI LLRDGIGPFV ERKITVSGIT DLKQNQLTPV
TMPIRTTAKG PAALVNGSVR KASGETIPGA VVRLMQLQWV EQDYERLQKY FIISEKPADA
NGRYQFDYVL QNDDPSGPFQ VEAVDPQTSD GANIITSVVF NEQKLNLDLF MKARGSVSGV
VRDAAGQAVA SAQVLITTLS GDRSYLAVSD AVGAFSFANV RVGAFTLKAV SQSLFAEGSI
MGTLPDNGGS ITQDITIYRL SDAKRGNIAG KVLGSDGTPR AGVIVVAQIF EGSGVRYSNW
MRSGSDGSYA FNGIFAGKLR IDVKDDASGE RTSASGTVLE GGTSVFNIIL KGTGTVTGKV
EREDGQSVAG LHVIADVDGT TRIVQTDTQG NFTISGVPLG TVSLRVTNPR DFNQTLVSLS
VSLLTAGDTA NAYLFVPAKS FFTGTIQGTV YRLDGSVFPH ATVRLVNLFS NTYFAYKADG
EGKYVIPALP MQTHYLTVVN GREIANASTT LWYDTQTRTV DLHPVGMGSV TGTIYDEGSG
MIPVGADVAL SSMRPDGLGW LRYDRSANVK SDPQSGRYTF TNVYVGDFTV SAFNVLRPTL
VSKRGTLTAN NETATADLVL KDTFGSISGQ VLLPNGTPAG ADITVAVTFG GADVVVTTNA
EGKFQFRPVI PAGNYSVLVE DLVTTLKGKG YVSVPAGRDV AVTIRLLGRG SMTVRALNAD
NTIAPNTAIS LKGSDFPNDL ANGVTGPNGT ITFENLTEGR YAISAMGSSN RGGRAEGNIP
LDHAAVNVDV RLAPSGSFSG TFFKSDGVTP IQGGQIKLLN SGKQVVAYAF TSTEAATAGQ
FRLDFVPLGD FTLEGFDPIS ERRGVGGGKL TSDGETVIAN VAVTPIGVVK GRVLNYSGTA
PVSNANVRIW VNGVSSYSNE TSTSPDGSFL FAGVPAGRFN LDATEPLTRL HGQATGAISY
ESEIAQTELH IAPTGSIEGS VLMPDRTTPA GSATVTLEES GVTTQVDPAT GAFRFLNLAA
GKSYSIRANE NGANRVGKTI TTITGDGEIA RADITLRGIG VVEGIVFDTN TTAPLEGARV
TIQTNTTSAD AYTDSTGSYR FADVPASSFT LRASHPQRLT AASASGTLDN EGQIVDINLT
FGSVGSVTGT VVMADGITPA RGGVVKFTGG GRTFIAVIDT NGQFGFNNIP LCSFKLSIED
ASGLAIGYAL GNIVSNGEVV AVGTIILDDK PITVIAVDPV SGAVDIPVSQ AIKILFSEPA
NPLTVNSSTV YIQQGTSRIT GSLVLDQDNK GVTFTPSAPL TGFTLYTVVV TTGIQDRVER
PLPQTFTSTF STVDNTPPSV KSVSPSNGTI EVATDGVVRV TFSETIDPAN VSGIKLLQGN
IPVAAQLDLI QGGTVAVLTP LNPLVANGNY TVSVSNARDM VGNIQQGTFL SSFNTIDTIA
PTISSLTVPA NADLIRGNTV AVTAVSPGTD VAFVDFYVDG MLTATDTTAP YSMNLLLSKE
GAVQVKAVAQ DRVGNRWLPV SLDLTVAADQ PPTVAFTAPA EGSSVNTGAG FSVTVQGSDD
LTVKEIALTV AGEVMATQTK TNLSGKNVST TFNFTAPTTI TQGGNIVLTA IAKDSAGNSS
QAALRTLTVH DGIAPVAVSL VSTGQTVKYR PGDTGTATFD ATDNVGVTRI ACSASGAVTE
NRDFALPSPQ SSVSQEFSFT VPVNAASNAT TTISCTAFDA AGYLATRSIT LAVADVIPPQ
VTGASVANNA TDVPVGSSIS VSFSEALATS SVTASSVVLT DTAGQPVPGS VTIAADRKGI
TFKPTSALAR GAAYTLTVTA AITDAAGNSL AAPYDVFFTT DNTAPSLKTI SPASGSQNVP
VGSAIAFTFS EAIDPASVKS DSISLSSAFG PVAGTVGLSA DNSSAIFKPL GQLSFSRDYT
ITFKASIADI SGNLTAVNYT GAFLTQSPSS DLVGLWTMDG DWSDSSGNGN HGTASGGAAF
ASDHAGGAMA GSFDGANDYI KVNNSSSLNP TAITVEAWAK SATPMWGSGS IASKYGAYTL
RPVEGSKELR FYAGSQYAAV NDPDLDPTQW HHYAGTYDGF SIKLYVDGVV RSTVAYNGNL
YTAGTNPLYI GWDDWQTWRY FKGLIDEVAV YKQALPAEDI FEHYHAALTS DRLPPAPPTV
NAVESPTFNN NIILTGTKEA DASVRVNGRE VVGHDASTTW QTIYSLQPGQ NILDITSRDM
AGNASDPVTI SVDLLPANQR DPDIVGLWHL DGNWLDYSGN GNHATGNGAV FSAELIEGAA
AAIFDGSNDY VSISDSASLD VTTAMTLEAW IKPNNVSTYQ QIIDKFGTYG DSTYRIGLVP
SGQISFDISG NGGAIDYTVS TNAPITQGKW HHVAATFDAG AVKLYVNGVE VASKVSSITV
LKAGTSPLNL GFEPTTGRYF NGLMDEVAIY KRALSPEEIR SHYNALPTVT LASPGQTTKY
RPGDAGNATV TVNHDPGVSS LICTASGAAS GMMTLPFGSL QTAVSQDFAF NVTANAASYA
TVTLTCSAVD AAGHIGSSSI NLTVSDIVAP TVAGASIADH AVNVAATESF TVSFYEVLAQ
STVNATSVSL MTDNGTNLMV GGTVTLSPDR KSISFTPATA LDGDTPYRLT VSTSVTDMAG
NPLTADYVLH FTTQSVTAVS VAGQGTSTAP YVVAAGRYST ISITGSYVVF DGPVAADSLS
LTGGSVLTHE QTGLTGAEQL DIAASSITID AASKIDVTGK GYLGAWQGLN GGLPRTHGNM
TSGTSSDYYN GGSYGGLGGI YSGSVNGAYG DLTNLNEPGS GGSGYPTNSS YAGGNGGGLV
RIKAGTLSLS GSILADGASA TYGSGSGGGI LIDVSTLTGS GAIYARGGSS SYGAGGGGRI
AVYYDTISLA VANIIASGGQ SGNGGNSARN GGAGTIYLKN NGKDKADLII CNNAIVSSVA
TPVPGGDYGT VDVKGGTLIA MNGSFTTESD IVLTNTQMTI NGSIAIPNLL MDNSTLIING
SLKAAGNIVM QNKSMLTHSG ATTAVVQMLD ITATNVTIDS TSKIDVSGKG YLGGWQGGNN
TNTGRTLGNT TDGGSVYNNG GSFGGLGGIS AWSGNVNGSY GNPQLPDELG SGGGGNGSSN
AGGNGGGLVK ISAGTMSLLG SILVDGGTTY VSGGGSGGSI QINVGTLTGS GTISARGGAA
TANTNYGAGG GGRIAIYYGV NTFPTANITA SGGKGGDGSN PARNGGAGTI YLKDNVKSLG
DLIVDNRGIL TSNTTTVPGG DYGLVDVRGG AAISMNGDLN PGMDMVIAGS QLTVSGGIKA
PGNLTIDNSV VTVAGAVNVS GVLELKNQSV LSHYVATTTS QWKLEVTAGS VTVDATSKID
VSGKGYLGGW RGGNNTNTGR TLGNTTDGGS VYTNGGSYGG LGGISAWGGN VNGSYGNPQL
PNELGSGGGG NGSSSNAGGN GGGLVKISAG TMSLLGSILA DGGTTYVSGG GSGGSIQINV
GTLTGSGTIS ARGGAATANT NYGAGGGGRI AIYYGVNAFP TVNITASGGK GGDGSNTARN
GGAGTIYLKD DVKSLGDLIV DNRGIDARDD STLMKLIGRG AISIITSDSL TMSGSNWTAG
AFAGLRINPN VNQSVYFTIK DNTADTIYIN SADGNLNQMA AVGDTFSGVF ALNQLQILGK
ARVYTTEQYN VATDVIVDNS VLTASEIYAD QLSVTNAGMV TQPSTTTTTA YRLKIDAVTD
FTVDATSKID VSGKGYLGGW QGENNTNTGR TLGNTTDGGS VYTNGGSYGG LGGISAWGGN
VNGSYGNPQL PNELGSGGGG NGSSSNAGGN GGGLVKISAG TMSLLGSILV DGGTTYVSGG
GSGGSIQINV GTLTGSGTIS ARGGAATANT NYGAGGGGRI AIYYGMSSFA PENIKVSGGI
SGNGGTAARN GSIGTIYTLQ R