Gene Gura_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3022 
Symbol 
ID5165691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3505465 
End bp3516225 
Gene Length10761 bp 
Protein Length3586 aa 
Translation table11 
GC content55% 
IMG OID640550517 
Producthypothetical protein 
Protein accessionYP_001231767 
Protein GI148265061 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGTA GTCCCTTTCA AGTTCTGAAG AATTCATTGC GCAATGCGCG GCCAGCGGCT 
GCTTTCGGTG TGGCTCTGCT GATGCTGCTC GTATCCGTTT CCTACTCTTT TGCCGGAGCG
GACATGACGG TCTTCGGTCC GAAGCGCTAT GACCGGCTCA AGGGGAAGCC CACCGTTTAT
ACGGACACTT TCGAGCGCTG CAATCCCTCC GACCTTGCGC TCGTCCGGGT GCAAAACGGC
GATAGTAAAA ATACCCGGAT CAAATCGGCA AGGATTTACA TAAACGGCAG CAAGATTGCC
GGAGAAAGCG AGTTCAAACA CAAGATTGCG ACCTTTGAAA AACTCGTAGC CGTACAGCAG
CTAAATGAGC TGAAGGTGAT ACTCAAAAGC GGTCACCACG GCTATTTTGA TCAGCTGGCA
AAGTATCAGG CCAGGCAGTC GGATCTGGAA AAGGATCTGG CCAAACTGCA ATTGCTGGAA
AATGAAATTG CCGGCCTGAA ACCACCCTAC GACATCAAAA CGCTTGAAAG AATCCTCCTG
GAATTCCAGC GCATTAAAAA AGGTTACGGC GATCGGGATG AGCACCTGGC CGGGGTACGT
AAGAGCCTTG ATGATGAAGA CGACGATGCA GATGACAAAG AAGATGGTTG GGATAGGGAG
AAAAACGGGC ACGAAATCGG AAAGGATTGG GTCGAACATG ACCGGACGGG CATAGAAAAA
TCGAAAAGGG ATACAGAGGA TGCCATCCGA TCCTGTAAGG CCGCCCTCGA CACATTTGAT
CGGTCAGCGA GCAAGAGAGA AAAAGATCAC GAAAAGAAGA GGGAAAAACT CTCGGAACTG
CTTAAGGAGC TTGAAGGGAT AAATCATGCC ATCGATGAAA ACGTCAGGCA TCTGGACGAT
CTGGCGCGGA AAATCGACGA GATCCGGAAA AGGGGACCCT CCTTTCTCAT CATAGAGATC
ATCGGCAAGG GGTGCGACAG CACACCGCCG GTCATCTCCG CCATTTCTCC GGCAGACGGG
GCTCTCCTCA ACAACCCGCT GCCGGTCATT TCCGCCTCCT ATTCGGACGA GACCAACGGC
TCCGGCATCA ACACGGCCAG TGTGCGGCTT TTGGTGGATG GAAACGATGT CTCCGCTGCT
GCGTCCATAA CTGCCGCCGC CATCTCCTAT GCCCCCGGCT CAAAGTTGCC CGATGGCAGC
CATGCTGTCA GCCTCAGCGT AGCCGACCGC GCCGCCAACG TCGCCAGCCG TATCTGGAGC
TTCGTTACCG ATACCGTGCC CCCGGTTGTC ACAATCAGCG CACCGGCATC GGGACTCCTG
ACCCGAAACA GCAAGGTGAC CGTGACCGGA ACGGTCAGTG AAGATGTAAC GGCGGTTACC
GTCAACAATA TTCCCGCCCA GGTCAATGGA AAAGATTGGA CCCTGGTTGA TTACACCCTG
AGCGAAGGTG CAAACAGCAT CACCGTTCAG GCGACCGACC GGACCGGAAA CCCGGGCAGC
GCCACTGTCA ATGTTACCCT CGATTCCACA CCACCGGCCG CGCCGCTGCT CAACCCACTC
ACCACACCAA CCAACCTGGC TGCGGTTACG ATCAGCGGCA CTGCCGAAGC GGATGCCAGT
GTAAAGGTGG TCGCTGGCGG GGTGATTTTC GGCACAGTGC CTGCCGATGC TGAAGGGAAG
TTCAGTCTGG GGGGCGCAAC ACTTGTTGAA GGGATTAACA CATACACTGC TGCAGCCATC
GACGCGGCAG GTAACGAAAG TTCCGCATCG TCACCGGTCA GCGCTGTTCT CGACATTACG
CCGCCGGTCA TAATAATAAC CGCGCCGACA GCAAATGCAT TCCTCAACAC GCCGCAGATC
ACCGTCACCG GTATGGTCAA CGAACCGGTC ACTTCGGTTA CGGTGAATAA TACGTCAGCC
GTTGTCGATG GGCTTTCGTT CAGTCTCACA ATAACTCTTG CAGAGGGAGC GAACACGCTT
GTCGTCAAGG CAAAGGATCT GGCCGGAAAT GAGGGAACCA CCTCTGTTCC CGTTTACCTG
GACACCACCC CCCCGCAGGT GACCATCGTC ACACCGGCAG CCGGACTTCT CACCAGGGAT
AAGCAGGTAA TCGTTTCCGG AACAGTCAGT GAAGAATTCA CCACGGTTAC CGTTAACACC
ATTACTGCCA CCGTCTCGGG CAAAACCTGG TCAGCCGCAT ATACATTAAG CGAGGAAGCA
AATTTCATTA CTGTAAAAGC AACCGACCGG GCCGGCAACA GCGGTTCGGC AACGGTCAAT
GTCACGCTAG ACTCCACGCC GCCGCAAGCT CCGGTACTCA ATTCGCTCAC CAGTCCGACC
AATGTGGCCG GGGTCACCCT TGGCGGCACT GCCGAAGCAG GATCAAGCGT GAAAATATCT
GCCGCAGGTT TGCTCATCGG CACCGTCACC GCAGACGCCC AGGGTAGCTT CACCCTGGCA
GGCGTTACCC TGGCGGAGGG GACAAACTCC TACACTGCCA TCGCCACCGA CGCAGCAGGC
AACGAGAGTA TTCCCTCAGT TCCCCTCGGC ATCGTTCTCG ATACCAGGGC GCCGGTCATA
ACCGTCACCG TTCCGGCAGA AAACGCCTTC TTCAGTGCGC CGCAGGTTAC GGTCAAAGGT
AACATCGACG AACCGGTTGC CGGCCTTACC ATCAACGGTC AGGCTGCAAC CTTGAGCAGT
CTCGATTTCG ATCAGCCGTT GACCCTGACA CCGGGGCTCA ATACCATCAC CCTTGTTGCC
ACGGACCTTG CAGGAAATCA GGCCACGAAA ACCGTTCTCG TCACCCTCGA CAATACACCC
CCCGTGGTTA CCATCACTGC GCCCCTTTCC GGCACCATTA CCAAAACAGC GCAGGTAACC
GTTTCCGGCA TCATCAGCAA GCCCCATACT ACCGCCACTA TCAACGGCAC TGCCATCACC
GTCACCAATC AGACCTTCAG TGTTGTCTAT ACACTTGCCG AGGGCGACAA CATCCTCAAC
ATAGAGGCTA CTGACCGGGC CGGCAACAAG GGGAGCGCCA GTGTCGGCGT CTCGCTGGAC
AGCCAGTCGC CCGTCCTGTC TCTGCAGAGC CCTGCCGAGG CTGCGGCCGG TGCCAATGTG
GCCATAGTGG TGAATGTTTC AGACAACAGA CAGCTCACCC TTGTGGAGTT GAAGGCCGAC
GGCGTACCCA TCTGGTCGGG AGGGAATATT CCTTCCATCG TCGAATCCGT TTCCTACAAG
CTTTCACCAT CTTTGAACAC TGGAAGCGAG GTGGTATTCC AGGCCAGAGG CATCGATGTC
GCCGGCAACG AAGGGAGCGC AACCACACGC GTCAAAATAA GCCAGGCGGC CATGGGGCCG
GGATATGTGC AGGGAAAAGT ACTTGAGGAT GAAAGAGGGC TCCTCCTTGC CGACGCCCAG
GTGACGGTAA CCGATGAGAA TGGTGAGGTG AAGACCCTGA CAACGGCAGC CGATGGCGGT
TACTTCACCG AAAACCAATC GGGAAACGCA GTCGTCGGCG TCATTAAGCC CGGTTTCACC
AGTGTTGAAC GAATCGTCCC GGTCATGCCG GAAAAGAAGG CCACCGCCCT CGACGCCAGG
CTGACGAAAA TCAATGGGAC AAAGAATGTT ATTGATGCCA TTGGCGGAAC CATACGGGTA
GATGTAGGGG CGGGGTTACC CCGCCCTGCA ATTGAATTAA CAATACCGAA CGGCGCTCTC
TCCAATCAGG CGGACATCCG TCTGACACCG GTCAGCAACC AGGGCCTGGC GGGCATGTTG
CCCCCCGGCT GGTCCCCCCT GGCCGCCGTC GATGTCCGTC TCCTTGATCC AACGGCGGGG
ACGGCTCTCT ATACGGGCTT TTCCTCCGCT GCTGCGCTGA AGATCCCGGT GCAGGTTCCT
CTCGCCCAGG CCTCGGGTAC CGTGTTGACG CTGGCCGCCT ACGACAGCAC CAGCCACCAG
TGGCGATCAA AAGGAAATGC AACCATTGCC GCCGACGGCC TGAGCGCAAG CGCAGACATA
ACCGCCGCCG GGCAGTATGC GCTCCTCATT GCCGACCCGG CTCCCAACGC CCCTGCCGCG
GCGGAAGCAG GCAACCCCCT TGCCCCGGCA TCCACGGCAT CTCTCGATTA TGCCGCAATC
AACACCGCCG GAAAGGTCGT TCCCCAGGCA TCGCCACCGT CCACAGGGCT CAAGGCAGCC
GGTGAGGTGC TGCTGACGGC TAAAGACGGG GTTACCCCAG CGCCGCAATT CATTTCCGGT
TTCATTATCA ACAACAGGGT AACGGAAAAA TTCGATCTCA AATCTGGAGA CAAGGTGGAG
CCTTCCGCCT ACACCCAGGA TATCGTTCTC TATCGCCAGC CATGCATAAC CAACATTGCC
GCCGGCGCAT TAACCCAGCC TGCAGTGAGC GGAGTCGGTG GGGGAGTGAG CGGAGTCGAA
CTCCGCACCA CCTTCCCCGT TTCGCCGTCC AGGGATTTCA CACTGGTCGA TCTTTTACTG
GGAAAAGTGG GCATAGAGAT AGTAAAACCC GATGCAACCG ACAGCGGCAT CATGGTGGGC
GCCGATGGCG GCCGGCTGGT GGATGCCGAC GGCAACATTC TCGTTATCCC GCAGGGGGCG
CTCGCCCGGA CAACACCGGT GGCGACGAAA AACGGGGCAG CCGCAACCGG AGCCGTGGGG
AACGATTTCA CTCTGCTCAA GGTGGTGGAG GTGAACCTGA CGCGGCAGAC CCTGGCGCAA
TCGGCGACCA TTTCCATCCC GGCGCCAGAC GGATTCAATC CGTCTCTACC GCTGCTCGTG
GCGAAGCAGA TCGATGTCAA GGGGGTTCAG AAGCTGAAGC TCGTGGCCCT GGCCAGACAG
AACGGCTCCT TCATCACCAC AGAGCCGTTA ACTTCCCAAC TCCAGAACTC ATTGAACTCA
TCAAACTCAA TAAACTCCTC CGGAGTTTAT TACTTCCTCC AGGCCAAGGG GCAGATCGGC
TTTGCCACCG GCACGGTCAC CGATTCCAGC AACGCCCCCT TCAACGGTGC TTTGGTGAAG
ACGGACAACG GTTCGCTCAT CGATCTTTCC GCAGCCACCG GGAAATATCT TGTGGCAGCG
CCTGTTGCCG CAGTCACCGC GACTGCAACG GATCTATACA AGAACGACGA AGGGAGCGCT
ACCGGCGCAA TAGCGGCCAA CCAGGCAATA ATCATTGACC TCAAAATCCT GATGATCTTG
CCGACGGTTG TTTCAGTGTC GCCGACGGGC ATCAATATTC AGCCAAACGT GCCGGTGGTG
ATTACCTTCT CCAAAAGCAT GGATCAAAGC TCCATAAACT CCCAAACTCT CAAACTCCTC
GACTCAGCCG GAACGGCTGT CCCCGGTGTA TTCACCTTCA GCGTTGACGG CAAGGTCGTC
ACCTTCACCC CGGCAGATCT GCTCAAGTCG CAGCAGAGTT ACACCATCAC CATAGCCGGG
ACGATCAAGG ATCTCCAGGG ATATCCCCTT GGCCAGGATG TCACCTCCGG CTTTACCATC
CGCAACACCA CTCCGCCTCC CATGCCGCCG GCAGGGAGCA TCACCGCGAG CTTTCCCGAT
GCTGATGGGT TCATCAGCAT CACCGCCACC CAAGGGAGCT CCCCTGCCGA CTGTACGGTG
CTCATCATCA ACGACACCAG CGGCGAGATA GTTTCCGTGG TTCCGGCCGG CAACGGCTCC
TTTACCGGCA AGGTCAGAGG CCAACTGGGC GACGAGATAA AGATCGTCAT CATGGATAAC
TCTGGGAATC AAACGCTCGT CTCGTACCTC ACCTTCAAAA GCGATGACGG CAACTACCTG
GTCACCGCCA AGGGTGGCAA GGTCGAAGGA GAGGGGGGGA GCATCCTCGA TATCCCCGAA
GGCGCTTTGT CGGGGCCGAC GATTGTCAAA CTGACCTTTG TGCCGGAAGC CAGCCTGGCA
AGTCCGGTAC CGGCACCGGG CACGTATCTC TCCGCATTCA ATATCGACAC CGGCGGGATC
GACTTCCAGA AGGAGGTCCA TCTCTCCATG CCGCTGCCGG ACGGCTTCGA TCCCACAACC
CCTGTCTTTG TCAACCGCCC GAGCGAGATC TATAACGCCG ACGGCACCAT AGAAAAGGTC
TACGAAATCA TCGACAGCAC CAAGGTAATC AACAACCGTA TCACCACCGC CAGCCCCCCA
TTCACCGGCA TCGTCCAGAT CGGCTCCTTT GCCTTCATCT CCTTCGGCGG AATGAAAAAA
GGCATCGTCT CCGGCTATAC CTACCAGAAG ATGAACGACC AGACCGGCTA TCAGCCGCCA
CCGTATGGAG TTATCGAGAT ACCAACCTTC GATGCCTCCG GCAACCCGGT CTACAAATAC
GACCGTCCCA TCAAGGGGGC CGTCATCCGG ACCCCGGATG CCTGGGATTA TGTCAGCTAC
TCAAACAGCA GCGGTTTTTA TGCCGGGTTC GCTACCCTGT ATGCCTACGT CGTGGCAATG
CAGCTGGATT ACCGGATAAC CGCCATCCAC CCCCTGACAA TGCGCCGCAC AAGCCACACC
CTCTTCATGG ATGTCGATCA CGACCTGATC ACCAACCTGA ACTTCATGCT GGCGGACAAA
AACTCCATCC CGCCGGACAA AACCGCGCCG GTCATCGACC TGACGATGCA GGTGGCACCC
GGACAAGCCC CGACCAACCG GATCAGCTCC GGCACCATGC CGCTGGGGAC GGAAGTGCAG
CTGCCGCTGT TCATCAGCGA CCAATCGAGT GTCAACCCCA CTCTGACGGT GGAATACAAG
AGCCCGGAGG CGACCACCTG GCAGTCATAC TCCGCACCCC TTATCCGGCT GGGCGCGGTA
CTTTCATCCC CACCCACCGC CGACAAGCCG GCCATCTGGC GCTATACCTT CAAAGCGGTC
TTTCCCGCCG AACTGCAGGG GAGCCAGGAC ACGAACTGCA AGCCGGGCCT GGCCGGCAAC
TACCGCTTCA CGGTCGAAGC CACCGACACC AGCACCAACA AGAGCACCAA GACCCTCCAA
CTGCGGGTCG TGGCGGTCGG CGCCATACCT GGCGGCGTTG ACGGCGCCCC CACCGTGGAC
AGCATCGCCC CAGCCGATGG CGCCAAGGAC CTCCCAGTCA CCACCCAGAT CACCGCCTGG
TTCAGCGAAC CGGTGGAAAA CGTCACCCCG GCCACCTTCA AACTGATCGA TACCAGCAAC
GGACGGGCGA TGCCGGCCAT CATCACCACC TCATATGAAG GAGGCCGGAT GCAGGCCGTG
CTCACACCAC GGGGCAACCT CTCCTACGAC CGGCAGTACC AGGTACAGGT GCTGGTACTG
CCCGCCGGAG GCATCGTAGA CATCAACCCC AACCCCTCGG CCGACAATGC CTTCTTGCCC
CTGGCCCAGG GATACCAGAG CACTTTCACC ACCAAACGCC CCGCCGCCTA TGACCTGGCC
GAAGCCGAAC AGTTCAAGGG GGGGCGCGAC ATCGCCCTGT ACAACCACTA TGACGGCAAC
AGCTACGCCT ACATCGCCGC CGACGACCAG GGGTGGCATG TCGCCGACGT CAGCGATCCG
ACCAACCCCT CCGTGGTCTA CAGTAAAAGC ATGAGCGCCC CCGCCGTCAG CTGGAGCTAC
CGCGGAGTCG CCGTCCATCC CGCACCAACG GCCGGGATAA TGGCCATGAC CGAGAACATC
GTCTTCGGCG ACGGCAACCA GTACGGTTAT ATCCGCTTCT ACGACATCAA AACCGACCCG
GTAAACCCGG CCATCATCGG TAAACAGCGG CTGGCCGAAG CCTATTCCGG GATACCCGGC
CGATTGGCCC TTGCCGGCGA ATATGCCTAC ATTGCCACGG CTGCGGCGGC GCTGCAGGTG
GTCAGCATCA GTAAGGCCAA GGACAGCTAT GACGGCTTCT CGGCAGACAA CCCAAGTATT
GTGGGTGTCT TCGACAGTAT CGGCCAGGGA TACGGTTCGC CCAACGACAT TGCGGTCTAC
GGAACGGGTA AGGCTCTCCT CACCACCACC GCTGGTTATC TGATAACCCT GGATATCAAC
AACCCCACGC CGGTCCAGAT GGGAGTGATC GAGCCCAAAA AGCTGAACGC CTTGCGTGTG
GCGGGTGTGT CCGACTACAG CTATGCCGAT GCCGACGGCA ACCCGCAGAG CATGGACCTG
GCCCTCACCG GCGGTGGCGG CCGACTCAAG ACCGTTGACC TTACCGACCC CTACAACCCC
CAACCGCTGG CAACGGTCAA AGATGCCCTG GATAAAGAGG TTGTCTCCTA TCCCTACGAT
ATAACCTTGA ACAAAGAGAC GGGGCTGGCA ATCGTCAGTA CCTTGAGCGC CATTCAGGTA
GTGGATGTGA AAGACCCAAA AAATCCCCGG CTGATCAACA CCATCACCCA ACTGCCCAAC
TCATCCGGCG CTACAACACC GGAAGGATCC CCGGCGATGA TTCCCATAGG CACCATCCCG
GCCATGGCGG AAAAAGACGG CTGGCTGTAC ATGGCGGACC AGACCAAAGG TATGCGCACA
ATGGGTATTA ACACCAACGA CATAAGAGTA TACAATGAGA GCAATATTCA GGTTCCCGAG
ATAGGATACA GTAAGGGCGG AGAGGATGAC CGTCGCTACT ACGTCGAGGT AAAGTACGAG
GATCCGGATA TCAACTGTAA TGATGACGAC CTGGTAGGTA GCATGGTAAT CAAAACGCGC
AAGGGTGCTG TTGTCAACCC TCTTCCCGGT ATAGACCACC CGACGCAGTA TCTGCTGTCT
TTCCGCAAAG GAAGTGACGA GCACTGCTAT GCCGACTTGA AAAAAACAAT AACGTCTCCG
ACAAACAAAC GCTTTATAGC CACCAACTTC CCGGCTGCCG ATTTGAAATT GACCACACCA
GACGGGATGA CGATAGCGCC AGTCTTCGGC AGTATCGGTA CAAAGGCGAT ATTCGAGTTC
AGCAGTCGCA AAGAGGGGAG ACTCATGAGG CGCAAGAGTA TACCGCTTGA GAAGCTGATC
AGGATTGCCT TTGATGGAAA TAGGGATGGG GTGATTGATT TTAAAAACCC CGAGGACAGG
AAATACACCT TCTGGGTGAA TGATGATAAT GATGTCAATG GTTATGAAAC CGAATATACT
GACATCAACC AAGGCTCAAA AATTACTTAT CCAGTAGAAG ATGATAATAT TAGCGGAACA
GTTAAGGATT GTGTTCGTGA AAACGGTGAC AAGATAAAAA CGCTGAGAGA TCTTGAAGAT
TTTGCCCGTG TCCAAATGCA ACTTCATCCT CACGCAAGGA AGGTAATGGA GATATTATCA
AATCAGACGA CACAACCTAA GAGCTTCGGT TATTATCTCA AGTTCAATAA TACCAACGGT
ACATCCCCTG AGATAAATAT TTTCCAGGCT GTTGGTGAAA CTGAAAAATA TTTGTGGAAT
GAAAATTCTG CACGAAAGCA GTTGGATAAA CCAAAGCTTC TTACTATCTC ATCTACTGAA
TCAGAACTCG CCCCCTCCTA TATATCGACG GATAAAACGT CGCCGTATAT TTTTGAAGGG
CGAAACTTTG GCAAAGGAGA ACTCACCTTT ATCCTCAAGG CGGATGGCGA TACTTTAGAA
GAAAATTCAG CGCTTCTGGA GTTAAAGCCA ATTCAGACAT TCTATGACAA ATTCAGGGTC
ACATATGACG AGGGAAGCAA AACAGTCGGA TCGCAGGTCA GCGAGATCAA AAAAAGCGAA
TATCCAGCGA AAGACAAAGA TTACCTCCTG TTCGTTCACG GCTGGAACAT GCAAGGTGAA
ATCGAGAAAG ATCGGTGGGC CGAAACAATC TATAAACGGC TTTGGTGGCA GGGGTATACA
GGGCGTGTAG GATCGTTCAG TTGGCCTACC CTTGAAAAAC CTTATGACAT TCCAAGAGGT
ACTACCTTTG ACAGAAGCGA ATTGAGGGCA TGGAATTCTT CTGAAGCACT AAAAGATGTA
CTTGAAATAA AGCTCAGGGA TTATGCTGGA AACATCAGGT TGCTTGGGCA TAGCATGGGA
GGAATAGTTT CAGGTGAGGC CATTCAGAAG CTGTCGGGTC CCGGCATCGT TAAAACGTAC
ATTCCCACAC AGGCAGCGCT TTCGTCACAT TTCTACGACA ATAGTGTGAC GGAGAGTAGC
CGGCGTCTGC CTCTGTTGCC GGTAACTCCG AATGTATTTG GGTACTACTA CTCAGGCAAT
AGTGGCTCAC AAGATGCCAC CTATTTGGCA GATTCAACCG GCAAGGCAAA GTTCATCAAT
TATTTTAACG AAGTGGACTA TGCTTTGACA GCCGGAGAAC GTGGTTTCAT GGCGTGGGAA
TTCAATACCA AGTTCAGACC CGATCCTGGT TACCTGTATT GCAACGGACT AGCAGTAGCT
TGTTTTACGT TGCTTGATCC GCTTCCGACA ATTCCAGGCA TAATAAGCTC GTTGCTGAAT
TCAATCGATG GCGGAAGCAA CTATTATGTG AGATTGGGGA TTTCGAACGA CGGATTAAGC
ATGCCAAGTC TGCTTAAATT TCCCGCTGAC CGTTATGAGA TATTCAGCAG AATAATTCAG
TCTCGGGTCA AGGCATTGGG GGCGACGAGC AAAACGCCAC TTAAGGGATT TGAAAAGAGC
CGAAACCTTA AAAAGTTTGG TTATGACGAC AAACATTATT CTCACAGTAA GGAGTTTCGT
TCAAATATTG TTGACGAGCA GGGATACTGG AAAGCAATCT TTAGTGATTT TGAACTAAAA
AGCTCAATAG AAAAGGAGTA G
 
Protein sequence
MAGSPFQVLK NSLRNARPAA AFGVALLMLL VSVSYSFAGA DMTVFGPKRY DRLKGKPTVY 
TDTFERCNPS DLALVRVQNG DSKNTRIKSA RIYINGSKIA GESEFKHKIA TFEKLVAVQQ
LNELKVILKS GHHGYFDQLA KYQARQSDLE KDLAKLQLLE NEIAGLKPPY DIKTLERILL
EFQRIKKGYG DRDEHLAGVR KSLDDEDDDA DDKEDGWDRE KNGHEIGKDW VEHDRTGIEK
SKRDTEDAIR SCKAALDTFD RSASKREKDH EKKREKLSEL LKELEGINHA IDENVRHLDD
LARKIDEIRK RGPSFLIIEI IGKGCDSTPP VISAISPADG ALLNNPLPVI SASYSDETNG
SGINTASVRL LVDGNDVSAA ASITAAAISY APGSKLPDGS HAVSLSVADR AANVASRIWS
FVTDTVPPVV TISAPASGLL TRNSKVTVTG TVSEDVTAVT VNNIPAQVNG KDWTLVDYTL
SEGANSITVQ ATDRTGNPGS ATVNVTLDST PPAAPLLNPL TTPTNLAAVT ISGTAEADAS
VKVVAGGVIF GTVPADAEGK FSLGGATLVE GINTYTAAAI DAAGNESSAS SPVSAVLDIT
PPVIIITAPT ANAFLNTPQI TVTGMVNEPV TSVTVNNTSA VVDGLSFSLT ITLAEGANTL
VVKAKDLAGN EGTTSVPVYL DTTPPQVTIV TPAAGLLTRD KQVIVSGTVS EEFTTVTVNT
ITATVSGKTW SAAYTLSEEA NFITVKATDR AGNSGSATVN VTLDSTPPQA PVLNSLTSPT
NVAGVTLGGT AEAGSSVKIS AAGLLIGTVT ADAQGSFTLA GVTLAEGTNS YTAIATDAAG
NESIPSVPLG IVLDTRAPVI TVTVPAENAF FSAPQVTVKG NIDEPVAGLT INGQAATLSS
LDFDQPLTLT PGLNTITLVA TDLAGNQATK TVLVTLDNTP PVVTITAPLS GTITKTAQVT
VSGIISKPHT TATINGTAIT VTNQTFSVVY TLAEGDNILN IEATDRAGNK GSASVGVSLD
SQSPVLSLQS PAEAAAGANV AIVVNVSDNR QLTLVELKAD GVPIWSGGNI PSIVESVSYK
LSPSLNTGSE VVFQARGIDV AGNEGSATTR VKISQAAMGP GYVQGKVLED ERGLLLADAQ
VTVTDENGEV KTLTTAADGG YFTENQSGNA VVGVIKPGFT SVERIVPVMP EKKATALDAR
LTKINGTKNV IDAIGGTIRV DVGAGLPRPA IELTIPNGAL SNQADIRLTP VSNQGLAGML
PPGWSPLAAV DVRLLDPTAG TALYTGFSSA AALKIPVQVP LAQASGTVLT LAAYDSTSHQ
WRSKGNATIA ADGLSASADI TAAGQYALLI ADPAPNAPAA AEAGNPLAPA STASLDYAAI
NTAGKVVPQA SPPSTGLKAA GEVLLTAKDG VTPAPQFISG FIINNRVTEK FDLKSGDKVE
PSAYTQDIVL YRQPCITNIA AGALTQPAVS GVGGGVSGVE LRTTFPVSPS RDFTLVDLLL
GKVGIEIVKP DATDSGIMVG ADGGRLVDAD GNILVIPQGA LARTTPVATK NGAAATGAVG
NDFTLLKVVE VNLTRQTLAQ SATISIPAPD GFNPSLPLLV AKQIDVKGVQ KLKLVALARQ
NGSFITTEPL TSQLQNSLNS SNSINSSGVY YFLQAKGQIG FATGTVTDSS NAPFNGALVK
TDNGSLIDLS AATGKYLVAA PVAAVTATAT DLYKNDEGSA TGAIAANQAI IIDLKILMIL
PTVVSVSPTG INIQPNVPVV ITFSKSMDQS SINSQTLKLL DSAGTAVPGV FTFSVDGKVV
TFTPADLLKS QQSYTITIAG TIKDLQGYPL GQDVTSGFTI RNTTPPPMPP AGSITASFPD
ADGFISITAT QGSSPADCTV LIINDTSGEI VSVVPAGNGS FTGKVRGQLG DEIKIVIMDN
SGNQTLVSYL TFKSDDGNYL VTAKGGKVEG EGGSILDIPE GALSGPTIVK LTFVPEASLA
SPVPAPGTYL SAFNIDTGGI DFQKEVHLSM PLPDGFDPTT PVFVNRPSEI YNADGTIEKV
YEIIDSTKVI NNRITTASPP FTGIVQIGSF AFISFGGMKK GIVSGYTYQK MNDQTGYQPP
PYGVIEIPTF DASGNPVYKY DRPIKGAVIR TPDAWDYVSY SNSSGFYAGF ATLYAYVVAM
QLDYRITAIH PLTMRRTSHT LFMDVDHDLI TNLNFMLADK NSIPPDKTAP VIDLTMQVAP
GQAPTNRISS GTMPLGTEVQ LPLFISDQSS VNPTLTVEYK SPEATTWQSY SAPLIRLGAV
LSSPPTADKP AIWRYTFKAV FPAELQGSQD TNCKPGLAGN YRFTVEATDT STNKSTKTLQ
LRVVAVGAIP GGVDGAPTVD SIAPADGAKD LPVTTQITAW FSEPVENVTP ATFKLIDTSN
GRAMPAIITT SYEGGRMQAV LTPRGNLSYD RQYQVQVLVL PAGGIVDINP NPSADNAFLP
LAQGYQSTFT TKRPAAYDLA EAEQFKGGRD IALYNHYDGN SYAYIAADDQ GWHVADVSDP
TNPSVVYSKS MSAPAVSWSY RGVAVHPAPT AGIMAMTENI VFGDGNQYGY IRFYDIKTDP
VNPAIIGKQR LAEAYSGIPG RLALAGEYAY IATAAAALQV VSISKAKDSY DGFSADNPSI
VGVFDSIGQG YGSPNDIAVY GTGKALLTTT AGYLITLDIN NPTPVQMGVI EPKKLNALRV
AGVSDYSYAD ADGNPQSMDL ALTGGGGRLK TVDLTDPYNP QPLATVKDAL DKEVVSYPYD
ITLNKETGLA IVSTLSAIQV VDVKDPKNPR LINTITQLPN SSGATTPEGS PAMIPIGTIP
AMAEKDGWLY MADQTKGMRT MGINTNDIRV YNESNIQVPE IGYSKGGEDD RRYYVEVKYE
DPDINCNDDD LVGSMVIKTR KGAVVNPLPG IDHPTQYLLS FRKGSDEHCY ADLKKTITSP
TNKRFIATNF PAADLKLTTP DGMTIAPVFG SIGTKAIFEF SSRKEGRLMR RKSIPLEKLI
RIAFDGNRDG VIDFKNPEDR KYTFWVNDDN DVNGYETEYT DINQGSKITY PVEDDNISGT
VKDCVRENGD KIKTLRDLED FARVQMQLHP HARKVMEILS NQTTQPKSFG YYLKFNNTNG
TSPEINIFQA VGETEKYLWN ENSARKQLDK PKLLTISSTE SELAPSYIST DKTSPYIFEG
RNFGKGELTF ILKADGDTLE ENSALLELKP IQTFYDKFRV TYDEGSKTVG SQVSEIKKSE
YPAKDKDYLL FVHGWNMQGE IEKDRWAETI YKRLWWQGYT GRVGSFSWPT LEKPYDIPRG
TTFDRSELRA WNSSEALKDV LEIKLRDYAG NIRLLGHSMG GIVSGEAIQK LSGPGIVKTY
IPTQAALSSH FYDNSVTESS RRLPLLPVTP NVFGYYYSGN SGSQDATYLA DSTGKAKFIN
YFNEVDYALT AGERGFMAWE FNTKFRPDPG YLYCNGLAVA CFTLLDPLPT IPGIISSLLN
SIDGGSNYYV RLGISNDGLS MPSLLKFPAD RYEIFSRIIQ SRVKALGATS KTPLKGFEKS
RNLKKFGYDD KHYSHSKEFR SNIVDEQGYW KAIFSDFELK SSIEKE