Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3902 |
Symbol | |
ID | 5167070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4553088 |
End bp | 4555766 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640551384 |
Product | DNA polymerase I |
Protein accession | YP_001232624 |
Protein GI | 148265918 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000292646 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTCGG AAAAACCAAC CATCTATCTC ATCGACGGCT CATCATACAT CTACCGGGCT TACTATGCGA TTCGCCATCT TTCTTCGCCG AAAGGGTTCC CCACCAATGC GCTCTATGGC TTCATCCAGA TGCTTTTGAA GGTGATCAAG GAGAAAAAAC CGGACCATCT CGCCGTGATT TTCGACGCCG GCCGGCAGAC CTTCCGCAAC GAAATCTATG CCGACTACAA GGCAAACCGC GCTGCCATGC CTGACGACCT GCGGCAGCAG ATTGAGCCGA TCAAGGAGGC GGTGCGGGCT TTCAACATCC CGGCGCTCGA GTTGGCCGGT TTCGAGGCCG ACGACATTAT CGGCACCATC GCCCGCGATT GTGAAGAGAA GGGGATGGCC GCGGTGGTAG TGACCGGCGA CAAGGATCTG ATGCAGATCG TCAGCGACAA CGTCACGCTT CTGGATACCA TGAAGGACAA GGTTTCGGGG CCTGCCGAGG TGGTTGAGCG GTTCGGAGTT GGTTCTGAGC GGGTTATCGA CATCCTCGGC CTGGCCGGCG ACTCTTCCGA CAATATTCCC GGTGTTCCCG GTATCGGCGA AAAGACGGCG ATCAAACTGG TAAACGATTT CGGTTCGCTG GATGAGCTCC TGGCGCGGGC CAATGAGGTG AAGGGAAAGA CCGGCGAGCG TCTGCAGGAA TTTGCTGAGC AGGCTCGTCT CTCGCGGAGG CTTGCAACCA TCGATTGTCA TGTCCCTTTG GCCTGGTCTT ACGACGACTT TGCCGCGTCT CCGCAGGACA ACCGGCGGCT GGCGGAATTG TTCAAGGAAT ACGGATTCAC GACGCTGATG AAGGAGTTGA CCAGTGAGGC GACCCTCTCG GCGGAAGATT ACCGGACCGT TCTCTCCTCC GACGATTTCA CGGCCCTTGT CCGGCAATTG ACCGCCGTCC GGGCCTTTGC CGTTGACCTG GAAACCACGA GTCTCAACCC GCTCGAAGCG GAGATCGTCG GCATCTCCTT CTCTTTCCGC GAGCATGAGG CGTACTATAT CCCGGTCGGC CACCGCTATC CGGGGGCGCC GCACCAGCTC TCCCGCGACG ATGTACTCGG AGCACTCAAG CCGCTACTGC TCGATCCGGA GAAGCACAAG ATCGGTCAGA ACATCAAGTA CGATTACCAG GTGTTGCGCC GGGCCGGGAT CGATATGCAG GGGATCTGGT GCGACACCAT GCTCGCCTCC TATCTGCTCA ACCCGACCCG CACCAGCCAG GGGCTTGATT CGCTGGCAGT GGAATTTCTC GATCACAGGA TGATTTCCTA TGCCGAGGTG GCCGGCAAGG GGAAGGAACA GAAGAACTTC GCCCAGGTCG AAGTGGAGAA AGCGTCGGTC TATTCCTGCG AAGACGCCGA TGCCACCTAT CTCCTGCACA AGCTCTTCTT GCCGAGGCTT GCCGAGTCAG GGATGGAACG GCTCTTTTTC GAGCTGGAGA TGCCGCTGGT CAAGATCCTG GCCGAGATGG AGCTGTACGG GGTAAAGCTC GACCTGCCGC TTTTGCAGCG CCTCTCAGAC GGATTCGGCG GCCAGCTTGC CGACTTGGAG CGAGAGATCT TCGCCCTGGC CGGGGGAGAG TTCAATGTCA ACTCGCCGAA GCAGCTGGGG GAGGTCCTTT TTGAGCGCTT GCAACTGCAG GTCGGCAAAA AGACCAAGAC CAAAACCGGC TGGTCGACCA ATGTAGACGA ACTGGAACGG CTGGCCGGGG AGCACGAGAT TGCCCGGCTG ATCCTCCAGT ACCGGAGCCT GTCCAAGCTG AAGTCCACCT ATACCGATGC CCTGCCCAAG CTCGTCGATG CTGCATCGGG GCGGGTCCAT ACCTCTTACA ACCAGGCGGT GACCAATACA GGTCGACTCT CTTCGTCGGA GCCGAATCTG CAGAACATCC CGATCCGCTC GGAAGAAGGT CGCAGCATCC GCCATGCCTT CATTGCCGCG GAGGGGTGCC TCCTCCTTTC CGCCGACTAT TCACAGATCG AACTGCGGGT TCTGGCCCAC CTGTCGGCGG ACCGGGTCTT CTGTGACGCT TTTGCCAGGG ACGAGGACAT CCATACCCGG ACCGCCGCCG AAGTGTTCGG CCTCTTCCCC CAGATGGTTA CCCAGGAGAT GCGTCGCCAG GCCAAGACCA TCAATTTTGG CGTAATCTAC GGCCAGGGAG CCTTCAGCCT GGCCAGGGAG CTCGGCGTAT CGACGAAGGT GGCAAAGGAA TTCATCGACA ACTACTTTGC CCGTCACGCA GGCGCCCGGA CCTTTCTTGA CGGCTGTGTC CGGGAGGCGG AGGATAATGG GTACGTTACC ACTATTCTTG GCCGGCGCCT TCCCATCCCG GATATCAAGA GTTCAAACGG CAACATTCGG GCCTTTGCCC AGCGGAACGC CGTCAACTAT CCGATCCAGG GGTCGGCGGC CGATATCATC AAGCAGGCGA TGGTGCGCGT GGTTGACCGG ATGGAGCGGG AGGGGATGAA AAGCCGTCTC ATCATGCAGG TCCACGACGA ATTGGTGTTC GAGGTGCCGG AAGAGGAAAA GCTGGCGATG GAGATGCTCG TGAAACACGA AATGGAGCAC GCTGTTACTC TGCGCGTACC GCTCAGGGTG GATATGAACT TCGGCAGGAA CTGGAGCGAG GCCCACTGA
|
Protein sequence | MMSEKPTIYL IDGSSYIYRA YYAIRHLSSP KGFPTNALYG FIQMLLKVIK EKKPDHLAVI FDAGRQTFRN EIYADYKANR AAMPDDLRQQ IEPIKEAVRA FNIPALELAG FEADDIIGTI ARDCEEKGMA AVVVTGDKDL MQIVSDNVTL LDTMKDKVSG PAEVVERFGV GSERVIDILG LAGDSSDNIP GVPGIGEKTA IKLVNDFGSL DELLARANEV KGKTGERLQE FAEQARLSRR LATIDCHVPL AWSYDDFAAS PQDNRRLAEL FKEYGFTTLM KELTSEATLS AEDYRTVLSS DDFTALVRQL TAVRAFAVDL ETTSLNPLEA EIVGISFSFR EHEAYYIPVG HRYPGAPHQL SRDDVLGALK PLLLDPEKHK IGQNIKYDYQ VLRRAGIDMQ GIWCDTMLAS YLLNPTRTSQ GLDSLAVEFL DHRMISYAEV AGKGKEQKNF AQVEVEKASV YSCEDADATY LLHKLFLPRL AESGMERLFF ELEMPLVKIL AEMELYGVKL DLPLLQRLSD GFGGQLADLE REIFALAGGE FNVNSPKQLG EVLFERLQLQ VGKKTKTKTG WSTNVDELER LAGEHEIARL ILQYRSLSKL KSTYTDALPK LVDAASGRVH TSYNQAVTNT GRLSSSEPNL QNIPIRSEEG RSIRHAFIAA EGCLLLSADY SQIELRVLAH LSADRVFCDA FARDEDIHTR TAAEVFGLFP QMVTQEMRRQ AKTINFGVIY GQGAFSLARE LGVSTKVAKE FIDNYFARHA GARTFLDGCV REAEDNGYVT TILGRRLPIP DIKSSNGNIR AFAQRNAVNY PIQGSAADII KQAMVRVVDR MEREGMKSRL IMQVHDELVF EVPEEEKLAM EMLVKHEMEH AVTLRVPLRV DMNFGRNWSE AH
|
| |