Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0541 |
Symbol | polA |
ID | 2685802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 572574 |
End bp | 575249 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637125207 |
Product | DNA polymerase I |
Protein accession | NP_951599 |
Protein GI | 39995648 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCG ACACGCTGTT TCTCATCGAC GGCTCCTCGT ACATCTACCG GGCCTACTTC GCCATCAGGC ATCTGTCGTC GCCCGCGGGT TTCCCCACCA ATGCGCTCTA CGGCTTCACC CAGATGCTCC TCAAGGTCAT CAAGGATCAC CATCCCGGCC GCTTGGCCGT GGTCTTCGAC AAGGGGCGCA CCACCTTCCG CACCGAGATC TACCCCGATT ACAAGGCGAA CCGGGCCGCC ATGCCCGATG ATCTGGTTCC CCAGATCGGA CCCATCAAGG AGATGGTCCG GGCCTTCAGC ATCCCGGTTC TGGAGTTGGA GGGCTACGAG GCCGACGACA TCATCGGCAC CATTGCCCGG CGGTGCGAGG AGCAGGGTTT GGAGGCCGTG GTCGTCACCG GCGACAAGGA TCTGATGCAG ATCGTGAGCG ACCGCATCCG GCTGCTCGAC ACCATGAAGG ACCGGGTGTC CGGCATCCCC GAGGTGGTGG AGCGCTTCGG CGTCGGCCCC GGGCAGGTCA TCGACATACT GGGCCTGGCG GGCGACACCT CCGACAACAT CCCCGGCGTC CCCGGCATCG GCGAAAAGAC CGCCACCAAG CTGATCCAGG AGTTTGGCTC TCTGGACGCC CTGCTGGAGC GGGCCGGCGA GGTTAAGGGA AAGACCGGCG AGCGCCTGCG CGAGTTCGCG GAGCAGGCCC GGCTTTCCCG GCGTCTCGCC ACCATTGTCC GCGACGTGCC CCTGGACTTC GACCTCGACG CCTTTGCCGT GGCCCCCGCC GACAATCGCC GATTGGCCGA ACTCTTCAAG GAATATGGGT TCACGACCCT CATGAAGGAG CTGACCAGCG AGGCCACCCT GACCACCGAG CACTACCGCA CCATTTTCGC GGAGGCCGAT TTCCGGGCCC TGCTCGCCGA CCTGGCCGGG GCCGGGGTCT TTTCGGTGGA CCTGGAAACC ACGAGCCTCA ATCCTCTTGA GGCGGACATC GTGGGCATCT CCCTGTCGTT CCGGGATCAC GAAGCCTGCT ACATCCCGGT GGGGCACGCC TATGACGGCG CCCCGGCCCA GCTCGACCGC ACCCTGGTGC TGGAGGGGCT GCGGCCGCTC CTCACCGACC CCGCGGTGCG CAAGGTGGGG CAGAACCTCA AGTACGACTA CCAGGTGCTG CGCCGTCACG GCATCGTCAT GGCCGGGGTC TGGTGCGATA CCATGGTCGC CTCCTATCTG ATCAACCCGG TCCGGTCGGG CCACGGCCTG GACTCCCTGG CGGTGGAGCA CCTGGACCAC CGGATGATCT CCTACGAGGA GGTGGCCGGG AAGGGAAAGG ACCAGCTGAA CTTTGCCCGG GTGCCGGTGG AAAAGGCGTC CACCTATTCC TGCGAGGATG CCGACGCCGC CTGGCTCCTG CACCGGCTGT TCCTGCCCCG GGTGGCCGAG GCCGGCATGG AGCGCCTGTT CTTCGATCTG GAGATGCCGC TCGTACCGAT CCTGGCCGAG ATGGAAACGG CCGGGGTCAA GCTGGACCTG GCGCTGCTGG GGGAACTCTC CGCCGGGCTC GGCAGCCAGT TGACGGCTCT GGAGGAGCAG GTCATGGCGC TCGCGCCCGA GCCGTTCAAC CTCAACTCGC CCAAGCAGCT GGGAGAGGTG CTGTTCGAGA AAATGAAGCT CCCAGCCGGG AAAAAGACCA AGACCAAAAC CGGCTGGTCC ACCAACGTGG AGGAGCTGGA ACGGCTGGCC GAGGCCGGGC ACGAGATCGC CGCGGCCATC CTCCGGTACC GGGGGCTCGC CAAGCTCAAG TCCACCTACA CCGATGCCCT GCCCAGGCTG GTGCACCCGG CCAGCGGGAG GGTGCACACC TCCTACAACC AGACCGTCAC CAACACCGGG CGGCTCTCGT CCTCGGAGCC CAACCTGCAG AACATCCCGG TCCGCACCGA CGAGGGGCGG AAGATCCGGC GCGCCTTCAT CGCGGAGGAG GGCCACCTGA TCCTCTCGGC CGACTACTCC CAGATCGAGC TGCGGGTCCT GGCCCATCTC TCGGAGGACC GGGTCTTCTG CGACGCCTTT GCCCGGGACG AGGACATCCA CACCCGGACC GCGGCCGAGG TGTTCGGCCT CTTCCCGGAG ATGGTGACGC CCGAGATGAG GCGCCAGGCC AAGGCCATCA ACTTCGGGGT CATCTACGGA CAGGGGGCCT TCAGTCTGGC CAAGCAACTG GGGATCACCA CCAAGGTGGC CAAGGAGTTC ATCGACAACT ACTTCGCCCG CCATCCTGGC GCCCGGGCCT TCCTGGACGG CTGCGTGGCC GAGGCCGAGG CCAGGGGCTA CGTGACCACC CTCATGGGGC GGCGGCTCCC CATTCCGGAC ATCGCCAGCA GCAACGGCAA CATCCGGGCC TTTGCCCAGC GCAACGCCGT CAACTACCCC ATTCAGGGCT CGGCCGCGGA CATCATCAAG GCGGCCATGG TGCGGGTCAC GGAGCGGATG CGGCGAGAGG GGCTCACCAG TCGCCTGATC ATGCAGGTCC ACGACGAACT GGTCTTCGAG GTGCCGGAGG GCGAACGGAC CGCCATGGAG CGGCTCGTGC AACACGAGAT GGAGCATGCT GCCCCCCTCC GGGTGCCGCT GCGGGCGGAC GTGAACGTGG GGCGTAACTG GAGCGAGGCC CACTGA
|
Protein sequence | MATDTLFLID GSSYIYRAYF AIRHLSSPAG FPTNALYGFT QMLLKVIKDH HPGRLAVVFD KGRTTFRTEI YPDYKANRAA MPDDLVPQIG PIKEMVRAFS IPVLELEGYE ADDIIGTIAR RCEEQGLEAV VVTGDKDLMQ IVSDRIRLLD TMKDRVSGIP EVVERFGVGP GQVIDILGLA GDTSDNIPGV PGIGEKTATK LIQEFGSLDA LLERAGEVKG KTGERLREFA EQARLSRRLA TIVRDVPLDF DLDAFAVAPA DNRRLAELFK EYGFTTLMKE LTSEATLTTE HYRTIFAEAD FRALLADLAG AGVFSVDLET TSLNPLEADI VGISLSFRDH EACYIPVGHA YDGAPAQLDR TLVLEGLRPL LTDPAVRKVG QNLKYDYQVL RRHGIVMAGV WCDTMVASYL INPVRSGHGL DSLAVEHLDH RMISYEEVAG KGKDQLNFAR VPVEKASTYS CEDADAAWLL HRLFLPRVAE AGMERLFFDL EMPLVPILAE METAGVKLDL ALLGELSAGL GSQLTALEEQ VMALAPEPFN LNSPKQLGEV LFEKMKLPAG KKTKTKTGWS TNVEELERLA EAGHEIAAAI LRYRGLAKLK STYTDALPRL VHPASGRVHT SYNQTVTNTG RLSSSEPNLQ NIPVRTDEGR KIRRAFIAEE GHLILSADYS QIELRVLAHL SEDRVFCDAF ARDEDIHTRT AAEVFGLFPE MVTPEMRRQA KAINFGVIYG QGAFSLAKQL GITTKVAKEF IDNYFARHPG ARAFLDGCVA EAEARGYVTT LMGRRLPIPD IASSNGNIRA FAQRNAVNYP IQGSAADIIK AAMVRVTERM RREGLTSRLI MQVHDELVFE VPEGERTAME RLVQHEMEHA APLRVPLRAD VNVGRNWSEA H
|
| |