Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1169 |
Symbol | |
ID | 8136491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1358337 |
End bp | 1361804 |
Gene Length | 3468 bp |
Protein Length | 1155 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644868780 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_003020988 |
Protein GI | 253699799 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 4.34494e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATTCAG CAAATTTCGT TCACCTTCAT CTCCACTCCC AGTATTCGCT CCTCGACGGG GCGATCCGCA TCGGTGACCT CCTGAAGAAG GTGAAGGAGT GCCACATGCC GGCCGTGGCC ATGACCGACC ACGGCAACAT GTTCGGCACC CTTGAGTTCT ATTTGAAGTG CAAGGACAAG GGGATCAAAC CGATCATCGG CAGCGAGGTG TACATCGCGC CGCAGTCGCG CCTTCTTAAG CAGGCGCCGG CCGCAGGCGA GGCGAACAGC TACCACCTGA TCCTCCTGTG CGAAAACATG ACCGGCTTCA AGAACCTCTC CTACCTGGTT TCCGCCGGCT ACAAGGAAGG GTTCTACCGC CGCCCCCGCA TCGACAAGGA ACTGCTGCTA AAGCACAAGG AAGGTCTCAT CGTCCTCTCC GCCTGCCTGC AGGGCGAGGT CGCCTACCTT GCGGGCAGGA ACAAGATGGA CGAGGCGCGC GCCGCGGCCT CCTGGTACGC CGAGAACTTC CCCGGGAGCT ACTACATAGA GCTGCAGGAA AACAAGCTCC CCGAGCAGGA CGTCGCCAAT CGCAGGCTCA TGGAGATCGC GCGGGAAATG GACCTGCCGC TCGTCGCCAC CAACGACTGC CATTACCTGA ACCGCGAGGA CGCGCGCGCC CACGAGATCC TCCTCTGCAT CCAGACCGGC AAGACCATGA GCGATCCCAA CCGCATGGCG TTCTCCGTTG ACGAGTTCTA CGTGAAGACT CCGGAGGAGA TGGCGGCGGC CTTTCATTAC GCCCCGGAGG CGCTCAGCAA CACCGTCAAG ATCGCCGAGC GCTGCCACCT GGAATTCGAC TTCAACACCT ACTACTTTCC CGCCTACGAG GCGCCGGAAG GGGAGACGCT GGACCAGCAA CTTGAGCGGG AGTCCAAGGC GGGCCTCATC GAGCGGCTCA AGAAGATCCG CATCAAGTAC AACCTCACCG AGGAGCAGGA ACAGGGCTAC CACGCCCGGC TGCGCATCGA GCTGGACTGC ATCAAGCAGA TGGGGTTCCC GGGCTACTTC CTGATCGTCG CCGACTTCAT CAACTGGGCC AAGGACCACG GCATCCCCGT CGGCCCCGGC AGGGGTTCCG CGGCGGGTTC TCTCGTCGCC TTCTGCATCA GGATCACCGA CATCGATCCC ATGCCGTACA ACCTCCTCTT CGAGCGATTC CTGAACCCGG AACGTATCTC CATGCCGGAT ATCGACGTCG ACTTCTGCCA GGATCGCCGC GAAGAGGTGA TCCAGTACAT GGTCGAGAAG TACGGCCGGG AGAAGGTCTG CCAGATCATC ACCTTCGGCA CCATGGCGGC GCGCGGCGTC ATCCGCGACG TGGGGCGCGC GCTCGATCTC ACCTTCGGCG AGGTGGACCG GATCGCGAAG CTGGTCCCGG AGGTGCTCGG GATAACCCTT GAGAAGGCGC TGGAGCAGGA GCCGAAGCTG AAGGAGCTGA TGGCGGCCGA CCCGAAGGTG AAGGAGGTCA TGACCGTCGC GCTCAGGCTG GAGGGGCTCG CCCGCCACGC CTCGACGCAT GCGGCCGGCC TCGTGGTCGC ACCACGCCAG ATGGAGGAAT TCTGCCCCGT CTACAAGGAC CAGAAGACCG GCTCCTTGAC CACGCAGTAC TCCATGAAGT ACGTGGAGAA GATCGGCCTG GTGAAGTTCG ACTTCCTGGG GCTCAAGAAC CTCACCGTCA TAGACAACGC CTGCAAGCAC ATCAGAAACG GCAAGGACCC CAACTTCGAC ATCACCCTTT TGCGCGACGA CGACGCGGAG TCCTACAAGC TCTTGCAGGC CGGCAACACG ACAGGCGTCT TCCAACTCGA GTCCAGCGGC ATGAAGGAGC TCCTGGTCAA GCTGAAGCCC TCCTGCTTCG AGGACATCAT CGCGGTCTGC GCCCTCTACC GTCCGGGTCC GCTCGGCAGC GGCATGGTCG ACGACTTCAT CGAAAGAAAG CACGGCCGCA AGCAGACCGT TTACGACCTG CCGCAGCTTG AGCCGGTCCT GAAAGACACC TACGGCGTCA TCGTCTACCA GGAACAGGTC ATGCAGATCT CCCGATCGCT CGCCGGCTAC TCGCTTGGGG GCGCGGACCT TCTGCGCCGC GCCATGGGCA AGAAGGACGC CGAGCAGATG GCGAAAGAGC GCGACAAGTT CCTGGAGGGG AGCGAGAAGC TGGGCCTCGA CGGGAAGAAG TGCGCTGCCA TATTCGACCT GATGGCGAAA TTCGCCGAGT ACGGCTTCAA CAAGTCGCAC TCGGCAGCCT ACGCGCTGGT CGCCTATCAG ACCGCATTCC TCAAGGCACA CTACCCGGTC GAGTTCATGG CTGCCCTCTT GACCGAGGAC ATGGGTAACA CCGACAAGGT CATCAAGAGC ATCGGCGACT GCCGCGAGAT GGGGATCGAG GTGCTCCCCC CCGACATCAA CGAGTCGGAC CGCTCCTTCC GCGTGCTGGA CAAGGCGATG CGCTTTGGTC TGGGCGCGGT CAAGAACGTG GGCGAAGGTG CCATCGAGGC GATCATCGAG GCGCGCGGCG ACGAGCCGTT CAAGGACCTC TTCGATTTCT GCGAGCGCGT CGACCTGCGC CGGGTCAATA AAAGGGTGAT TGAGGCGCTC ATCAAGTGCG GCGCCTTCGA CTGCACCGGC GCCAAGCGCT CGCAGCTCAT GGCGGGGCTG GAGGATGCCG CGGCGACGGG GCAGAGGGTG CAGCAGGAGC GCGAGAGCGC GCAGGCGTCG CTTTTCGGCG CCGCCGAGAT CGTGCGTGGC GGCAACGGCG GCGGCAACCG GCTTCCCGAC ATCCCCGAGT GGGACGAGAA GTACCGGCTC GGCTGCGAGA AGGAGGCTAT CGGCTTCTTC ATCACCGGGC ATCCGCTGGA CCGCTACGTC GCCGACATGA GGCGCTTTTC CACCGTGGAC TGCTCCACCA TCCTGGACGC CAAGGAGAAG GGCGAGGTGA GGATCTGCGG CGTACCTAGC ACAGTGAAGG AGCTGATCAC CAAGAAAGGG GACCGGATGG CCTTCCTCGC GCTGGAGGAC CTGGTGGGCT CGGTCGAGGT GGTGGTCTTT CCGGAAACCT ACGCCAAGTG CTCCGAGGTG TTAAGGGGGG ACGATCCCAT CCACGTCACC GGCACCGTCG AGCTGAGCGA GAAGGGGGCC AAGGTGATGG CGAGCGACAT CATACTCTTG CGCGACCTGG TGGAGCGTGA AACGAGAAAG GTCAATTTCA CCATAGACGC CAAAGAGGCC GACGAAGGGA AGCTCAAAAC GCTCAAGGAC ATCATCTCCC GCTATCAGGG AATCTGTCGC AGCTTCCTGC ACCTTGACAT AGAGAACAGC TCCAGAGTCA CCATCAAGCT TCCCGATGTA TACAAAGTCT CGGCAAGTGA AGAATTAACA GTGGAAGTGA GCAACCTCTT CGGCTATAAT GCCGTGTCCT TCGAGTGA
|
Protein sequence | MDSANFVHLH LHSQYSLLDG AIRIGDLLKK VKECHMPAVA MTDHGNMFGT LEFYLKCKDK GIKPIIGSEV YIAPQSRLLK QAPAAGEANS YHLILLCENM TGFKNLSYLV SAGYKEGFYR RPRIDKELLL KHKEGLIVLS ACLQGEVAYL AGRNKMDEAR AAASWYAENF PGSYYIELQE NKLPEQDVAN RRLMEIAREM DLPLVATNDC HYLNREDARA HEILLCIQTG KTMSDPNRMA FSVDEFYVKT PEEMAAAFHY APEALSNTVK IAERCHLEFD FNTYYFPAYE APEGETLDQQ LERESKAGLI ERLKKIRIKY NLTEEQEQGY HARLRIELDC IKQMGFPGYF LIVADFINWA KDHGIPVGPG RGSAAGSLVA FCIRITDIDP MPYNLLFERF LNPERISMPD IDVDFCQDRR EEVIQYMVEK YGREKVCQII TFGTMAARGV IRDVGRALDL TFGEVDRIAK LVPEVLGITL EKALEQEPKL KELMAADPKV KEVMTVALRL EGLARHASTH AAGLVVAPRQ MEEFCPVYKD QKTGSLTTQY SMKYVEKIGL VKFDFLGLKN LTVIDNACKH IRNGKDPNFD ITLLRDDDAE SYKLLQAGNT TGVFQLESSG MKELLVKLKP SCFEDIIAVC ALYRPGPLGS GMVDDFIERK HGRKQTVYDL PQLEPVLKDT YGVIVYQEQV MQISRSLAGY SLGGADLLRR AMGKKDAEQM AKERDKFLEG SEKLGLDGKK CAAIFDLMAK FAEYGFNKSH SAAYALVAYQ TAFLKAHYPV EFMAALLTED MGNTDKVIKS IGDCREMGIE VLPPDINESD RSFRVLDKAM RFGLGAVKNV GEGAIEAIIE ARGDEPFKDL FDFCERVDLR RVNKRVIEAL IKCGAFDCTG AKRSQLMAGL EDAAATGQRV QQERESAQAS LFGAAEIVRG GNGGGNRLPD IPEWDEKYRL GCEKEAIGFF ITGHPLDRYV ADMRRFSTVD CSTILDAKEK GEVRICGVPS TVKELITKKG DRMAFLALED LVGSVEVVVF PETYAKCSEV LRGDDPIHVT GTVELSEKGA KVMASDIILL RDLVERETRK VNFTIDAKEA DEGKLKTLKD IISRYQGICR SFLHLDIENS SRVTIKLPDV YKVSASEELT VEVSNLFGYN AVSFE
|
| |