Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0516 |
Symbol | |
ID | 8135827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 635712 |
End bp | 638951 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868135 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_003020354 |
Protein GI | 253699165 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATCC ATCTTCACGC ACATTCTTCG CTGTCGCCCA ACTGGGGGGT GCACTCGCCG GAGACGCTCT GCTCTCACGC GGCTCTGCTG GGCTTCAATA CGCTCGCCAT CACCGACCGC AACGGGCTCT ACGGGGTGCC GCGCTTCCTC GATGCTGCGC GGGAGACGGG CATCTCGCCC ATCATCGGGA CGGAGGCGGT CACGCAAAAC AACCGGGCGG TCCTCCTTGC CTGTAACGAG GAGGGGTACG CCAACATCTC CCGTCTCATC TCGGACTTGC ACTGCCAAAA GAACTTCGAC CTCACCCAGG CGCTTTCGGA GTACCGGCGC GGCATCATCG TGTTGAGCGA CGACCGAAAG CTTCTGTCGG CTCTCAAACG GAAGTCGGAG GAGGGGCTGT TCGTGGAGCT CTCGCCGGGA CATGCGATGC ACAGCGCCCT GACGCTGGCA AGGGATCTCA GGCTGCCGCC GGTGGCGACA TCTCGGGCCC TCCTCCGCGC TACGCGCTAC GGACGGCAGG CTCCTCCTCC GGCTACGCGC TACGGACGGC AGGTTCCTCC TCCGGCTACG CCCGGTTCCT TCATAGCTGC GTCCGACGCC CTCACCCCGA CCCTCTCCCA GAGGGAGAGG GGGAATGGAC TCAACACTCT GGAGGAGAGG GGGAATGGAC TCAACACTCC GGTGGAGAGC CAACTGACGG GGAACGATGG TCTGGCGGAA TACCTTGATA ACGCAGGGAC TTCTCCGCAG CTCGACGATT TCCACCTGCA TCGGGTGCTG CGGGCTATTC ATTTGAACAC GAAGCTCTCG CGACTCACGC CGGAGATGAC GGCGACGGAG TCTGACGCGC TCTACCCTGC GCAGAAGATG GCGGAGTTCT TTCCGCATTG CCCGGAGGCT CTGCAAAATT CCATTCGCAT CGCTTCTCTT TGCAAAACGG ATTGGGATTT CTCCAGCACC ATCTTCCCCG CTTTCCGTGA GTTGGGAACG GAGGCTGCGT TCGAAACGCT GCAGGAACGC GCTCGCCAGG GGGCAATCTG GCGCTACGGC AGCATCGACG ACCGGGTGCA GGCCCGCCTC GACAAGGAAC TCTCCATCAT CCGCGACAAG GGGTTCTCTC ATTACTTCCT GGTGGTGGAG GAACTGACCA AGCAGTCGGA GAGAACCTGC GGCAGGGGGA GTGCTGCGGC CTCTCTCGTC GCCTACTGCC TCGCGATCAC GCACGTCGAT CCCATCCGGC ACAACCTCTT CTTCGAGCGT TTTCTGAACG AGGGGAGAAG CGACCCTCCC GACATCGACG TGGACTTTCC CTGGGACGAG CGCGACGCCA TCCTCGACTT CGCCTTCGCG CGCTACGGCG CCCATCGGGC GGCCATGGTG GCGAACCAAG TGGGCTTCAA GGGGAGGTCG GCGCTGCGCG AGGTGGCCAA GGTCTACGGC CTGCCGGACT ACGAGATCAA GGAGATGACG GAACGAATCT CGGGATTTTG GCGCGCGGAG CAAAGCGCCG CGGCTATGAG CGGCCACCCG CTCTTCAAGG GGGAGTCGCT TTCCTCCGAC TGGCAGGAGA TCATGTCGAC GGCCAGGCGC CTGAACGGGC AGTTGCGCCA CCTGTCGCTC CATTGCGGGG GGCTCGTCAT CGTGCCGGAC GAGATCCGCA AGTACGTCCC GGTGGAGATC TCGCACAAGG GGCTGCCGCT GATCCAGTGG GAAAAGGACC AGACAGAGGA CGCGGGGCTG GTGAAGATCG ACATCCTGGG AAACCGCTCG CTCGCCGTCA TCCGCGACGC GATGGCGGCG GTGAAGGAGC AGAAGGGGGT CGAGATCGAC TACGCGACCT GGCGCCCGCT GGATGATGAA CGGACGCAGA GCCTTCTGCG CCGCGGCCTC ACCATCGGCT GTTTCTACCT GGAATCACCT TCGGTGCGCC TTTTGCTGCG CAAGATCTGG AGCAGCACGG CGCCGCCGGA GACCTTCCGG CACGACCTCT TCGAGGTGCT GGTGCAGGCC TCCTCGATCA TCAGGCCGGC TGCGAACAGC TTCATCCAGG AGTACGTGGC GAGGCTCCAG GGAAAACCCT GGTCGCACCT GCATCCCCTT TTGGAGAGCG TGCTGGGGGA GACGCTGGGG ATAGCCATCT ACCAGGAGCA GATCACCCAG ATCGCCATGG AACTTGCGGG TTTCTCCGCC GGCGAGGGGG ATCAGCTCAG GAAGGTGATC ACCAAGAAGC ACCGCGAGAA GCGGCTCGCG GATTTCCGCG CCAAGTTCAT GGCGGGGGGC ATGGAGCGCG GGGTCCCGGA AAAGGTGCTG CAGGGGATCT GGGACCAGAT ACTCTCCTTC GCCGGCTACT CGTTTTGCAA GCCGCACTCG GCGAGCTACG CGCTTTTGAG CGGGAAGGCG GCCTACATGA AGGCGAACCA CCCGGCGCAG TTCATCGCGG CGGTGATCTC CAACCAGGGG GGGTACTACT CGCCGTTCGC CTACATCTCG GAGGGGCGCA GGCTGGGGCT TGCCATCCTA CCCCCGGACA TCAACGAGAG CGAGTACCAC TACACCGGCA AAGAGCAGAC GCTTAGGGTC GGCCTGATGC AGATCGACGG CCTGACAAGA GACGGCGCCG ACCGGCTCCT CAAGGAGCGC CGCGAGCACG GCGCTTTCGC CTCCTTCAAG GAGTTCCTGC GCCGGGCGAG GCTGCAGCGC GCCGACGCCG AGCGCCTGGT GAAGGCCGGT TGCTTCGACG CGCTGGAGGG GGAGGAGAAG CGCCCGACGC TTCTTTGGGA GCTGCTCCAC TTCCAGCAGC AGGCGACGGC GCTTCTCTTC GAGCAAAAGA CGGAGCTGCC GCACCCCCCT CCCTACGACG CGCAGATGGT GCTCAGGCAG GAGGTGGAGA CGCTCGGCTT CCTGGTGTCG CGCCATCCGC TCGCGCTCTA CAAGGCGCAG TGGCAGCGGC ACCGGCCCAT CAAGGCTTCG GAATTGATCA GGCACACGGG GAAATGGGTG ACCATGGTGG GGTGGTGGAT CACGACGAAG ACGGTTGAGG ACAAGCACGG CAGGCCGATG GAGTTCATCT CTTTCGAGGA CGTGACCGCG ATCTTCGACG CCACCTTCTT TCCGGACGTC TACGCCAGGT TCTGCCGTAA GCTCTCGCAG CGGCGCCCCT ACCTGCTCAA AGGGATGGTG GAAGAGGAGT TCGGGGTAGC AACACTCAGG GTGAAGTGGG TCGGCTTTCT GGACGGGTAG
|
Protein sequence | MFIHLHAHSS LSPNWGVHSP ETLCSHAALL GFNTLAITDR NGLYGVPRFL DAARETGISP IIGTEAVTQN NRAVLLACNE EGYANISRLI SDLHCQKNFD LTQALSEYRR GIIVLSDDRK LLSALKRKSE EGLFVELSPG HAMHSALTLA RDLRLPPVAT SRALLRATRY GRQAPPPATR YGRQVPPPAT PGSFIAASDA LTPTLSQRER GNGLNTLEER GNGLNTPVES QLTGNDGLAE YLDNAGTSPQ LDDFHLHRVL RAIHLNTKLS RLTPEMTATE SDALYPAQKM AEFFPHCPEA LQNSIRIASL CKTDWDFSST IFPAFRELGT EAAFETLQER ARQGAIWRYG SIDDRVQARL DKELSIIRDK GFSHYFLVVE ELTKQSERTC GRGSAAASLV AYCLAITHVD PIRHNLFFER FLNEGRSDPP DIDVDFPWDE RDAILDFAFA RYGAHRAAMV ANQVGFKGRS ALREVAKVYG LPDYEIKEMT ERISGFWRAE QSAAAMSGHP LFKGESLSSD WQEIMSTARR LNGQLRHLSL HCGGLVIVPD EIRKYVPVEI SHKGLPLIQW EKDQTEDAGL VKIDILGNRS LAVIRDAMAA VKEQKGVEID YATWRPLDDE RTQSLLRRGL TIGCFYLESP SVRLLLRKIW SSTAPPETFR HDLFEVLVQA SSIIRPAANS FIQEYVARLQ GKPWSHLHPL LESVLGETLG IAIYQEQITQ IAMELAGFSA GEGDQLRKVI TKKHREKRLA DFRAKFMAGG MERGVPEKVL QGIWDQILSF AGYSFCKPHS ASYALLSGKA AYMKANHPAQ FIAAVISNQG GYYSPFAYIS EGRRLGLAIL PPDINESEYH YTGKEQTLRV GLMQIDGLTR DGADRLLKER REHGAFASFK EFLRRARLQR ADAERLVKAG CFDALEGEEK RPTLLWELLH FQQQATALLF EQKTELPHPP PYDAQMVLRQ EVETLGFLVS RHPLALYKAQ WQRHRPIKAS ELIRHTGKWV TMVGWWITTK TVEDKHGRPM EFISFEDVTA IFDATFFPDV YARFCRKLSQ RRPYLLKGMV EEEFGVATLR VKWVGFLDG
|
| |