Gene GM21_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0516 
Symbol 
ID8135827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp635712 
End bp638951 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content64% 
IMG OID644868135 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003020354 
Protein GI253699165 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCC ATCTTCACGC ACATTCTTCG CTGTCGCCCA ACTGGGGGGT GCACTCGCCG 
GAGACGCTCT GCTCTCACGC GGCTCTGCTG GGCTTCAATA CGCTCGCCAT CACCGACCGC
AACGGGCTCT ACGGGGTGCC GCGCTTCCTC GATGCTGCGC GGGAGACGGG CATCTCGCCC
ATCATCGGGA CGGAGGCGGT CACGCAAAAC AACCGGGCGG TCCTCCTTGC CTGTAACGAG
GAGGGGTACG CCAACATCTC CCGTCTCATC TCGGACTTGC ACTGCCAAAA GAACTTCGAC
CTCACCCAGG CGCTTTCGGA GTACCGGCGC GGCATCATCG TGTTGAGCGA CGACCGAAAG
CTTCTGTCGG CTCTCAAACG GAAGTCGGAG GAGGGGCTGT TCGTGGAGCT CTCGCCGGGA
CATGCGATGC ACAGCGCCCT GACGCTGGCA AGGGATCTCA GGCTGCCGCC GGTGGCGACA
TCTCGGGCCC TCCTCCGCGC TACGCGCTAC GGACGGCAGG CTCCTCCTCC GGCTACGCGC
TACGGACGGC AGGTTCCTCC TCCGGCTACG CCCGGTTCCT TCATAGCTGC GTCCGACGCC
CTCACCCCGA CCCTCTCCCA GAGGGAGAGG GGGAATGGAC TCAACACTCT GGAGGAGAGG
GGGAATGGAC TCAACACTCC GGTGGAGAGC CAACTGACGG GGAACGATGG TCTGGCGGAA
TACCTTGATA ACGCAGGGAC TTCTCCGCAG CTCGACGATT TCCACCTGCA TCGGGTGCTG
CGGGCTATTC ATTTGAACAC GAAGCTCTCG CGACTCACGC CGGAGATGAC GGCGACGGAG
TCTGACGCGC TCTACCCTGC GCAGAAGATG GCGGAGTTCT TTCCGCATTG CCCGGAGGCT
CTGCAAAATT CCATTCGCAT CGCTTCTCTT TGCAAAACGG ATTGGGATTT CTCCAGCACC
ATCTTCCCCG CTTTCCGTGA GTTGGGAACG GAGGCTGCGT TCGAAACGCT GCAGGAACGC
GCTCGCCAGG GGGCAATCTG GCGCTACGGC AGCATCGACG ACCGGGTGCA GGCCCGCCTC
GACAAGGAAC TCTCCATCAT CCGCGACAAG GGGTTCTCTC ATTACTTCCT GGTGGTGGAG
GAACTGACCA AGCAGTCGGA GAGAACCTGC GGCAGGGGGA GTGCTGCGGC CTCTCTCGTC
GCCTACTGCC TCGCGATCAC GCACGTCGAT CCCATCCGGC ACAACCTCTT CTTCGAGCGT
TTTCTGAACG AGGGGAGAAG CGACCCTCCC GACATCGACG TGGACTTTCC CTGGGACGAG
CGCGACGCCA TCCTCGACTT CGCCTTCGCG CGCTACGGCG CCCATCGGGC GGCCATGGTG
GCGAACCAAG TGGGCTTCAA GGGGAGGTCG GCGCTGCGCG AGGTGGCCAA GGTCTACGGC
CTGCCGGACT ACGAGATCAA GGAGATGACG GAACGAATCT CGGGATTTTG GCGCGCGGAG
CAAAGCGCCG CGGCTATGAG CGGCCACCCG CTCTTCAAGG GGGAGTCGCT TTCCTCCGAC
TGGCAGGAGA TCATGTCGAC GGCCAGGCGC CTGAACGGGC AGTTGCGCCA CCTGTCGCTC
CATTGCGGGG GGCTCGTCAT CGTGCCGGAC GAGATCCGCA AGTACGTCCC GGTGGAGATC
TCGCACAAGG GGCTGCCGCT GATCCAGTGG GAAAAGGACC AGACAGAGGA CGCGGGGCTG
GTGAAGATCG ACATCCTGGG AAACCGCTCG CTCGCCGTCA TCCGCGACGC GATGGCGGCG
GTGAAGGAGC AGAAGGGGGT CGAGATCGAC TACGCGACCT GGCGCCCGCT GGATGATGAA
CGGACGCAGA GCCTTCTGCG CCGCGGCCTC ACCATCGGCT GTTTCTACCT GGAATCACCT
TCGGTGCGCC TTTTGCTGCG CAAGATCTGG AGCAGCACGG CGCCGCCGGA GACCTTCCGG
CACGACCTCT TCGAGGTGCT GGTGCAGGCC TCCTCGATCA TCAGGCCGGC TGCGAACAGC
TTCATCCAGG AGTACGTGGC GAGGCTCCAG GGAAAACCCT GGTCGCACCT GCATCCCCTT
TTGGAGAGCG TGCTGGGGGA GACGCTGGGG ATAGCCATCT ACCAGGAGCA GATCACCCAG
ATCGCCATGG AACTTGCGGG TTTCTCCGCC GGCGAGGGGG ATCAGCTCAG GAAGGTGATC
ACCAAGAAGC ACCGCGAGAA GCGGCTCGCG GATTTCCGCG CCAAGTTCAT GGCGGGGGGC
ATGGAGCGCG GGGTCCCGGA AAAGGTGCTG CAGGGGATCT GGGACCAGAT ACTCTCCTTC
GCCGGCTACT CGTTTTGCAA GCCGCACTCG GCGAGCTACG CGCTTTTGAG CGGGAAGGCG
GCCTACATGA AGGCGAACCA CCCGGCGCAG TTCATCGCGG CGGTGATCTC CAACCAGGGG
GGGTACTACT CGCCGTTCGC CTACATCTCG GAGGGGCGCA GGCTGGGGCT TGCCATCCTA
CCCCCGGACA TCAACGAGAG CGAGTACCAC TACACCGGCA AAGAGCAGAC GCTTAGGGTC
GGCCTGATGC AGATCGACGG CCTGACAAGA GACGGCGCCG ACCGGCTCCT CAAGGAGCGC
CGCGAGCACG GCGCTTTCGC CTCCTTCAAG GAGTTCCTGC GCCGGGCGAG GCTGCAGCGC
GCCGACGCCG AGCGCCTGGT GAAGGCCGGT TGCTTCGACG CGCTGGAGGG GGAGGAGAAG
CGCCCGACGC TTCTTTGGGA GCTGCTCCAC TTCCAGCAGC AGGCGACGGC GCTTCTCTTC
GAGCAAAAGA CGGAGCTGCC GCACCCCCCT CCCTACGACG CGCAGATGGT GCTCAGGCAG
GAGGTGGAGA CGCTCGGCTT CCTGGTGTCG CGCCATCCGC TCGCGCTCTA CAAGGCGCAG
TGGCAGCGGC ACCGGCCCAT CAAGGCTTCG GAATTGATCA GGCACACGGG GAAATGGGTG
ACCATGGTGG GGTGGTGGAT CACGACGAAG ACGGTTGAGG ACAAGCACGG CAGGCCGATG
GAGTTCATCT CTTTCGAGGA CGTGACCGCG ATCTTCGACG CCACCTTCTT TCCGGACGTC
TACGCCAGGT TCTGCCGTAA GCTCTCGCAG CGGCGCCCCT ACCTGCTCAA AGGGATGGTG
GAAGAGGAGT TCGGGGTAGC AACACTCAGG GTGAAGTGGG TCGGCTTTCT GGACGGGTAG
 
Protein sequence
MFIHLHAHSS LSPNWGVHSP ETLCSHAALL GFNTLAITDR NGLYGVPRFL DAARETGISP 
IIGTEAVTQN NRAVLLACNE EGYANISRLI SDLHCQKNFD LTQALSEYRR GIIVLSDDRK
LLSALKRKSE EGLFVELSPG HAMHSALTLA RDLRLPPVAT SRALLRATRY GRQAPPPATR
YGRQVPPPAT PGSFIAASDA LTPTLSQRER GNGLNTLEER GNGLNTPVES QLTGNDGLAE
YLDNAGTSPQ LDDFHLHRVL RAIHLNTKLS RLTPEMTATE SDALYPAQKM AEFFPHCPEA
LQNSIRIASL CKTDWDFSST IFPAFRELGT EAAFETLQER ARQGAIWRYG SIDDRVQARL
DKELSIIRDK GFSHYFLVVE ELTKQSERTC GRGSAAASLV AYCLAITHVD PIRHNLFFER
FLNEGRSDPP DIDVDFPWDE RDAILDFAFA RYGAHRAAMV ANQVGFKGRS ALREVAKVYG
LPDYEIKEMT ERISGFWRAE QSAAAMSGHP LFKGESLSSD WQEIMSTARR LNGQLRHLSL
HCGGLVIVPD EIRKYVPVEI SHKGLPLIQW EKDQTEDAGL VKIDILGNRS LAVIRDAMAA
VKEQKGVEID YATWRPLDDE RTQSLLRRGL TIGCFYLESP SVRLLLRKIW SSTAPPETFR
HDLFEVLVQA SSIIRPAANS FIQEYVARLQ GKPWSHLHPL LESVLGETLG IAIYQEQITQ
IAMELAGFSA GEGDQLRKVI TKKHREKRLA DFRAKFMAGG MERGVPEKVL QGIWDQILSF
AGYSFCKPHS ASYALLSGKA AYMKANHPAQ FIAAVISNQG GYYSPFAYIS EGRRLGLAIL
PPDINESEYH YTGKEQTLRV GLMQIDGLTR DGADRLLKER REHGAFASFK EFLRRARLQR
ADAERLVKAG CFDALEGEEK RPTLLWELLH FQQQATALLF EQKTELPHPP PYDAQMVLRQ
EVETLGFLVS RHPLALYKAQ WQRHRPIKAS ELIRHTGKWV TMVGWWITTK TVEDKHGRPM
EFISFEDVTA IFDATFFPDV YARFCRKLSQ RRPYLLKGMV EEEFGVATLR VKWVGFLDG