Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0671 |
Symbol | |
ID | 8135986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 804440 |
End bp | 806077 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868288 |
Product | protein of unknown function DUF342 |
Protein accession | YP_003020503 |
Protein GI | 253699314 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 6.38243e-32 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAGCA ATCGTCTCGA AGGGACAGGA ATCGCGTTCA AATTGAGCCC CGACCAGAGG GTGCTCACCG CTTCTTTCAC CCCGGTGGTG AACAAGCGGG GGATCAGCCT GGAGCAGCTG CAGGAGGCGA TCGCCGAGGA AGGTTACGGC GACCTCTTCA TCTCCCAGGA CGCGCTCGGG CAGTTCCTGA AAAAATGGGA AGTCTCCCCC ATGGCGTTCT CCCTGCAGAT AGGGGAGCGG CGCGACGCCA CCCTCGCCAT CCAGATCCCC GACGACATGA TGCAGGCCGT CATGACCATC CAGCCCGCCT ACGGGGGGCG CCGCATCACG GTCGAGGAGG TGGAGCAGGA GCTCGCGGCC CGGGGAGTCG TCTGCGGCAT CCTCTACGAC GAGATCAGGT GCGCCGTCGA AGCGGGCGAG GCGTCGAAGC TCGTCATCGC CGCGGGCACC CCGCCGGTAC CGGGCGAGGA CACCCAGTTC ATCTCGCTCA TCCCGGAAAT AACCGCCCAG GCGCCGCGGG TCTCCGACGA CGACACGGCC GACTACCGCA ACCTCGGGGA CATCGTCAGC GTAAGCCTTG GGGATCCGCT TTTGCGCCGC ACCCACCCCA CCACGGGGAT CGCAGGGATG AACCTTTTGG GGGTCGAACT CCCCACCACC GACGGGGTCG AGCTCTCCTT CGCCGAGAAC CTCACCGGGA TCGCCTGCGA TCTCACCGAC TGCGACCTCC TGATCGCCGC CATCTCGGGG CACCCGGTCA TCGTCCCCCG GGGGGTGATC GTCGACCCGG TCTTCAAGGT GAAACGGGTC GACCTCTCCA CCGGCAACCT CCACTTCAAG GGGTCCCTCG ATATCGAGGG GGACGTGCTG GAGGGGATGG AGGTGAGCGC CACCCAGGAC ATCAACGTCG GGGGGATCGT GGAGGCGGCG AGAGTCAAGG CCGGCGGGAA CATCGTGGTC AGCGGCGGTG TGATCGGCCA CGGCAAAGGG GCGCAGGGAA AAGTGGAGAA CCGCAAGGAG ATGGCGCAGC TGGAGGCGGG AGGCTCGGTC TGGGTGCAGT TCGCCGAGAA CGCCCTCATC AACGCCGGGG GGGAGATCGT GGTCAAGGAA CTCTCCATGC AGAGCGAACT CACCTCCGGC TCCAGCATCA CGGTCGGCGA AAAGGGGGCG CGCAAAGGGC ACCTCATCGG CGGGGTCTGC CGCGCCGTCT CGCTGGTGCA TGCCGTGGTG GTCGGCTCCC ACGCCGGGGT CCCCACCGCC ATCGAGGTGG GGGTCGACCC GGCGCTGAAC AAGAAGCTCG AGATCGTACT GGACGCTTTA GCCGAGAAGG GGCGCCTCAT CGAGGAGCTG GCCAAGACCC TCGCCTACGT GCGTGAGAAC CCGGGGAAAA TGGAGCCCGG CCTTCTGGCC CTCAAAGAGC GGATCTACGC CAAGTACCAG GCGGAGATCG CCGAGCTCAA CAATGAGAAG AAGCGGCTGG AGAAAAGAAT GGAGATCAAC GCGCAGGCGA GGGTGGAGGT GGAACGCGAC GCCTTCCTGG GGACGCAGAT CAAGATCGGC TCGACCGCCT TGCAGATCGA GGAGGACCTG ACCAACCCCA CCTTCGCCCT GGGCGAGAAC GGGATCATCT ACTCATGA
|
Protein sequence | MQSNRLEGTG IAFKLSPDQR VLTASFTPVV NKRGISLEQL QEAIAEEGYG DLFISQDALG QFLKKWEVSP MAFSLQIGER RDATLAIQIP DDMMQAVMTI QPAYGGRRIT VEEVEQELAA RGVVCGILYD EIRCAVEAGE ASKLVIAAGT PPVPGEDTQF ISLIPEITAQ APRVSDDDTA DYRNLGDIVS VSLGDPLLRR THPTTGIAGM NLLGVELPTT DGVELSFAEN LTGIACDLTD CDLLIAAISG HPVIVPRGVI VDPVFKVKRV DLSTGNLHFK GSLDIEGDVL EGMEVSATQD INVGGIVEAA RVKAGGNIVV SGGVIGHGKG AQGKVENRKE MAQLEAGGSV WVQFAENALI NAGGEIVVKE LSMQSELTSG SSITVGEKGA RKGHLIGGVC RAVSLVHAVV VGSHAGVPTA IEVGVDPALN KKLEIVLDAL AEKGRLIEEL AKTLAYVREN PGKMEPGLLA LKERIYAKYQ AEIAELNNEK KRLEKRMEIN AQARVEVERD AFLGTQIKIG STALQIEEDL TNPTFALGEN GIIYS
|
| |