Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0250 |
Symbol | |
ID | 8135557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 298050 |
End bp | 299786 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644867871 |
Product | PHP domain protein |
Protein accession | YP_003020093 |
Protein GI | 253698904 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.00055482 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAACG GGGAAATCGC ACGCATCTTT TCCGAGATCG CCGACATCCT GGAGATCAAG GAAGGGAACG TTTTCAAGAT AAGGGCCTAC CGGCGCGCGG CGCTCAACCT GGACGGCTTC AGCCGGGACC TGGCGCAGCT CACCCACAAG GAACTCCTGG AGATTCCGGG GGTGGGGGCG GATCTGGCGG CGAGGATCGA GGAATACCTC CAGACCGGGA CCATGGCGGC CTACGAGGAG CTGAAACAGG AGGTTCCCGC CGGCGTCTTC GCGCTGTTGG CCATCCCGGA CCTGGGGCCG AAGACGGCGA AGGCGATCTA CGACGCCCTG CAGATCGCGA GCATCGAGGA GCTGGAAAAG GCCGCCCTTG AGCACAGGTT GATCGGCATC AAGGGGATCA AGCAGAAGAC GGAGGAGAAC ATCCTCAAGG GGATAGCGGC GGTGAAGCGC GGACGCGAGC GCCAGCCCCT GGGGCGCATG CTCCCTGCGG CGCTCGAACT GGTGCAGGTG CTCAAAGAGC GGGCGCCGCT GGAGCGGGTC GAGGTGGCGG GAAGCATCCG TAGGCGCAGG GACAGCATCA AGGACATAGA CATCGTCGCC ACCTCCCCCG ATCCGGCCGC CGTCATGGCG GCTTTTGTCG ATCTGCCCCA GGTGCACGAC GTCATCATGC GCGGCCCGAC CCGTGCCAGC GTCACCATCC GCGAGGGGGT GCAGGTGGAC CTCCGGGTGG TGGATCCCAT CTCCTACGGC GCCGCCCTTG CCTACCTTAC CGGCAGCCAG GCCCACAACG TGCGCTTGCG CGAGATGGCC CAGAAACGGG GGCTGAAGAT CAACGAATAC GGCATCTTTC GGGAAGAGGA CAACCAGCGC CTGGGTGGCG TGGACGAGGA AGACATCTAC CGCCTGCTGG ACCTGGCCTT TGTCCCCCCG GTGCTGCGCG AGGACCGGGG AGAGATCGAA GCGGCGCTCC TGGGGAAGCT GCCGCGGCTG GTGACCCAAG CCGACATCAG GGGAGACCTG CACGTCCACT CCAGGTGGAG CGACGGCGCC CATGCCGTCT CGGAGCTGGT GGAGGCGGCA AGGGAGCGCG GCCTTTCCTA CCTCGCCGTC ACCGACCACT CGCAAGGGCT CGGCGTCGCG CGCGGCCTCT CCGTGGAGCG GCTTCGGGAA CAGGCCGTCG AGATAAAGGA ACTGAATAGG GAGCTCAAGG GGTTCCGGGT CCTGCACGGC ACCGAGATGG ACATCCTGGG GGACGGGACC CTCGATTTTC CCGACGAGGT GCTGAAGGAT CTCGACATAG TGGTCGCCTC CATCCATTCC GGGTTCAACA ACTCGAAGGA AGTCATGACC TCGCGCATCG TGGCCGCCAT GCGCAACCCC TACGTCTCGA TCATCGGGCA TCCGACCGGG CGCATCCTCG GAGAGAGGGA AGGGTACCAG GTAGACATGG ACGAGGTGCT GCGGGTAGCC AAGGAGACCG GGACGGCCCT GGAGATCAAC GCCTACCCGA TGCGGCTGGA TCTGGAGGAC AAGCACGTGC GCCGCGCGAA GGAACTGGGC GTCATGATCG CCGTCAACAC GGACACCCAC GCCAAGCTGC AATTCGATTT TCTCCCCTAC GGCATCTCGG TGGCGCAGCG CGGCTGGCTT GAGCCGGCGG ACGTGCTGAA TACGCTGGAA CCCGACCAGT TGCTGAAGAA GCTCAGAGAG AAGAGGAAGA AGATGGGTAT TAAATAA
|
Protein sequence | MKNGEIARIF SEIADILEIK EGNVFKIRAY RRAALNLDGF SRDLAQLTHK ELLEIPGVGA DLAARIEEYL QTGTMAAYEE LKQEVPAGVF ALLAIPDLGP KTAKAIYDAL QIASIEELEK AALEHRLIGI KGIKQKTEEN ILKGIAAVKR GRERQPLGRM LPAALELVQV LKERAPLERV EVAGSIRRRR DSIKDIDIVA TSPDPAAVMA AFVDLPQVHD VIMRGPTRAS VTIREGVQVD LRVVDPISYG AALAYLTGSQ AHNVRLREMA QKRGLKINEY GIFREEDNQR LGGVDEEDIY RLLDLAFVPP VLREDRGEIE AALLGKLPRL VTQADIRGDL HVHSRWSDGA HAVSELVEAA RERGLSYLAV TDHSQGLGVA RGLSVERLRE QAVEIKELNR ELKGFRVLHG TEMDILGDGT LDFPDEVLKD LDIVVASIHS GFNNSKEVMT SRIVAAMRNP YVSIIGHPTG RILGEREGYQ VDMDEVLRVA KETGTALEIN AYPMRLDLED KHVRRAKELG VMIAVNTDTH AKLQFDFLPY GISVAQRGWL EPADVLNTLE PDQLLKKLRE KRKKMGIK
|
| |