Gene GM21_0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0250 
Symbol 
ID8135557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp298050 
End bp299786 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content64% 
IMG OID644867871 
ProductPHP domain protein 
Protein accessionYP_003020093 
Protein GI253698904 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.00055482 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAACG GGGAAATCGC ACGCATCTTT TCCGAGATCG CCGACATCCT GGAGATCAAG 
GAAGGGAACG TTTTCAAGAT AAGGGCCTAC CGGCGCGCGG CGCTCAACCT GGACGGCTTC
AGCCGGGACC TGGCGCAGCT CACCCACAAG GAACTCCTGG AGATTCCGGG GGTGGGGGCG
GATCTGGCGG CGAGGATCGA GGAATACCTC CAGACCGGGA CCATGGCGGC CTACGAGGAG
CTGAAACAGG AGGTTCCCGC CGGCGTCTTC GCGCTGTTGG CCATCCCGGA CCTGGGGCCG
AAGACGGCGA AGGCGATCTA CGACGCCCTG CAGATCGCGA GCATCGAGGA GCTGGAAAAG
GCCGCCCTTG AGCACAGGTT GATCGGCATC AAGGGGATCA AGCAGAAGAC GGAGGAGAAC
ATCCTCAAGG GGATAGCGGC GGTGAAGCGC GGACGCGAGC GCCAGCCCCT GGGGCGCATG
CTCCCTGCGG CGCTCGAACT GGTGCAGGTG CTCAAAGAGC GGGCGCCGCT GGAGCGGGTC
GAGGTGGCGG GAAGCATCCG TAGGCGCAGG GACAGCATCA AGGACATAGA CATCGTCGCC
ACCTCCCCCG ATCCGGCCGC CGTCATGGCG GCTTTTGTCG ATCTGCCCCA GGTGCACGAC
GTCATCATGC GCGGCCCGAC CCGTGCCAGC GTCACCATCC GCGAGGGGGT GCAGGTGGAC
CTCCGGGTGG TGGATCCCAT CTCCTACGGC GCCGCCCTTG CCTACCTTAC CGGCAGCCAG
GCCCACAACG TGCGCTTGCG CGAGATGGCC CAGAAACGGG GGCTGAAGAT CAACGAATAC
GGCATCTTTC GGGAAGAGGA CAACCAGCGC CTGGGTGGCG TGGACGAGGA AGACATCTAC
CGCCTGCTGG ACCTGGCCTT TGTCCCCCCG GTGCTGCGCG AGGACCGGGG AGAGATCGAA
GCGGCGCTCC TGGGGAAGCT GCCGCGGCTG GTGACCCAAG CCGACATCAG GGGAGACCTG
CACGTCCACT CCAGGTGGAG CGACGGCGCC CATGCCGTCT CGGAGCTGGT GGAGGCGGCA
AGGGAGCGCG GCCTTTCCTA CCTCGCCGTC ACCGACCACT CGCAAGGGCT CGGCGTCGCG
CGCGGCCTCT CCGTGGAGCG GCTTCGGGAA CAGGCCGTCG AGATAAAGGA ACTGAATAGG
GAGCTCAAGG GGTTCCGGGT CCTGCACGGC ACCGAGATGG ACATCCTGGG GGACGGGACC
CTCGATTTTC CCGACGAGGT GCTGAAGGAT CTCGACATAG TGGTCGCCTC CATCCATTCC
GGGTTCAACA ACTCGAAGGA AGTCATGACC TCGCGCATCG TGGCCGCCAT GCGCAACCCC
TACGTCTCGA TCATCGGGCA TCCGACCGGG CGCATCCTCG GAGAGAGGGA AGGGTACCAG
GTAGACATGG ACGAGGTGCT GCGGGTAGCC AAGGAGACCG GGACGGCCCT GGAGATCAAC
GCCTACCCGA TGCGGCTGGA TCTGGAGGAC AAGCACGTGC GCCGCGCGAA GGAACTGGGC
GTCATGATCG CCGTCAACAC GGACACCCAC GCCAAGCTGC AATTCGATTT TCTCCCCTAC
GGCATCTCGG TGGCGCAGCG CGGCTGGCTT GAGCCGGCGG ACGTGCTGAA TACGCTGGAA
CCCGACCAGT TGCTGAAGAA GCTCAGAGAG AAGAGGAAGA AGATGGGTAT TAAATAA
 
Protein sequence
MKNGEIARIF SEIADILEIK EGNVFKIRAY RRAALNLDGF SRDLAQLTHK ELLEIPGVGA 
DLAARIEEYL QTGTMAAYEE LKQEVPAGVF ALLAIPDLGP KTAKAIYDAL QIASIEELEK
AALEHRLIGI KGIKQKTEEN ILKGIAAVKR GRERQPLGRM LPAALELVQV LKERAPLERV
EVAGSIRRRR DSIKDIDIVA TSPDPAAVMA AFVDLPQVHD VIMRGPTRAS VTIREGVQVD
LRVVDPISYG AALAYLTGSQ AHNVRLREMA QKRGLKINEY GIFREEDNQR LGGVDEEDIY
RLLDLAFVPP VLREDRGEIE AALLGKLPRL VTQADIRGDL HVHSRWSDGA HAVSELVEAA
RERGLSYLAV TDHSQGLGVA RGLSVERLRE QAVEIKELNR ELKGFRVLHG TEMDILGDGT
LDFPDEVLKD LDIVVASIHS GFNNSKEVMT SRIVAAMRNP YVSIIGHPTG RILGEREGYQ
VDMDEVLRVA KETGTALEIN AYPMRLDLED KHVRRAKELG VMIAVNTDTH AKLQFDFLPY
GISVAQRGWL EPADVLNTLE PDQLLKKLRE KRKKMGIK