Gene GM21_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1921 
Symbol 
ID8137255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2227961 
End bp2229226 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID644869535 
Productmetallophosphoesterase 
Protein accessionYP_003021732 
Protein GI253700543 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.666866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTCA GATTCATCCA CACCTCAGAC ATCCACCTGG GAAAAACCTA CCGCTGCCTG 
GGCGGCGACG CCGAGCGCTA TCAGGACTTT TTCACCACCT TCGCAGCCAT CATCGCCGAC
GCCGTCGAGG AGCGGGTCGA TTTCGTCCTG ATCGGCGGCG ACCTCTTCCA TACCGGCCAA
ATCCTCCCCA AGACTTTCGC CAAAACCATC GAAATCCTGC AGCCGTTGAA GGACGCGGGC
ATCCCCTGCC TTGCGGTCGA GGGAAACCAC GACTGGATAC ACCGTCGCGA CAGCGTCTCC
TGGATGGAGG CGCTTTCCCA ACTGGGGTAC ATCCGCCTGC TGCGCCCCTC CCGTACCGGC
GACGGCGATT ACCTTTTCGC GCCCTTCGAT CTGGAGCAGG GAGCGGGGGG GCACCTCGAA
ATCGGCGGGG TGAATATCTA CGGGCTCGGT TATATCGGCT CCCAGGCGGC CAACCACGTG
GCGCGCATCT GCGAGGCGGT CGATACCCGC CGAAACATAT TGCTCTTTCA CGTCGGCGTC
TGGAGCTACT CTCCCGTGGA GATCGGCAAC ATCCGTCCTG AGGAGGCGCT CCCCTTGTCG
GAGTGCTTCG ACTACGTGGC GCTCGGGCAC GGCCACAAGC CTTACGTCGT CAGCACCCCC
GACGGCCGCC CCTATGCCTT CAACCCCGGA TCACCCGACT GCGTCAACTT CGGCGAGGAG
CGCTACGACA AGGGGTACTA CCTTGTCTCG TTGGAGGAGG GTGGGGAGAC CCTTCATGAA
TTCCGGCGCT GTTCCCCCCG CCCTATGCTG GTTCTCACGG TGAACCTGGA AGGCGCCAAG
AATGCCGACG AGGCGCTGCA GCGCTTCGCC TCCGGGGTCG CCGAGAAGCT TGGCGGCAGC
TCCGATCCGC GTTCTCCGCT GATAGAGGTG CGGCTTTGCG GCAAGGTAGG CTTCCACCCC
TTCGAGCTCA GCCGCGACCG TTTGCGGCTG GCCCTCTTCG AGGTCTGCCA ACCGCTGCAC
CTGGAGATAA AGAACCACCT CTCCCAGGTC TCCGGCGGGG GAGGGGAGGA GAAGGTCAAG
AAGAGCCTCG CCGAGATCGA GCGGGATGTA TTGGCCGAGC TGGTAGGGGC GAACAGCCAG
TACCAGGGTA GGGAAGAGGA GCTGGTGCGT CTTTCCCTGG CTCTTCGCGA CCTGGTGCTC
AAGGGGGAGG TCGAGGGAGA GGAACTGCTG GCCCTGCTCC CGTCGGGAGG TGCCGAATGC
GCATAA
 
Protein sequence
MPVRFIHTSD IHLGKTYRCL GGDAERYQDF FTTFAAIIAD AVEERVDFVL IGGDLFHTGQ 
ILPKTFAKTI EILQPLKDAG IPCLAVEGNH DWIHRRDSVS WMEALSQLGY IRLLRPSRTG
DGDYLFAPFD LEQGAGGHLE IGGVNIYGLG YIGSQAANHV ARICEAVDTR RNILLFHVGV
WSYSPVEIGN IRPEEALPLS ECFDYVALGH GHKPYVVSTP DGRPYAFNPG SPDCVNFGEE
RYDKGYYLVS LEEGGETLHE FRRCSPRPML VLTVNLEGAK NADEALQRFA SGVAEKLGGS
SDPRSPLIEV RLCGKVGFHP FELSRDRLRL ALFEVCQPLH LEIKNHLSQV SGGGGEEKVK
KSLAEIERDV LAELVGANSQ YQGREEELVR LSLALRDLVL KGEVEGEELL ALLPSGGAEC
A