Gene GM21_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2548 
Symbol 
ID8137890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2976989 
End bp2978149 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID644870157 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003022347 
Protein GI253701158 
COG category[R] General function prediction only 
COG ID[COG3481] Predicted HD-superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value1.98451e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAAAAGA AATGCGTTGC CGAGATAAAG GACCGCGACC TGGTGGACGC GGTCTTCCTG 
GTCAAGGAGA AGATCGTGGC CATGGCCAAG AACGGCAAGC CGTACCTCAC CCTGAAGCTT
ATGGACAGAA GCGGCGAGGT CGACGCCAAG GTCTGGGACA ACGCGGACCA GGTGGGGGCG
CTCTTCGACC GCAACGACTT CCTCGCGGTG CGCGCGAAGG CGAGCGTCTA CCTGGGAAAG
ATGCAGCTGA TCGTCTCGGA GCTTAAGAAG GTCCCCGACG ACTCGGTGGA TCTGGCGGAC
TTCCTTCCCG AAACCGACCG GGACGTCAAG GCGATGGTCG AGGAGCTGCA CGCCCTCGTC
GCCGGCGTGA AGGACCCGGA CCTCGCGCGG CTTTTGTCCT CCTTCTTCCA CGACCCAGAG
CTTTTGGCCC AGTATCGCGT CGCCCCCGCG GCCAAGGGGA TGCACCACGT CTATCTCGGG
GGGCTTTTGG AGCACTCGCT CGCCGTGGCG AAGCTGGTGG ACGCCATGGT CCCGCTCTAC
CCGGGGCTGA ACCGGGACCT CCTCGTCGCC GGGGCACTTT TGCACGACGT GGGGAAGGTG
CGCGAGATGA CCTACCTGCG CTCCTTCGAC TACTCCGACG AGGGGAAGCT GATCGGCCAC
ATCACCATCG GCGCCGAGAT GCTGCACGAG CGGATCACGG CGCTGCCGGG TTTTCCGGCC
GAGCTCGCCA TGCTCTTGAA GCACATGATC CTGTCGCATC ACGGCCAGTA CGAGTACGGC
TCCCCCAAGC GCCCGAAGAC GCTGGAGGCG ACCATCCTCA ACTACCTGGA CGACCTCGAC
TCCAAGATCA ACGGCATCAG GACCCACATC CGCAAGGAGC CGGACAATCC CTCGCGCTGG
ACCGCGTACC ACCGCCTCTA CGACCGCTAC TTCTTCAAGG AGAACTGCCT GCCTGAGGAG
GAGCTGGAAA TCTCCCCCGC GGATTGCCTG GAGCCGTCCG AGCTGATGCC GCAGACGGTG
GAGGCGCCGA GCCCTCTCCC GGCGAGCGTG CCGGAGCAGG AAGCGCCGCG CCGGGAGCGC
CCTGAGGCAC CCCGCGGCGA CCAGGGGCGC AAGAGCTTCA GCAACAACCC TTTCGCCGCG
CTTAAAAACG GCAAGGGTTA A
 
Protein sequence
MKKKCVAEIK DRDLVDAVFL VKEKIVAMAK NGKPYLTLKL MDRSGEVDAK VWDNADQVGA 
LFDRNDFLAV RAKASVYLGK MQLIVSELKK VPDDSVDLAD FLPETDRDVK AMVEELHALV
AGVKDPDLAR LLSSFFHDPE LLAQYRVAPA AKGMHHVYLG GLLEHSLAVA KLVDAMVPLY
PGLNRDLLVA GALLHDVGKV REMTYLRSFD YSDEGKLIGH ITIGAEMLHE RITALPGFPA
ELAMLLKHMI LSHHGQYEYG SPKRPKTLEA TILNYLDDLD SKINGIRTHI RKEPDNPSRW
TAYHRLYDRY FFKENCLPEE ELEISPADCL EPSELMPQTV EAPSPLPASV PEQEAPRRER
PEAPRGDQGR KSFSNNPFAA LKNGKG