Gene GM21_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4093 
Symbol 
ID8139467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4672478 
End bp4673788 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID644871708 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003023866 
Protein GI253702677 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones147 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG GGCTGCAGCA TGCACAACGC CTGGCGACGC TCCTGTCGGG CGCGATCAAG 
GGGGCCGGGC TCTATCCGCC TGGGCACCCC GCCTCGTTGC AGCCCTTCCG GGAAATGGAA
GCGTTGATGC TGACGCTGCA GCGAAACGGC GGAGACCTGC GCCTGGCTGT GGTGGATGGA
GTTCTCTCTG TCGGCGAGCA TCTCTTCTTC GCGCCCCCCG CTCCGCTGCA GGAGCTGATC
AACCGTCTCG AGGAGAAGGG GATAGCGGGC CTGGTCCTCA AACCGGGGGT GCTGGCGCCG
GATCTGACCG TGTTGGCGCG CCTGATGGCG GAGGGAAGCG GCGAGGCCTG CGACCTCACG
CGCGGACTCA AGGAGGCCGG GGTAAAACTG ATCGAGGTGA TGGAGGAGAA TTCCCTCTCC
CATACCTACA ACGAGGCGGT CAGCGCGGTG CGCGACATCT TCGAGGAGAT CGGCAAGGGG
CGCATACCCA ACTCCCGGCG CATGCTTACC GTGGTGAGCA GCCTCGCCTC GGCGGCCATC
AAGGAGCCGG CGGCGCTCTT GGGCCTGGCC CTGATCAAGG ATTACGACAA CTACACCTTC
CAGCACAGCG TCAACGTCGG CGTACTCTCC ATGGCGCTCT CAGCGTCCAT GGGACAAGAG
GAGGTCAAGG TGGAGGAGTG CGGCCTGGCC GGTTTTCTCC ACGACATCGG CAAGACCCGG
GTGGACAAGG ATATCCTCAA CAAGCCGGGG AAGCTTAGCA GCGACGAGTT TGTGGAGATG
AGGAAGCATC CGGAATTCGG CGCCGCCATC GTCCGGGAGA TGGAAGGGGT TTCGGAAGGG
GTGGCCGAGG CGGTCCTGGG ACATCACATC CGTTACGACC GGGCAGGATA CCCCGATTGG
GCCAGGGAGA AGGAGTTCGG GACCACCAGC AAGATCGTCG CCGTCGCCGA CTTCTACGAC
GCCACCACCA CGCTGAGAAG CTACCAGCGC CCCATGCTCC CCGACCAGGC GATGAAGGAA
ATCAGGAAAG CGGTGGGGGG AAGCCTCGAC GGCACCATCG TGGAGCGGTT CATGGAGTTG
ACCGGGAAGT ACCCCACAGG GAGCCTGGTT CGGCTCGACA GCAACGAGAT CGCGGTCGTT
TTCTCCCCCA GCAGCCAGCC CTGCGGCGCG GCGGTGGTGA AGGTGGTCAT GGACCGGCAC
GGGAGCCTGC TCGGCGACCC CGAACTGAGA AGCCTCATCA CGAGCGGCGA CAACATCGTC
GACCTGGTGG ATCCTCTGGT CAAGGGGATC GACGTGGCGC AGTACTTTTA G
 
Protein sequence
MTDGLQHAQR LATLLSGAIK GAGLYPPGHP ASLQPFREME ALMLTLQRNG GDLRLAVVDG 
VLSVGEHLFF APPAPLQELI NRLEEKGIAG LVLKPGVLAP DLTVLARLMA EGSGEACDLT
RGLKEAGVKL IEVMEENSLS HTYNEAVSAV RDIFEEIGKG RIPNSRRMLT VVSSLASAAI
KEPAALLGLA LIKDYDNYTF QHSVNVGVLS MALSASMGQE EVKVEECGLA GFLHDIGKTR
VDKDILNKPG KLSSDEFVEM RKHPEFGAAI VREMEGVSEG VAEAVLGHHI RYDRAGYPDW
AREKEFGTTS KIVAVADFYD ATTTLRSYQR PMLPDQAMKE IRKAVGGSLD GTIVERFMEL
TGKYPTGSLV RLDSNEIAVV FSPSSQPCGA AVVKVVMDRH GSLLGDPELR SLITSGDNIV
DLVDPLVKGI DVAQYF