Gene Gmet_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3083 
Symbol 
ID3740781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3479819 
End bp3480910 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID637780371 
ProductHAD family hydrolase 
Protein accessionYP_386022 
Protein GI78224275 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG1011] Predicted hydrolase (HAD superfamily)
[COG1633] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGA TTCGCGCCCT GGTTTTCGAC CTGGACGGCA CCCTCTACGA CAGTGAAGGG 
GTCGGTCGCC AGATCGACGC CTCCGCCATG GGGCATGTGG CCGCGGTACG GGGGATTTCC
CCCGAGGAGG CGAGACTCCT CATCCGGGAG ACCCGGGAAA AGATAGCCGT CCGCACCGGC
CGGACCGCCT CCCTGAGCCA TGTCTGTCTG GAACTGGGGA TTGACCTGCG GGAGCTCCAT
CGCCGTTTTG AGGCCGAGAT CGAACCGGAG CCCTTCCTCA CCCGGGACGA ACGGGTGGTG
GAGTTGCTGG AGAGGCTGGG CGAGCGGTTC GATCTCCATA TCTACACCAA CAACAACCGC
CTCCTCTCTT CGCGGATCAT GACGGCCCTC GGCGTTGACG GGTGCTTCCG GCGCATCTTT
ACCATCGAGG ATTCCTGGCG CCCCAAACCG GACCGTCAGG TTCTCGAAGA GATTTTCCGG
GAGATCGGGC AAGAACCCTC CCACTGCCTC TTCGTGGGGG ACCGTTACGA CATCGACCTG
CGGCTGCCGC GGGAGCTCGG CTGCCGGGTA TTTCACTCCA GGACCGTTGA CGAACTTTTG
ACCATTGAAA CCACTTTACC GTCAGGAGCG CCCATGAACG ACCAGACCAA AGAGACCCTC
GACGCCATCA TGCGGGCCAT TGAGATCGAA AAGGAGACCT TTGACTTTTA CACCAGGGCC
GAGCGGAAAA CCTTCAATCC CGAGGGGAAG CGGATCTTCC GCTGGCTTGC CAAGACCGAG
GAGCAGCACT ACCTGAAGCT GAACGAGCTC TACCAGTCAC TCCACGAGGG GGGGCGCTGG
GTCTTCTACG GCGGCTCCAC CATTACCCTC GACCCGGCGG GCTCCGGGGA GAAGCAGGTG
GGGTTCGACA CCGACGACCT TCAGGCCCTG GAGATCGCTA TGGAGATCGA GAAGAAGGGA
ATCGCCTACT TCGACGACCT CATGGCGAAG ACCGCCGACG CCGACGGCAA GGGGATGCTC
AAGGCCCTGC GGGACGAGGA GACCGAGCAC TTGCGGGTGA TTACCGAGAA GTACAACGCC
ATCAAGGGGT AA
 
Protein sequence
MSGIRALVFD LDGTLYDSEG VGRQIDASAM GHVAAVRGIS PEEARLLIRE TREKIAVRTG 
RTASLSHVCL ELGIDLRELH RRFEAEIEPE PFLTRDERVV ELLERLGERF DLHIYTNNNR
LLSSRIMTAL GVDGCFRRIF TIEDSWRPKP DRQVLEEIFR EIGQEPSHCL FVGDRYDIDL
RLPRELGCRV FHSRTVDELL TIETTLPSGA PMNDQTKETL DAIMRAIEIE KETFDFYTRA
ERKTFNPEGK RIFRWLAKTE EQHYLKLNEL YQSLHEGGRW VFYGGSTITL DPAGSGEKQV
GFDTDDLQAL EIAMEIEKKG IAYFDDLMAK TADADGKGML KALRDEETEH LRVITEKYNA
IKG