Gene GM21_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0842 
Symbol 
ID8136158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp997878 
End bp999527 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID644868454 
ProductAcetyl-CoA hydrolase 
Protein accessionYP_003020668 
Protein GI253699479 
COG category[C] Energy production and conversion 
COG ID[COG0427] Acetyl-CoA hydrolase 
TIGRFAM ID[TIGR03458] succinate CoA transferases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATT ATGGAACCCT GCAGGACCGC GTGCGCTGCA AGTCGCTTCT GAACAAAGTG 
ATGTCCCCCG AACAGACCAT CGGCTTCTTC AAGGACGGGA TGAACCTGGG CTGGTCCGGT
TTCACCCCGG CCGGCTACCC AAAAGCGGTG CCCATCGCCC TGGCGGACCA CGTCGAGAAG
AACGGGCTGC AAGGCAAACT CAGGTTCAAC CTCTTCATCG GCGCCTCGGT CGGAGCGGAA
ACCGAAGACC GTTGGGCGAC CCTCGACATG ATCGACCGCC GCTGGCCCTA CCAGACCGGC
AAGAACATCG CCGCGGGGAT CAACGCCGGC CGCATCCGCA TGGGGGACAA GCACCTCTCC
CTGTTCGCCC AGGATCTCGG CTACGGCTTC TACACCAAGG ACACCCCGAG CGGCAAGCTC
GACCTCGCCA TCATCGAGGT CTCGGCCATC ACCGAAGACG GTGGGCTGGT GCTGACCTCT
TCCTGCGGCG TCGTCCCCGA AATCCTGATG ATCTGCGACA AGATCATCCT CGAGGTGAAC
ACCGGACAGC CCTCCTTCGA GGGGATGCAC GACGTCGTGG TCTGCAATCA CCCCCCCAAG
CGCCAGATCC TGGGGATCAC CAGCGCCGGC GAGCGCATCG GCAGCACCTA CGTCCCGTGC
GACCCCAGCA AGGTGATCGC CGTGGTCGAG TCCAAGCACC GCGACAAGGG GCGCGCCTTC
TCCGAGCAGG ACGACACCTC CGAGGCGATC GCCAATAACA TCATCGACTT CTTCAGCCAC
GAGGTGAAGG CGGGGCGCCT GCCCAAGAAC CTCCTCCCGC TGCAGTCCGG CGTAGGTTCC
ATCGCCAACG CCGTCATCGG CGGCCTGGCC AAGGGTCCCT TCTCGAACCT CACCGTCTAC
ACCGAGGTGC TGCAGGACAC CATGCTCGAC CTCTTTGACT CGGGCAAGCT GGACATGGCG
TCTTCCTGCT CCCTGTCGCT CTCAGAGACC CCGGGCTTCC CGCGTTTCTT CGACAACATG
GAGAAGTACT TCGACAAGAT CGTGCTGCGC CCGCTCTCCA TCTCCAACGC CCCCGAGCCG
ATCCGTCGCC TTGGGTGCAT CGCGATGAAC ACCCCGGTCG AGATCGACAT CTACGCGCAC
GCCAACTCGA CGCTTGTCGG CGGCACCCGC ATGATCAACG GCCTGGGCGG CTCGGGCGAC
TTCCTGAGGA ACGGGTTCCT GAAGATCATG CACACCCCGT CCTCCCGCCC CTCGAAGACC
GATCCCAACG GCATCTCCTG CGTGGTGCCG CACTGCTCGC ACATCGACCA CACCGAGCAC
GACCTCGACT GCGTGGTTAC CGAGCAGGGG CTTGCCGACC TGCGCGGCAT GGCTCCCAAG
GAGCGCGCCC GCCGCATCAT CGAGAAGTGC GCGCACCCCG ACTACAAGCC GATCCTCACC
GAGTACCTCA ACATCGCCGA GAAGCAGTGC CTCGCGAAGA ATGTCGGCCA CGAGCCGCAG
CTTTGGGACC GCGCCTTCAA GATGCACCTG AACCTCGCCG CGAACGGTAC CATGAAGATC
AAGAACTGGG ACATGAAGGT CGACCTCTGC GACGAGGTAG CCGAGCGCCC GGTGCGCCAG
CCGAGCGTAG GCGACTCCGC CGCGGTTTAG
 
Protein sequence
MSNYGTLQDR VRCKSLLNKV MSPEQTIGFF KDGMNLGWSG FTPAGYPKAV PIALADHVEK 
NGLQGKLRFN LFIGASVGAE TEDRWATLDM IDRRWPYQTG KNIAAGINAG RIRMGDKHLS
LFAQDLGYGF YTKDTPSGKL DLAIIEVSAI TEDGGLVLTS SCGVVPEILM ICDKIILEVN
TGQPSFEGMH DVVVCNHPPK RQILGITSAG ERIGSTYVPC DPSKVIAVVE SKHRDKGRAF
SEQDDTSEAI ANNIIDFFSH EVKAGRLPKN LLPLQSGVGS IANAVIGGLA KGPFSNLTVY
TEVLQDTMLD LFDSGKLDMA SSCSLSLSET PGFPRFFDNM EKYFDKIVLR PLSISNAPEP
IRRLGCIAMN TPVEIDIYAH ANSTLVGGTR MINGLGGSGD FLRNGFLKIM HTPSSRPSKT
DPNGISCVVP HCSHIDHTEH DLDCVVTEQG LADLRGMAPK ERARRIIEKC AHPDYKPILT
EYLNIAEKQC LAKNVGHEPQ LWDRAFKMHL NLAANGTMKI KNWDMKVDLC DEVAERPVRQ
PSVGDSAAV