Gene GM21_1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1579 
Symbol 
ID8136910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1842180 
End bp1844021 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content65% 
IMG OID644869192 
ProductUbiD family decarboxylase 
Protein accessionYP_003021392 
Protein GI253700203 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGATATA AGAATCTGGC AGCGTGCGTG ACGGACCTGG AGCGGACCGG CGCACTGGTA 
AGGATAAACG AGGAACTCTC TTCTGACCTG GAGATCGGGT CCATCCAGCG CAGGGTGTAC
CAGGCGGGTG GCCCGGCGCT TCTCTTTACC CGGGTGAAGG GATGCTCTTT CCCGATGCTC
GGGAACCTGT TCGGGACCCT GGAGCGGACC AGGTACATCT TCAGGGATAC CCTGAGGGCG
GTCGAGCGTC TGGTGCAGTT GAAGATCGAT CCGAAATCGG CACTGAAAGA CCCCGCTTCC
TTTTTGGCGG CGGTCCCGGC CGCATGGCAC CTCATTCCCA AGGAGGTGGG GGACGGCCCC
ATCCTCGCCA ACCGGACCAC CATCGATAAA CTGCCGCAGC TAAAATCCTG GCCGGACGAC
GGCGGCGCCT TCATCACGCT GCCGCAGGTC TACTCGGAGA GCGTGGCCCA ACCGGGATTG
CGCCACTCCA ACCTCGGCAT GTACCGGGTG CAGATCTCCG GCGGCGAGTA CCGGCAAAAC
GCTGAGGTGG GGGTGCACTA CCAGATCCAT CGCGGCATCG GCTTTCACCA TGCGGAGGCG
ATCGAGCGGG GCGAGCCGTT GCGGGTGAAC ATCTTCGTGG GGGGCGCACC CTCCATGACT
GTCGCCGCCG TGATGCCCTT GCCCGAGGGG ATGCCGGAAC TCTCTTTCGC CGGGCTTCTG
GCCGGGCACC GGATCGAGAT GGTGCAGCGC CCCGGGCAGC TCCCGATTCC CGCGCAGGCC
GACTTCTGCA TCACCGGCGT CATCGACCCC AACAAGACCC TTCCCGAAGG CCCCTTCGGC
GACCATTTCG GCTACTACAG CCTGGCGCAC CACTTCCCCG TGCTCCAGGT CGAGGAGGTC
TTCCATCGCG ACGGCGCCAT CTGGCCCTTC ACCACTGTGG GGCGCCCTCC GCAGGAGGAT
ACCTCCTTCG GCGCCTTCAT TCATGAACTG ACCGGTCCCT TGATCCCCAC GGTGATACCG
GGCGTCAAGG CGGTGCACGC GGTGGACGCG GCCGGAGTGC ACCCGCTGCT TTTGGCCTTG
GGTAGCGAGC GCTACGTCCC CTACGGCGAG CGCCGGACTC CGCAGGAACT CCTCACCATC
GCGAACGCGG TGCTCGGACA GGGGCAGCTC TCCCTGGCCA AGTACCTGAT GATCGCCTCC
CACGAGGACG CGCCGCAGCT CGACATCCAC GACATCCCCG CCTTCCTGCG CCATGTGCTG
GAGCGGATCG ACCTGAAGCG CGATCTGCAT TTCCAGACCG CCACCACCAT CGACACGCTC
GATTACTCCG GCTCAGGGCT GAACAGCGGC TCCAAGGTGG TGTTCGCCGC CGTCGGCGAA
AAGCGCCGCA CCCTCGGGGT CGAACTCCCC TCCTCGTTGA GCCTGGCCGA CGGCTTCAAT
GATCCCTGTA TTTGCCTCCC CGGCGTCATC GCGGTCAAGG GGCCTGCCTG CACCGTCCGG
AAGGGGGAGG CGGACCCGCA GATGGAGGCG CTTTGCGCTG CGCTCGAGGG AGTGGAGGGG
CTGGAGAGTT TCCCGCTGAT CGTCGTCTGC GACGACAGCA GGTTCGCCGC AAAAGATCTG
GACAACTTCC TCTGGGTCAC CTTCACCCGT TCCGATCCCG CCGCCGACAT CTACGGTGTC
GGGGCCGGCA TGGTTTGCAA GCAGTGGGGG TGCACAGGTC CCCTGGTGAT AGACGCCCGG
GTCAAGCCGC ACCACGCGCC GCCGCTCATC GAGGATCCGG CCGTCGAGCG GAAGCTGGAC
CAGTTGGCCG CCCCAGGAGG GCCGCTGCAC GGGTTGTATT AG
 
Protein sequence
MGYKNLAACV TDLERTGALV RINEELSSDL EIGSIQRRVY QAGGPALLFT RVKGCSFPML 
GNLFGTLERT RYIFRDTLRA VERLVQLKID PKSALKDPAS FLAAVPAAWH LIPKEVGDGP
ILANRTTIDK LPQLKSWPDD GGAFITLPQV YSESVAQPGL RHSNLGMYRV QISGGEYRQN
AEVGVHYQIH RGIGFHHAEA IERGEPLRVN IFVGGAPSMT VAAVMPLPEG MPELSFAGLL
AGHRIEMVQR PGQLPIPAQA DFCITGVIDP NKTLPEGPFG DHFGYYSLAH HFPVLQVEEV
FHRDGAIWPF TTVGRPPQED TSFGAFIHEL TGPLIPTVIP GVKAVHAVDA AGVHPLLLAL
GSERYVPYGE RRTPQELLTI ANAVLGQGQL SLAKYLMIAS HEDAPQLDIH DIPAFLRHVL
ERIDLKRDLH FQTATTIDTL DYSGSGLNSG SKVVFAAVGE KRRTLGVELP SSLSLADGFN
DPCICLPGVI AVKGPACTVR KGEADPQMEA LCAALEGVEG LESFPLIVVC DDSRFAAKDL
DNFLWVTFTR SDPAADIYGV GAGMVCKQWG CTGPLVIDAR VKPHHAPPLI EDPAVERKLD
QLAAPGGPLH GLY