Gene GM21_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0215 
Symbol 
ID8135521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp258662 
End bp260008 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content66% 
IMG OID644867836 
ProductUbiD family decarboxylase 
Protein accessionYP_003020058 
Protein GI253698869 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTA AGGACTTGCG AGGTTTCATC GCCGGACTGG AAGCTGCCGG GGAACTCCAG 
AGGGTGGCTG CGGAGGTCGA TCCCGACTTG GAGATCGCCT GCATCACCGA CCGGCAAAGC
AAACTTCCCG GCGGCGGTAA GGCGCTGCTC TTCGAGAACG TGAAAGGGAG CCCCTTTCCA
GTTGCCACCA ATCTCTTCGG TTCGCCGGCT CGCATGGCGC TCGCTCTGGG CGTCGCAAAG
CTGGACCGGC TCTCGGAGGC GATGGAAGCA TTGCTCCTTT GCCCCGGGCG GGCCCCCCTC
CCGCTGCTGG TCGATCAAGC CCCCTGCCGG GAGGTCGTGG AGCTGGCCCC GGATCTGCTG
TGCTACCCCT TCCTGAAGAG CTGGCCCGGC GACGGCGGGC GCTTTATCAC GCTTCCGCTG
GTGTTCACCA GGGACCCTGA GACCGGCGCG GACAACTGCG GCATGTACCG GGTCCGGATC
TTCGACGACC GTAGCGCCGG GGTGCGATGG AAAAACGGCA GCGGCGGTTG GGAGCATTAC
CAAAAGCACC TCGCGTCCGG GAAGCGGATG CCGGTCGCCA TCGCCATAGG CGCCGACCCG
GCGCTGACGC TCGCCGCGTC GCTTCCGCTC CCCGCCGGCC TGATCGAGGT TTCCCTTGCC
GGGTACCTGC GCGGCGAGCC GGTGCCGATG CTCCGCTGCC TCGATTCGGA CCTCCTGGTT
CCGGCGGACG CCGAGTTGGT GATCGAGGGG TTCGTCGAGC CTGGGGTGAC CCGCAACGAA
GGAGATTTCG GCAACCATAC CGGAAGCTAC GATCAGGGCG AAGAGGTGCC GCTGCTGACG
GTCACCTGCA TCACGCGCAG GCGCGACCCC ATCTGCCAGG CGACGGTGGT GGGCCCTCCT
CCCATGGAGG ACTGCTGGAT GGCCAAGGCT GCCGAGCGGC TGCTGCTGCC GCTTATCCGC
AGGCAGTGCC CGGAGATCGT CGACCTGCTT TTGCCGCTGG AGGGAATCTT CCACGGCTGC
GCCTTGATCG GCATAAAGAA GAGCCTTCCC GGGCAGGGGC GGCGCGTGCT GGAGACGCTG
CGCTTGGAGG GATGGTTGAA GCGGGGAAAA CTTTTGGTGG CGATCGACGC CACAGATAAC
CCCCTCACGC TGTCGGAGGG TTTCTGGCGG GCGCTGAACG CGGTCAGTTT TCCGCGCGAC
CTGGCGGTCA CCCCCGATGG GTGCCTCGGG GTCGACGCGA CGAGAAAGCT CCCGGAAGAG
GGGGGCGGAC AGTACCGGGA GTTGAAACAG GATGCATCGG TTAGTGCCCA GGTTGCGAGG
AGATGGCGGG AGTACGGCTT TCTCTAG
 
Protein sequence
MAFKDLRGFI AGLEAAGELQ RVAAEVDPDL EIACITDRQS KLPGGGKALL FENVKGSPFP 
VATNLFGSPA RMALALGVAK LDRLSEAMEA LLLCPGRAPL PLLVDQAPCR EVVELAPDLL
CYPFLKSWPG DGGRFITLPL VFTRDPETGA DNCGMYRVRI FDDRSAGVRW KNGSGGWEHY
QKHLASGKRM PVAIAIGADP ALTLAASLPL PAGLIEVSLA GYLRGEPVPM LRCLDSDLLV
PADAELVIEG FVEPGVTRNE GDFGNHTGSY DQGEEVPLLT VTCITRRRDP ICQATVVGPP
PMEDCWMAKA AERLLLPLIR RQCPEIVDLL LPLEGIFHGC ALIGIKKSLP GQGRRVLETL
RLEGWLKRGK LLVAIDATDN PLTLSEGFWR ALNAVSFPRD LAVTPDGCLG VDATRKLPEE
GGGQYRELKQ DASVSAQVAR RWREYGFL