Gene GM21_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3803 
Symbol 
ID8139177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4374718 
End bp4375575 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content66% 
IMG OID644871422 
Productmodification methylase, HemK family 
Protein accessionYP_003023580 
Protein GI253702391 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones115 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA ACGCCGAAAA ATGGGACGTA CTCAAAGTCC TGAATTGGAC CAAGGGTTAC 
CTCGCCGAAA AGGGTGTGGA GAACCCCCGC CTGGAAGCGG AGTGGATGCT CTGCGAGGCG
CTCTCGCTGG ACCGGGTGGG GCTCTACCTC AACTTCGACA AGCCGCTCTC CGACGCCGAG
CTCGCCCTCT ACCGCGGCAT GGTCGCCCGG CGCGGCAGGC GCGAACCGCT GCAGTACATC
CTGGGTAGCC AGGAGTTCAT GGGGCTCGAA TTCCGGGTCA CCCCCGCCGT CCTGATCCCG
CGCCACGACA CCGAGGTGCT GGTGACCGAG GCGGTGAAGA GGGGAGGCGC GTGCCGCAGC
ATCCTCGACA TCGGCACCGG GAGCGGCTGC GTCGCCATCG CCGTCGCCAA GGCGCTCCCC
GAGGCCGAAG TCTGCACCGT GGACGTTTCC GGCGAGGCAA TCGAGGTGGC CCGGGGGAAC
GCGGAGCGAA ACGGGGTCTC CGTGCAGTTT TTCCAGGGCT CGCTGTTCGA GCCGTTTGCC
GGGAAGCGTT TCGATATGCT AGTATCCAAC CCGCCCTACA TCACTTCGGC TGATCTAGCT
TCCCTCCAGC AGGAGGTGCG CGACTTCGAG CCGGCGGGCG CCCTGGACGG GGGAGGCGAC
GGGCTCGATT TCTACCGGCG CATCACGGCC GGCGCCCCGG CGCACCTCAA TCCGGGCGGC
TGGCTCTTGT TCGAAGTGGG GGCCGGGCAG GCAGGGGAGG TGCTGGAGCT CTTGAACTCC
GGCGGTTTCA CCAACGAAAG GTTCAGCCAG ACCGACCCCG CAGGTATTGA GCGGGTGGTA
GGCGCAAGGC TTCAGTAA
 
Protein sequence
MTANAEKWDV LKVLNWTKGY LAEKGVENPR LEAEWMLCEA LSLDRVGLYL NFDKPLSDAE 
LALYRGMVAR RGRREPLQYI LGSQEFMGLE FRVTPAVLIP RHDTEVLVTE AVKRGGACRS
ILDIGTGSGC VAIAVAKALP EAEVCTVDVS GEAIEVARGN AERNGVSVQF FQGSLFEPFA
GKRFDMLVSN PPYITSADLA SLQQEVRDFE PAGALDGGGD GLDFYRRITA GAPAHLNPGG
WLLFEVGAGQ AGEVLELLNS GGFTNERFSQ TDPAGIERVV GARLQ