Gene GM21_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0218 
Symbol 
ID8135524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp261551 
End bp262627 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content65% 
IMG OID644867839 
ProductRadical SAM domain protein 
Protein accessionYP_003020061 
Protein GI253698872 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTG CCACCATAGC GGAGAGGGTG CGGGCGGGCG CGCGCATCAC GGAGCCGGAG 
GCGCTCGTGC TTTTCGAGCA CCCGGACCTC CTGGCGCTCG GGGAACTGGC CCAGTCCGTC
AACGAGCGGC GTAACGGGAA GAGGGTTTTC TTCAACGTCA ACCGCCACAT CAACCACACC
AACATCTGCG TGAACCGATG CCGCTTCTGC GCCTTCTTCC GCAAAGCCGG CGACCCCGGC
GCCTACCTCA TGACTCTGGA CGAGGTTCGC GGCCGCGCCG AGGAGGCGGT GAAGGAAGGG
GCCACCGAGA TCCACGTGGT CGGAGGCCTC CACCCCGAGC TCCCTTTCGA GTTCTACCTG
GAGCTTCTTT CCACCGTCAA GGCGGTCTCG CCGGCGCTGC ACGTGAAAGC CTTCACCGCG
GTCGAGATCG CTTACCTGGC CGAGCTCTCC GGCCTCGGCA TCCCGGCGAC CCTGGAGAAG
CTGAAGGAGG CGGGGCTCGG CTCACTTCCC GGCGGCGGCG CCGAGATCTT CGCGCCGGAG
ATCCGCAACC AGCTCTGCCC GGAGAAGATC AGCGGCGCCG CCTGGCTCTC CATCATGGAG
CAGGTGCACC AGGCGGGGCT CAAGTCCAAT GCCACCATGC TCTACGGGCA CCTGGAGAGC
GTGGCCGACC GGGTGGACCA CATGCGGCAG CTACGCGAGA TGCAGGATCG TACCGGCGGC
TTCCAGGTTT TCATCCCGCT CGCCTTCCAA CCGGAGCATT CGCAGTTGAA GATCGCAGGC
TCCGGCACAA GCGGCGTGGA TGACCTGCGC ACCCTGGCCG TCGCCCGCAT CTACCTGGAC
AACTTCGCCA ACGTCAAGGC CTACTGGGTG ATGCTGGGGG AGAAGATCGC CCAGGTCTCC
CTTTCCTTCG GCGTCAACGA CCTTGACGGT ACCGTGGTCG AGGAGCGGAT CGGGCACGAG
GCGGGGGCCG ATACCCCGCA GACCATGAGC CGCGACAACA TCGTCACCAT GATCAGGAAG
GCCGGCCGCA TACCGGTGGA GCGGGACACG CTCTACCAGG AATTGCGCGT GTATTGA
 
Protein sequence
MTFATIAERV RAGARITEPE ALVLFEHPDL LALGELAQSV NERRNGKRVF FNVNRHINHT 
NICVNRCRFC AFFRKAGDPG AYLMTLDEVR GRAEEAVKEG ATEIHVVGGL HPELPFEFYL
ELLSTVKAVS PALHVKAFTA VEIAYLAELS GLGIPATLEK LKEAGLGSLP GGGAEIFAPE
IRNQLCPEKI SGAAWLSIME QVHQAGLKSN ATMLYGHLES VADRVDHMRQ LREMQDRTGG
FQVFIPLAFQ PEHSQLKIAG SGTSGVDDLR TLAVARIYLD NFANVKAYWV MLGEKIAQVS
LSFGVNDLDG TVVEERIGHE AGADTPQTMS RDNIVTMIRK AGRIPVERDT LYQELRVY