Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0218 |
Symbol | |
ID | 8135524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 261551 |
End bp | 262627 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644867839 |
Product | Radical SAM domain protein |
Protein accession | YP_003020061 |
Protein GI | 253698872 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTG CCACCATAGC GGAGAGGGTG CGGGCGGGCG CGCGCATCAC GGAGCCGGAG GCGCTCGTGC TTTTCGAGCA CCCGGACCTC CTGGCGCTCG GGGAACTGGC CCAGTCCGTC AACGAGCGGC GTAACGGGAA GAGGGTTTTC TTCAACGTCA ACCGCCACAT CAACCACACC AACATCTGCG TGAACCGATG CCGCTTCTGC GCCTTCTTCC GCAAAGCCGG CGACCCCGGC GCCTACCTCA TGACTCTGGA CGAGGTTCGC GGCCGCGCCG AGGAGGCGGT GAAGGAAGGG GCCACCGAGA TCCACGTGGT CGGAGGCCTC CACCCCGAGC TCCCTTTCGA GTTCTACCTG GAGCTTCTTT CCACCGTCAA GGCGGTCTCG CCGGCGCTGC ACGTGAAAGC CTTCACCGCG GTCGAGATCG CTTACCTGGC CGAGCTCTCC GGCCTCGGCA TCCCGGCGAC CCTGGAGAAG CTGAAGGAGG CGGGGCTCGG CTCACTTCCC GGCGGCGGCG CCGAGATCTT CGCGCCGGAG ATCCGCAACC AGCTCTGCCC GGAGAAGATC AGCGGCGCCG CCTGGCTCTC CATCATGGAG CAGGTGCACC AGGCGGGGCT CAAGTCCAAT GCCACCATGC TCTACGGGCA CCTGGAGAGC GTGGCCGACC GGGTGGACCA CATGCGGCAG CTACGCGAGA TGCAGGATCG TACCGGCGGC TTCCAGGTTT TCATCCCGCT CGCCTTCCAA CCGGAGCATT CGCAGTTGAA GATCGCAGGC TCCGGCACAA GCGGCGTGGA TGACCTGCGC ACCCTGGCCG TCGCCCGCAT CTACCTGGAC AACTTCGCCA ACGTCAAGGC CTACTGGGTG ATGCTGGGGG AGAAGATCGC CCAGGTCTCC CTTTCCTTCG GCGTCAACGA CCTTGACGGT ACCGTGGTCG AGGAGCGGAT CGGGCACGAG GCGGGGGCCG ATACCCCGCA GACCATGAGC CGCGACAACA TCGTCACCAT GATCAGGAAG GCCGGCCGCA TACCGGTGGA GCGGGACACG CTCTACCAGG AATTGCGCGT GTATTGA
|
Protein sequence | MTFATIAERV RAGARITEPE ALVLFEHPDL LALGELAQSV NERRNGKRVF FNVNRHINHT NICVNRCRFC AFFRKAGDPG AYLMTLDEVR GRAEEAVKEG ATEIHVVGGL HPELPFEFYL ELLSTVKAVS PALHVKAFTA VEIAYLAELS GLGIPATLEK LKEAGLGSLP GGGAEIFAPE IRNQLCPEKI SGAAWLSIME QVHQAGLKSN ATMLYGHLES VADRVDHMRQ LREMQDRTGG FQVFIPLAFQ PEHSQLKIAG SGTSGVDDLR TLAVARIYLD NFANVKAYWV MLGEKIAQVS LSFGVNDLDG TVVEERIGHE AGADTPQTMS RDNIVTMIRK AGRIPVERDT LYQELRVY
|
| |