Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0219 |
Symbol | |
ID | 8135525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 262685 |
End bp | 263758 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644867840 |
Product | Radical SAM domain protein |
Protein accession | YP_003020062 |
Protein GI | 253698873 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAA TGCTCAACGT CATATCTGAA AAAGTCGCCG CAGGTGCGGC CATAACCTCG GAAGAAGCGC TCTGGTGCCT CACCGAGGCG GAACTCCTCG CCGTGGGGCG CATAGCCGAT TCGATCCGCC GCGCCATGCA CCCGGACGGC TGCGTCAGCT TCGTCGTCGA CCGCAACGTC AACTACACCA ACGTCTGCGA GTCCAGGTGC AAGTTCTGCG CGTTCTACCG CGATGCCGAC GCGGCGGACG CCTACCTCCT GGACACCGAA ACCATCATGG CGAAGATCGG GGAACTGGTG GACCAGGGAG GAACCCAGCT CCTGATGCAG GGGGGGCTCC ACCCCTCGCT CGACATCGCC TGGTTCGAGG AGCTCTTCAG GGAGATCAAG CGCCGCTTTC CCGGCGTGCA GAACCATTCG CTCTCCCCGG CGGAGGTCAC CCAGGTGGCG AAGCTCTCCG GCCTTGGCAT CGCCCAGACG CTGGTACGGC TGCAGCAGGC CGGGCTCGAT TCTATCCCCG GAGGGGGGGC CGAAATCCTG GTCGACAGCG TCCGTGCCGA GATCTCACCT AAGAAGATCG GTTGGCAGGG GTGGGCGCAG GTCATGCGCG AGGCTGCCCG GTTAGGGATG CCCACCACCG CTACCATGAT GTTCGGCAGC CGCGAGCGCG CCGAGGATAT CGTCGAGCAC CTGTTCCGGG TGCGCGCGTT GCAGGACGAG GGAGGGAGCT TCACCGCCTT CATCCCCTGG ACCTATCAGC CGGGGAACAC CGAGCTCGGG GGGGAGGGTG CCAGCGGGGT CGAGTACCTG AAGGTGCTGG CCCTGTCGCG CATCGTGCTC AGGAACGTGC CGAACGTGCA GGCGAGCTGG GTGACCCAGG GGGCCAAGAT GGCGCAGGTC GCGCTCTTCT TCGGCGCCAA CGACCTGGGG GGAACCATGC TCGAGGAGAA CGTCGTGGCG GCCGCCGGCT GCCGCTTCCG CATGACGCGC GAGGAGATGA TAGCGCTCAT CCGCGGCGCC GGTTTCACCC CGGTCCGGCG CACCACCCTG TACCGGGAGC TTGAGCGTTA CTGA
|
Protein sequence | MSKMLNVISE KVAAGAAITS EEALWCLTEA ELLAVGRIAD SIRRAMHPDG CVSFVVDRNV NYTNVCESRC KFCAFYRDAD AADAYLLDTE TIMAKIGELV DQGGTQLLMQ GGLHPSLDIA WFEELFREIK RRFPGVQNHS LSPAEVTQVA KLSGLGIAQT LVRLQQAGLD SIPGGGAEIL VDSVRAEISP KKIGWQGWAQ VMREAARLGM PTTATMMFGS RERAEDIVEH LFRVRALQDE GGSFTAFIPW TYQPGNTELG GEGASGVEYL KVLALSRIVL RNVPNVQASW VTQGAKMAQV ALFFGANDLG GTMLEENVVA AAGCRFRMTR EEMIALIRGA GFTPVRRTTL YRELERY
|
| |