Gene GM21_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0219 
Symbol 
ID8135525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp262685 
End bp263758 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content66% 
IMG OID644867840 
ProductRadical SAM domain protein 
Protein accessionYP_003020062 
Protein GI253698873 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAA TGCTCAACGT CATATCTGAA AAAGTCGCCG CAGGTGCGGC CATAACCTCG 
GAAGAAGCGC TCTGGTGCCT CACCGAGGCG GAACTCCTCG CCGTGGGGCG CATAGCCGAT
TCGATCCGCC GCGCCATGCA CCCGGACGGC TGCGTCAGCT TCGTCGTCGA CCGCAACGTC
AACTACACCA ACGTCTGCGA GTCCAGGTGC AAGTTCTGCG CGTTCTACCG CGATGCCGAC
GCGGCGGACG CCTACCTCCT GGACACCGAA ACCATCATGG CGAAGATCGG GGAACTGGTG
GACCAGGGAG GAACCCAGCT CCTGATGCAG GGGGGGCTCC ACCCCTCGCT CGACATCGCC
TGGTTCGAGG AGCTCTTCAG GGAGATCAAG CGCCGCTTTC CCGGCGTGCA GAACCATTCG
CTCTCCCCGG CGGAGGTCAC CCAGGTGGCG AAGCTCTCCG GCCTTGGCAT CGCCCAGACG
CTGGTACGGC TGCAGCAGGC CGGGCTCGAT TCTATCCCCG GAGGGGGGGC CGAAATCCTG
GTCGACAGCG TCCGTGCCGA GATCTCACCT AAGAAGATCG GTTGGCAGGG GTGGGCGCAG
GTCATGCGCG AGGCTGCCCG GTTAGGGATG CCCACCACCG CTACCATGAT GTTCGGCAGC
CGCGAGCGCG CCGAGGATAT CGTCGAGCAC CTGTTCCGGG TGCGCGCGTT GCAGGACGAG
GGAGGGAGCT TCACCGCCTT CATCCCCTGG ACCTATCAGC CGGGGAACAC CGAGCTCGGG
GGGGAGGGTG CCAGCGGGGT CGAGTACCTG AAGGTGCTGG CCCTGTCGCG CATCGTGCTC
AGGAACGTGC CGAACGTGCA GGCGAGCTGG GTGACCCAGG GGGCCAAGAT GGCGCAGGTC
GCGCTCTTCT TCGGCGCCAA CGACCTGGGG GGAACCATGC TCGAGGAGAA CGTCGTGGCG
GCCGCCGGCT GCCGCTTCCG CATGACGCGC GAGGAGATGA TAGCGCTCAT CCGCGGCGCC
GGTTTCACCC CGGTCCGGCG CACCACCCTG TACCGGGAGC TTGAGCGTTA CTGA
 
Protein sequence
MSKMLNVISE KVAAGAAITS EEALWCLTEA ELLAVGRIAD SIRRAMHPDG CVSFVVDRNV 
NYTNVCESRC KFCAFYRDAD AADAYLLDTE TIMAKIGELV DQGGTQLLMQ GGLHPSLDIA
WFEELFREIK RRFPGVQNHS LSPAEVTQVA KLSGLGIAQT LVRLQQAGLD SIPGGGAEIL
VDSVRAEISP KKIGWQGWAQ VMREAARLGM PTTATMMFGS RERAEDIVEH LFRVRALQDE
GGSFTAFIPW TYQPGNTELG GEGASGVEYL KVLALSRIVL RNVPNVQASW VTQGAKMAQV
ALFFGANDLG GTMLEENVVA AAGCRFRMTR EEMIALIRGA GFTPVRRTTL YRELERY