Gene Anae109_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4207 
Symbol 
ID5375582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4932761 
End bp4934638 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content72% 
IMG OID640845734 
Productradical SAM domain-containing protein 
Protein accessionYP_001381369 
Protein GI153007044 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes
[COG1427] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.428604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGC TCCGCGCCGC AGCCGTCTCC TTCCTGAACG CGCACCCGCT CACCGTCGGG 
CTCGAGGGCT CCGACCGGAT CGAGCTCGTC CCCGCCGAGC CGTCGCGGTG CGCCGCGATG
CTGGAGGACG GCGAGGTGGA TCTCGCGCTC GTCTCGGTCG CGGCCCTCAC CAAGGGCGAG
TACGAGATCG TCCCGGGCAT CGCCATCGGC GCGGACGGGC CGGTGCAGAC GGTGGTCCTC
GCCGGCGAGC AGTCGCCCGC GATCTGGGAC GAGGTGTTCC TCGACACGGC CTCGCGCACC
TCGCACGTGC TCGCCAAGCT CGTGCTCGAC GCGATGGGCG TTCACCCCAA GTTCACGCCG
ATGCACGCGG ACGAGGGGCT CGCGCGCGCC AAGGGGACGA AGGGCGCGCT CGTCATCGGG
GACCGCGCCT TCGGCGTGCG GGCGAACCAC GTGCTCGACC TCGGGCGCGA GTGGACGCAC
CTCACCGGCC TGCCCATGAT CTTCGCCCTG TGGGCCGCGC GCCCGGGGCG CGTCTCGCCG
GAGGACGTGC AGGAGCTGAC GCGCGCCGCG CAGCACGGCC TCGGCGTCCG CACCGAGCTC
GCCCAGCGCT TCGCCGCGCA GAAGGGCGGC GATCCGGAGC GCTTCCGGCG CTACCTCACG
CAGCGCATCC GCTACGGGCT CGGGCCGCAC GAGCTGGACG GCCTGGAGGC CTTCCTCGGC
AGGGCGGCCG AGAAGGGGTT CCTGCCGCCC ATGAAGCTGC GCTTCGTCGA CGACGTGGTC
CGCACGACGC GCACGCGGCG GCTGGTGTCG CTCGACACCG CGCTGCAGAA GGGCGCCGAC
GGCGAGCGCC TCGACGCGGA CGAGGCGGAG CTCCTCGACG AGAAGGCGCC GCTCCTCGAG
CTCGGCCTCG CCGCCGACGC CCGCCGCCGC GCGCTCCACC CGGACGGAGC GGTCACGTAC
ATCGTATCGC GGAACGTGAA CTACACGAAC GTCTGCACCA CGGCGTGCCA CTTCTGCGCG
TTCTACCGGC CGCGCGGCCA CAAGGAGGCC TACGTCCTCG ACCGCGACGA GCTGACCCGC
AAGATCGACG AGACCGTCGC CCTCGGCGGC ATCGAGATCC TGCTCCAGGG CGGCCTGCAC
CCCGACCTCG GCGTCGAGTG GTACGAGGAC CTCTTCCGCT GGGTGAAGGC GAGGTGGCCG
GCGATCAACC TGCACGCCCT CTCCCCCGAG GAGATCTGGC ACATCGCCCG CACGAGCGAG
CTGCCGCTCG ACGACACCAT CGCGCGGCTC ATCGCGGCCG GCCTCGACTC GATCCCCGGC
GGCGGCGCGG AGATCCTCGA CGACGAGGTC CGCCGCCGGA TCGCCCCGCT CAAGTGCTCC
AGCGACGAGT GGCTGTCGGT CATGCGGGCG GCCCATTTGA AGGGGCTGCG CAGCACGGCC
ACCATGATGT TCGGGGTGGG CGAGGAGCCC CGCCACCGGG TCGCCCACCT CGTGCGCCTG
CGCGAGCTGC AGGACGAGAC GCGCGGCTTC ACCGCCTTCA TCTGCTGGCC GTTCCAGTCG
GCCAACACCC GCCTCACCGC GTCCGACACG AGCGCGCAGG CCTACCTCAG GGTCAACGCG
GTCTCGCGGC TCGTCCTCGA CAACGTGCCG AACCTGCAGG CCTCCTGGGT GACCATGGGC
GGCGGGGTCG CGCAGGCCTC CCTGCACATG GGCTGCAACG ACTTCGGCTC GGTGATGATC
GAGGAGAACG TGGTCTCCGC GGCCGGCACG ACGTTCCAGA TGGACGCGGA GGAGGTCGAG
CGGCACATCC GCGACGCGGG CTTCCGGCCG GCGCGGCGGA ACATGAGGTA CGAGCGCGTG
GGGGACGCCG CGGCGTGA
 
Protein sequence
MPKLRAAAVS FLNAHPLTVG LEGSDRIELV PAEPSRCAAM LEDGEVDLAL VSVAALTKGE 
YEIVPGIAIG ADGPVQTVVL AGEQSPAIWD EVFLDTASRT SHVLAKLVLD AMGVHPKFTP
MHADEGLARA KGTKGALVIG DRAFGVRANH VLDLGREWTH LTGLPMIFAL WAARPGRVSP
EDVQELTRAA QHGLGVRTEL AQRFAAQKGG DPERFRRYLT QRIRYGLGPH ELDGLEAFLG
RAAEKGFLPP MKLRFVDDVV RTTRTRRLVS LDTALQKGAD GERLDADEAE LLDEKAPLLE
LGLAADARRR ALHPDGAVTY IVSRNVNYTN VCTTACHFCA FYRPRGHKEA YVLDRDELTR
KIDETVALGG IEILLQGGLH PDLGVEWYED LFRWVKARWP AINLHALSPE EIWHIARTSE
LPLDDTIARL IAAGLDSIPG GGAEILDDEV RRRIAPLKCS SDEWLSVMRA AHLKGLRSTA
TMMFGVGEEP RHRVAHLVRL RELQDETRGF TAFICWPFQS ANTRLTASDT SAQAYLRVNA
VSRLVLDNVP NLQASWVTMG GGVAQASLHM GCNDFGSVMI EENVVSAAGT TFQMDAEEVE
RHIRDAGFRP ARRNMRYERV GDAAA