Gene Anae109_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4206 
Symbol 
ID5375581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4931583 
End bp4932746 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content71% 
IMG OID640845733 
Productradical SAM domain-containing protein 
Protein accessionYP_001381368 
Protein GI153007043 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGA CCCTCGCGCA CCGCGCGCTC GAGCGCGACG GCCTCGGGCC GATCCGCGAC 
AAGGTCCTCG CCGGCGAGCG GCTCGGCGAC GCGGACGCGC TCCGGCTCCT GGAGGCGGCG
GACCTCGCCG CGGTGGGCGC CCTGGCGAAC CACGTCCGCG AGGCGCGCCA CGGCGATCTC
ACCTTCTACA ACCGCAACGT CCACCTGAAC CCGACGAACG TCTGCGTCGC GACCTGCAAG
TTCTGCTCGT TCGCGCGCAA GGACGATCAG GCCGCCTCCG AGGGCTACAC GATGTCCCTC
GACGAGGCCG TGCAGAAGGT CCTCTCGCGC CGCGGGCTCG GGATCACCGA GGTGCACATC
GTCTCCGGGC TGCACCCGGA CCTGCCCTGG GAGTACTACC CGGAGCTGCT GCGCCGCATC
CGCCAGGCCT GGCCCGAGCT CGCCATCAAG GCCTTCACCG CCATCGAGAT CCACTTCTTC
GCGGAGAAGT TCGGGAAGAG CTACGAGCAG GTGCTGCGCG AGCTGCACGA GGCCGGCATG
GACACCATGC CCGGCGGGGG CGCCGAGATC TTCGCGACGC GCGTGCGCCG CAAGATCTGC
GACGACAAGG CCACCGCCGA GCAGTGGCTC GAGATCCACC GCACCGCCCA CCGGCTGGGC
CTCAAGACCA ACGCCACCAT GCTCTACGGC CACATCGAGC GGCTCGACGA GCGCGTGGAC
CACATGCGGC TCCTGCGCGA GCTGCAGGAC GAGACCGGCG GCTTCCAGGT CTTCATCCCG
CTCGCCTTCC ACCCCGAGCA CAACATGATC GGCAAGGCCT TCCCCAAGCC GACCGGGTAC
GACGCGCTGC GCACCCTCGC GGTGGCGCGG CTGTACCTCG ACAACTTCGA CCACGTGAAG
GCGTACTGGG TCTCGCTCGG CGAGCGGCTC GCGCAGACGT CGCTCGCGTT CGGCGTGGAC
GACGTCGACG GCACCGTGCT CGAGGAGCGC ATCTACCACA TGGCCGGCTC GACCGTCCCG
CAGGCGCTCT CGGAGCGGAC GCTGCACGAG CTCATCCGCG CCGCGGGCCG CGTGCCCGCC
GAGCGCGACA GCCTGTACCG CGTCCTGAAG GTGCATGAGC AGCCGCCGAG CGACGCGCCC
GGACGCCTGC AGGTCACCGC CTGA
 
Protein sequence
MISTLAHRAL ERDGLGPIRD KVLAGERLGD ADALRLLEAA DLAAVGALAN HVREARHGDL 
TFYNRNVHLN PTNVCVATCK FCSFARKDDQ AASEGYTMSL DEAVQKVLSR RGLGITEVHI
VSGLHPDLPW EYYPELLRRI RQAWPELAIK AFTAIEIHFF AEKFGKSYEQ VLRELHEAGM
DTMPGGGAEI FATRVRRKIC DDKATAEQWL EIHRTAHRLG LKTNATMLYG HIERLDERVD
HMRLLRELQD ETGGFQVFIP LAFHPEHNMI GKAFPKPTGY DALRTLAVAR LYLDNFDHVK
AYWVSLGERL AQTSLAFGVD DVDGTVLEER IYHMAGSTVP QALSERTLHE LIRAAGRVPA
ERDSLYRVLK VHEQPPSDAP GRLQVTA