Gene Dred_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_2193 
Symbol 
ID4956811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp2398305 
End bp2399372 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content44% 
IMG OID640181369 
Productradical SAM domain-containing protein 
Protein accessionYP_001113533 
Protein GI134300037 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGG TTAAAGAGAT ACTCAATAAA GCTGTGGCAG GTGAGAGGTT AAGTGTAGAA 
GAGGGCGTTG CCCTCTTAAA TTCTTCTGAT TTAATTCAAC TGGGAGCAGC AGCAAACACC
ATTCGCAAAA GGTTGCATCC AAAAAATCGT ACCACCTTTA TTATTGATCG CAATATTAAT
TATACAAACA TATGCACCTG TAAGTGTAAA TTCTGTGCTT TCTGGCGGGA ACCCAGTGAC
CCGGATGCCT ATATCTTGCC TCAGGAAGAG TTATATCAAA AAATTGAAGA GACCATTGCT
GTAGGTGGCA CAGAGATTTT AGTTCAGGGT GGGTTACATC CAGAATTAGG ACTGGATTAC
TATGTGGATT TGCTAAAATC CATTAAGGAA CGTTACAATA TTCACATTCA TTCTTTTTCG
CCACCGGAAA TTTGGCATAT CGCCAGAAAA GAAGGTCTGC CATTACTGCG GGTTATTGAG
GCCTTAAAGA ATGCCGGGCT TGATTCAATC CCCGGAGGTG GGGCAGAGAT TCTGGATAAC
CGTGTACGAA GGATCATTAG CCCGGATAAA GTTACTTGGG AAGAATGGAT GGAGGTCATG
ACCATTGCCC ATTGTCTGGA CATGAAAACC ACAGCAACCA TGATGTTCGG GCATGTTGAA
ACTCCTGAAG AAAGGATTCT GCACATGGTC AGGGTAAGAG ATACCCAGGA CAAGACCGGA
GGTTTTACTG CCTTTATCCC TTGGAGTTTT CAACCTAAAA ACACCAAATT AGGAGGAGAG
ACAACCTCTG GGATTGATTA TTTAAAAACC TTGGCAGTGG CTAGATTAAT GCTCGATAAT
GTAAAGAACA TTCAGGCCTC GTGGGTGACA CAGGGTGCCA AGATGGCTCA AGTATCGCTA
AATTTTGGTG CCAATGATTT TGGCAGCACC ATGCTGGAGG AGAATGTTGT GAGGGCAGCC
GGCGTCACCT ACCGAGTGGC CCTCCAGGAA ATCCTTCGCT GCATTCGTGA AGCAGGTTTC
CGTCCGGCCC AGCGTACCAC TGATTATCGT ATTATTAAAG AAATGTAA
 
Protein sequence
MLQVKEILNK AVAGERLSVE EGVALLNSSD LIQLGAAANT IRKRLHPKNR TTFIIDRNIN 
YTNICTCKCK FCAFWREPSD PDAYILPQEE LYQKIEETIA VGGTEILVQG GLHPELGLDY
YVDLLKSIKE RYNIHIHSFS PPEIWHIARK EGLPLLRVIE ALKNAGLDSI PGGGAEILDN
RVRRIISPDK VTWEEWMEVM TIAHCLDMKT TATMMFGHVE TPEERILHMV RVRDTQDKTG
GFTAFIPWSF QPKNTKLGGE TTSGIDYLKT LAVARLMLDN VKNIQASWVT QGAKMAQVSL
NFGANDFGST MLEENVVRAA GVTYRVALQE ILRCIREAGF RPAQRTTDYR IIKEM