Gene Dred_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_3089 
Symbol 
ID4956901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp3353232 
End bp3354332 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID640182277 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001114416 
Protein GI134300920 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA AGGTAAAAAA GAAAAGATGG TATCATTTAG AAGCCCTCTG GGAAATTACA 
AGAATCCTTC ATACATCTCT TGATCTGGAA GAAGTACTGG ACATGGCCCT TACTGAAGCG
ATGAAAGCAG TCCATGCAGA GGCAGGTACC CTCTGGTTAA ACGATAACCA GACCAATGAA
TTTATCCAAC CGGTTTTAGC AAGGGGTCCC AAGGCAGATG GGCTGAAAGG CTTAAAACTA
AAGATAGGCG AAGGGATGGC CGGCTGGGTA ACCGCCAATG GTCAGTCCCA AATGGTCAGT
GATGTTCTTA AAGATTCCCG CTGGTCCCAA CGATTTGACC AGTCCACCGG TTTTATTACC
CGCTCTTTGC TTTGTGTACC ACTAATAACC CAAACTTCCT GTATTGGGTG TCTGCAACTG
GTTAATAAGC TCGATGGTCA ACTATTCGAT GAGGATGATT TAAGCTTATG CGAAGCCCTG
GCTGGAGTTA TTGGTATGGC TGTGGAAAAC AGTCGTCTTT ATACAGACTT AAAGACCATG
TTTAAGAGTT TTCTGGTGGC CTTAGCCTCG GCCATTGATG CCCGGGACCC CTATACTCGA
GGTCATTCAG AGCGAGTTAG CCAGTATAGC CTGATGATGG GAAAAGCCCT GGGACTTCCT
GAACAGGATT TAGAATTATT AGAAAGAGCT GCTTTTCTGC ATGATATTGG GAAGATTGGT
ATTAGAGACC ATATACTGCT AAAAGAATCG CCACTGGATA ATGAGGAATT TATAATTATG
AAGACCCATA CCACCATTGG GCAAAATATT CTACAACAGA TTGAGCCTAA CTATTTGGTT
CAGGAGATAT CCCAGGGAGC CGCCTGTCAT CACGAACGAT ATGACGGCAA GGGATACCCT
CAGGGATTGC AAAGAGAAGA AATCCCCCTT GCTGCACGTA TTATGGCCAT TGCTGATACC
TTTGACGCCA TGGTAACAGA CAGACCATAT CGCAAGGGGT TACCGGTGAA ATTAGCGTTA
CAGGAAATAA AACGCTGTGC CGGCAGCCAG TTTGATCCCC AACTGGCAGA AATATTTTTA
ACAGAAATGA AAAAGGAGTA A
 
Protein sequence
MSDKVKKKRW YHLEALWEIT RILHTSLDLE EVLDMALTEA MKAVHAEAGT LWLNDNQTNE 
FIQPVLARGP KADGLKGLKL KIGEGMAGWV TANGQSQMVS DVLKDSRWSQ RFDQSTGFIT
RSLLCVPLIT QTSCIGCLQL VNKLDGQLFD EDDLSLCEAL AGVIGMAVEN SRLYTDLKTM
FKSFLVALAS AIDARDPYTR GHSERVSQYS LMMGKALGLP EQDLELLERA AFLHDIGKIG
IRDHILLKES PLDNEEFIIM KTHTTIGQNI LQQIEPNYLV QEISQGAACH HERYDGKGYP
QGLQREEIPL AARIMAIADT FDAMVTDRPY RKGLPVKLAL QEIKRCAGSQ FDPQLAEIFL
TEMKKE