Gene Dred_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_1939 
Symbol 
ID4956292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp2130524 
End bp2131414 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content44% 
IMG OID640181107 
Productdihydrodipicolinate synthase 
Protein accessionYP_001113283 
Protein GI134299787 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACAG TGGATTTTGG CAGGGTTATT ACGGCCATGG TGACACCTTT TCATCCAGAT 
ATGTCCGTGA ATTATACCCA GGCTAAAAAA CTAAGCAGAT ATCTGGTGGA AAATGGTTCA
GATGGTTTGG TTGTTAGTGG AACCACTGGA GAATCTCCTA CCCTGAACAA GGATGAAAAG
ATTCAATTAT TTAAAGCAGT GGTAGAAGAA GTTGGAGGGC AGGCAACGGT TATTGCTGGG
ACCGGAAGTT ATGATACCGC CAGCAGTATT ATCCTAACCA AAGAAGCTGA AAAGGTTGGT
TGTGATGGAG TTATGCTGGT GGCTCCCTAT TACAACAAAC CATCCCAGGA AGGACTCTAT
CAGCATTTTC GCACTATCGC TGAATGTACC AGCTTGCCAG TAATGCTTTA CAATATTCCG
GGGCGGACTG GAATTAATGT TTTGCCCGCC ACAGTGGAAA GACTGGCCAA GGATGTACCA
AACATTGTGG CCATTAAAGA AGCGGCTGGG GATATCAATC AGGTATCTGA ATTACGTCGC
ATCCTACCTG AGGATTTTAT CATCTTTAGC GGAGATGACT CTTTGACACT ACCAATGCTG
TCCTTAGGAT GCAAGGGGAT TGTTAGTGTG GCGGCTCATA TTGCCGGTAA GCAAATACAG
GAGATGATTG ATGCCTTTAC TTCTGGGAAT ACAACCCTTG CGGCTAATCT CCATAAAGAA
CTTTTCCCGA TCTTTAAGGG GTTGTTTATA ACATCCAATC CAGTTCCGGT CAAAGCAGCC
TTAAACCTAA AGGGGCTTGC GGTAGGTGGT GTGAGATTAC CCTTAGTTGA GGCGACGGCC
AAGGAAATCG AAACTGTAAA AAATATTATG AATAACCTCC ATTTATTGTA A
 
Protein sequence
MSTVDFGRVI TAMVTPFHPD MSVNYTQAKK LSRYLVENGS DGLVVSGTTG ESPTLNKDEK 
IQLFKAVVEE VGGQATVIAG TGSYDTASSI ILTKEAEKVG CDGVMLVAPY YNKPSQEGLY
QHFRTIAECT SLPVMLYNIP GRTGINVLPA TVERLAKDVP NIVAIKEAAG DINQVSELRR
ILPEDFIIFS GDDSLTLPML SLGCKGIVSV AAHIAGKQIQ EMIDAFTSGN TTLAANLHKE
LFPIFKGLFI TSNPVPVKAA LNLKGLAVGG VRLPLVEATA KEIETVKNIM NNLHLL