Gene TM1040_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3386 
Symbol 
ID4075285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp402659 
End bp403693 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID638004894 
Productmalate/L-lactate dehydrogenase 
Protein accessionYP_611620 
Protein GI99078362 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.188787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA GACAAGAAAG AACAGAGATG ACCGCCACCG AAACACTCAC GCTCTCGGAA 
ATCGAAAGCC TTGCCTTTGA TGCACTGGTT GCAGCGGGCA CCTCACCCGC AAACGCCCGC
CCTCTGGCAG TTGCAACCGC GATGACCGAA GCCGATGGGG TCGCCTCGCA CGGGCTGGCC
TATATCCCGA TCTATGCCCA GCATGTTGAA TGCGGCAAAG TCGACGGACA GGCCAACCCC
AAGGTCGCAC ACCCTCGACC CGCAGTGATC ACCGTGGACG CAGCCACCGG ATTTGCACAT
CGTGCGATCG ACCTCGGCTT TGAGCAGTTG ATCCCTCTGG CCAAGGAAAT GGGTGTCGCG
GTGCTGGCCG TGAACAACTC CTACAACTGT GGTGTTCTGG GGGTTCACAC GCAAAGGCTG
GCGCAGGCTG GGCTGATGGG GTTTGGCTTT ACCAATGCCC CTGCCTCGAT TGCGCCCTCG
GGTGGCGCAA AGCCTGTGGT GGGCACCAAT CCATTTTCAA TCTCGGCGCC GGGTTCAGAT
GGCACTGCGG CTCTGCTCAT TGATCAATCC GCCAGCACGA TTGCAAAGAG CGAAGTGATG
AAACACGCCC GTGAAGGCAA GCCGGTCCCA CAAGGCTGGG TGCTGGACGC CGATGGCCAG
CCCACCATCG ATCCCGATGC AGGCCTCAAA GGGTCAATGG TGCCGTCCGG CGGATACAAA
GGCGTGGGCA TTGCCCTGAC CGTCGAGCTT CTGGCAGCGG CCATGACCGG CGCAACCCTG
GGCGCGGTGG CGAGCCCGTT TTCCGGCACA GCGGGCGGTC CGCCCAAAAC CGGCCAGTTC
TTTATCGCCA TAGACCCGGA CGCTACATCC GGGGGGCTCT TTCAGGAAAA GCTCGCGGAT
TTGATTTCGG CATTTCGTGA TCAAGATGGC GCACGTCTGC CAGGAGATGG TCGCCAATCC
GCCCGTCTCC GGGCCGCCAC CGAGGGCGTG AGGGTGAACG CCGCCCTACT GGAGCGCGTG
CGCGCCCTCA TCTAA
 
Protein sequence
MSDRQERTEM TATETLTLSE IESLAFDALV AAGTSPANAR PLAVATAMTE ADGVASHGLA 
YIPIYAQHVE CGKVDGQANP KVAHPRPAVI TVDAATGFAH RAIDLGFEQL IPLAKEMGVA
VLAVNNSYNC GVLGVHTQRL AQAGLMGFGF TNAPASIAPS GGAKPVVGTN PFSISAPGSD
GTAALLIDQS ASTIAKSEVM KHAREGKPVP QGWVLDADGQ PTIDPDAGLK GSMVPSGGYK
GVGIALTVEL LAAAMTGATL GAVASPFSGT AGGPPKTGQF FIAIDPDATS GGLFQEKLAD
LISAFRDQDG ARLPGDGRQS ARLRAATEGV RVNAALLERV RALI