Gene TM1040_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3167 
Symbol 
ID4075337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp147745 
End bp148710 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content64% 
IMG OID638004670 
Productmandelate racemase/muconate lactonizing-like protein 
Protein accessionYP_611403 
Protein GI99078145 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CCGTCACCCC CGATGTGTTC AAACTGGCGC AAGTGTTCAC CATCTCGCGC 
GGGTCGCGCA CCGAGGCCAA GGTGCTGACG GTTCGGATTG AGGATGGAGA TCATGTGGGC
TGGGGCGAAT GTGTGCCCTA TGCGCGCTAT GACGAGACAC TCGAGTCGGT GACGGCTCAG
ATCGAAGCGC TGCCCGCTAC GTTCACCCGC GCGGAACTGC AGTCGCTGCT GCCCGCTGGG
GCGGCGCGCA ACGCGGTGGA TTGTGCCCTG TGGGATCTGG AGGCCAAAAA GGCCGGCAAG
CCGGTCTGGG AATTGGCCGG TCTGGACCAG CCGGGACCCG AGATCACCGC CTATACGCTG
TCGCTGGCCT CTCCGGAGGA GATGCAGAAA CAGGCCGCAG AGAACGCCCA TCGTCCGCTG
TTGAAGATCA AGCTCGGCAC GCCAGAGGAT ATGCCCCGCC TTGAGGCGGT GCGCGCAGGC
GCGCCCGATG CGCGGATCAT CATTGACGCC AACGAGGGCT GGTCGGCCGA GGTCTACGCC
GAGCTTGCGC CGCATCTGCT GCGCCTTGGG GTGGAGCTGG TGGAGCAACC CCTGCCCGCA
GGCGAGGATG AGGCCCTGAT CGGGATGGAA CGTCCGGTGC CGGTCTGCGC CGATGAGAGC
GCGCATGACT GCGCCAGCCT GCCAAAACTC AAGGGCAAAT ATGATGTTGT GAACATCAAA
CTGGATAAGA CCGGCGGCCT GACAGAAGCG TTGAAATTGC GCGATGCAGC GCTGGCCGAG
GGCTATCAGG TGATGGTCGG CTGCATGGTC GGATCGTCGC TGGCCATGGC CCCCGCGACA
CTGGTGGCGC AGGGTGCGTT GGTGACAGAT CTTGACGGGC CGCTCCTTCT GGCCGAAGAC
CGTCCCGAAC CGCTGACTTT TGACGCCGAG GGGGTCCACC CCCCACGGCC CGCGCTCTGG
GGCTAA
 
Protein sequence
MKITVTPDVF KLAQVFTISR GSRTEAKVLT VRIEDGDHVG WGECVPYARY DETLESVTAQ 
IEALPATFTR AELQSLLPAG AARNAVDCAL WDLEAKKAGK PVWELAGLDQ PGPEITAYTL
SLASPEEMQK QAAENAHRPL LKIKLGTPED MPRLEAVRAG APDARIIIDA NEGWSAEVYA
ELAPHLLRLG VELVEQPLPA GEDEALIGME RPVPVCADES AHDCASLPKL KGKYDVVNIK
LDKTGGLTEA LKLRDAALAE GYQVMVGCMV GSSLAMAPAT LVAQGALVTD LDGPLLLAED
RPEPLTFDAE GVHPPRPALW G