Gene TM1040_3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3395 
Symbol 
ID4075569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp414441 
End bp415550 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content61% 
IMG OID638004904 
Productmandelate racemase/muconate lactonizing-like protein 
Protein accessionYP_611629 
Protein GI99078371 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.446671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGA TCACTCGCGT TGAATTGCGC ATGATCGACC TCAAGCCCAA GGTGGAGCGG 
GTCGACGCCA TCCAGAGCTT TGTCAGTCAG GAGACCCCGA TTGTCACGAT TCATGACAGC
GACGGGGCAT CGGGGACGGG CTACAGCTAC ACCATCGGGA CCGGTGGCCC CTCTGTCATG
GCGCTGTTGG AGCAGACGCT CGTGCCTGCT CTGATCGGGA AAGATGCGGA TCGGATCGAT
GGCATCTGGC GGGATCTGTT GTTCCTGACC CATGCCACCG CAGTTGGTCC GATCACTTCG
CTGGCGCTGG CAGCCATCGA CATCGCGCTC TGGGATCTGC GCTGCAAAAA GATGGGGCTG
CCACTCTGGA AGGCCGCGGG CGGGAGCCGC GACCGTATTC CGCTCTATTC GACCGAGGGT
GGCTGGCTGC ATCTCGAGAC CGCCGCGTTG GTCGAAGACG CCTTGGCGAT GAAAGAGCAG
GGATTTCGCG GCTCCAAGAT CAAGATTGGC CGCGCGCATC TGAGTGAGGA CCGTGCCCGT
CTTGGGGCGA TACGCGCAGC AGTGGGCGAC AGCTACGAGA TCATGACGGA CGCCAATCAG
GGCTTCACGC TTTCCGAGGC GATCCGCCGT GCCAAAGTGC TGGAGGAGTT CGGCATCGGC
TGGTTCGAAG AGCCACTTCC TGCAGATGAT GTGCTCTCGC ATCAAGAGCT CTCGCGGCGC
ACGCATGTGC CGATTGCCGT GGGCGAGAGC ATGTATTCGC TGAGCCAGTT CAAGGACTAT
CTGCAGGTGG GGGCGGCGCA GATCGTGCAG GTGGATGTCG CCCGGATTGG CGGCATCACA
CCCTGGCTCA AGGTCGCGCA TCTGGCCGAG GCGCATAGTG TTATGGTGTG CCCGCATTTC
CTGATGGAAT TGCACCTGCC GCTTGTCTGC GCGGTGCCCA ATGCAAAGTG GCTCGAATAC
ATCCCGCAGC TCGATGGAGT TACCCGCCAG AACATGCAGA TCAGTAATGG CGATGCCGTG
CCTTCGGATG AGCCCGGTCT GGGCATCGAC TGGGATTGGG ACGCCATCAC AGCGCAAGAA
ATCGGCGCGC GCTCCATTGG AGGTGCATGA
 
Protein sequence
MARITRVELR MIDLKPKVER VDAIQSFVSQ ETPIVTIHDS DGASGTGYSY TIGTGGPSVM 
ALLEQTLVPA LIGKDADRID GIWRDLLFLT HATAVGPITS LALAAIDIAL WDLRCKKMGL
PLWKAAGGSR DRIPLYSTEG GWLHLETAAL VEDALAMKEQ GFRGSKIKIG RAHLSEDRAR
LGAIRAAVGD SYEIMTDANQ GFTLSEAIRR AKVLEEFGIG WFEEPLPADD VLSHQELSRR
THVPIAVGES MYSLSQFKDY LQVGAAQIVQ VDVARIGGIT PWLKVAHLAE AHSVMVCPHF
LMELHLPLVC AVPNAKWLEY IPQLDGVTRQ NMQISNGDAV PSDEPGLGID WDWDAITAQE
IGARSIGGA