Gene Caci_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5041 
Symbol 
ID8336395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5781965 
End bp5783074 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content71% 
IMG OID644958140 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003115742 
Protein GI256394178 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01928] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG AGCGCGTCGA ACTGCGCCGG ATCGGCATCC CCCTGGCCAC GCCCTTCCGG 
ACCTCGCTGG GCCTTGAGCT GGACCGCGAC ATCCTGATCC TGCGTGCCGA CACCTCCGAG
GGACCGGGCT GGGGCGAGTG CGTCGCCATG CCGGAGCCCG GATACTCCGA GGAGTACCTC
GACGGCGCCG CGCACGTCAT CCAGCGCTTT CTCCTGCCGG CGGTGGCAGC ACTCGACGAC
CTCACGCCGG CCCGCGCGGC TGCCGCGATG GCGGCGTTCC CCGGCAACCC CATGGCCAAG
GCGACGGTCG AGATGGCGGT CATGGACGCC TGGCTCCGCG CGCGCCGAAG CTCCTACGCC
GACCATCTCG GCGCGGTGCG CTCAACCGTG GAAAGCGGGG TGTCGGTCGG CATCGCCGAT
ACGATCGATC AACTCCTCGC CGAGGTCTCA GGCTACGTGG ATCAGGGGTA CCGCCGCATC
AAGCTGAAGA TCGAGCCGGG CTGGGACCTC GAACCGGTCC GAGCGATCCG CGAGCGCTTC
CCCGACATCG CGCTCCAAGC CGACGCCAAC GCCGCCTACA CCTTCGCCGA TGCCCGCCAC
CTGGCCGCCC TGGACGCGTT CGACCTGGTG ATGCTCGAGC AGCCGCTGGG CACCGCGGAC
GTGCGCGACC ACGCCGCGCT GGCGCGCATG CTACGCACCC CGATCTGCCT GGACGAGTCC
ATCACGTCCG CGCGCAGCGC CGCCGACGCC ATCGCGCTGG GCGCGTGCGC GATCGTCAAC
ATCAAGGCGG GCCGGGTCGG CGGCTACCTG GAAGCGCGGC GGATCCACGA CGTCTGCGCC
GCGCACAGCG TGCCGGTGTG GTGCGGCGGG ATGCTGGAAA CCGGGCTCGG GCAGGCTGCC
AACCTGGCGC TCGCGGCGTT GCCGGGCTTC ACGATGCCCG CCGACATCGC GCCGTCCGCG
CGCTACTTCG CCACCGACGT CACGGCGCCG ATCACGATGA GCGAGGGCCG GATCGCGGTT
CCGGACGGAC CGGGTCTGGG GCTGGACCCG ATCCCGGAGA TCCTCGAGGG CTACACGACC
GACGTCGTCA CCATCACCCG GTTCGGCTGA
 
Protein sequence
MKLERVELRR IGIPLATPFR TSLGLELDRD ILILRADTSE GPGWGECVAM PEPGYSEEYL 
DGAAHVIQRF LLPAVAALDD LTPARAAAAM AAFPGNPMAK ATVEMAVMDA WLRARRSSYA
DHLGAVRSTV ESGVSVGIAD TIDQLLAEVS GYVDQGYRRI KLKIEPGWDL EPVRAIRERF
PDIALQADAN AAYTFADARH LAALDAFDLV MLEQPLGTAD VRDHAALARM LRTPICLDES
ITSARSAADA IALGACAIVN IKAGRVGGYL EARRIHDVCA AHSVPVWCGG MLETGLGQAA
NLALAALPGF TMPADIAPSA RYFATDVTAP ITMSEGRIAV PDGPGLGLDP IPEILEGYTT
DVVTITRFG