Gene Smed_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3872 
Symbol 
ID5318871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp329189 
End bp330229 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID640775684 
Productpeptidase C45 acyl-coenzyme A:6-aminopenicillanic acid acyl-transferase 
Protein accessionYP_001312617 
Protein GI150376021 
COG category[R] General function prediction only 
COG ID[COG4927] Predicted choloylglycine hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCGT TGCCACGCAA GCGGTTCCGT TCTTATACTC CTCGTGTCAT GTACAAGACA 
TTCGTCGCAG CGCGAGAGGA CCGGCCCGGA GAAGCTTGGC TCTCCAGGTT TGCGGCCGGG
CGAGCCGAGG CGGAGAGGTG GTATTTCGGG CAGGCGCCAA TGGCGGCGAG CCCAAGTGCC
AAGGAGTGTC GTGCCGCTTT GATGCAGCAC ATGCCTGAGC TCGTCCCTCA CTACGAAAGC
GCCTGTGATC TCGTCGGAGA TGATGAGATC GCTCATCGGC TGCTCAGCCA CTACCGTCCC
GCGCCGGAGC GCTATGGCTG CAGTCAGTCT GTCTGGCTCG GAAAGGAAGG CCCGGCGCTG
ATCCGCAACT TCGACTACCC ACCGGATATC GTCTCTGACC GCTTCGAGAT GACCGATTGG
TCTGGCGTGA AGGTGATCGC GAAGATGCAG CGGCCCTGGG GAGGTTGCGT GGACGGGCTG
AATGAGGAGG GACTGGCCGC AAGCGTGACT CTCGGGGGTG GCCGCTCTCA GGGTCTCGGC
TTCTCGATCA TTCTTGTGAT GCGCTATTTG CTTGAAAATT TTCGTGAGGT CGGCGAGGCG
GTGAAGGCGC TTTGCCGAAT ACCCGTGGCG CTCGCACAGA ATGTCACGGT GCTGGATCGT
GCTGGCAGCT ACGCAACGCT GTTTCTTGGT CCGGGGCAGC GGCCGGTCAT CACGCGCCTG
AAGGCATGCA CGAACCATCA GCGGGGCGGA AGACCCTCAT CGTCTTCTTT GGCGCGACAG
CAATTTGTTC TGCAAGCACT GGAAGACCCA TCGATGTCGC TCGAGAAGCT GACCGACCGC
TTTCTCCAGC CGCCGCTCTA TTCCATGCGT CTTCCCCAAC CGACCCTGTA CACGGCTGTC
TACCGACCTG CGGAAGGGCG GGTGGATTAC ATCTGGCCAG GGAACCACTG GTCGCAAGGT
TTCGACGGCT TTGAGACAGG CGAGTACACC CATCGCTATG GATCATCGGG CGGCCCGCTG
GCCGAAAGTC CGGCATCTTA G
 
Protein sequence
MSALPRKRFR SYTPRVMYKT FVAAREDRPG EAWLSRFAAG RAEAERWYFG QAPMAASPSA 
KECRAALMQH MPELVPHYES ACDLVGDDEI AHRLLSHYRP APERYGCSQS VWLGKEGPAL
IRNFDYPPDI VSDRFEMTDW SGVKVIAKMQ RPWGGCVDGL NEEGLAASVT LGGGRSQGLG
FSIILVMRYL LENFREVGEA VKALCRIPVA LAQNVTVLDR AGSYATLFLG PGQRPVITRL
KACTNHQRGG RPSSSSLARQ QFVLQALEDP SMSLEKLTDR FLQPPLYSMR LPQPTLYTAV
YRPAEGRVDY IWPGNHWSQG FDGFETGEYT HRYGSSGGPL AESPAS