Gene Smed_5571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5571 
Symbol 
ID5319873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp537229 
End bp538455 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content61% 
IMG OID640777319 
ProductOsmC family protein 
Protein accessionYP_001314251 
Protein GI150377656 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
[COG1765] Predicted redox protein, regulator of disulfide bond formation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0439292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCA AAACGCAACG GCTCCAATTT TCCGGTCATT CAGGCGCGAC GCTCAGCGCC 
CGCCTTGATC TGCCAAATGG GCCTTTGCGC GCCTACGCGC TTTTTGCCCA TTGCTTCACC
TGTTCCAAGG ATCTGGCGGC AGCACGCCGG ATTGCAGTGG AGCTTGCGCG TGAAGGCATC
GCTGTCCTGC GTTTTGATTT CACGGGGCTG GGATCGAGCG AAGGCGAATT CGCTTCAACG
AATTTCTCCT CCAACGTTGC CGACCTTCTT TCAGCCGCGG ACTATTTACG CCAACACTAT
GAGGCGCCAG CGGTGCTGAT CGGCCACTCG CTCGGTGGTG CAGCGGTCCT CACCGTCGCC
GGGGACATTC CGGAAGTGCG CGCCGTAGCC ACCATAGGCG CGCCGGCTGA TGTCGGCCAC
GTATTGAAGA ACTTCGGAGC GAGCCTCGAC GAGATCGAGA AGAACGGCGA GGCCGACGTC
GATCTCGCCG GGCGCACGTT TCTCGTCAAA AGACAGTTTG TCGAGGACAC GCGTGCACAC
CGCATCAAGG ATGCTGTTGC GGGGCTGAAA AGACCGCTCC TCGTCCTTCA CGCGCCGCTG
GACCATACGG TCGGGATCGA GAACGCCACC GAGATCTTCG TCGCGGCGAG GCATCCGAAA
AGCTTCATTT CGCTGGACAA GGCTGACCAC CTGCTCACCG ACCGTGAGGA TGCGGCCTTT
GCCGGACGGA TCATTTCGGA ATGGCTGACA CGCTATCTTG CCGCCGACAC GCCGCAAGCC
ACCGGGCCGA TCGAATATGT CCGCGTGAGG GAAACGGGCG AAGGAAAGTT TCAGAACGCT
GTTCAGGCTG GCGGGCATCG GCTGTTCGCC GATGAACCCG AAAGCGTGGG CGGGCTTGAT
TCCGGACCAT CGCCCTACGA CTTCCTGGCG GTCGCACTTG GCGCCTGCAC CTCGATGACG
CTGCGCCTCT ATGCCGGCCA CAAGCAGCTG AAGCTCGGAC GCATCGCCGT CGACGTCTCG
CATGCCAAGA CTCATGCGAA GGATTGCGAG GAGTGCACCG AGCTGGAACG CAGTGGCAGC
GGCAGGATCG ATCGTTTCGA GCGCGTCATT TCCATCGATG GCGAGGTCTC GGAGGAGCTT
CGCGAGAAGA TCGGCGAAAT CGCCGGCAAG TGCCCGGTCC ATCGCACGCT CGAAGCAGTG
ACGAAGATAA AAACGGTCGT GAAGTAA
 
Protein sequence
MAFKTQRLQF SGHSGATLSA RLDLPNGPLR AYALFAHCFT CSKDLAAARR IAVELAREGI 
AVLRFDFTGL GSSEGEFAST NFSSNVADLL SAADYLRQHY EAPAVLIGHS LGGAAVLTVA
GDIPEVRAVA TIGAPADVGH VLKNFGASLD EIEKNGEADV DLAGRTFLVK RQFVEDTRAH
RIKDAVAGLK RPLLVLHAPL DHTVGIENAT EIFVAARHPK SFISLDKADH LLTDREDAAF
AGRIISEWLT RYLAADTPQA TGPIEYVRVR ETGEGKFQNA VQAGGHRLFA DEPESVGGLD
SGPSPYDFLA VALGACTSMT LRLYAGHKQL KLGRIAVDVS HAKTHAKDCE ECTELERSGS
GRIDRFERVI SIDGEVSEEL REKIGEIAGK CPVHRTLEAV TKIKTVVK