Gene Smed_1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1171 
Symbol 
ID5322017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1247627 
End bp1249156 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content64% 
IMG OID640790112 
Productpeptidase M23B 
Protein accessionYP_001326857 
Protein GI150396390 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.816034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.264871 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT TTGAATCTCC CCAGGCAGGC AAGTCTATGA TCCGTCTCTG CGCGGCTATA 
TTGCTTGCGG GTGTAGCGAC CGGTTGCAGT TCTGACGCCA GCCGCTTCGG CGGGCTCTTC
TCGCGGTCTG ACGACATAAT GACAGGCTCG ATCCCGCAGG GTTCGAGTAC GATGCCGAGA
GGCGATGTCG CAAGTGGCGA TGCAGCGCCC TCCTATGGAA ACAGCGCGTC GGTCGGTCAA
TCCTATCCCG CTGGCGGCGG CTACAATGCG GCAGCGGCGC CGGTCTCCAG TGCTCGTGTC
GCATCCACGC CTATGGCCGT CCAGCGTACG AGCCTCGACG AGCCGTCCGC CACATCCGGG
CAGCAGCAGG TTCGGACCGC TTCTCTCGAC TCGCAGGCCG CCGCCCTTCC GAGGTCGCAG
CCGCAGTCGG GGGGCGCCAG GGATATTCCG AGCAAGGGGG GGTGGAGTGC CTCCAACGCG
CCGACGATCT TGGTCCGTCA GGGCGACACG GTGACCGTTC TCGCCAGACG GTTCGGCGTC
CCCGAGAAGG AGATCCTGAG GGCGAACGGA TTGAAATCGT CAAGCCAGGT AGAGCCGGGC
CAGCGCCTGG TCATTCCGAC GTTTGGCGCC GCGGGCAGCG CCGCGAAGGC GGCTGCATCG
GGGTCGATCG CCGACGTGGA AGGTGGTAAG AGGCGCCCGT CGCCGCTGCC GACTGATCAG
CGCGAGGTCG CAATCCTCCC CGGTCAGTCC CAATCTCGCG AAAAGAACGA AAGCCGCAGC
GATGTGGCCG CAGGCAAGCT CAACAGCGCT GGCGAGGGCG GCGGCACTGG CGGTTATGCG
GTCAAGCCGG GCGACTCGCT GAACCGGATC GCCAAGGCGA ACGGTGTCTC GGTCGCTGCT
CTGAAGCAGG CAAACGGGCT TACGACGGAA GCCATCCGCA TCGGACAAAA ACTCAATATC
CCCAGCGCTT CGGCCAATAC GCCAGCGACC GACGCTGTCG TCACGGCTTC TGTCTCGCCC
AAGAAGAACG AAGCCAAGGT TGCCGCAACG GAGCAGTCCA AGCCGGCCGA AGCCAAGGCG
GCTGCCGCCA AGGAGAGCGT GTCCGAAGTC GCCATCAAGT CGGATGTCAA CGAAGATCTC
CCGAAGTCGA CCGGCATCGG GAAATACCGC TGGCCGGTCC GTGGCGCGGT CGTCGCTGCC
TATGGCGCCA ATGTCGATGG CAACCGGAAT GACGGTATCA ACATATCGGT TCCGGAGGGT
ACGCCTATCA AGGCGGCTGA GAACGGCGTC GTGATCTACT CGGGCAGCAG CCTCAAGGAG
CTCGGCAATG CCGTTCTGGT GCGCCACGAT GACGGTACCG TAACGGTCTA CGGCAATGCT
GCCGAACTGA AGGTGCAACG CGGTCAGAAG GTCCAGCGCG GGCAGACGCT CGCATCATCA
GGCATGACGG GCAGGGCATC GCGGCCGCAG GTGCACTTCG AGGTGCGCAA GAATGCGACC
CCGGTCAACC CGGTGACCTA TCTCGAATAG
 
Protein sequence
MRKFESPQAG KSMIRLCAAI LLAGVATGCS SDASRFGGLF SRSDDIMTGS IPQGSSTMPR 
GDVASGDAAP SYGNSASVGQ SYPAGGGYNA AAAPVSSARV ASTPMAVQRT SLDEPSATSG
QQQVRTASLD SQAAALPRSQ PQSGGARDIP SKGGWSASNA PTILVRQGDT VTVLARRFGV
PEKEILRANG LKSSSQVEPG QRLVIPTFGA AGSAAKAAAS GSIADVEGGK RRPSPLPTDQ
REVAILPGQS QSREKNESRS DVAAGKLNSA GEGGGTGGYA VKPGDSLNRI AKANGVSVAA
LKQANGLTTE AIRIGQKLNI PSASANTPAT DAVVTASVSP KKNEAKVAAT EQSKPAEAKA
AAAKESVSEV AIKSDVNEDL PKSTGIGKYR WPVRGAVVAA YGANVDGNRN DGINISVPEG
TPIKAAENGV VIYSGSSLKE LGNAVLVRHD DGTVTVYGNA AELKVQRGQK VQRGQTLASS
GMTGRASRPQ VHFEVRKNAT PVNPVTYLE