Gene Smed_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3665 
Symbol 
ID5318062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp104426 
End bp105604 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID640775478 
Productpeptidase M42 family protein 
Protein accessionYP_001312411 
Protein GI150375815 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03106] hydrolase, peptidase M42 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00161837 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGCGA CAACGCACTC CATGCAGACA CCGATCAATA TAGACCCGGA CTACCTCACC 
AGCCGGCTGA AAGCCCTGCT CGAAATTGCG AGCCCGACCG GTTTCACCGA TGAGGCGGTG
CGCTACACCG CGCGTGAGCT CGAGCGTCTC GGGCTGGAAG TGAAGCTGAC CCGCCGGGGC
GCAATTCGCG CTATGCGGCC GGGTGCGGCC GAACGGCCAG CCCGCGGGAT CGTCTCGCAC
CTGGACACGC TTGGTGCGCA GGTGAAGGCG CTGAAGGAGA ACGGCCGCCT CGAACTGGTC
TCGATCGGCC ACTGGTCGGC GCGGTTCGCG GAGGGGGCTC GCGGCTCGAT CTTTTCCGGC
AAGGGAACCT ATCGCGGAAC CATCCTTCCC CTGAAGGCTT CTGGACACAC TTTCAATGAC
GAGATCGACA CGCAACCGAC CGGCTGGCGG CACATCGAGC TGCGCGTCGA TGCGCTTGCC
CGCGACCGGA GCGATCTGGT GCAGCTCGGA ATTGACGTTG GCGACATTGT CGCCATCGAT
CCCCAGCCCG AGTTCCTCGA CAACGGGTTC ATCGTTTCGC GGCATCTCGA CGACAAGGCC
GGGGTGGCCA TCATGCTTGC GGCGCTCGAG GCCATGCAGC GCCAGAAGGT GGAAACGCCG
GTCGACACCT ATTGGCTCTT CACCATCGGC GAGGAGGTGG GCGTTGGCGC CTCGGCTGCA
ATCGTTCCGG AAATCGCCTC TCTGGTGGCG ATCGACAACG GTACGACCGC GCCGGGCCAG
AATTCGGACG AGTTCGGCGT TACGCTCGCC ATGGCGGACC AGACAGGGCC CTTCGACTAT
CATCTCTCGA GAAAGCTCTA CGAACTTTGC GGCGAGCATG GCATCCGCGT TCAGAAGGAC
GTCTTCCGCT ACTATCGCTC CGACGCCGCA TCCGCGCTCG AAGCGGGGCA CGACGTCCGC
ACGGCGCTTC TTACCTTCGG CGTCGACGCG TCGCACGGCT ATGAGCGCAT CCATCTTCAC
GCGCTGATGT CGGTTGCGAA GCTTGCGGTG TATCACGCGG CAAGCGAGGT CCAGATCGAG
CGTGACGCGG AGGAAGTCTC CGGGCTTCGG GGCTTTACCC GCCAAAAGGT TCGGCAGGCC
GAGCAGGATC TGAAGGCCGA CGAGCCCGAG GGACCTTAG
 
Protein sequence
MIATTHSMQT PINIDPDYLT SRLKALLEIA SPTGFTDEAV RYTARELERL GLEVKLTRRG 
AIRAMRPGAA ERPARGIVSH LDTLGAQVKA LKENGRLELV SIGHWSARFA EGARGSIFSG
KGTYRGTILP LKASGHTFND EIDTQPTGWR HIELRVDALA RDRSDLVQLG IDVGDIVAID
PQPEFLDNGF IVSRHLDDKA GVAIMLAALE AMQRQKVETP VDTYWLFTIG EEVGVGASAA
IVPEIASLVA IDNGTTAPGQ NSDEFGVTLA MADQTGPFDY HLSRKLYELC GEHGIRVQKD
VFRYYRSDAA SALEAGHDVR TALLTFGVDA SHGYERIHLH ALMSVAKLAV YHAASEVQIE
RDAEEVSGLR GFTRQKVRQA EQDLKADEPE GP