Gene Smed_0058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0058 
Symbol 
ID5320885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp61664 
End bp65260 
Gene Length3597 bp 
Protein Length1198 aa 
Translation table11 
GC content63% 
IMG OID640788989 
ProductTonB-dependent receptor 
Protein accessionYP_001325753 
Protein GI150395286 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.872531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.103238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTAA TAAGAACGGG GGCATTGCTT GCCGCGGCGA CTCTATTTGT GCCTTGGTCT 
GCGTTTGCCG AGCCCGTTCC ACGCCCCTCA CCCGCAGCCG GTTCCGTCAT TGCCCGGAAA
TCCGGCGAAG AGGTGCGCTT CGTCGATGTT TCCAGCTGGC GGATCGTGGA TCTCGCTCAG
GACCTGCTTC CCGGCGATGT GCTGCGCACC AACGCGACCG GTGCGCTCGC CGTCCTCTTC
AAAGATCACA CGCAGATAAG GCTGGGGCGC AATACCGCGC TGCGCGTGAA GCAGATCGGC
GCGGGCGACA CGAAGCTCGG CCTGGAATCG GGTACGATCT GGGCACGTGC CGAACGTGGC
GGCGACGGCC TTGTCATCGA TACGCCGGCA GCAGCCGCGG CCATCCGCGG CACTGACTGG
ACGCTCTCTG TCGGCACAGA CGGCAAGACG TCACTTGTCG TCCTCGAAGG GGTGGTCGAA
CTCAGCAATG CCCATGGCAG CGTCACGGTG AATCAGGGTG AAGGCGCGGT TGCCGCGATC
GGCAGCGCGC CCACAAAAGT CGTGATCGTT ACGCCGAAAG ACCGTGAGCA GATGCTCTTC
CATCTGTCGC TCCGCAACGC CTTCGTCTGG ATGCCGGCCA CCCCTTTCAG CGTTCCCGAG
ATGCGCCGCG AACGGGCGCG TATCGAGACC GAGCCGGCAG CGTCCCGCAC GGCGGAGGAA
TGGCTGACAC TTTCCGAGAT CTATCTGTCG CTCGATGGCC GCCGCAAGGC ACTGGCCGCG
CTCGCGCAGG CAACGAGCCG GGGCCTTACC GGTGCGCAGG CGGCGCGTGC CGACCTCATC
AGGGCATTGA TCGCCGGTTC CTCCAATCAG TACCGCGAGG CGGCGCGCCT GTTCGCGAAG
TCCGAACGCG GCCTCGACCC GAGCCGACGC ACGATCGCGG CCTATGGCGG CTATTTTGCG
CGCGGGCTGG CTAATCCCGA CCGCGTGGAA AATCCGCCGC GCGATGCAAG CGGCCGCTAT
GCGGGCATGG CCCAGGCGTG GACGGCAGGT TTTCGGGAAG ACATCCCCGC AGCGATCGCG
GTGATCAAAA GGGCTGAGCG GCAATATCCT GATGATCCCA CTTTGCCCGC TGCCCGCGCG
CAGCTCGCGA TGTTGCTGGA CGACCGCGAC GAGCTTCGCG ACGGCGTCGA GCGGGCGCTT
TCGATCGATC CCGAAGATCC GACTGCGCTC GAGGCGCGTG CCCATTACCG CTATCACATC
GACAACGATC TCGAGGGCGC CCTTTCGGAT CTGAACCGGG CGCTGCAGAC CGCGCCCGGT
TCCTCGTCCA TCTGGAATGC GCTCGGTCTG GTCCAGGGCG CGCGCGGCGA CAATCGGGCC
GCGGAGGCCG CATTCAAGCA GGCGATCGCG CTCGATCCGC TGGATCCTGT CTCTCACGCA
AACCTCGCAA TACAATATCT GGATGAAATG CGAATGGCCG AGGCGAAGCG CGAGATCGAT
ACCGCGCTTT CGGTCGATCC GTCCTTCGAC GTCGCGCTTG TGGCGCGCGG CCGCTATCAA
ATGCAGAACG GCGATGTTGA CAGGGCGGTC GAAGACCTTC TCGCCGGCTC GACGGCCAAC
CCGGCCTATT CGAACGCACA GCTCCTGCTT GCCGCCGCCC ATTACGAAAA AGGCGACCGC
ATCCCGGCAG CTCAGGCGCT CGACAATGCC GACCGGCTCG ACCCGAACGA CCCCGTCGTA
GCGACGGTCA GAACCGCGAT AGCAATCGAC GCCTATGATG CCGACGCGGC CATTCTCAAC
GCTCAGGAAG CCCTGCGCCG CACCCGCGCC AAGGGCGGCG ACACGGCGGC ACTCGGGGCC
AATCAGGAGC AGGGCTCGAC ACTCAACGAC GCCTTCCGCC TGCAGGGGCT CGATGCCTGG
GGCCAGTATT ACGGCGATGC CGTCTTCGAT CCTTTCACCG GCGCGAGCTA TGTAGACCAG
GCGGTCCGCG GCAGCGTCAG TCCCTTCTTC AACAGTTATG ATTTCGCCGC CAACGCCATC
ACCAACACGG TCAATACGAC GAGCTTTTCC GCTCTCATCC AGGGACTTCT GATCGAGCCG
CACATGCTCG CCAGCCGCGA GCGAACGGTC AACCTGCTGC GTTCGCCCTT TTTCGAAACA
GAGATCGGGG GTGGATTCAT TGCCAATGAG GACCATACCG GTTGGGTCGG CGAAGCCGCT
GTTCGCGGTT TCACGGTTTC GCCGTTCCCG ATCAGCGTCT ATGGCACGTT TCAGTGGGAG
GAGCCGCGGG ATACGTTCGA GTCCGACGGC TTGCGCGTAG AACGCGAACT GCGGATTATA
GGAGGCAATG GTTATGTTAC GGCCAGCCCG ACGCCCGACG ACCGCGTCGT TGCTTTCACC
AATTATTCGG ACGTCGACGA TGCCCGGGAG TTACTGCCGG TCCCGCTGCA AGTCGAGATT
GGGGACGACT CTTCCGGTGT GACTTCGGGG CTCGCCTGGA GTCATACATT CGGCTACCGC
AGCATCGGCA ACGCCGCCTT GTTTTTCAAA GAGCTCCGGA CCGGAGACAG TGAAATTCAG
CGCGCTAGTG GGGCTGAGAT CAGCGCTGAT GTCGACGCAA AAGAACGAAC CTACATTGCT
GCCTTGAACC ACATGTATGG AGAGGGCGAC CTGACGTGGC GCTACGGCGC CGAAGCCGGG
AAGATAAGGT CCGACATCAG GACGATCTTA AATATTAACG TACCGCCGCT CCCTTCCGAA
ACCGGAACAG ATTACAGATC ATCCACGCAA GCGGTGGCAA AGGCCTATGT CGACGGCCTT
TACGAAATCA CCCCCGACCT CAAGATCGAA GGCGCGCTGT TCGCCCGTTA CATAGAGGAC
GCTAACGACA ACAATATCCG ACTTGAGCCG AAGCTCGGTA TCGCGTGGGC ACCGGCCGAG
CGGCATTGGC TGCGCGCCGC CATCCAGCGC GAGGGCTACA ATTTCGGCTC TGCTACGCTT
GCGCCGATCG GCATCGTAGG GCTGCAGCCC AACCAGTTTT TGATCGGCAC CGACGGCTAT
GCCGACACGC TGGCGCTGCG CTGGGACGCA GAATGGAACG ACTGGCTCTT TACGGCCGTG
GACTACCAGC ATCAGGAGAT CCTCAACGGT TCCATCGATC TCCCCTTCTC CTTGGCTGAC
TTCGACTTCG AAAAAGCAAG GGTCAACCGC GTTGCACTGA CCGCAAACCT CGCGCTCGGC
CACGGCTTCG GCCTTTCCGC CACAGTGGCC CGCACGGAAA GCGACGATCT GTCGACGGGC
CGCAGCGGCG ATCTGCCCTT CCTGCCGGAA AACTCGGGTC AGATAGCGCT GACCTATGTC
AGCACCGCCA GTATCAAGAC GACTGTCGCC GCCAACTATA TCGGCAAACG CAATGACAGT
TCCACGACGC TCGATGATTT CTGGACGCTC GACGCCGCCC TCCAATGGGA ACCCTTCGAC
AAGCGGTTCG AGGTGGAACT CGCCGGCTTC AATCTGCTCG ATGAGGAATT CGAGCTGCGG
GACGGGCTGC CCGGCTGGGG TCCCACGGTC AAGGGCACAG TCAAAGTGCG GTTCTGA
 
Protein sequence
MQVIRTGALL AAATLFVPWS AFAEPVPRPS PAAGSVIARK SGEEVRFVDV SSWRIVDLAQ 
DLLPGDVLRT NATGALAVLF KDHTQIRLGR NTALRVKQIG AGDTKLGLES GTIWARAERG
GDGLVIDTPA AAAAIRGTDW TLSVGTDGKT SLVVLEGVVE LSNAHGSVTV NQGEGAVAAI
GSAPTKVVIV TPKDREQMLF HLSLRNAFVW MPATPFSVPE MRRERARIET EPAASRTAEE
WLTLSEIYLS LDGRRKALAA LAQATSRGLT GAQAARADLI RALIAGSSNQ YREAARLFAK
SERGLDPSRR TIAAYGGYFA RGLANPDRVE NPPRDASGRY AGMAQAWTAG FREDIPAAIA
VIKRAERQYP DDPTLPAARA QLAMLLDDRD ELRDGVERAL SIDPEDPTAL EARAHYRYHI
DNDLEGALSD LNRALQTAPG SSSIWNALGL VQGARGDNRA AEAAFKQAIA LDPLDPVSHA
NLAIQYLDEM RMAEAKREID TALSVDPSFD VALVARGRYQ MQNGDVDRAV EDLLAGSTAN
PAYSNAQLLL AAAHYEKGDR IPAAQALDNA DRLDPNDPVV ATVRTAIAID AYDADAAILN
AQEALRRTRA KGGDTAALGA NQEQGSTLND AFRLQGLDAW GQYYGDAVFD PFTGASYVDQ
AVRGSVSPFF NSYDFAANAI TNTVNTTSFS ALIQGLLIEP HMLASRERTV NLLRSPFFET
EIGGGFIANE DHTGWVGEAA VRGFTVSPFP ISVYGTFQWE EPRDTFESDG LRVERELRII
GGNGYVTASP TPDDRVVAFT NYSDVDDARE LLPVPLQVEI GDDSSGVTSG LAWSHTFGYR
SIGNAALFFK ELRTGDSEIQ RASGAEISAD VDAKERTYIA ALNHMYGEGD LTWRYGAEAG
KIRSDIRTIL NINVPPLPSE TGTDYRSSTQ AVAKAYVDGL YEITPDLKIE GALFARYIED
ANDNNIRLEP KLGIAWAPAE RHWLRAAIQR EGYNFGSATL APIGIVGLQP NQFLIGTDGY
ADTLALRWDA EWNDWLFTAV DYQHQEILNG SIDLPFSLAD FDFEKARVNR VALTANLALG
HGFGLSATVA RTESDDLSTG RSGDLPFLPE NSGQIALTYV STASIKTTVA ANYIGKRNDS
STTLDDFWTL DAALQWEPFD KRFEVELAGF NLLDEEFELR DGLPGWGPTV KGTVKVRF