Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0058 |
Symbol | |
ID | 5320885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 61664 |
End bp | 65260 |
Gene Length | 3597 bp |
Protein Length | 1198 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640788989 |
Product | TonB-dependent receptor |
Protein accession | YP_001325753 |
Protein GI | 150395286 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.872531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.103238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTAA TAAGAACGGG GGCATTGCTT GCCGCGGCGA CTCTATTTGT GCCTTGGTCT GCGTTTGCCG AGCCCGTTCC ACGCCCCTCA CCCGCAGCCG GTTCCGTCAT TGCCCGGAAA TCCGGCGAAG AGGTGCGCTT CGTCGATGTT TCCAGCTGGC GGATCGTGGA TCTCGCTCAG GACCTGCTTC CCGGCGATGT GCTGCGCACC AACGCGACCG GTGCGCTCGC CGTCCTCTTC AAAGATCACA CGCAGATAAG GCTGGGGCGC AATACCGCGC TGCGCGTGAA GCAGATCGGC GCGGGCGACA CGAAGCTCGG CCTGGAATCG GGTACGATCT GGGCACGTGC CGAACGTGGC GGCGACGGCC TTGTCATCGA TACGCCGGCA GCAGCCGCGG CCATCCGCGG CACTGACTGG ACGCTCTCTG TCGGCACAGA CGGCAAGACG TCACTTGTCG TCCTCGAAGG GGTGGTCGAA CTCAGCAATG CCCATGGCAG CGTCACGGTG AATCAGGGTG AAGGCGCGGT TGCCGCGATC GGCAGCGCGC CCACAAAAGT CGTGATCGTT ACGCCGAAAG ACCGTGAGCA GATGCTCTTC CATCTGTCGC TCCGCAACGC CTTCGTCTGG ATGCCGGCCA CCCCTTTCAG CGTTCCCGAG ATGCGCCGCG AACGGGCGCG TATCGAGACC GAGCCGGCAG CGTCCCGCAC GGCGGAGGAA TGGCTGACAC TTTCCGAGAT CTATCTGTCG CTCGATGGCC GCCGCAAGGC ACTGGCCGCG CTCGCGCAGG CAACGAGCCG GGGCCTTACC GGTGCGCAGG CGGCGCGTGC CGACCTCATC AGGGCATTGA TCGCCGGTTC CTCCAATCAG TACCGCGAGG CGGCGCGCCT GTTCGCGAAG TCCGAACGCG GCCTCGACCC GAGCCGACGC ACGATCGCGG CCTATGGCGG CTATTTTGCG CGCGGGCTGG CTAATCCCGA CCGCGTGGAA AATCCGCCGC GCGATGCAAG CGGCCGCTAT GCGGGCATGG CCCAGGCGTG GACGGCAGGT TTTCGGGAAG ACATCCCCGC AGCGATCGCG GTGATCAAAA GGGCTGAGCG GCAATATCCT GATGATCCCA CTTTGCCCGC TGCCCGCGCG CAGCTCGCGA TGTTGCTGGA CGACCGCGAC GAGCTTCGCG ACGGCGTCGA GCGGGCGCTT TCGATCGATC CCGAAGATCC GACTGCGCTC GAGGCGCGTG CCCATTACCG CTATCACATC GACAACGATC TCGAGGGCGC CCTTTCGGAT CTGAACCGGG CGCTGCAGAC CGCGCCCGGT TCCTCGTCCA TCTGGAATGC GCTCGGTCTG GTCCAGGGCG CGCGCGGCGA CAATCGGGCC GCGGAGGCCG CATTCAAGCA GGCGATCGCG CTCGATCCGC TGGATCCTGT CTCTCACGCA AACCTCGCAA TACAATATCT GGATGAAATG CGAATGGCCG AGGCGAAGCG CGAGATCGAT ACCGCGCTTT CGGTCGATCC GTCCTTCGAC GTCGCGCTTG TGGCGCGCGG CCGCTATCAA ATGCAGAACG GCGATGTTGA CAGGGCGGTC GAAGACCTTC TCGCCGGCTC GACGGCCAAC CCGGCCTATT CGAACGCACA GCTCCTGCTT GCCGCCGCCC ATTACGAAAA AGGCGACCGC ATCCCGGCAG CTCAGGCGCT CGACAATGCC GACCGGCTCG ACCCGAACGA CCCCGTCGTA GCGACGGTCA GAACCGCGAT AGCAATCGAC GCCTATGATG CCGACGCGGC CATTCTCAAC GCTCAGGAAG CCCTGCGCCG CACCCGCGCC AAGGGCGGCG ACACGGCGGC ACTCGGGGCC AATCAGGAGC AGGGCTCGAC ACTCAACGAC GCCTTCCGCC TGCAGGGGCT CGATGCCTGG GGCCAGTATT ACGGCGATGC CGTCTTCGAT CCTTTCACCG GCGCGAGCTA TGTAGACCAG GCGGTCCGCG GCAGCGTCAG TCCCTTCTTC AACAGTTATG ATTTCGCCGC CAACGCCATC ACCAACACGG TCAATACGAC GAGCTTTTCC GCTCTCATCC AGGGACTTCT GATCGAGCCG CACATGCTCG CCAGCCGCGA GCGAACGGTC AACCTGCTGC GTTCGCCCTT TTTCGAAACA GAGATCGGGG GTGGATTCAT TGCCAATGAG GACCATACCG GTTGGGTCGG CGAAGCCGCT GTTCGCGGTT TCACGGTTTC GCCGTTCCCG ATCAGCGTCT ATGGCACGTT TCAGTGGGAG GAGCCGCGGG ATACGTTCGA GTCCGACGGC TTGCGCGTAG AACGCGAACT GCGGATTATA GGAGGCAATG GTTATGTTAC GGCCAGCCCG ACGCCCGACG ACCGCGTCGT TGCTTTCACC AATTATTCGG ACGTCGACGA TGCCCGGGAG TTACTGCCGG TCCCGCTGCA AGTCGAGATT GGGGACGACT CTTCCGGTGT GACTTCGGGG CTCGCCTGGA GTCATACATT CGGCTACCGC AGCATCGGCA ACGCCGCCTT GTTTTTCAAA GAGCTCCGGA CCGGAGACAG TGAAATTCAG CGCGCTAGTG GGGCTGAGAT CAGCGCTGAT GTCGACGCAA AAGAACGAAC CTACATTGCT GCCTTGAACC ACATGTATGG AGAGGGCGAC CTGACGTGGC GCTACGGCGC CGAAGCCGGG AAGATAAGGT CCGACATCAG GACGATCTTA AATATTAACG TACCGCCGCT CCCTTCCGAA ACCGGAACAG ATTACAGATC ATCCACGCAA GCGGTGGCAA AGGCCTATGT CGACGGCCTT TACGAAATCA CCCCCGACCT CAAGATCGAA GGCGCGCTGT TCGCCCGTTA CATAGAGGAC GCTAACGACA ACAATATCCG ACTTGAGCCG AAGCTCGGTA TCGCGTGGGC ACCGGCCGAG CGGCATTGGC TGCGCGCCGC CATCCAGCGC GAGGGCTACA ATTTCGGCTC TGCTACGCTT GCGCCGATCG GCATCGTAGG GCTGCAGCCC AACCAGTTTT TGATCGGCAC CGACGGCTAT GCCGACACGC TGGCGCTGCG CTGGGACGCA GAATGGAACG ACTGGCTCTT TACGGCCGTG GACTACCAGC ATCAGGAGAT CCTCAACGGT TCCATCGATC TCCCCTTCTC CTTGGCTGAC TTCGACTTCG AAAAAGCAAG GGTCAACCGC GTTGCACTGA CCGCAAACCT CGCGCTCGGC CACGGCTTCG GCCTTTCCGC CACAGTGGCC CGCACGGAAA GCGACGATCT GTCGACGGGC CGCAGCGGCG ATCTGCCCTT CCTGCCGGAA AACTCGGGTC AGATAGCGCT GACCTATGTC AGCACCGCCA GTATCAAGAC GACTGTCGCC GCCAACTATA TCGGCAAACG CAATGACAGT TCCACGACGC TCGATGATTT CTGGACGCTC GACGCCGCCC TCCAATGGGA ACCCTTCGAC AAGCGGTTCG AGGTGGAACT CGCCGGCTTC AATCTGCTCG ATGAGGAATT CGAGCTGCGG GACGGGCTGC CCGGCTGGGG TCCCACGGTC AAGGGCACAG TCAAAGTGCG GTTCTGA
|
Protein sequence | MQVIRTGALL AAATLFVPWS AFAEPVPRPS PAAGSVIARK SGEEVRFVDV SSWRIVDLAQ DLLPGDVLRT NATGALAVLF KDHTQIRLGR NTALRVKQIG AGDTKLGLES GTIWARAERG GDGLVIDTPA AAAAIRGTDW TLSVGTDGKT SLVVLEGVVE LSNAHGSVTV NQGEGAVAAI GSAPTKVVIV TPKDREQMLF HLSLRNAFVW MPATPFSVPE MRRERARIET EPAASRTAEE WLTLSEIYLS LDGRRKALAA LAQATSRGLT GAQAARADLI RALIAGSSNQ YREAARLFAK SERGLDPSRR TIAAYGGYFA RGLANPDRVE NPPRDASGRY AGMAQAWTAG FREDIPAAIA VIKRAERQYP DDPTLPAARA QLAMLLDDRD ELRDGVERAL SIDPEDPTAL EARAHYRYHI DNDLEGALSD LNRALQTAPG SSSIWNALGL VQGARGDNRA AEAAFKQAIA LDPLDPVSHA NLAIQYLDEM RMAEAKREID TALSVDPSFD VALVARGRYQ MQNGDVDRAV EDLLAGSTAN PAYSNAQLLL AAAHYEKGDR IPAAQALDNA DRLDPNDPVV ATVRTAIAID AYDADAAILN AQEALRRTRA KGGDTAALGA NQEQGSTLND AFRLQGLDAW GQYYGDAVFD PFTGASYVDQ AVRGSVSPFF NSYDFAANAI TNTVNTTSFS ALIQGLLIEP HMLASRERTV NLLRSPFFET EIGGGFIANE DHTGWVGEAA VRGFTVSPFP ISVYGTFQWE EPRDTFESDG LRVERELRII GGNGYVTASP TPDDRVVAFT NYSDVDDARE LLPVPLQVEI GDDSSGVTSG LAWSHTFGYR SIGNAALFFK ELRTGDSEIQ RASGAEISAD VDAKERTYIA ALNHMYGEGD LTWRYGAEAG KIRSDIRTIL NINVPPLPSE TGTDYRSSTQ AVAKAYVDGL YEITPDLKIE GALFARYIED ANDNNIRLEP KLGIAWAPAE RHWLRAAIQR EGYNFGSATL APIGIVGLQP NQFLIGTDGY ADTLALRWDA EWNDWLFTAV DYQHQEILNG SIDLPFSLAD FDFEKARVNR VALTANLALG HGFGLSATVA RTESDDLSTG RSGDLPFLPE NSGQIALTYV STASIKTTVA ANYIGKRNDS STTLDDFWTL DAALQWEPFD KRFEVELAGF NLLDEEFELR DGLPGWGPTV KGTVKVRF
|
| |