Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3665 |
Symbol | |
ID | 5318062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 104426 |
End bp | 105604 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640775478 |
Product | peptidase M42 family protein |
Protein accession | YP_001312411 |
Protein GI | 150375815 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | [TIGR03106] hydrolase, peptidase M42 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00161837 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGCGA CAACGCACTC CATGCAGACA CCGATCAATA TAGACCCGGA CTACCTCACC AGCCGGCTGA AAGCCCTGCT CGAAATTGCG AGCCCGACCG GTTTCACCGA TGAGGCGGTG CGCTACACCG CGCGTGAGCT CGAGCGTCTC GGGCTGGAAG TGAAGCTGAC CCGCCGGGGC GCAATTCGCG CTATGCGGCC GGGTGCGGCC GAACGGCCAG CCCGCGGGAT CGTCTCGCAC CTGGACACGC TTGGTGCGCA GGTGAAGGCG CTGAAGGAGA ACGGCCGCCT CGAACTGGTC TCGATCGGCC ACTGGTCGGC GCGGTTCGCG GAGGGGGCTC GCGGCTCGAT CTTTTCCGGC AAGGGAACCT ATCGCGGAAC CATCCTTCCC CTGAAGGCTT CTGGACACAC TTTCAATGAC GAGATCGACA CGCAACCGAC CGGCTGGCGG CACATCGAGC TGCGCGTCGA TGCGCTTGCC CGCGACCGGA GCGATCTGGT GCAGCTCGGA ATTGACGTTG GCGACATTGT CGCCATCGAT CCCCAGCCCG AGTTCCTCGA CAACGGGTTC ATCGTTTCGC GGCATCTCGA CGACAAGGCC GGGGTGGCCA TCATGCTTGC GGCGCTCGAG GCCATGCAGC GCCAGAAGGT GGAAACGCCG GTCGACACCT ATTGGCTCTT CACCATCGGC GAGGAGGTGG GCGTTGGCGC CTCGGCTGCA ATCGTTCCGG AAATCGCCTC TCTGGTGGCG ATCGACAACG GTACGACCGC GCCGGGCCAG AATTCGGACG AGTTCGGCGT TACGCTCGCC ATGGCGGACC AGACAGGGCC CTTCGACTAT CATCTCTCGA GAAAGCTCTA CGAACTTTGC GGCGAGCATG GCATCCGCGT TCAGAAGGAC GTCTTCCGCT ACTATCGCTC CGACGCCGCA TCCGCGCTCG AAGCGGGGCA CGACGTCCGC ACGGCGCTTC TTACCTTCGG CGTCGACGCG TCGCACGGCT ATGAGCGCAT CCATCTTCAC GCGCTGATGT CGGTTGCGAA GCTTGCGGTG TATCACGCGG CAAGCGAGGT CCAGATCGAG CGTGACGCGG AGGAAGTCTC CGGGCTTCGG GGCTTTACCC GCCAAAAGGT TCGGCAGGCC GAGCAGGATC TGAAGGCCGA CGAGCCCGAG GGACCTTAG
|
Protein sequence | MIATTHSMQT PINIDPDYLT SRLKALLEIA SPTGFTDEAV RYTARELERL GLEVKLTRRG AIRAMRPGAA ERPARGIVSH LDTLGAQVKA LKENGRLELV SIGHWSARFA EGARGSIFSG KGTYRGTILP LKASGHTFND EIDTQPTGWR HIELRVDALA RDRSDLVQLG IDVGDIVAID PQPEFLDNGF IVSRHLDDKA GVAIMLAALE AMQRQKVETP VDTYWLFTIG EEVGVGASAA IVPEIASLVA IDNGTTAPGQ NSDEFGVTLA MADQTGPFDY HLSRKLYELC GEHGIRVQKD VFRYYRSDAA SALEAGHDVR TALLTFGVDA SHGYERIHLH ALMSVAKLAV YHAASEVQIE RDAEEVSGLR GFTRQKVRQA EQDLKADEPE GP
|
| |