Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4519 |
Symbol | |
ID | 5318494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1004915 |
End bp | 1006273 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640776320 |
Product | hypothetical protein |
Protein accession | YP_001313252 |
Protein GI | 150376656 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.829865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.308497 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGCA TCCTCTTCCT GTTGTTTAGC CTCCTGTCGG CCTTCATGCT TCCTGTCTCG CCGGCTCCGG CAGCCGAGCG GAAAGCAATC GTCCTGCACG TAAACGGCGC CATCAGTCCG GCCACGGCGG AATATGTAAC GCGCGGCCTG CGGCGGGCCA GGGACCGCGG TGTCGCGCTG GTGGTCCTGC AGATGGATAC GCCCGGCGGA CTGGATACGT CGATGCGCGA AATCATACGC GCCATCCTCG ATTCACCCGT GCCGGTCGCG AGTTTCGTCG CGCCCAGCGG AGCGCGGGCG GCAAGCGCCG GCACCTATAT TCTTTATGCG AGCCACATCG CGGCCATGGC GCCCGGAACC AATCTGGGCG CCGCCACGCC GATCGCCATC GGCGGTGGAC TTTTTGGCGA TGACGAGCGG GACGGCGAGG AAACGCCGGG CGAACCGGAC AAGCAGGAAC CCCGCAAGCC GGCCGATGCC GGCGAGGCGA AACTTATCAA CGATGCAATC GCTTATATCC GCGGCCTTGC GGAGTTGCGC GGTCGCAATA TCGACTGGGC GGAGCGCGCC GTGCGCGAGG CTGCCAGCCT TTCCTCGGCC GCAGCCGCGC GCGAACAGGT CATCGACTTC ACCGCGATCA ACCTCAATGA CCTCCTCAAG CAGGCACACG GTCGCTCCGT TCGCATCGGT CAATCCGACG TCCGGCTCGA TACTGCAGGG CTCTTCATCG AGGACTTGCC GCCCGATTGG CGCACGCAGC TCTTATCGGT GATCACCAAT CCCAATGTCG CTCTCCTTCT GATGATGGTC GGCATCTATG GGCTCATCTT CGAGTTTCTC TCACCCGGCA CCGTTGTGCC CGGGACCATT GGCGGCATAA GTCTCCTGCT CGGTCTCTAC GCCCTGGCGG TGCTGCCTGT GAGCTATGCC GGCGTTGCCC TCATCCTGCT CGGAGCCGGG CTGCTGGTCG CGGAAGCGCA TGCGCCGTCT TTCGGCGTTC TCGGCCTTGG CAGCGCCGTC GCGCTGGTGC TCGGTGCCGC AATTCTTTTC GACACGGACG TACCGGGACT GCAGGTGTCC TGGCCGGTTC TGAGCGGCAT CGGGTTCGCA AGCCTGGCTT TCGGCCTGCT GGTCGCCCGT CTCGCTCTTC TCTCGAGCCG ACACAAGATC CTCACCGGAG CGGAGGAGAT GATCGGCATC TCCGGAAAGG TCGACAGCTG GGAGGGAGCG GGCGGCTACG TGATTGCCCA CGGCGAGCGG TGGAGCGCAG TCAGCAATGA ACCGCTCGGT CCGGGAGAGG ACGTCATGGT CGTCGGCCGT CAGAGTTTGA CGCTGGAGGT GGCGCGCAAG CCAACTTGA
|
Protein sequence | MARILFLLFS LLSAFMLPVS PAPAAERKAI VLHVNGAISP ATAEYVTRGL RRARDRGVAL VVLQMDTPGG LDTSMREIIR AILDSPVPVA SFVAPSGARA ASAGTYILYA SHIAAMAPGT NLGAATPIAI GGGLFGDDER DGEETPGEPD KQEPRKPADA GEAKLINDAI AYIRGLAELR GRNIDWAERA VREAASLSSA AAAREQVIDF TAINLNDLLK QAHGRSVRIG QSDVRLDTAG LFIEDLPPDW RTQLLSVITN PNVALLLMMV GIYGLIFEFL SPGTVVPGTI GGISLLLGLY ALAVLPVSYA GVALILLGAG LLVAEAHAPS FGVLGLGSAV ALVLGAAILF DTDVPGLQVS WPVLSGIGFA SLAFGLLVAR LALLSSRHKI LTGAEEMIGI SGKVDSWEGA GGYVIAHGER WSAVSNEPLG PGEDVMVVGR QSLTLEVARK PT
|
| |