Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4517 |
Symbol | |
ID | 5318080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1001730 |
End bp | 1003130 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776318 |
Product | hypothetical protein |
Protein accession | YP_001313250 |
Protein GI | 150376654 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.198903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTC GTGGATTGTT TGCCCGCAGG TCAGCAATGG CCGCCTTGTC GAAGCCGGAA ATCGCGATCG TTGGCCGCGG TTTTTCGGGC ATCATGATGG CGATCGCGTT GATGAAATCG ATGCGGGTAT CTTTCCACCT CACCATGTAT GATCCGCATC CATCGATCAG CGGGGGCCAG GTATTTTCCG CCGCTCAGCG CTGCGAGATA CTCAACAGCC GCGTCCGCGA TCTTTCGGTC GCGGCCGGCC AGCCGGATGA TTTCAACGAT TGGCTCTGCG CCAATGATGA GTTCCGAACG GCGGTTCCGG CGGCCATTCC CGGCTTCCGG CAGATTTTCG TGCCGAAAGG CATCTTCAGC GATTACGTGC ACCAGCGCTT CTCCGAGGCG CTTGCCCGGC GCTCCGACAT CACGGTGAGG TTCTCACACG AGCCGGTCAC AGGCCTGCGA AAGCTCGCAA GTGGTCGCTT CAGCCTTGTG CGGCACGGTG GCAACGACGA GAGCGACATT GTCATCCTCG CTACGGGCTT CGGCATGCGC CCGCGGGACC TCGAGGTTTC CGAGGAGGAG CGGCCGCTTG TGCGCACTCG GCGCCTCGTC GATCCACGCC ACGCGGTGCT GCTCGGTAGC GGGATACGTG TGGTCGACCA GTTGTTCCAG ATGCGCGACA ACGGCTATGC GGGAAAGGTC ACGCTCATTT CGCGGCACGG TTTCCTGCCG CAGGCGCACA CGCAGCGCGC GGCATCGCCG AGCTTTCCCG TCGATCCGCT GCCGCAAGGC CTGGGCCGTA TCGTGCGCTT CGTGCGTCAG GCCTGCGCCG AGGCCGAAGC GAACGGGCAG GGATGGCAAT CTGCGATGAA CGGCCTCCGC CGCCGCGCTC GCTCTCTCTG GCAATCGCTC TCCGCACAGG AGAAGCGGCA GTTCAACCGT CACCTGCGTG CAATTTACGA CAGCCACAGG AACCGCCTGC CAGCGGCCGT TCACGCGCGG CTGCAGCAGG AACTTGGCGA GGGTCGGACC GTGCTTCGCC GCGGCCGGGC GGGTCGACGC CTGCCCGAAG GTATCCTCGT GCGATGGGCC GGCCAGGATA CCGAGGAACT GCTGAGGGCT GATCAGGTGA TCGAATGTCG CTGTTCAGCT CCGGACCTCG GAACGCCGTT GCTTCGGAGC CTTATTGCGG GCGGGCTTGC CCAACCAGAC GAACTCGAGC TCGGCATTGC TGTCGCCCCG ACGGGCGAAG TCTTGAGCTC GAGCGGACAC ACGCCGAACC TCTTCGCCAT CGGTCCGTTG GGATTGGGAA GCCTTCCCGA CATCGACCTC GTACCGGAAA TCGTCACGCA GACCTATGCG GCATCACGGC TGATAGCGAC AGGAAAGCGC ATGACGCTGA AAGCTGGATA G
|
Protein sequence | MTVRGLFARR SAMAALSKPE IAIVGRGFSG IMMAIALMKS MRVSFHLTMY DPHPSISGGQ VFSAAQRCEI LNSRVRDLSV AAGQPDDFND WLCANDEFRT AVPAAIPGFR QIFVPKGIFS DYVHQRFSEA LARRSDITVR FSHEPVTGLR KLASGRFSLV RHGGNDESDI VILATGFGMR PRDLEVSEEE RPLVRTRRLV DPRHAVLLGS GIRVVDQLFQ MRDNGYAGKV TLISRHGFLP QAHTQRAASP SFPVDPLPQG LGRIVRFVRQ ACAEAEANGQ GWQSAMNGLR RRARSLWQSL SAQEKRQFNR HLRAIYDSHR NRLPAAVHAR LQQELGEGRT VLRRGRAGRR LPEGILVRWA GQDTEELLRA DQVIECRCSA PDLGTPLLRS LIAGGLAQPD ELELGIAVAP TGEVLSSSGH TPNLFAIGPL GLGSLPDIDL VPEIVTQTYA ASRLIATGKR MTLKAG
|
| |