Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4731 |
Symbol | |
ID | 5319077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1251363 |
End bp | 1252442 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776529 |
Product | hypothetical protein |
Protein accession | YP_001313461 |
Protein GI | 150376865 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.226742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00240822 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAAGA GTATTCCCGT GCGGGACGAA GACGAGAACT CGAACCGCCT TTTTACGCCG CGCCGCTCGC GCACTTTGGC GGTCCTTGTC GCCCTTCTCG TAGGATTGGT CGTGTGCACG TTCCTCACCG TACCGTTCCT GCCAGCCCTC ACCTGGGCGC TCGTGTTGGC TGTGATGTTC CAGCCGCTGC ATCGCCGGGT CAAAAGCCGC TTCCGCTATC CGGACGCAGC AGCCGCGGCG ACCGTCGCCA TCGCTGTCTT CGTCGTCGCC GTGCCGCTTA CCTTCATGGC GGAGCGGTTG GTAAACGAGG CTGCCAAGGG CGCAAAGATC ATCGAGGAGG CGCTGCGGTC CGGCACCTGG CGCGACGCGC TCGCCAATTA TCCACGTCTC GCGCCGGCCG TCGTCTGGAT CGAGACGCAG CTCGATCTGG CCGGGATCGC CGGAAGCGCC ACCAGCTGGC TTACCAACTT CAGCGCCTCG TTCGTACGCG GTTCCGTTGC CCAGATCATC GATGCCGTCC TGACATTCTA TTTCCTCTTC TACTTCATGC GTGACGGCCG GCAAGTGCTG GCCGCGCTGA AGGAACATTC CCCACTCAGC GAGCAGGATA TGAACCGGCT GTTCACGCGG GTACACGAAA CGGTGCATGC CGTTGTCTTC GGCACAGTGG CCGTTGCAGC CGTGCAGGGC GCCATGGGCG GCCTGATGTT CTGGTTGCTC GGCTTACCGG CGCCCGTGGT ATGGGGGCTC GCCATGGGCT TGCTCGCGGT CGTACCGGTG CTCGGCGCTT TCATCGTCTG GCTACCGGCC GCACTATCGC TGGCGTTGAG CGGCGAATGG GGAAAGGCGT TGATACTCGC CGGTTGGGGC GCGGGGGTGG TCGCGACCAT CGACAATCTT CTGTACCCCA TCTTCGTGGG GGACCGCCTG AAGCTCCACA CCCTCACGGC CTTCATGAGC ATGATCGGCG GCATCATCGT ATTCGGGTCG GCAGGGTTGG TGATCGGGCC GGTCGCCTTC ACCGTCACGC TGCTGCTTCT GGACATCTGG CGCCAACACA ATACGGAGCC CAGGGCCTGA
|
Protein sequence | MAKSIPVRDE DENSNRLFTP RRSRTLAVLV ALLVGLVVCT FLTVPFLPAL TWALVLAVMF QPLHRRVKSR FRYPDAAAAA TVAIAVFVVA VPLTFMAERL VNEAAKGAKI IEEALRSGTW RDALANYPRL APAVVWIETQ LDLAGIAGSA TSWLTNFSAS FVRGSVAQII DAVLTFYFLF YFMRDGRQVL AALKEHSPLS EQDMNRLFTR VHETVHAVVF GTVAVAAVQG AMGGLMFWLL GLPAPVVWGL AMGLLAVVPV LGAFIVWLPA ALSLALSGEW GKALILAGWG AGVVATIDNL LYPIFVGDRL KLHTLTAFMS MIGGIIVFGS AGLVIGPVAF TVTLLLLDIW RQHNTEPRA
|
| |