Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5539 |
Symbol | |
ID | 5319841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 503715 |
End bp | 504923 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640777290 |
Product | hypothetical protein |
Protein accession | YP_001314222 |
Protein GI | 150377627 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.188295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0555001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGAC GAGTGGCCGA TCAACTCACA ATCGTTCTCG CAGGATCCGC TTCTCTTGCT ATCGCAATGG GTGTTGGTCG TTTTGCGTTC ACCCCAATCT TGCCGATGAT GCTGCACGAC GGAACGGTCG ATCTCACTGT TGCCGGAAGG CTCGCAACCG CAAACTACGT CGGGTACCTC GTGGGAGCGT TGGCGGCGAT GCTAATACCG AAGGAGTGGC CGCAGACGAC GATTATCAAA TCGGCTCTTC TCCTGACCGT CGTGCTGACT GCCTTGATGG CGCTTCCGTT CCCAACGACA TGGACAGGAC TACGATTTCT CGCAGGCGTA GCATCGGCAA TCGGCTTTGT ATTCACGTCA GGTTGGTGCC TGGCGGAGCT TACCGGTTCA ACCGCCTCGA TCGGCAGTGC GATCTTTACC GGGCCCGGAG CGGGGATCGC CGTTTCTGGT CTAGCAGCCA GCGTGATGAC AGCATTGAGC TTCACTGGGC AGGGGGCCTG GACGACATTT GCCGTGATGT CGGCGGCGAT CAGCGCACTT ATCTGGCGTG TCTTTCGAAA GCCCGGCAGA GACGTGGAGC CATTTTATGT TGGTAAAGCG AGGACGTCGG GTCGGGTCCC GCGAACCGAA ATGCCCATGT TCGCCGTTGC GTACGGGCTC GCTGGCTTCG GTTACATCGT GACTGCTACA TACCTGCCTG TGATCGCGAA GAACGCAATA CCGGATTCCC CTTTGCTGAC CGTCTTCTGG CCGCTGTTTG GCATTTCAGC CGTAGTGGGT TCACTTCTGG CATCGCGTGT GCGGAAGAGC GCCGACGCTC GCCTATACCT CATTGGTTCG TATCTCGTGC AAGCGGCAGG CGTAGCAATG TCTGTCGTGT GGGAGGACGC AGTCGGTCTT GCCTTAAGCA GCATTCTAGT CGGTGTCCCG TTCACCGCGA TCAGCTATTT TGCCATGAAT GAGGTCCGAC GGATCCGATC AAGTCATCAC GCCCGCTACA TGGGGCTGCT GACGGCGGTG TTCGCCGTCG GCCAGATCAT GGGGCCGCCG GTCGTAGGCG CAATATTCGC CCGCCAGGTG AACTCCGACA GTGCGTTTGC TCTCGCCCTA GGCATTGCAA GCATAACGTT GGTCGTAGGG GCGGTGATCC TGGTGGCGAT GATCGTGATG TTCCCGGCCA GGGTTCATCA CAGGCCTCAG GAGCGCTAG
|
Protein sequence | MSRRVADQLT IVLAGSASLA IAMGVGRFAF TPILPMMLHD GTVDLTVAGR LATANYVGYL VGALAAMLIP KEWPQTTIIK SALLLTVVLT ALMALPFPTT WTGLRFLAGV ASAIGFVFTS GWCLAELTGS TASIGSAIFT GPGAGIAVSG LAASVMTALS FTGQGAWTTF AVMSAAISAL IWRVFRKPGR DVEPFYVGKA RTSGRVPRTE MPMFAVAYGL AGFGYIVTAT YLPVIAKNAI PDSPLLTVFW PLFGISAVVG SLLASRVRKS ADARLYLIGS YLVQAAGVAM SVVWEDAVGL ALSSILVGVP FTAISYFAMN EVRRIRSSHH ARYMGLLTAV FAVGQIMGPP VVGAIFARQV NSDSAFALAL GIASITLVVG AVILVAMIVM FPARVHHRPQ ER
|
| |