Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4624 |
Symbol | |
ID | 5318935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1126767 |
End bp | 1128005 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776423 |
Product | hypothetical protein |
Protein accession | YP_001313355 |
Protein GI | 150376759 |
COG category | [S] Function unknown |
COG ID | [COG3748] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.449143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGAGT ATGCCATCGC CTGGGATTGG CTGACATTTG CGGTGAGATG GCTGCATGTC ATCACCGGCA TCGCCTGGAT AGGCTCATCC TTTTACTTCG TTGCGCTCGA CCTGGGCCTG AAGCAGCGTC CGGGCCTGCC GGTCGGCGCC TATGGCGAAG AGTGGCAGGT GCATGGCGGC GGTTTCTACC ACATCCAGAA ATATCTGGTG GCGCCGGCAA ACATGCCGGA GCACCTGATC TGGTTCAAAT GGGAATCCTA CGTCACCTGG CTCTCCGGTT TCGGCATGCT TGCGCTCGTC TATTATGCCG GCGCGGACCT CTACCTCATC GATCCGAACG TGCTCGACGT TTCGAAGCCG ATGGCGATCG CCATCTCGCT CGCCTCGCTC GGCTTCGGCT GGCTCGCCTA CGACATGATC TGCCGATCCC CATTCGGCAA TGACAATACG CGGCTGATGG TGCTGCTCTA TTTCATTCTC GTCGCCGTCG CCTGGGGTTA CACCCAGCTG TTCACGGGGC GTGCCGCCTA TCTGCATCTA GGCGCCTTCA CGGCGACCAT CATGTCGGCG AACGTGTTCT TCATCATCAT CCCGAACCAG AAGAAGGTCG TTGCCGACCT GATCGCAGGC CGCACGCCCG ATCCTGCTCT CGGAAAGCAG GCGAAGCAGC GTTCGACGCA CAACAACTAT CTGACGCTGC CCGTGCTGTT CCTGATGCTG TCGAACCATT ACCCGCTCGC CTTCGGCACG CAGTATAACT GGATCATCGC CTCGCTGGTT TTCCTCATGG GGGTCACGAT TCGCCACTGG TTTAACACCC GCCACGCCAA CAAAGGCAGC CCGACCTGGA CCTGGCTCGC GACCGTGCTC CTGTTCATCG CCATCATGTG GCTTTCCACC GTGCCCAAGG TCCTCTCCGA GGGCGGAGAG GCAAGGGCAG CGACGGCGGC CGAGGCGGTG GTCGCGTCTC CGGATTTCTC CAAAGTGCGC GACACCGTGC TTGGTCGCTG TTCGATGTGC CATGCGCGGG AGCCCGGCTG GGAGGGTATC ATCGTGCCGC CAAAGGGCGT GATCCTCGAA TCCGACGGCG ACATTGTTGC GCATGCCCGT GAAATCTACC TGCAGGCGGG GCGTTCGCAT GCCATGCCGC CTGCCAATGT CACCGGCATC ACCGAGGAGG AACGCCAGCT CATCGCCTCC TGGTACGAGA GGACGATAAA GGAAGGAAAA GTCCAATGA
|
Protein sequence | MYEYAIAWDW LTFAVRWLHV ITGIAWIGSS FYFVALDLGL KQRPGLPVGA YGEEWQVHGG GFYHIQKYLV APANMPEHLI WFKWESYVTW LSGFGMLALV YYAGADLYLI DPNVLDVSKP MAIAISLASL GFGWLAYDMI CRSPFGNDNT RLMVLLYFIL VAVAWGYTQL FTGRAAYLHL GAFTATIMSA NVFFIIIPNQ KKVVADLIAG RTPDPALGKQ AKQRSTHNNY LTLPVLFLML SNHYPLAFGT QYNWIIASLV FLMGVTIRHW FNTRHANKGS PTWTWLATVL LFIAIMWLST VPKVLSEGGE ARAATAAEAV VASPDFSKVR DTVLGRCSMC HAREPGWEGI IVPPKGVILE SDGDIVAHAR EIYLQAGRSH AMPPANVTGI TEEERQLIAS WYERTIKEGK VQ
|
| |