Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5729 |
Symbol | |
ID | 5320031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 698757 |
End bp | 700283 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777447 |
Product | transcriptional regulator domain-containing protein |
Protein accession | YP_001314379 |
Protein GI | 150377784 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.816557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGAAC CGCTATATAC TTTCGGCGAC TTCGTGCTGG ATCCCCGCAG CGGGCTACTG CGGCGGAAGG GAGAGTCGGT GGAAATCGGC TCCCGCGGTC TCGCGCTTCT CCAAGCATTG CTGGAAGCCA GGGGCGGGAT CGTATCCAAG GCCGAACTGA TCGAACGAGG TTGGCCGAAC ACGATTGTCG AGGACAGCAA CCTTACCGTA CAGATCGCAA GTTTGCGAAA GGCGCTTGGA CCGTCTCCCG ACGGGCGTGA GTGGATCACG ACAATACCTC GGGCTGGCTA TCGCCTAGTC ACGGCTTATG CGGGAGCTTC CAGTCTGACG TCAACCAGGC CCGCACTGGC GGTTCTGCCA TTTATCAATC TTGATGGCGG TTCTGACCAG GGTTATTTCG CGGACGGCGT TGTCAATGAG ATCATCACCG CGCTCAGCCG CTTCAGAAGC TTCGCCGTGG TCGCGCGCAA CGGATCCTTT TTCAACAATT TGCGATTTCC TGACGTACGA TTGGTCGCCA AGGAACTCGG TGTCGGCTAT ATGCTCCAAG GCAACATAAG GCGTCCGGGC AGCCGATTGA GAATCTCGGT ACAGCTCGTT GACGGCAGTG GCACACATCT CTGGGCGCAT AGCTTCGATG AGGAACTTGA TGATGTGTCC GTTTTCCAGG ACCGAATAGC CGAGAGCGTC GTGTCACTGG TTGAGCCGCA TATTCAAGCG GCCGAGATCG AACGCTCACG TCGAGACCGG CCGGGGAGCA CTGCCTCTTA CGATATCTAT CTGCAAGCGC TGGCAAAAAT CTCAACGGAG TCTGAACTTG ACAATGCGGA GGCTTACGCC CTCCTCATGA GAGGCATTGA GGCCGAGCCC GACAATGCTC TTCTGCTCGC CCATGCCGCG TGGGCGCTCG AGCATCGGCA CACGATGGGT TGGCCATCGC TTGGTGAGGA CGATGTAGGA GAGTGTGTGG CGCTGGCGCG GCGCGGGCTC GAACATGCTG CGGGAGATGC GATGGTGATG GCGCATTGCG GCGTCGCCCT GCTGCAAACC GCAAAGGACT ACGATTGGGC CTTGGCGGTC CTGCAATCTG CGGCGGAGGC TAATCCAAAC AACCTGATGG TCGTAGTCCG GGCCGGCCTT GCGCACCTGC ATTGCGGGAG CCTTGACGAG GCCCTGACCC ACTTTCAAAG GGCAAGCCGG TTGAGCCCGG GGGACCGCGG CGCTCACTTC TCGCTCTGCG GTATTGCAGA CGTGCACTTG ATCCACGGCA ACTATGCCGA GGCGATTACC TGGGCCGCTC GCGCGCTCGC GAGCAATCCG AATTTCGATC CGAACCTCTG GGTGCTGATC GCAGCGAATG CCCATCTTGG CCGAATGGAA GAGGCGGATC GGTATCTGCG TGAGCTAAGG CGACGTGTTC CGGAGGTCAC AATCGCCCGC ATCAAGGCCG GACAACCTCG CAAGGACCCG ACGAGGACCG CCGCCTTGCT CGAAGGGTTA CGCAAGGTCG GCCTGAAGGA AGGCTGA
|
Protein sequence | MVEPLYTFGD FVLDPRSGLL RRKGESVEIG SRGLALLQAL LEARGGIVSK AELIERGWPN TIVEDSNLTV QIASLRKALG PSPDGREWIT TIPRAGYRLV TAYAGASSLT STRPALAVLP FINLDGGSDQ GYFADGVVNE IITALSRFRS FAVVARNGSF FNNLRFPDVR LVAKELGVGY MLQGNIRRPG SRLRISVQLV DGSGTHLWAH SFDEELDDVS VFQDRIAESV VSLVEPHIQA AEIERSRRDR PGSTASYDIY LQALAKISTE SELDNAEAYA LLMRGIEAEP DNALLLAHAA WALEHRHTMG WPSLGEDDVG ECVALARRGL EHAAGDAMVM AHCGVALLQT AKDYDWALAV LQSAAEANPN NLMVVVRAGL AHLHCGSLDE ALTHFQRASR LSPGDRGAHF SLCGIADVHL IHGNYAEAIT WAARALASNP NFDPNLWVLI AANAHLGRME EADRYLRELR RRVPEVTIAR IKAGQPRKDP TRTAALLEGL RKVGLKEG
|
| |