Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5238 |
Symbol | |
ID | 5319540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 199984 |
End bp | 201030 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640777015 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001313947 |
Protein GI | 150377352 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAA TGCCAGCGTC GAGTGTCGGG CGGGGTATTG ACGTCAAATT TGCGTGCTCG ATCGATGTAA GTACCCAGTC CTATCCTCAG CAAGATCAGT TTGAAATATT CAAGAACGCA CATGCCGGCG TCGCGGATCT AACCTTCTGC AAGAGTGCGG ACGGGTCCTT TCCAGCCCGG CAGATGGTGT GGATCCTGGG CTCAATGGTA ATTGTCTCCA GTATGCTGCC TGGAGCAGGT TACGCACACG AGTGGCGGCA TTTAAAGAAG CCGGCCTTAG ACAACTGGTA CCTGTGGATT CCACGACGGT CCGTCGATCA GGGCGTTGGC GCGCGGACCA TGCCTCATCT GCACTGTCTG GCGAAGCCAT TTCACGCCAT CGTCGAGGAC GAGGGCGCAT GTGCGATCTA TTTTCCGTCA GAGGGGTTCG TTCCTGCATC GATCCTTGAT TGCTTGCTCG ATAGATCTGT CCAGGGCGCG TCGGGGCGTC TCCTCACTGA CTATCTCATG CTCCTGGTTC GATCACTGCC CGACATGACC GTGGCAGAAG TTCCATACGT CGTCGAAGCA ACACGCAATC TCGTCGTTGC ATGTTTGGCG CCTTCGCCTG ATCGTGTTGC AGACGCTCAA AGACCGATTG CGGCGGTCGT TCTGGAGCGG GCCAAACGCA TGATAACCTC GAGACTTGTT GATCGCTCGC TCACCCCCGA GGCGATTTGC TGCGAAATAG GCATCTCGCG CTCGAGGTTG TACAGGTTGT TCGAGCCGCT TGGTGGCGTC GCGGCCTACA TCCGACACCA GCGCTTGGTT CGGACCCGCA GCGCTATTTC CAGTATTGAA GACGTCCGGC CGATATCTCG CATCGCGGAG GAATGGGGGT TCGACGATCC TTCAGCATTC AGCCGGGCCT TTAAGCACGA GTTTGGAATG ACCCCTAAGG AAGTAAGGGA GGTGGGATGG AACGGGGCTG CTGCGCACGT GCGCAGGGAG AGGTTTCGCG AAGGAGCCCC GACTACACTC CGCCAACTCC TTCAAGGCAT AGCGTGA
|
Protein sequence | MNTMPASSVG RGIDVKFACS IDVSTQSYPQ QDQFEIFKNA HAGVADLTFC KSADGSFPAR QMVWILGSMV IVSSMLPGAG YAHEWRHLKK PALDNWYLWI PRRSVDQGVG ARTMPHLHCL AKPFHAIVED EGACAIYFPS EGFVPASILD CLLDRSVQGA SGRLLTDYLM LLVRSLPDMT VAEVPYVVEA TRNLVVACLA PSPDRVADAQ RPIAAVVLER AKRMITSRLV DRSLTPEAIC CEIGISRSRL YRLFEPLGGV AAYIRHQRLV RTRSAISSIE DVRPISRIAE EWGFDDPSAF SRAFKHEFGM TPKEVREVGW NGAAAHVRRE RFREGAPTTL RQLLQGIA
|
| |