Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3674 |
Symbol | |
ID | 5318781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 113778 |
End bp | 114926 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640775487 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001312420 |
Protein GI | 150375824 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.833381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00584343 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAAGG ACTCCGGATC AGGAGATGAC CCCGTCGAAG TTCCGGCTTG GCCCGCCCAT ATGGAGCGGC GCCGCTGGCC ACGCGACCCA TCAGCTCGGA AGGATCGTTC GGCCGAACGA TCACCGGCGC TCGTGCCTTC ACGGTTCTCG ACGCGTGAGC TGCCGTTGGA GGAGCAATTC AAGTCCTGGC GGGCGCACAT GGCGCCTCTC GTGGACGTGC ATTTGCCGGA CGGGGTAACC GAAGAACACG GCTTCCCGGC TGAGCTGGTT GGTTGGCACC TCGGTGATTT GCTGGTCGTT CAACAGAGTA CGCCCGCACA TAGCTATGAG CGTTCTCAGG CGATGTTGCG TTCAAGCCCG ATTGACCATT GGAATGTCGG GCTCTTTCGC GCCGGACGAT CGTGGACCGA AGCCGACCGG CGCGTGACGG AAACAGGCCC CGGAGAGTTC TTCTTCCGAT CGCTCGGCTA TCCCTATCGC GGACGCATGA CGGACGCTGC GGCTATCCTT CTGTTCATGC CCTATGAACT GCTTGCAGAT GATGTGGGTA AGCTCGAAGG TGCCAACAAT TCGGTCTTGA CGGGTAGTCT CGCCGATCTG CTCGCGAACT ACCTCAACGG CATGGAAGAA AACCTCGGCA ACATCACCGT GGAAGAAGTG CCGCGCATCG TCCGTACGAT ACGCGACATG GTCGTTACAT GCGTCGCGGC AGTGAGACCG GATACCGGAG GCTCCCAGGC GAAGATGGGA GTAATGGAGC GGGCGCACCG ATATATCCAT CTCAACCTGC ACTCGGCCGA CCTCACACCC GAATCGATTT GTCGGGAGCT CGGCGTCTCC AGGACACGCC TTTATCAACT TTTCGAGCCG AGCGGGGGCG TACTCAACTA TATTCGAAGA CGCCGGCTGC TGCAGGCCCA CGCTGAACTC AGCGATCCAA CGAACTACCG CCCGATCGCG GAGATTGCGG AAACAGCCGG CTTCGACCTG GCCGCCAACT TTACTCGTGC CTTCAGCCAT GAATTCGGCG TAAGCCCGCG CGAAGTCCGC AAGGCGGCAG CCGCCGATCG TCTTGTCACC CCCGTTGCCG TGCCGGAGCG CGACCGCGGG TTAACGATCG GCGACTGGCT CAGATCGATA CAAGGATGA
|
Protein sequence | MTKDSGSGDD PVEVPAWPAH MERRRWPRDP SARKDRSAER SPALVPSRFS TRELPLEEQF KSWRAHMAPL VDVHLPDGVT EEHGFPAELV GWHLGDLLVV QQSTPAHSYE RSQAMLRSSP IDHWNVGLFR AGRSWTEADR RVTETGPGEF FFRSLGYPYR GRMTDAAAIL LFMPYELLAD DVGKLEGANN SVLTGSLADL LANYLNGMEE NLGNITVEEV PRIVRTIRDM VVTCVAAVRP DTGGSQAKMG VMERAHRYIH LNLHSADLTP ESICRELGVS RTRLYQLFEP SGGVLNYIRR RRLLQAHAEL SDPTNYRPIA EIAETAGFDL AANFTRAFSH EFGVSPREVR KAAAADRLVT PVAVPERDRG LTIGDWLRSI QG
|
| |