Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5098 |
Symbol | |
ID | 5319400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 46329 |
End bp | 47372 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776876 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001313808 |
Protein GI | 150377213 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.109281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.521536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGCC GGATGATCGC TCCATGTTTC ATGGACGACA CGCTCGAATG CCTTAGGCGC CATGGAGTCG ACGCCGGGCC CCTTCTTGCG CAAGTCGGCC TCCCGCCAGT CATCACCGGC CCTGTGCCGG CCGAGCAATA CGGCGCATTG TGGCATGCGG TGGCTCAGGT CATGGATGAC GAGTTTTTCG GCGAAGGCGC ACGGCCGATG CGCGCCGGCA GCTTCGCATT GCTGTGCCAT GCCATTCTTT CGACGGTGAC CCTCGAACAC GCGCTGCGGC GAGCCCTGCG CTTTCTAAGG ATCGTCCTCG ACGATCCCCA TGGCGAACTG GTCGTCGAGG ACGGACTGGC GCAGATCGTC CTCGAGGATG CCGGCGCGAC ACGCTCCGCC TTTGCCTACC GCACCTTCTG GATCATCCTG CATGGCGTCA ATTGCTGGCT GATCGGACGC CGGTTACCGA TCCGCTGGGT CGACTTCCGC TGCAGCGCGC CGCCGGCCGG AACCGATTAC CGGCTCTTCT TCGGTGCGCC CGTCCGCTTC GACCAGCCCC GGACGCGGCT CGTCTTCGAT GCCGAGTACC TGAAACTGCC GCCCATTCGA GACGAGCGGG CGCTGAAACA CTTCCTCCGG CATGCGCCGG CGAACATTCT CGTGCGCTAT CGCCACGATG CGGGCTTGTC CACGGCAATA CGCAGGCGGC TCCAGGCCTT GGATCCGTCC GCCTGGCCCG GTTTCGAGAC GCTCGCGGCG CGGATGCGGA TTGCAGCGCC GACCCTTCGC CGGCGATTGA AGCAGGAAGG TCAGACCTAC CGATCAATCA AGGAAGACCT GCGCCGGACA CTTGCCATGG AAGCGCTTGC CGAGCGCGGA CAGAACGTGG CGCAGCTCGC CGTCGAACTC GGCTTTTCCG AACCAAGCGC CTTTCATCGC GCATTCCGCA AATGGACGGG GAAATCCCCG GCGCAATTCC GGCGCAGTGC CAGCGAAACA GGTCTGGCGG AAAGCGGCTC GTTGAAGAAA CGCACTCAAC TCCAGGAGAG CTGA
|
Protein sequence | MERRMIAPCF MDDTLECLRR HGVDAGPLLA QVGLPPVITG PVPAEQYGAL WHAVAQVMDD EFFGEGARPM RAGSFALLCH AILSTVTLEH ALRRALRFLR IVLDDPHGEL VVEDGLAQIV LEDAGATRSA FAYRTFWIIL HGVNCWLIGR RLPIRWVDFR CSAPPAGTDY RLFFGAPVRF DQPRTRLVFD AEYLKLPPIR DERALKHFLR HAPANILVRY RHDAGLSTAI RRRLQALDPS AWPGFETLAA RMRIAAPTLR RRLKQEGQTY RSIKEDLRRT LAMEALAERG QNVAQLAVEL GFSEPSAFHR AFRKWTGKSP AQFRRSASET GLAESGSLKK RTQLQES
|
| |