Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4168 |
Symbol | |
ID | 5319197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 642350 |
End bp | 643339 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640775973 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001312906 |
Protein GI | 150376310 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.565249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.28666 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACC ATTCGTCCAA AACCTCCTCT TCCGCTTCTG AGGATTTTCT GACCGAGCTG TTGCGCGGGC TCCGTCTCGA CGGGGTGGAT TACGTCCGCT GCGAATTGAC GGCACCCTGG GGGATCTCAT TTCCGGCGCA GGAGACGGCG CGCTTCCATT TCATCTGCGG GGATTGCTGG CTGCGCGTCG CCGACGGGGA CTGGATCGAG TTGAAGCGCG GCGATGCCGT GCTCCTGCCG CGCGGCGGCG AGCACGCGCT GGCAAGCATG CCAGGCGAGA AACTCGCTCC GCTCGACGCC TATTCGGTCC AGGAAGTATG CCATTGCGTC TACAATGTCT GCGGCGGCGG GCGCGGCGAG ACCACCATTC TTTTCTGCGG CAGCCTCAGG TTCAACATGG ATTCCATGCA TCCGCTGCTG CGCATGATGC CGGACGTGAT GCGAATCAAC GCACTGACCG CCAGCGAGCC GGCTATCCCG CACATGCTCG ACGCCATGGC GCGGGAAGTC GGCGCCAGCC GCGTCGGTTC CGGCGGTGTC CTGGCGCGGC TCGCCGACGT GCTCGCGGCC CTCATCATCC GTTCCTGGGT TGAACACGGA TGCGGCAATA CCAGCGGCTG GGTGGCGGCG GTCCGCCACC CCGGCCTCGG CCGGGTCATC GCGGCCATGC ACCTCGACCC GGAAAAGGCC TGGACCGTCG ACTCCCTCGC CAGGCTGATG GGCGCCTCGC GTTCCGGCTT CGCTCAGCAA TTCGCCAGCG TGGTCGGCGA GACGCCGGCC CGCTACCTTG CGCAAGTGCG TATGCACCAG GCACGTCAGT GGCTGACCCG CGACCGCATG CGTATCTCGG TCGTGGCACG TCGCCTCGGC TATGATTCGG AAGCCTCCTT CAGCCGCGCC TTCAAGCGCG TGATCGGCCA GCCGCCGAGT CATTATCGTG GCGCCGACCC GGCCGAGGTC TCCACATTCG CTGGCGAGAG CAGACCCTGA
|
Protein sequence | MLDHSSKTSS SASEDFLTEL LRGLRLDGVD YVRCELTAPW GISFPAQETA RFHFICGDCW LRVADGDWIE LKRGDAVLLP RGGEHALASM PGEKLAPLDA YSVQEVCHCV YNVCGGGRGE TTILFCGSLR FNMDSMHPLL RMMPDVMRIN ALTASEPAIP HMLDAMAREV GASRVGSGGV LARLADVLAA LIIRSWVEHG CGNTSGWVAA VRHPGLGRVI AAMHLDPEKA WTVDSLARLM GASRSGFAQQ FASVVGETPA RYLAQVRMHQ ARQWLTRDRM RISVVARRLG YDSEASFSRA FKRVIGQPPS HYRGADPAEV STFAGESRP
|
| |