Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5827 |
Symbol | |
ID | 5320129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 793149 |
End bp | 794099 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777523 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001314455 |
Protein GI | 150377860 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.112164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.283627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTGC CTTCCGCCAG CGAACCTGCC GAAAGCTCTC CGGCTGAGAC AATCGACGCA CATCGGCACG AGATGCTCGA TGTTAACCGG AATCCCGCGA ACGAAATGTC ACGCGTTCTC GGGACTGAGC CGGTCCATCT AGCGGCAGAC CCGTCCGGCG GCGCAATCGC CCATTGGCAT CACGACGCCT TGCATGACGT CGTCGAGCCT ATGACCGATC ACGTCATCAT GGCTTATAAC GGCGCGATAC AACGCATGGA AAGGCGGTCC GGAAGATCGA TCGAGAGCGG AACGTTTCGT CGCGGGGTGG TGATCATCAT TCCAGCTGGG TCAAGCTCCC GCTGGGATAT TCCAAAGCCG GTGGATGTCG TGCAGCTCTA TCTTCCCCAC GCGACACTGA CGCGCATTGC CGACGAAACC GATACATTCA CTCCGACCGA CCTCCTGGAG CGAACCGCGC ATCCGGACCA CATTACATCC CGATTGCTCC TGAGTGCGGC CGATGTCTTA GAAGGCAATA CGGCACTGGA TACACTGTTC AGGCAGCAGT TGACCGACCT TCTGGCCACG CGCCTGCTGG CTGCGCACAC TGGCAGGGCG CCGAGCTATC GGCCGGCGGT CGGCGGCCTC GCGCCGAGCG TTCTGCGCAG GGCCGTCGAA CGATTACGGT CGGACGCGGA CGCCGACGTC TCCCTTGCGG CCCTTGCTGC CGATTCCGGG CTGTCGCGGT TTCATTTCTG TCGGGCCTTC AAGGAAAGCA CGGGGCTTTC ACCTCACAAC TGGCTGCGTC AATATCGACT CGAGCAGGCC ATAAACATGC TACGTGATCC TCAACAATCG ATCGCGTTGG TCGCAGCTTC CCTCGGGTAT GCCTCGCAAA CTGCATTTGC AGCTGCGTTC CGCAAATTGA CCGGTGAAAC ACCGACAGAT TGGCGCCGCC GCCACGGCTG A
|
Protein sequence | MSLPSASEPA ESSPAETIDA HRHEMLDVNR NPANEMSRVL GTEPVHLAAD PSGGAIAHWH HDALHDVVEP MTDHVIMAYN GAIQRMERRS GRSIESGTFR RGVVIIIPAG SSSRWDIPKP VDVVQLYLPH ATLTRIADET DTFTPTDLLE RTAHPDHITS RLLLSAADVL EGNTALDTLF RQQLTDLLAT RLLAAHTGRA PSYRPAVGGL APSVLRRAVE RLRSDADADV SLAALAADSG LSRFHFCRAF KESTGLSPHN WLRQYRLEQA INMLRDPQQS IALVAASLGY ASQTAFAAAF RKLTGETPTD WRRRHG
|
| |