Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5109 |
Symbol | |
ID | 5319411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 56192 |
End bp | 57184 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776887 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001313819 |
Protein GI | 150377224 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0905505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0237076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGATA CGCAGGAGAT CCGCTTTGCG GTTCTGATGT TTTCGAACTT TCCGCTGATG GCTTTCAGTT CGGTCATCGA ACCGCTGCGC GCGGCCAACA CGCTCTCCGG CAGGCGCTGC TATTCCTGGG TCACGGTCGC CGCCGGCGAG AAGGTCACGG CCTCGAACGG CATCCGCATC GAGCCCGATT TTGCTGTGCG CAACGCACCC GAGGTCGACC GCATCGTCGT GTGTTCGGGC GGCGACGCCG AGCATCTCGT GGCGGACGAC GAAATGGCTT GGATCCGCAA GAACCTTCGC TCCGGTGCGC AACTCGGCGC CGTGGCCGAT GCGGCCTTCT TCCTTGCCCG CAAGGGCCTT CTCGACGGAC ATGCATGCAC GCTTCACTGG ACCAGTCAGC CGGCGTTCAA GGAAGCGTTT CCGCATCTCG ACATGCGCTC GGACCTCTAT GTGGTCGATC GCCGCCGCTT CACCTCGGTC GGCGGCGTCG GCAGCCTCGA CATGATGCTG GATATGATCG GACGGGATTA TGGCGCCGAG CTCGCCGACG CTGTCGCGGG TTGGTACATG CACAGTCCTT TGCGGCCGAA TGCCGACAGG CAAAAGCTGA CCTTACGTAT CCGAAGCGGT ATCGTCGACG ACCTCGTCCT TTCCGCAGTG GCGATGATGG AGGATGCCAT CGAGGACGTG CTGCGGATCG AGGATCTGGC CGCCCGGCTG AACGTCTCTT CCGACAAACT CGAGCGCGCC TTCAAAGCCG CGCTGGGTGT GTCGCCCAAC AGCTATTATC GCAGTCTGAG GCTTGGTCAT GCCGCAGACA TGTTGACGCA TTCCAATCTC AAGGTGAACG AAGTCGCCGT CGCCTGTGGG TTCGCGAACG CCGCGAACTT CTCCCGTGCC TTCAAGGAGC AGTTCGGATA CGTGCCGCAT AGCATCCGCC GCCGTATCTC GCGGGCCGGC GAGGCGCCGC GCGCGGTTGC GGGGCTGAAA TAA
|
Protein sequence | MRDTQEIRFA VLMFSNFPLM AFSSVIEPLR AANTLSGRRC YSWVTVAAGE KVTASNGIRI EPDFAVRNAP EVDRIVVCSG GDAEHLVADD EMAWIRKNLR SGAQLGAVAD AAFFLARKGL LDGHACTLHW TSQPAFKEAF PHLDMRSDLY VVDRRRFTSV GGVGSLDMML DMIGRDYGAE LADAVAGWYM HSPLRPNADR QKLTLRIRSG IVDDLVLSAV AMMEDAIEDV LRIEDLAARL NVSSDKLERA FKAALGVSPN SYYRSLRLGH AADMLTHSNL KVNEVAVACG FANAANFSRA FKEQFGYVPH SIRRRISRAG EAPRAVAGLK
|
| |