Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3825 |
Symbol | |
ID | 6147080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3892979 |
End bp | 3893950 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618651 |
Product | LysR family transcriptional regulator |
Protein accession | YP_001745791 |
Protein GI | 170682485 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.18343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGAAA AAACAATAAA CAATGCGATA TGCGCGCTAC TGTTTCGCTG TGAACAACAA TCGGTCAAAG AAATGGATAA AATTCACGCA ATGCAGTTGT TCATCAAAGT CGCTGAGCTG GAAAGTTTTT CCCGCGCAGC GGATTTCTTT GCTTTGCCAA AGGGAAGTGT TTCGCGCCAG ATACAGGCAC TGGAACATCA ACTTGGCACC CAGCTTCTCC AGCGCACCAC GCGGCGGGTC AAACTCACGC CAGAAGGCAT GACCTATTAT CAACGGGCAA AAGATGTGTT GAGTAATCTC AACGAACTGG ACGGTCTGTT TCAACAGGAT GCCACCAGTA TCAGCGGTAA ATTACGCATC GACATCCCGC CAGGAATCGC GAAAAGCCTG TTACTGCCGC GCCTGTCGGA ATTTCTCTAT CTGCATCCGG GAATTGAGCT GGAGCTGAGT AGCCATGACC GTCCGGTAGA TATTCTTCAC GATGGTTTTG ATTGCGTGAT ACGCACTGGC GCATTACCGG AAGATGGCGT TATCGCCCGT CCCCTCGGCA AACTGACCGT GGTCAATTGT GCCAGTCCAC ACTATCTGAC GCGCTTTGGT TATCCTCAAA GCCCCGACGA TCTGACTTCG CACGCTATAG TGCGTTACAC ACCGCACCTG GGTGTACATC CGTTAGGTTT TGAGGTTGCC AGCGTTAATG GCGTCCAGTG GTTTAAGTCT GGCGGCATGT TGACGGTAAA CAGTCGCGAA AACTATCTCG CCGCCGGTCT TGCCGGTCTG GGGATTATTC AGATCCCGCG CATTGCCGTG CGCGAAGCCC TGCGTGCCGG GCGGCTTATT GAAGTGTTGC CTGGCTACCG TGCCGAGCCG CTGTCCCTTT CACTGGTTTA TCCGCAGCGT CGGGAGCTTT CCCGGCGTGT AAACCTGTTT ATGCAGTGGC TGGCTGGCGT AATGAAAGAG TACCTGGACT GA
|
Protein sequence | MLEKTINNAI CALLFRCEQQ SVKEMDKIHA MQLFIKVAEL ESFSRAADFF ALPKGSVSRQ IQALEHQLGT QLLQRTTRRV KLTPEGMTYY QRAKDVLSNL NELDGLFQQD ATSISGKLRI DIPPGIAKSL LLPRLSEFLY LHPGIELELS SHDRPVDILH DGFDCVIRTG ALPEDGVIAR PLGKLTVVNC ASPHYLTRFG YPQSPDDLTS HAIVRYTPHL GVHPLGFEVA SVNGVQWFKS GGMLTVNSRE NYLAAGLAGL GIIQIPRIAV REALRAGRLI EVLPGYRAEP LSLSLVYPQR RELSRRVNLF MQWLAGVMKE YLD
|
| |