Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1802 |
Symbol | |
ID | 6146752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1821947 |
End bp | 1822945 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641616678 |
Product | LacI family transcription regulator |
Protein accession | YP_001743856 |
Protein GI | 170682008 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCTA CTATTTATGA TATTGCCAGG GTTGCAGGCG TATCAAAATC CACCGTATCA CGCGTGCTGA ATAAGCAAAC CAATATCTCC CCGGAAGCGC GCGAAAAAGT GTTACGGGCC ATTGAAGAAT TACAGTATCA ACCAAACAAG CTGGCACGCG CGCTGACCTC TTCGGGTTTT GATGCCATTA TGGTGATTTC TACCCGTTCG ACCAAAACTA CGGCGGGTAA TCCGTTTTTC TCGGAAGTTT TACATGCCAT CACCGCCAAA GCTGAAGAAG AAGGTTTCGA CGTGATATTG CAGACGTCGC ACAACCTGGC AGAAGACTTA CAAAAATGCG AAAGCAAAAT TAAGCAGAAA ATGATTAAAG GCATTATTAT GCTAAGTTCG CCAGCGGATG AGTCATTTTT TGCCCAACTC GATAAATATG ATATTCCGGT AGTGGTAATT GGCAAAGTTG AAGGTCAATA CAGCCATGTT TATTCCGTCG ATACCGATAA TTATGGCGAC AGTATTGCGC TGACCAATGC GTTAATTGAA AGCGGGCATA AGAATATTGC CTGCCTGCAT GCGCCGCTTG ATGTTCATGT TTCAGTGGAT CGGGTAAATG GTTATAAGCA AAGCCTGGCT ACGCATAATA TTGCAGTGCG TGATGAATGG ATTGTTGACG GCGGTTATAC CCATGAAACA GCCTTGCAAG CCGCACGGCA ATTATTAAGC CAGTCGCCGT TGCCAGAAGC CGTATTTGCC ACTGACAGCC TGAAATTAAT GAGCATTTAT CGTGCGGCAG CAGAGAAAAA TATTGCTATT CCGCAGCAGT TAGCGGTGGT GGGTTATAGC AATGAAACGC TGTCATTTAT TTTAACGCCT GCACCGGGCG GCATCGATGT TCCGACGCAG GAGTTAGGGC AACAAAGCTG CGAGTTATTA TTCCGCTTAA TTGCCGGAAA ACCGTCACCA CAAAATATTA CCGTTGCCAC GCATATGTCG TTGAAATAA
|
Protein sequence | MSPTIYDIAR VAGVSKSTVS RVLNKQTNIS PEAREKVLRA IEELQYQPNK LARALTSSGF DAIMVISTRS TKTTAGNPFF SEVLHAITAK AEEEGFDVIL QTSHNLAEDL QKCESKIKQK MIKGIIMLSS PADESFFAQL DKYDIPVVVI GKVEGQYSHV YSVDTDNYGD SIALTNALIE SGHKNIACLH APLDVHVSVD RVNGYKQSLA THNIAVRDEW IVDGGYTHET ALQAARQLLS QSPLPEAVFA TDSLKLMSIY RAAAEKNIAI PQQLAVVGYS NETLSFILTP APGGIDVPTQ ELGQQSCELL FRLIAGKPSP QNITVATHMS LK
|
| |