Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0376 |
Symbol | lacI |
ID | 6145584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 390646 |
End bp | 391728 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615272 |
Product | lac repressor |
Protein accession | YP_001742479 |
Protein GI | 170679780 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000396722 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACCAG TAACGCTATA CGATGTCGCA GAGTATGCCG GTGTCTCTTA TCAGACCGTT TCCCGCGTGG TGAACCAGGC CAGCCACGTT TCTGCGAAAA CGCGGGAAAA AGTGGAAGCG GCGATGGCGG AGCTGAATTA CATTCCCAAC CGCGTGGCAC AACAACTGGC AGGCAAACAG TCGTTGCTGA TTGGCGTTGC CACCTCCAGT CTGGCCCTGC ACGCGCCGTC GCAAATTGTC GCCGCGATTA AATCTCGCGC CGATCAACTG GGTGCCAGCG TGGTGGTGTC GATGGTAGAA CGAAGCGGCG TCGAAGCCTG TAAAGCGGCG GTACACAATC TCCTCGCGCA ACGCGTCAGT GGGCTGATCA TTAACTATCC GCTGGATGAC CAGGATGCCA TTGCTGTGGA AGCTGCCTGC GCTAATGTTC CGGCTTTATT TCTTGATGTC TCTGACCAGA CACCCATCAA CAGTATTATT TTCTCCCATG AAGACGGTAC GCGACTGGGC GTGGAGCATC TGGTCGCATT GGGTCACCAG CAAATCGCGC TGTTAGCGGG TCCATTAAGT TCTGTCTCGG CACGTCTGCG TCTGGCGGGC TGGCATAAAT ATCTCACTCG CAATCAAATT CAGCCGATAG CGGAACGGGA AGGCGACTGG AGTGCCATGT CCGGTTTTCA ACAAACCATG CAAATGCTGA ATGAGGACAT CGTTCCTACT GCGATGCTGG TTGCCAACGA TCAGATGGCG CTGGGCGCAA TGCGCGCCAT TACCGAGTCC GGGCTGCGCG TTGGTGCGGA TGTCTCGGTA GTGGGATACG ACGATACCGA AGACAGCTCG TGTTATATCC CGCCGTTAAC CACCATCAAA CAGGATTTTC GCCTGCTGGG GCAAACCAGC GTGGACCGCT TGCTGCAACT CTCTCAGGGC CAGGCGGTGA AGGGCAATCA GCTGTTGCCC GTCTCACTGG TGAAAAGAAA AACCACCCTT CCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCAG ATTCTTTAAT GCAGCTGGCA CGACAAGTTT CCCGACTGGA AAGCGGGCAG TGA
|
Protein sequence | MKPVTLYDVA EYAGVSYQTV SRVVNQASHV SAKTREKVEA AMAELNYIPN RVAQQLAGKQ SLLIGVATSS LALHAPSQIV AAIKSRADQL GASVVVSMVE RSGVEACKAA VHNLLAQRVS GLIINYPLDD QDAIAVEAAC ANVPALFLDV SDQTPINSII FSHEDGTRLG VEHLVALGHQ QIALLAGPLS SVSARLRLAG WHKYLTRNQI QPIAEREGDW SAMSGFQQTM QMLNEDIVPT AMLVANDQMA LGAMRAITES GLRVGADVSV VGYDDTEDSS CYIPPLTTIK QDFRLLGQTS VDRLLQLSQG QAVKGNQLLP VSLVKRKTTL PPNTQTASPR ALADSLMQLA RQVSRLESGQ
|
| |