Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1104 |
Symbol | |
ID | 6147129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1119437 |
End bp | 1120468 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615988 |
Product | periplasmic sugar binding transcriptional regulator |
Protein accession | YP_001743180 |
Protein GI | 170680901 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.242225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTCAAT CACATTCACA ACGCGTAACA CGTTCTGACG TGGCAAAAGA AGCAGGGACA TCCGTCGCTG TCGTTAGTTA CGTTATTAAT AATGGCCCGC GCCCCGTTGC CGAAGCGACC CGACAACGTG TACTACAGGC CATTAAGAAA ACCGGTTATC GACCCAATGG CATCGCGCGT GCGCTGGCTT CAGGGAGTAC GCAAACTTAT GGGCTAGTCG CTCCGGACAT CTCGAACCCG TTTATCGCCT CTATGGCTCA TGCCCTGCAA CACGAAGCCT TTGCTGATGG CAAAGTACTC CTGCTGGGCG ACGCAGGCGA TAGTAGCTGC CGTGAACGTG AACTCATTAA TAATATGCTA CACCGCCAGG TTGACGGGCT GATCTACACC AGTGTTGATC GCCATCCTTA TATCAATTTG ATTCAGGAGA GCGGTACCCC CTGCGTCATG CTCGATCGTG TGGATCCTGG GCTTAACGTC AGCGTTATCC AGGTTGATGA ACAACTGGCG GCGATGCAGG TAACACAACA TCTTATTGAT CACGGATACC GCGACATCGG CATCATCTGC GGCCCGCGTG AAATGCTGAA TACTCAGGAT CGTATCCGGG GCTGGCAGCA GGCGCTGGAA GCCTCATCGT TAGTGGTCAA TCCCTCATGG ATTTTTTCGA CCAACTATAC CCGCGCCGGC GGTTACGAGG CGACAAAACG CATGCTTGAG CACCAACTGC CACGCGCCCT GTTCGCGACA AATGAACAAC AGGCTCTCGG CTGTTTACGC GCTTTGGCAG AACATGGGCT GCGCGTTCCT GAAGATGTTG CGCTGGTCTG CTTTAACGCA ACACAAGAAT CGGCTTATAA CGTCCCTTCT TTAACCGCCG TTCGACAACC TGTCGATAAG ATGGCCCGTG CGGCAATTGA CATGCTGAAA AATTGGGATG GAGAAGTTCG CCGTGTTGAA TTTGAGTTTT ATTTACGAAC AGGAGAGTCG TGTGGCTGCC AAGGGCATGA AGTTCAACCC GAGAAAAAGT GA
|
Protein sequence | MTQSHSQRVT RSDVAKEAGT SVAVVSYVIN NGPRPVAEAT RQRVLQAIKK TGYRPNGIAR ALASGSTQTY GLVAPDISNP FIASMAHALQ HEAFADGKVL LLGDAGDSSC RERELINNML HRQVDGLIYT SVDRHPYINL IQESGTPCVM LDRVDPGLNV SVIQVDEQLA AMQVTQHLID HGYRDIGIIC GPREMLNTQD RIRGWQQALE ASSLVVNPSW IFSTNYTRAG GYEATKRMLE HQLPRALFAT NEQQALGCLR ALAEHGLRVP EDVALVCFNA TQESAYNVPS LTAVRQPVDK MARAAIDMLK NWDGEVRRVE FEFYLRTGES CGCQGHEVQP EKK
|
| |