Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4870 |
Symbol | |
ID | 6144844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4982418 |
End bp | 4983329 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619674 |
Product | putative DNA-binding transcriptional regulator |
Protein accession | YP_001746781 |
Protein GI | 170679932 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGCT GTGGTGCGGT TTTGCATAAT ATTGAAACAA AATGGCTTTA TGATTTTCTG ACCCTGGAAA AATGCCGCAA CTTTTCCCAG GCGGCAGTCA GTCGCAACGT CTCGCAACCG GCATTCAGCC GCCGCATCCG TGCACTGGAA CAGGCGATTG GCGTTGAATT GTTTAACCGC CAGGTGACGC CGCTGCAACT CTCGGAACAA GGGAAAATCT TTCATTCGCA GATCCGCCAT CTGTTGCAAC AGTTAGAAAG CAACCTGGCA GAGCTGCGTG GCGGCAGCGA TTACGCGCAA CGTAAAATCA AGATAGCCGC TGCACACTCT CTTTCCCTCG GTCTGTTACC GTCCATTATC AGCCAGATGC CGCCGCTCTT TACCTGGGCG ATTGAAGCCA TTGATGTCGA TGAAGCGGTC GATAAACTGC GTGAAGGGCA AAGTGACTGT ATTTTTTCCT TTCACGACGA AGATTTGCTG GAAGCACCAT TTGACCACAT TCGCTTATTT GAATCTCAAT TGTTCCCCGT CTGCGCCAGT GACGAACACG GAGAAGCACT TTTTAACCTC GCGCAGCCAC ACTTTCCGTT ACTGAATTAC AGCCGCAATT CCTACATGGG GCGATTGATT AATCGCACCC TGACGCGCCA CAGTGAGTTA AGTTTCAGCA CCTTTTTTGT CTCTTCGATG AGCGAGCTTT TAAAGCAGGT TGCCCTCGAC GGCTGTGGGA TTGCCTGGCT ACCGGAGTAC GCCATACAAC AAGAAATTCG CAGCGGGCAG CTCGTTGTGC TTAATCGGGA CGAACTGGTG ATCCCGATTC AGGCTTACGC ATACAGGATG AACACCCGTA TGAATCCCGT TGCCGAACGC TTCTGGCGTG AACTGCGCGA GCTGGAGATT GTGCTTAGCT GA
|
Protein sequence | MDGCGAVLHN IETKWLYDFL TLEKCRNFSQ AAVSRNVSQP AFSRRIRALE QAIGVELFNR QVTPLQLSEQ GKIFHSQIRH LLQQLESNLA ELRGGSDYAQ RKIKIAAAHS LSLGLLPSII SQMPPLFTWA IEAIDVDEAV DKLREGQSDC IFSFHDEDLL EAPFDHIRLF ESQLFPVCAS DEHGEALFNL AQPHFPLLNY SRNSYMGRLI NRTLTRHSEL SFSTFFVSSM SELLKQVALD GCGIAWLPEY AIQQEIRSGQ LVVLNRDELV IPIQAYAYRM NTRMNPVAER FWRELRELEI VLS
|
| |