Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2534 |
Symbol | |
ID | 6145210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2591937 |
End bp | 2592794 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617406 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001744577 |
Protein GI | 170680686 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC CAGGCTTGCC CGCCGATCAA CAATTTTTCG CCGATCTGTT CAGCGGCCTG GTGCTTAACC CGCAACTACT GGGGCGCGTC TGGTTTGCCA GCCAGCCTGC CTCATTGCCA GTGGGCAGTT TATGTATTGA TTTTCCCCGT CTGGATATCG TGCTACGCGG CGAATACGGC AATCTGCTGG AAGCAAAGCA GCAACGTATG GTGGAAGGAG AAATGCTGTT TATTCCGGCA CGCGCGGCTA ATTTACCGGT CAACAACAAA CCGGTGATGC TGTTAAGCCT GGTGTTCGCA CCGACCTGGC TTGGGTTATC GTTTTACGAT AGCCGCACCA CGTCGTTGTT GCATCCTGCC CGCCAGATCC AGCTTCCCAG CCTGCAACGC GGCGAAGGTG AAGCGATGCT TACCGCCCTC ACCCATCTTA GCCGTTCGCC GCTGGAGCAG AATATCATTC AGCCACTGGT GTTAAGTTTG CTGCATCTTT GCCGCAACGT GGTGAATATG CCACCGGGCA ATTCGCAGCC GCGCGGCGAT TTTCTCTATC ACAGCATTTG TAACTGGGTT CAGGATAATT ATGCCCAGCA GCTCACCCGC GAGAGCGTGG CTCAGTTTTT TAATATCACG CCCAATCATC TGTCAAAACT GTTTGCTCAG CATGGAACAA TGCGTTTTAT CGAGTATGTT CGCTGGGTAC GAATGGCAAA GGCAAGGATG ATTTTGCAGA AATATCATCT GTCGATTCAT GAAGTGGCAC AGCGTTGCGG TTTTCCGGAC AGCGACTATT TTTGTCGCAT TTTCCGGCGT CAGTTTGGCC TGACGCCGGG AGAGTACAGC GCCCGTTTTC AGGGCTAA
|
Protein sequence | MKAPGLPADQ QFFADLFSGL VLNPQLLGRV WFASQPASLP VGSLCIDFPR LDIVLRGEYG NLLEAKQQRM VEGEMLFIPA RAANLPVNNK PVMLLSLVFA PTWLGLSFYD SRTTSLLHPA RQIQLPSLQR GEGEAMLTAL THLSRSPLEQ NIIQPLVLSL LHLCRNVVNM PPGNSQPRGD FLYHSICNWV QDNYAQQLTR ESVAQFFNIT PNHLSKLFAQ HGTMRFIEYV RWVRMAKARM ILQKYHLSIH EVAQRCGFPD SDYFCRIFRR QFGLTPGEYS ARFQG
|
| |