Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3997 |
Symbol | |
ID | 6142601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4075489 |
End bp | 4077042 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641618822 |
Product | putative PTS regulatory protein |
Protein accession | YP_001745961 |
Protein GI | 170680552 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATGA TTACCTCGCG ACAAAACAGA TTATTACGAT TCCTTCTACC ACGAAGGGAA TATACGACTA TTGTTACAAT TGCCGGCTAT TTAAATGTTT CGGAAAAAAC CATTCAACGT GATTTACGTT TACTTGAGCA ATGGCTGGGG CAATGGAGAA TAAATGTTGA GAAGCGTGCT GGCGCGGGTG TGATGTTAAG CGCGGAGAAT ATTGCTGATT TGCTGCATCT TGATCATTTG TTGGGAGCAG AATGTGAAGA GATTGATGGT GTAATGAATA ATGCCAGGCG CGTTAAAATA GCGTCGCAGT TATTAAGTGA AACACCGAAT GAAACGTCGA TCAGTAAATT GTCAGAACGC TACTTTATCA GCGGAGCCTC TATTGTTAAT GACCTGAGAG TAATTGAGTC CTGGCTTGCG CCGTTGGGGT TATCATTGAT CCGCAGCCCA AGTGGTACGC ATATTGAAGG TAGTGAAGGG CAAGTCCGAC AGGCAATGGC ATTACTGATT AACGGCATTA TTAACCATAA TGAGCCGCAA GGTGTCGTGT ATTCACGTCT GGATCCCGGA AGCTATAAAG CATTAGTCCA TTATTTTGGA GAGGAAGAAG TGTTATTTGT CCAGTCATTG TTACTGGATA TGGAAAATGA ATTAAGTTGG TCTTTGGGAG AACCTTATTA CGTTAACATT TTTACTCACA TCCTTATTAT GATGTATCGC AACACGCACG GGAATGCGTT ATCAAGAGAA GAAGATCAAA CCAGGCAATA TGATGAAAAT ATCTTTAATG TTGCCAGTCA GATGATTCAT AAGATAGAAC AACGAATTGC ACATACATTG CCCGATGATG AAGTCTGGTT TATTTATCAA TATATCATTT CATCAGGTGT GGCGATTGAT GGACAAAAAG ATGTGAGCAT TATTTCACAT ATGCAGGCCA GCAATGAAGC GCGTCTGATT ACCTGGCGTT TAATTACGGT ATTCAGTGAC ATCGTGGACT GCGATTTTAG TGAAGACAGC GCATTATATG ATGGCTTAAT GGTGCATATT AAACCGCTGA TTAACCGACT AAATTATCGT ATTCATATCC GTAATCCATT GTTGGAAGAT ATTAAAGCAG AACTAGCGGA TGTCTGGCGG TTGACGCAAT ATGTGGTGAA TCAGGTATTT AAAACCTGGG GTGAGAATGC AGTGAGCGAG GATGAAGTGG GTTACCTGAC CGTTCATTTT CAGGCTGCGA TGGAGCGGCA AATTGCCCGT AAACGTGTAT TACTGGTCTG TTCAACCGGA ATCGGAACTT CGCATCTACT GAAAAGCCGT ATTCTGCGAG CATTTCCTGA ATGGACGATT GTTGATGTTA TTTCAGCAGC GAATTTATCA CAGGTTTTGC CTGACAATAT CGAACTGATT ATTTCGACAA TTAATTTGCC TACAGTCACT ATGCCGGTCG CTTATGTTAC CGCTTTTTTT AATGATGCCG ATATTAAGCG GGTCACTGAA ATGGTGATTA CGGAAAAATT ACATCATGCG ACGTCTCGGG TCGTTGAAAT TTAA
|
Protein sequence | MQMITSRQNR LLRFLLPRRE YTTIVTIAGY LNVSEKTIQR DLRLLEQWLG QWRINVEKRA GAGVMLSAEN IADLLHLDHL LGAECEEIDG VMNNARRVKI ASQLLSETPN ETSISKLSER YFISGASIVN DLRVIESWLA PLGLSLIRSP SGTHIEGSEG QVRQAMALLI NGIINHNEPQ GVVYSRLDPG SYKALVHYFG EEEVLFVQSL LLDMENELSW SLGEPYYVNI FTHILIMMYR NTHGNALSRE EDQTRQYDEN IFNVASQMIH KIEQRIAHTL PDDEVWFIYQ YIISSGVAID GQKDVSIISH MQASNEARLI TWRLITVFSD IVDCDFSEDS ALYDGLMVHI KPLINRLNYR IHIRNPLLED IKAELADVWR LTQYVVNQVF KTWGENAVSE DEVGYLTVHF QAAMERQIAR KRVLLVCSTG IGTSHLLKSR ILRAFPEWTI VDVISAANLS QVLPDNIELI ISTINLPTVT MPVAYVTAFF NDADIKRVTE MVITEKLHHA TSRVVEI
|
| |