Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2910 |
Symbol | |
ID | 5589342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2914257 |
End bp | 2915378 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640926563 |
Product | putative type I restriction-modification system, S subunit |
Protein accession | YP_001463945 |
Protein GI | 157159064 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGT GGATAAAAAC TAAACTTGGT GAGATTGTCA TTCTTAATTA TGGTAAAGCA TTAAAGGCAC AGGATAGGAA TGCTGGCTCT ATACCTGTAT ATAGTTCAGG TGGATTAACT GGTTGGCATA ACAAAGCATT AATAAACGAA CAAGGAATTA TTATTGGTCG TAAAGGAACT GTTGGGAAAG CGTACTTAAC TTATGGACCA TTTTGGTGTA TTGATACTGC TTATTATATT TTACCTAACC CTTCGAAATA TGATTTTGTT TTTTTATTCT ATTTGTTGAA AACATTAGGG CTGGAAGAAC TGAACGAAGA TAGCGCTGTT CCTGGACTAA ACAGGGATAC CGCTTATAGT CAAGAGATCC TATTACCTTC ACTGCCAGAG CAAAAAACCA TCGCCTCTGT GCTGTCCTCT CTTGATGACA AAATAGACTT GCTTCATCGC CAGAATAAAA CTCTGGAATC CATGGCCGAA ACCCTGTTTA GGCAGTGGTT TATTTTAGAT AGTACTGGTG TATCAGTTAG TATAGATCAG ATAATTGACT TTAATCCAAA GCGAACTTTA ATTAAAAGCC AAGACGCTAC TTATTTGGAC ATGGCTGGAC TGAGCACTGT TATTTTTCGT GCAAATGGAT ATTACAGACG TCCATTTTCT TCAGGAACTA AATTTACTAA ACGAGATACC TTGCTAGCCA GAATTACCCC ATGTTTAGAA AATGGGAAAG CGGCCTATAT TGATTTTTTA GACGACAATG AAACTGGATG GGGTTCTACC GAATTCATAG TTATGCGTCC TAAAAAAGAA ATCCATCCTT TTATCTCATA TATCATGTGC CGAAACCCGG ACTTCAAAGA ATATGCAGAA AGCTGTATGG AAGGTTCAAC TGGCAGACAA CGAGTTAACT TGGATCATCT TAAGAAGTTC AATGTTAATC TGCCAACAGA AGCTTCGCTA CGTATAATTA ATGAGCTATT AGATTCATTT GAAAGTAAAC TTATTAACAA CTCAAAACAA ATTGATAGCT TAGAAAAACT CCGCGATACC TTACTTCCCA AACTGATGAG CGGTGAAGTA CGGGTTCAGT ATGCAGAAGA AGCAATCGCA TCAGTAGCAT AA
|
Protein sequence | MSQWIKTKLG EIVILNYGKA LKAQDRNAGS IPVYSSGGLT GWHNKALINE QGIIIGRKGT VGKAYLTYGP FWCIDTAYYI LPNPSKYDFV FLFYLLKTLG LEELNEDSAV PGLNRDTAYS QEILLPSLPE QKTIASVLSS LDDKIDLLHR QNKTLESMAE TLFRQWFILD STGVSVSIDQ IIDFNPKRTL IKSQDATYLD MAGLSTVIFR ANGYYRRPFS SGTKFTKRDT LLARITPCLE NGKAAYIDFL DDNETGWGST EFIVMRPKKE IHPFISYIMC RNPDFKEYAE SCMEGSTGRQ RVNLDHLKKF NVNLPTEASL RIINELLDSF ESKLINNSKQ IDSLEKLRDT LLPKLMSGEV RVQYAEEAIA SVA
|
| |