Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4761 |
Symbol | |
ID | 6147429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4859524 |
End bp | 4860810 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641619574 |
Product | type I restriction modification DNA specificity domain-containing protein |
Protein accession | YP_001746681 |
Protein GI | 170680371 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCTG GCAACATAGT AGATGCTTTC GTATCTGACT CGAAATGGAA CTCAGTACCA GCTAAACGTT TATTTACGAG TAGCAAAGAA ATCAATCAAG GAATGAAAGA ATCTAATCGT CTTGCACTAA CAATGAAAGG TGTCATAAAC CGCTCGTTAG ACGATTTACA AGGACTTCAA TCTTCTGATT ACTCTGTTTA TCAGATTTTT GAAAAAGATG ACTTAGTTTT TAAGCTCATA GATCTTGAAA ATATAAAGAC TAGCCGAGTG GGGATTGTGC ATGAACGAGG AATAATGAGC CCTGCATACA TCCGTGTTTC AGCGAGTTCA AATAGCATCT ACCCAAGGTT CTATTACTGG TACTTTTTTG CTCTATACCT AACCAATATT TATAACAAAC TGGGTGGAGG AGTTCGTCAG AACCTAACAG CTGGTGATCT TTTAGAAATA CCTGTTCCAC TGATTGATAT TTCATTACAG AAGCAGGTTA GCACATTCCT TGATCGCGAA ACCCAGCGCA TCGACAGTCT GATCGAGGAA AAGCAGACCT TTATTAAGCT GCTCAAAGAA AAACGCCAAG CACTGATCAG CCATGTGGTC ACAAAGGGGC TCTATCCCAA TGTGGAGATG CAGGACTCTG GCATCGAGTG GATTGGGCAA GTGCCGAAGC ATTGGGAAGT CAAAAAGATA AAGCATATTT GCTCCAACTT TATGTATGGA ACCTCTCAAG ACTGTAACCA GTCTGATGTT GGCTATCCTG TTCTCAGGAT ACCAAACATC AAGAGTACCA ATGTTGATTT TGAAGATCTC AAATATGCAA ATATCAGTGA TGTTGATGCT TTAACTTATC TTTTATCAAG AGGCGACATT TTAGTCATCA GAACTAACGG TAACCCTAAT CTGGTCGGCC AAAGCGCGCT TTTTGATTCA AATGGGCAGT ATTTATTTGC GTCGTACCTA ATAAAGTTAA CACCAAAGCA AGGAGTGGAT ACTAGCTTCT TAGTGGAAGC GATGAACTCA CTATCTGTTC GTCAAGCATT AACATTTCAG TCAAGAACAT CGGTAGGTAA CTACAACCTG AGCATTCCTT CACTAGCTAA TACAAGTATT GCTATCCCAC CAATTGATGA GCAAAAGACA ATAACGAATT ACTTGAGTGC AGCAACGATA AACATTGATT TACTGATTCA AGAGACTGAT AAATCGATTG ATCTTCTAAA AGAACACCGC ACATCGCTGA TCAACGCGGC TGTCACAGGA AAAATAGACG TCAGGGAGGC GGTGTAA
|
Protein sequence | MNAGNIVDAF VSDSKWNSVP AKRLFTSSKE INQGMKESNR LALTMKGVIN RSLDDLQGLQ SSDYSVYQIF EKDDLVFKLI DLENIKTSRV GIVHERGIMS PAYIRVSASS NSIYPRFYYW YFFALYLTNI YNKLGGGVRQ NLTAGDLLEI PVPLIDISLQ KQVSTFLDRE TQRIDSLIEE KQTFIKLLKE KRQALISHVV TKGLYPNVEM QDSGIEWIGQ VPKHWEVKKI KHICSNFMYG TSQDCNQSDV GYPVLRIPNI KSTNVDFEDL KYANISDVDA LTYLLSRGDI LVIRTNGNPN LVGQSALFDS NGQYLFASYL IKLTPKQGVD TSFLVEAMNS LSVRQALTFQ SRTSVGNYNL SIPSLANTSI AIPPIDEQKT ITNYLSAATI NIDLLIQETD KSIDLLKEHR TSLINAAVTG KIDVREAV
|
| |