Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3719 |
Symbol | |
ID | 6068714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4070347 |
End bp | 4072104 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641603136 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001726656 |
Protein GI | 170021702 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGG AAAAGCTGAT CGTTGATCAT ATGGAAACCT GGACCTCGGC GTTGCAAACC CGTTCCACCG CCGGGCGCGG CAGTTCCGGA AAAATTGATT TATATGGCAT TAAGAAATTA CGTGAGCTGA TTCTGGAACT GGCTGTGCGC GGTAAACTGG TGCCGCAGGA CCCGAACGAT GAACCGGCGT CGGAGCTGCT GAAGCGTATT GCGGCGGAAA AAGCGGAGCT GGTGAAGCAA GGTAAAATTA AAAAGCAAAA ACCGCTGCCG GAAATTAGCG AGGAAGAGAA GCCGTTTGAA TTGCCGGAGG GATGGGAGTG GGCAAGAATC AATGACATAG CTTCATTTAC AAATGGATAT GCATTTAAGA GTAGCGAGTT TCAGAATTCT GGAGTTGGTA TAGTCAAAAT AGGTGATATC GACAGTTCCG GGTTTATTTC TACCGCAGGT ATGTCATATG TTAGTGAAAA AAAAATAAAT GTATTACCCG AAGAGATGAG GGTTAATCCT GGTGATATGG TTATTGCTAT GTCTGGAGCA ACAACAGGGA AACTAGGGTT TAATAAAACA AAAAGCACGT TCCTTCTAAA TCAACGAGTC GGTAAAATTG TGACTTACTC AGTTGACAAA GAGTTCATTT ATCACTATTT ATCTACAAGA ATTGAAGAAA ATCTATCTAT TTCTTTAGGT AGTGCAATAC CAAATATTAG CACAGCACAA ATAAACAATA TCATTATTCC AATCCCCCCA TCAGATGAAC AAGTTAAAAT AATAGCCAGA GTTAAGTTAC TTATTTCCCT ATGCGACCAA CTGGAACAAC AATCCCTGAC CAGTCAGGAC GCACATCAGC AACTGGTTGA AACCCTATTG GGAACACTTA CAGACAGCCA AAACGCCGAG GAACTGGCTG AAAACTGGGC GCGTATTAGC GAGCATTTCG ACACACTATT TACCACAGAA GCCAGCGTGG ATGCGTTAAA ACAGACCATT CTGCAACTGG CCGTAATGGG TAAACTTGTG CCGCAGGATC CGAATGATGA ACCAGCCTCT GAACTGCTCA AACGAATTGC GCAGGAAAAA GCTCAACTGG TGAAAGAAGG AAAAATAAAA AAACCGTTGC CGCCAATTAG CGATGAGGAA AAACCGTTTG AACTTCCGGA AGGGTGGGAG TGGTGTTTAT TTGAAGATAT TATTGATATT CAAAGTGGTA TCACTAAAGG AAGAAATTTA TCAAATAGAA CTTTGGTAAA AGTTCCTTAT TTACGTGTTG CAAACGTCCA ACGCGGATAT CTTGATCTTA CGGAAATTAA ACAGATTGAA ATCCCTATTG AGGAAAAAGA AAAATATCAA GTAGTCAAGG GAGATTTATT GATCACAGAA GGCGGCGACT GGGATACAGT CGGGAGAACT ACAGTATGGT GTCATGACTG GTATATAGCA AATCAAAACC ATGTATTCAA AGGACGAAAT ATAGGGCAAT ATGTTGATCC ATATTGGTTA GAAACATATA TGAATAGCCC ATTCTCAAGA CAATATTTTG CTAACGCAAG TAAGCAAACC ACTAATTTAG CTTCTATTAA TAAAACCCAG CTCAGAGGTT GTCCTGTTGC TATTCCTCCT AGCTCAGAAG CAAAAAAAAT AATGAGTAAA CTACATATTT TTTATAAACT ATGTGAAGAA TTAAAGAATC ATATCCAATC CGCCCAGCAA ACCCAACTGC ACCTTGCAGA TGCACTCACT GACGCGGCGG TAAACTAA
|
Protein sequence | MSVEKLIVDH METWTSALQT RSTAGRGSSG KIDLYGIKKL RELILELAVR GKLVPQDPND EPASELLKRI AAEKAELVKQ GKIKKQKPLP EISEEEKPFE LPEGWEWARI NDIASFTNGY AFKSSEFQNS GVGIVKIGDI DSSGFISTAG MSYVSEKKIN VLPEEMRVNP GDMVIAMSGA TTGKLGFNKT KSTFLLNQRV GKIVTYSVDK EFIYHYLSTR IEENLSISLG SAIPNISTAQ INNIIIPIPP SDEQVKIIAR VKLLISLCDQ LEQQSLTSQD AHQQLVETLL GTLTDSQNAE ELAENWARIS EHFDTLFTTE ASVDALKQTI LQLAVMGKLV PQDPNDEPAS ELLKRIAQEK AQLVKEGKIK KPLPPISDEE KPFELPEGWE WCLFEDIIDI QSGITKGRNL SNRTLVKVPY LRVANVQRGY LDLTEIKQIE IPIEEKEKYQ VVKGDLLITE GGDWDTVGRT TVWCHDWYIA NQNHVFKGRN IGQYVDPYWL ETYMNSPFSR QYFANASKQT TNLASINKTQ LRGCPVAIPP SSEAKKIMSK LHIFYKLCEE LKNHIQSAQQ TQLHLADALT DAAVN
|
| |