Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0330 |
Symbol | |
ID | 5590907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 335287 |
End bp | 336495 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640919516 |
Product | DNA methylase family protein |
Protein accession | YP_001457102 |
Protein GI | 157159784 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.00332873 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTA ACTATATGCC ATTCACACAA CGAGATAGTC TCGGAAGATA CTATACAAAA GAATCCATAA GTGCATTATT GGTTTCTCAG ATGAAAGCTG AAAAAGTCAA TAATATTATT GATTTAGCTT CTGGTGAGGG GAGCTTAACC TATGCTGCTT TAGACCGATG GAAGAATGCA GAAGCATATT CTTTGGATAT AGAATCTCGG ATGTCAAAAA AAGTATGTGA TAATCTTACT CATATTGTAA CAGATGCCCT TGTTCATTCA TTTCCTGAAA TGCTTGCCCG ACATCAAGGA AATTTTGATG TTGCAGTATG TAATCCTCCA TTCACTCTCC CTGAATGGAG GGATGATTAT TTTAAAATCA TTAGTGAGAT TGGCGCAGAT AAATATATAT CTGTCTCAAA ATATGTCCCA GCAGAGATAA TATTTATATC TCAAGTCATC AGGTTTCTTA AAAAGGGCGG TGAGGCTGGA ATAATTTTAC CTGATGGTAT ATTTACAGCA AGAAAGTTTA TAGGTTTAAG ACGATATTTA TTGAATGAAC ATTCAATTAC AAAAGTCATT GAATTGCCTA GGAATATCTT CAAAAGGACA GAGGCTAAAA CACATATTTT AATTTTTAAT AAAAAAATTA TGCCTCATCA TAAAATACAA TTACATTGTA TAACTAAAGA TGGGGAATTG TCGCCTCCTG TTTTAATTAG AAAAGAAGAT GCGGTTGAGA GAATGGACTA CTCTTATCAT TATAATAAAA ATGAAGGTAA AGGGTTTAGC ACAATAGGGA TGCTTAAAAA TATTTCAATT TTTAGAGGTA GGTTTAATTC AAAGGAAATT ACGGAACATG TTTTTCATAC GACAAAATTT AGTGGTGATG AAAAGTACAT TAAATTCCAC TGCAACTCTG TAGAAGAATT GAAGCCATCA AAATTAGATG TCATTGCTAA GCCTGGTGAT ATATTAATAG CAAGAGTTGG GCGAAATTTT CATAAAAAAA TATTGTTTGT TGAGAGCGGC TATTCTTATA TCAGTGACTG TATTTTTCTG ATACGAGCCT CTGGTGGAGA TAAGAAAAAA CTATTTGATT TTCTTTGTTC TCAAGATGGG CAAGAGGAAT TATCTCGAGC GAGTAGTGGT GTAGCCGCAC AACATATTAC AATGGATGCA TTAAAAAAAA TACATCTTGT AAGGATTAAA CATGACTGA
|
Protein sequence | MNSNYMPFTQ RDSLGRYYTK ESISALLVSQ MKAEKVNNII DLASGEGSLT YAALDRWKNA EAYSLDIESR MSKKVCDNLT HIVTDALVHS FPEMLARHQG NFDVAVCNPP FTLPEWRDDY FKIISEIGAD KYISVSKYVP AEIIFISQVI RFLKKGGEAG IILPDGIFTA RKFIGLRRYL LNEHSITKVI ELPRNIFKRT EAKTHILIFN KKIMPHHKIQ LHCITKDGEL SPPVLIRKED AVERMDYSYH YNKNEGKGFS TIGMLKNISI FRGRFNSKEI TEHVFHTTKF SGDEKYIKFH CNSVEELKPS KLDVIAKPGD ILIARVGRNF HKKILFVESG YSYISDCIFL IRASGGDKKK LFDFLCSQDG QEELSRASSG VAAQHITMDA LKKIHLVRIK HD
|
| |