Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0730 |
Symbol | nei |
ID | 6146476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 735503 |
End bp | 736294 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615619 |
Product | endonuclease VIII |
Protein accession | YP_001742818 |
Protein GI | 170683193 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.307896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.698141 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAG GCCCGGAGAT CCGCCGTGCA GCGGATAACC TGGAGGCGGC GATCAAAGGC AAACCACTAA CTGATGTCTG GTTTGCCTTC CCGCAGTTAA AATCTTATCA ATCACGACTT ATCGGTCAAC ACGTTACCCA TGTGGAAACG CGTGGTAAGG CGTTGTTAAC TCATTTTTCC AACGACTTAA CGCTCTACAG CCATAATCAG CTTTACGGCG TCTGGCGCGT GGTTGAAACC GGCGAAGAGC CGCAGACCAC GCGAGTATTG CGGGTAAAAC TGCAAACGGC TGACAAAACC ATTCTGCTTT ATAGCGCCTC GGATATTGAG ATGTTGACCC CGGAACAACT GACCACGCAT CCGTTTTTAC AACGCGTTGG TCCCGATGTG CTGGATCCGA AACTGACGCC GGAGGTGGCG AAAGAACGGT TATTGTCGCC GCGCTTTCGT AACCGTCAGT TTGCTGGATT ACTGCTCGAT CAGGCGTTTC TGGCAGGACT TGGCAATTAT TTGCGGGTGG AGATCCTCTG GCAGGTTGGG TTGACCGGAA ATCATAAAGC GAAAGATCTC AATGCGGCGC AACTGGACGC ACTCGCACAC GCGTTACTGG ATATTCCGCG ACTTTCTTAC GCTACGCGGG GGCAGGTGGA TGAAAACAAA TATCATGGGG CGCTGTTTCG CTTTAAGGTT TTTCATCGTG ATGGCGAACC TTGCGAACGC TGTGGCGGCA TCATTGAGAA AACCACGCTG TCATCGCGCC CGTTTTACTG GTGCCCTGGC TGCCAGCACT AG
|
Protein sequence | MPEGPEIRRA ADNLEAAIKG KPLTDVWFAF PQLKSYQSRL IGQHVTHVET RGKALLTHFS NDLTLYSHNQ LYGVWRVVET GEEPQTTRVL RVKLQTADKT ILLYSASDIE MLTPEQLTTH PFLQRVGPDV LDPKLTPEVA KERLLSPRFR NRQFAGLLLD QAFLAGLGNY LRVEILWQVG LTGNHKAKDL NAAQLDALAH ALLDIPRLSY ATRGQVDENK YHGALFRFKV FHRDGEPCER CGGIIEKTTL SSRPFYWCPG CQH
|
| |