Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4690 |
Symbol | |
ID | 6144764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4789107 |
End bp | 4789967 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619506 |
Product | NmrA family protein |
Protein accession | YP_001746614 |
Protein GI | 170683021 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.557719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCAA TTACCGGCGC AACCGGCCAA CTTGGTCACT ATGTTATTGA ATCCTTGATG AAAACGGTTC CTGCCAGCCA AATAGTGGCT ATCGTTCGTA ATCCGGCAAA AGCCCAGGCA CTGACAGCAC AAGGCATTAC CGTGCGTCAG GCTGATTACG GCGATGAAGC CGCACTGACA TCTGCGCTTC AGGGAGTGGA AAAACTACTG CTGATCTCTT CCAGCGAAGT GGGTCAACGT GCCCCGCAGC ATCGTAATGT TATTAATGCC GCAAAGGCGG CTGGTGTGAA ATTTATCGCT TATACCAGTC TGCTTCACGC GGATAAATCT CCGCTCGGCC TCGCCGATGA GCACATTGAG ACGGAGAAAA TGTTGGCTGA TTCTGGCATC GTTTACACCC TGCTGCGCAA CGGCTGGTAC AGCGAAAACT ACCTCGCCAG CGCCCCGGCA GCACTGAAAC ACGGCGTATT TATCGGTGCG GCGGGCGATG GCAAAATCGC CTCAGCAACG CGGGCAGATT ATGCGGCAGC TGCGGCACGC GTGATTAGCG AAGCAGGTCA CGAAGGCAAG GTTTATGAAC TGGCAGGTGA TAGCGCCTGG ACACTGACAC AGTTAGCGGC GGAACTCACC AAACAGAGCG GCAAACAGGT TACCTATCAA AATCTGAGTG AAGCTGATTT CGCCGCGGCA CTGAAAAGCG TCGGACTGCC TGACGGACTG GCGGATATGC TGGCGGATTC TGACGTTGGC GCATCGAAAG GTGGTCTGTT TGATAACAGC AAAACGCTTA GCAAATTGAT TGGCCGCCCA ACGACAACGT TAGCCGAAAG CGTAAGCCAT CTTTTTAATG TTAATAACTA G
|
Protein sequence | MIAITGATGQ LGHYVIESLM KTVPASQIVA IVRNPAKAQA LTAQGITVRQ ADYGDEAALT SALQGVEKLL LISSSEVGQR APQHRNVINA AKAAGVKFIA YTSLLHADKS PLGLADEHIE TEKMLADSGI VYTLLRNGWY SENYLASAPA ALKHGVFIGA AGDGKIASAT RADYAAAAAR VISEAGHEGK VYELAGDSAW TLTQLAAELT KQSGKQVTYQ NLSEADFAAA LKSVGLPDGL ADMLADSDVG ASKGGLFDNS KTLSKLIGRP TTTLAESVSH LFNVNN
|
| |