Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3956 |
Symbol | rfaD |
ID | 6147033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4033982 |
End bp | 4034914 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618782 |
Product | ADP-L-glycero-D-mannoheptose-6-epimerase |
Protein accession | YP_001745921 |
Protein GI | 170683119 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000259209 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCG TTACCGGCGG CGCGGGCTTT ATCGGCAGCA ACATCGTTAA AGCCCTGAAT GACAAAGGCA TCACCGATAT TCTGGTGGTG GATAACCTGA AAGACGGCAC CAAGTTTGTG AACCTGGTGG ATCTGGATAT CGCGGACTAT ATGGATAAGG AAGACTTCCT GATCCAGATT ATGGCTGGCG AAGAGTTCGG CGATGTCGAA GCGATTTTCC ACGAAGGCGC GTGCTCTTCC ACCACCGAGT GGGACGGCAA GTATATGATG GATAACAACT ATCAATACTC CAAAGAGCTG CTGCACTACT GCCTGGAGCG CGAAATCCCG TTGCTGTACG CTTCTTCTGC AGCCACTTAC GGCGGACGCA CTTCCGACTT TATTGAATCC CGCGAGTACG AAAAACCACT TAACGTCTAC GGTTATTCGA AGTTCCTGTT TGATGAATAT GTTCGCCAAA TCCTGCCGGA AGCCAACTCG CAGATTGTTG GCTTCCGTTA TTTCAACGTT TATGGACCGC GTGAAGGCCA TAAAGGCAGC ATGGCGAGCG TCGCTTTCCA TCTCAACACC CAGCTTAATA ACGGTGAATC ACCGAAGCTG TTCGAAGGTA GCGAGAACTT CAAACGTGAC TTCGTCTATG TGGGCGATGT GGCAGATGTA AACCTGTGGT TCCTGGAAAA TGGCGTTTCC GGCATCTTCA ACCTCGGCAC CGGTCGTGCG GAATCCTTCC AGGCAGTAGC AGATGCTACA CTTGCTTATC ACAAGAAAGG CCAAATCGAA TACATTCCGT TCCCGGATAA ACTGAAAGGC CGCTACCAGG CGTTCACGCA GGCAGATCTG ACGAATCTGC GCGCGGCGGG TTACGATAAA CCGTTCAAAA CCGTTGCCGA AGGCGTAACG GAATACATGG CCTGGCTGAA TCGTGACGCA TAA
|
Protein sequence | MIIVTGGAGF IGSNIVKALN DKGITDILVV DNLKDGTKFV NLVDLDIADY MDKEDFLIQI MAGEEFGDVE AIFHEGACSS TTEWDGKYMM DNNYQYSKEL LHYCLEREIP LLYASSAATY GGRTSDFIES REYEKPLNVY GYSKFLFDEY VRQILPEANS QIVGFRYFNV YGPREGHKGS MASVAFHLNT QLNNGESPKL FEGSENFKRD FVYVGDVADV NLWFLENGVS GIFNLGTGRA ESFQAVADAT LAYHKKGQIE YIPFPDKLKG RYQAFTQADL TNLRAAGYDK PFKTVAEGVT EYMAWLNRDA
|
| |