Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3664 |
Symbol | damX |
ID | 6143251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3722809 |
End bp | 3724095 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618491 |
Product | hypothetical protein |
Protein accession | YP_001745631 |
Protein GI | 170683150 |
COG category | [S] Function unknown |
COG ID | [COG3266] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00402866 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.034061 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT CGTTCTCGTC AATCTTCTGA ACGTTCTGAG CGTACTGAAC GTGGCGAACC GCAGATCAAT TTTGATGATA TTGAACTTGA TGACACTGAC GATCGCCGTC CGACTCGTGC GCAAAAAGAG CGCAATGATG AGCCGGAAAT CGAAGAAGAA ATTGACGAAT CCGAAGATGA AATCGTGGAT GAAGAGCGCG TAGAGCGTCG TCCGCGTAAG CGCAAAAAAG CAGCCAGTAA ACCCGCTTCT CGTCAGTATA TGATGATGGG GGTCGGCATT CTGGTTCTCC TACTGTTGAT CATCGGTATC GGTTCTGCGC TAAAAGCCCC CTCGACCTCT TCCAGCGATC AAACCGCGTC TGGCGAGAAG AGTATTGATC TTGCAGGCAA TGCGACCGAT CAGGCGAATG GTGTGCAGCC AGCGCCGGGA ACCACGTCTG CGGAAAATAC TCAGCAGGAT GTTTCTCTGC CGCCGATCTC TTCTACGCCG ACTCAAGGGC AAACCCCGGC GGCAACGGAT GGTCAACAAC GTGTTGAAGT GCAGGGTGAC CTGAACAATG CGCTGACCCA GCCACAAAAT CAGCAACAGT TGAATAATGT GGCGGTCAAT TCCACGTTGC CGACTGAACC CGCGACGGTC GCGCCTGTTC GCAATGGCAA TGCATCGCGT GACACGGCGA AAACGCAAAC CGCTGAACGT CCGGCCACTA CGCGTCCAGC TCGCCAGCAG GCGGTGATTG AACCGAAAAA ACCGCAAGCA ACCGTGAAAA CGGAGCCGAA GCCGGTAGCA CAGACGCCGA AGCGTACTGA ACCAGCTGCT CCTGTGGCAA GCACGAAGGC ACCGGCTGCG ACCTCTACGC CAGCACCAAA AGAGACCGCA ACTACGGCTC CAGTACAGAC GGCATCCCCG GCGCAAACCA CGGCAACACC AGCCGCTGGA GGGAAGACCG CAGGTAATGT GGGTTCGTTG AAATCGGCAC CGTCCAGCCA TTACACTCTG CAGCTGAGCA GTTCCTCTAA CTACGACAAC CTGAACGGTT GGGCGAAGAA AGAGAATCTG AAAAACTACG TTGTCTATGA AACGACGCGT AATGGTCAGC CGTGGTATGT CCTGGTTTCT GGCGTGTACG CTTCGAAAGA AGAGGCGAAA AAAGCGGTAT CTACATTGCC AGCCGATGTC CAGGCCAAAA ACCCGTGGGC GAAACCGCTG CGTCAGGTAC AGGCCGATCT GAAGTAA
|
Protein sequence | MDEFKPEDEL KPDPSDRRTG RSRQSSERSE RTERGEPQIN FDDIELDDTD DRRPTRAQKE RNDEPEIEEE IDESEDEIVD EERVERRPRK RKKAASKPAS RQYMMMGVGI LVLLLLIIGI GSALKAPSTS SSDQTASGEK SIDLAGNATD QANGVQPAPG TTSAENTQQD VSLPPISSTP TQGQTPAATD GQQRVEVQGD LNNALTQPQN QQQLNNVAVN STLPTEPATV APVRNGNASR DTAKTQTAER PATTRPARQQ AVIEPKKPQA TVKTEPKPVA QTPKRTEPAA PVASTKAPAA TSTPAPKETA TTAPVQTASP AQTTATPAAG GKTAGNVGSL KSAPSSHYTL QLSSSSNYDN LNGWAKKENL KNYVVYETTR NGQPWYVLVS GVYASKEEAK KAVSTLPADV QAKNPWAKPL RQVQADLK
|
| |