Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3950 |
Symbol | |
ID | 6145382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4028055 |
End bp | 4029314 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618776 |
Product | hypothetical protein |
Protein accession | YP_001745915 |
Protein GI | 170684238 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG4942] Membrane-bound metallopeptidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.50769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGGG CCGTGAAACC GCGCAGGTTT GCAATCAGGC CCATCATCTA CGCCAGCGTT CTGAGCGCTG GCGTATTGTT GTGCGCCTTT TCCGCCCACG CGGATGAGCG TGACCAACTC AAATCCATTC AGGCTGACAT CGCCGCGAAA GAGCGCGCGG TACGCCAAAA GCAACAACAA CGCGCAAGCC TGCTCGCACA ATTGAAAAAG CAGGAAGAAG CGATCTCTGA AGCCACCCGT AAACTGCGCG AAACGCAAAA CACGCTTAAT CAACTGAATA AACAGATTGA TGAGATGAAC GCGTCGATTG CCAAACTGGA GCAGCAAAAA GCCGCCCAGG AGCGCAGCCT CGCCGCGCAA CTGGATGCCG CGTTTCGTCA GGGTGAACAT ACCGGTATTC AGCTGATTCT CAGCGGTGAA GAAAGCCAGC GTGGGCAGCG GTTACAGGCT TATTTCGGCT ATCTCAACCA GGCGCGACAA GAAACCATTG CTCAGTTGAA ACAAACGCGT GAAGAAGTCG CTATGCAGCG TGCCGAACTG GAAGAGAAAC AGAGCGAGCA ACAAACGCTT TTATATGAGC AGCGCGCCCA ACAGGCGAAG CTGACCCAGG CGTTGAGCGA GCGTAAAAAG ACGCTGGCAG GGCTGGAGTC TTCCATCCAG CAAGGTCAGC AACAGTTGAG CGAGCTGCGC GCCAACGAAT CCCGCCTGCG TAACAGCATT GCCCGTGCGG AAGCTGCGGC GAAAGCGCGT GCTGAACGTG AAGCGCGCGA AGCCCAGGCG GTTCGCGACC GCCAGAAAGA AGCGACGCGC AAAGGCACCA CCTACAAGCC GACCGAAAGC GAAAAATCGC TGATGTCCCG TACCGGTGGT CTGGGCGCAC CGCGCGGTCA GGCATTCTGG CCAGTTCGCG GGCCAACGCT GCATCGCTAT GGCGAACAGC TACAGGGTGA ATTACGCTGG AAAGGGATGG TGATTGGTGC CTCTGAAGGT ACTGAAGTTA AAGCGATTGC CGACGGCCGG GTGATTCTGG CTGACTGGCT GCAAGGCTAC GGTCTGGTGG TGGTGGTTGA GCACGGTAAA GGCGACATGA GTCTTTACGG CTATAATCAG AGCGCACTGG TGAGCGTTGG TTCGCAGGTT CGCGCGGGCC AGCCAATTGC ACTGGTGGGC AGCAGTGGCG GTCAGGGTCG GCCTTCACTC TATTTCGAAA TTCGCCGCCA GGGTCAGGCG GTCAATCCAC AGCCGTGGTT GGGAAGATAA
|
Protein sequence | MTRAVKPRRF AIRPIIYASV LSAGVLLCAF SAHADERDQL KSIQADIAAK ERAVRQKQQQ RASLLAQLKK QEEAISEATR KLRETQNTLN QLNKQIDEMN ASIAKLEQQK AAQERSLAAQ LDAAFRQGEH TGIQLILSGE ESQRGQRLQA YFGYLNQARQ ETIAQLKQTR EEVAMQRAEL EEKQSEQQTL LYEQRAQQAK LTQALSERKK TLAGLESSIQ QGQQQLSELR ANESRLRNSI ARAEAAAKAR AEREAREAQA VRDRQKEATR KGTTYKPTES EKSLMSRTGG LGAPRGQAFW PVRGPTLHRY GEQLQGELRW KGMVIGASEG TEVKAIADGR VILADWLQGY GLVVVVEHGK GDMSLYGYNQ SALVSVGSQV RAGQPIALVG SSGGQGRPSL YFEIRRQGQA VNPQPWLGR
|
| |