Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1330 |
Symbol | |
ID | 6145926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1318470 |
End bp | 1319792 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616208 |
Product | hypothetical protein |
Protein accession | YP_001743388 |
Protein GI | 170682487 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0147983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAACAGA TAGCCCGCTC TGTCGCCCTG GCGTTTAATA ATTTACCGCG ACCACACCGC GTTATGTTGG GGTCGCTCAC CGTTCTTACT CTGGCCGTCG CTGTCTGGCG GCCCTATGTT TATCACCGCG ACGCCACGCC AATTGTCAAA ACCATTGAGC TGGAACAGAA CGAAATTCGT TCGCTCTTAC CTGAAGCCAG TGAGCCGATT GATCAAGCTG CACAAGAAGA TGAAGCCATT CCCCAGGATG AACTGGATGA CAAAATCGCC GGTGAAGCGG GCGTGCATGA ATATGTTGTT TCCACTGGCG ATACGCTAAG CAGCATTCTC AATCAGTATG GTATTGATAT GGGTGATATC ACCCAACTGG CTGCTGCCGA CAAAGAATTG CGTAACCTGA AAATCGGTCA ACAACTCTCC TGGACATTAA CCGCGGACGG CGAACTGCAA CGCCTCACCT GGGAAGTGTC TCGTCGTGAA ACCCGAACCT ATGACCGTAC TGCCGCTAAC GGTTTTAAAA TGACCAGCGA AATGCAGCAA GGAGAGTGGG TCAACAATCT GCTGAAAGGT ACCGTCGGGG GAAGCTTTGT TGCCAGCGCC AGAAACGCCG GTTTAACCAG CGCCGAAGTG AGCGCAGTGA TTAAAGCCAT GCAGTGGCAA ATGGATTTCC GCAAACTGAA AAAAGGCGAT GAATTTGCGG TGTTAATGTC ACGAGAAATG CTTGATGGTA AACGTGAGCA AAGCCAGCTG CTGGGCGTAC GTTTGCGTTC AGAAGGTAAA GATTATTACG CAATCCGCGC TGAAGATGGC AAATTCTACG ATCGTAACGG TACTGGTCTG GCGAAAGGAT TCTTGCGATT CCCGACGGCG AAACAGTTCC GTATCTCATC TAACTTTAAC CCGCGTCGTA CTAATCCGGT GACCGGTCGC GTTGCGCCAC ACAGAGGTGT TGATTTTGCC ATGCCACAAG GTACGCCAGT GCTTTCAGTG GGTGACGGTG AAGTGGTGGT TGCCAAACGC AGTGGCGCAG CAGGTTATTA TGTGGCTATT CGTCATGGTC GCAGCTACAC CACGCGTTAT ATGCACTTGC GCAAGATCCT GGTGAAACCG GGACAGAAGG TGAAACGTGG CGACCGTATC GCGCTTTCCG GTAATACCGG ACGTTCAACC GGGCCGCATC TGCACTATGA AGTATGGATA AACCAGCAGG CCGTAAACCC GCTGACGGCA AAACTGCCGC GTACCGAAGG GCTGACCGGC TCCGATCGTC GCGAATTCCT GGCACAGGCC AAAGAGATTG TGCCGCAGCT ACGGTTTGAT TAA
|
Protein sequence | MQQIARSVAL AFNNLPRPHR VMLGSLTVLT LAVAVWRPYV YHRDATPIVK TIELEQNEIR SLLPEASEPI DQAAQEDEAI PQDELDDKIA GEAGVHEYVV STGDTLSSIL NQYGIDMGDI TQLAAADKEL RNLKIGQQLS WTLTADGELQ RLTWEVSRRE TRTYDRTAAN GFKMTSEMQQ GEWVNNLLKG TVGGSFVASA RNAGLTSAEV SAVIKAMQWQ MDFRKLKKGD EFAVLMSREM LDGKREQSQL LGVRLRSEGK DYYAIRAEDG KFYDRNGTGL AKGFLRFPTA KQFRISSNFN PRRTNPVTGR VAPHRGVDFA MPQGTPVLSV GDGEVVVAKR SGAAGYYVAI RHGRSYTTRY MHLRKILVKP GQKVKRGDRI ALSGNTGRST GPHLHYEVWI NQQAVNPLTA KLPRTEGLTG SDRREFLAQA KEIVPQLRFD
|
| |