Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3835 |
Symbol | |
ID | 6145279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3904983 |
End bp | 3906479 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618661 |
Product | M16 family peptidase |
Protein accession | YP_001745801 |
Protein GI | 170682195 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.173464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGCA CAAAAATTCG ACTTTTAGCG GGCGGTTTGC TGATGATGGC CACTGCTGGC TATGTGCAGG CAGATGCGCT CCAGCCTGAT CCAGCATGGC AACAGGGGAC GCTTTCCAAC GGTTTACAGT GGCAAGTGCT GACTACACCC CAGCGTCCCA GCGATCGTGT TGAAATTCGC CTGCTGGTTA ATACCGGTTC GCTCGCCGAA AGTACACAAC AGAGCGGTTA CAGTCACGCC ATCCCTCGTA TTGCGCTAAC GCAAAGCGGT GGCCTTGACG CAGCGCAGGC GCGTTCATTG TGGCAGCAGG GGATCGACCC TAAACGCCCG ATGCCGCCGG TAATTGTCTC TTATGACACC ACGCTGTTTA ACCTGAGTTT GCCCAATAAC CGTAACGACT TGCTGAAAGA AGCGCTCTCT TATCTGGCAA ATGCCACTGG CAAACTGACT ATCACGCCAG AAACCATCAA CCACGCGCTG CAAAGTCAGG ACATGGTGGC AACCTGGCCT GCCGATACTA AAGAGGGCTG GTGGCGTTAT CGTCTGAAAG GATCAACCTT GTTAGGTCAC GATCCTGCCG ATCCGCTGAA ACAACCCGTT GAAGCGGAAA AGATTAAAGA TTTCTATCAG AAATGGTACA CCCCGGATGC AATGACGCTG CTGGTGGTGG GAAACGTGGA TGCGCGCTCG GTCGTCGACC AAATCAATAA AACGTTTGGC GAACTGAAAG GCAAACGTGA AACACCGGCT CCGGTGCCGA CGCTTTCTCC GCTGCGTGCG GAAGCGGTGA GTATTATGAC CGACGCGGTG CGTCAGGACC GGTTATCTAT CATGTGGGAT ACGCCGTGGC AGCCGATTCG TGAATCAGCC GCACTGCTGC GCTACTGGCG TGCGGACCTG GCCCGTGAGG CGCTGTTCTG GCACGTTCAG CAAGCGTTAA GCGCCAGTAA CAACAAAGAA ATCGGTCTTG GATTTGATTG CCGTGTGCTG TATCTGCGTG CGCAGTGTGC CATCAACATC GAATCACCAA ACGACAAGCT GAACAGCAAC CTTAATCTGG TGGCGCGTGA ACTGGCAAAG GTTCGCGATA AAGGTCTGCC GGAAGAAGAG TTCAATGCGT TAGTGGCGCA AAAGAAACTG GAGCTGCAGA AACTGTTTGC CGCCTATGCG CGAGCTGATA CCGATATTCT GATGGGGCAG CGGATGCGTT CGTTGCAAAA TCAGGTCGTC GATATCGCGC CGGAGCAATA TCAGAAACTG CGTCAGGATT TCCTTAACAG CCTGACGGTG GAGATGTTAA ATCAGGATCT GCGTCAGCAG TTGTCGAATG ATATGGCGTT AATACTGCTG CAGCCGAAAG GCGAGCCGGA ATTTAACATG AAAGCGTTGC AGGCGGCCTG GGATCAAATC ATGGCCCCAT CGACTGCGGC TGCCGCCACA TCTGTCGCCA CGGATGACGT ACATCCTGAA GTGACGGATA TTCCACCCGC ACAGTGA
|
Protein sequence | MQGTKIRLLA GGLLMMATAG YVQADALQPD PAWQQGTLSN GLQWQVLTTP QRPSDRVEIR LLVNTGSLAE STQQSGYSHA IPRIALTQSG GLDAAQARSL WQQGIDPKRP MPPVIVSYDT TLFNLSLPNN RNDLLKEALS YLANATGKLT ITPETINHAL QSQDMVATWP ADTKEGWWRY RLKGSTLLGH DPADPLKQPV EAEKIKDFYQ KWYTPDAMTL LVVGNVDARS VVDQINKTFG ELKGKRETPA PVPTLSPLRA EAVSIMTDAV RQDRLSIMWD TPWQPIRESA ALLRYWRADL AREALFWHVQ QALSASNNKE IGLGFDCRVL YLRAQCAINI ESPNDKLNSN LNLVARELAK VRDKGLPEEE FNALVAQKKL ELQKLFAAYA RADTDILMGQ RMRSLQNQVV DIAPEQYQKL RQDFLNSLTV EMLNQDLRQQ LSNDMALILL QPKGEPEFNM KALQAAWDQI MAPSTAAAAT SVATDDVHPE VTDIPPAQ
|
| |