Gene EcSMS35_3835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3835 
Symbol 
ID6145279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3904983 
End bp3906479 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content54% 
IMG OID641618661 
ProductM16 family peptidase 
Protein accessionYP_001745801 
Protein GI170682195 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.173464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCA CAAAAATTCG ACTTTTAGCG GGCGGTTTGC TGATGATGGC CACTGCTGGC 
TATGTGCAGG CAGATGCGCT CCAGCCTGAT CCAGCATGGC AACAGGGGAC GCTTTCCAAC
GGTTTACAGT GGCAAGTGCT GACTACACCC CAGCGTCCCA GCGATCGTGT TGAAATTCGC
CTGCTGGTTA ATACCGGTTC GCTCGCCGAA AGTACACAAC AGAGCGGTTA CAGTCACGCC
ATCCCTCGTA TTGCGCTAAC GCAAAGCGGT GGCCTTGACG CAGCGCAGGC GCGTTCATTG
TGGCAGCAGG GGATCGACCC TAAACGCCCG ATGCCGCCGG TAATTGTCTC TTATGACACC
ACGCTGTTTA ACCTGAGTTT GCCCAATAAC CGTAACGACT TGCTGAAAGA AGCGCTCTCT
TATCTGGCAA ATGCCACTGG CAAACTGACT ATCACGCCAG AAACCATCAA CCACGCGCTG
CAAAGTCAGG ACATGGTGGC AACCTGGCCT GCCGATACTA AAGAGGGCTG GTGGCGTTAT
CGTCTGAAAG GATCAACCTT GTTAGGTCAC GATCCTGCCG ATCCGCTGAA ACAACCCGTT
GAAGCGGAAA AGATTAAAGA TTTCTATCAG AAATGGTACA CCCCGGATGC AATGACGCTG
CTGGTGGTGG GAAACGTGGA TGCGCGCTCG GTCGTCGACC AAATCAATAA AACGTTTGGC
GAACTGAAAG GCAAACGTGA AACACCGGCT CCGGTGCCGA CGCTTTCTCC GCTGCGTGCG
GAAGCGGTGA GTATTATGAC CGACGCGGTG CGTCAGGACC GGTTATCTAT CATGTGGGAT
ACGCCGTGGC AGCCGATTCG TGAATCAGCC GCACTGCTGC GCTACTGGCG TGCGGACCTG
GCCCGTGAGG CGCTGTTCTG GCACGTTCAG CAAGCGTTAA GCGCCAGTAA CAACAAAGAA
ATCGGTCTTG GATTTGATTG CCGTGTGCTG TATCTGCGTG CGCAGTGTGC CATCAACATC
GAATCACCAA ACGACAAGCT GAACAGCAAC CTTAATCTGG TGGCGCGTGA ACTGGCAAAG
GTTCGCGATA AAGGTCTGCC GGAAGAAGAG TTCAATGCGT TAGTGGCGCA AAAGAAACTG
GAGCTGCAGA AACTGTTTGC CGCCTATGCG CGAGCTGATA CCGATATTCT GATGGGGCAG
CGGATGCGTT CGTTGCAAAA TCAGGTCGTC GATATCGCGC CGGAGCAATA TCAGAAACTG
CGTCAGGATT TCCTTAACAG CCTGACGGTG GAGATGTTAA ATCAGGATCT GCGTCAGCAG
TTGTCGAATG ATATGGCGTT AATACTGCTG CAGCCGAAAG GCGAGCCGGA ATTTAACATG
AAAGCGTTGC AGGCGGCCTG GGATCAAATC ATGGCCCCAT CGACTGCGGC TGCCGCCACA
TCTGTCGCCA CGGATGACGT ACATCCTGAA GTGACGGATA TTCCACCCGC ACAGTGA
 
Protein sequence
MQGTKIRLLA GGLLMMATAG YVQADALQPD PAWQQGTLSN GLQWQVLTTP QRPSDRVEIR 
LLVNTGSLAE STQQSGYSHA IPRIALTQSG GLDAAQARSL WQQGIDPKRP MPPVIVSYDT
TLFNLSLPNN RNDLLKEALS YLANATGKLT ITPETINHAL QSQDMVATWP ADTKEGWWRY
RLKGSTLLGH DPADPLKQPV EAEKIKDFYQ KWYTPDAMTL LVVGNVDARS VVDQINKTFG
ELKGKRETPA PVPTLSPLRA EAVSIMTDAV RQDRLSIMWD TPWQPIRESA ALLRYWRADL
AREALFWHVQ QALSASNNKE IGLGFDCRVL YLRAQCAINI ESPNDKLNSN LNLVARELAK
VRDKGLPEEE FNALVAQKKL ELQKLFAAYA RADTDILMGQ RMRSLQNQVV DIAPEQYQKL
RQDFLNSLTV EMLNQDLRQQ LSNDMALILL QPKGEPEFNM KALQAAWDQI MAPSTAAAAT
SVATDDVHPE VTDIPPAQ