Gene EcSMS35_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3950 
Symbol 
ID6145382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4028055 
End bp4029314 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID641618776 
Producthypothetical protein 
Protein accessionYP_001745915 
Protein GI170684238 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG4942] Membrane-bound metallopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.50769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGG CCGTGAAACC GCGCAGGTTT GCAATCAGGC CCATCATCTA CGCCAGCGTT 
CTGAGCGCTG GCGTATTGTT GTGCGCCTTT TCCGCCCACG CGGATGAGCG TGACCAACTC
AAATCCATTC AGGCTGACAT CGCCGCGAAA GAGCGCGCGG TACGCCAAAA GCAACAACAA
CGCGCAAGCC TGCTCGCACA ATTGAAAAAG CAGGAAGAAG CGATCTCTGA AGCCACCCGT
AAACTGCGCG AAACGCAAAA CACGCTTAAT CAACTGAATA AACAGATTGA TGAGATGAAC
GCGTCGATTG CCAAACTGGA GCAGCAAAAA GCCGCCCAGG AGCGCAGCCT CGCCGCGCAA
CTGGATGCCG CGTTTCGTCA GGGTGAACAT ACCGGTATTC AGCTGATTCT CAGCGGTGAA
GAAAGCCAGC GTGGGCAGCG GTTACAGGCT TATTTCGGCT ATCTCAACCA GGCGCGACAA
GAAACCATTG CTCAGTTGAA ACAAACGCGT GAAGAAGTCG CTATGCAGCG TGCCGAACTG
GAAGAGAAAC AGAGCGAGCA ACAAACGCTT TTATATGAGC AGCGCGCCCA ACAGGCGAAG
CTGACCCAGG CGTTGAGCGA GCGTAAAAAG ACGCTGGCAG GGCTGGAGTC TTCCATCCAG
CAAGGTCAGC AACAGTTGAG CGAGCTGCGC GCCAACGAAT CCCGCCTGCG TAACAGCATT
GCCCGTGCGG AAGCTGCGGC GAAAGCGCGT GCTGAACGTG AAGCGCGCGA AGCCCAGGCG
GTTCGCGACC GCCAGAAAGA AGCGACGCGC AAAGGCACCA CCTACAAGCC GACCGAAAGC
GAAAAATCGC TGATGTCCCG TACCGGTGGT CTGGGCGCAC CGCGCGGTCA GGCATTCTGG
CCAGTTCGCG GGCCAACGCT GCATCGCTAT GGCGAACAGC TACAGGGTGA ATTACGCTGG
AAAGGGATGG TGATTGGTGC CTCTGAAGGT ACTGAAGTTA AAGCGATTGC CGACGGCCGG
GTGATTCTGG CTGACTGGCT GCAAGGCTAC GGTCTGGTGG TGGTGGTTGA GCACGGTAAA
GGCGACATGA GTCTTTACGG CTATAATCAG AGCGCACTGG TGAGCGTTGG TTCGCAGGTT
CGCGCGGGCC AGCCAATTGC ACTGGTGGGC AGCAGTGGCG GTCAGGGTCG GCCTTCACTC
TATTTCGAAA TTCGCCGCCA GGGTCAGGCG GTCAATCCAC AGCCGTGGTT GGGAAGATAA
 
Protein sequence
MTRAVKPRRF AIRPIIYASV LSAGVLLCAF SAHADERDQL KSIQADIAAK ERAVRQKQQQ 
RASLLAQLKK QEEAISEATR KLRETQNTLN QLNKQIDEMN ASIAKLEQQK AAQERSLAAQ
LDAAFRQGEH TGIQLILSGE ESQRGQRLQA YFGYLNQARQ ETIAQLKQTR EEVAMQRAEL
EEKQSEQQTL LYEQRAQQAK LTQALSERKK TLAGLESSIQ QGQQQLSELR ANESRLRNSI
ARAEAAAKAR AEREAREAQA VRDRQKEATR KGTTYKPTES EKSLMSRTGG LGAPRGQAFW
PVRGPTLHRY GEQLQGELRW KGMVIGASEG TEVKAIADGR VILADWLQGY GLVVVVEHGK
GDMSLYGYNQ SALVSVGSQV RAGQPIALVG SSGGQGRPSL YFEIRRQGQA VNPQPWLGR