Gene EcSMS35_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3357 
Symbolgcp 
ID6145887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3435048 
End bp3436061 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID641618186 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001745336 
Protein GI170682206 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000189454 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT 
GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC
GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGTA AAACCGTACC GTTGATCCAG
GAGGCGCTGA AAGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA
GGCCCAGGAT TAGTCGGCGC ACTGCTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT
GCCTGGGACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG
CTGGAAGATA ACCCACCGGC ATTTCCGTTT GTGGCGCTGC TGGTTTCCGG CGGTCATACG
CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TTGGCGAGTC TATCGATGAT
GCTGCTGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGG
CCATTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGCCCG
ATGACCGACC GTCCGGGGCT GGATTTCAGC TTTTCTGGTC TGAAAACCTT CGCGGCGAAC
ACGATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA
GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGAG CACTGGATCA GACGGGCTTT
AAGCGACTGG TAATGGCAGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCT
GAAATGATGA AAAAACGCCG TGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT
AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT
CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCAGAGT TACCGGCCGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
EALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWDVPAIPVH HMEGHLLAPM
LEDNPPAFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE
DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA