Gene EcolC_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2356 
Symbol 
ID6064794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2596246 
End bp2597295 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content50% 
IMG OID641601759 
Productputative periplasmic protease 
Protein accessionYP_001725318 
Protein GI170020364 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.203735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000387336 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG 
ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC
GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG
GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCATAAG
CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGAG AGGTGGCAAC TGACAGTAAA
CCCCGCGTCT GGGTGCTGGA TTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA
CGTGAAGAGA TAACGGCTGT ACTCGCAGCA TTCAAACCGC AGGATCAGGT TGTACTACGT
CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT
CTGCGTGATA AAAACATTCC TTTAACTGTT ACGGTAGACA AAGTCGCTGC CAGCGGCGGT
TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC
ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT
ATCGAACTTC ACACCGCCGG GCAGTATAAG CGTACGCTGA CCTTGCTGGG TGAAAATACC
GAAGAAGGGC GGGAGAAATT CCGCGAAGAG CTGAACGAAA CGCATCAGTT ATTTAAAGAT
TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG
TACGGACAAC AGGCGGTAGA GAAAGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT
ATTCTTAGCC TGATGGAAGG CCGTGAAGTG GTCAATGTAC GCTATATGCA GCGTAAACGA
CTCATTGACC GACTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTGTT GTTACGCTGG
TGGCAGCGGG GGCAAAAGCC ATTGATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA 
ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVATDSK PRVWVLDFKG SMDAHEVNSL
REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG
YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGENT
EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV
ILSLMEGREV VNVRYMQRKR LIDRLTGSAA ESADRLLLRW WQRGQKPLM