Gene EcolC_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2641 
Symbol 
ID6065885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2892061 
End bp2893821 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID641602048 
Productpeptidase S16 lon domain-containing protein 
Protein accessionYP_001725598 
Protein GI170020644 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000712327 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000606295 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAC CTGGTTCCTG ATACCGATAG CTATCAGGAA 
ATATTTGCTC AGCCACATTT GATTGACGAA AACGATCCTT TATTCAGTGA TACTCAACCG
CGGCTGCAAT TTGCGCTGGA GCAGTTGCTG CATACGCGAG CATCCTCCTC TTTTATGCTG
GCGAAGGCCC CGGAAGAGTC TGAGTATCTG AATCTTATTG CCAATGCCGC GCGTACGCTA
CAAAGCGATG CAGGCCAACT GGTGGGCGGT CACTATGAGG TTTCCGGCCA CTCCATCCGC
TTACGTCACG CAGTGAGTGC AGATGATAAT TTTGCGACTT TAACGCAAGT TGTCGCTGCC
GACTGGGTAG AAGCGGAGCA ACTCTTTGGC TGCCTGCGCC AGTTTAATGG CGACATTACC
CTGCAGCCTG GTCTGGTGCA TCAGGCAAAT GGCGGTATTC TCATCATCTC TTTGCGTACA
CTGCTGGCGC AACCTCTGCT GTGGATGCGG CTGAAAAATA TCGTTAACCG CGAGCGTTTT
GACTGGGTTG CGTTTGATGA GTCGCGCCCT CTCCCCGTCT CTGTGCCTTC GATGCCATTG
AAGCTGAAAG TCATTCTGGT AGGCGAACGC GAATCATTGG CTGATTTCCA GGAGATGGAG
CCAGAGCTTT CAGAGCAGGC TATTTATAGC GAATTTGAAG ATACTCTGCA GATTGTCGAT
GCGGAGTCAG TAACCCAGTG GTGTCGCTGG GTGACATTTA CCGCCAGACA TAATCACTTA
CCTGCACCGG GAGCGGATGC CTGGCCGATA CTTATCCGCG AAGCAGCACG CTACACCGGT
GAACAAGAAA CACTTCCGCT TAGCCCGCAG TGGATCCTCC GCCAGTGTAA AGAGGTCGCC
TCCCTGTGTG ATGGCGACAC CTTCTCCGGC GAGCAGCTAA ACTTAATGCT GCAGCAGCGT
GAATGGCGCG AAGGTTTCCT CGCTGAACGT ATGCAGGATG AGATCCTTCA GGAGCAAATC
CTGATTGAAA CCGAAGGCGA ACGCATCGGG CAAATTAACG CCCTTTCGGT CATTGAATTT
CCGGGTCATC CACGCGCTTT TGGCGAACCT TCTCGCATTA GTTGCGTTGT GCATATTGGC
GATGGTGAAT TCACCGACAT CGAACGCAAA GCGGAGCTTG GCGGCAATAT CCATGCGAAA
GGGATGATGA TCATGCAAGC GTTCCTGATG TCGGAACTAC AGCTTGAGCA ACAGATCCCC
TTCTCAGCAT CGCTGACATT TGAGCAGTCA TACAGTGAAG TTGATGGAGA TAGTGCCTCG
ATGGCTGAAC TCTGCGCCCT GATAAGCGCC CTCGCCGATG TGCCGGTGAA TCAGAGTATC
GCTATCACAG GTTCAGTCGA TCAGTTCGGT CGCGCCCAGC CGGTCGGTGG TTTAAATGAG
AAAATCGAAG GCTTCTTTGC TATTTGCCAG CAACGTGAGT TAACCGGGAA ACAAGGTGTC
ATTATCCCCA CAGCTAACGT TCGCCATTTA AGTCTTCACA GTGAACTGGT GAAAGCGGTA
GAAGAAGGCA AATTCACCAT CTGGGCAGTA GACGATGTGA CTGACGCACT GCCGTTATTA
TTAAATCTGG TGTGGGATGG CGAAGGCCAA ACGACGCTGA TGCAAACCAT CCAGGAACGT
ATCGCGCAAG CATCGCAACA GGAAGGACGT CACCGTTTTC CATGGCCATT ACGTTGGCTG
AACTGGTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDTDSYQE IFAQPHLIDE NDPLFSDTQP RLQFALEQLL HTRASSSFML 
AKAPEESEYL NLIANAARTL QSDAGQLVGG HYEVSGHSIR LRHAVSADDN FATLTQVVAA
DWVEAEQLFG CLRQFNGDIT LQPGLVHQAN GGILIISLRT LLAQPLLWMR LKNIVNRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELSEQAIYS EFEDTLQIVD
AESVTQWCRW VTFTARHNHL PAPGADAWPI LIREAARYTG EQETLPLSPQ WILRQCKEVA
SLCDGDTFSG EQLNLMLQQR EWREGFLAER MQDEILQEQI LIETEGERIG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFTDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LADVPVNQSI AITGSVDQFG RAQPVGGLNE
KIEGFFAICQ QRELTGKQGV IIPTANVRHL SLHSELVKAV EEGKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQASQQEGR HRFPWPLRWL NWFIPN