Gene EcolC_3193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3193 
Symbol 
ID6066618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3498403 
End bp3500757 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content52% 
IMG OID641602608 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_001726142 
Protein GI170021188 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000724307 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000504746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCCTG AGCGTTCTGA ACGCATTGAA ATCCCCGTAT TGCCGCTGCG CGATGTGGTG 
GTTTATCCGC ACATGGTCAT CCCCTTATTT GTCGGGCGGG AAAAATCTAT CCGTTGTCTG
GAAGCGGCGA TGGACCATGA TAAAAAAATT ATGCTGGTCG CGCAGAAAGA AGCTTCAACG
GATGAGCCGG GTGTAAACGA TCTTTTCACC GTCGGGACCG TGGCCTCTAT ATTGCAGATG
CTGAAACTGC CTGACGGCAC CGTCAAAGTG CTGGTCGAGG GGTTACAGCG CGCGCGTATT
TCTGCGCTCT CTGACAATGG CGAACACTTT TCTGCGAAGG CGGAGTATCT GGAGTCGCCG
ACCATTGATG AGCGGGAACA GGAAGTGCTG GTGCGTACTG CAATCAGCCA GTTCGAAGGC
TACATCAAGC TGAACAAAAA AATCCCACCA GAAGTGCTGA CGTCGCTGAA TAGCATCGAC
GATCCGGCGC GTCTGGCGGA TACCATTGCT GCACATATGC CGCTGAAACT GGCTGACAAA
CAGTCTGTTC TGGAGATGTC CGACGTTAAC GAACGTCTGG AATATCTGAT GGCAATGATG
GAATCGGAAA TCGATCTGCT GCAGGTTGAG AAACGCATTC GCAACCGCGT TAAAAAGCAG
ATGGAGAAAT CCCAGCGTGA GTACTATCTG AACGAGCAAA TGAAAGCTAT TCAGAAAGAA
CTCGGTGAAA TGGACGACGC GCCGGACGAA AACGAAGCCC TGAAGCGCAA AATCGACGCG
GCGAAGATGC CGAAAGAGGC AAAAGAGAAA GCGGAAGCAG AGTTGCAGAA GCTGAAAATG
ATGTCTCCGA TGTCGGCAGA AGCGACCGTA GTGCGTGGTT ATATCGACTG GATGGTACAG
GTGCCGTGGA ATGCGCGTAG CAAGGTCAAA AAAGACCTGC GTCAGGCGCA GGAAATCCTT
GATACCGACC ATTATGGTCT GGAGCGCGTG AAAGATCGAA TCCTTGAGTA TCTTGCGGTT
CAAAGCCGTG TCAACAAAAT CAAGGGACCG ATCCTCTGCC TGGTAGGGCC GCCGGGGGTA
GGTAAAACCT CTCTTGGTCA GTCCATTGCC AAAGCCACCG GGCGTAAATA TGTCCGTATG
GCGCTGGGCG GCGTGCGTGA TGAAGCGGAA ATCCGTGGTC ACCGCCGTAC TTACATCGGT
TCTATGCCGG GTAAACTGAT CCAGAAAATG GCGAAAGTGG GCGTGAAAAA CCCGCTGTTC
CTGCTCGATG AGATCGACAA AATGTCTTCT GACATGCGTG GCGATCCGGC CTCTGCACTG
CTTGAAGTGC TGGATCCAGA GCAGAACGTA GCGTTCAGCG ACCACTACCT GGAAGTGGAT
TACGATCTCA GCGACGTGAT GTTTGTCGCG ACGTCGAACT CCATGAACAT TCCGGCACCG
CTGCTGGATC GTATGGAAGT GATTCGCCTC TCCGGTTATA CCGAAGATGA AAAACTGAAC
ATCGCCAAAC GTCACCTGCT GCCGAAGCAG ATTGAACGTA ATGCACTGAA AAAAGGTGAG
CTGACCGTCG ACGATAGCGC CATTATCGGC ATTATTCGTT ACTACACCCG TGAGGCGGGC
GTGCGTGGTC TGGAGCGTGA AATCTCCAAA CTGTGTCGCA AAGCGGTTAA GCAGTTACTG
CTCGATAAGT CATTAAAACA TATCGAAATT AACGGCGATA ACCTGCATGA CTATCTCGGT
GTTCAGCGTT TCGACTATGG TCGCGCGGAT AACGAAAACC GTGTCGGTCA GGTAACCGGT
CTGGCGTGGA CGGAAGTGGG CGGTGACTTG CTGACCATTG AAACCGCATG TGTTCCGGGT
AAAGGCAAAC TGACCTATAC CGGTTCGCTC GGCGAAGTGA TGCAGGAGTC CATTCAGGCG
GCGTTAACGG TGGTTCGTGC GCGTGCGGAA AAACTGGGGA TCAACCCTGA TTTTTACGAA
AAACGCGACA TCCACGTCCA CGTACCGGAA GGTGCGACGC CGAAAGATGG TCCGAGTGCC
GGTATTGCTA TGTGCACCGC GCTGGTTTCT TGCCTGACCG GTAACCCGGT TCGTGCCGAT
GTGGCAATGA CCGGTGAGAT CACTCTGCGT GGTCAGGTAC TGCCGATCGG TGGTTTGAAA
GAAAAACTCC TGGCAGCGCA TCGCGGCGGG ATTAAAACAG TGTTAATTCC GTTCGAAAAT
AAACGCGATC TGGAAGAGAT TCCTGACAAC GTAATTGCCG ATCTGGACAT TCATCCTGTG
AAGCGCATTG AGGAAGTTCT GACTCTGGCG CTGCAAAATG AACCGTCTGG CATGCAGGTT
GTGACTGCAA AATAG
 
Protein sequence
MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKKI MLVAQKEAST 
DEPGVNDLFT VGTVASILQM LKLPDGTVKV LVEGLQRARI SALSDNGEHF SAKAEYLESP
TIDEREQEVL VRTAISQFEG YIKLNKKIPP EVLTSLNSID DPARLADTIA AHMPLKLADK
QSVLEMSDVN ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE
LGEMDDAPDE NEALKRKIDA AKMPKEAKEK AEAELQKLKM MSPMSAEATV VRGYIDWMVQ
VPWNARSKVK KDLRQAQEIL DTDHYGLERV KDRILEYLAV QSRVNKIKGP ILCLVGPPGV
GKTSLGQSIA KATGRKYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF
LLDEIDKMSS DMRGDPASAL LEVLDPEQNV AFSDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IAKRHLLPKQ IERNALKKGE LTVDDSAIIG IIRYYTREAG
VRGLEREISK LCRKAVKQLL LDKSLKHIEI NGDNLHDYLG VQRFDYGRAD NENRVGQVTG
LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAE KLGINPDFYE
KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GQVLPIGGLK
EKLLAAHRGG IKTVLIPFEN KRDLEEIPDN VIADLDIHPV KRIEEVLTLA LQNEPSGMQV
VTAK