Gene EcolC_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2224 
Symbol 
ID6064980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2442599 
End bp2444602 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content51% 
IMG OID641601630 
Productpeptidase U32 
Protein accessionYP_001725189 
Protein GI170020235 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.787396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAAAA TAGCCGCCAT TTTTCAGCTA CTGGATAAGA ATGTGACCGT ATCTTCTCAT 
CGACTTGAAC TGTTAAGCCC GGCACGCGAT GCCGCCATTG CCCGCGAAGC TATTTTGCAC
GGTGCCGATG CTGTTTATAT CGGCGGCCCT GGTTTTGGTG CCCGTCATAA TGCCAGTAAT
AGCTTGAAAG ATATTGCCGA GCTGGTGCCG TTTGCCCATC GTTATGGTGC AAAAATTTTC
GTCACGCTTA ACACCATTTT GCATGATGAT GAGCTGGAAC CCGCGCAACG GCTGATTACT
GACCTCTACC AGACCGGTGT CGATGCGCTG ATTGTTCAGG ATATGGGGAT TCTGGAACTT
GATATTCCGC CGATTGAACT GCACGCCAGT ACGCAGTGCG ACATTCGTAC AGTTGAAAAA
GCGAAGTTCC TCTCTGATGT TGGCTTCACG CAGATTGTGC TGGCGCGAGA GCTGAATCTT
GATCAGATCC GCGCGATTCA CCAGGCTACG GACGCGACCA TTGAATTCTT TATTCATGGG
GCACTGTGCG TGGCCTATTC GGGTCAGTGC TACATTTCTC ATGCGCAAAC AGGGCGTAGC
GCCAACCGTG GCGATTGCTC GCAGGCGTGC CGTTTGCCAT ACACATTGAA AGACGATCAG
GGGCGGGTGG TTTCCTATGA AAAACATCTG CTGTCGATGA AAGATAACGA TCAGACTGCC
AACCTCGGCG CGCTGATTGA TGCTGGTGTA CGCTCCTTCA AGATTGAAGG GCGTTACAAA
GATATGAGCT ACGTGAAGAA TATCACCGCC CATTATCGCC AGATGCTTGA TGCCATTATT
GAAGAACGTG GCGATCTGGC GCGCGCTTCA TCAGGTCGTA CTGAACATTT CTTTGTTCCA
TCGACGGAAA AGACTTTCCA CCGTGGTAGC ACAGATTATT TTGTGAATGC CCGTAAAGGC
GATATTGGCG CGTTCGATTC GCCGAAATTT ATCGGCCTGC CGGTAGGCGA AGTATTGAAA
GTGGCGAAAG ATCATCTCGA TGTTGCCGTT ACCGAGCCAC TGGCAAATGG CGATGGCCTG
AACGTGTTGA TTAAACGTGA AGTCGTCGGT TTTCGTGCCA ATACGGTCGA GAAAACCGGA
GAAAATCAGT ACCGCGTCTG GCCCAATGAA ATGCCAGCAG ATTTGCACAA AATTCGTCCA
CATCACCCAC TAAACCGTAA TCTTGATCAT AACTGGCAGC AGGCACTGAC AAAAACCTCC
AGCGAACGTC GGGTGGCGGT AGACATTGAA CTGGGCGGCT GGCAGGAACA ACTGATTCTG
ACCCTCACCA GTGAAGAGGG TGTCAGCATC ACGCATACGC TGGACGGGCA GTTCGACGAA
GCCAATAACG CCGAAAAAGC AATGAACAAT CTGAAGGATG GTCTGGCAAA ACTGGGGCAA
ACCCTCTATT ACGCCCGCGA TGTGCAAATT AATTTGCCGG GGGCGCTGTT TGTACCAAAC
AGTCTGTTAA ACCAGTTCCG CCGTGAAGCT GCTGACATGC TGGATGCTGC GCGTCTTGCC
AGTTACCAGC GCGGCAGCCG TAAACCGGTT GCTGATCCTG CGCCGGTTTA TCCGCAAACG
CATCTGAGTT TCCTCGCGAA CGTATACAAC CAGAAAGCGC GTGAATTTTA TCATCGCTAT
GGTGTGCAGC TGATTGACGC GGCGTATGAA GCACATGAAG AGAAGGGCGA AGTCCCGGTG
ATGATCACCA AGCATTGTCT GCGCTTTGCC TTTAATCTGT GCCCGAAACA GGCGAAAGGC
AATATCAAAA GCTGGAAGGC GACGCCAATG CAACTGGTTA ACGGCGATGA AGTATTAACG
CTAAAGTTTG ATTGCCGCCC ATGCGAGATG CACGTCATTG GCAAAATCAA AAATCACATA
CTGAAAATGC CGTTACCGGG AAGCGTAGTG GCATCCGTAA GTCCGGATGA GCTGCTGAAA
ACATTGCCTA AGCGAAAAGG GTAA
 
Protein sequence
MAKIAAIFQL LDKNVTVSSH RLELLSPARD AAIAREAILH GADAVYIGGP GFGARHNASN 
SLKDIAELVP FAHRYGAKIF VTLNTILHDD ELEPAQRLIT DLYQTGVDAL IVQDMGILEL
DIPPIELHAS TQCDIRTVEK AKFLSDVGFT QIVLARELNL DQIRAIHQAT DATIEFFIHG
ALCVAYSGQC YISHAQTGRS ANRGDCSQAC RLPYTLKDDQ GRVVSYEKHL LSMKDNDQTA
NLGALIDAGV RSFKIEGRYK DMSYVKNITA HYRQMLDAII EERGDLARAS SGRTEHFFVP
STEKTFHRGS TDYFVNARKG DIGAFDSPKF IGLPVGEVLK VAKDHLDVAV TEPLANGDGL
NVLIKREVVG FRANTVEKTG ENQYRVWPNE MPADLHKIRP HHPLNRNLDH NWQQALTKTS
SERRVAVDIE LGGWQEQLIL TLTSEEGVSI THTLDGQFDE ANNAEKAMNN LKDGLAKLGQ
TLYYARDVQI NLPGALFVPN SLLNQFRREA ADMLDAARLA SYQRGSRKPV ADPAPVYPQT
HLSFLANVYN QKAREFYHRY GVQLIDAAYE AHEEKGEVPV MITKHCLRFA FNLCPKQAKG
NIKSWKATPM QLVNGDEVLT LKFDCRPCEM HVIGKIKNHI LKMPLPGSVV ASVSPDELLK
TLPKRKG