Gene EcE24377A_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1614 
Symbol 
ID5588801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1606713 
End bp1608716 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content51% 
IMG OID640925302 
ProductU32 family peptidase 
Protein accessionYP_001462707 
Protein GI157157126 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTAAAA TAGCCGCCAT TTTTCAGCTA CTGGATAAGA ATGTGACCGT ATCTTCTCAT 
CGACTTGAAC TGTTAAGCCC GGCACGCGAT GCCGCCATTG CCCGCGAAGC TATTTTGCAC
GGTGCCGATG CTGTTTATAT CGGCGGCCCT GGTTTTGGTG CCCGTCATAA TGCCAGTAAT
AGCCTGAAAG ATATTGCCGA GCTGGTGCCG TTTGCCCATC GTTATGGTGC AAAAATTTTC
GTCACGCTTA ACACCATTTT GCATGATGAT GAGCTGGAAC CCGCGCAACG GCTGATTACT
GACCTCTACC AGACCGGTGT CGATGCGCTG ATTGTTCAGG ATATGGGGAT TCTGGAACTT
GATATTCCGC CGATTGAACT GCACGCCAGT ACGCAGTGCG ACATTCGTAC AGTTGAAAAA
GCGAAGTTCC TCTCTGATGT TGGCTTCACG CAGATTGTGC TGGCGCGAGA GCTGAATCTT
GATCAGATCC GCGCGATTCA CCAGGCTACG GACGCGACCA TTGAATTCTT TATTCATGGG
GCACTGTGCG TGGCCTATTC GGGTCAGTGC TACATTTCTC ATGCGCAAAC AGGGCGTAGC
GCCAACCGTG GCGATTGCTC GCAGGCGTGC CGTTTGCCAT ACACATTGAA AGACGATCAG
GGGCGGGTGG TTTCCTATGA AAAACATCTG CTGTCGATGA AAGATAACGA TCAGACTGCC
AACCTCGGCG CGCTGATTGA TGCTGGTGTA CGCTCCTTCA AGATTGAAGG GCGTTACAAA
GATATGAGCT ACGTGAAGAA TATCACCGCC CATTATCGCC AGATGCTTGA TGCCATTATT
GAAGAACGTG GCGATCTGGC GCGCGCTTCA TCAGGTCGTA CTGAACATTT CTTTGTTCCA
TCGACGGAAA AGACTTTCCA CCGTGGTAGC ACAGATTATT TTGTGAATGC CCGTAAAGGC
GATATTGGCG CGTTCGATTC GCCGAAATTT ATCGGCCTGC CGGTAGGCGA AGTATTGAAA
GTGGCGAAAG ATCATCTCGA TGTTGCCGTT ACCGAGCCAC TGGCAAATGG CGATGGCCTG
AACGTGTTGA TTAAACGTGA AGTCGTCGGT TTTCGTGCCA ATACGGTCGA GAAAACCGGA
GAAAATCAGT ACCGCGTCTG GCCCAATGAA ATGCCAGCAG ATTTGCACAA AATTCGTCCA
CATCACCCAC TAAACCGTAA TCTTGATCAT AACTGGCAGC AGGCACTGAC AAAAACCTCC
AGCGAACGTC GGGTGGCGGT AGACATTGAA CTGGGCGGCT GGCAGGAACA ACTGATTCTG
ACCCTCACCA GTGAAGAGGG TGTCAGCATC ACGCATACGC TGGACGGGCA GTTCGACGAA
GCCAATAACG CCGAAAAAGC AATGAACAAT CTGAAGGATG GTCTGGCAAA ACTGGGGCAA
ACCCTCTATT ACGCCCGCGA TGTGCAAATT AATTTGCCGG GGGCGCTGTT TGTACCAAAC
AGTCTGTTAA ACCAGTTCCG CCGTGAAGCT GCTGACATGC TGGATGCTGC GCGTCTTGCC
AGTTACCAGC GCGGCAGCCG TAAACCGGTT GCTGATCCTG CGCCGGTTTA TCCGCAAACG
CATCTGAGTT TCCTCGCGAA CGTATACAAC CAGAAAGCGC GTGAATTTTA TCATCGCTAT
GGTGTGCAGC TGATTGACGC GGCGTATGAA GCACATGAAG AGAAGGGCGA AGTCCCGGTG
ATGATCACCA AGCATTGTCT GCGCTTTGCC TTTAATCTGT GCCCGAAACA GGCGAAAGGC
AATATCAAAA GCTGGAAGGC GACGCCAATG CAACTGGTTA ACGGCGATGA AGTATTAACG
CTAAAGTTTG ATTGCCGCCC ATGCGAGATG CACGTCATTG GCAAAATCAA AAATCACATA
CTGAAAATGC CGTTACCGGG AAGCGTAGTG GCATCCGTAA GTCCGGATGA GCTGCTGAAA
ACATTGCCTA AGCGAAAAGG GTAA
 
Protein sequence
MAKIAAIFQL LDKNVTVSSH RLELLSPARD AAIAREAILH GADAVYIGGP GFGARHNASN 
SLKDIAELVP FAHRYGAKIF VTLNTILHDD ELEPAQRLIT DLYQTGVDAL IVQDMGILEL
DIPPIELHAS TQCDIRTVEK AKFLSDVGFT QIVLARELNL DQIRAIHQAT DATIEFFIHG
ALCVAYSGQC YISHAQTGRS ANRGDCSQAC RLPYTLKDDQ GRVVSYEKHL LSMKDNDQTA
NLGALIDAGV RSFKIEGRYK DMSYVKNITA HYRQMLDAII EERGDLARAS SGRTEHFFVP
STEKTFHRGS TDYFVNARKG DIGAFDSPKF IGLPVGEVLK VAKDHLDVAV TEPLANGDGL
NVLIKREVVG FRANTVEKTG ENQYRVWPNE MPADLHKIRP HHPLNRNLDH NWQQALTKTS
SERRVAVDIE LGGWQEQLIL TLTSEEGVSI THTLDGQFDE ANNAEKAMNN LKDGLAKLGQ
TLYYARDVQI NLPGALFVPN SLLNQFRREA ADMLDAARLA SYQRGSRKPV ADPAPVYPQT
HLSFLANVYN QKAREFYHRY GVQLIDAAYE AHEEKGEVPV MITKHCLRFA FNLCPKQAKG
NIKSWKATPM QLVNGDEVLT LKFDCRPCEM HVIGKIKNHI LKMPLPGSVV ASVSPDELLK
TLPKRKG