Gene ECH74115_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1568 
SymbolpepT 
ID6972211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1532992 
End bp1534260 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content49% 
IMG OID643385533 
Productpeptidase T 
Protein accessionYP_002270027 
Protein GI209398016 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000893895 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTTCTTA ATAATGTTGT CACAAAAAGT GAGGGTGACT ACATGGATAA ACTACTTGAG 
CGATTTTTGA ACTACGTGTC TCTGGATACC CAATCAAAAG CAGGGGTGAG ACAGGTTCCC
AGCACGGAAG GCCAATGGAA GTTATTGCAT CTGCTGAAAG AGCAGCTCGA AGAGATGGGG
CTTATCAATG TGACCTTAAG TGAGAAGGGC ACTTTGATGG CGACGTTACC GGCTAACGTC
CCTGGCGATA TCCCGGCGAT TGGCTTTATT TCTCATGTGG ATACCTCACC GGATTGCAGC
GGCAAAAATG TGAATCCGCA AATTGTTGAA AACTATCGCG GTGGCGATAT TGCGCTGGGT
ATCGGCGATG AAGTTTTATC ACCGGTTATG TTCCCGGTGC TGCATCAGCT ACTGGGTCAG
ACGCTGATTA CCACCGATGG TAAAACCTTG TTAGGTGCCG ATGACAAAGC AGGTATTGCA
GAAATCATGA CCGCGCTGGC GGTATTGCAA CAGAAAAACA TTCCGCATGG TGATATTCGC
GTCGCCTTTA CCCCGGATGA AGAAGTGGGC AAAGGGGCGA AACATTTTGA TGTTGATGCC
TTCGATGCCC GCTGGGCTTA CACTGTTGAT GGTGGTGGCG TAGGCGAACT GGAGTTTGAA
AACTTCAACG CCGCATCGGT CAATATCAAA ATTGTCGGTA ACAATGTTCA TCCGGGCACG
GCGAAAGGAG TGATGGTAAA TGCGCTGTCG CTGGCGGCAC GTATTCATGC GGAAGTTCCG
GCGGATGAAA GCCCGGAAAT GACAGAAGGC TATGAAGGTT TCTATCACCT GGCGAGCATG
AAAGGCACCG TTGAACGGGC CGATATGCAC TACATCATCC GTGATTTCGA CCGTAAACAG
TTTGAAGCGC GTAAACGTAA AATGATGGAG ATCGCCAAAA AAGTGGGCAA AGGGTTACAT
CCTGATTGCT ACATTGAATT GGTGATTGAA GACAGTTACT ACAATATGCG CGAGAAAGTG
GTTGAGCATC CGCATATTCT CGATATCGCC CAGCAGGCGA TGCGTGACTG CGATATTGAA
CCGGAACTGA AACCGATCCG CGGCGGTACC GACGGCGCGC AGTTGTCGTT TATGGGATTA
CCGTGCCCGA ACCTGTTCAC TGGCGGTTAC AACTATCATG GTAAGCATGA GTTTGTGACT
CTGGAAGGTA TGGAAAAAGC GGTGCAGGTG ATCGTCCGTA TTGCCGAGTT AACGGCGCAA
CGGAAGTAA
 
Protein sequence
MLLNNVVTKS EGDYMDKLLE RFLNYVSLDT QSKAGVRQVP STEGQWKLLH LLKEQLEEMG 
LINVTLSEKG TLMATLPANV PGDIPAIGFI SHVDTSPDCS GKNVNPQIVE NYRGGDIALG
IGDEVLSPVM FPVLHQLLGQ TLITTDGKTL LGADDKAGIA EIMTALAVLQ QKNIPHGDIR
VAFTPDEEVG KGAKHFDVDA FDARWAYTVD GGGVGELEFE NFNAASVNIK IVGNNVHPGT
AKGVMVNALS LAARIHAEVP ADESPEMTEG YEGFYHLASM KGTVERADMH YIIRDFDRKQ
FEARKRKMME IAKKVGKGLH PDCYIELVIE DSYYNMREKV VEHPHILDIA QQAMRDCDIE
PELKPIRGGT DGAQLSFMGL PCPNLFTGGY NYHGKHEFVT LEGMEKAVQV IVRIAELTAQ
RK