Gene EcSMS35_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1115 
Symbol 
ID6143756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1129875 
End bp1131131 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content46% 
IMG OID641615995 
Productpeptidase T 
Protein accessionYP_001743187 
Protein GI170683987 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTG TCGAACGCTT TATCAATTAT ACAAAAATTA ATACCACTAC CTCACGAGAA 
AACGGCGCAA AAGGGATCAT GCCTTCATCT CCTGGTCAGA TGAAACTTGC GAAATTGTTG
GTGAGTGAAC TTGAAGCATT GGGTATGGAG GATATTATTC TCAGAGAGAA TGCTATCGTA
ACAGCAACAC TCCCTGCAAA TACTGACGAA ACAATCCCCG TCGTTTCCTT TTTTGGACAT
CTGGACACCA GTGCCGAACA GACAGCCGAT ACAAAAGCGC AACGATTACC TTATAATGGG
GGCGATCTCT GTCTGAACCC TGAACTGAAT ATTTATTTGC GTGAGAGTGA ATTCCCGGAG
CTCAAAAATT ATATTGGCGA TGACCTTATA GTGACAGATG GTACCAGCCT TCTGGGGGCC
GATGATAAAG CGGCGCTGGC GGCCATAATG AACGCACTTC AATTTCTGAT TTCTCATCCT
GAAATCAGGC ATGGAGAAGT TAAAGTTGGG TTTGTGCCAG ATGAAGAGCA GGGGTTACGG
GGGGCCAAAG CCTTCGATGT TTCAGAGTTT GGCGCAGATT TTGGCTACAC TCTGGATTGC
TGCGGTATTG GAGAATTTGT TTACGAAAAC TGGAATGCGG GTGATGCAGA AATTATCTTT
ACCGGCCAGT CTGCACACCC TATGTCAGCG AAAGGCAAGC TTAAGAACTC TCTTTTGATG
GCTCACAAAT TCATTTCGAT GTTGCCAGGA GGAGAAGCGC CTGAATATAC AGAAGGACGC
GAAGGCTATT ATTGGGTGAA ACAGTTACAG GGAAACAGTG CCAGAACTGT ACTAAAACTG
GATATACGAG ATTTCAGCGA GGAAGGATAT CACGCCCGGA AGACATTTGT ACGCCAGCTT
GCAGAAAGTG CTTGTGCATT ATGGGGAGAA GGAAGCGTGA TCTGCCAACT GAGCGATCGG
TACGCCAATG TCTTTAACAG TCTGCAGGGA GAGGGGCATT ATCCCATCGA CATTGCCCTG
CGAGCTTACC AACGATGTGG TATTACTCCG ACACCGGTGG CGATGCGTGG AGGATATGAT
GGCGCTGTTC TTTCACAAAA AGGATTACCC TGCCCGAACA TTTTTACCGG CGCACATAAT
TTCCATTCTA TCTATGAATA TCTTCCAGTC CGTTCACTTC GGGCGGCAAG TGATGTGGTT
ATTGCCATCA TTCAGGAGAC ATTCAATGGG TTCACCACAG GGAATCGTGA GTCATGA
 
Protein sequence
MDIVERFINY TKINTTTSRE NGAKGIMPSS PGQMKLAKLL VSELEALGME DIILRENAIV 
TATLPANTDE TIPVVSFFGH LDTSAEQTAD TKAQRLPYNG GDLCLNPELN IYLRESEFPE
LKNYIGDDLI VTDGTSLLGA DDKAALAAIM NALQFLISHP EIRHGEVKVG FVPDEEQGLR
GAKAFDVSEF GADFGYTLDC CGIGEFVYEN WNAGDAEIIF TGQSAHPMSA KGKLKNSLLM
AHKFISMLPG GEAPEYTEGR EGYYWVKQLQ GNSARTVLKL DIRDFSEEGY HARKTFVRQL
AESACALWGE GSVICQLSDR YANVFNSLQG EGHYPIDIAL RAYQRCGITP TPVAMRGGYD
GAVLSQKGLP CPNIFTGAHN FHSIYEYLPV RSLRAASDVV IAIIQETFNG FTTGNRES