Gene EcSMS35_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2537 
SymbolypdF 
ID6144040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2596354 
End bp2597439 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content58% 
IMG OID641617409 
Productaminopeptidase 
Protein accessionYP_001744580 
Protein GI170683336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAC TCGCTTCGCT GCGCGACTGG CTTAAGGCGC AACAACTGGA TGCGGTGCTT 
CTCTCCTCAC GGCAGAACAA ACAGCCGCAT CTGGGGATCT CCACCGGATC AGGCTATGTG
CTGATTAGCC GTGAAAGTGC GCACATTCTG GTGGACTCGC GCTATTACGC GGATGTAGAA
GCCCGCACGC AAGGCTACCA GCTGCATTTG CTTGACGCGA CGCACACGCT TGCAACCATC
GCCAGGCAAA TCATTGCCGA TGAGCAGTTA AAAACGCTCG GTTTTGAAGG CCAGCAGGTG
AGTTGGGAAA CCGCGCACCG CTGGCAGTCT GAACTCAATG CGAAACTGGT AAGCGCCACG
CCGGATGTGC TGCGGCAAAT CAAAACGCCA GAGGAGGTGG AGAAAATCCG CCTCGCCTGT
GGGATTGCCG ATCGCGGTGC AGAGCATATT CGCCGCTTTA TTCAGGCGGG AATGAGCGAG
CGCGAGATAG CCGCTGAACT GGAGTGGTTT ATGCGCCAGC AGGGCGCAGA AAAAGCCTCT
TTCGATACCA TTGTCGCCAG CGGCTGGCGT GGGGCGCTGC CGCACGGCAA AGCCAGCGAC
AAGATTGTTG CAGCGGGCGA GTTTGTCACT CTTGATTTTG GTGCGCTGTA TCAGGGCTAC
TGCTCTGATA TGACGCGCAC CTTGCTGGTG AATGGCGAAG GGGTGAGCGC CGAATCTCAC
CCGCTGTTTG ACGTCTATCA GATTGTTTTG CAGGCACAGC TCGCGGCAAT CTCCGCGATT
CGCCCCGGCG TGCGCTGCCA GCAGGTTGAC GACGCCGCGC GCCGGGTGAT TACCGAGGCT
GGATTTGGCG ACTATTTCGG TCATAACACC GGTCATGCTA TCGGCATTGA AGTCCATGAA
GGTCCGCGTT TTTCACCGCG GGACACCACG ACGCTACAGC CAGGCATGTT ACTGACCGTG
GAGCCGGGGA TTTATTTGCC AGGGCAAGGG GGCGTGCGCA TCGAAGATGT TGTGCTGGTC
ACCCCGCAAG GCGCAGAAGT GCTCTACGCC ATGCCGAAAA CAGTGTTGCT CACGGGAGAG
GCATAA
 
Protein sequence
MTLLASLRDW LKAQQLDAVL LSSRQNKQPH LGISTGSGYV LISRESAHIL VDSRYYADVE 
ARTQGYQLHL LDATHTLATI ARQIIADEQL KTLGFEGQQV SWETAHRWQS ELNAKLVSAT
PDVLRQIKTP EEVEKIRLAC GIADRGAEHI RRFIQAGMSE REIAAELEWF MRQQGAEKAS
FDTIVASGWR GALPHGKASD KIVAAGEFVT LDFGALYQGY CSDMTRTLLV NGEGVSAESH
PLFDVYQIVL QAQLAAISAI RPGVRCQQVD DAARRVITEA GFGDYFGHNT GHAIGIEVHE
GPRFSPRDTT TLQPGMLLTV EPGIYLPGQG GVRIEDVVLV TPQGAEVLYA MPKTVLLTGE
A