Gene EcSMS35_2536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2536 
Symbol 
ID6143891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2595317 
End bp2596354 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID641617408 
Productexoaminopeptidase 
Protein accessionYP_001744579 
Protein GI170679878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG 
GAAGTACGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAC
GGCCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGT
GCACATATGG ATGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT
GTGCTGCCAG TTGGCAACGT GCGCATGGCT GCCCGACAGC TTCAGCCGGT GCGCATCACC
ACCCGTGAAG AGTGCAAAAT ACCTGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC
AGCACCATGC GCGTGGACAT TGGCGCGCGC TCGTGTGACG AAGTGATGCA GGCGGGAATT
CGTCCAGGCG ATCGCGTCAC CTTCGATACC ACTTTTCAGG TTCTCCCTCA CCAGCGGGTG
ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTACCTGC TGGTGACGTT ACTACGTGAG
CTGCACGACG CCGAACTGCC TGCGGAAGTA TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG
GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGATGTCGC CATTGTGCTT
GATACCGCCT GCTGGGCGAA AAACTTTGAT TATGGCGCGG CTAACCATCG TCAGATTGGT
AACGGGCCGA TGCTGGTGTT AAGCGATAAA TCGCTGATTG CGCCGCCAAA ACTTACCGCG
TGGATTGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTAC AGGCAGATAT GTTCAGCAAC
GGCGGCACGG ACGGTGGAGC GGTGCATTTA ACCGGCACTG GCGTACCCAC AGTGGTGATG
GGGCCAGCCA CCCGCCACGG ACATTGCGCG GCATCGATTG CCGATTGTCG CGACATTTTG
CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA
CTGACGGATT TCAGATGA
 
Protein sequence
MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC 
AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV
STMRVDIGAR SCDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE
LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YGAANHRQIG
NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM
GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR