Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2536 |
Symbol | |
ID | 6143891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2595317 |
End bp | 2596354 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617408 |
Product | exoaminopeptidase |
Protein accession | YP_001744579 |
Protein GI | 170679878 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG GAAGTACGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAC GGCCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGT GCACATATGG ATGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT GTGCTGCCAG TTGGCAACGT GCGCATGGCT GCCCGACAGC TTCAGCCGGT GCGCATCACC ACCCGTGAAG AGTGCAAAAT ACCTGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC AGCACCATGC GCGTGGACAT TGGCGCGCGC TCGTGTGACG AAGTGATGCA GGCGGGAATT CGTCCAGGCG ATCGCGTCAC CTTCGATACC ACTTTTCAGG TTCTCCCTCA CCAGCGGGTG ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTACCTGC TGGTGACGTT ACTACGTGAG CTGCACGACG CCGAACTGCC TGCGGAAGTA TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGATGTCGC CATTGTGCTT GATACCGCCT GCTGGGCGAA AAACTTTGAT TATGGCGCGG CTAACCATCG TCAGATTGGT AACGGGCCGA TGCTGGTGTT AAGCGATAAA TCGCTGATTG CGCCGCCAAA ACTTACCGCG TGGATTGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTAC AGGCAGATAT GTTCAGCAAC GGCGGCACGG ACGGTGGAGC GGTGCATTTA ACCGGCACTG GCGTACCCAC AGTGGTGATG GGGCCAGCCA CCCGCCACGG ACATTGCGCG GCATCGATTG CCGATTGTCG CGACATTTTG CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA CTGACGGATT TCAGATGA
|
Protein sequence | MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV STMRVDIGAR SCDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YGAANHRQIG NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR
|
| |