Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1285 |
Symbol | |
ID | 6068696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1405538 |
End bp | 1406575 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641600706 |
Product | exoaminopeptidase |
Protein accession | YP_001724278 |
Protein GI | 170019324 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.836873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG GAAGTGCGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT GGTCTGGGAT CGGTGCTGAT CCGCCTGAAT GAATCGACAG GTCCGAAGGT GATGATCTGT GCGCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT GTGCTGCCGG TTGGCAACGT ACGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC AGCGCCATGC GCGTGGACAT TGGTGCGCGC TCCTATGACG AAGTGATGCA GGCGGGAATT CGTCCCGGCG ATCGCGTCAC GTTTGATACC ACTTTTCAGG TTCTCCCTCA CCAGCGAGTG ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTATCTGC TGGTGACGTT ACTGCGCGAA CTGCACGACG CCGAACTACC TGCGGAAGTG TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGACGTCGC CATTGTGCTT GATACCGCCT GCTGGGCGAA AAACTTTGAT TATGGCGCGG CTAACCATCG CCAGATTGGT AACGGGCCGA TGCTGGTGTT AAGCGACAAG TCGCTGATTG CGCCGCCAAA ACTTACCGCC TGGGTCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTGC AGGCAGATAT GTTCAGCAAC GGCGGCACGG ACGGCGGGGC GGTGCACTTA ACCGGCACCG GCGTGCCCAC AGTGGTGATG GGGCCAGCAA CCCGCCATGG ACATTGCGCC GCATCGATTG CCGATTGCCG CGACATTTTG CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA CTGACGGATT TCAGATGA
|
Protein sequence | MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YGAANHRQIG NGPMLVLSDK SLIAPPKLTA WVETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR
|
| |