Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2674 |
Symbol | |
ID | 5590350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2668016 |
End bp | 2669053 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640926330 |
Product | exoaminopeptidase |
Protein accession | YP_001463723 |
Protein GI | 157157890 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG GAAGTGCGGC AGATCCTACT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT GGTCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGC GCCCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG CGCTATTGAC GTGCTGCCGG TTGGCAACGT GCGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC AGCGCCATGC GCGTGGACAT TGGTGCGCGC TCCTATGACG AAGTGATGCA GGCGGGAATT CGTCCAGGCG ATCGCGTCAC CTTTGATACA ACTTTTCAGG TTCTCCCTCA CCAGCGGGTG ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTATCTGC TGGTGACGTT ACTACGTGAG CTGCACGACG CCGAACTGCC TGCGGAGGTG TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGATGTCGC CATTGTGCTT GATACCGCCT GCTGGGCGAA AAACTTTGAT TATAGCGCGG CTAACCATCG CCAGATTGGT AACGGGCCGA TGCTGGTGTT AAGCGACAAG TCGCTGATTG CGCCGCCAAA ACTTACCGCC TGGATCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTGC AGGCGGATAT GTTCAGCAAC GGCGGCACGG ACGGCGGAGC GGTGCATTTA ACCGGCACCG GCGTGCCCAC GGTGGTGATG GGGCCAGCGA CGCGCCACGG ACATTGTGCG GCATCGATTG CCGATTGCCG CGACATTTTG CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CCCGTGAGAC GGTTGTTCAA CTGACGGATT TCAGATGA
|
Protein sequence | MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YSAANHRQIG NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR
|
| |