Gene EcolC_1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1285 
Symbol 
ID6068696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1405538 
End bp1406575 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID641600706 
Productexoaminopeptidase 
Protein accessionYP_001724278 
Protein GI170019324 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.836873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG 
GAAGTGCGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT
GGTCTGGGAT CGGTGCTGAT CCGCCTGAAT GAATCGACAG GTCCGAAGGT GATGATCTGT
GCGCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT
GTGCTGCCGG TTGGCAACGT ACGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC
ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC
AGCGCCATGC GCGTGGACAT TGGTGCGCGC TCCTATGACG AAGTGATGCA GGCGGGAATT
CGTCCCGGCG ATCGCGTCAC GTTTGATACC ACTTTTCAGG TTCTCCCTCA CCAGCGAGTG
ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTATCTGC TGGTGACGTT ACTGCGCGAA
CTGCACGACG CCGAACTACC TGCGGAAGTG TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG
GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGACGTCGC CATTGTGCTT
GATACCGCCT GCTGGGCGAA AAACTTTGAT TATGGCGCGG CTAACCATCG CCAGATTGGT
AACGGGCCGA TGCTGGTGTT AAGCGACAAG TCGCTGATTG CGCCGCCAAA ACTTACCGCC
TGGGTCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTGC AGGCAGATAT GTTCAGCAAC
GGCGGCACGG ACGGCGGGGC GGTGCACTTA ACCGGCACCG GCGTGCCCAC AGTGGTGATG
GGGCCAGCAA CCCGCCATGG ACATTGCGCC GCATCGATTG CCGATTGCCG CGACATTTTG
CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA
CTGACGGATT TCAGATGA
 
Protein sequence
MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC 
AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV
SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE
LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YGAANHRQIG
NGPMLVLSDK SLIAPPKLTA WVETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM
GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR