Gene EcE24377A_2674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2674 
Symbol 
ID5590350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2668016 
End bp2669053 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID640926330 
Productexoaminopeptidase 
Protein accessionYP_001463723 
Protein GI157157890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAT CGCTATTAAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG 
GAAGTGCGGC AGATCCTACT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT
GGTCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGC
GCCCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG CGCTATTGAC
GTGCTGCCGG TTGGCAACGT GCGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC
ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGACGTC
AGCGCCATGC GCGTGGACAT TGGTGCGCGC TCCTATGACG AAGTGATGCA GGCGGGAATT
CGTCCAGGCG ATCGCGTCAC CTTTGATACA ACTTTTCAGG TTCTCCCTCA CCAGCGGGTG
ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTATCTGC TGGTGACGTT ACTACGTGAG
CTGCACGACG CCGAACTGCC TGCGGAGGTG TGGCTGGTGG CAAGTTCCAG CGAAGAGGTG
GGATTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGATGTCGC CATTGTGCTT
GATACCGCCT GCTGGGCGAA AAACTTTGAT TATAGCGCGG CTAACCATCG CCAGATTGGT
AACGGGCCGA TGCTGGTGTT AAGCGACAAG TCGCTGATTG CGCCGCCAAA ACTTACCGCC
TGGATCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTGC AGGCGGATAT GTTCAGCAAC
GGCGGCACGG ACGGCGGAGC GGTGCATTTA ACCGGCACCG GCGTGCCCAC GGTGGTGATG
GGGCCAGCGA CGCGCCACGG ACATTGTGCG GCATCGATTG CCGATTGCCG CGACATTTTG
CAGATGCAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CCCGTGAGAC GGTTGTTCAA
CTGACGGATT TCAGATGA
 
Protein sequence
MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC 
AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV
SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE
LHDAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFD YSAANHRQIG
NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM
GPATRHGHCA ASIADCRDIL QMQQLLSALI QRLTRETVVQ LTDFR