Gene EcolC_2803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2803 
Symbol 
ID6064990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3064148 
End bp3065350 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID641602209 
ProductD-alanyl-D-alanine carboxypeptidase fraction C 
Protein accessionYP_001725758 
Protein GI170020804 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.93217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAT ACTCCTCTCT CCTTCGTGGT CTTGCAGCGG GTTCTGCATT TTTATTCCTT 
TTTGCCCCAA CGGCATTCGC GGCGGAACAA ACCGTTGAAG CGCCGAGCGT GGATGCGCGT
GCATGGATTT TAATGGATTA CGCCAGCGGT AAAGTGCTGG CAGAAGGCAA CGCGGATGAG
AAACTGGATC CCGCGAGCCT GACTAAAATC ATGACCAGCT ATGTGGTTGG GCAGGCGCTT
AAGGCCGATA AGATTAAACT CACCGATATG GTGACGGTCG GTAAAGATGC CTGGGCGACG
GGAAATCCGG CACTGCGTGG TTCATCGGTA ATGTTCCTCA AACCGGGCGA TCAGGTTTCG
GTGGCAGACT TGAACAAAGG TGTGATTATC CAGTCCGGTA ATGACGCCTG TATTGCGCTG
GCCGATTACG TTGCCGGGAG CCAGGAGTCA TTTATTGGTT TGATGAATGG TTATGCCAAA
AAACTGGGTC TGACCAACAC TACCTTCCAG ACGGTGCACG GTCTGGATGC GCCGGGGCAG
TTCAGTACCG CGCGCGATAT GGCATTGCTG GGCAAAGCAT TGATCCACGA TGTGCCGGAA
GAGTACGCCA TTCATAAAGA GAAAGAGTTC ACCTTCAACA AAATTCGTCA GCCTAACCGT
AACCGTCTGC TGTGGAGCAG CAATCTGAAT GTTGATGGCA TGAAGACAGG AACCACGGCA
GGCGCGGGAT ATAATCTGGT TGCTTCGGCT ACCCAGGGCG ATATGCGTTT AATCTCCGTA
GTGCTGGGGG CGAAAACCGA CCGTATCCGT TTTAATGAGT CTGAGAAATT ATTGACCTGG
GGTTTCCGCT TCTTTGAAAC TGTGACGCCA ATTAAACCTG ATGCCACCTT TGTGACTCAG
CGCGTCTGGT TTGGTGATAA GAGCGAAGTG AATCTCGGGG CAGGCGAAGC GGGCTCCGTG
ACCATACCGC GTGGGCAGCT GAAAAACCTG AAAGCGAGTT ATACGTTAAC GGAACCGCAG
CTTACCGCAC CGCTGAAAAA AGGTCAGGTT GTCGGGACCA TTGATTTCCA GCTTAACGGT
AAATCCATTG AGCAGCGTCC GCTGATCGTG ATGGAAAATG TGGAAGAGGG CGGATTCTTT
GGTCGGGTGT GGGATTTCGT GATGATGAAA TTCCATCAGT GGTTCGGCAG CTGGTTCTCT
TAA
 
Protein sequence
MTQYSSLLRG LAAGSAFLFL FAPTAFAAEQ TVEAPSVDAR AWILMDYASG KVLAEGNADE 
KLDPASLTKI MTSYVVGQAL KADKIKLTDM VTVGKDAWAT GNPALRGSSV MFLKPGDQVS
VADLNKGVII QSGNDACIAL ADYVAGSQES FIGLMNGYAK KLGLTNTTFQ TVHGLDAPGQ
FSTARDMALL GKALIHDVPE EYAIHKEKEF TFNKIRQPNR NRLLWSSNLN VDGMKTGTTA
GAGYNLVASA TQGDMRLISV VLGAKTDRIR FNESEKLLTW GFRFFETVTP IKPDATFVTQ
RVWFGDKSEV NLGAGEAGSV TIPRGQLKNL KASYTLTEPQ LTAPLKKGQV VGTIDFQLNG
KSIEQRPLIV MENVEEGGFF GRVWDFVMMK FHQWFGSWFS