Gene EcolC_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3013 
Symbol 
ID6065984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3293199 
End bp3294482 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content50% 
IMG OID641602430 
ProductD-alanyl-D-alanine carboxypeptidase fraction A 
Protein accessionYP_001725965 
Protein GI170021011 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00459886 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCG GTAGTGCATT TGCTATAGTA GGGCACTTTT TTAATTCCAT CACGGATGTC 
GTAGTTCAGA CCATGAATAC CATTTTTTCC GCTCGTATCA TGAAGCGCCT GGCGCTCACC
ACGGCTCTTT GCACAGCCTT TATCTCTGCT GCACATGCCG ATGACCTGAA TATCAAAACT
ATGATCCCGG GTGTACCGCA GATCGATGCG GAGTCCTACA TCCTGATTGA CTATAACTCC
GGCAAAGTGC TCGCCGAACA GAACGCAGAT GTCCGCCGCG ATCCTGCCAG CCTGACCAAA
ATGATGACCA GTTACGTTAT CGGCCAGGCA ATGAAAGCCG GTAAATTTAA AGAAACTGAT
TTAGTCACTA TCGGCAACGA CGCATGGGCC ACCGGTAACC CGGTGTTTAA AGGTTCTTCG
CTGATGTTCC TCAAACCGGG CATGCAGGTT CCGGTTTCTC AGCTGATCCG CGGTATTAAC
CTGCAATCGG GTAACGATGC TTGTGTCGCC ATGGCCGATT TTGCCGCTGG TAGCCAGGAC
GCTTTTGTTG GCTTGATGAA CAGCTACGTT AACGCACTGG GCCTGAAAAA TACCCACTTC
CAGACGGTAC ATGGTCTGGA TGCTGATGGT CAGTACAGCT CCGCGCGAGA TATGGCGCTG
ATCGGCCAGG CGTTGATCCG TGACGTACCG AATGAATACT CGATCTATAA AGAAAAAGAA
TTTACGTTTA ACGGTATTCG CCAGCTGAAC CGTAACGGCC TGTTATGGGA TAACAGCCTG
AATGTCGACG GCATCAAAAC CGGACACACT GACAAAGCAG GTTACAACCT TGTTGCTTCT
GCGACTGAAG GCCAGATGCG CTTGATCTCT GCGGTGATGG GCGGACGTAC TTTTAAAGGC
CGTGAAGCCG AAAGTAAAAA ACTGCTAACC TGGGGCTTCC GTTTCTTCGA AACCGTTAAC
CCACTGAAAG TAGGTAAAGA GTTCGCCTCT GAACCGGTTT GGTTTGGTGA TTCTGATCGC
GCTTCGTTAG GGGTTGATAA AGACGTGTAC CTGACCATTC CGCGTGGTCG CATGAAAGAT
CTGAAAGCCA GCTATGTGCT GAACAGCAGT GAATTGCATG CGCCGCTGCA AAAGAATCAG
GTCGTCGGAA CTATCAACTT CCAGCTTGAT GGCAAAACGA TCGAGCAACG CCCGCTGGTT
GTGTTGCAAG AAATCCCGGA AGGTAACTTC TTCGGCAAAA TCATTGATTA CATTAAATTA
ATGTTCCATC ACTGGTTTGG CTAA
 
Protein sequence
MPAGSAFAIV GHFFNSITDV VVQTMNTIFS ARIMKRLALT TALCTAFISA AHADDLNIKT 
MIPGVPQIDA ESYILIDYNS GKVLAEQNAD VRRDPASLTK MMTSYVIGQA MKAGKFKETD
LVTIGNDAWA TGNPVFKGSS LMFLKPGMQV PVSQLIRGIN LQSGNDACVA MADFAAGSQD
AFVGLMNSYV NALGLKNTHF QTVHGLDADG QYSSARDMAL IGQALIRDVP NEYSIYKEKE
FTFNGIRQLN RNGLLWDNSL NVDGIKTGHT DKAGYNLVAS ATEGQMRLIS AVMGGRTFKG
REAESKKLLT WGFRFFETVN PLKVGKEFAS EPVWFGDSDR ASLGVDKDVY LTIPRGRMKD
LKASYVLNSS ELHAPLQKNQ VVGTINFQLD GKTIEQRPLV VLQEIPEGNF FGKIIDYIKL
MFHHWFG