Gene EcolC_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3349 
Symbol 
ID6067406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3672099 
End bp3673154 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content55% 
IMG OID641602763 
ProductDNA polymerase IV 
Protein accessionYP_001726295 
Protein GI170021341 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA TCATTCATGT GGATATGGAC TGCTTTTTCG CCGCAGTGGA GATGCGCGAC 
AATCCCGCCC TGCGCGATAT CCCTATTGCT ATTGGCGGCA GCCGCGAACG TCGGGGGGTG
ATCAGCACCG CCAATTATCC CGCGCGTAAA TTTGGCGTAC GTAGCGCTAT GCCGACAGGG
ATGGCGCTCA AATTATGCCC GCATCTCACC TTGCTTCCGG GGCGCTTTGA CGCCTACAAA
GAAGCCTCAA ATCATATCCG CGAAATCTTC TCGCGCTACA CCTCGCGTAT TGAACCGTTG
TCACTGGATG AGGCTTATCT CGACGTCACC GATAGCGTCC ATTGCCACGG TTCTGCGACC
CTCATCGCCC AGGAAATCCG CCAGACGATT TTCAACGAGC TGCAACTGAC GGCGTCTGCG
GGCGTGGCAC CCGTAAAGTT TCTCGCCAAA ATCGCCTCCG ACATGAATAA ACCCAACGGC
CAGTTTGTGA TTACGCCGGC AGAAGTTCCG GCATTTTTAC AAACCTTACC ACTGGCAAAA
ATCCCCGGCG TCGGCAAAGT CTCGGCGGCA AAACTGGAAG CGATGGGGCT ACGAACCTGC
GGTGATGTAC AAAAGTGTGA TCTGGTGATG CTGCTTAAAC GCTTTGGCAA ATTTGGCCGC
ATTTTGTGGG AGCGTAGTCA GGGGATTGAC GAGCGCGACG TTAACAGCGA ACGGTTGCGA
AAATCCGTCG GCGTGGAACG CACGATGGCG GAAGATATCC ACCACTGGTC TGAATGTGAA
GCGATTATCG AGCGGCTGTA TCCGGAACTT GAACGCCGTC TGGCAAAGGT AAAACCTGAT
TTACTGATTG CCCGCCAGGG GGTGAAATTA AAGTTCGACG ATTTTCAGCA AACCACTCAG
GAGCACGTCT GGCCGCGGCT GAATAAAGCT GACTTAATCG CCACCGCGCG TAAAACCTGG
GATGAACGCC GCGGCGGGCG CGGTGTGCGA CTGGTGGGGC TGCATGTGAC GTTGCTTGAT
CCGCAAATGG AAAGACAACT GGTGCTGGGA TTATGA
 
Protein sequence
MRKIIHVDMD CFFAAVEMRD NPALRDIPIA IGGSRERRGV ISTANYPARK FGVRSAMPTG 
MALKLCPHLT LLPGRFDAYK EASNHIREIF SRYTSRIEPL SLDEAYLDVT DSVHCHGSAT
LIAQEIRQTI FNELQLTASA GVAPVKFLAK IASDMNKPNG QFVITPAEVP AFLQTLPLAK
IPGVGKVSAA KLEAMGLRTC GDVQKCDLVM LLKRFGKFGR ILWERSQGID ERDVNSERLR
KSVGVERTMA EDIHHWSECE AIIERLYPEL ERRLAKVKPD LLIARQGVKL KFDDFQQTTQ
EHVWPRLNKA DLIATARKTW DERRGGRGVR LVGLHVTLLD PQMERQLVLG L