Gene EcSMS35_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0285 
SymboldinB 
ID6146033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp294541 
End bp295596 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID641615183 
ProductDNA polymerase IV 
Protein accessionYP_001742392 
Protein GI170679693 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.32449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA TCATTCATGT GGATATGGAC TGCTTTTTCG CAGCGGTGGA GATGCGCGAC 
AATCCCGCCC TGCGCGATAT CCCTATTGCT ATTGGCGGCA GCCGCGAACG TCGGGGGGTG
ATCAGCACCG CCAATTATCC CGCGCGTAAA TTTGGCGTAC GCAGCGCTAT GCCGACAGGG
ATGGCACTCA AATTATGCCC ACATCTCACC TTGCTTCCGG GGCGCTTTGA CGCCTACAAA
GAAGCCTCAA ATCATATCCG CGAAATTTTC TCGCGCTATA CCTCGCGCAT TGAACCGTTA
TCACTGGATG AAGCCTATCT CGACGTCACC GATAGTGTGC ATTGCCACGG TTCTGCGACC
CTCATCGCCA AGGAAATCCG TCAGACGATC TTTAACGAGC TGCAACTGAC GGCATCTGCG
GGCGTTGCGC CGGTAAAGTT TCTCGCCAAA ATTGCTTCCG ACATGAATAA ACCCAACGGT
CAGTTTGTGA TTACTCCGGC AGAAGTTCCG GCATTTCTGC AAACCTTACC GCTGGCAAAA
ATTCCCGGCG TCGGCAAAGT CTCGGCGGCA AAACTGGAAG CGATGGGGCT GCGGACCTGC
GGTGATGTGC AAAAGTGTGA TCTGGTGACT CTGCTCAAAC GTTTTGGCAA ATTTGGCCGC
ATTTTGTGGG AGCGTAGTCA GGGGATTGAC GAGCGCGACG TTAACAGTGA ACGGTTGCGA
AAATCCGTCG GCGTGGAACG CACGATGGCG GAAGATATTC ATCACTGGTC TGAATGTGAA
GCGATTATCG AGCGGCTGTA TCCTGAACTT GAACGCCGTC TGGCAAAGGT AAAACCTGAT
TTACTGATTG CTCGCCAGGG GGTGAAATTA AAGTTTGATG ATTTTCAGCA AACCACTCAG
GAGCACGTCT GGCCGCGGCT GAATAAAGCT GATCTAATCG CCACCGCGCG TAAAACCTGG
GATGAACGTC GCGGCGGGCG CGGTGTGCGT CTGGTGGGGC TGCATGTGAC GTTACTTGAC
CCTCAAATGG AAAGACAACT GGTGCTGGGA TTATGA
 
Protein sequence
MRKIIHVDMD CFFAAVEMRD NPALRDIPIA IGGSRERRGV ISTANYPARK FGVRSAMPTG 
MALKLCPHLT LLPGRFDAYK EASNHIREIF SRYTSRIEPL SLDEAYLDVT DSVHCHGSAT
LIAKEIRQTI FNELQLTASA GVAPVKFLAK IASDMNKPNG QFVITPAEVP AFLQTLPLAK
IPGVGKVSAA KLEAMGLRTC GDVQKCDLVT LLKRFGKFGR ILWERSQGID ERDVNSERLR
KSVGVERTMA EDIHHWSECE AIIERLYPEL ERRLAKVKPD LLIARQGVKL KFDDFQQTTQ
EHVWPRLNKA DLIATARKTW DERRGGRGVR LVGLHVTLLD PQMERQLVLG L