Gene EcSMS35_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3717 
Symbol 
ID6143927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3786095 
End bp3787297 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content50% 
IMG OID641618543 
Productputative DNA protecting protein DprA 
Protein accessionYP_001745683 
Protein GI170683496 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTT CAGCCAATGC ACAAGCGACC CTCCTGCTAA CCAGCGATTT TTCTCGCGCG 
GCGGCGAGTG AGTATAAACC TCTTAGTAAT AGTGAATGGG GGAAGTTTGC ATTATGGCTG
AAGCACCAAC GTATCAGTCC CGCCGATCTT CTGGTGCCGC AACCGCAAGA GAAACTTACA
GGCTGGAGCG ATCCGCGTAT TTCTCAGGAG CGTATTCTTG GCTTGCTGGC GCGTGGTCAT
AGTTTGGCGT TGGCGGTAGA TAAGTGGCAA CGCGCCGGTT TATGGATCTT AACCCGCGGA
GATGCCGATT ATCCCGTTCG CTTGAAAAAC CGATTGCGAA CGGATGCACC TCCCGTTTTA
TTTGGCTGCG GGAATAAAGC ATTACTGCAA GCGGAAGGTA TGGCGATTGT TGGCTCGCGA
GATGCTCCGA CTGCCGATTT GCGCTATACC CAACAACTGG CAGCGAAACT GGCCCAACAG
GGGATTTGCG TTATCTCTGG TGGTGCGCGA GGTATTGATG AATGTGCAAT GGCGTCGGCA
CTGGAGGCCG GGGGAACTGC CGTTGGCGTA TTAGCTGATA GCTTGTTAAA AACGAGTACG
TTAGTGAAAT GGCGTGAAGG GCTTATAGCA GGCAACCTGG TGTTGATTTC GCCGTTTTAC
CCAGAGGTAC GTTTCACCGT CGGCAATGCG ATGGCGCGAA ATAAATATAT TTATTGCCTT
GCTGAAAGCG CAATGGTTGT ACGTGCGGGA ATGACCGGTG GAACGATAAC CGGGGCGATG
GAGGCATTAA AACATCAGTG GCTGCCTGTG CAGGTTAAAC CAAATCAGGA TATGCAATCA
GCCAATTCAC GATTAGTAGA AAATGGGGCG TCATGGAGTA CTGAACAGGC TGAGAATGTG
ACGTTCAGAC TGCCAGACGT TACTGGGCTG ATGTATGACA AAGCACTCCG TAACGCTCAA
CCAGAATTGT TTTCGCTGCA CGAAGATGAC GCAAATTACG CAGTGATGCC CTCGCATACC
CCTGTCGATT TTTATCAACT TTTTGTGGCG GAACTGGCGA TCCTTGCAAA GGAATCGATA
AGTATTGAAA GGCTGGCGTC TTGTACTGGT TTAACCATCG AACAAATTAG TGTGTGGCTG
AACCGCGCAG AAGAAGAGGG AAGGGTTATC CGATTGGGCG AAGGTCATTA TCAGTTCAGG
TAA
 
Protein sequence
MNLSANAQAT LLLTSDFSRA AASEYKPLSN SEWGKFALWL KHQRISPADL LVPQPQEKLT 
GWSDPRISQE RILGLLARGH SLALAVDKWQ RAGLWILTRG DADYPVRLKN RLRTDAPPVL
FGCGNKALLQ AEGMAIVGSR DAPTADLRYT QQLAAKLAQQ GICVISGGAR GIDECAMASA
LEAGGTAVGV LADSLLKTST LVKWREGLIA GNLVLISPFY PEVRFTVGNA MARNKYIYCL
AESAMVVRAG MTGGTITGAM EALKHQWLPV QVKPNQDMQS ANSRLVENGA SWSTEQAENV
TFRLPDVTGL MYDKALRNAQ PELFSLHEDD ANYAVMPSHT PVDFYQLFVA ELAILAKESI
SIERLASCTG LTIEQISVWL NRAEEEGRVI RLGEGHYQFR