Gene EcSMS35_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2333 
Symbol 
ID6145767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2366292 
End bp2368052 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID641617207 
Productputative helicase 
Protein accessionYP_001744380 
Protein GI170682105 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00564696 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0326725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTTA CACTCCGCCC ATATCAGCAA GAAGCCGTGG ATGCCACGCT CAACCATTTT 
CGTCGTCATA AAACCCCTGC CGTAATCGTG CTGCCCACCG GTGCAGGTAA AAGCCTGGTG
ATAGCGGAAC TGGCGCGGCT GGCACGTGGT CGCGTGCTGG TGCTGGCACA CGTTAAAGAA
CTGGTGGCGC AAAACCATGC AAAATATCAG GCGCTGGGGC TGGAAGCCGA TATTTTTGCC
GCCGGGCTAA AGCGCAAAGA GAGCCACGGT AAAGTGGTAT TTGGCAGCGT GCAGTCGGTC
GCCCGTAATC TTGATGCCTT TCAGGGTGAA TTTTCGCTGT TGATTGTCGA TGAATGTCAC
CGTATTGGTG ACGATGAAGA GAGCCAGTAT CAGCAAATCC TCACTCACCT GACCAAAGTG
AATCCCCACT TACGCCTGCT GGGGCTGACT GCCACGCCTT TTCGACTGGG CAAAGGCTGG
ATTTATCAGT TCCATTATCA CGGCATGGTA CGCGGCGATG AGAAAGCCCT TTTCCGTGAC
TGCATTTATG AGCTGCCGCT GCGTTATATG ATTAAACACG GCTATCTGAC GCCGCCAGAA
CGACTGGATA TGCCAGTAGT GCAATACGAT TTCAGCCGCT TGCAGGCACA GAGTAACGGG
CTGTTCAGCG AAGCCGATCT CAACCGTGAG CTGAAAAAAC AACAACGTAT TACCCCGCAC
ATCATCAACC AGATTATGGA GTTTGCTGCA ACGCGCAAAG GGGTGATGAT TTTCGCCGCC
ACCGTTGAAC ACGCAAAAGA GATTGTGGGA TTACTACCCA CCGAAGATGC TGCACTGATT
ACTGGCGACA CCCCCGGCGC TGAGCGCGAT GTGTTAATTG AAGATTTTAA AGCCCAGCGT
TTTCGCTATC TGGTCAACGT CGCGGTACTG ACCACCGGAT TTGACGCCCC GCACGTCGAT
CTTATCGCCA TTCTGCGCCC TACCGAATCG GTGAGTCTTT ACCAACAAAT TGTCGGGCGA
GGTCTGCGTC TCGCTCCTGG CAAGACTGAT TGCTTAATTC TTGATTATGC GGGTAATCCT
CACGATCTCT ACGCGCCGGA AGTTGGTACA CCGAAAGGCA AAAGTGACAA CGTTCCGGTA
CAGGTTTTCT GCCCTGCCTG CGGTTTTGCC AACACCTTTT GGGGGAAAAC GACCGCCGAC
GGGACATTGA TTGAACACTT TGGTCGCCGC TGTCAGGGAT GGTTTGAAGA TGACGACGGT
CATCGCGAAC AGTGTGACTT CCGTTTCCGT TTTAAAAATT GCCCGCAATG TAACGCAGAA
AATGATATTG CCGCCCGCCG TTGCCGCGAG TGTGACACCA TTCTGGTTGA CCCGGATGAT
ATGTTAAAAG CGGCGCTACG ACTGAAAGAC GCGCTGGTAT TACGCTGTAG CGGCATGTCT
TTGCAGCATG GGCACGACGA AAAAGGCGAA TGGTTGAAAA TCACCTATTA CGATGAAGAC
GGCGCGGATG TGAGTGAGCG TTTCCGTCTG CAAACGCCCG CCCAGCGAAC TGCCTTCGAG
CAGCTTTTTA TCCGCCCGCA TACGCGCACA CCGGGCATCC CGCTGCGCTG GATCACCGCC
GCCGATATCC TCGCCCAGCA AGCCTTATTG CGACACCCGG ATTTTGTCGT TGCCCGCATG
AAAGGTCAGT ACTGGCAAGT GCGTGAAAAA GTGTTCGATT ACGAAGGTCG TTTTCGTCGG
GCGCACGAAT TACGCGGTTA A
 
Protein sequence
MIFTLRPYQQ EAVDATLNHF RRHKTPAVIV LPTGAGKSLV IAELARLARG RVLVLAHVKE 
LVAQNHAKYQ ALGLEADIFA AGLKRKESHG KVVFGSVQSV ARNLDAFQGE FSLLIVDECH
RIGDDEESQY QQILTHLTKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGDEKALFRD
CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH
IINQIMEFAA TRKGVMIFAA TVEHAKEIVG LLPTEDAALI TGDTPGAERD VLIEDFKAQR
FRYLVNVAVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP
HDLYAPEVGT PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG
HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDTILVDPDD MLKAALRLKD ALVLRCSGMS
LQHGHDEKGE WLKITYYDED GADVSERFRL QTPAQRTAFE QLFIRPHTRT PGIPLRWITA
ADILAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRR AHELRG