Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2333 |
Symbol | |
ID | 6145767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2366292 |
End bp | 2368052 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617207 |
Product | putative helicase |
Protein accession | YP_001744380 |
Protein GI | 170682105 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00564696 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0326725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTTA CACTCCGCCC ATATCAGCAA GAAGCCGTGG ATGCCACGCT CAACCATTTT CGTCGTCATA AAACCCCTGC CGTAATCGTG CTGCCCACCG GTGCAGGTAA AAGCCTGGTG ATAGCGGAAC TGGCGCGGCT GGCACGTGGT CGCGTGCTGG TGCTGGCACA CGTTAAAGAA CTGGTGGCGC AAAACCATGC AAAATATCAG GCGCTGGGGC TGGAAGCCGA TATTTTTGCC GCCGGGCTAA AGCGCAAAGA GAGCCACGGT AAAGTGGTAT TTGGCAGCGT GCAGTCGGTC GCCCGTAATC TTGATGCCTT TCAGGGTGAA TTTTCGCTGT TGATTGTCGA TGAATGTCAC CGTATTGGTG ACGATGAAGA GAGCCAGTAT CAGCAAATCC TCACTCACCT GACCAAAGTG AATCCCCACT TACGCCTGCT GGGGCTGACT GCCACGCCTT TTCGACTGGG CAAAGGCTGG ATTTATCAGT TCCATTATCA CGGCATGGTA CGCGGCGATG AGAAAGCCCT TTTCCGTGAC TGCATTTATG AGCTGCCGCT GCGTTATATG ATTAAACACG GCTATCTGAC GCCGCCAGAA CGACTGGATA TGCCAGTAGT GCAATACGAT TTCAGCCGCT TGCAGGCACA GAGTAACGGG CTGTTCAGCG AAGCCGATCT CAACCGTGAG CTGAAAAAAC AACAACGTAT TACCCCGCAC ATCATCAACC AGATTATGGA GTTTGCTGCA ACGCGCAAAG GGGTGATGAT TTTCGCCGCC ACCGTTGAAC ACGCAAAAGA GATTGTGGGA TTACTACCCA CCGAAGATGC TGCACTGATT ACTGGCGACA CCCCCGGCGC TGAGCGCGAT GTGTTAATTG AAGATTTTAA AGCCCAGCGT TTTCGCTATC TGGTCAACGT CGCGGTACTG ACCACCGGAT TTGACGCCCC GCACGTCGAT CTTATCGCCA TTCTGCGCCC TACCGAATCG GTGAGTCTTT ACCAACAAAT TGTCGGGCGA GGTCTGCGTC TCGCTCCTGG CAAGACTGAT TGCTTAATTC TTGATTATGC GGGTAATCCT CACGATCTCT ACGCGCCGGA AGTTGGTACA CCGAAAGGCA AAAGTGACAA CGTTCCGGTA CAGGTTTTCT GCCCTGCCTG CGGTTTTGCC AACACCTTTT GGGGGAAAAC GACCGCCGAC GGGACATTGA TTGAACACTT TGGTCGCCGC TGTCAGGGAT GGTTTGAAGA TGACGACGGT CATCGCGAAC AGTGTGACTT CCGTTTCCGT TTTAAAAATT GCCCGCAATG TAACGCAGAA AATGATATTG CCGCCCGCCG TTGCCGCGAG TGTGACACCA TTCTGGTTGA CCCGGATGAT ATGTTAAAAG CGGCGCTACG ACTGAAAGAC GCGCTGGTAT TACGCTGTAG CGGCATGTCT TTGCAGCATG GGCACGACGA AAAAGGCGAA TGGTTGAAAA TCACCTATTA CGATGAAGAC GGCGCGGATG TGAGTGAGCG TTTCCGTCTG CAAACGCCCG CCCAGCGAAC TGCCTTCGAG CAGCTTTTTA TCCGCCCGCA TACGCGCACA CCGGGCATCC CGCTGCGCTG GATCACCGCC GCCGATATCC TCGCCCAGCA AGCCTTATTG CGACACCCGG ATTTTGTCGT TGCCCGCATG AAAGGTCAGT ACTGGCAAGT GCGTGAAAAA GTGTTCGATT ACGAAGGTCG TTTTCGTCGG GCGCACGAAT TACGCGGTTA A
|
Protein sequence | MIFTLRPYQQ EAVDATLNHF RRHKTPAVIV LPTGAGKSLV IAELARLARG RVLVLAHVKE LVAQNHAKYQ ALGLEADIFA AGLKRKESHG KVVFGSVQSV ARNLDAFQGE FSLLIVDECH RIGDDEESQY QQILTHLTKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGDEKALFRD CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH IINQIMEFAA TRKGVMIFAA TVEHAKEIVG LLPTEDAALI TGDTPGAERD VLIEDFKAQR FRYLVNVAVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP HDLYAPEVGT PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDTILVDPDD MLKAALRLKD ALVLRCSGMS LQHGHDEKGE WLKITYYDED GADVSERFRL QTPAQRTAFE QLFIRPHTRT PGIPLRWITA ADILAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRR AHELRG
|
| |