Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4891 |
Symbol | |
ID | 6142914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5008273 |
End bp | 5010123 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641619694 |
Product | hypothetical protein |
Protein accession | YP_001746801 |
Protein GI | 170683581 |
COG category | [R] General function prediction only |
COG ID | [COG3972] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.412698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.98473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAA TAATCCCAAC AATCAGTAGC TGCAACGATA AAATTACTGC CGGAGAGAAA AGGCTGGCAA GAATTCTGGA AAGAGGACTA GGCGAAGAGT GCTGCTGTTG GTATGACATT CCCACGGGTG ATAAGCATCT TCACCCAGAT TTTATGATAC TTTCGCCAGA AAAAGGCATT ATTTTTCTCG AGGTAAAAGA TTGGTTTATT ACAAAAATAA AAAAAGCCAA CAAGTCTAAT GTGTTGTATG AGACACAAAA TGGTATTCAA TCTTTAAAGA ATCCGATAGA ACAAGTTAGG CGATATGCTT TCGAAGCTGT AAACCAGTTA AAAAAAGACC CCCTATTGTG CCAACAGGAT GATCGTTACA ATGGCAATCT TCTTTTCCCC TATGGATATG GCGTCTACTT TAGTAATATT ACACGAGAAG CATTAAATAA TAAATTTACT CAAGAAGAGC TGCTCGGGAT CTTCCCATCA GATCTCATCA TCTGCAAAGA CGAAATAAAT GAATTCATGT CAAAAGAGGA TGTTTCATCC AAAATACATT CGTTGATTAA ATATGACTTC AAATGTCATG CTACACAAGA ACAGTTGAAC AGGATTAGAT GGCACTTATA CCCCGATGTC AGAATAGAAA GGGAAAAGAG AAGTAAAAAC CGTGATGAAT TTAGTGTCGA TGCACCAACC ATCATTTCTA TCCTGGATAC TCAACAGGAA CAACTTGCCA GAAGTATGAG AGATGGCCAC CGGATCATTC ATGGTGTTGC AGGTTCAGGG AAGACATTGA TCCTCTATCA TCGCTGTCAG GAATTAGCGA AAAAAAATGA CAGCGAAAAG CCAATTCTTG TCATTTGCTA CAATATTACT TTAGCCAAAA AACTTCGGTC GATGTTCATC AATCATCCCT TTGCAGAGAA AATAAAGGTT AAAAATTTTC ATGCTTGGTG TTTCCAACAA ATTAAAGAAA ACAAGATCAC TATTCCAGAA GGGGATAATA TTTTTGAAAA TATGGAGCTA GCGTTGACTG ATGGTTTCAA GACAGGAAAA ATAAAACCAG AACAATACAG CGCTGTATTA ATTGATGAAG GACATGATTT TAAGCCAGAA TGGTTAAAAA TCCTGGCAAA AATGACTGAC TCTCAGGACG AGAATTTGCT ATTCCTTTAT GATGATGCTC AGTCAATTTA CCAAAAAAAG AAAGCATTAG ACTTTACACT GTCCAGCGTA GATATTAAAG CGACAGGACG CACAACCATT CTTAATATAA ATTATAGAAA CACACAGCAA ATCCTACACT TTGCCAGTTG CATTGCATTT AATTATCTTA ATAGCCATAT CGATAGTTCT TTAACCTATC ATAAACCTGC TGCGGGAGGA ATGAATGGCG ATTATCCCAA CCTTGAACAT TTCGACACAC AGGACGAAGA AATAGCACGA GCAGTCGAAT GGATTACAGA GCAAAATGAA CAAGGGATTC CGTGGTCTGA AATTGCTATA CTAAGTCCAT CAACTCACAC CCTTTCTAAC TCTCTAAGCT CTGTACTTGA ATCGAAAAAC ATCCCCTTTA ATCTTATTGT AAGCTCTGCA GATAAAAAGT CCTGGACGCC AGAACAAGAG CGTATTTCTG TTATGCCATT ACCCAGCAGT AAAGGACTTG AGTTTCACTC AGTTGTCATT ATTGATGCAG CGAGAACACG GGATAATAGC GATGATTTGA GTGAGGATAT AAAGAGACTC TATGTAGGTT TTACCCGAGC ACGTTGCAAT CTCTTGGTAA CCTTGCACGG TGAAGGTGCG TTAAATGAGC ATCTACTAAA AACCTATGAG CAAAGCCCAC AATATATGTA A
|
Protein sequence | MATIIPTISS CNDKITAGEK RLARILERGL GEECCCWYDI PTGDKHLHPD FMILSPEKGI IFLEVKDWFI TKIKKANKSN VLYETQNGIQ SLKNPIEQVR RYAFEAVNQL KKDPLLCQQD DRYNGNLLFP YGYGVYFSNI TREALNNKFT QEELLGIFPS DLIICKDEIN EFMSKEDVSS KIHSLIKYDF KCHATQEQLN RIRWHLYPDV RIEREKRSKN RDEFSVDAPT IISILDTQQE QLARSMRDGH RIIHGVAGSG KTLILYHRCQ ELAKKNDSEK PILVICYNIT LAKKLRSMFI NHPFAEKIKV KNFHAWCFQQ IKENKITIPE GDNIFENMEL ALTDGFKTGK IKPEQYSAVL IDEGHDFKPE WLKILAKMTD SQDENLLFLY DDAQSIYQKK KALDFTLSSV DIKATGRTTI LNINYRNTQQ ILHFASCIAF NYLNSHIDSS LTYHKPAAGG MNGDYPNLEH FDTQDEEIAR AVEWITEQNE QGIPWSEIAI LSPSTHTLSN SLSSVLESKN IPFNLIVSSA DKKSWTPEQE RISVMPLPSS KGLEFHSVVI IDAARTRDNS DDLSEDIKRL YVGFTRARCN LLVTLHGEGA LNEHLLKTYE QSPQYM
|
| |