Gene EcSMS35_4891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4891 
Symbol 
ID6142914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5008273 
End bp5010123 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content38% 
IMG OID641619694 
Producthypothetical protein 
Protein accessionYP_001746801 
Protein GI170683581 
COG category[R] General function prediction only 
COG ID[COG3972] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.412698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.98473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA TAATCCCAAC AATCAGTAGC TGCAACGATA AAATTACTGC CGGAGAGAAA 
AGGCTGGCAA GAATTCTGGA AAGAGGACTA GGCGAAGAGT GCTGCTGTTG GTATGACATT
CCCACGGGTG ATAAGCATCT TCACCCAGAT TTTATGATAC TTTCGCCAGA AAAAGGCATT
ATTTTTCTCG AGGTAAAAGA TTGGTTTATT ACAAAAATAA AAAAAGCCAA CAAGTCTAAT
GTGTTGTATG AGACACAAAA TGGTATTCAA TCTTTAAAGA ATCCGATAGA ACAAGTTAGG
CGATATGCTT TCGAAGCTGT AAACCAGTTA AAAAAAGACC CCCTATTGTG CCAACAGGAT
GATCGTTACA ATGGCAATCT TCTTTTCCCC TATGGATATG GCGTCTACTT TAGTAATATT
ACACGAGAAG CATTAAATAA TAAATTTACT CAAGAAGAGC TGCTCGGGAT CTTCCCATCA
GATCTCATCA TCTGCAAAGA CGAAATAAAT GAATTCATGT CAAAAGAGGA TGTTTCATCC
AAAATACATT CGTTGATTAA ATATGACTTC AAATGTCATG CTACACAAGA ACAGTTGAAC
AGGATTAGAT GGCACTTATA CCCCGATGTC AGAATAGAAA GGGAAAAGAG AAGTAAAAAC
CGTGATGAAT TTAGTGTCGA TGCACCAACC ATCATTTCTA TCCTGGATAC TCAACAGGAA
CAACTTGCCA GAAGTATGAG AGATGGCCAC CGGATCATTC ATGGTGTTGC AGGTTCAGGG
AAGACATTGA TCCTCTATCA TCGCTGTCAG GAATTAGCGA AAAAAAATGA CAGCGAAAAG
CCAATTCTTG TCATTTGCTA CAATATTACT TTAGCCAAAA AACTTCGGTC GATGTTCATC
AATCATCCCT TTGCAGAGAA AATAAAGGTT AAAAATTTTC ATGCTTGGTG TTTCCAACAA
ATTAAAGAAA ACAAGATCAC TATTCCAGAA GGGGATAATA TTTTTGAAAA TATGGAGCTA
GCGTTGACTG ATGGTTTCAA GACAGGAAAA ATAAAACCAG AACAATACAG CGCTGTATTA
ATTGATGAAG GACATGATTT TAAGCCAGAA TGGTTAAAAA TCCTGGCAAA AATGACTGAC
TCTCAGGACG AGAATTTGCT ATTCCTTTAT GATGATGCTC AGTCAATTTA CCAAAAAAAG
AAAGCATTAG ACTTTACACT GTCCAGCGTA GATATTAAAG CGACAGGACG CACAACCATT
CTTAATATAA ATTATAGAAA CACACAGCAA ATCCTACACT TTGCCAGTTG CATTGCATTT
AATTATCTTA ATAGCCATAT CGATAGTTCT TTAACCTATC ATAAACCTGC TGCGGGAGGA
ATGAATGGCG ATTATCCCAA CCTTGAACAT TTCGACACAC AGGACGAAGA AATAGCACGA
GCAGTCGAAT GGATTACAGA GCAAAATGAA CAAGGGATTC CGTGGTCTGA AATTGCTATA
CTAAGTCCAT CAACTCACAC CCTTTCTAAC TCTCTAAGCT CTGTACTTGA ATCGAAAAAC
ATCCCCTTTA ATCTTATTGT AAGCTCTGCA GATAAAAAGT CCTGGACGCC AGAACAAGAG
CGTATTTCTG TTATGCCATT ACCCAGCAGT AAAGGACTTG AGTTTCACTC AGTTGTCATT
ATTGATGCAG CGAGAACACG GGATAATAGC GATGATTTGA GTGAGGATAT AAAGAGACTC
TATGTAGGTT TTACCCGAGC ACGTTGCAAT CTCTTGGTAA CCTTGCACGG TGAAGGTGCG
TTAAATGAGC ATCTACTAAA AACCTATGAG CAAAGCCCAC AATATATGTA A
 
Protein sequence
MATIIPTISS CNDKITAGEK RLARILERGL GEECCCWYDI PTGDKHLHPD FMILSPEKGI 
IFLEVKDWFI TKIKKANKSN VLYETQNGIQ SLKNPIEQVR RYAFEAVNQL KKDPLLCQQD
DRYNGNLLFP YGYGVYFSNI TREALNNKFT QEELLGIFPS DLIICKDEIN EFMSKEDVSS
KIHSLIKYDF KCHATQEQLN RIRWHLYPDV RIEREKRSKN RDEFSVDAPT IISILDTQQE
QLARSMRDGH RIIHGVAGSG KTLILYHRCQ ELAKKNDSEK PILVICYNIT LAKKLRSMFI
NHPFAEKIKV KNFHAWCFQQ IKENKITIPE GDNIFENMEL ALTDGFKTGK IKPEQYSAVL
IDEGHDFKPE WLKILAKMTD SQDENLLFLY DDAQSIYQKK KALDFTLSSV DIKATGRTTI
LNINYRNTQQ ILHFASCIAF NYLNSHIDSS LTYHKPAAGG MNGDYPNLEH FDTQDEEIAR
AVEWITEQNE QGIPWSEIAI LSPSTHTLSN SLSSVLESKN IPFNLIVSSA DKKSWTPEQE
RISVMPLPSS KGLEFHSVVI IDAARTRDNS DDLSEDIKRL YVGFTRARCN LLVTLHGEGA
LNEHLLKTYE QSPQYM