Gene EcSMS35_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0990 
Symbol 
ID6146757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1003964 
End bp1005910 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content51% 
IMG OID641615877 
Producthypothetical protein 
Protein accessionYP_001743069 
Protein GI170681285 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.169795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAA ATATAAAAGT ATTTACATCG ACAGATGAAT TGACCACTCT CGGGCGTGAA 
CTGGGCAAAG GCGGCGAAGG TGCGGTTTAT GATATCGAGG AGTTTGTCGA TAGCGTCGCC
AAGATTTATC ACACGCCGCC ACCCGCCTTA AAACAGGACA AACTTGCCTT TATGGCTGCG
ACAGCTGACG CGCAATTGTT GAATTATGTC GCCTGGCCGC AGGCAACGCT TCACGGTGGA
CGAGGCGGAA AAGTAATCGG TTTTATGATG CCAAAAGTTT CTGGTAAAGA ACCGATTCAT
ATGATCTATA GCCCGGCACA TCGTCGTCAG AGTTACCCTC ATTGTGCGTG GGATTTTCTC
CTCTATGTTG CGCGCAATAT TGCTTCATCT TTTGCTACGG TTCACGAGCA CGGGCACGTT
GTGGGTGACG TAAACCAGAA CAGCTTTATG GTAGGCCGTG ACAGCAAAGT GGTGTTGATT
GATAGCGACT CCTTTCAGAT TAACGCCAAT GGAACGCTGC ATTTATGCGA AGTGGGCGTG
TCGCATTTTA CGTCGCCAGA GCTGCAAACC TTGTCGTCAT TTGTTGGCTT TGAACGCACC
GCGAATCACG ATAATTTTGG CCTTGCGTTG CTGATTTTTC ACGTCTTGTT TGGTGGTCGG
CATCCTTATT CCGGTGTACC GCTTATCTCT GATGCGGGTA ATGCGCTGGA GACGGATATT
GCCCATTTCC GTTATGCCTA CGCGTCAGAT AATCAGCGAC GTGGTTTAAA ACCGCCGCCA
CGATCTATTC CGCTGTCGAT GTTACCGGGC GATGTTGAAG CCATGTTTCA GCAGGCATTC
ACGGAAAGTG GCGTGGCAAC CGGGCGTCCG ACGGCAAAAG CGTGGGTAGC GGCACTTGAT
TCTCTACGCC AACAATTAAA GAAATGTACC GTTTCGGCAA TGCATGTTTA TCCCGGTCAT
TTGGCTGACT GCCCGTGGTG TGCTCTGGAT AATCAAGGCG TTATCTATTT TATTGATCTC
GGCGAAGAGG TCATTACCAC CAGCGGTGAT TTTGTGCTGG CGAAAGTCTG GGCGATGGTG
ATGGCGTCAG TAGCACCGCC AGCATTGCAA CTGCCATTAC CCGATCATTT CCAACCGACT
GGCAGGCCGC TTCCTTTAGG CCTGTTAAGG CGTGAATACA TCATTCTGAT TGAGATCGCA
CTGTCAGCGT TATCGCTGTT GCTTTGCGGC CTTCAGACAG AACCACGTTA TATTATTTTG
GTTCCTGTGC TGTCGGCTAT CTGGATTATT GGCAGCCTGA CAAGCAAAGC GTACAAAGCA
GAAATCCAGC AACGAAGAGA GGCTTTTAAT CGCGCAAAAA TGGACTATGA CCATTTAGTC
AGCCAGATCC AACAGTTGGG CGGGCTGGAA GGTTTTATCG CCAAACGGAC GATGCTCGAA
AAAATGAAGG ACGAAATTCT TGGGTTACCG GAAGAAGAAA AGCGCGATCT GGCAGCACTT
CAGGACACCG CAAGGGAACG GCAGAAGCAG AAGTTTCTGG AGGGATTTTT TATTGATGTT
GCCTCTATTC CCGGCGTTGG CCCTGCGCGT AAAGCGGCGT TACGGTCCTT TGGTATTGAA
ACGGCGGCAG ACGTTACCCG ACGGAGCGTT AAGCAAGTAA AAGGTTTTGG TGATCATCTG
ACCCAGGCGG TTATCGACTG GAAAGCGAGT TGCGAACGCC GTTTTGTTTT CAGGCCGAAC
GAAGCGGTAA CGCCTGCAGA CAGACAAGCG GTACTGACTA AAATGGCCGC CAAACGACAT
CGGCTGGAAT CGGCGTTGAC TGTCGGTGCG ACAGAGTTGC AGCGATTCCG CCTTCATGCT
CCAGCACGGA CCATGCCGTT GATGGAGCCG CTACGTCAGG CGGCAGAAAA ACTGGCTCAG
GCGCAGGCTG ATTTAAGCCG CTGCTGA
 
Protein sequence
MKTNIKVFTS TDELTTLGRE LGKGGEGAVY DIEEFVDSVA KIYHTPPPAL KQDKLAFMAA 
TADAQLLNYV AWPQATLHGG RGGKVIGFMM PKVSGKEPIH MIYSPAHRRQ SYPHCAWDFL
LYVARNIASS FATVHEHGHV VGDVNQNSFM VGRDSKVVLI DSDSFQINAN GTLHLCEVGV
SHFTSPELQT LSSFVGFERT ANHDNFGLAL LIFHVLFGGR HPYSGVPLIS DAGNALETDI
AHFRYAYASD NQRRGLKPPP RSIPLSMLPG DVEAMFQQAF TESGVATGRP TAKAWVAALD
SLRQQLKKCT VSAMHVYPGH LADCPWCALD NQGVIYFIDL GEEVITTSGD FVLAKVWAMV
MASVAPPALQ LPLPDHFQPT GRPLPLGLLR REYIILIEIA LSALSLLLCG LQTEPRYIIL
VPVLSAIWII GSLTSKAYKA EIQQRREAFN RAKMDYDHLV SQIQQLGGLE GFIAKRTMLE
KMKDEILGLP EEEKRDLAAL QDTARERQKQ KFLEGFFIDV ASIPGVGPAR KAALRSFGIE
TAADVTRRSV KQVKGFGDHL TQAVIDWKAS CERRFVFRPN EAVTPADRQA VLTKMAAKRH
RLESALTVGA TELQRFRLHA PARTMPLMEP LRQAAEKLAQ AQADLSRC