Gene EcSMS35_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0303 
Symbol 
ID6145078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp311216 
End bp314491 
Gene Length3276 bp 
Protein Length1091 aa 
Translation table11 
GC content30% 
IMG OID641615200 
Producthypothetical protein 
Protein accessionYP_001742408 
Protein GI170682597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0198659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000318594 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAAAAA ATTCTTTAAA TGTGATTTTG GATACTGTTT TCTCGGCGAA AAGCATCAAT 
GGTGAGAAAT CTCATTCTTT TATATCGTAT AAATTAGTTA AAGCTGTAGA AAATTTAAGT
GGATATGACT TGGATCTTTT CCTCAATCGT GCGATAAACC ATCCTAACTT CCCTAGCAAT
ATGGATTTTT CCTTACGAAC AGTTTTTACA ATTGAACAAT CTAACACTAT TGAATTTAAA
AATTTCATTT CAGAATTGGT GTGGTACAAG CATATTTTCA TACGATATAA AAAACACCTT
AATGATATAT TATCGGCAAA GGCAAGACTT GAGAAGTTAG TTCTTTTCGC AACAGGTAGT
GAATGTATAA AATATTTAGA TGAAATAGAA AGCTCTTATG GAGTTAGTTT TTGGAGCATT
GATGCTCGTC TCTTAATAAA TAAAGTTTTG CTTAACGAAA GCAATGGGTT ATATGTTAAA
TCAATTCTAT CTAAAACCCA CTATAAACTT ACAGAGTTTA TGTTACAGCA GCTATTATTT
AAGCATCAAG TACATAACTT TGATGATTTT TCAAAGAACC TTATAAAAAT ACTAGATGAT
ATGCGCACCA CTCAAGATAA TGGGTATGCT AGAAATTTAG CTGATGGTAT TTCATCCTTT
TTAATACCTT TAGAGTTTGA TAAAAATATA GATCTCAAAA ATAGAACCTT AGCACCATTT
ACCAACCTTC CTTTAATAGA TCAATTTATC ATATTCCAGA GAACTATAAG CGATTTACAA
CTTCATGGAA ATGGTTTAAG AAATAGCGAG TTAAGCTTAA TTGCAGAAAT TAACAGCTTT
ATTAAAAATA ACTATCTCAA CAATATTTTA GATGGAAGAG AGCGTAATAT AGAGAATATT
GATGAACACT ATAAAAGTGT TATTCGACGT TATACTATAG GTGATTATAC TGGGTGTATT
TCAGAAATAA ATTCAGTTCG AACGCAGAAG ACAATTTTGC CATTAATTGA AATATATGCA
AAATCCCATG TGTATCTTGA TAAGAAAATT TCAGACTCAT GCATATTCAA TAAATTAACA
AACTGGCTTA TAGCAATTAT AAAATGCGAC AGACGCGCTG GAAAATACAT CGATGAATTT
GAACTTTTAG CATCTAAGAT ATATTTCAAT TCAAATTCAT CCTCTCTATT CTTCACATTA
TATAATTTAA TAAGTGATAA CCATAATAAA TTAAACATAT CTCGTATAAA TCTAACCAAA
AATGGTTCCT TTGTAACATC CCTTCATTTG AATATAACAA TCAAGGAACT ATGGACTGAA
CTTGGAATAA AAGAGAATGA GATACCAAAA TATCGATTAT TTAAATTCAA AGACCTAGCA
AATCTATCCT TTGATGAAAT CAAAAAACAC TACGAAGTAT ACAATGATAA TGTAATAATT
CGCTCTGAAT ATTTAAAAGA TTATACTAAC TTTTTATTAG AAAATGAAAA GTATGAATTA
TGCATCCAAT TTATAGCTGA AAACTGCATC GGCAACCCCG CAAATTATTA CTACTTCCCA
ATCCGAAAAA TAATTGATTT TGTAAAAGAC AATATTTTTG ACTATGCAAG CGTGGCCTTA
TCTATATTGA TAGATATTTT TGTCAAAAGC ACATCAAACA CTTCAAATGA GCTTCTTCTC
GAAAGTTATG AGAGTTTAGT AGATATTTCA AATATTAATA GGCCATCTAC CCATTATTGC
TCTAAGCAAT TAACTCCATT AGAGCACTAT TTCCTTAAGT TTATCTGTAT ACCTTCCGTG
ATGGATTCAG ACCCTTCATT CACAGGCACG GACGACTTGA AGAAGGAAAG AATTGCTATT
ATAGATCTAT TATTAAAAGA ACACGATGAT GCAGATTTAT TGAAAGAGAA AGACGAAATT
ATAGATGAAA TATCATTCGA AGAAATAAAA ACAAAATATG AGACAGGAAA AATATTTGTT
GATATAGAAA ATTTAAAACG GGCAAAGCTT GAAAGATACA GATATTATTT TGATGCACTT
AAGGATTCGC TTATTCTCGG ACTAGAACCA CCTGAAGACT TTGCTTTAAT TACAGACGAT
GGGAATATCA CAGCCATTCC ATCTGGCGAT ACAAACTCTA TAATCCATGA ACTTCTAAAA
GAACTAATCT CCGATTTTGT TAAAAATGAA AATTACGGTC TTGATAAATA TTTGAGTGCA
GATATACGGC ATGGTGTTTT TGAAAATCAA CTCAGATCAA GTGCTGAAAA ATCGCAATTA
ATAACTGATA TGGATGGTAC TGGACAGTAT TCGAAAAGCA ATTTAATTAT TGAGACATAT
CCATTAATTA ACCCTATTAT AAATAACGAA ATAGGGACAG CAATAGCTGA TTTCTCTTAT
CACTTTGACA TTGAATTGGC TAAAGCCAAT TCCTGGTTCA ATGTCAAGAC AATGCTCATA
AGTGAATCAC AGCAAGGTAT GATTGATTTC CTCATTTCTG TTGATATGTT TAATTCATTT
AAAGCAGCAG TTTCAAATCA AGCATCTTTT GAGAAGTTTT TTGACGCTTG TATTAACTTC
ATGTGGGAAA GAACGTTCAG ATGTTTGAAT GATATTAAAG AGCGATTGCA TTACGAATTT
AAGAGGAATA TTTTAGATTT AATTGCCACT CTTAGGCATC AAATTGATAT CTATAGAAGA
CGTAGTTCCA TGAGAGAAAT CCAGGAGAAA ATTGACCTCT TGGCCGAAAG CATTAATAAA
GAAATTGCAA CAGTCTGCGA TTGGTTAAAT GTACTTGAGT ATAACGAGGA GAAAATATAT
AAAATATCAT CCGTGATGCA GGCATGTAGG AAAACTTTTT TTAATATACA CCATTGTTCT
GATGATGCTA TCGTTTTTGA TTCAATGTAT CATGATGATG AACCAAAATT ATCTTATAAA
GAAGCCAAGC CTCTAATCAC ATCAATAATA ACAGCGTTAA ACAATGCTAT GGCCTATGGT
AATAAAAAAA TCTTTATCAG TATTAAGCCT GAAGAAAAGT CATGGAAAAT AACCATCAGA
AATTTGATAA TTGAGACAAA GAACAGAACT ACCAACCAAA TTTTACAAGA GATTGATGAT
AAAATAAAAC GAGGGGATAA TAGTCTAAAT ATAAAAGAAG GCGGAGCTGG AATATATAAA
ATATATGATT TATTATGTTC ATTACCGCAA AGATTTAATG TGAATCATTG CATCCAAAAT
AACGAATTCA TTTTAAATAT TGAGATAAAG AAATGA
 
Protein sequence
MKKNSLNVIL DTVFSAKSIN GEKSHSFISY KLVKAVENLS GYDLDLFLNR AINHPNFPSN 
MDFSLRTVFT IEQSNTIEFK NFISELVWYK HIFIRYKKHL NDILSAKARL EKLVLFATGS
ECIKYLDEIE SSYGVSFWSI DARLLINKVL LNESNGLYVK SILSKTHYKL TEFMLQQLLF
KHQVHNFDDF SKNLIKILDD MRTTQDNGYA RNLADGISSF LIPLEFDKNI DLKNRTLAPF
TNLPLIDQFI IFQRTISDLQ LHGNGLRNSE LSLIAEINSF IKNNYLNNIL DGRERNIENI
DEHYKSVIRR YTIGDYTGCI SEINSVRTQK TILPLIEIYA KSHVYLDKKI SDSCIFNKLT
NWLIAIIKCD RRAGKYIDEF ELLASKIYFN SNSSSLFFTL YNLISDNHNK LNISRINLTK
NGSFVTSLHL NITIKELWTE LGIKENEIPK YRLFKFKDLA NLSFDEIKKH YEVYNDNVII
RSEYLKDYTN FLLENEKYEL CIQFIAENCI GNPANYYYFP IRKIIDFVKD NIFDYASVAL
SILIDIFVKS TSNTSNELLL ESYESLVDIS NINRPSTHYC SKQLTPLEHY FLKFICIPSV
MDSDPSFTGT DDLKKERIAI IDLLLKEHDD ADLLKEKDEI IDEISFEEIK TKYETGKIFV
DIENLKRAKL ERYRYYFDAL KDSLILGLEP PEDFALITDD GNITAIPSGD TNSIIHELLK
ELISDFVKNE NYGLDKYLSA DIRHGVFENQ LRSSAEKSQL ITDMDGTGQY SKSNLIIETY
PLINPIINNE IGTAIADFSY HFDIELAKAN SWFNVKTMLI SESQQGMIDF LISVDMFNSF
KAAVSNQASF EKFFDACINF MWERTFRCLN DIKERLHYEF KRNILDLIAT LRHQIDIYRR
RSSMREIQEK IDLLAESINK EIATVCDWLN VLEYNEEKIY KISSVMQACR KTFFNIHHCS
DDAIVFDSMY HDDEPKLSYK EAKPLITSII TALNNAMAYG NKKIFISIKP EEKSWKITIR
NLIIETKNRT TNQILQEIDD KIKRGDNSLN IKEGGAGIYK IYDLLCSLPQ RFNVNHCIQN
NEFILNIEIK K