Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0303 |
Symbol | |
ID | 6145078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 311216 |
End bp | 314491 |
Gene Length | 3276 bp |
Protein Length | 1091 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641615200 |
Product | hypothetical protein |
Protein accession | YP_001742408 |
Protein GI | 170682597 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0198659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000318594 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATTCTTTAAA TGTGATTTTG GATACTGTTT TCTCGGCGAA AAGCATCAAT GGTGAGAAAT CTCATTCTTT TATATCGTAT AAATTAGTTA AAGCTGTAGA AAATTTAAGT GGATATGACT TGGATCTTTT CCTCAATCGT GCGATAAACC ATCCTAACTT CCCTAGCAAT ATGGATTTTT CCTTACGAAC AGTTTTTACA ATTGAACAAT CTAACACTAT TGAATTTAAA AATTTCATTT CAGAATTGGT GTGGTACAAG CATATTTTCA TACGATATAA AAAACACCTT AATGATATAT TATCGGCAAA GGCAAGACTT GAGAAGTTAG TTCTTTTCGC AACAGGTAGT GAATGTATAA AATATTTAGA TGAAATAGAA AGCTCTTATG GAGTTAGTTT TTGGAGCATT GATGCTCGTC TCTTAATAAA TAAAGTTTTG CTTAACGAAA GCAATGGGTT ATATGTTAAA TCAATTCTAT CTAAAACCCA CTATAAACTT ACAGAGTTTA TGTTACAGCA GCTATTATTT AAGCATCAAG TACATAACTT TGATGATTTT TCAAAGAACC TTATAAAAAT ACTAGATGAT ATGCGCACCA CTCAAGATAA TGGGTATGCT AGAAATTTAG CTGATGGTAT TTCATCCTTT TTAATACCTT TAGAGTTTGA TAAAAATATA GATCTCAAAA ATAGAACCTT AGCACCATTT ACCAACCTTC CTTTAATAGA TCAATTTATC ATATTCCAGA GAACTATAAG CGATTTACAA CTTCATGGAA ATGGTTTAAG AAATAGCGAG TTAAGCTTAA TTGCAGAAAT TAACAGCTTT ATTAAAAATA ACTATCTCAA CAATATTTTA GATGGAAGAG AGCGTAATAT AGAGAATATT GATGAACACT ATAAAAGTGT TATTCGACGT TATACTATAG GTGATTATAC TGGGTGTATT TCAGAAATAA ATTCAGTTCG AACGCAGAAG ACAATTTTGC CATTAATTGA AATATATGCA AAATCCCATG TGTATCTTGA TAAGAAAATT TCAGACTCAT GCATATTCAA TAAATTAACA AACTGGCTTA TAGCAATTAT AAAATGCGAC AGACGCGCTG GAAAATACAT CGATGAATTT GAACTTTTAG CATCTAAGAT ATATTTCAAT TCAAATTCAT CCTCTCTATT CTTCACATTA TATAATTTAA TAAGTGATAA CCATAATAAA TTAAACATAT CTCGTATAAA TCTAACCAAA AATGGTTCCT TTGTAACATC CCTTCATTTG AATATAACAA TCAAGGAACT ATGGACTGAA CTTGGAATAA AAGAGAATGA GATACCAAAA TATCGATTAT TTAAATTCAA AGACCTAGCA AATCTATCCT TTGATGAAAT CAAAAAACAC TACGAAGTAT ACAATGATAA TGTAATAATT CGCTCTGAAT ATTTAAAAGA TTATACTAAC TTTTTATTAG AAAATGAAAA GTATGAATTA TGCATCCAAT TTATAGCTGA AAACTGCATC GGCAACCCCG CAAATTATTA CTACTTCCCA ATCCGAAAAA TAATTGATTT TGTAAAAGAC AATATTTTTG ACTATGCAAG CGTGGCCTTA TCTATATTGA TAGATATTTT TGTCAAAAGC ACATCAAACA CTTCAAATGA GCTTCTTCTC GAAAGTTATG AGAGTTTAGT AGATATTTCA AATATTAATA GGCCATCTAC CCATTATTGC TCTAAGCAAT TAACTCCATT AGAGCACTAT TTCCTTAAGT TTATCTGTAT ACCTTCCGTG ATGGATTCAG ACCCTTCATT CACAGGCACG GACGACTTGA AGAAGGAAAG AATTGCTATT ATAGATCTAT TATTAAAAGA ACACGATGAT GCAGATTTAT TGAAAGAGAA AGACGAAATT ATAGATGAAA TATCATTCGA AGAAATAAAA ACAAAATATG AGACAGGAAA AATATTTGTT GATATAGAAA ATTTAAAACG GGCAAAGCTT GAAAGATACA GATATTATTT TGATGCACTT AAGGATTCGC TTATTCTCGG ACTAGAACCA CCTGAAGACT TTGCTTTAAT TACAGACGAT GGGAATATCA CAGCCATTCC ATCTGGCGAT ACAAACTCTA TAATCCATGA ACTTCTAAAA GAACTAATCT CCGATTTTGT TAAAAATGAA AATTACGGTC TTGATAAATA TTTGAGTGCA GATATACGGC ATGGTGTTTT TGAAAATCAA CTCAGATCAA GTGCTGAAAA ATCGCAATTA ATAACTGATA TGGATGGTAC TGGACAGTAT TCGAAAAGCA ATTTAATTAT TGAGACATAT CCATTAATTA ACCCTATTAT AAATAACGAA ATAGGGACAG CAATAGCTGA TTTCTCTTAT CACTTTGACA TTGAATTGGC TAAAGCCAAT TCCTGGTTCA ATGTCAAGAC AATGCTCATA AGTGAATCAC AGCAAGGTAT GATTGATTTC CTCATTTCTG TTGATATGTT TAATTCATTT AAAGCAGCAG TTTCAAATCA AGCATCTTTT GAGAAGTTTT TTGACGCTTG TATTAACTTC ATGTGGGAAA GAACGTTCAG ATGTTTGAAT GATATTAAAG AGCGATTGCA TTACGAATTT AAGAGGAATA TTTTAGATTT AATTGCCACT CTTAGGCATC AAATTGATAT CTATAGAAGA CGTAGTTCCA TGAGAGAAAT CCAGGAGAAA ATTGACCTCT TGGCCGAAAG CATTAATAAA GAAATTGCAA CAGTCTGCGA TTGGTTAAAT GTACTTGAGT ATAACGAGGA GAAAATATAT AAAATATCAT CCGTGATGCA GGCATGTAGG AAAACTTTTT TTAATATACA CCATTGTTCT GATGATGCTA TCGTTTTTGA TTCAATGTAT CATGATGATG AACCAAAATT ATCTTATAAA GAAGCCAAGC CTCTAATCAC ATCAATAATA ACAGCGTTAA ACAATGCTAT GGCCTATGGT AATAAAAAAA TCTTTATCAG TATTAAGCCT GAAGAAAAGT CATGGAAAAT AACCATCAGA AATTTGATAA TTGAGACAAA GAACAGAACT ACCAACCAAA TTTTACAAGA GATTGATGAT AAAATAAAAC GAGGGGATAA TAGTCTAAAT ATAAAAGAAG GCGGAGCTGG AATATATAAA ATATATGATT TATTATGTTC ATTACCGCAA AGATTTAATG TGAATCATTG CATCCAAAAT AACGAATTCA TTTTAAATAT TGAGATAAAG AAATGA
|
Protein sequence | MKKNSLNVIL DTVFSAKSIN GEKSHSFISY KLVKAVENLS GYDLDLFLNR AINHPNFPSN MDFSLRTVFT IEQSNTIEFK NFISELVWYK HIFIRYKKHL NDILSAKARL EKLVLFATGS ECIKYLDEIE SSYGVSFWSI DARLLINKVL LNESNGLYVK SILSKTHYKL TEFMLQQLLF KHQVHNFDDF SKNLIKILDD MRTTQDNGYA RNLADGISSF LIPLEFDKNI DLKNRTLAPF TNLPLIDQFI IFQRTISDLQ LHGNGLRNSE LSLIAEINSF IKNNYLNNIL DGRERNIENI DEHYKSVIRR YTIGDYTGCI SEINSVRTQK TILPLIEIYA KSHVYLDKKI SDSCIFNKLT NWLIAIIKCD RRAGKYIDEF ELLASKIYFN SNSSSLFFTL YNLISDNHNK LNISRINLTK NGSFVTSLHL NITIKELWTE LGIKENEIPK YRLFKFKDLA NLSFDEIKKH YEVYNDNVII RSEYLKDYTN FLLENEKYEL CIQFIAENCI GNPANYYYFP IRKIIDFVKD NIFDYASVAL SILIDIFVKS TSNTSNELLL ESYESLVDIS NINRPSTHYC SKQLTPLEHY FLKFICIPSV MDSDPSFTGT DDLKKERIAI IDLLLKEHDD ADLLKEKDEI IDEISFEEIK TKYETGKIFV DIENLKRAKL ERYRYYFDAL KDSLILGLEP PEDFALITDD GNITAIPSGD TNSIIHELLK ELISDFVKNE NYGLDKYLSA DIRHGVFENQ LRSSAEKSQL ITDMDGTGQY SKSNLIIETY PLINPIINNE IGTAIADFSY HFDIELAKAN SWFNVKTMLI SESQQGMIDF LISVDMFNSF KAAVSNQASF EKFFDACINF MWERTFRCLN DIKERLHYEF KRNILDLIAT LRHQIDIYRR RSSMREIQEK IDLLAESINK EIATVCDWLN VLEYNEEKIY KISSVMQACR KTFFNIHHCS DDAIVFDSMY HDDEPKLSYK EAKPLITSII TALNNAMAYG NKKIFISIKP EEKSWKITIR NLIIETKNRT TNQILQEIDD KIKRGDNSLN IKEGGAGIYK IYDLLCSLPQ RFNVNHCIQN NEFILNIEIK K
|
| |