Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0913 |
Symbol | |
ID | 6145239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 921240 |
End bp | 922157 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615801 |
Product | quaternary amine ABC transporter periplasmic substrate-binding protein |
Protein accession | YP_001742993 |
Protein GI | 170683010 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.768792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTCT CAAAGGTCTG GGCAGGTTCA CTGGTTTTGT TGGCAGCCGT GAGCCTGCCG CTGCACGCGG CTTCCCCCGT TAAAGTCGGT TCAAAAATCG ATACCGAAGG CGCGCTGCTC GGCAATATCA TTTTGCAGGT GCTGGAAAGT CACGGCGTTC CTACGGTAAA TAAAGTGCAA TTGGGAACGA CTCCCGTAGT GCGCGGGGCG ATCACTTCTG GTGAGCTGGA TATCTATCCG GAATATACTG GCAATGGCGC GTTTTTCTTT AAAGATGAAA ACGATACGGC GTGGAAAAAT GCCGGGCAAG GCTACGAGAA AGTCAAAAAG CTCGATGCAG AGCAAAACAA GTTAATCTGG CTGACGCCTG CGCCTGCAAA TAACACCTGG ACCATCGCCG TGCGTCAGGA TGTGGCAGAG AAAAACAAAC TCACTTCGCT TGCTGACCTG AGTCGTTATC TGCAAGAGGG CGGCACCTTC AAACTGGCTG CCTCAGCAGA GTTTATCGAA CGCGCCGATG CGTTACCCGC CTTTGAAAAA GCCTATGGCT TTAAGCTCGG TCAGGATCAG TTGCTGTCAC TGGCTGGTGG CGACACGGCG GTGACGATCA AGGCCGCTGC CCAGCAAACA TCCGGCGTTA ATGCCGCAAT GGCTTACGGC ACTGACGGTC CGGTCGCGGC GCTGGGGCTG CAAACCTTAA GCGATCCGCA AGGCGTTCAG CCTATCTACG CGCCTGCACC AGTGGTGCGA GAGTCGGTGT TGAAAGAGTA TCCGCAAATG GCACAGTGGC TACAGCCAGT CTTCGCCAGC CTCGATGAAA AAACATTACA GCAACTGAAT GCCAGCATTG CCGTTGAAGG ACTGGATGCC AAAAAAGTGG CTGCCGACTA CCTGAAACAA AAAGGGTGGA CGAAGTAA
|
Protein sequence | MPLSKVWAGS LVLLAAVSLP LHAASPVKVG SKIDTEGALL GNIILQVLES HGVPTVNKVQ LGTTPVVRGA ITSGELDIYP EYTGNGAFFF KDENDTAWKN AGQGYEKVKK LDAEQNKLIW LTPAPANNTW TIAVRQDVAE KNKLTSLADL SRYLQEGGTF KLAASAEFIE RADALPAFEK AYGFKLGQDQ LLSLAGGDTA VTIKAAAQQT SGVNAAMAYG TDGPVAALGL QTLSDPQGVQ PIYAPAPVVR ESVLKEYPQM AQWLQPVFAS LDEKTLQQLN ASIAVEGLDA KKVAADYLKQ KGWTK
|
| |