Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2329 |
Symbol | |
ID | 6144241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2361958 |
End bp | 2363547 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617203 |
Product | ABC transporter, ATP-binding protein |
Protein accession | YP_001744376 |
Protein GI | 170680600 |
COG category | [R] General function prediction only |
COG ID | [COG4172] ABC-type uncharacterized transport system, duplicated ATPase component |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.787685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0242694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAA CTCTGTTAGC GATTGAAAAT TTGTCGGTGG GTTTTCGCCA TCAGCAAACC GTACGTACAG TAGTCAATGA TGTTTCACTT CAGATTGAGG CTGGCGAAAC GCTGGCGCTG GTGGGTGAGT CAGGTTCAGG CAAAAGCGTT ACCGCGCTGT CAATTTTACG CCTGCTCCCT TCCCCGCCGG TTGAATATCT CTCCGGCGAT ATTCGTTTTC ATGGCGAATC GCTGCTTCAT GCCAGCGATC AAACGTTACG CGGTGTACGC GGTAATAAGA TTGCCATGAT TTTTCAGGAG CCGATGGTGT CATTAAATCC ATTGCATACC CTGGAAAAAC AGCTTTATGA AGTGCTTTCA CTCCACCGCG GGATGCGTCG GGAAGCGGCT CGTGGCGAAA TTCTTAACTG CCTTGATCGC GTCGGTATCC GCCAGGCGGC AAAACGGCTA ACAGATTATC CGCATCAGCT CTCCGGCGGC GAACGCCAGC GGGTGATGAT TGCGATGGCG CTGTTAACGC GACCGGAATT ATTAATTGCC GATGAACCGA CCACCGCGCT GGACGTCTCT GTCCAGGCGC AGATTTTACA GCTGTTGCGC GAACTGCAAG GCGAGCTGAA TATGGGCATG CTGTTTATTA CTCATAACCT CAGCATTGTC AGAAAACTGG CCCACCGCGT GGCGGTAATG CAAAACGGTC GCTGTGTCGA GCAAAATAAC GCTGCTACGC TATTTGCCTC ACCCACTCAT CCTTACACAC AAAAGCTACT CAACAGTGAA CCATCTGGCG ACCCGGTGCC ATTGCCAGAA CCTGCCTCTA CGTTGCTGGA TGTTGAACAG CTTCAGGTTG CCTTCCCCAT TCGCAAAGGG ATTTTGAAGC GCATTGTGGA TCATAATGTG GTGGTGAAAA ACATCAGTTT TACGCTACGG GCGGGTGAAA CACTGGGTTT AGTGGGCGAG TCCGGTTCCG GGAAAAGCAC GACGGGACTG GCGCTGCTGC GACTGATTAA TTCTCATGGC AGCATCGTCT TTGACGGTCA GCCACTGCAA AATTTAAATC GCCGCCAGCT GTTACCTATT CGTCATCGCA TTCAGGTGGT ATTTCAGGAT CCAAACTCCT CACTCAACCC ACGACTCAAC GTTTTGCAAA TTATTGAGGA AGGCTTACGG GTTCACCAGC CGACGCTTTC TGCCGCACAA CGCGAACAAC AAGTGATAGC CGTGATGCAT GAAGTGGGAT TAGATCCTGA AACACGCCAC CGTTATCCGG CGGAGTTCTC TGGTGGTCAG CGGCAACGTA TTGCGATTGC CAGGGCGTTA ATTCTTAAGC CCTCGCTGAT CATTCTTGAT GAACCAACAT CATCACTCGA CAAAACGGTG CAGGCGCAAA TATTGACGCT ATTGAAATCA TTGCAACAAA AGCATCAACT GGCCTATTTG TTTATCAGTC ACGATTTGCA CGTTGTCCGC GCGTTATGTC ATCAGGTTAT CGTACTGCGA CAAGGGGAAG TAGTGGAACA AGGACCGTGC GCGCGCGTGT TTGCCGCACC GCAGCAGGAG TATACGCGTC AGCTACTGGC GTTGAGCTGA
|
Protein sequence | MTQTLLAIEN LSVGFRHQQT VRTVVNDVSL QIEAGETLAL VGESGSGKSV TALSILRLLP SPPVEYLSGD IRFHGESLLH ASDQTLRGVR GNKIAMIFQE PMVSLNPLHT LEKQLYEVLS LHRGMRREAA RGEILNCLDR VGIRQAAKRL TDYPHQLSGG ERQRVMIAMA LLTRPELLIA DEPTTALDVS VQAQILQLLR ELQGELNMGM LFITHNLSIV RKLAHRVAVM QNGRCVEQNN AATLFASPTH PYTQKLLNSE PSGDPVPLPE PASTLLDVEQ LQVAFPIRKG ILKRIVDHNV VVKNISFTLR AGETLGLVGE SGSGKSTTGL ALLRLINSHG SIVFDGQPLQ NLNRRQLLPI RHRIQVVFQD PNSSLNPRLN VLQIIEEGLR VHQPTLSAAQ REQQVIAVMH EVGLDPETRH RYPAEFSGGQ RQRIAIARAL ILKPSLIILD EPTSSLDKTV QAQILTLLKS LQQKHQLAYL FISHDLHVVR ALCHQVIVLR QGEVVEQGPC ARVFAAPQQE YTRQLLALS
|
| |