Gene EcSMS35_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2329 
Symbol 
ID6144241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2361958 
End bp2363547 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content51% 
IMG OID641617203 
ProductABC transporter, ATP-binding protein 
Protein accessionYP_001744376 
Protein GI170680600 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.787685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0242694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CTCTGTTAGC GATTGAAAAT TTGTCGGTGG GTTTTCGCCA TCAGCAAACC 
GTACGTACAG TAGTCAATGA TGTTTCACTT CAGATTGAGG CTGGCGAAAC GCTGGCGCTG
GTGGGTGAGT CAGGTTCAGG CAAAAGCGTT ACCGCGCTGT CAATTTTACG CCTGCTCCCT
TCCCCGCCGG TTGAATATCT CTCCGGCGAT ATTCGTTTTC ATGGCGAATC GCTGCTTCAT
GCCAGCGATC AAACGTTACG CGGTGTACGC GGTAATAAGA TTGCCATGAT TTTTCAGGAG
CCGATGGTGT CATTAAATCC ATTGCATACC CTGGAAAAAC AGCTTTATGA AGTGCTTTCA
CTCCACCGCG GGATGCGTCG GGAAGCGGCT CGTGGCGAAA TTCTTAACTG CCTTGATCGC
GTCGGTATCC GCCAGGCGGC AAAACGGCTA ACAGATTATC CGCATCAGCT CTCCGGCGGC
GAACGCCAGC GGGTGATGAT TGCGATGGCG CTGTTAACGC GACCGGAATT ATTAATTGCC
GATGAACCGA CCACCGCGCT GGACGTCTCT GTCCAGGCGC AGATTTTACA GCTGTTGCGC
GAACTGCAAG GCGAGCTGAA TATGGGCATG CTGTTTATTA CTCATAACCT CAGCATTGTC
AGAAAACTGG CCCACCGCGT GGCGGTAATG CAAAACGGTC GCTGTGTCGA GCAAAATAAC
GCTGCTACGC TATTTGCCTC ACCCACTCAT CCTTACACAC AAAAGCTACT CAACAGTGAA
CCATCTGGCG ACCCGGTGCC ATTGCCAGAA CCTGCCTCTA CGTTGCTGGA TGTTGAACAG
CTTCAGGTTG CCTTCCCCAT TCGCAAAGGG ATTTTGAAGC GCATTGTGGA TCATAATGTG
GTGGTGAAAA ACATCAGTTT TACGCTACGG GCGGGTGAAA CACTGGGTTT AGTGGGCGAG
TCCGGTTCCG GGAAAAGCAC GACGGGACTG GCGCTGCTGC GACTGATTAA TTCTCATGGC
AGCATCGTCT TTGACGGTCA GCCACTGCAA AATTTAAATC GCCGCCAGCT GTTACCTATT
CGTCATCGCA TTCAGGTGGT ATTTCAGGAT CCAAACTCCT CACTCAACCC ACGACTCAAC
GTTTTGCAAA TTATTGAGGA AGGCTTACGG GTTCACCAGC CGACGCTTTC TGCCGCACAA
CGCGAACAAC AAGTGATAGC CGTGATGCAT GAAGTGGGAT TAGATCCTGA AACACGCCAC
CGTTATCCGG CGGAGTTCTC TGGTGGTCAG CGGCAACGTA TTGCGATTGC CAGGGCGTTA
ATTCTTAAGC CCTCGCTGAT CATTCTTGAT GAACCAACAT CATCACTCGA CAAAACGGTG
CAGGCGCAAA TATTGACGCT ATTGAAATCA TTGCAACAAA AGCATCAACT GGCCTATTTG
TTTATCAGTC ACGATTTGCA CGTTGTCCGC GCGTTATGTC ATCAGGTTAT CGTACTGCGA
CAAGGGGAAG TAGTGGAACA AGGACCGTGC GCGCGCGTGT TTGCCGCACC GCAGCAGGAG
TATACGCGTC AGCTACTGGC GTTGAGCTGA
 
Protein sequence
MTQTLLAIEN LSVGFRHQQT VRTVVNDVSL QIEAGETLAL VGESGSGKSV TALSILRLLP 
SPPVEYLSGD IRFHGESLLH ASDQTLRGVR GNKIAMIFQE PMVSLNPLHT LEKQLYEVLS
LHRGMRREAA RGEILNCLDR VGIRQAAKRL TDYPHQLSGG ERQRVMIAMA LLTRPELLIA
DEPTTALDVS VQAQILQLLR ELQGELNMGM LFITHNLSIV RKLAHRVAVM QNGRCVEQNN
AATLFASPTH PYTQKLLNSE PSGDPVPLPE PASTLLDVEQ LQVAFPIRKG ILKRIVDHNV
VVKNISFTLR AGETLGLVGE SGSGKSTTGL ALLRLINSHG SIVFDGQPLQ NLNRRQLLPI
RHRIQVVFQD PNSSLNPRLN VLQIIEEGLR VHQPTLSAAQ REQQVIAVMH EVGLDPETRH
RYPAEFSGGQ RQRIAIARAL ILKPSLIILD EPTSSLDKTV QAQILTLLKS LQQKHQLAYL
FISHDLHVVR ALCHQVIVLR QGEVVEQGPC ARVFAAPQQE YTRQLLALS