Gene EcolC_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1467 
Symbol 
ID6067246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1617285 
End bp1618874 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content51% 
IMG OID641600887 
ProductABC transporter related 
Protein accessionYP_001724457 
Protein GI170019503 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000937636 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.218677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CTCTGTTAGC GATTGAAAAT TTGTCGGTGG GTTTTCGCCA TCAGCAAACC 
GTACGTACAG TAGTCAATGA TGTTTCACTA CAGATTGAGG CTGGCGAAAC GCTGGCGCTG
GTGGGTGAGT CAGGTTCAGG CAAAAGCGTT ACCGCGCTGT CAATTTTACG CCTGCTCCCT
TCCCCGCCGG TTGAATATCT CTCCGGCGAT ATTCGTTTTC ATGGCGAATC GCTGCTTCAC
GCCAGCGATC AAACGTTGCG CGGTGTACGC GGTAATAAGA TCGCCATGAT TTTTCAGGAA
CCGATGGTGT CGTTAAATCC ATTGCATACC CTGGAAAAAC AGCTTTATGA AGTGCTTTCA
CTCCACCGCG GGATGCGTCG GGAAGCGGCT CGTGGCGAAA TTCTTAACTG CCTTGATCGC
GTTGGTATCC GCCAGGCGGC AAAACGGCTG ACAGATTATC CGCATCAGCT CTCCGGCGGC
GAACGGCAGC GGGTGATGAT TGCGATGGCG CTGTTAACGC GACCGGAATT ATTAATTGCC
GATGAACCGA CCACCGCACT GGACGTCTCT GTCCAGGCGC AGATTTTACA GCTGTTGCGC
GAACTGCAAG GCGAGCTGAA TATGGGCATG CTGTTTATTA CTCATAACCT CAGCATTGTC
AGAAAACTGG CCCACCGCGT GGCGGTAATG CAAAACGGTC GCTGTGTCGA GCAAAATTAC
GCCGCTACGC TATTTGCATC ACCCACTCAT CCTTACACAC AAAAGCTACT CAACAGTGAA
CCGTCAGGCG ATCCAGTGCC GTTGCCAGAA CCTGCCTCAA CGTTGCTGGA TGTTGAACAG
CTTCAGGTTG CCTTCCCCAT TCGCAAAGGG ATTTTGAAGC GCATTGTGGA TCATAATGTG
GTGGTGAAAA ACATCAGTTT TACGCTACGA GCGGGTGAAA CACTGGGTTT AGTGGGCGAG
TCCGGTTCCG GGAAAAGTAC GACGGGACTG GCGCTGCTGC GACTGATTAA TTCTCAGGGC
AGCATCATCT TTGACGGTCA GCCACTGCAA AATTTAAATC GCCGCCAGCT GTTACCTATT
CGTCATCGCA TTCAGGTGGT ATTTCAGGAT CCAAACTCCT CGCTCAACCC ACGACTCAAC
GTTTTGCAGA TTATTGAGGA AGGCTTACGG GTTCACCAGC CGACGCTTTC TGCCGCACAA
CGCGAACAAC AAGTGATAGC CGTGATGCAT GAAGTGGGAT TAGATCCTGA AACACGCCAC
CGTTATCCGG CGGAGTTCTC TGGTGGTCAG CGACAACGTA TTGCGATTGC CAGGGCATTA
ATTCTTAAGC CCTCGCTGAT CATACTTGAT GAACCGACAT CATCACTCGA CAAAACGGTA
CAGGCGCAAA TATTGACGCT ATTGAAATCA TTGCAACAAA AGCATCAACT GGCCTATTTG
TTTATCAGCC ACGATTTGCA CGTTGTCCGC GCGTTATGTC ATCAGGTTAT CATACTGCGA
CAAGGGGAAG TAGTGGAACA AGGACCGTGC GCGCGCGTGT TTGCCACACC GCAGCAGGAG
TATACGCGTC AGCTACTGGC GTTGAGCTGA
 
Protein sequence
MTQTLLAIEN LSVGFRHQQT VRTVVNDVSL QIEAGETLAL VGESGSGKSV TALSILRLLP 
SPPVEYLSGD IRFHGESLLH ASDQTLRGVR GNKIAMIFQE PMVSLNPLHT LEKQLYEVLS
LHRGMRREAA RGEILNCLDR VGIRQAAKRL TDYPHQLSGG ERQRVMIAMA LLTRPELLIA
DEPTTALDVS VQAQILQLLR ELQGELNMGM LFITHNLSIV RKLAHRVAVM QNGRCVEQNY
AATLFASPTH PYTQKLLNSE PSGDPVPLPE PASTLLDVEQ LQVAFPIRKG ILKRIVDHNV
VVKNISFTLR AGETLGLVGE SGSGKSTTGL ALLRLINSQG SIIFDGQPLQ NLNRRQLLPI
RHRIQVVFQD PNSSLNPRLN VLQIIEEGLR VHQPTLSAAQ REQQVIAVMH EVGLDPETRH
RYPAEFSGGQ RQRIAIARAL ILKPSLIILD EPTSSLDKTV QAQILTLLKS LQQKHQLAYL
FISHDLHVVR ALCHQVIILR QGEVVEQGPC ARVFATPQQE YTRQLLALS