Gene EcSMS35_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0844 
Symbol 
ID6142991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp847642 
End bp849234 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content53% 
IMG OID641615732 
ProductABC transporter, ATP-binding protein 
Protein accessionYP_001742924 
Protein GI170681928 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAGTTT CCAGTAACGT CACCATGCAG TTCGGCAGTA AGCCGTTGTT TGAAAACATT 
TCCGTCAAGT TTGGCGGCGG CAACCGTTAC GGCCTGATTG GCGCGAACGG TAGTGGTAAA
TCCACCTTTA TGAAGATCCT CGGCGGCGAC TTAGAGCCGA CGCTGGGTAA CGTTTCCCTC
GATCCCAACG AGCGCATTGG TAAATTGCGT CAGGATCAGT TTGCCTTTGA AGAGTTCACC
GTGCTGGATA CGGTGATCAT GGGGCATAAA GAGTTGTGGG AAGTGAAGCA GGAGCGCGAC
CGTATCTATG CTTTGCCGGA AATGAGCGAA GAAGACGGTT ATAAAGTGGC CGATCTGGAA
GTAAAATACG GCGAAATGGA CGGTTACTCT GCGGAAGCTC GCGCCGGTGA ACTGTTGCTT
GGCGTGGGAA TTCCAGTGGA ACAGCACTAC GGCCCGATGA GTGAAGTTGC TCCTGGCTGG
AAGCTGCGTG TGCTGCTGGC GCAGGCGCTG TTTGCTGATC CGGACATTCT CCTGCTCGAC
GAACCGACCA ACAACCTCGA CATCGACACC ATTCGTTGGC TGGAACAGGT GCTGAACGAG
CGTGACAGCA CCATGATCAT CATCTCGCAC GACCGTCACT TCCTTAACAT GGTTTGTACC
CATATGGCGG ATCTGGATTA CGGCGAGCTG CGCGTTTATC CAGGTAACTA CGATGAGTAC
ATGACGGCGG CGACCCAGGC GCGTGAACGT CTGCTGGCTG ATAATGCCAA GAAGAAAGCG
CAGATTGCCG AGCTGCAATC TTTCGTTAGC CGCTTTAGCG CCAACGCCTC GAAATCTCGC
CAGGCAACTT CGCGTGCGCG CCAGATCGAT AAAATCAAAC TGGAAGAGGT GAAAGCTTCC
AGCCGTCAGA ACCCGTTCAT CCGTTTTGAG CAGGATAAGA AACTGTTCCG TAACGCGCTG
GAAGTGGAAG GTCTGACCAA AGGGTTTGAT AACGGTCCAC TGTTTAAAAA TCTCAACCTG
CTGCTGGAAG TCGGTGAAAA ACTGGCGGTA CTGGGTACCA ACGGCGTCGG TAAATCAACG
CTGCTGAAAA CGCTGGTGGG CGATCTGCAA CCGGACAGCG GCACCGTAAA ATGGTCTGAG
AACGCGCGCA TTGGTTACTA CGCCCAGGAC CATGAATATG AGTTCGAAAA TGATCTGACC
GTGTTCGAAT GGATGAGCCA GTGGAAGCAG GAAGGCGATG ACGAGCAGGC GGTACGCAGC
ATTCTCGGTC GTTTGCTGTT CAGCCAGGAC GATATCAAAA AGCCAGCGAA AGTGCTTTCC
GGTGGGGAGA AAGGGCGGAT GCTGTTTGGT AAGTTAATGA TGCAGAAGCC GAACATTCTG
ATCATGGACG AACCGACCAA CCACCTGGAT ATGGAATCTA TTGAGTCGCT GAACATGGCG
CTGGAACTGT ATCAGGGCAC GCTGATCTTT GTTTCACACG ACCGTGAGTT CGTAAGCTCC
CTGGCGACCC GTATTCTGGA AATCACCCCG GAACGCGTGA TCGACTTTAG CGGTAATTAC
GAAGATTATC TGCGTAGTAA AGGGATCGAG TAA
 
Protein sequence
MLVSSNVTMQ FGSKPLFENI SVKFGGGNRY GLIGANGSGK STFMKILGGD LEPTLGNVSL 
DPNERIGKLR QDQFAFEEFT VLDTVIMGHK ELWEVKQERD RIYALPEMSE EDGYKVADLE
VKYGEMDGYS AEARAGELLL GVGIPVEQHY GPMSEVAPGW KLRVLLAQAL FADPDILLLD
EPTNNLDIDT IRWLEQVLNE RDSTMIIISH DRHFLNMVCT HMADLDYGEL RVYPGNYDEY
MTAATQARER LLADNAKKKA QIAELQSFVS RFSANASKSR QATSRARQID KIKLEEVKAS
SRQNPFIRFE QDKKLFRNAL EVEGLTKGFD NGPLFKNLNL LLEVGEKLAV LGTNGVGKST
LLKTLVGDLQ PDSGTVKWSE NARIGYYAQD HEYEFENDLT VFEWMSQWKQ EGDDEQAVRS
ILGRLLFSQD DIKKPAKVLS GGEKGRMLFG KLMMQKPNIL IMDEPTNHLD MESIESLNMA
LELYQGTLIF VSHDREFVSS LATRILEITP ERVIDFSGNY EDYLRSKGIE