Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0844 |
Symbol | |
ID | 6142991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 847642 |
End bp | 849234 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615732 |
Product | ABC transporter, ATP-binding protein |
Protein accession | YP_001742924 |
Protein GI | 170681928 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAGTTT CCAGTAACGT CACCATGCAG TTCGGCAGTA AGCCGTTGTT TGAAAACATT TCCGTCAAGT TTGGCGGCGG CAACCGTTAC GGCCTGATTG GCGCGAACGG TAGTGGTAAA TCCACCTTTA TGAAGATCCT CGGCGGCGAC TTAGAGCCGA CGCTGGGTAA CGTTTCCCTC GATCCCAACG AGCGCATTGG TAAATTGCGT CAGGATCAGT TTGCCTTTGA AGAGTTCACC GTGCTGGATA CGGTGATCAT GGGGCATAAA GAGTTGTGGG AAGTGAAGCA GGAGCGCGAC CGTATCTATG CTTTGCCGGA AATGAGCGAA GAAGACGGTT ATAAAGTGGC CGATCTGGAA GTAAAATACG GCGAAATGGA CGGTTACTCT GCGGAAGCTC GCGCCGGTGA ACTGTTGCTT GGCGTGGGAA TTCCAGTGGA ACAGCACTAC GGCCCGATGA GTGAAGTTGC TCCTGGCTGG AAGCTGCGTG TGCTGCTGGC GCAGGCGCTG TTTGCTGATC CGGACATTCT CCTGCTCGAC GAACCGACCA ACAACCTCGA CATCGACACC ATTCGTTGGC TGGAACAGGT GCTGAACGAG CGTGACAGCA CCATGATCAT CATCTCGCAC GACCGTCACT TCCTTAACAT GGTTTGTACC CATATGGCGG ATCTGGATTA CGGCGAGCTG CGCGTTTATC CAGGTAACTA CGATGAGTAC ATGACGGCGG CGACCCAGGC GCGTGAACGT CTGCTGGCTG ATAATGCCAA GAAGAAAGCG CAGATTGCCG AGCTGCAATC TTTCGTTAGC CGCTTTAGCG CCAACGCCTC GAAATCTCGC CAGGCAACTT CGCGTGCGCG CCAGATCGAT AAAATCAAAC TGGAAGAGGT GAAAGCTTCC AGCCGTCAGA ACCCGTTCAT CCGTTTTGAG CAGGATAAGA AACTGTTCCG TAACGCGCTG GAAGTGGAAG GTCTGACCAA AGGGTTTGAT AACGGTCCAC TGTTTAAAAA TCTCAACCTG CTGCTGGAAG TCGGTGAAAA ACTGGCGGTA CTGGGTACCA ACGGCGTCGG TAAATCAACG CTGCTGAAAA CGCTGGTGGG CGATCTGCAA CCGGACAGCG GCACCGTAAA ATGGTCTGAG AACGCGCGCA TTGGTTACTA CGCCCAGGAC CATGAATATG AGTTCGAAAA TGATCTGACC GTGTTCGAAT GGATGAGCCA GTGGAAGCAG GAAGGCGATG ACGAGCAGGC GGTACGCAGC ATTCTCGGTC GTTTGCTGTT CAGCCAGGAC GATATCAAAA AGCCAGCGAA AGTGCTTTCC GGTGGGGAGA AAGGGCGGAT GCTGTTTGGT AAGTTAATGA TGCAGAAGCC GAACATTCTG ATCATGGACG AACCGACCAA CCACCTGGAT ATGGAATCTA TTGAGTCGCT GAACATGGCG CTGGAACTGT ATCAGGGCAC GCTGATCTTT GTTTCACACG ACCGTGAGTT CGTAAGCTCC CTGGCGACCC GTATTCTGGA AATCACCCCG GAACGCGTGA TCGACTTTAG CGGTAATTAC GAAGATTATC TGCGTAGTAA AGGGATCGAG TAA
|
Protein sequence | MLVSSNVTMQ FGSKPLFENI SVKFGGGNRY GLIGANGSGK STFMKILGGD LEPTLGNVSL DPNERIGKLR QDQFAFEEFT VLDTVIMGHK ELWEVKQERD RIYALPEMSE EDGYKVADLE VKYGEMDGYS AEARAGELLL GVGIPVEQHY GPMSEVAPGW KLRVLLAQAL FADPDILLLD EPTNNLDIDT IRWLEQVLNE RDSTMIIISH DRHFLNMVCT HMADLDYGEL RVYPGNYDEY MTAATQARER LLADNAKKKA QIAELQSFVS RFSANASKSR QATSRARQID KIKLEEVKAS SRQNPFIRFE QDKKLFRNAL EVEGLTKGFD NGPLFKNLNL LLEVGEKLAV LGTNGVGKST LLKTLVGDLQ PDSGTVKWSE NARIGYYAQD HEYEFENDLT VFEWMSQWKQ EGDDEQAVRS ILGRLLFSQD DIKKPAKVLS GGEKGRMLFG KLMMQKPNIL IMDEPTNHLD MESIESLNMA LELYQGTLIF VSHDREFVSS LATRILEITP ERVIDFSGNY EDYLRSKGIE
|
| |