Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4706 |
Symbol | |
ID | 6143827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4805756 |
End bp | 4807258 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619522 |
Product | putative sugar ABC transporter, ATP-binding protein |
Protein accession | YP_001746630 |
Protein GI | 170681344 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.706122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTA ACCAACACCA GGAAATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGTC GCGGCGAAAT TATGGCGCTG CTCGGTGAAA ACGGGGCGGG AAAATCAACG CTAATCAAAG CCTTAACCGG TGTCTACCAC GCCGATCGTG GCACCATCTG GCTGGAAGGC CAGACTATCT CACCGAAAAA CACCGCCCAC GCACAACAAC TCGGCATCGG CACCGTCTAT CAGGAAGTCA ACCTGCTACC CAATATGTCG GTCGCTGATA ATCTATTCAT AGGTCGCGAA CCCAAACGTT TCGGCCTTCT ACGCCGTAAA GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC GAACCGCTCA ACCGCTTTTC AGTCGCGATG CAGCAAATCG TCGCCATTTG CCGGGCCATC GATCTCTCCG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGCCTCGA CACTCAGGAA GTAGAGTTAC TGTTTGGCCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATCTTC GTTACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC GGCAGTTTCG TAGGCTGTCG GGAAACCCGC GAGCTACCAC AGATCGAACT GGTAAAAATG ATGCTGGGGC GCGAGCTGGA CACCCACGCG CTACAGCGTG CCGGACGAAC ATTGTTGAGC GACAAACCCG TCGCCGCGTT CAAAAATTAC GGCAAAAAAG GAATGATCGC ACCGTTTGAT CTCGAAGTGC GCCCCGGCGA GATCGTCGGT CTGGCAGGCT TGCTGGGATC AGGACGTACC GAAACCGCCG AAGTGATCTT CGGTATTAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA GGCAAACCGC AAACCCTGCG ATCGCCACAT CAGGCTTCGG TACTTGGCAT TGGCTTTTGC CCGGAAGACA GGAAAACCGA TGGCATCATC GCCGCCGCCT CGGTGCGGGA AAATATCATT CTCGCGCTAC AAGCCCAGCG CGGCTGGCTG CGACCGATCT CCCGCAAAGA ACAGCAAGAG ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACCC CTTCCACTGA ACAACCGATT GAATTTCTCT CCGGCGGCAA TCAGCAAAAA GTGCTGCTCT CTCGCTGGCT GCTGACTCGT CCGCAATTTC TGATCCTCGA CGAGCCAACG CGTGGAATTG ACGTTGGTGC GCACGCCGAG ATCATCCGCC TGATTGAAAC GCTATGCGCT GACGGTCTGG CGCTGCTGGT GATCTCCTCT GAACTGGAAG AGCTGGTGGG CTATGCCGAT CGGGTGATTA TCATGCGCGA TCGCAAACAG GTGGCGGAGA TCCCGCTGGC AGCGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG TAA
|
Protein sequence | MNTNQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH ADRGTIWLEG QTISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE VELLFGLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETR ELPQIELVKM MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGMIAPFD LEVRPGEIVG LAGLLGSGRT ETAEVIFGIK PADSGTALIK GKPQTLRSPH QASVLGIGFC PEDRKTDGII AAASVRENII LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ VAEIPLAALS VPAIMNAIAA
|
| |