Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0915 |
Symbol | |
ID | 6147103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 923314 |
End bp | 924240 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615803 |
Product | quaternary amine ABC transporter ATP-binding protein |
Protein accession | YP_001742995 |
Protein GI | 170684202 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1125] ABC-type proline/glycine betaine transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.530478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAAT TTAGCCATGT CAGCAAACTG TTCGGCGCAC AAAAAGCCGT TAACGATCTC AATCTCAATT TTCAGGAAGG GAGTTTTTCG GTGCTGATTG GCACATCTGG CTCCGGCAAA TCCACCACCC TGAAAATGAT TAACCGCCTG GTGGAGCATG ACAGCGGCGT GATCCGCTTT GCCGGAGAAG AAATTCGCTC GCTGCCAGTA CTGGAGTTGC GCCGCCGGAT GGGCTATGCC ATTCAATCTA TTGGCCTGTT CCCCCACTGG AGCGTGGCGC AAAACATCGC TACCGTGCCG CAATTACAAA AATGGTCGCG GGCGCGGATT GACGATCGTA TCGACGAATT AATGGCGCTA CTGGGGCTGG AGTCAAATTT ACGTGAGCGT TATCCGCATC AGCTTTCCGG TGGTCAGCAG CAACGTGTGG GAGTGGCGCG CGCACTGGCT GCCGATCCGC AAGTCTTACT GATGGATGAA CCTTTTGGCG CACTGGACCC GGTAACGCGC GGCGCGTTGC AACAAGAGAT GACGCGCATT CACCGTTTGC TGGGGCGCAC TATTGTGCTG GTCACTCATG ATATTGATGA GGCGCTACGG CTGGCAGAAC ATCTGGTATT GATGGATCAC GGTGAAGTGG TGCAGCAGGG CAATCCGCTG ACGATGCTGA CTCGTCCGGC GAATGATTTT GTTCGCCAGT TTTTTGGACG TAGTGAACTG GGTGTGCGCC TGCTTTCGTT ACGTAGTGTG GCGGATTACG TGCGTCGCGA AGAACGGGCA GAAGGTGAGG CGCTGGCAGA AGAGATGACG CTACGCGATG CGCTCTCCCT GTTTGTCGCG CGGGGATGTG AAGTGCTGCC GGTGGTGAAC ACACAGGGCG AGCCTTGCGG CACGCTGCAT TTTCAGGATC TGCTGGTGGA GGCGTAA
|
Protein sequence | MIEFSHVSKL FGAQKAVNDL NLNFQEGSFS VLIGTSGSGK STTLKMINRL VEHDSGVIRF AGEEIRSLPV LELRRRMGYA IQSIGLFPHW SVAQNIATVP QLQKWSRARI DDRIDELMAL LGLESNLRER YPHQLSGGQQ QRVGVARALA ADPQVLLMDE PFGALDPVTR GALQQEMTRI HRLLGRTIVL VTHDIDEALR LAEHLVLMDH GEVVQQGNPL TMLTRPANDF VRQFFGRSEL GVRLLSLRSV ADYVRREERA EGEALAEEMT LRDALSLFVA RGCEVLPVVN TQGEPCGTLH FQDLLVEA
|
| |