Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3311 |
Symbol | |
ID | 6145653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3388889 |
End bp | 3390496 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618140 |
Product | putative binding protein |
Protein accession | YP_001745290 |
Protein GI | 170680624 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.43332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.120975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATACGC GAAATTTATT ATGGCTGGTC AGCCTGGTAA GTGCGGCTCC TCTCTACGCT GCTGACGTTC CCGCCAACAC ACCGCTCGCC CCGCAACAAG TCTTTCGTTA CAACAATCAT AGCGACCCAG GCACGCTCGA CCCGCAAAAG GTGGAGGAGA ATACTGCCGC GCAGATTGTG CTGGATCTGT TTGAAGGTCT GGTATGGATG GACGGTGAAG GCCAGGTGCA GCCCGCTCAG GCTGAACGCT GGGAGATACT GGACGGCGGC AAGCGCTATA TTTTCCATCT GCGTAGCGGT TTGCAGTGGT CAGACGGTCA GCCTCTGACG GCAGAGGATT TTGTCCTCGG CTGGCAGCGC GCGGTTGACC CGAAAACGGC AAGCCCTTTT GCTGGCTATC TGGCACAGGC GCACATTAAC AATGCCGCGG CTATTGTTGC GGGTAAAGCA GATGTTACAT CGCTGGGTGT CAAAGCGACG GATGATCGTA CTCTTGAAGT TACGCTTGAG CAGCCAGTTC CTTGGTTCAC GACGATGCTC GCCTGGCCGA CGCTGTTCCC GGTTCCTCAT CATGTCATCG CTAAACATGG CGATAGCTGG AGTAAGCCAG AGAACATGGT TTACAACGGT GCCTTTGTGC TTGATCAGTG GGTAGTTAAC GAAAAGATTA CTGCACGCAA AAATCCAAAG TACCGCGATG CGCAACATAC AGTATTGCAA CAGGTTGAGT ATCTGGCGCT GGATAATTCG GTCACCGGCT ATAACCGCTA TCGCGCGGGA GAGGTCGATC TCACCTGGGT TCCGGCGCAG CAAATTCCCG CCATTGAAAA ATCACTGCCT GGCGAGCTAC GAATTATTCC GCGTCTGAAC AGCGAATATT ACAACTTCAA CCTTGAGAAA CCGCCATTTA ACGATGTGCG GGTGCGTCGG GCGCTATATC TTACGGTTGA TCGACAGCTT ATTGCGCAAA AGGTACTGGG GTTGAGAACG CCCGCAACTA CGCTGACGCC GCCAGAGGTA AAAGGCTTTA GCGCGACGAC GTTCGATGAA CTGCAAAAGC CGCTGAGTGA GCGCGTCGCG ATGGCAAAAG CCTTGTTGAA ACAGGCGGGA TACGACGCCT CTCATCCGCT TCGCTTTGAG CTGTTCTACA ACAAGTACGA TCTGCATGAA AAGACCGCGA TAGCGTTGTC TTCCGAATGG AAAAAATGGC TGGGTGCACA GGTGACGCTG CGCACAATGG AGTGGAAAAC TTATCTTGAT GCCCGACGAG CCGGTGATTT CATGTTGTCT CGGCAGTCGT GGGATGCGAC GTACAATGAT GCCTCCACCT TTCTGAACAC GCTTAAGAGC GATAGCGAGG AAAATGTCGG CCACTGGAAA AACGCGCAGT ATGACGCCTT ACTAAACCAG GCGGCACAAA CGGCTGATGC AACAAAGCGT AATGCGTTGT ATCAGCAGGC AGAAGTGATC ATCAACCAGC AGGCACCGCT CATTCCTGTC TATTATCAGC CGTTAATCAA ACTGCTTAAA CCCTACGTTG GCGGTTTTCC GCTGCATAAT CCCCAGGATT ATGTCTACAG CAAAGAGTTG TATATCAAGG CACATTGA
|
Protein sequence | MYTRNLLWLV SLVSAAPLYA ADVPANTPLA PQQVFRYNNH SDPGTLDPQK VEENTAAQIV LDLFEGLVWM DGEGQVQPAQ AERWEILDGG KRYIFHLRSG LQWSDGQPLT AEDFVLGWQR AVDPKTASPF AGYLAQAHIN NAAAIVAGKA DVTSLGVKAT DDRTLEVTLE QPVPWFTTML AWPTLFPVPH HVIAKHGDSW SKPENMVYNG AFVLDQWVVN EKITARKNPK YRDAQHTVLQ QVEYLALDNS VTGYNRYRAG EVDLTWVPAQ QIPAIEKSLP GELRIIPRLN SEYYNFNLEK PPFNDVRVRR ALYLTVDRQL IAQKVLGLRT PATTLTPPEV KGFSATTFDE LQKPLSERVA MAKALLKQAG YDASHPLRFE LFYNKYDLHE KTAIALSSEW KKWLGAQVTL RTMEWKTYLD ARRAGDFMLS RQSWDATYND ASTFLNTLKS DSEENVGHWK NAQYDALLNQ AAQTADATKR NALYQQAEVI INQQAPLIPV YYQPLIKLLK PYVGGFPLHN PQDYVYSKEL YIKAH
|
| |