Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1682 |
Symbol | |
ID | 6143883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1683419 |
End bp | 1684633 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616558 |
Product | putative lipoprotein |
Protein accession | YP_001743736 |
Protein GI | 170683453 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACAC CACCAGCGGG TTCAAAGCCA CCAGCCACGA CGCAACAATC GTCACAACCG ATGCGTGGCA TCTGGCTGGC CACGGTGTCT CGCCTCGACT GGCCACCGGT TTCCTCGGTT AACATTAGTA ACCCCACCAG CCGGGCCCGT GTACAACAAC AGGCGATGAT CGACAAACTG GATCATCTGC AACGTCTCGG CATAAACACG GTCTTTTTCC AGGTCAAGCC AGACGGTACC GCCCTGTGGC CATCGAAAAT TTTGCCGTGG TCCGATCTTA TGACCGGTAA GATTGGTGAA AATCCGGGTT ACGATCCGCT GCAATTCATG CTCGACGAAG CCCACAAGCG TGGGATGAAA GTACACGTCT GGTTTAACCC CTATCGCGTA TCGGTTAATA CGAAGCCCGG TACTATCAGG GAACTGAATA GCACCCTGTC TCAACAACCG GCGAGCGTCT ATGTGCAACA CCGCGACTGG ATCAGAACGT CCGGCGATCG CTTTGTCCTC GACCCGGGCA TACCTGAGGT TCAGGACTGG ATCACATCAA TAGTTGCAGA AGTGGTTTCC CGCTATCCGG TAGATGGCGT GCAGTTTGAC GACTATTTCT ATACTGAGTC GCCAGGTTCA CGGCTAAATG ATAATGAAAC GTACCGCAAA TACGGCGGCG CATTTGCGTC AAAAGCAGAC TGGCGGCGCA ACAATACTCA GCAGTTAATT GCGAAGGTAT CGCACACCAT TAAAAGCATT AAGCCGGAAG TCGAATTTGG CGTTAGCCCG GCAGGCGTGT GGCGTAACCG ATCACACGAT CCGCTCGGTT CCGATACCCG AGGCGCGGCA GCCTATGACG AATCCTACGC TGATACCCGT CGATGGGTGG AACAAGGGTT GCTGGATTAC ATTGCTCCCC AAATTTACTG GCCATTCTCA CGGAGTGCCG CGCGTTATGA CGTGTTGGCA AAATGGTGGG CGGATGTCGT TAAACCGACC AGGACCCGCC TGTATATCGG TATCGCCTTC TATAAAGTGG GTGAACCTTC AAAGATAGAG CCAGACTGGA TAATTAACGG CGGCGTACCG GAACTGAAAA AGCAGCTCGA TCTTAACGAT GCCGTGCCAG AAATTAGCGG CACAATCTTG TTCCGTGAGG ACTATCTGAA TAAGCCACAG ACTCAACAAG CGGTCAGCTA TCTGCAAAGT CGTTGGGGCA GTTAA
|
Protein sequence | MVTPPAGSKP PATTQQSSQP MRGIWLATVS RLDWPPVSSV NISNPTSRAR VQQQAMIDKL DHLQRLGINT VFFQVKPDGT ALWPSKILPW SDLMTGKIGE NPGYDPLQFM LDEAHKRGMK VHVWFNPYRV SVNTKPGTIR ELNSTLSQQP ASVYVQHRDW IRTSGDRFVL DPGIPEVQDW ITSIVAEVVS RYPVDGVQFD DYFYTESPGS RLNDNETYRK YGGAFASKAD WRRNNTQQLI AKVSHTIKSI KPEVEFGVSP AGVWRNRSHD PLGSDTRGAA AYDESYADTR RWVEQGLLDY IAPQIYWPFS RSAARYDVLA KWWADVVKPT RTRLYIGIAF YKVGEPSKIE PDWIINGGVP ELKKQLDLND AVPEISGTIL FREDYLNKPQ TQQAVSYLQS RWGS
|
| |