Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2664 |
Symbol | |
ID | 6145451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2733435 |
End bp | 2734613 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617535 |
Product | outer membrane protein assembly complex subunit YfgL |
Protein accession | YP_001744700 |
Protein GI | 170681614 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | [TIGR03300] outer membrane assembly lipoprotein YfgL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.280231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTGC GTAAATTACT GCTGCCAGGA CTGCTTTCCG TTACCCTTTT AAGCGGCTGT TCGCTGTTTA ACAGCGAAGA AGATGTGGTA AAGATGTCCC CATTGCCAAC CGTTGAAAAC CAGTTTACGC CGTCTACTGC GTGGAGCACT TCCGTTGGTA GCGGCATTGG CAACTTCTAT TCCAATCTTC ATCCGGCACT GGCGGACAAC GTTGTCTATG CTGCGGACCG CGCTGGTTTA GTAAAAGCGC TGAATGCGGA TGACGGCAAA GAAATCTGGT CTGTCAACCT GGCCGAGAAA GATGGCTGGT TCTCTAAAGA TCCTGCATTA CTTTCTGGCG GCGTGACCGT GTCTGGTGGG CATGTCTACA TTGGCACCGA AAAGGCGCAG GTTTACGCGC TGAATACCAG CGATGGTACT GTGGCATGGC AAACTAAAGT CGCGGGTGAA GCACTTTCGC GCCCGGTGGT CAGCGACGGT CTGGTGTTAA TCCACACCAG TAACGGTCAG TTACAAGCGC TGAACGAAGC TGACGGCGCT GTCAAATGGA CAGTTAACCT CGATATGCCT TCGCTCTCTT TGCGTGGCGA GTCTGCGCCG GCAACGGCTT TTGGTGCGGC CGTCGTGGGT GGCGATAATG GTCGCGTCAG CGCAGTGCTG ATGGAACAGG GCCAGATGAT TTGGCAGCAG CGTATTTCCC AGGCGACCGG TTCTACCGAA ATTGACCGTC TGAGCGATGT TGACACGACA CCCGTCGTTG TTAACGGCGT TGTTTTCGCG CTGGCCTATA ATGGTAACCT GACGGCGCTT GATCTGCGCA GTGGTCAGAT TATGTGGAAA CGCGAACTGG GTTCGGTGAA TGATTTCATC GTCGACGGCA ATCGCATCTA TCTGGTCGAT CAAAATGACC GGGTGATGGC GTTGACCATT GATGGCGGCG TTACGCTGTG GACACAAAGC GATCTGCTGC ATCGCCTGCT GACTTCTCCG GTGCTGTATA ATGGCAACCT GGTGGTAGGT GACAGTGAAG GTTATCTGCA CTGGATTAAC GTCGAAGATG GTCGTTTCGT TGCCCAGCAA AAAGTTGATA GTTCCGGTTT CCAGACTGAA CCGGTTGCCG CTGACGGCAA ACTGCTGATC CAGGCAAAAG ACGGAACCGT GTACTCTATT ACACGTTAA
|
Protein sequence | MQLRKLLLPG LLSVTLLSGC SLFNSEEDVV KMSPLPTVEN QFTPSTAWST SVGSGIGNFY SNLHPALADN VVYAADRAGL VKALNADDGK EIWSVNLAEK DGWFSKDPAL LSGGVTVSGG HVYIGTEKAQ VYALNTSDGT VAWQTKVAGE ALSRPVVSDG LVLIHTSNGQ LQALNEADGA VKWTVNLDMP SLSLRGESAP ATAFGAAVVG GDNGRVSAVL MEQGQMIWQQ RISQATGSTE IDRLSDVDTT PVVVNGVVFA LAYNGNLTAL DLRSGQIMWK RELGSVNDFI VDGNRIYLVD QNDRVMALTI DGGVTLWTQS DLLHRLLTSP VLYNGNLVVG DSEGYLHWIN VEDGRFVAQQ KVDSSGFQTE PVAADGKLLI QAKDGTVYSI TR
|
| |