Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2013 |
Symbol | |
ID | 6147057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2036139 |
End bp | 2037098 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616889 |
Product | LysM/ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_001744065 |
Protein GI | 170681577 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.00035713 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAAAA CGCATTTTTC TCGCTGGCTA ACGTTTTTTA CGTTCGCCGC TGCCGTGGCG CTGGCGCTAC CGGCAAAAGC CAACACCTGG CCGCTGCCGC CTGCGGGCAG TCGTCTGGTT GGTGAAAACA AATTTCATGT TGTGGAAAAT GACGGTGGTT CTCTGGAAGC CATCGCCAAA AAATATAACG TCGGCTTTCT CGCGCTGTTA CAGGCTAACC CCGGCGTTGA TCCTTACGTA CCGCGCGCGG GTAGCGTGTT AACGATCCCG TTGCAAACCC TACTTCCAGA CGCGCCGCGT GAAGGCATTG TGATCAACAT TGCGGAGCTG CGTCTCTATT ACTACCCGCC GGGTAAAAAT TCGGTAACCG TGTATCCCAT CGGCATTGGT CAGTTGGGTG GTGACACGCT GACACCAACA ATGGTGACCA CCGTTTCAGA CAAACGTGCA AACCCAACCT GGACGCCAAC GGCAAACATC CGCGCCCGTT ATAAAGCACA GGGAATTGAG TTGCCTGCGG TAGTGCCGGC TGGACCGGAT AACCCAATGG GCCATCATGC GATTCGTCTG GCGGCCTATG GCGGCGTTTA TTTGCTTCAT GGTACGAACG CCGATTTCGG CATTGGCATG CGGGTAAGTT CTGGCTGTAT TCGTCTGCGG GATGACGATA TCAAAACACT CTTTAGCCAG GTCACCCCAG GCACCAAAGT GAATATCATC AACACTCCGA TAAAAGTCTC TGCCGAACCA AACGGTGCGC GTCTGGTTGA AGTACATCAG CCGCTGTCTG AGAAGATTGA TGACGATCCG CAGCTGCTGC CAATTACGCT GAATAGCGCA ATGCAATCAT TTAAAGATGC AGCACAAACT GACGCTGAAG TGATGCAACA TGTGATGGAT GTCCGTTCCG GGATGCCGGT GGATGTCCGC CGTCATCAAG TGAGCCCACA AACTCTGTAA
|
Protein sequence | MIKTHFSRWL TFFTFAAAVA LALPAKANTW PLPPAGSRLV GENKFHVVEN DGGSLEAIAK KYNVGFLALL QANPGVDPYV PRAGSVLTIP LQTLLPDAPR EGIVINIAEL RLYYYPPGKN SVTVYPIGIG QLGGDTLTPT MVTTVSDKRA NPTWTPTANI RARYKAQGIE LPAVVPAGPD NPMGHHAIRL AAYGGVYLLH GTNADFGIGM RVSSGCIRLR DDDIKTLFSQ VTPGTKVNII NTPIKVSAEP NGARLVEVHQ PLSEKIDDDP QLLPITLNSA MQSFKDAAQT DAEVMQHVMD VRSGMPVDVR RHQVSPQTL
|
| |