Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2299 |
Symbol | |
ID | 6143981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2327524 |
End bp | 2328681 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617173 |
Product | hypothetical protein |
Protein accession | YP_001744346 |
Protein GI | 170681337 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00383736 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCGCA ACGTCACGCT CGATTTTGTT CGCGGCGTCG CCATTCTGGG GATCCTGCTA TTAAACATCA GCGCCTTTGG GCTACCAAAG GCGGCTTATC TCAATCCCGC CTGGTACGGC GCTATTACGT CGCAGGATGC ATGGACCTGG GCATTTCTCG ATCTCATCGG CCAGGTGAAA TTCCTCACGC TTTTTGCTCT GCTGTTTGGT GCGGGCCTGC AAATGTTGCT GCCCCGTGGC AGACGCTGGA TCCAGTCGCG GTTAACGCTG TTAGTCTTGC TGGGCTTTAT TCACGGTTTA TTGTTCTGGG ACGGCGATAT TCTGCTGGCT TACGGGCTGG TGGGCTTAAT CTGCTGGCGG CTGGTGCGCG ATGCGCCATC GGTAAAAAGC CTGTTTAATA CAGGCGTCAT GCTTTATCTG GTGGGGCTTG GCGTTTTGCT GTTATTGGGG CTGATTTCCG ACAGCCAGAC CAGCCGCGCC TGGACGCCGG ATGCATCGGC TATTTTGTAT GAAAAATACT GGAAGCTTCA CGGCGGCGTT GAAGCGATCA GTAATCGTGC CGATGGTGTT GGCAACAGTT TACTGGCACT GGGCGCACAG TATGGCTGGC AACTGGCAGG GATGATGCTC ATTGGTGCCG CATTGATGCG CAGCGGCTGG CTGAAAGGGC AGTTCAGCTT ACGTCACTAT CGTCGTACTG GTTTTGTGCT GGTGGCGATT GGGGTGACCA TTAACCTTCC TGCCATCGCC CTGCAATGGC AGCTGGACTG GGCGTATCGC TGGTGTGCAT TCTTACTTCA AATGCCGCGG GAACTGAGTG CGCCGTTTCA GGCGATTGGC TATGCGTCGC TGTTTTATGG CTTCTGGCCG CAATTAAGCC GCTTTAAGCT GGTGCTGGCA ATCGCCTGTG TCGGACGGAT GGCACTGACC AACTATCTAT TGCAAACGCT GATTTGTACC ACGCTTTTTT ACCACCTCGG TTTGTTTATG CTGTTTGACC GTCTGGAATT GCTGGCGTTT GTTATTCCGG TATGGCTGGC GAATATTCTC TTCTCTGTTA TCTGGCTGCG TTACTTCCGC CAGGGGCCGG TGGAATGGCT CTGGCGTCAG TTAACTTTGC GTGCAGCTGG GCCGGCAAGA TCTAAAACAT CAAGATAA
|
Protein sequence | MERNVTLDFV RGVAILGILL LNISAFGLPK AAYLNPAWYG AITSQDAWTW AFLDLIGQVK FLTLFALLFG AGLQMLLPRG RRWIQSRLTL LVLLGFIHGL LFWDGDILLA YGLVGLICWR LVRDAPSVKS LFNTGVMLYL VGLGVLLLLG LISDSQTSRA WTPDASAILY EKYWKLHGGV EAISNRADGV GNSLLALGAQ YGWQLAGMML IGAALMRSGW LKGQFSLRHY RRTGFVLVAI GVTINLPAIA LQWQLDWAYR WCAFLLQMPR ELSAPFQAIG YASLFYGFWP QLSRFKLVLA IACVGRMALT NYLLQTLICT TLFYHLGLFM LFDRLELLAF VIPVWLANIL FSVIWLRYFR QGPVEWLWRQ LTLRAAGPAR SKTSR
|
| |