Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2286 |
Symbol | |
ID | 5592659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2284037 |
End bp | 2285194 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921414 |
Product | hypothetical protein |
Protein accession | YP_001458950 |
Protein GI | 157161632 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 70 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCGCA ACGTCACGCT CGATTTTGTT CGCGGCGTCG CCATTCTGGG GATCCTGCTA TTAAACATCA GCGCCTTTGG GCTACCAAAG GCGGCTTATC TTAATCCCGC CTGGTACGGC GCTATTACGC CGCAGGATGC ATGGACCTGG GCATTTCTTG ATCTCGTCGG CCAGGTGAAA TTCCTCACGC TTTTTGCGCT GCTGTTTGGT GCGGGCCTGC AAATGTTGCT GCCCCGTGGC AGACGCTGGA TCCAGTCGCG GTTAACGCTG TTAGTCTTGT TGGGCTTTAT TCACGGTTTA CTGTTCTGGG ACGGCGATAT TCTGCTGGCT TACGGGCTGG TGGGCTTAAT CTGCTGGCGG CTGGTGCGCG ATGCGCCATC GGTAAAAAGC CTGTTTAATA CAGGCGTCAT GCTTTATCTG GTGGGGCTTG GCGTTTTGCT GTTATTGGGG TTGATTTCCG ATAGCCAGAC CAGCCGCGCC TGGACGCCGG ATGCATCGGC TATTTTGTAT GAAAAATACT GGAAGCTTCA CGGCGGCGTT GATGCGATCA GTAATCGTGC CGATGGTGTT GGCAACAGTT TACTGGCACT GGGCGCACAG TATGGCTGGC AACTGGCTGG GATGATGCTC ATTGGTGCCG CATTGATGCG CAGTGGCTGG CTGAAAGGGC AGTTCAGCTT ACGTCACTAT CGTCGTACTG GTTTTGTGCT GGTGGCGATT GGGGTGATCA TTAACCTTCC TGCCATCGCC CTGCAATGGC GGCTGGACTG GGCATATCGC TGGTGCGCCT TCTTACTTCA GATGCCGCGG GAACTGAGTG CGCCGTTTCA GGCGATAGGC TATGCGTCGC TGTTTTATGG TTTCTGGCCG CAATTGAGCC GCTTTAAGCT GGTGCTTGCG ATCGCCTGCG TCGGACGGAT GGCGCTGACC AACTATCTAT TGCAAACGCT GATTTGTACC ACGCTTTTTT ACCACCTCGG TTTGTTTATG CATTTTGACC GCCTGGAGCT GCTGGCGTTT GTTATTCCGG TATGGCTGGC GAATATCCTC TTCTCTGTTA TCTGGCTGCG TTTCTTCCGC CAGGGGCCGG TGGAATGGCT CTGGCGTCAG TTAACTTTGC GTGCTGCCGG ACCGGCAATA TCTAAAACAT CAAGATAA
|
Protein sequence | MERNVTLDFV RGVAILGILL LNISAFGLPK AAYLNPAWYG AITPQDAWTW AFLDLVGQVK FLTLFALLFG AGLQMLLPRG RRWIQSRLTL LVLLGFIHGL LFWDGDILLA YGLVGLICWR LVRDAPSVKS LFNTGVMLYL VGLGVLLLLG LISDSQTSRA WTPDASAILY EKYWKLHGGV DAISNRADGV GNSLLALGAQ YGWQLAGMML IGAALMRSGW LKGQFSLRHY RRTGFVLVAI GVIINLPAIA LQWRLDWAYR WCAFLLQMPR ELSAPFQAIG YASLFYGFWP QLSRFKLVLA IACVGRMALT NYLLQTLICT TLFYHLGLFM HFDRLELLAF VIPVWLANIL FSVIWLRFFR QGPVEWLWRQ LTLRAAGPAI SKTSR
|
| |