Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2386 |
Symbol | |
ID | 5591857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2397886 |
End bp | 2398788 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640921513 |
Product | hypothetical protein |
Protein accession | YP_001459047 |
Protein GI | 157161729 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAT CAACAACCTC CTCCCCGCAT GATGCGGTAT TTAAAACCTT TATGTTCACA CCCGAAACCG CACGGGATTT TCTCGAAATA CATTTACCAG AACCACTGCG CAAGCTTAGC AACCTGCAAA CCTTACGCCT GGAACCCACT AGTTTTATTG AAAAAAGTTT ACGCGCTTAC TACTCGGATG TTTTGTGGTC CGTGGAAACC AGCGACGGTG ACGGCTATAT CTACTGCGTG ATTGAACATC AAAGCTCTGC AGAAAAGAAT ATGGCTTTTC GGCTAATGCG CTATGCCACT GCCGCCATGC AGCGTCACCT GGATAAAGGC TATGACAGAG TTCCGCTGGT GGTGCCATTG CTGTTTTATC ATGGCGAAAC ATCGCCCTAC CCGTACTCAC TTAACTGGCT GGATGAGTTT GACGATCCGC AACTTGCCCG GCAGTTGTAC ACCGAAGCTT TTCCGTTGGT GGATATCACC ATCGTACCTG ACGATGAGAT CATGCAACAT CGGCGTATAG CTCTGCTGGA ACTGATTCAA AAGCATATTC GCGACCGCGA TTTAATCGGC ATGGTCGACA GGATCACCAC GCTTTTGGTT AGAGGCTTCA CTAATGACAG CCAGCTACAA ACACTGTTTA ATTATCTGCT GCAATGCGGC GATACCTCCC GTTTCACCCG TTTTATTGAG GAGATTGCCG AACGTTCACC ACTACAAAAG GAGAGATTAA TGACTATTGC TGAACGGCTA CGGCAGGAAG GGCATCAAAT TGGCTGGCAG GAAGGTATGC ATGAACAAGC CATTAAAATT GCTTTGCGCA TGCTGGAACA GGGCTTTGAT CGTGACCAGG TGCTCGCGGC CACCCAGCTA AGCGAAGCCG ATCTGGCAGC GAATAACCAC TAA
|
Protein sequence | MTESTTSSPH DAVFKTFMFT PETARDFLEI HLPEPLRKLS NLQTLRLEPT SFIEKSLRAY YSDVLWSVET SDGDGYIYCV IEHQSSAEKN MAFRLMRYAT AAMQRHLDKG YDRVPLVVPL LFYHGETSPY PYSLNWLDEF DDPQLARQLY TEAFPLVDIT IVPDDEIMQH RRIALLELIQ KHIRDRDLIG MVDRITTLLV RGFTNDSQLQ TLFNYLLQCG DTSRFTRFIE EIAERSPLQK ERLMTIAERL RQEGHQIGWQ EGMHEQAIKI ALRMLEQGFD RDQVLAATQL SEADLAANNH
|
| |