Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4566 |
Symbol | |
ID | 5594538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4573994 |
End bp | 4574938 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640923662 |
Product | hypothetical protein |
Protein accession | YP_001461102 |
Protein GI | 157163784 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACT TCACAACCAG CACGCCGCAT GACGCATTAT TTAAATCCTT TCTCACGCAC CCTGACACCG CGCGGGATTT TATGGAGATC CACTTACCCA AAGATTTACG TGAACTGTGC GATCTCGACA GCTTAAAACT GGAATCCGCC AGCTTCGTCG ATGAAAAATT GCGGGCGCTA CACTCCGATA TTCTGTGGTC GGTAAAGACC CGTGAAGGTG ATGGTTATAT TTACGTAGTG ATTGAACATC AGAGCCGCGA GGATATCCAT ATGGCCTTTC GCCTGATGCG ATATTCCATG GCGGTGATGC AGCGCCATAT CGAGCATGAT AAACGCCGGC CGCTACCGCT GGTCATCCCG ATGCTGTTTT ATCACGGTAG CCGTAGTCCT TATCCCTGGT CCCTGTGCTG GCTGGACGAA TTTGCTGACC CGACCACCGC ACGGAAGCTT TATACCGCAG CGTTCCCGCT GGTGGATGTC ACTGTCGTGC CAGACGACGA GATTGTGCAG CACCGCAGAG TCGCCCTGTT GGAGTTGATC CAAAAGCATA TTCGCCAGCG CGATCTGATG GGGCTTATCG ATCAACTGGT AATATTACTG GTTACAGAGT GTGCTAATGA CAGCCAGATA ACTGCGCTGT TAAATTACAT TTTACTGACT GGCGATGAAG CGCGTTTTAA GAAGTTTATC AGCGAACTTA CCCGTCGAAT GCCACAACAC AGGGAGCGAA TAATGACGAT TGCAGAGCGA ATTTATAATG ATGGATGGCT GTTGGGGATG GAAAAGGGGA AAGAAGAAGG GGAACAACGC CTCCTTAGAT TGTTGTTGCA GAATGGGGCA GATCCTGAAT GGATACAAAA GATTACCGGA CTTTCGACAG AGCAAATGCA GGCATTAGAG CAGCCCTTGC CTGAAATCAA GCGCGATCCA TGGATCGAAT ACTAA
|
Protein sequence | MTNFTTSTPH DALFKSFLTH PDTARDFMEI HLPKDLRELC DLDSLKLESA SFVDEKLRAL HSDILWSVKT REGDGYIYVV IEHQSREDIH MAFRLMRYSM AVMQRHIEHD KRRPLPLVIP MLFYHGSRSP YPWSLCWLDE FADPTTARKL YTAAFPLVDV TVVPDDEIVQ HRRVALLELI QKHIRQRDLM GLIDQLVILL VTECANDSQI TALLNYILLT GDEARFKKFI SELTRRMPQH RERIMTIAER IYNDGWLLGM EKGKEEGEQR LLRLLLQNGA DPEWIQKITG LSTEQMQALE QPLPEIKRDP WIEY
|
| |