Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4564 |
Symbol | |
ID | 5595312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4570974 |
End bp | 4572254 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640923660 |
Product | hypothetical protein |
Protein accession | YP_001461100 |
Protein GI | 157163782 |
COG category | [S] Function unknown |
COG ID | [COG2733] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT AAAGACCGGA TTGGCGAAAA TCTCGGCCAG TTCGTGCAGG AAAAATTTCT CGATACCCAG TCGCTGGTGG CATTGATTCG ACGCCACGAA CCGGTGTTGT TGATTGGCAA CTGGTTTAGT CAGCCAGAAA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTT GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA TCGGGCGATT GATAAGGTCG ATCTTTCCGG CACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT CGTCATCAGG TGCTACTGGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT AAATCGCGCA AGTTTATTGC CCAGCAAATT GTTCGCTGGC TGGAGAGTGA GCATCCACTG AAAGCCAAAA TTCTCCCCAC CGAATGGTTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC GCGGTGAATT CTTTGCTTGA TGATATCAGC CGTGATCGTG CGCATCAGAT CCGTCATGCG TTTGATCGCG CCACTTTTGC CCTGATCGAC AAGTTGAAAA ACGATCCGGA AATGGCAGCG CGAGCCGATG CCGTAAAAAG TTATCTGAAA GAAGATGAAG CTTTTAACCG CTATCTCAGT GAATTGTGGG GGGATTTACG GGAGTGGCTG AAAGCGGATA TCAACAGTGA AGATTCTCGT GTGAAAGAAC GTATCGCGCG GGCGGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT GCCTTGCGGG CGTCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCGCGAGAT ATGTCGCGGC AAATCGAGTT AAATATCGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT ACGCTGGTTG GCGGTTGTAT TGGGCTAATT TTATATTTGT TGTCGCAGCT CCCGGCCTTG TTCCCCCTCA GCAATTTTTA G
|
Protein sequence | MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PVLLIGNWFS QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS ELWGDLREWL KADINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL FPLSNF
|
| |