Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2103 |
Symbol | |
ID | 5594448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2090879 |
End bp | 2091931 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640921242 |
Product | hypothetical protein |
Protein accession | YP_001458784 |
Protein GI | 157161466 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00000172399 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTTAG CATCATTGTT GAGACGTATT GCATTTAGTT ACTACGATTA TAAAGCTTAT AATTTCAATA TTGAAAAAAC AGACTTTGTT GTCATCCATA TTCCCGATCA GATTGGCGAT GCTATGGCCA TCTTTCCTGT TATTCGGGCG CTTGAATTGC ATAAAATTAA GCATCTTTTA ATTGTAATGT CGACAATTAA TTTAGAAGTC TTTAATGCGC TTAAACTTGA ACAGACTAAA TTAACATTAG TCACAATGAC TATGCAGGAT CACGCAACAT TAAAAGAAAT AAAAGATTTA GCAAAGAACA TAACACAGCA ATACGGTACG CCGGATCTTT GCATTGAGGG GATGCGTAAA AAGAACCTGA AAACGATGTT ATTTATCAGT CAGTTGAAAG CAAAAACGAA TTTTCAGGTT GTTGGTATAA CCATGAATTG CTTCTCCCCT TTGTGCAAGA ACGCGTCCCG TATGGATCAG AAACTCCGGG CTCCCGTACC TATGACATGG GCATTTATGA TGCGTGAGGC GGGTTTTCCA GCAGTCAGGC CAATATATGA ATTGCCACTA AGTGAGGATG TACTCGATGA GGTGCGCGAG GAAATGCGAT CGTTAGGATC TTACATTGCG TTCAATTTAG AAGGTAGCTC GCAGGAACGT ACATTTTCAT TATCGATTGC AGAAAATCTA ATAGCAAAAA TTCAAAGTGA AACAGATATG CCAATAGTGA TCGTTCATGG ACCCAAAGGT GAAGATAAAG CCAGGGCATT AGTGGATTGT TATAATAATG TCTACCGTTT ATCCTTACCA CCCTCGATTA AACGTTCAGC AGCAATCATA AAAGATGCTT ATATCGCAAT AACTCCTGAC ACCTCAATAT TACATATGGC AAGTGCCTAT AATACTCCCG TTGTTGCAAT TTATGCTGAT TACAAAACGC GATGGCCCGC AATGGCCGAT GTTTCGGAGT CAGTCGTCGT TGGGCAAAAA ATTGACAATA TAAGTCTGGA TGAATTCGCA AAGGCATTAA AAAGTGTTTT GGCGAGAATA TGA
|
Protein sequence | MFLASLLRRI AFSYYDYKAY NFNIEKTDFV VIHIPDQIGD AMAIFPVIRA LELHKIKHLL IVMSTINLEV FNALKLEQTK LTLVTMTMQD HATLKEIKDL AKNITQQYGT PDLCIEGMRK KNLKTMLFIS QLKAKTNFQV VGITMNCFSP LCKNASRMDQ KLRAPVPMTW AFMMREAGFP AVRPIYELPL SEDVLDEVRE EMRSLGSYIA FNLEGSSQER TFSLSIAENL IAKIQSETDM PIVIVHGPKG EDKARALVDC YNNVYRLSLP PSIKRSAAII KDAYIAITPD TSILHMASAY NTPVVAIYAD YKTRWPAMAD VSESVVVGQK IDNISLDEFA KALKSVLARI
|
| |