Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4519 |
Symbol | |
ID | 5593982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4524687 |
End bp | 4526189 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923615 |
Product | hypothetical protein |
Protein accession | YP_001461056 |
Protein GI | 157163738 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00000000859541 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC CCCTGTTAAT TGCCCGCACG CCGGACACAG AACTGTTTTT ACTGCCGGGA ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC CGTTACGCTG CAAAAACTGG CGGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CGCAAGGCTT AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCCAATC CGGTGGTGGT GTGGGATATC TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCGG ATCTGGGGCC GCTGTTGCTG GCGCGGCTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT CTTCCGTATT GCTGACGATC AGGGATTGTT GCTGCTCGAC TTTAAAGATC TGCGGGCGAT TACCCAGTAC ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGTA ATATCAGTAG CGCATCGGTT GGTGCCATCC AGCGCGGACT GTTGTCGCTG GAACAGCAAG GCGCAGCACA CTTCTTTGGC GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATA CCAACGGTAA AGGCGTTATC AATATCCTCA GCGCCGAGAA GCTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG TGGATGCTTT CAGAGTTGTA TGAACAATTG CCGGAAGCAG GCGATCTGGA GAAACCAAAA CTGGTGTTTT TCTTCGACGA GGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG GATAAGATTG AGCAGGTGAT ACGGCTTATT CGCTCAAAAG GCGTAGGCGT CTGGTTCGTT TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGGC AGCTCGGTAA TCGCGTTCAA CACGCTTTGC GTGCTTTTAC GCCCAAAGAT CAGAAAGCGG TAAAAGCTGC GGCGCAAACC ATGCGGGCCA ATCCGGCATT TGATACCGAA AAGGCGATTC AGGAACTGGG CACCGGCGAG GCGTTGATCT CTTTTCTGGA TGCGAAAGGA AGCCCTTCTG TGGTGGAGCG TGCGATGGTG ATCGCGCCTT GTTCGCGGAT GGGGCCGGTG ACGGAAGATG AGCGTAATGG CTTGATTAAT CACTCTCCGG TGTATGGCAA ATATGAGGAT GAGGTGGACC GGGAATCCGC CTATGAGATG TTGCAAAAAG GCTTTCAGGC CAGTACCGAG CAGCAAAATA ATCCCCCCGC GAAAGGGAAA GAGGTAGCGG TGGATGACGG CATTCTTGGT GGATTGAAGG ATATTTTGTT CGGCACTACC GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCCAAAAG CGCCGCTCGC CAGGTGACGA ATCAGATTGT GCGTGGAATG TTGGGGAGTT TGCTGGGGGG GAGAAGAAGG TAA
|
Protein sequence | MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK GDLTGIAQAG TASEKLLARL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDTNGKGVI NILSAEKLYQ MPKLYAASLL WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED EVDRESAYEM LQKGFQASTE QQNNPPAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR QVTNQIVRGM LGSLLGGRRR
|
| |