Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3011 |
Symbol | eprH |
ID | 5594817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3017221 |
End bp | 3018402 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640922129 |
Product | type III secretion apparatus protein EprH |
Protein accession | YP_001459632 |
Protein GI | 157162314 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02554] type III secretion system protein PrgH/EprH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00000773817 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATA ATGATAAATT CTTATCACAA GACTTATTGG AATCTTATGC CATTCGTTTG TTGAGCGGAC CATTGAATGG ATGCGAGTAT GAAATACTTA ATGGGCGTCT ACTTGTTATC ATTGGTAATG ATGTTTCGTT AGGAAGAAGT GATGCTTTTT CTGAGTTACC AGAAAACACA ATAGTCGTTC CTTATGGCGA GCTTACAGGT AGTTTTGAGA TAATAATTAC TACCGACCCT GATATAGTAG TAACAATCAG AGAATTAACA GCACAAGAAC CTGAAGATAG AACGTTAACA TTCAATCAGC AAGTAGAAGT TTTAGGTCTC AAATTTGCTG TAAAAGAAAA AAATGAAGTT TGGCAGTATT CATTGCCAGG CATTATTGAA AATAACATTA TTTCTACAAA ACAACATTTC TTTAGCAGTA AGCTGTTTAA GTATGTAATG CTTTTTTTTC TTTTTGCTAT CATTTTCTTT GCTTTTTATA TTGTTAATGC CAGTAATGAT CCGCAGCTGA GACATATCGA TAAAATTCTT GTAAACAAAA ACAGGAATTA TGAAATTTTA TATGGTAGAG ATCATGTTAT CTATATCAAT ACCAATAGTT TGGATGAAGC AGTCTGGGTC AAACAAGCAC TGGAAAAAAA TCAACCTGGA AAGCCAGTAC GGGTGATAAA TCCTGATGAT GAATCGATAC GAATTTTTTC ATGGCTTGCT GATAATTTCC CTGATTTACA ATATTTTAAA CTTCAGCTAT TAGATGCCAG TAATCCCAGA CTGACCGTGA GTAAGCAACG GAATGCTATC ACACAGCAAC TAATCGACAA TCTTATTAAA GGGTTACTAC AAACTATGCC ATATGCCAGC AATATAAGTA TTGCGGTATT AGATGATAAT GTATTGGAAA GTCAGGCTAT TGAAACATTG TCAGCGATAG GTCTTTCTTA TGAAAAATAT AAAACAGCTA ACAATGTGTA TTTCAATATC ATTGGTACGT TAAGTGACAG TGAATTAAAT AAAATTAATA ACTATGTTGA CGAATATTAT AAACAATGGG GTAAACAATA TGTAAGATTT AATGTGAATT TGAAAAATCA GGACACAAAT AATAGTTCAT TTAGCTACGG AGATAACCGA TTCGAGAAGT CTCAAGGTAG CAACTGGACG TTTCAGGAAT AA
|
Protein sequence | MENNDKFLSQ DLLESYAIRL LSGPLNGCEY EILNGRLLVI IGNDVSLGRS DAFSELPENT IVVPYGELTG SFEIIITTDP DIVVTIRELT AQEPEDRTLT FNQQVEVLGL KFAVKEKNEV WQYSLPGIIE NNIISTKQHF FSSKLFKYVM LFFLFAIIFF AFYIVNASND PQLRHIDKIL VNKNRNYEIL YGRDHVIYIN TNSLDEAVWV KQALEKNQPG KPVRVINPDD ESIRIFSWLA DNFPDLQYFK LQLLDASNPR LTVSKQRNAI TQQLIDNLIK GLLQTMPYAS NISIAVLDDN VLESQAIETL SAIGLSYEKY KTANNVYFNI IGTLSDSELN KINNYVDEYY KQWGKQYVRF NVNLKNQDTN NSSFSYGDNR FEKSQGSNWT FQE
|
| |