Gene EcHS_A3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3011 
SymboleprH 
ID5594817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3017221 
End bp3018402 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content33% 
IMG OID640922129 
Producttype III secretion apparatus protein EprH 
Protein accessionYP_001459632 
Protein GI157162314 
COG category 
COG ID 
TIGRFAM ID[TIGR02554] type III secretion system protein PrgH/EprH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00000773817 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA ATGATAAATT CTTATCACAA GACTTATTGG AATCTTATGC CATTCGTTTG 
TTGAGCGGAC CATTGAATGG ATGCGAGTAT GAAATACTTA ATGGGCGTCT ACTTGTTATC
ATTGGTAATG ATGTTTCGTT AGGAAGAAGT GATGCTTTTT CTGAGTTACC AGAAAACACA
ATAGTCGTTC CTTATGGCGA GCTTACAGGT AGTTTTGAGA TAATAATTAC TACCGACCCT
GATATAGTAG TAACAATCAG AGAATTAACA GCACAAGAAC CTGAAGATAG AACGTTAACA
TTCAATCAGC AAGTAGAAGT TTTAGGTCTC AAATTTGCTG TAAAAGAAAA AAATGAAGTT
TGGCAGTATT CATTGCCAGG CATTATTGAA AATAACATTA TTTCTACAAA ACAACATTTC
TTTAGCAGTA AGCTGTTTAA GTATGTAATG CTTTTTTTTC TTTTTGCTAT CATTTTCTTT
GCTTTTTATA TTGTTAATGC CAGTAATGAT CCGCAGCTGA GACATATCGA TAAAATTCTT
GTAAACAAAA ACAGGAATTA TGAAATTTTA TATGGTAGAG ATCATGTTAT CTATATCAAT
ACCAATAGTT TGGATGAAGC AGTCTGGGTC AAACAAGCAC TGGAAAAAAA TCAACCTGGA
AAGCCAGTAC GGGTGATAAA TCCTGATGAT GAATCGATAC GAATTTTTTC ATGGCTTGCT
GATAATTTCC CTGATTTACA ATATTTTAAA CTTCAGCTAT TAGATGCCAG TAATCCCAGA
CTGACCGTGA GTAAGCAACG GAATGCTATC ACACAGCAAC TAATCGACAA TCTTATTAAA
GGGTTACTAC AAACTATGCC ATATGCCAGC AATATAAGTA TTGCGGTATT AGATGATAAT
GTATTGGAAA GTCAGGCTAT TGAAACATTG TCAGCGATAG GTCTTTCTTA TGAAAAATAT
AAAACAGCTA ACAATGTGTA TTTCAATATC ATTGGTACGT TAAGTGACAG TGAATTAAAT
AAAATTAATA ACTATGTTGA CGAATATTAT AAACAATGGG GTAAACAATA TGTAAGATTT
AATGTGAATT TGAAAAATCA GGACACAAAT AATAGTTCAT TTAGCTACGG AGATAACCGA
TTCGAGAAGT CTCAAGGTAG CAACTGGACG TTTCAGGAAT AA
 
Protein sequence
MENNDKFLSQ DLLESYAIRL LSGPLNGCEY EILNGRLLVI IGNDVSLGRS DAFSELPENT 
IVVPYGELTG SFEIIITTDP DIVVTIRELT AQEPEDRTLT FNQQVEVLGL KFAVKEKNEV
WQYSLPGIIE NNIISTKQHF FSSKLFKYVM LFFLFAIIFF AFYIVNASND PQLRHIDKIL
VNKNRNYEIL YGRDHVIYIN TNSLDEAVWV KQALEKNQPG KPVRVINPDD ESIRIFSWLA
DNFPDLQYFK LQLLDASNPR LTVSKQRNAI TQQLIDNLIK GLLQTMPYAS NISIAVLDDN
VLESQAIETL SAIGLSYEKY KTANNVYFNI IGTLSDSELN KINNYVDEYY KQWGKQYVRF
NVNLKNQDTN NSSFSYGDNR FEKSQGSNWT FQE