Gene EcHS_A3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3019 
SymbolepaO 
ID5594445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3022339 
End bp3023286 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content35% 
IMG OID640922136 
Productsurface presentation of antigens protein SpaO 
Protein accessionYP_001459639 
Protein GI157162321 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1886] Flagellar motor switch/type III secretory pathway protein 
TIGRFAM ID[TIGR02551] type III secretion system apparatus protein YscQ/HrcQ 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.000335444 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGTC TGAGGAAAGT AAATCGAAAC ACTCATACGT TTGAAACCAC TTTTCAAAAT 
TGGAAAGAAA ATGGCGAAGA TGTTGCATTA TTAATGCCCG AATTTAGTGC TAAATGGCTA
CCGATAGCAG AAGAGAGTGG CAGCTGGTCT GGATGGGTAT TGCTTGGAGA AATATTTCCC
TTAATTTCAT CTGAGCTCGC TGGGATGGCT TTAATGCCAG AAACGGAAAG GTTAATAGGG
GAATGGCTCA GTTTATCCAG TTCCCCGTTA AATCTAAAGC ATCCTGAACT AAAATATAAT
CGTTTGCGCG TAGGTAAAGT ATTTGATGGA GTATTGAGCT CTGCTCAACC ACTAATAAGA
ATATGGACAG GGGAATTGAA TCTTTGGTTA GATAAAGTCA CAGTCTGCCA ATACGGAAAC
GCTCCAGCGT TAGACAAAAA ATCGTTATAT TGCTCCATTC ATTTTGTAAT TGGATTTAGT
AAAACATGTT ACAGAAGTCT TGTTGATATT GAAGTTGGTG ATGTTTTATT AATCTCAAAT
AATTTGGCTT ATGCGGTTAT TTATAATACA AAAATTTTTG ATTTAATTTA TCCAGAGGAG
TTAAAAATGG CTGATCATTT TGAGTACGAG GAAGATTTTG AAACAGATGA TTTTGATATC
AAAAAAAACG AGAGTGAGAT TTATGATGAA AATGACGATC AGATGATTAA TAGTTTTGAA
GACTTGCCTG TAAAAATTGA GTTTGTTCTT GGTAAGAAAA TAATGAATCT ATATGAAATA
GACGAACTCT GTGCAAAAAG AATAATATCT CTGCTTTCTG AATCTGAGAA AAATATAGAA
ATACGTGTAA ATGGCGCGCT AACTGGCTAT GGAGAACTTG TTGAAGTGGA TGATAAATTG
GGCGTAGAGA TTCATTCTTG GTTATCAGGG CATAATAATG TCAAATAG
 
Protein sequence
MFGLRKVNRN THTFETTFQN WKENGEDVAL LMPEFSAKWL PIAEESGSWS GWVLLGEIFP 
LISSELAGMA LMPETERLIG EWLSLSSSPL NLKHPELKYN RLRVGKVFDG VLSSAQPLIR
IWTGELNLWL DKVTVCQYGN APALDKKSLY CSIHFVIGFS KTCYRSLVDI EVGDVLLISN
NLAYAVIYNT KIFDLIYPEE LKMADHFEYE EDFETDDFDI KKNESEIYDE NDDQMINSFE
DLPVKIEFVL GKKIMNLYEI DELCAKRIIS LLSESEKNIE IRVNGALTGY GELVEVDDKL
GVEIHSWLSG HNNVK