Gene ECH74115_1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1266 
SymbolpgaA 
ID6968718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1275095 
End bp1277518 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content47% 
IMG OID643385256 
Productouter membrane protein PgaA 
Protein accessionYP_002269751 
Protein GI209397134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.632553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCAA GTAGCAGAAA AAGGTGCCCG AAAACCAAAT GGGCTTTGAA ACTTCTTACT 
GCCGCATTTT TAGCAGCGAG TCCCGCGGCG AAGAGTGCTG TTAATAACGC CTATGATGCA
TTGATTATTG AAGCTCGCAA GGGTAATACT CAGCCAGCTT TGTCATGGTT TGCACTAAAA
TCAGCACTCA GCAATAACCA AATTGCTGAC TGGTTACAGA TTGCCTTATG GGCCGGGCAA
GATAAACAGG TTATTACCGT TTACAACCGC TACCGTCATC AGCAATTACC AGCGCGTGGT
TATGCAGCTG TCGCCGTCGC TTATCGTAAC CTGCAACAAT GGCAAAACTC GCTTACACTG
TGGCAAAAGG CGCTCTCTCT GGAGCCGCAA AATAAGGATT ATCAACGGGG ACAAATTTTA
ACCCTGGCAG ATGCTGGTCA CTATGATACT GCGCTGGTTA AACTTAAGCA GCTTAACTCT
GGAGCACCGG ACAAAGCCAA TTTACTCGCA GAAGCCTATA TCTATAAACT GGCGGGGCGT
CATCAGGATG AATTACGGGC GATGACAGAG TCATTACCTG AAAATGCATC TACGCAACAA
TATCCCACAG AATACGTGCA GGCATTACGT AATAATCAAC TTGCTGCCGC GATTGACGAT
GCCAATTTAA CGCCAGATAT TCGCGCTGAT ATTCATGCCG AACTGGTCAG ACTGTCGTTT
ATGCCTACGC GCAGTGAAAG TGAACGTTAT GCCATTGCCG ATCGCGCCCT CGCCCAATAC
GCTGCATTAG AAATTCTGTG GCACGATAAC CCAGACCGTA CTGCCCAGTA CCAGCGTATT
CAGGTTGATC ATCTTGGCGC GTTATTAACT CGCGATCGTT ATAAAGACGT TATTTCTCAC
TATCAGCGAT TAAAAAAGAC GGGGCAAATT ATTCCGCCCT GGGGGCAATA TTGGGTTGCA
TCGGCTTATC TCAAAGATCA TCAGCCGAAA AAAGCACAGT CAATAATGAC CGAGCTCTTT
TATCACAAGG AGACCATTGC CCCGGATTTA TCCGATGAAG AACTTGCGGA TCTCTTTTAC
AGCCACCTGG AGAGTGAAAA TTATCCGGGC GCGCTAACTG TCACCCAACA TACCATTAAT
ACTTCGCCGC CTTTCCTTCG GTTAATGGGC ACGCCTACGA GCATCCCGAA TGATACCTGG
TTACAGGGGC ATTCGTTTCT CTCAACCGTA GCAAAATATA GTAATGATCT TCCTCAGGCT
GAAATGACAG CCAGAGAGCT TGCTTATAAC GCACCAGGAA ATCAGGGACT GCGCATTGAT
TACGCGAGTG TGTTACAAGC CCGCGGTTGG CCTCGTGCAG CAGAAAATGA ATTAAAAAAA
GCAGAAGTGA TCGAGCCACG TAATATTAAT CTGGAGGTTG AACAAGCCTG GACAGCATTA
ACGTTACAAG AATGGCAGCA GGCAGCTGTC TTAACGCACG ATGTTGTCGA ACGTGAACCG
CAAGATCCCG GCGTTGTACG ATTAAAACGT GCGGTTGATG TACATAATCT TGCAGAGCTT
CGTATCGCTG GCTCAACAGG AATTGATGCC GAAGGCCCGG ATAGTGGTAA ACATGATGTC
GACTTAACCA CCATCGTTTA TTCACCACCG CTGAAGGATA ACTGGCGCGG TTTTGCTGGA
TTCGGTTATG CCGATGGACA ATTTAGCGAA GGAAAAGGGA TTGTTCGCGA CTGGCTTGCG
GGTGTTGAGT GGCGGTCACG TAATATCTGG CTCGAGGCAG AGTACGCTGA ACGCGTTTTC
AATCATGAGC ATAAACCCGG CGCGCGCCTG TCTGGCTGGT ATGATTTTAA TGATAACTGG
CGTATTGGTT CGCAACTGGA ACGCCTCTCT CACCGCGTTC CATTACGGGC AATGAAAAAT
GGTGTTACAG GCAACAGTGC TCAGGCTTAT GTTCGCTGGT ATCAAAATGA GCGGCGTAAG
TACGGTGTCT CCTGGGCTTT CACTGATTTT TCCGACAGTA ACCAGCGTCA TGAAGTCTCA
CTTGAGGGTC AGGAACGCAT CTGGTCTTCA CCATATTTGA TTGTCGATTT CCTACCCAGT
CTGTATTACG AACAAAATAC AGAACACGAT ACCCCATACT ACAACCCTAT AAAAACGTTC
GATATTGTTC CGGCATTTGA GGCAAGCCAT TTGTTATGGC GAAGCTATGA AAATAGCTGG
GAGCAAATAT TCAGCGCAGG TGTTGGTGCC TCCTGGCAAA AACATTATGG CACGGATGTC
GTCACCCAAC TCGGCTACGG GCAACGCATT AGCTGGAATG ACGTGATTGA TGCTGGCGCA
ACGCTACGCT GGGAAAAACG ACCTTATGAC GGTGACAGAG AACACAACTT ATACGTTGAA
TTCGATATGA CATTCAGATT TTAA
 
Protein sequence
MYSSSRKRCP KTKWALKLLT AAFLAASPAA KSAVNNAYDA LIIEARKGNT QPALSWFALK 
SALSNNQIAD WLQIALWAGQ DKQVITVYNR YRHQQLPARG YAAVAVAYRN LQQWQNSLTL
WQKALSLEPQ NKDYQRGQIL TLADAGHYDT ALVKLKQLNS GAPDKANLLA EAYIYKLAGR
HQDELRAMTE SLPENASTQQ YPTEYVQALR NNQLAAAIDD ANLTPDIRAD IHAELVRLSF
MPTRSESERY AIADRALAQY AALEILWHDN PDRTAQYQRI QVDHLGALLT RDRYKDVISH
YQRLKKTGQI IPPWGQYWVA SAYLKDHQPK KAQSIMTELF YHKETIAPDL SDEELADLFY
SHLESENYPG ALTVTQHTIN TSPPFLRLMG TPTSIPNDTW LQGHSFLSTV AKYSNDLPQA
EMTARELAYN APGNQGLRID YASVLQARGW PRAAENELKK AEVIEPRNIN LEVEQAWTAL
TLQEWQQAAV LTHDVVEREP QDPGVVRLKR AVDVHNLAEL RIAGSTGIDA EGPDSGKHDV
DLTTIVYSPP LKDNWRGFAG FGYADGQFSE GKGIVRDWLA GVEWRSRNIW LEAEYAERVF
NHEHKPGARL SGWYDFNDNW RIGSQLERLS HRVPLRAMKN GVTGNSAQAY VRWYQNERRK
YGVSWAFTDF SDSNQRHEVS LEGQERIWSS PYLIVDFLPS LYYEQNTEHD TPYYNPIKTF
DIVPAFEASH LLWRSYENSW EQIFSAGVGA SWQKHYGTDV VTQLGYGQRI SWNDVIDAGA
TLRWEKRPYD GDREHNLYVE FDMTFRF