Gene ECH74115_1265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1265 
SymbolpgaB 
ID6966846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1273068 
End bp1275086 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content44% 
IMG OID643385255 
Productouter membrane N-deacetylase 
Protein accessionYP_002269750 
Protein GI209397733 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.798831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGTA ATGGAAATAA ATATCTCCTG ATGCTGGTGA GTATAATTAT GCTCACCGCG 
TGCATTAGCC AGTCAAGAAC ATCATTTATA CCGCCACAGG ATCGCAAATC TTTACTCGCC
GAGCAACCGT GGCCGCATAA TGGTTTTGTA GCGATTTCAT GGCATAACGT TGAAGACGAA
GCTGCCGACC AGCGTTTTAT GTCAGTGCGG ACATCAGCAC TGCGTGAACA ATTTGCCTGG
CTGCGCGAGA ACGGTTATCA ACCGGTCAGT ATTGCTCAAA TTCGTGAAGC ACATCGAGGA
GGAAAACCGC TACCGGAAAA AGCTGTAGTG CTGACTTTTG ATGACGGCTA CCAGAGTTTT
TATACCCGCG TCTTCCCAAT TCTTCAGGCC TTCCAGTGGC CTGCTGTATG GGCCCCCGTC
GGCAGTTGGG TCGATACGCC AGCGGATAAA CAAGTAAAAT TTGGCGATGA GTTGGTCGAT
CGAGAATATT TTGCCACGTG GCAACAAGTG CGAGAAGTTG CGCGTTCCCG GCTCGTTGAG
CTCGCTTCTC ATACATGGAA TTCTCACTAC GGTATTCAGG CTAATGCCAC CGGCAGCTTA
TTGCCTGTAT ATGTAAATCG TGCATATTTT ACTGACCACG CACGGTATGA AACCGCAGCA
GAATACCGGG AAAGAATTCG TCTGGATGCT GTAAAAATGA CGGAATACCT GCGTACAAAG
GTTGAGGTAA ATCCACACGT TTTTATTTGG CCTTATGGCG AAGCGAATGG CATAGCGATA
GAGGAATTAA AAAAACTCGG TTATGACATG TTCTTCACCC TTGAATCAGG TTTGGCAAAT
GCGTCGCAAT TGGATTCCAT TCCGCGGGTA TTAATCGCCA ATAATCCCTC ATTAAAAGAG
TTTGCCCAGC AAATTATTAC CGTACAGGAA AAATCACCAC AACGGATAAT GCATATCGAT
CTTGATTACG TTTATGACGA AAACCTCCAG CAAATGGATC GCAATATTGA TGTGCTAATT
CAGCGGGTGA AAGATATGCA AATATCAACC GTGTATTTGC AGGCATTTGC TGATCCCGAT
GGTGATGGGC TGGTCAAAGA GGTCTGGTTT CCAAATCGTT TGCTACCAAT GAAAGCAGAT
ATTTTTAGTC GGGTTGCCTG GCAATTACGT ACCCGCTCAG GTGTAAACAT CTATGCGTGG
ATGCCGGTAT TAAGCTGGGA TTTAGATCCC ACATTAACGC GAGTAAAATA CTTACCAACA
GGGGAGAAAA AAGCACAAAT TCATCCTGAA CAATATCACC GTCTCTCTCC TTTCGATGAC
AGAGTCAGAG CACAAGTTGG CATGTTATAT GAAGATCTTG CCGGACATGC TGCTTTTGAT
GGCATATTGT TCCACGATGA TGCTTTGCTT TCAGATTATG AAGATGCCAG TGCACCGGCT
ATCACGGCTT ATCAGCAAGC AGGCTTTAGC GGGAGTCTGA GCGAAATTCG ACAAAACCCG
GAGCAATTTA AACAGTGGGC CCGCTTTAAA AGTCGTGCGT TAACTGACTT CACTTTAGAA
CTTAGTGCGC GCGTAAAAGC CATTCGCGGT CCACATATTA AAACTGCACG AAATATTTTT
GCACTTCCGG TAATACAACC TGAAAGTGAA GCCTGGTTTG CACAGAATTA TGCTGATTTC
CTAAAAAGCT ATGACTGGAC CGCTATTATG GCTATGCCTT ATCTGGAAGG TGTCGCAGAA
AAATCGGCTG ACCAATGGTT AATACAATTG ACCAATCAAA TTAAAAACAT CCCTCAGGCT
AAAGACAAAT CTATTTTAGA ATTACAGGCA CAAAACTGGC AGAAAAATGG TCAGCATCAG
GCTATTTCTT CGCAACAACT CGCTCACTGG ATGAGCCTAT TACAACTGAA TGGAGTGAAA
AACTATGGTT ATTATCCCGA CAATTTTCTG CATAACCAAC CTGAAATAGA CCTTATTCGT
CCTGAGTTTT CAACAGCCTG GTATCCGAAA AATGATTAA
 
Protein sequence
MLRNGNKYLL MLVSIIMLTA CISQSRTSFI PPQDRKSLLA EQPWPHNGFV AISWHNVEDE 
AADQRFMSVR TSALREQFAW LRENGYQPVS IAQIREAHRG GKPLPEKAVV LTFDDGYQSF
YTRVFPILQA FQWPAVWAPV GSWVDTPADK QVKFGDELVD REYFATWQQV REVARSRLVE
LASHTWNSHY GIQANATGSL LPVYVNRAYF TDHARYETAA EYRERIRLDA VKMTEYLRTK
VEVNPHVFIW PYGEANGIAI EELKKLGYDM FFTLESGLAN ASQLDSIPRV LIANNPSLKE
FAQQIITVQE KSPQRIMHID LDYVYDENLQ QMDRNIDVLI QRVKDMQIST VYLQAFADPD
GDGLVKEVWF PNRLLPMKAD IFSRVAWQLR TRSGVNIYAW MPVLSWDLDP TLTRVKYLPT
GEKKAQIHPE QYHRLSPFDD RVRAQVGMLY EDLAGHAAFD GILFHDDALL SDYEDASAPA
ITAYQQAGFS GSLSEIRQNP EQFKQWARFK SRALTDFTLE LSARVKAIRG PHIKTARNIF
ALPVIQPESE AWFAQNYADF LKSYDWTAIM AMPYLEGVAE KSADQWLIQL TNQIKNIPQA
KDKSILELQA QNWQKNGQHQ AISSQQLAHW MSLLQLNGVK NYGYYPDNFL HNQPEIDLIR
PEFSTAWYPK ND