Gene EcHS_A1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1138 
SymbolpgaB 
ID5593558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1150138 
End bp1152156 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content44% 
IMG OID640920301 
Productouter membrane N-deacetylase 
Protein accessionYP_001457865 
Protein GI157160547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.0115575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGTA ATGGAAATAA ATATCTCCTG ATGCTGGTGA GTATAATTAT GCTCACCGCG 
TGCATTAGCC AGTCAAGAAC ATCATTTATA CCGCCACAGG ATCGCGAATC TTTACTCGCC
GAGCAACCGT GGCCGCATAA TGGTTTTGTA GCGATTTCAT GGCATAACGT TGAAGACGAA
GCTGCCGACC AGCGTTTTAT GTCAGTGCGG ACATCAGCAC TGCGTGAACA ATTTGCCTGG
CTGCGCGAGA ACGGTTATCA ACCGGTCAGT ATTGCTCAAA TTCGTGAAGC ACGTCGAGGA
GGAAAACCGC TACCGGAAAA AGCTGTAGTG CTGACTTTTG ATGACGGCTA CCAGAGTTTT
TATACCCGCG TCTTCCCAAT TCTTCAGGCC TTCCAGTGGC CTGCTGTATG GGCCCCCGTC
GGCAGTTGGG TCGATACGCC AGCGGATAAA CAAGTAAAAT TTGGCGATGA GTTGGTCGAT
CGAGAATATT TTGCCACGTG GCAACAAGTG CGAGAAGTTG CGCGTTCCCG GCTCGTTGAG
CTCGCTTCTC ATACATGGAA TTCTCACTAC GGTATTCAGG CTAATGCCAC CGGCAGCTTA
TTGCCTGTAT ATGTAAATCG TGCATATTTT ACTGACCACG CACGGTATGA AACCGCAGCA
GAATACCGGG AAAGAATTCG TCTGGATGCT GTAAAAATGA CGGAATACCT GCGTACAAAG
GTTGAGGTAA ATCCACACGT TTTTGTTTGG CCTTATGGCG AAGCGAATGG CATAGCGATA
GAGGAATTAA AAAAACTCGG TTATGACATG TTCTTCACCC TTGAATCAGG TTTGGCAAAT
GCGTCGCAAT TGGATTCCAT TCCGCGGGTA TTAATCGCCA ATAATCCCTC ATTAAAAGAG
TTTGCCCAGC AAATTATTAC CGTACAGGAA AAATCACCAC AACGGATAAT GCATATCGAT
CTTGATTACG TTTATGACGA AAACCTCCAG CAAATGGATC GCAATATTGA TGTGCTAATT
CAGCGGGTGA AAGATATGCA AATATCAACC GTGTATTTGC AGGCATTTGC TGATCCCGAT
GGTGATGGGC TGGTCAAAGA GGTCTGGTTT CCAAATCGTT TGCTACCAAT GAAAGCAGAT
ATTTTTAGTC GGGTTGCCTG GCAATTACGT ACCCGCTCAG GTGTAAACAT CTATGCGTGG
ATGCCGGTAT TAAGCTGGGA TTTAGATCCC ACATTAACGC GAGTAAAATA CTTACCAACA
GGGGAGAAAA AAGCACAAAT TCATCCTGAA CAATATCACC GTCTCTCTCC TTTCGATGAC
AGAGTCAGAG CACAAGTTGG CATGTTATAT GAAGATCTTG CCGGACATGC TGCTTTTGAT
GGCATATTGT TCCACGATGA TGCTTTGCTT TCAGATTATG AAGATGCCAG TGCACCGGCT
ATCACGGCTT ATCAGCAAGC AGGCTTTAGC GGGAGTCTGA GCGAAATTCG ACAAAACCCG
GAGCAATTTA AACAGTGGGC CCGCTTTAAA AGTCGTGCGT TAACTGACTT CACTTTAGAA
CTTAGTGCGC GCGTAAAAGC CATTCGCGGT CCACATATTA AAACTGCACG AAATATTTTT
GCACTTCCGG TAATACAACC TGAAAGTGAA GCCTGGTTTG CACAGAATTA TGCTGATTTC
CTAAAAAGCT ATGACTGGAC CGCTATTATG GCTATGCCTT ATCTGGAAGG TGTCGCAGAA
AAATCGGCTG ACCAATGGTT AATACAATTG ACCAATCAAA TTAAAAACAT CCCTCAGGCT
AAAGACAAAT CTATTTTAGA ATTACAGGCA CAAAACTGGC AGAAAAATGG TCAGCATCAG
GCTATTTCTT CGCAACAACT CGCTCACTGG ATGAGCCTAT TACAACTGAA TGGAGTGAAA
AACTATGGTT ATTATCCCGA CAATTTTCTG CATAACCAAC CTGAAATAGA CCTTATTCGT
CCTGAGTTTT CAACAGCCTG GTATCCGAAA AATGATTAA
 
Protein sequence
MLRNGNKYLL MLVSIIMLTA CISQSRTSFI PPQDRESLLA EQPWPHNGFV AISWHNVEDE 
AADQRFMSVR TSALREQFAW LRENGYQPVS IAQIREARRG GKPLPEKAVV LTFDDGYQSF
YTRVFPILQA FQWPAVWAPV GSWVDTPADK QVKFGDELVD REYFATWQQV REVARSRLVE
LASHTWNSHY GIQANATGSL LPVYVNRAYF TDHARYETAA EYRERIRLDA VKMTEYLRTK
VEVNPHVFVW PYGEANGIAI EELKKLGYDM FFTLESGLAN ASQLDSIPRV LIANNPSLKE
FAQQIITVQE KSPQRIMHID LDYVYDENLQ QMDRNIDVLI QRVKDMQIST VYLQAFADPD
GDGLVKEVWF PNRLLPMKAD IFSRVAWQLR TRSGVNIYAW MPVLSWDLDP TLTRVKYLPT
GEKKAQIHPE QYHRLSPFDD RVRAQVGMLY EDLAGHAAFD GILFHDDALL SDYEDASAPA
ITAYQQAGFS GSLSEIRQNP EQFKQWARFK SRALTDFTLE LSARVKAIRG PHIKTARNIF
ALPVIQPESE AWFAQNYADF LKSYDWTAIM AMPYLEGVAE KSADQWLIQL TNQIKNIPQA
KDKSILELQA QNWQKNGQHQ AISSQQLAHW MSLLQLNGVK NYGYYPDNFL HNQPEIDLIR
PEFSTAWYPK ND