Gene EcolC_2573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2573 
SymbolpgaB 
ID6064986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2819874 
End bp2821892 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content44% 
IMG OID641601980 
Productouter membrane N-deacetylase 
Protein accessionYP_001725531 
Protein GI170020577 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0269291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGTA ATGGAAATAA ATATCTCCTG ATGCTGGTGA GTATAATTAT GCTCACCGCG 
TGCATTAGCC AGTCAAGAAC ATCATTTATA CCGCCACAGG ATCGCGAATC TTTACTCGCC
GAGCAACCGT GGCCGCATAA TGGTTTTGTA GCGATTTCAT GGCATAACGT TGAAGACGAA
GCTGCCGACC AGCGTTTTAT GTCAGTGCGG ACATCAGCAC TGCGTGAACA ATTTGCCTGG
CTGCGCGAGA ACGGTTATCA ACCGGTCAGT ATTGCTCAAA TTCGTGAAGC ACATCGAGGA
GGAAAACCGC TACCGGAAAA AGCTGTAGTG CTGACTTTTG ATGACGGCTA CCAGAGTTTT
TATACCCGCG TCTTCCCAAT TCTTCAGGCC TTCCAGTGGC CTGCTGTATG GGCCCCCGTC
GGCAGTTGGG TCGATACGCC AGCGGATAAA CAAGTAAAAT TTGGCGATGA GTTGGTCGAT
CGAGAATATT TTGCCACGTG GCAACAAGTG CGAGAAGTTG CGCGTTCCCG GCTCGTTGAG
CTCGCTTCTC ATACATGGAA TTCTCACTAC GGTATTCAGG CTAATGCCAC CGGCAGCTTA
TTGCCTGTAT ATGTAAATCG TGCATATTTT ACTGACCACG CACGGTATGA AACCGCAGCA
GAATACCGGG AAAGAATTCG TCTGGATGCT GTAAAAATGA CGGAATACCT GCGTACAAAG
GTTGAGGTAA ATCCACACGT TTTTGTTTGG CCTTATGGCG AAGCGAATGG CATAGCGATA
GAGGAATTAA AAAAACTCGG TTATGACATG TTCTTCACCC TTGAATCAGG TTTGGCAAAT
GCGTCGCAAT TGGATTCCAT TCCGCGGGTA TTAATCGCCA ATAATCCCTC ATTAAAAGAG
TTTGCCCAGC AAATTATTAC CGTACAGGAA AAATCACCAC AACGGATAAT GCATATCGAT
CTTGATTACG TTTATGACGA AAACCTCCAG CAAATGGATC GCAATATTGA TGTGCTAATT
CAGCGGGTGA AAGATATGCA AATATCAACC GTGTATTTGC AGGCATTTGC TGATCCCGAT
GGTGATGGGC TGGTCAAAGA GGTCTGGTTT CCAAATCGTT TGCTACCAAT GAAAGCAGAT
ATTTTTAGTC GGGTTGCCTG GCAATTACGT ACCCGCTCAG GTGTAAACAT CTATGCGTGG
ATGCCGGTAT TAAGCTGGGA TTTAGATCCC ACATTAACGC GAGTAAAATA CTTACCAACA
GGGGAGAAAA AAGCACAAAT TCATCCTGAA CAATATCACC GTCTCTCTCC TTTCGATGAC
AGAGTCAGAG CACAAGTTGG CATGTTATAT GAAGATCTTG CCGGACATGC TGCTTTTGAT
GGCATATTGT TCCACGATGA TGCTTTGCTT TCAGATTATG AAGATGCCAG TGCACCGGCT
ATCACGGCTT ATCAGCAAGC AGGCTTTAGC GGGAGTCTGA GCGAAATTCG ACAAAACCCG
GAGCAATTTA AACAGTGGGC CCGCTTTAAA AGTCGTGCGT TAACTGACTT CACTTTAGAA
CTTAGTGCGC GCGTAAAAGC CATTCGCGGT CCACATATTA AAACTGCACG AAATATTTTT
GCACTTCCGG TAATACAACC TGAAAGTGAA GCCTGGTTTG CACAGAATTA TGCTGATTTC
CTAAAAAGCT ATGACTGGAC CGCTATTATG GCTATGCCTT ATCTGGAAGG TGTCGCAGAA
AAATCGGCTG ACCAATGGTT AATACAATTG ACCAATCAAA TTAAAAACAT CCCTCAGGCT
AAAGACAAAT CTATTTTAGA ATTACAGGCA CAAAACTGGC AGAAAAATGG TCAGCATCAG
GCTATTTCTT CGCAACAACT CGCTCACTGG ATGAGCCTAT TACAACTGAA TGGAGTGAAA
AACTATGGTT ATTATCCCGA CAATTTTCTG CATAACCAAC CTGAAATAGA CCTTATTCGT
CCTGAGTTTT CAACAGCCTG GTATCCGAAA AATGATTAA
 
Protein sequence
MLRNGNKYLL MLVSIIMLTA CISQSRTSFI PPQDRESLLA EQPWPHNGFV AISWHNVEDE 
AADQRFMSVR TSALREQFAW LRENGYQPVS IAQIREAHRG GKPLPEKAVV LTFDDGYQSF
YTRVFPILQA FQWPAVWAPV GSWVDTPADK QVKFGDELVD REYFATWQQV REVARSRLVE
LASHTWNSHY GIQANATGSL LPVYVNRAYF TDHARYETAA EYRERIRLDA VKMTEYLRTK
VEVNPHVFVW PYGEANGIAI EELKKLGYDM FFTLESGLAN ASQLDSIPRV LIANNPSLKE
FAQQIITVQE KSPQRIMHID LDYVYDENLQ QMDRNIDVLI QRVKDMQIST VYLQAFADPD
GDGLVKEVWF PNRLLPMKAD IFSRVAWQLR TRSGVNIYAW MPVLSWDLDP TLTRVKYLPT
GEKKAQIHPE QYHRLSPFDD RVRAQVGMLY EDLAGHAAFD GILFHDDALL SDYEDASAPA
ITAYQQAGFS GSLSEIRQNP EQFKQWARFK SRALTDFTLE LSARVKAIRG PHIKTARNIF
ALPVIQPESE AWFAQNYADF LKSYDWTAIM AMPYLEGVAE KSADQWLIQL TNQIKNIPQA
KDKSILELQA QNWQKNGQHQ AISSQQLAHW MSLLQLNGVK NYGYYPDNFL HNQPEIDLIR
PEFSTAWYPK ND