Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2573 |
Symbol | pgaB |
ID | 6064986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2819874 |
End bp | 2821892 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641601980 |
Product | outer membrane N-deacetylase |
Protein accession | YP_001725531 |
Protein GI | 170020577 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0269291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGTA ATGGAAATAA ATATCTCCTG ATGCTGGTGA GTATAATTAT GCTCACCGCG TGCATTAGCC AGTCAAGAAC ATCATTTATA CCGCCACAGG ATCGCGAATC TTTACTCGCC GAGCAACCGT GGCCGCATAA TGGTTTTGTA GCGATTTCAT GGCATAACGT TGAAGACGAA GCTGCCGACC AGCGTTTTAT GTCAGTGCGG ACATCAGCAC TGCGTGAACA ATTTGCCTGG CTGCGCGAGA ACGGTTATCA ACCGGTCAGT ATTGCTCAAA TTCGTGAAGC ACATCGAGGA GGAAAACCGC TACCGGAAAA AGCTGTAGTG CTGACTTTTG ATGACGGCTA CCAGAGTTTT TATACCCGCG TCTTCCCAAT TCTTCAGGCC TTCCAGTGGC CTGCTGTATG GGCCCCCGTC GGCAGTTGGG TCGATACGCC AGCGGATAAA CAAGTAAAAT TTGGCGATGA GTTGGTCGAT CGAGAATATT TTGCCACGTG GCAACAAGTG CGAGAAGTTG CGCGTTCCCG GCTCGTTGAG CTCGCTTCTC ATACATGGAA TTCTCACTAC GGTATTCAGG CTAATGCCAC CGGCAGCTTA TTGCCTGTAT ATGTAAATCG TGCATATTTT ACTGACCACG CACGGTATGA AACCGCAGCA GAATACCGGG AAAGAATTCG TCTGGATGCT GTAAAAATGA CGGAATACCT GCGTACAAAG GTTGAGGTAA ATCCACACGT TTTTGTTTGG CCTTATGGCG AAGCGAATGG CATAGCGATA GAGGAATTAA AAAAACTCGG TTATGACATG TTCTTCACCC TTGAATCAGG TTTGGCAAAT GCGTCGCAAT TGGATTCCAT TCCGCGGGTA TTAATCGCCA ATAATCCCTC ATTAAAAGAG TTTGCCCAGC AAATTATTAC CGTACAGGAA AAATCACCAC AACGGATAAT GCATATCGAT CTTGATTACG TTTATGACGA AAACCTCCAG CAAATGGATC GCAATATTGA TGTGCTAATT CAGCGGGTGA AAGATATGCA AATATCAACC GTGTATTTGC AGGCATTTGC TGATCCCGAT GGTGATGGGC TGGTCAAAGA GGTCTGGTTT CCAAATCGTT TGCTACCAAT GAAAGCAGAT ATTTTTAGTC GGGTTGCCTG GCAATTACGT ACCCGCTCAG GTGTAAACAT CTATGCGTGG ATGCCGGTAT TAAGCTGGGA TTTAGATCCC ACATTAACGC GAGTAAAATA CTTACCAACA GGGGAGAAAA AAGCACAAAT TCATCCTGAA CAATATCACC GTCTCTCTCC TTTCGATGAC AGAGTCAGAG CACAAGTTGG CATGTTATAT GAAGATCTTG CCGGACATGC TGCTTTTGAT GGCATATTGT TCCACGATGA TGCTTTGCTT TCAGATTATG AAGATGCCAG TGCACCGGCT ATCACGGCTT ATCAGCAAGC AGGCTTTAGC GGGAGTCTGA GCGAAATTCG ACAAAACCCG GAGCAATTTA AACAGTGGGC CCGCTTTAAA AGTCGTGCGT TAACTGACTT CACTTTAGAA CTTAGTGCGC GCGTAAAAGC CATTCGCGGT CCACATATTA AAACTGCACG AAATATTTTT GCACTTCCGG TAATACAACC TGAAAGTGAA GCCTGGTTTG CACAGAATTA TGCTGATTTC CTAAAAAGCT ATGACTGGAC CGCTATTATG GCTATGCCTT ATCTGGAAGG TGTCGCAGAA AAATCGGCTG ACCAATGGTT AATACAATTG ACCAATCAAA TTAAAAACAT CCCTCAGGCT AAAGACAAAT CTATTTTAGA ATTACAGGCA CAAAACTGGC AGAAAAATGG TCAGCATCAG GCTATTTCTT CGCAACAACT CGCTCACTGG ATGAGCCTAT TACAACTGAA TGGAGTGAAA AACTATGGTT ATTATCCCGA CAATTTTCTG CATAACCAAC CTGAAATAGA CCTTATTCGT CCTGAGTTTT CAACAGCCTG GTATCCGAAA AATGATTAA
|
Protein sequence | MLRNGNKYLL MLVSIIMLTA CISQSRTSFI PPQDRESLLA EQPWPHNGFV AISWHNVEDE AADQRFMSVR TSALREQFAW LRENGYQPVS IAQIREAHRG GKPLPEKAVV LTFDDGYQSF YTRVFPILQA FQWPAVWAPV GSWVDTPADK QVKFGDELVD REYFATWQQV REVARSRLVE LASHTWNSHY GIQANATGSL LPVYVNRAYF TDHARYETAA EYRERIRLDA VKMTEYLRTK VEVNPHVFVW PYGEANGIAI EELKKLGYDM FFTLESGLAN ASQLDSIPRV LIANNPSLKE FAQQIITVQE KSPQRIMHID LDYVYDENLQ QMDRNIDVLI QRVKDMQIST VYLQAFADPD GDGLVKEVWF PNRLLPMKAD IFSRVAWQLR TRSGVNIYAW MPVLSWDLDP TLTRVKYLPT GEKKAQIHPE QYHRLSPFDD RVRAQVGMLY EDLAGHAAFD GILFHDDALL SDYEDASAPA ITAYQQAGFS GSLSEIRQNP EQFKQWARFK SRALTDFTLE LSARVKAIRG PHIKTARNIF ALPVIQPESE AWFAQNYADF LKSYDWTAIM AMPYLEGVAE KSADQWLIQL TNQIKNIPQA KDKSILELQA QNWQKNGQHQ AISSQQLAHW MSLLQLNGVK NYGYYPDNFL HNQPEIDLIR PEFSTAWYPK ND
|
| |