Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02953 |
Symbol | agaA |
ID | 8116240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3148176 |
End bp | 3149330 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644849138 |
Product | hypothetical protein |
Protein accession | YP_003000711 |
Protein GI | 251786407 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGTC GAGGAAGGAA TATGACACAC GTTCTGCGCG CCAGAAGGCT GCTGACTGAA GAGGGATGGC TCGATGACCA TCAGTTGCGT ATTGCTGACG GTGTCATCGC AGCAATCGAA CCGATTCCAG CGGGCGTGAC TGAACGCGAT GCGGAACTGC TCTGCCCTGC TTACATCGAC ACCCATGTAC ACGGTGGTGC GGGCGTTGAT GTAATGGATG ACGCGCCGGA TGTACTCGAC AAGCTGGCAA TGCACAAGGC ACGCGAAGGT GTCGGCAGTT GGTTACCGAC CACCGTAACC GCGCCGCTTA ATACCATTCA TGCGGCGCTG AAACGTATTG CTCAACGTTG CCAACGCGGC GGACCTGGTG CGCAAGTGCT GGGGAGTTAT CTCGAAGGAC CGTACTTCAC GCCGCAGAAT AAAGGCGCGC ATCCGCCGGA GTTGTTTCGC GAGCTTGAAA TTGCCGAGCT GGATCAGTTG ATTGCCGTTT CTCAGCACAC CTTACGCGTG GTAGCGCTGG CACCGGAAAA AGAGGGGGCA TTGCAGGCCA TCCGCCATCT TAAACAGCAA AATGTACGAG TGATGCTGGG GCATAGCGCG GCGACCTGGC AACAAACTCG CGCCGCGTTT GATGCTGGTG CCGACGGCCT GGTGCATTGC TATAACGGGA TGACAGGTTT ACATCACCGC GAACCGGGAA TGGTTGGCGC GGGATTAACG GACAAGCGCG CCTGGCTGGA ACTGATAGCC GATGGTCATC ATGTGCATCC GGCGGCGATG TCGCTGTGTT GTTGCTGTGC AAAAGAGAGA ATCGTGATGA TCACCGACGC GATGCAGGCA GCCGGGATGC CGGATGGTCG CTATACGTTA TGTGGCGAAG AAGTGCAGAT GCACGGTGGC GTTGTCCGTA CCGCGTCCGG TGGGCTGGCG GGCAGTACGC TGTCTGTTGA TGCGGCAGTG CGCAACATGG TCGAGTTGAC GGGCGTAACG CCTGCGGAAG CCATTCATAT GGCATCGCTG CATCCGGCGC GAATGCTGGG TGTTGATGGT GTTCTGGGAT CGCTTAAACC GGGCAAACGC GCCAGCATCG TTGCGCTGGA TAGCGGGCTG CATGTGCAAC AAATCTGGAT TCAGAGTCAA TTAGCTTCGT TTTGA
|
Protein sequence | MSGRGRNMTH VLRARRLLTE EGWLDDHQLR IADGVIAAIE PIPAGVTERD AELLCPAYID THVHGGAGVD VMDDAPDVLD KLAMHKAREG VGSWLPTTVT APLNTIHAAL KRIAQRCQRG GPGAQVLGSY LEGPYFTPQN KGAHPPELFR ELEIAELDQL IAVSQHTLRV VALAPEKEGA LQAIRHLKQQ NVRVMLGHSA ATWQQTRAAF DAGADGLVHC YNGMTGLHHR EPGMVGAGLT DKRAWLELIA DGHHVHPAAM SLCCCCAKER IVMITDAMQA AGMPDGRYTL CGEEVQMHGG VVRTASGGLA GSTLSVDAAV RNMVELTGVT PAEAIHMASL HPARMLGVDG VLGSLKPGKR ASIVALDSGL HVQQIWIQSQ LASF
|
| |