Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01991 |
Symbol | yegU |
ID | 8114664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2077202 |
End bp | 2078206 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644848204 |
Product | hypothetical protein |
Protein accession | YP_002999777 |
Protein GI | 251785473 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1397] ADP-ribosylglycohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAG AACGTATTCT CGGTGCTCTT TATGGGCAGG CGTTAGGGGA TGCGATGGGA ATGCCCTCCG AGCTTTGGCC ACGCAGCCGC GTTAAAGCAC ACTTTGGCTG GATTGACCGT TTTCTTCCTG GACCAAAGGA GAATAACGCG GCCTGTTATT TTAACCGCGC CGAATTCACC GACGATACCT CGATGGCGCT GTGTCTGGCG GATGCGTTGC TGGAACGTAA AGGCAAGATC GATCCGGATC TGATTGGGCG TAATATTCTC GACTGGGCGC TGCGTTTCGA CGCCTTTAAC AAAAACGTGC TAGGTCCGAC CTCGAAGATT GCGCTTAACG CCATTCGCGA CGGTAAACCC GTTGCTGAAC TGGAAAACAA CGGCGTGACC AACGGCGCAG CGATGCGCGT CTCGCCATTA GGTTGTTTGC TTCCGGCGCG TGATGTTGAT TCCTTTATTG ATGATGTGGC GCTGGCGTCG AGCCCAACCC ATAAATCCGA TCTGGCAGTT GCAGGTGCGG TAGTCATCGC ATGGGCGATT TCTCGTGCCA TTGACGGAGA AAGCTGGTCA GCAATTGTTG ATTCACTGCC TTCAATTGCG CGACATGCAC AGCAAAAACG CATCACTACC TTCAGCGCCT CACTGGCAGC ACGTCTGGAG ATTGCGCTGA AAATTGTGCG CAATGCCGAC GGCACCGAAT CCGCCAGCGA ACAGCTTTAC CAGGTCGTTG GCGCAGGTAC CAGCACTATT GAGTCCGTTC CGTGCGCCAT TGCGCTGGTT GAACTGGCAC AAACCGACCC GAATCGCTGC GCCGTCCTGT GCGCTAACCT TGGCGGCGAC ACAGACACCA TCGGTGCTAT GGCGACGGCA ATTTGCGGCG CGTTGCATGG CGTTAACGCT ATCGATCCTG CGTTAAAGGC GGAACTGGAT GCGGTAAATC AGCTTGATTT CAACCGCTAT GCCACAGCGC TGGCGAAATA TCGTCAACAA CGGGAGGCGG TATGA
|
Protein sequence | MKTERILGAL YGQALGDAMG MPSELWPRSR VKAHFGWIDR FLPGPKENNA ACYFNRAEFT DDTSMALCLA DALLERKGKI DPDLIGRNIL DWALRFDAFN KNVLGPTSKI ALNAIRDGKP VAELENNGVT NGAAMRVSPL GCLLPARDVD SFIDDVALAS SPTHKSDLAV AGAVVIAWAI SRAIDGESWS AIVDSLPSIA RHAQQKRITT FSASLAARLE IALKIVRNAD GTESASEQLY QVVGAGTSTI ESVPCAIALV ELAQTDPNRC AVLCANLGGD TDTIGAMATA ICGALHGVNA IDPALKAELD AVNQLDFNRY ATALAKYRQQ REAV
|
| |