Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02017 |
Symbol | yehX |
ID | 8114680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2110657 |
End bp | 2111583 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644848229 |
Product | hypothetical protein |
Protein accession | YP_002999802 |
Protein GI | 251785498 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1125] ABC-type proline/glycine betaine transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAT TTAGCCATGT CAGCAAACTG TTCGGCGCAC AAAAAGCCGT TAACGATCTC AATCTCAATT TTCAGGAAGG GAGTTTTTCG GTGCTGATTG GCACATCTGG CTCCGGCAAA TCCACCACCC TGAAAATGAT TAACCGCCTG GTGGAGCATG ACAGCGGCGT GATCCGCTTT GCCGGAGAAG AAATTCGCTC GCTGCCAGTG CTGGAGTTGC GCCGCCGGAT GGGCTATGCC ATTCAATCTA TTGGCCTGTT CCCCCACTGG AGCGTGGCGC AAAACATCGC CACCGTGCCG CAATTACAAA AATGGTCACG GGCGCGGATC GATGATCGTA TCGACGAATT AATGGCGCTA CTGGGGCTGG AGCCAAATTT GCGTGAGCGT TATCCGCATC AGCTTTCCGG TGGTCAGCAG CAACGTGTGG GAGTGGCGCG TGCACTGGCT GCCGATCCGC AAGTCTTACT GATGGATGAA CCTTTTGGCG CACTGGACCC GGTAACGCGC GGCGCGTTGC AACAAGAGAT GACGCGCATT CACCGTTTGC TGGGGCGTAC CATTGTGCTG GTCACTCATG ATATTGATGA GGCGCTACGG CTGGCAGAAC ATCTGGTATT GATGGATCAC GGTGAAGTAG TGCAGCAGGG CAATCCGCTG ACGATGCTGA CTCGTCCGGC GAATGATTTT GTCCGCCAGT TTTTTGGACG TAGTGAACTG GGTGTGCGCC TGCTTTCGTT ACGTAGTGTG GCGGATTACG TGCGTCGCGA AGAACGAGCA GATGGTGAGG CACTGGCAGA AGAGATGACG CTACGCGATG CGCTCTCTCT GTTTGTTGCG CGGGGATGCG AGGTGCTGCC GGTGGTGAAC ATGCAGGGCC AGCCTTGCGG CACGCTGCAT TTTCAGGATC TGCTGGTGGA GGCGTAA
|
Protein sequence | MIEFSHVSKL FGAQKAVNDL NLNFQEGSFS VLIGTSGSGK STTLKMINRL VEHDSGVIRF AGEEIRSLPV LELRRRMGYA IQSIGLFPHW SVAQNIATVP QLQKWSRARI DDRIDELMAL LGLEPNLRER YPHQLSGGQQ QRVGVARALA ADPQVLLMDE PFGALDPVTR GALQQEMTRI HRLLGRTIVL VTHDIDEALR LAEHLVLMDH GEVVQQGNPL TMLTRPANDF VRQFFGRSEL GVRLLSLRSV ADYVRREERA DGEALAEEMT LRDALSLFVA RGCEVLPVVN MQGQPCGTLH FQDLLVEA
|
| |