Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02026 |
Symbol | yohG |
ID | 8115826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2121540 |
End bp | 2122628 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644848238 |
Product | hypothetical protein |
Protein accession | YP_002999811 |
Protein GI | 251785507 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCAG AAGGCTTAAT GGGGCCGTTT GCTCTGAACG ATCCGGCCGC AGGTACGACC GGCCCGTGGT ACACCAACGG TACTTTTGGC TTAACGGCGG GCTGGCATCT CGATATCTGG GGAAAGAATC GGGCGGAGGT TACTGCCCGC CTGGGTACGG TTAAAGCACG GGCGGCGGAA CGCGAGCAAA CCCGCCAATT GCTGGCTGGC AGCGTAGCCC GCCTGTACTG GGAGTGGCAA ACCCAGGCGG CGTTAAACAC GGTCTTGCAG CAAATAGAAA AAGAGCAGAA CACCATTATC GCGACCGATC GCCAGCTATA TCAGAACGGG ATTACTTCTT CAGTTGAAGG TGTGGAAACC GATATTAATG CCAGCAAAAC CCGGCAGCAG CTCAACGATG TCGCGGGGAA AATGAAAATT ATTGAGGCAC GGTTAAGCGC ACTTACAAAT AACCAGACAA AGTCATTGAA GCTTAAACCG GTCGCGTTGC CGAAAGTGGC AAGCCAGCTT CCTGATGAAC TGGGGTACTC CTTACTGGCC CGGCGGGCAG ATTTGCAGGC GGCGCACTGG TACGTTGAGT CATCGCTAAG CACCATTGAT GCGGCAAAAG CGGCATTTTA TCCTGACATC AACCTGATGG CCTTCCTGCA ACAGGATGCG TTGCACTTAA GCGATCTGTT CCGTCATTCC GCGCAGCAAA TGGGCGTTAC GGCAGGCCTG ACGCTACCCA TTTTCGATAG TGGTCGTCTT AACGCCAATC TCGATATCGC AAAAGCCGAA AGCAACTTGT CTATCGCCAG CTACAACAAA GCGGTGGTTG AAGCGGTGAA TGACGTGGCG CGGGCAGCCA GTCAGGTTCA GACACTGGCG GAGAAAAACC AGCATCAGGC GCAAATTGAG CGCGATGCCT TGCGTGTGGT AGGTCTTGCG CAGGCGCGCT TTAACGCGGG CATCATTGCT GGTTCCCGCG TCAGCGAAGC CAGAATCCCC GCGCTGCGTG AGCGGGCCAA TGGCCTGTTA TTGCAAGGGC AGTGGCTGGA TGCCTCCATT CAACTCACTG GTGCGTTGGG CGGGGGTTAC AAACGCTGA
|
Protein sequence | MSAEGLMGPF ALNDPAAGTT GPWYTNGTFG LTAGWHLDIW GKNRAEVTAR LGTVKARAAE REQTRQLLAG SVARLYWEWQ TQAALNTVLQ QIEKEQNTII ATDRQLYQNG ITSSVEGVET DINASKTRQQ LNDVAGKMKI IEARLSALTN NQTKSLKLKP VALPKVASQL PDELGYSLLA RRADLQAAHW YVESSLSTID AAKAAFYPDI NLMAFLQQDA LHLSDLFRHS AQQMGVTAGL TLPIFDSGRL NANLDIAKAE SNLSIASYNK AVVEAVNDVA RAASQVQTLA EKNQHQAQIE RDALRVVGLA QARFNAGIIA GSRVSEARIP ALRERANGLL LQGQWLDASI QLTGALGGGY KR
|
| |