Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03161 |
Symbol | yhfK |
ID | 8116124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3345602 |
End bp | 3347704 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644849343 |
Product | hypothetical protein |
Protein accession | YP_003000916 |
Protein GI | 251786612 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0228744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCCCC CGATGTGGCG CAGACTGATT TATCACCCCG ATATCAACTA TGCACTTCGA CAAACGCTGG TGCTATGTTT GCCCGTGGCC GTTGGGTTAA TGCTTGGCGA ATTACGATTC GGTCTGCTCT TCTCCCTCGT TCCTGCCTGT TGCAATATTG CGGGCCTTGA TACACCTCAT AAACGTTTTT TCAAACGCTT AATCATTGGT GCGTCGCTGT TTGCCACCTG TAGCTTGCTG ACACAGCTAC TACTGGCAAA AGATGTCCCC CTGCCCTTTT TGCTGACCGG ATTAACGCTG GTACTTGGCG TCACTGCTGA GCTGGGGCCA TTGCACGCAA AATTGCTTCC CGCATCGCTG CTCGCCGCCA TTTTTACCCT CAGTCTGGCG GGATACATGC CGGTCTGGGA ACCATTGCTC ATCTATGCGT TGGGCACTCT CTGGTACGGA TTGTTTAACT GGTTTTGGTT CTGGATCTGG CGCGAACAAC CGCTGCGCGA GTCACTAAGT CTGCTGTACC GTGAACTGGC AGATTATTGT GAAGCCAAAT ACAGCCTGCT TACCCAGCAC ACCGACCCTG AAAAAGCGCT GCCGCCGCTG CTGGTGCGCC AGCAAAAAGC GGTCGATTTA ATTACCCAGT GCTATCAGCA AATGCATATG CTTTCCGCGC AAAACAATAC CGATTACAAG CGGATGCTGC GTATTTTCCA GGAGGCGCTG GACTTGCAGG AACATATTTC GGTCAGTTTG CATCAGCCGG AAGAGGTGCA AAAACTGGTC GAGCGTAGCC ATGCGGAAGA AGTTATCCGC TGGAATGCGC AAACCGTCGC CGCTCGCCTG CGCGTGCTGG CTGATGACAT TCTTTACCAT CGCCTGCCAA CGCGTTTTAC GATGGAAAAG CAAATTGGCG CACTGGAAAA AATCGCCCGC CAGCATCCGG ATAATCCGGT TGGGCAATTC TGCTACTGGC ATTTCAGCCG CATCGCCCGC GTGCTGCGCA CCCAAAAACC GCTCTATGCC CGTGACTTAC TGGCCGATAA ACAGCGGCGA ATGCCATTAC TTCCGGCGCT GAAAAGTTAT CTGTCACTAA AGTCTCCGGC GCTACGCAAT GCCGGACGAC TCAGTGTGAT GTTAAGCGTT GCCAGCCTGA TGGGCACCGC GCTGCATCTG CCGAAGTCGT ACTGGATCCT GATGACGGTA TTGCTGGTGA CACAAAATGG CTATGGCGCA ACCCGTCTGA GGATTGTGAA TCGCTCCGTG GGAACCGTGG TCGGGTTAAT CATTGCGGGC GTGGCGCTGC ACTTTAAAAT TCCCGAAGGT TACACCCTGA CGTTGATGCT GATTACCACC CTCGCCAGCT ACCTGATATT GCGCAAAAAC TACGGCTGGG CGACGGTCGG TTTTACTATT ACCGCAGTGT ATACCCTGCA ACTATTGTGG TTGAACGGCG AGCAATACAT CCTTCCGCGT CTTATCGATA CCATTATTGG TTGTTTAATT GCTTTCGGCG GTACTGTCTG GCTGTGGCCG CAGTGGCAGA GCGGGTTATT GCGTAAAAAC GCCCATGATG CTTTAGAAGC CTATCAGGAA GCGATTCGCT TGATTCTTAG CGAGGATCCG CAACCTACGC CACTGGCCTG GCAGCGAATG CGGGTAAATC AGGCACATAA CACTCTGTAT AACTCATTGA ATCAGGCGAT GCAGGAACCG GCGTTTAACA GCCATTATCT GGCAGATATG AAACTGTGGG TAACGCACAG CCAGTTTATT GTTGAGCATA TTAATGCCAT GACCACGCTG GCGCGGGAAC ACCGGGCATT GACACCTGAA CTGGCACAAG AGTATTTACA GTCTTGTGAA ATCGCCATTC AGCGTTGTCA GCAGCGACTG GAGTATGACG AACCGGGTAG TTCTGGCGAT GCCAATATCA TGGATGCGCC GGAGATGCAG CCGCACGAAG GCGCGGCAGG TACGCTGGAG CAGCATTTAC AGCGGGTTAT TGGTCATCTG AACACCATGC ACACCATTTC GTCGATGGCA TGGCGTCAGC GACCGCATCA CGGGATTTGG CTGAGTCGCA AGTTGCGGGA TTCGAAGGCG TAA
|
Protein sequence | MFPPMWRRLI YHPDINYALR QTLVLCLPVA VGLMLGELRF GLLFSLVPAC CNIAGLDTPH KRFFKRLIIG ASLFATCSLL TQLLLAKDVP LPFLLTGLTL VLGVTAELGP LHAKLLPASL LAAIFTLSLA GYMPVWEPLL IYALGTLWYG LFNWFWFWIW REQPLRESLS LLYRELADYC EAKYSLLTQH TDPEKALPPL LVRQQKAVDL ITQCYQQMHM LSAQNNTDYK RMLRIFQEAL DLQEHISVSL HQPEEVQKLV ERSHAEEVIR WNAQTVAARL RVLADDILYH RLPTRFTMEK QIGALEKIAR QHPDNPVGQF CYWHFSRIAR VLRTQKPLYA RDLLADKQRR MPLLPALKSY LSLKSPALRN AGRLSVMLSV ASLMGTALHL PKSYWILMTV LLVTQNGYGA TRLRIVNRSV GTVVGLIIAG VALHFKIPEG YTLTLMLITT LASYLILRKN YGWATVGFTI TAVYTLQLLW LNGEQYILPR LIDTIIGCLI AFGGTVWLWP QWQSGLLRKN AHDALEAYQE AIRLILSEDP QPTPLAWQRM RVNQAHNTLY NSLNQAMQEP AFNSHYLADM KLWVTHSQFI VEHINAMTTL AREHRALTPE LAQEYLQSCE IAIQRCQQRL EYDEPGSSGD ANIMDAPEMQ PHEGAAGTLE QHLQRVIGHL NTMHTISSMA WRQRPHHGIW LSRKLRDSKA
|
| |