Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04104 |
Symbol | yjhB |
ID | 8112940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4412406 |
End bp | 4413623 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 644850251 |
Product | hypothetical protein |
Protein accession | YP_003001824 |
Protein GI | 251787520 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAG CATGGTATAA ACAAGTTAAT CCACCACAAC GGAAAGCTCT TTTTTCCGCA TGGCTTGGAT ATGTATTTGA TGGCTTTGAT TTTATGATGA TATTTTACAT TCTTCATATT ATAAAAGCAG ATCTTGGCAT TACGGATATT CAGGCTACTT TAATAGGGAC AGTGGCCTTC ATAGCCAGAC CTATTGGAGG TGGTTTTTTT GGTGCCATGG CTGATAAATA TGGTCGTAAG CCAATGATGA TGTGGGCAAT TTTCATTTAC TCAGTCGGAA CAGGCCTTAG CGGTATTGCT ACAAACTTAT ATATGCTCGC AGTTTGCCGT TTTATTGTTG GCTTAGGGAT GTCTGGTGAA TATGCATGTG CTTCAACTTA TGCGGTAGAA AGTTGGCCTA AAAATCTTCA ATCTAAAGCT AGTGCTTTTT TGGTAAGTGG TTTTTCTGTT GGAAATATTA TTGCGGCACA AATAATCCCT CAGTTTGCTG AAGTATATGG ATGGAGAAAC TCTTTTTTTA TAGGCCTGTT ACCAGTTTTA CTAGTTCTTT GGATCAGAAA AAGTGCTCCA GAAAGTCAGG AGTGGATTGA AGATAAATAT AAGGATAAAT CAACATTTTT GTCTGTCTTC AGAAAACCAC ATCTTTCAAT CTCTATGATC GTTTTCCTCG TCTGTTTTTG TCTATTTGGT GCAAACTGGC CGATAAACGG ACTACTTCCT TCCTACCTGG CAGATAATGG AGTTAATACA GTGGTCATTT CAACTCTGAT GACAATAGCA GGTTTAGGAA CACTGACAGG TACAATATTT TTTGGTTTTG TTGGTGATAA GATTGGTGTA AAAAAAGCCT TTGTAGTCGG TCTAATAACT TCATTTATTT TCCTTTGTCC TCTTTTTTTT ATTTCTGTGA AAAACTCTTC TCTTATAGGA TTATGTCTCT TTGGATTAAT GTTTACAAAT TTAGGTATTG CAGGGTTGGT TCCAAAATTT ATATATGATT ACTTTCCAAC AAAATTAAGA GGATTAGGGA CCGGTCTTAT TTATAACTTA GGGGCAACTG GAGGAATGGC CGCACCTGTA TTAGCTACAT ACATTTCAGG ATATTATGGC TTAGGTGTTT CATTATTCAT TGTTACGGTT GCATTCTCTG CCTTATTAAT TTTGTTAGTT GGTTTTGATA TTCCAGGTAA AATTTATAAA CTATCCGTGG CTAAATGA
|
Protein sequence | MATAWYKQVN PPQRKALFSA WLGYVFDGFD FMMIFYILHI IKADLGITDI QATLIGTVAF IARPIGGGFF GAMADKYGRK PMMMWAIFIY SVGTGLSGIA TNLYMLAVCR FIVGLGMSGE YACASTYAVE SWPKNLQSKA SAFLVSGFSV GNIIAAQIIP QFAEVYGWRN SFFIGLLPVL LVLWIRKSAP ESQEWIEDKY KDKSTFLSVF RKPHLSISMI VFLVCFCLFG ANWPINGLLP SYLADNGVNT VVISTLMTIA GLGTLTGTIF FGFVGDKIGV KKAFVVGLIT SFIFLCPLFF ISVKNSSLIG LCLFGLMFTN LGIAGLVPKF IYDYFPTKLR GLGTGLIYNL GATGGMAAPV LATYISGYYG LGVSLFIVTV AFSALLILLV GFDIPGKIYK LSVAK
|
| |