Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01617 |
Symbol | ydhP |
ID | 8116779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1680704 |
End bp | 1681873 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644847845 |
Product | hypothetical protein |
Protein accession | YP_002999418 |
Protein GI | 251785114 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTA ACTATCCGTT GCTGGCGCTG GCGATTGGCG CGTTTGGTAT CGGGACAACG GAGTTCTCGC CAATGGGCTT GTTGCCCGTC ATTGCGCGCG GTGTGGATGT CTCGATTCCC GCTGCCGGAA TGTTAATCAG TGCCTATGCA GTTGGCGTAA TGGTGGGCGC GCCGCTGATG ACGCTTCTAC TTTCTCATCG TGCCCGCCGC AGTGCGTTGA TTTTCCTGAT GGCAATTTTC ACGCTAGGCA ACGTTCTTTC CGCCATCGCG CCGGATTATA TGACCCTGAT GCTTTCACGC ATTTTGACCA GCCTGAATCA CGGAGCATTT TTTGGTTTGG GTTCAGTCGT GGCCGCAAGC GTGGTGCCAA AACATAAACA GGCCAGCGCA GTTGCCACTA TGTTTATGGG GTTAACCCTG GCAAATATCG GTGGCGTGCC GGCGGCGACC TGGTTGGGTG AAACCATCGG CTGGCGGATG TCATTTCTGG CAACGGCGGG GCTGGGAGTG ATTTCAATGG TAAGTCTGTT CTTCTCATTA CCTAAAGGTG GTGCAGGGGC ACGACCTGAA GTGAAAAAAG AGCTGGCGGT ATTAATGCGT CCGCAGGTGC TGTCTGCATT GCTGACGACG GTACTGGGAG CTGGTGCAAT GTTTACTCTC TACACCTATA TCTCTCCGGT ACTGCAAAGT ATTACCCACG CAACACCGGT GTTCGTCACG GCAATGCTGG TGCTGATTGG TGTCGGATTC TCTATCGGTA ACTATCTCGG CGGCAAACTG GCAGATCGTT CAGTTAACGG CACGTTGAAA GGCTTTTTGT TGCTGCTGAT GGTGATTATG CTGGCAATCC CGTTCCTGGC CCGCAATGAG TTCGGCGCAG CTATTAGCAT GGTGGTGTGG GGCGCAGCAA CCTTTGCGGT CGTACCGCCG TTACAGATGC GCGTGATGCG TGTCGCCAGT GAAGCGCCGG GTCTGTCTTC ATCAGTCAAT ATTGGTGCCT TTAATCTTGG AAATGCGCTG GGAGCAGCTG CTGGTGGTGC GGTAATTTCC GCTGGGCTGG GATACAGCTT TGTGCCGGTG ATGGGAGCGA TTGTCGCGGG ACTGGCATTA TTGCTGGTGT TTATGTCAGC CAGAAAACAA CCTGAAACAG TTTGCGTTGC TAACAGCTAA
|
Protein sequence | MKINYPLLAL AIGAFGIGTT EFSPMGLLPV IARGVDVSIP AAGMLISAYA VGVMVGAPLM TLLLSHRARR SALIFLMAIF TLGNVLSAIA PDYMTLMLSR ILTSLNHGAF FGLGSVVAAS VVPKHKQASA VATMFMGLTL ANIGGVPAAT WLGETIGWRM SFLATAGLGV ISMVSLFFSL PKGGAGARPE VKKELAVLMR PQVLSALLTT VLGAGAMFTL YTYISPVLQS ITHATPVFVT AMLVLIGVGF SIGNYLGGKL ADRSVNGTLK GFLLLLMVIM LAIPFLARNE FGAAISMVVW GAATFAVVPP LQMRVMRVAS EAPGLSSSVN IGAFNLGNAL GAAAGGAVIS AGLGYSFVPV MGAIVAGLAL LLVFMSARKQ PETVCVANS
|
| |