Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00547 |
Symbol | ybdA |
ID | 8115440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 582588 |
End bp | 583838 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644846825 |
Product | hypothetical protein |
Protein accession | YP_002998398 |
Protein GI | 251784094 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC AATCCTGGCT GCTTAACCTC AGCCTGTTGA AAACGCACCC GGCGTTTCGC GCAGTATTCC TCGCTCGTTT TATCTCAATT GTGTCTCTGG GTTTGCTCGG CGTCGCGGTG CCGGTGCAGA TCCAGATGAT GACGCATTCT ACCTGGCAGG TGGGGCTTTC GGTGACGCTG ACCGGCGGCG CGATGTTTGT TGGCCTGATG GTTGGCGGTG TGCTGGCGGA TCGCTATGAA CGCAAAAAAG TGATTTTGCT GGCGCGCGGC ACCTGTGGCA TTGGCTTCAT TGGACTGTGC CTGAACGCGC TGCTGCCGGA GCCGTCATTG CTGGCAATCT ATTTACTTGG TTTATGGGAT GGTTTTTTCG CATCACTTGG TGTTACGGCG CTACTGGCGG CAACACCTGC ACTGGTAGGG CGTGAAAACT TAATGCAGGC CGGGGCGATC ACCATGTTGA CCGTGCGTCT GGGGTCGGTG ATTTCGCCCA TGATTGGCGG TTTATTGCTG GCGACCGGCG GCGTAGCCTG GAACTACGGG CTGGCGGCGG CGGGCACGTT TATTACCTTG CTACCGTTGT TAAGCCTTCC GGCGTTGCCA CCGCCACCGC AGCCGCGCGA GCATCCGTTG AAATCATTAC TGGCAGGATT TCGTTTTCTG CTCGCCAGCC CGCTGGTAGG AGGGATTGCG TTGCTGGGTG GTTTATTGAC GATGGCGAGC GCGGTGCGGG TACTGTATCC GGCGCTGGCT GACAACTGGC AGATGTCGGC GGCACAGATT GGTTTTCTCT ATGCGGCGAT CCCGCTCGGT GCGGCTATTG GCGCGTTAAC CAGTGGGAAG CTGGCACATA GTGCGCGACC AGGGTTATTG ATGCTGCTCT CCACGCTGGG ATCGTTCCTC GCCATTGGTC TGTTTGGCCT GATGCCGATG TGGATTTTAG GCGTGGTTTG TCTGGCGCTG TTCGGCTGGT TGAGTGCGGT CAGCTCGTTG CTGCAATACA CAATGCTGCA AACGCAAACC CCGGAAGCGA TGTTAGGGCG GATTAACGGT TTGTGGACGG CGCAAAACGT GACGGGCGAT GCCATAGGTG CGGCGCTGTT AGGCGGTCTG GGAGCGATGA TGACACCGGT TGCTTCTGCA AGCGCGAGCG GTTTTGGTTT GTTGATTATC GGCGTGTTGT TGTTGCTGGT GCTGGTGGAG TTGCGACGTT TTCGCCAGAC GCCGCCGCAG GTGACAGCGT CCGACAGTTA A
|
Protein sequence | MNKQSWLLNL SLLKTHPAFR AVFLARFISI VSLGLLGVAV PVQIQMMTHS TWQVGLSVTL TGGAMFVGLM VGGVLADRYE RKKVILLARG TCGIGFIGLC LNALLPEPSL LAIYLLGLWD GFFASLGVTA LLAATPALVG RENLMQAGAI TMLTVRLGSV ISPMIGGLLL ATGGVAWNYG LAAAGTFITL LPLLSLPALP PPPQPREHPL KSLLAGFRFL LASPLVGGIA LLGGLLTMAS AVRVLYPALA DNWQMSAAQI GFLYAAIPLG AAIGALTSGK LAHSARPGLL MLLSTLGSFL AIGLFGLMPM WILGVVCLAL FGWLSAVSSL LQYTMLQTQT PEAMLGRING LWTAQNVTGD AIGAALLGGL GAMMTPVASA SASGFGLLII GVLLLLVLVE LRRFRQTPPQ VTASDS
|
| |