Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04038 |
Symbol | ybl215 |
ID | 8115077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4337899 |
End bp | 4338690 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644850188 |
Product | hypothetical protein |
Protein accession | YP_003001761 |
Protein GI | 251787457 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00473885 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCA CTCGTCCTGT TGCCGTGATT ACCGGAGCCG CTCGTGGTAT TGGAAAGGGC TGTGCGCTGG AGCTTGCTCG TGGCGGATTT AACCTGCTGA TTAACGATCT TCCCGATGCC GACAGCGTAG AAAAACTGCA TATCACACAA CAAGAATGTA TAGCTGAAGG CGTGGAAGTG ATTTGCTTCC CTGCCGATGT CGGCGATCTC TCCTTGCATG AAGAGATGCT CGACGCGGCG CAAAACCTGT GGGGACGTCT GGACTGCCTG CTCAACAACG CAGGCATTTC AGTGAAAAAA CGCGGTGACC TCCTCGACCT CGAACCAGAC AGCTTCGATC AAAACATTGC CATTAACACC CGCGCGCCGT TCTTCCTTGC TCAGGCATTC AGTAAGCGCT TACTGGCACA GCCGAAACCG GAAGCGGAGC TGCCGCACCG CTCGATTATT TTCGTCAGTT CCATCAACGC CATCATGCTG GCGATGAATC GCGGCGAATA CACCATTGCC AAAACCGCCG TTTCTGCCGC TGCTCGTCTG TTTGCTGCTC GCCTTTGTAA TGAACAAATT GGCGTTTATG AAGTGCGCCC TGGTCTCATC AAAACCGATA TGACTATTCC AGCAACGGCT TATTACGACG AACTGATTGC CAAAGGTCTG GTGCCGTGGG GACGTTGGGG TTATCCGGCT GATATTGCTT CCACCGTCCG CGCGATGGCG GAAGGTAAAC TGATTTACAC CTGTGGTCAA GCGGTCGCCA TCGACGGCGG CTTAAGTATG CCGCGCTTCT GA
|
Protein sequence | MKSTRPVAVI TGAARGIGKG CALELARGGF NLLINDLPDA DSVEKLHITQ QECIAEGVEV ICFPADVGDL SLHEEMLDAA QNLWGRLDCL LNNAGISVKK RGDLLDLEPD SFDQNIAINT RAPFFLAQAF SKRLLAQPKP EAELPHRSII FVSSINAIML AMNRGEYTIA KTAVSAAARL FAARLCNEQI GVYEVRPGLI KTDMTIPATA YYDELIAKGL VPWGRWGYPA DIASTVRAMA EGKLIYTCGQ AVAIDGGLSM PRF
|
| |