Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_00742 |
Symbol | bioB |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 793337 |
End bp | 794377 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | biotin synthase |
Protein accession | ACT42643 |
Protein GI | 253976973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000391904 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCACC GCCCACGCTG GACATTGTCG CAAGTCACAG AATTATTTGA AAAACCGTTG CTGGATCTGC TGTTTGAAGC GCAGCAGGTG CATCGCCAGC ATTTCGATCC TCGTCAGGTG CAGGTCAGCA CGTTGCTGTC GATTAAGACC GGAGCTTGTC CGGAAGATTG CAAATACTGC CCGCAAAGCT CGCGCTACAA AACCGGGCTG GAAGCCGAGC GGTTGATGGA AGTTGAACAG GTGCTGGAGT CGGCGCGCAA AGCGAAAGCG GCAGGATCGA CGCGCTTCTG TATGGGCGCG GCGTGGAAGA ATCCCCACGA ACGCGATATG CCGTACCTGG AACAAATGGT GCAGGGGGTA AAAGCGATGG GGCTGGAGGC GTGTATGACG CTGGGCACGT TGAGTGAATC TCAGGCGCAG CGCCTCGCGA ACGCCGGGCT GGATTACTAC AACCACAACC TGGACACCTC GCCGGAGTTT TACGGCAATA TCATCACCAC ACGCACTTAT CAGGAACGCC TCGATACGCT GGAAAAAGTG CGCGATGCCG GGATCAAAGT CTGTTCTGGC GGCATTGTGG GCTTAGGCGA AACGGTAAAA GATCGCGCCG GATTATTGCT GCAACTGGCA AACCTGCCGA CGCCGCCGGA AAGCGTGCCA ATCAACATGC TGGTGAAGGT GAAAGGCACG CCGCTTGCCG ATAACGATGA TGTCGATGCC TTTGATTTTA TTCGCACCAT TGCGGTCGCG CGGATCATGA TGCCAACCTC TTACGTGCGC CTTTCTGCCG GACGCGAGCA GATGAACGAA CAGACTCAGG CGATGTGCTT TATGGCAGGC GCAAACTCGA TTTTCTACGG TTGCAAACTG CTGACCACGC CGAATCCGGA AGAAGATAAA GACCTGCAAC TGTTCCGCAA ACTGGGGCTA AATCCGCAGC AAACTGCCGT GCTGGCGGGC GATAACGAAC AACAGCAGCG TCTGGAACAG GCACTGATGA CCCCGGACAC TGACGAATAT TACAACGCGG CAGCACTATG A
|
Protein sequence | MAHRPRWTLS QVTELFEKPL LDLLFEAQQV HRQHFDPRQV QVSTLLSIKT GACPEDCKYC PQSSRYKTGL EAERLMEVEQ VLESARKAKA AGSTRFCMGA AWKNPHERDM PYLEQMVQGV KAMGLEACMT LGTLSESQAQ RLANAGLDYY NHNLDTSPEF YGNIITTRTY QERLDTLEKV RDAGIKVCSG GIVGLGETVK DRAGLLLQLA NLPTPPESVP INMLVKVKGT PLADNDDVDA FDFIRTIAVA RIMMPTSYVR LSAGREQMNE QTQAMCFMAG ANSIFYGCKL LTTPNPEEDK DLQLFRKLGL NPQQTAVLAG DNEQQQRLEQ ALMTPDTDEY YNAAAL
|
| |