Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2577 |
Symbol | bioB |
ID | 6272237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2379205 |
End bp | 2380245 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726559 |
Product | biotin synthase |
Protein accession | YP_001881039 |
Protein GI | 187732174 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.000942413 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCACC GCCCACGCTG GACATTGTCG CAAGTCGCAG AATTATTTGA AAAACCGTTG CTGGATCTGC TGTTTGAAGC GCAGCAGGTG CATCGCCAGC ATTTCGATCC TCGTCAGGTG CAGGTCAGCA CGTTGCTGTC GATTAAAACC GGTGCTTGCC CGGAAGATTG CAAATATTGC CCGCAAAGCT CGCGCTATAA AACCGGGCTG GAAGCCGAGC GGTTGATGGA AGTTGAACAG GTGCTGGAGT CGGCGCGCAA AGCGAAAGCG GCGGGATCGA CGCGTTTTTG CATGGGCGCG GCGTGGAAGA ATCCCAACGA ACGCGATATG CCGTACCTGG AGCAGATGGT GCAGGGGGTA AAAGATTTAG GGCTGGAGGC GTGTATGACG CTGGGCACGT TGAGTGAATC TCAGGCGCAG CGCCTCGCGA ACGCCGGGCT GGATTACTAC AACCACAACC TCGACACCTC GCCGGAGTTT TACGGCAATA TCATCACCAC CCGCACTTAT CAGGAACGCC TCGATACGCT GGAAAAAGTG CGCGATGCCG GGATCAAAGT CTGTTCTGGC GGCATTGTGG GCTTAGGCGA AACGGTAAAA GATCGCGCCG GATTATTGCT GCAACTGGCA AACCTGCCGA CGCCGCCGGA AAGCGTGCCA ATCAACATGC TGGTGAAGGT GAAAGGCACG CCGCTTGCCG ATAACGATGA CGTTGATGCC TTTGATTTTA TTCGCACCAT TGCGGTCGCG CGGATCATGA TGCCGACCTC TTACGTGCGC CTTTCTGCCG GACGCGAGCA GATGAACGAA CAGACTCAGG CGATGTGCTT TATGGCAGGC GCAAACTCGA TTTTCTACGG TTGCAAACTG CTGACCACGC CGAATCCGGA AGAAGATAAA GACCTGCAAC TGTTCCGCAA ACTGGGGCTA AATCCGCAGC AAACTGCCGT GCTGGCAGGG GATAACGAAC AACAGCAACG TCTTGAACAG GCGCTGATGA CCCCGGATAC TGACGAATAT TACAACGCGG CAGCACTATG A
|
Protein sequence | MAHRPRWTLS QVAELFEKPL LDLLFEAQQV HRQHFDPRQV QVSTLLSIKT GACPEDCKYC PQSSRYKTGL EAERLMEVEQ VLESARKAKA AGSTRFCMGA AWKNPNERDM PYLEQMVQGV KDLGLEACMT LGTLSESQAQ RLANAGLDYY NHNLDTSPEF YGNIITTRTY QERLDTLEKV RDAGIKVCSG GIVGLGETVK DRAGLLLQLA NLPTPPESVP INMLVKVKGT PLADNDDVDA FDFIRTIAVA RIMMPTSYVR LSAGREQMNE QTQAMCFMAG ANSIFYGCKL LTTPNPEEDK DLQLFRKLGL NPQQTAVLAG DNEQQQRLEQ ALMTPDTDEY YNAAAL
|
| |