Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0798 |
Symbol | bioB |
ID | 6144002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 801192 |
End bp | 802232 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615686 |
Product | biotin synthase |
Protein accession | YP_001742878 |
Protein GI | 170680651 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000218975 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACC GCCCACGCTG GACATTGTCG CAAGTCACAG AATTATTTGA AAAACCGTTG CTGGATCTGC TGTTTGAAGC GCAGCAGGTG CATCGTCAGC ATTTCGATCC ACGTCAGGTG CAGGTCAGCA CTTTGCTGTC GATTAAGACC GGAGCTTGTC CGGAAGATTG CAAATACTGC CCGCAAAGCT CGCGCTACAA AACCGGGCTG GAAGCCGAGC GGTTGATGGA AGTTGAACAG GTGCTGGAGT CGGCGCGCAA AGCGAAAGCG GCAGGATCGA CGCGCTTCTG CATGGGCGCG GCGTGGAAGA ATCCCCACGA ACGCGATATG CCCTACCTGG AACAAATGGT GCAGGGAGTA AAAGCGATGG GGCTGGAGGC GTGTATGACG CTGGGTACGT TGAGTGAATC TCAGGCGCAG CGCCTTGCGA ACGCCGGGCT GGATTACTAC AACCACAACC TCGACACCTC GCCGGAGTTT TACGGCAATA TCATCACCAC GCGTACCTAT CAGGAACGCC TCGATACGCT GGAAAAAGTG CGCGACGCCG GGATCAAAGT CTGCTCGGGC GGCATTGTGG GCTTAGGCGA AACAGTAAAA GATCGCGCCG GATTATTGCT GCAACTGGCA AACCTGCCAA CGCCGCCGGA AAGCGTACCA ATCAACATGC TGGTGAAGGT GAAAGGCACG CCGCTTGCTG ATAACGATGA TGTCGATGCC TTTGATTTTA TTCGCACCAT TGCAGTTGCG CGGATCATGA TGCCGACCTC TTACGTACGC CTTTCTGCCG GACGCGAGCA GATGAACGAA CAGACTCAGG CGATGTGCTT TATGGCTGGC GCAAACTCGA TTTTCTACGG TTGCAAACTG CTGACCACGC CGAATCCGGA AGAAGATAAA GACCTGCAAC TGTTCCGCAA ACTGGGGCTA AATCCGCAGC AAACTGCCGT GCTGGCGGGC GATAACGAAC AACAGCAGCG TCTGGAGCAG GCGCTGATGA CCCCGGACAC TGACGAATAT TACAACGCGG CAGCACTATG A
|
Protein sequence | MAHRPRWTLS QVTELFEKPL LDLLFEAQQV HRQHFDPRQV QVSTLLSIKT GACPEDCKYC PQSSRYKTGL EAERLMEVEQ VLESARKAKA AGSTRFCMGA AWKNPHERDM PYLEQMVQGV KAMGLEACMT LGTLSESQAQ RLANAGLDYY NHNLDTSPEF YGNIITTRTY QERLDTLEKV RDAGIKVCSG GIVGLGETVK DRAGLLLQLA NLPTPPESVP INMLVKVKGT PLADNDDVDA FDFIRTIAVA RIMMPTSYVR LSAGREQMNE QTQAMCFMAG ANSIFYGCKL LTTPNPEEDK DLQLFRKLGL NPQQTAVLAG DNEQQQRLEQ ALMTPDTDEY YNAAAL
|
| |