Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4421 |
Symbol | birA |
ID | 6143412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4518654 |
End bp | 4519619 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641619241 |
Product | biotin--protein ligase |
Protein accession | YP_001746361 |
Protein GI | 170682429 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0340] Biotin-(acetyl-CoA carboxylase) ligase |
TIGRFAM ID | [TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region [TIGR00122] BirA biotin operon repressor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000434302 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00072275 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGGATA ATACCGTTCC GCTGAAATTA ATCGCGCTGT TAGCGAATGG CGAATTTCAC TCTGGTGAGC AGTTGGGTGA AACGTTAGGA ATGAGCCGGG CGGCTATTAA TAAACACATT CAGACACTGC GTGACTGGGG CGTTGATGTC TTTACCGTTC CGGGTAAAGG ATACAGCCTG CCTGAGCCTA TCCAGTTACT TAATGCTGAA CAGATATTGG GTCAGCTGGA TGGCGGTAGT GTAACTGTGC TGCCCGTTAT TGACTCCACG AATCAGTACC TTCTTGATCG TATCGGAGAG CTTAAATCGG GCGATGCCTG TGTTGCAGAA TACCAGCAGG CTGGCCGTGG TCGCCGGGGT CGGAAATGGT TTTCGCCTTT TGGCGCAAAC TTATATTTGT CGATGTTCTG GCGTCTGGAA CAAGGCCCGG CGGCGGCGAT TGGTTTAAGT CTGGTTATCG GTATCGTGAT GGCGGAAGTA TTACGCAAGC TGGGTGCAGA TAAAGTTCGT GTTAAATGGC CTAATGACCT CTATTTGCAG GATCGCAAGC TGGCAGGCAT TCTTGTGGAG CTAACTGGCA AAACCGGCGA TGCGGCGCAA ATAGTCATTG GAGCCGGGAT CAACATGGCA ATGCGTCGTG TTGAAGAGAG TGTCGTTAAT CAAGGGTGGA TCACGCTGCA GGAAGCGGGA ATCAATCTCG ATCGTAATAC GTTGGCGGCC ATGCTCATAC GTGAATTACG CGCGGCGCTG GAACTCTTCG AGCAAGAAGG ATTGGCACCT TATCTTTCGC GCTGGGAAAA GCTGGATAAT TTTATTAATC GCCCAGTGAA ACTTATCATT GGTGATAAAG AAATATTTGG CATTTCACGT GGAATAGACA AACAGGGCGC TTTATTGCTT GAGCAGGATG GAATAATAAA ACCCTGGATG GGCGGTGAAA TATCCCTGCG TAGTGCGGAA AAATAA
|
Protein sequence | MKDNTVPLKL IALLANGEFH SGEQLGETLG MSRAAINKHI QTLRDWGVDV FTVPGKGYSL PEPIQLLNAE QILGQLDGGS VTVLPVIDST NQYLLDRIGE LKSGDACVAE YQQAGRGRRG RKWFSPFGAN LYLSMFWRLE QGPAAAIGLS LVIGIVMAEV LRKLGADKVR VKWPNDLYLQ DRKLAGILVE LTGKTGDAAQ IVIGAGINMA MRRVEESVVN QGWITLQEAG INLDRNTLAA MLIRELRAAL ELFEQEGLAP YLSRWEKLDN FINRPVKLII GDKEIFGISR GIDKQGALLL EQDGIIKPWM GGEISLRSAE K
|
| |