Gene ECH74115_5438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5438 
SymbolbirA 
ID6967399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5085212 
End bp5086177 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content49% 
IMG OID643389088 
Productbiotin--protein ligase 
Protein accessionYP_002273493 
Protein GI209399876 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000183056 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00134266 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGATA ACACCGTGCC ACTGAAATTA ATCGCGCTGT TAGCGAATGG CGAATTTCAC 
TCAGGTGAGC AGTTGGGTGA AACGTTAGGA ATGAGCCGGG CGGCTATTAA TAAACACATT
CAGACACTGC GTGACTGGGG CGTTGATGTC TTTACCGTTC CGGGTAAAGG ATACAGCCTG
CCTGAGCCTA TCCAGTTACT TAATGCTGAA CAGATATTGG GTCAGCTGGA TGGCGGTAGT
GTAACTGTGC TGCCAGTGAT TGACTCCACG AATCAGTACC TTCTTGATCG TATCGGAGAG
CTTAAATCGG GCGATGCCTG CGTTGCAGAA TACCAGCAGG CTGGCCGTGG TCGCCGGGGT
CGGAAATGGT TTTCGCCTTT TGGCGCAAAC TTATATTTGT CGATGTTCTG GCGTCTGGAA
CAAGGCCCGG CGGCGGCGAT TGGTTTAAGT CTGGTTATCG GTATCGTGAT GGCGGAAGTA
TTACGCAAGC TGGGAGCAGA TAAAGTTCGT GTCAAATGGC CTAATGACCT CTATCTGCAG
GATCGCAAGC TGGCAGGCAT TCTTGTGGAG CTGACTGGCA AAACCGGCGA TGCGGCGCAA
ATAGTCATTG GAGCCGGGAT CAACATGGCA ATGCGTCGTG TTGAAGAGAG TGTCGTTAAT
CAGGGGTGGA TCACGCTGCA GGAAGCGGGG ATCAATCTCG ATCGTAATAC GTTGGCGGCC
ATGCTAATAC GTGAATTACG CGCGGCGCTG GAACTCTTCG AACAAGAAGG ATTGGCACCT
TATCTTTCGC GCTGGGAAAA GCTGGATAAT TTTATTAATC GCCCAGTGAA ACTTATCATT
GGTGATAAAG AAATATTTGG CATTTCACGC GGAATAGACA AACAAGGCGC TTTATTGCTT
GAGCAGGATG GAATAATAAA ACCCTGGATG GGCGGTGAAA TATCCCTGCG TAGTGCAGAA
AAATAA
 
Protein sequence
MKDNTVPLKL IALLANGEFH SGEQLGETLG MSRAAINKHI QTLRDWGVDV FTVPGKGYSL 
PEPIQLLNAE QILGQLDGGS VTVLPVIDST NQYLLDRIGE LKSGDACVAE YQQAGRGRRG
RKWFSPFGAN LYLSMFWRLE QGPAAAIGLS LVIGIVMAEV LRKLGADKVR VKWPNDLYLQ
DRKLAGILVE LTGKTGDAAQ IVIGAGINMA MRRVEESVVN QGWITLQEAG INLDRNTLAA
MLIRELRAAL ELFEQEGLAP YLSRWEKLDN FINRPVKLII GDKEIFGISR GIDKQGALLL
EQDGIIKPWM GGEISLRSAE K