Gene EcSMS35_4421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4421 
SymbolbirA 
ID6143412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4518654 
End bp4519619 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content49% 
IMG OID641619241 
Productbiotin--protein ligase 
Protein accessionYP_001746361 
Protein GI170682429 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000434302 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00072275 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGATA ATACCGTTCC GCTGAAATTA ATCGCGCTGT TAGCGAATGG CGAATTTCAC 
TCTGGTGAGC AGTTGGGTGA AACGTTAGGA ATGAGCCGGG CGGCTATTAA TAAACACATT
CAGACACTGC GTGACTGGGG CGTTGATGTC TTTACCGTTC CGGGTAAAGG ATACAGCCTG
CCTGAGCCTA TCCAGTTACT TAATGCTGAA CAGATATTGG GTCAGCTGGA TGGCGGTAGT
GTAACTGTGC TGCCCGTTAT TGACTCCACG AATCAGTACC TTCTTGATCG TATCGGAGAG
CTTAAATCGG GCGATGCCTG TGTTGCAGAA TACCAGCAGG CTGGCCGTGG TCGCCGGGGT
CGGAAATGGT TTTCGCCTTT TGGCGCAAAC TTATATTTGT CGATGTTCTG GCGTCTGGAA
CAAGGCCCGG CGGCGGCGAT TGGTTTAAGT CTGGTTATCG GTATCGTGAT GGCGGAAGTA
TTACGCAAGC TGGGTGCAGA TAAAGTTCGT GTTAAATGGC CTAATGACCT CTATTTGCAG
GATCGCAAGC TGGCAGGCAT TCTTGTGGAG CTAACTGGCA AAACCGGCGA TGCGGCGCAA
ATAGTCATTG GAGCCGGGAT CAACATGGCA ATGCGTCGTG TTGAAGAGAG TGTCGTTAAT
CAAGGGTGGA TCACGCTGCA GGAAGCGGGA ATCAATCTCG ATCGTAATAC GTTGGCGGCC
ATGCTCATAC GTGAATTACG CGCGGCGCTG GAACTCTTCG AGCAAGAAGG ATTGGCACCT
TATCTTTCGC GCTGGGAAAA GCTGGATAAT TTTATTAATC GCCCAGTGAA ACTTATCATT
GGTGATAAAG AAATATTTGG CATTTCACGT GGAATAGACA AACAGGGCGC TTTATTGCTT
GAGCAGGATG GAATAATAAA ACCCTGGATG GGCGGTGAAA TATCCCTGCG TAGTGCGGAA
AAATAA
 
Protein sequence
MKDNTVPLKL IALLANGEFH SGEQLGETLG MSRAAINKHI QTLRDWGVDV FTVPGKGYSL 
PEPIQLLNAE QILGQLDGGS VTVLPVIDST NQYLLDRIGE LKSGDACVAE YQQAGRGRRG
RKWFSPFGAN LYLSMFWRLE QGPAAAIGLS LVIGIVMAEV LRKLGADKVR VKWPNDLYLQ
DRKLAGILVE LTGKTGDAAQ IVIGAGINMA MRRVEESVVN QGWITLQEAG INLDRNTLAA
MLIRELRAAL ELFEQEGLAP YLSRWEKLDN FINRPVKLII GDKEIFGISR GIDKQGALLL
EQDGIIKPWM GGEISLRSAE K