Gene EcolC_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4047 
Symbol 
ID6065006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4458282 
End bp4459247 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content50% 
IMG OID641603466 
Productbiotin--protein ligase 
Protein accessionYP_001726973 
Protein GI170022019 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0641292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00297103 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGATA ACACCGTGCC ACTGAAATTG ATTGCCCTGT TAGCGAACGG TGAATTTCAC 
TCTGGCGAGC AGTTGGGTGA AACGCTGGGA ATGAGCCGGG CGGCTATTAA TAAACACATT
CAGACACTGC GTGACTGGGG CGTTGATGTC TTTACCGTTC CGGGTAAAGG ATACAGCCTG
CCTGAGCCCA TCCAGTTACT TAATGCTGAA CAGATATTGG GTCAGCTGGA TGGCGGTAGT
GTAGCCGTGC TGCCAGTTAT TGACTCCACG AATCAGTACC TTCTTGATCG TATCGGAGAG
CTTAAATCGG GCGATGCCTG TGTTGCAGAA TACCAGCAGG CTGGCCGTGG TCGCCGGGGG
CGGAAATGGT TTTCGCCTTT TGGCGCAAAC TTATATTTGT CGATGTTCTG GCGTCTGGAA
CAAGGCCCGG CGGCGGCGAT TGGTTTAAGT CTGGTTATCG GTATCGTGAT GGCGGAAGTA
TTACGCAAGC TGGGAGCAGA TAAAGTTCGT GTCAAATGGC CTAATGACCT CTATCTGCAG
GATCGCAAGC TGGCAGGCAT TCTTGTGGAG CTGACTGGCA AAACTGGCGA TGCGGCGCAA
ATAGTCATTG GAGCCGGGAT CAACATGGCA ATGCGCCGTG TTGAAGAGAG TGTCGTTAAT
CAGGGGTGGA TCACGCTGCA GGAAGCGGGG ATCAATCTCG ATCGTAATAC GTTGGCGGCC
ATGCTAATAC GTGAATTACG TGCTGCGTTG GAACTCTTCG AACAAGAAGG ATTGGCACCT
TATCTGTCGC GCTGGGAAAA GCTGGATAAT TTTATTAATC GCCCAGTGAA ACTTATCATT
GGTGATAAAG AAATATTTGG CATTTCACGC GGAATAGACA AACAGGGGGC TTTATTACTT
GAGCAGGATG GAATAATAAA ACCCTGGATG GGCGGTGAAA TATCCCTGCG TAGTGCAGAA
AAATAA
 
Protein sequence
MKDNTVPLKL IALLANGEFH SGEQLGETLG MSRAAINKHI QTLRDWGVDV FTVPGKGYSL 
PEPIQLLNAE QILGQLDGGS VAVLPVIDST NQYLLDRIGE LKSGDACVAE YQQAGRGRRG
RKWFSPFGAN LYLSMFWRLE QGPAAAIGLS LVIGIVMAEV LRKLGADKVR VKWPNDLYLQ
DRKLAGILVE LTGKTGDAAQ IVIGAGINMA MRRVEESVVN QGWITLQEAG INLDRNTLAA
MLIRELRAAL ELFEQEGLAP YLSRWEKLDN FINRPVKLII GDKEIFGISR GIDKQGALLL
EQDGIIKPWM GGEISLRSAE K