Gene EcHS_A4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4207 
SymbolbirA 
ID5593140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4202664 
End bp4203629 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content50% 
IMG OID640923310 
Productbiotin--protein ligase 
Protein accessionYP_001460764 
Protein GI157163446 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000658318 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATA ACACCGTGCC ACTGAAATTG ATTGCCCTGT TAGCGAACGG TGAATTTCAC 
TCTGGCGAGC AGTTGGGTGA AACGCTGGGA ATGAGCCGGG CGGCTATTAA TAAACACATT
CAGACACTGC GTGACTGGGG CGTTGATGTC TTTACCGTTC CGGGTAAAGG ATACAGCCTG
CCTGAGCCCA TCCAGTTACT TAATGCTGAA CAGATATTGG GTCAGCTGGA TGGCGGTAGT
GTAGCCGTGC TGCCAGTTAT TGACTCCACG AATCAGTACC TTCTTGATCG TATCGGAGAG
CTTAAATCGG GCGATGCCTG TGTTGCAGAA TACCAGCAGG CTGGCCGTGG TCGCCGGGGG
CGGAAATGGT TTTCGCCTTT TGGCGCAAAC TTATATTTGT CGATGTTCTG GCGTCTGGAA
CAAGGCCCGG CGGCGGCGAT TGGTTTAAGT CTGGTTATCG GTATCGTGAT GGCGGAAGTA
TTACGCAAGC TGGGAGCAGA TAAAGTTCGT GTCAAATGGC CTAATGACCT CTATCTGCAG
GATCGCAAGC TGGCAGGCAT TCTTGTGGAG CTGACTGGCA AAACTGGCGA TGCGGCGCAA
ATAGTCATTG GAGCCGGGAT CAACATGGCA ATGCGCCGTG TTGAAGAGAG TGTCGTTAAT
CAGGGGTGGA TCACGCTGCA GGAAGCGGGG ATCAATCTCG ATCGTAATAC GTTGGCGGCC
ATGCTAATAC GTGAATTACG TGCTGCGTTG GAACTCTTCG AACAAGAAGG ATTGGCACCT
TATCTGTCGC GCTGGGAAAA GCTGGATAAT TTTATTAATC GCCCAGTGAA ACTTATCATT
GGTGATAAAG AAATATTTGG CATTTCACGC GGAATAGACA AACAGGGGGC TTTATTACTT
GAGCAGGATG GAATAATAAA ACCCTGGATG GGCGGTGAAA TATCCCTGCG TAGTGCAGAA
AAATAA
 
Protein sequence
MKDNTVPLKL IALLANGEFH SGEQLGETLG MSRAAINKHI QTLRDWGVDV FTVPGKGYSL 
PEPIQLLNAE QILGQLDGGS VAVLPVIDST NQYLLDRIGE LKSGDACVAE YQQAGRGRRG
RKWFSPFGAN LYLSMFWRLE QGPAAAIGLS LVIGIVMAEV LRKLGADKVR VKWPNDLYLQ
DRKLAGILVE LTGKTGDAAQ IVIGAGINMA MRRVEESVVN QGWITLQEAG INLDRNTLAA
MLIRELRAAL ELFEQEGLAP YLSRWEKLDN FINRPVKLII GDKEIFGISR GIDKQGALLL
EQDGIIKPWM GGEISLRSAE K