Gene Mlg_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0440 
Symbol 
ID4270384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp495081 
End bp496091 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content73% 
IMG OID638125175 
Productbiotin--acetyl-CoA-carboxylase ligase 
Protein accessionYP_741284 
Protein GI114319601 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.028543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.823385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA GCGAGGGCAG GCTCGCCGCG AGGGGCTCCG CGCAGGCGGT CCTGGAACGG 
CTGTCGGCCG GCGACTGCTG GTCCGGTGAG GCGCTGGGCC GCGAGCTGGG CATCTCCCGG
GCAGCGGTCT GGAAGGCGGT GGCGACACTC CGGGGCCTCG GAGTGCCTGT CGAGGCGGTC
GCTGGGAAGG GCTACCGGTT GCCCGGGCCG GTCGAGGTGC TGGATCGGCA GCGGATTGTC
GCCGAGCTGC GGCGCGCGGG TGTGGCGCCG CTGCCCTCGG TGGACGTCTG GCTGTCGACC
CCGTCCACCA ACCTGTGTGT GCTGGGGTCC CAGGCAGGCA CGCCGCGCGC CGCATTTGCC
GAGGTCCAGA CCGCCGGCCG GGGCCGCCAG GGGCGGCGCT GGTGGTCGGC CTTTGGCGAG
CAGGTTCAGT TCTCGCTGGC CTGGCATTAC CAGGCCCTGC CCGCACCGGT GCCCGGGCTG
AGCCTCGCGG TCGGTGTCGA ATTGGCCGAG ACGCTGAGTG GGCTCGGCGC CCGGGGCCTG
CAGTTGAAGT GGCCCAACGA CCTGCTGTGT AAGGAGGGGC GGAAGCTCGC CGGGATCCTG
ATTGAGCTCG AGGGCCAGGT GCTGGGCCCG TGCCGGGTGG TGGTGGGGGT CGGGGTCAAC
CACGGTCGCG GTGCCGGCGG CGCTGAGGCG GACCGGCCGG TGGCCAGCCT GGCCGAGGCG
GGCCTGGAGG GCGTCGCGCG CAATCGATTG GCCGCGCTCC TGCTCAGCGC GGTGATCCGG
GGCTGCCAGC GGTTCGGGGT CACCGGGCTC GACGACTACC GTGAGGGCTG GGCACGCTGG
GATGCCCTGC GCGACCGGCC GCTGTCCGTG GTCCAGGCGG GGGCGACCTT ACGGGGCTGG
GGCGCGGGGA TCAGCGAGGA CGGGGCCTTG GTCCTGACGC TGGCCTCGGG CGGGCAGCGT
GTCTTGCATG CGGGGGAGGT GCATATCGGT GCCGCACTGG CTGATTCTTG A
 
Protein sequence
MSGSEGRLAA RGSAQAVLER LSAGDCWSGE ALGRELGISR AAVWKAVATL RGLGVPVEAV 
AGKGYRLPGP VEVLDRQRIV AELRRAGVAP LPSVDVWLST PSTNLCVLGS QAGTPRAAFA
EVQTAGRGRQ GRRWWSAFGE QVQFSLAWHY QALPAPVPGL SLAVGVELAE TLSGLGARGL
QLKWPNDLLC KEGRKLAGIL IELEGQVLGP CRVVVGVGVN HGRGAGGAEA DRPVASLAEA
GLEGVARNRL AALLLSAVIR GCQRFGVTGL DDYREGWARW DALRDRPLSV VQAGATLRGW
GAGISEDGAL VLTLASGGQR VLHAGEVHIG AALADS