Gene Mlg_2436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2436 
Symbol 
ID4268742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2768536 
End bp2769549 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content69% 
IMG OID638127194 
Productbiotin synthase 
Protein accessionYP_743266 
Protein GI114321583 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.113139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCCA CCAAATCAAT GACCGCCACA ACCCAAACAC CCCGCCACGA CTGGTCTAAA 
GACGAAGTCC TGGCCCTGTT CGAGCAGCCC TTCAACGACC TGCTCCACCA AGCCCAGACC
ACCCACCGGG CCCACTTCGA CCCCAACACC GTCCAGGTCA GCACCCTGCT CAGCATCAAG
ACCGGCGCCT GCCCGGAGGA CTGCAAATAC TGCCCCCAGA GCGTGCGCTA CGACACCGGC
CTGGAACGCG AGCAAATCCT GGCCGTGGAA GAGGTGGTGG CGGCCGCCCG CCGGGCCCGG
GACGCCGGTG CTACCCGCTT CTGTATGGGC GCCGCCTGGC GCTCGCCCAA AGACCGCGAC
CTGGAGACCG TCGAGGCCAT GGTGCGCGAG GTCAAGGCCC TGGGCCTCGA GACCTGCCTC
ACCCTCGGCA TGCTCCGCGA CGGCCAGGCC GAACGCCTGC GCGAGGCCGG GCTCGACTAC
TACAACCACA ACCTCGACAC CTCCGAGGAC TACTACGACG AAATCATCAC CACCCGCAGC
TACCAGGACC GCCTCGACAC CCTCGCCCGG GTGCGCGACG CCGGCCTCAA GACCTGCTGC
GGCGGCATCA TCGGCATGGG CGAAACCCGC CAGGACCGCG CCGAACTGCT GCGTACCCTG
GCCAGCCTGC CGGTGCAGCC GCAGAGCGTC CCCATCAACC AGCTCGTCCA GGTCCCCGGC
ACCCCGCTGC ACGGCGTCGA GCCCCCCGAC CCCTTCGAAT TCGTCCGCAC CATCGCCGTC
GCCCGCATCC TCATGCCGGC CAGCTACGTC CGCCTCTCCG CCGGCCGCGA GCAGATGTCC
GACGAACTGC AGGCCCTCTG CTTCCTGGCC GGCGCCAACA GCATCTTCTA CGGCGACAAG
CTGCTCACCA CCGGCAACCC GGAGGCCGAC AAGGACCGCC GCCTGCTGGC CCGTCTGGGC
ATGGGGTTTG AGGCGCACGC CTGCGCCCAG GCCGAGGCCG AGGACCTGGG ATGA
 
Protein sequence
MGATKSMTAT TQTPRHDWSK DEVLALFEQP FNDLLHQAQT THRAHFDPNT VQVSTLLSIK 
TGACPEDCKY CPQSVRYDTG LEREQILAVE EVVAAARRAR DAGATRFCMG AAWRSPKDRD
LETVEAMVRE VKALGLETCL TLGMLRDGQA ERLREAGLDY YNHNLDTSED YYDEIITTRS
YQDRLDTLAR VRDAGLKTCC GGIIGMGETR QDRAELLRTL ASLPVQPQSV PINQLVQVPG
TPLHGVEPPD PFEFVRTIAV ARILMPASYV RLSAGREQMS DELQALCFLA GANSIFYGDK
LLTTGNPEAD KDRRLLARLG MGFEAHACAQ AEAEDLG