Gene Mlg_1236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1236 
Symbol 
ID4269020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1440656 
End bp1441891 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content68% 
IMG OID638125986 
Producttryptophan synthase subunit beta 
Protein accessionYP_742075 
Protein GI114320392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.617865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.527603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA AGGTGACTGA CGTGATTGAA CGCATCCCCG GCTTCGACCG CTACCCGGAC 
GAGGCGGGGC ATTTTGGGCC CTACGGCGGG CGCTTCGTTT CCGAGACCCT GATGGCTCCG
CTGGACGAGC TGGCGCAGGC CTACGATCAC TACCGCAATG ACCCGGAGTT TCTGGCCGAG
ATCGACCGGG ATTTGCAGGA CTTTGTCGGT CGGCCCAGCC CGCTGTACCT GGCGGAGCGC
TGGACCCAGC GGATCGGTGG TGCGCGGATC TACTTCAAGC GTGAAGACCT CAACCACACC
GGCGCCCACA AGATCAACAA CACCGTGGGC CAGGCGCTGC TGGCTAAGCG GATGGGCAAG
ACCCGGGTCA TCGCTGAGAC CGGTGCCGGG CAGCACGGCG TGGCCAGCGC CACGGTGGCG
GCGCGCCTGG GCATGCAGTG CGTGGTGTAT ATGGGCGCGG ACGACGTCAA GCGTCAGGCG
GTCAATGTTT TCCGCATGCG CCTGCTCGGC GCCGAGGTGC GGCCGGTGGA CGCCGGCACC
CGGACGCTCA AGGACGCCCT CAACGAGGCA ATGCGCGACT GGGTGGCCCA TGTGGACGAC
ACCTTCTACA TCATCGGCAC CGTCGCCGGC CCCCACCCCT ACCCGATGAT GGTGCGCGAC
TTTCAGACCG TGATCGGGCG GGAGGCGCGG CGCCAGATGC TCGAGCGCGA GGGCCGGCTG
CCCGATGCCC TGGTGGCCTG TGTGGGGGGC GGCTCCAACG CCATTGGCCT GTTCCACCCC
TTCCTGGCGG ACCAGGCCGT GGCCATCTAC GGGGTCGAGG CCGGCGGCGA AGGGGTGGAG
AGCGGGCGGC ATGCCGCGCC CCTGTGCGCC GGCCGCTCCG GGGTGCTGCA CGGCAACCGC
ACCTACCTGA TGATGAACGA CTCCGGCCAG ATCCAGGGGA CCCACTCGAT CTCTGCCGGG
CTCGACTACC CGGGGGTCGG GCCGGAGCAT GCCTGGCTGA AGGACTCCGG CCGTGCCCAA
TACGTCAGCG TGACCGATGA CGAGGCCCTG GAGGCGTTCC ACGAGGTGAC CCGCTGCGAG
GGCATCATGC CAGCCCTGGA GACCGCCCAT GCCCTGGCCT ATGCCCGCAA GCTGGCCGCC
GGGATGAGCC CGGAGCAGAG CGTGGTGGTG AGCCTGTCCG GGCGGGGTGA CAAGGATATT
GCGACGGTGG CCGAGCTGGA GGGCATTGAG CTATGA
 
Protein sequence
MSTKVTDVIE RIPGFDRYPD EAGHFGPYGG RFVSETLMAP LDELAQAYDH YRNDPEFLAE 
IDRDLQDFVG RPSPLYLAER WTQRIGGARI YFKREDLNHT GAHKINNTVG QALLAKRMGK
TRVIAETGAG QHGVASATVA ARLGMQCVVY MGADDVKRQA VNVFRMRLLG AEVRPVDAGT
RTLKDALNEA MRDWVAHVDD TFYIIGTVAG PHPYPMMVRD FQTVIGREAR RQMLEREGRL
PDALVACVGG GSNAIGLFHP FLADQAVAIY GVEAGGEGVE SGRHAAPLCA GRSGVLHGNR
TYLMMNDSGQ IQGTHSISAG LDYPGVGPEH AWLKDSGRAQ YVSVTDDEAL EAFHEVTRCE
GIMPALETAH ALAYARKLAA GMSPEQSVVV SLSGRGDKDI ATVAELEGIE L