Gene Mlg_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2341 
Symbol 
ID4269097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2652074 
End bp2653213 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID638127099 
Productglycosyl transferase, group 1 
Protein accessionYP_743171 
Protein GI114321488 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.244619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGT CCGGGGTGGT GGCGCTGGTC TCGAACACCT CTTGGTATCT CTACAACTTC 
CGGCGCGGGA CGCTGGCGGC GTTACGCGAT GCGGGATACC GGGTGGTGTG CCTCGCGCCG
CCGGATGCCT ATTCCAGGCG GCTGGAGGAG GAACTGGGCG CAGAGCACCT GCCCCTGGCC
ATGGAGGGGA AGAGCACCCG GCTATGGGAC GAGGCCAGGA GCCTGCTGAC ACTGGCCAAC
ACGCTGCGCC GCCTGCGCCC TGCCTTCGTC TTCAACTACA CCGTCAAGGC GAACATCTAC
TCCGGCCTGG CCTGCCAGGC GCTGCGCATT CCCTACGCCA ACAATGTCTC CGGGCTGGGT
ACCGCCTTTA TCCACGACGG TTGGCTGTTC CGGCGCGTGC GTTCGCTCTA CGGCCTGGCC
AACCGGGGCG CGCGGCAGGT GTTCCTCCAG AACCCCGATG ACCACGCCCT CCTTCAGTCC
CACGGCCTGT TGCGCAACTC CCCGACGATG GTGCTCCCCG GCTCAGGCAT AGACACCGCG
CGGTTCGACT TTTCGCACCT GCCGGACGCG TCACCCTTCA CCTTTGTGAT GATCGCGCGC
CTGCTCGGGG ACAAGGGGGT ACGAGAGTAC GTCCAGGCCG CCGAGTTGGT GCGTGAACAG
CACCCCGATA CCCGCTTTCT GCTCGTCGGT CCGCACGGCG CCAGCAACCG CACCGCAATC
CCCGAGGAAG AGGTGCGCGC CTGGCAGGCA CGGGGTGTGG TGGAGTACCT GGGTGAGCAG
GAGGATGTTC GCCCTTTCAT CAGGCAATCG CACATCCTGG TACTGCCCTC CTATCGTGAG
GGCATGCCGC GGACCGTACT GGAGGCGGCT GCCATGGGGC GCCCGGCCAT CGTGTCCGAT
GTGCCTGGGT GCCGGCACGC CGTGGTTGAT GGCGAAACCG GCTGGCTGGC CCCGGTTAAG
CGCCCGGAGG CTCTCGCCCG GCAAATGGTC GACTGCGCGG CGCTGCCGCG CGAAGCGCTG
GCGCAGGCGG GCAGCACAGC ACGCCAACGT ATCGAGCGGG TGTTCGATGA GCGTGTTGTG
GTGGCGGCCA CGCTGGCCTG CCTACGGGAT GAGCCTGTGG GCGCCAGCGG CTCGCAATGA
 
Protein sequence
MGRSGVVALV SNTSWYLYNF RRGTLAALRD AGYRVVCLAP PDAYSRRLEE ELGAEHLPLA 
MEGKSTRLWD EARSLLTLAN TLRRLRPAFV FNYTVKANIY SGLACQALRI PYANNVSGLG
TAFIHDGWLF RRVRSLYGLA NRGARQVFLQ NPDDHALLQS HGLLRNSPTM VLPGSGIDTA
RFDFSHLPDA SPFTFVMIAR LLGDKGVREY VQAAELVREQ HPDTRFLLVG PHGASNRTAI
PEEEVRAWQA RGVVEYLGEQ EDVRPFIRQS HILVLPSYRE GMPRTVLEAA AMGRPAIVSD
VPGCRHAVVD GETGWLAPVK RPEALARQMV DCAALPREAL AQAGSTARQR IERVFDERVV
VAATLACLRD EPVGASGSQ