Gene Mlg_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0139 
Symbol 
ID4269832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp160425 
End bp161633 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content73% 
IMG OID638124863 
Productglycosyl transferase, group 1 
Protein accessionYP_740984 
Protein GI114319301 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03088] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.211857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CCCGCCCCCT GGCAACGGCG GCCCCGGGCG ACGAGCGCCC GCTGGTGGCC 
CATATCATCC ACCGCCTGGA CGTGGGCGGC ATGGAGAACG GCCTGGTCAA CCTGATCAAC
CACATGCCGG CCGAGCGCTA CCGCCACGCC ATCGTCTGCA TGACCCGGTA CACCGACTTC
AGCCAGCGCA TCCACCGCGA TGATGTGAGC CTGCACGCCC TGCACAAGCG CGAGGGCAAG
GACCTGGGGG TGCATCGGCG CCTGCACCGG CTGCTGCGGT CGTTGCGCCC GGCCATCGTC
CACACCCGCA ACCTCGCCAC CCTGGAGGCC CAGGCCACCG CCGCGGCGGC CGGCGTGCGG
GCACGCATCC ACGGTGAGCA CGGCTGGGAT ATCGGCGATC TCGACGGCGC CCGCACCAAA
CACCGCCTGA TGCGCCGCCT GGCCCGACCG TTGGTGGGGC GCTATATCGC CCTGTCGCGC
CAGCAGCTGG ACTACCTGGC CGGTGCCATC GGCGTGCCGG AGGGGCGGTT GCACCACGTC
TGCAACGGTG TGGACACCCA CCGCTTCAGG CCCCGCCGTC GGGACGAGGC CTCGCCACTG
CCGGACGGCT TCGCGCCGGA GGGCAGCCTG GTGGTGGGCA GCGTGATGCG CATGCAGGCG
GTCAAGGCCC CGGAGGATCT CGTTGATGCC TTCATCGCGC TGCGCGAACG GGCACCCGCC
CGCTTCCCCC GCCTGCGGCT GGTGCTGGTG GGCGACGGCC CCCTGAGCGA GCGCGTCGCC
CGGCGGCTGG CGGAGGCCGG GGTGGCGGAT CAGGCCTGGC TGCCCGGCGC CCGGGACGAT
GTGGCGGCGG TGATGCGCGC CCTGGACCTG TTCGTGTTGC CGTCACTCGC CGAGGGCATC
TGCAACACCG TCCTGGAGGC CATGGCCTGC GGGCTGCCAG TGGTCGCCAC CGAGGTGGGC
GGCAACCCGG ACCTGGTGCG GCCCGGCGAG ACCGGCACGC TGGTCCCGGC AGGCGATCCG
TCAACCCTCG CCCGGCACCT CCAGGCCTAC CTGGACGACC CGGAACGGCG GCAGCGCGAG
GGCGAGGCCG CGCGGGCCCG GGCGGAGGCG GTATTCAGCA TGGAGGCCAT GGTGGAGGGC
TACATGAGGG TCTACGATCA GGCGCTGGCC GAACACCCCT TGCCGGCCGT GCCGGGGAGG
CGGGGCTAG
 
Protein sequence
MSAARPLATA APGDERPLVA HIIHRLDVGG MENGLVNLIN HMPAERYRHA IVCMTRYTDF 
SQRIHRDDVS LHALHKREGK DLGVHRRLHR LLRSLRPAIV HTRNLATLEA QATAAAAGVR
ARIHGEHGWD IGDLDGARTK HRLMRRLARP LVGRYIALSR QQLDYLAGAI GVPEGRLHHV
CNGVDTHRFR PRRRDEASPL PDGFAPEGSL VVGSVMRMQA VKAPEDLVDA FIALRERAPA
RFPRLRLVLV GDGPLSERVA RRLAEAGVAD QAWLPGARDD VAAVMRALDL FVLPSLAEGI
CNTVLEAMAC GLPVVATEVG GNPDLVRPGE TGTLVPAGDP STLARHLQAY LDDPERRQRE
GEAARARAEA VFSMEAMVEG YMRVYDQALA EHPLPAVPGR RG