Gene Mlg_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0137 
Symbol 
ID4269830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp157284 
End bp158519 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content72% 
IMG OID638124861 
Productglycosyl transferase, group 1 
Protein accessionYP_740982 
Protein GI114319299 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0776455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGA AAGTGCTCCA CATCCTCGAC CATTCCCTGC CCCTGCACAG CGGCTACACC 
TTCCGCACGG CCGCCATCCT GCGCGAGCAG CACCGGCTGG GCTGGGAGAC CGTGCACCTG
ACCAGCCCGA AACACGGGGT GGCGGCCGGC GCGGACGCCG ACCGCGAGGA GTGGGCGGAG
GGGTTGCACT TCCACCGCAC CCCCCACACC CCCCTGCGGG TCCCCGGACT GGGCGAGTGG
ACCCTGATGG ATGCGCTCAC CCGACGCCTG CACCAGGTGG CCGCAGAGAC CCGACCGGAC
GTCCTGCACG CCCACTCGCC GGCCCTGAAC GCCATCCCCG CCCTGCGGGT GGGCCGGCGG
CTGGGCATCC CGGTGGTCTA CGAGGTGCGG GCCTTCTGGG AGGACGCCGC GGTGGACCAT
GGCACCAGCC GCGATCAGGG GCTGCGTTAC CGGCTCACCC GCGGCCTGGA GACCCGCGCC
CTGCGACGCG CCGACCATGT CACAACGATC TGCGAGGGGC TGCGCCAGGA CATCATCACC
CGCGGCATCG CCCCGGGCCG GGTCACCGTC ATCCCCAACG CCGTGGACGC GGAGCGGTTC
CAACTCGGCG GTACCGCCGA CCCGGCCCTG AAGGCGGAAC TGGGCCTGGA GGGCTGCCGG
GTGCTCGGTT TCATCGGTTC CTTCTACGCC TACGAGGGGC TGGACCTGCT GCTGCAGGCC
TTCCCACGCA TCCACGACCA GGCCCCCGAT GTCCGCATCC TGCTGGTGGG CGGGGGCAGC
CAGGCGGAGG CGCTCAAGGC CCAGGCCCGG GACCTGGGCA TCGCCGACCA GGTGGTCTTC
ACCGGCCGGG TCCCCCATGA CCAGGTCAAC CGCTACTATG ACCTGGTGGA CCTGCTGGTC
TACCCGCGCC ATTCCATGCG CCTGACCGAG CTGGTCACCC CGCTCAAGCC GCTGGAGGCC
ATGGCCCAGG GCCGGCTGCT GGTGGCCTCC GACGTGGGGG GCCACCGGGA GCTGATCCGC
GATGGCGAGA CCGGCTGGCT GTTCCCGGCC GGCGACCCCA AGGCCCTGGC CGATACCGTC
CTGCACACCC TCGCCCGCGC CGCGGACTGG CCGCAGGTGC GCGCCAATGG CCGCCGATTT
GTCGAGGAGG AGCGCAACTG GCCGGCGAGC GTGGCCCGCT ATCAGGCCAT CTACCGCCGC
CTGACAGGGC TCGGGGAGGC CGCCCGTGCC GGCTGA
 
Protein sequence
MPMKVLHILD HSLPLHSGYT FRTAAILREQ HRLGWETVHL TSPKHGVAAG ADADREEWAE 
GLHFHRTPHT PLRVPGLGEW TLMDALTRRL HQVAAETRPD VLHAHSPALN AIPALRVGRR
LGIPVVYEVR AFWEDAAVDH GTSRDQGLRY RLTRGLETRA LRRADHVTTI CEGLRQDIIT
RGIAPGRVTV IPNAVDAERF QLGGTADPAL KAELGLEGCR VLGFIGSFYA YEGLDLLLQA
FPRIHDQAPD VRILLVGGGS QAEALKAQAR DLGIADQVVF TGRVPHDQVN RYYDLVDLLV
YPRHSMRLTE LVTPLKPLEA MAQGRLLVAS DVGGHRELIR DGETGWLFPA GDPKALADTV
LHTLARAADW PQVRANGRRF VEEERNWPAS VARYQAIYRR LTGLGEAARA G