Gene Mlg_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2793 
Symbol 
ID4269727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3175949 
End bp3177082 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content70% 
IMG OID638127555 
Productglycosyl transferase, group 1 
Protein accessionYP_743623 
Protein GI114321940 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.09685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTGG CCTTCGCCCT GTTCAAGTAC TTCCCCCACG GCGGCCTGCA GCGCAACTTC 
CGGCGCATCA CCGAGCTGGC GCTGGAGCGC GGCCATCGGG TCGATGTCTA CACCCTGGCC
TGGAGTGGCT GGACGCCGGA ACATCCGAAC CTGACGGTGG AGGTGGTGAA GGTGCCCGGC
TGGCGCAACC ACACCCGCTA CCGCCGCTTT GGTACCCACG TGCAGAAACG CCTGGCGGAC
AACCCGCCCG ACCGGGTGGT GGGCTTCAAC AAGATGCCGG GGCTCGATGT CTACTACAAC
GCCGACCCCT GCTTCGTGGA GCGGGCGCAG GCGCGCCACC CGCTCTACCG CTGGTCGGGG
CGCTACCGCC AGCACGCCGC CTTTGAGCAG GCCGTCTTCC GCGCCGACGC CCGCAACCAC
ATCCTGCTGC TATCCGAAGC GGAGAAGCCG CTCTTCCAGC GCTGGTACGC CACCCCCGAC
GACCGTTTCC ACCTGATGCC GCCCTACGTC TCCACCGACC GCTTCGCCGG CCCCGAGGCC
CCGCACATCG GCGCCGGCCT GCGCCGGGAG CTGGGCCTGG GCGAGGCGGA CCGCATGCTG
CTGATGGTGG GCTCCGACTT CCGTCGCAAG GGGGTGGATC GCAGCATCCG TGCACTGGCC
GCCCTGCCCG AATCGCCGCG GCGACGCACC CATCTCTATG TGCTGGGCAA GGGTCGTGCG
GCAACCCAGG AAGCCCTGGC CCGGGGGCTC GGCGTGGCGG ACCAGGTGCA CTTCCTGCAG
GGCCGGGACG ACGTGGCACG CTTCCTCTTC GCCGCCGACC TGCTGCTCCA CCCGGCCTAC
CAGGAGAACA CCGGCACCGC CATTGTCGAG GCCATCGCCG CCGGGTTGCC CGCACTGGTG
ACCGGGAATT GCGGCTACGC CTTCCACATT GAGCGCGCCG GCAGCGGCCG GGTCCTGCCA
CCGCCCTTCA CCCAGGCGGC CATGGACGAG GCCCTGGCTT CGATGATCGA CAGCCCCGAG
CAACCCCGCT GGCGCGAATA CGCCCGGACC TACGCCCGCC GGACCGAACT GGGCAGCCGC
GCCGAGCACG CCCTGCGGGT CATCGAAGGC CCCCGCTACG GGGAGCATGG GTGA
 
Protein sequence
MHLAFALFKY FPHGGLQRNF RRITELALER GHRVDVYTLA WSGWTPEHPN LTVEVVKVPG 
WRNHTRYRRF GTHVQKRLAD NPPDRVVGFN KMPGLDVYYN ADPCFVERAQ ARHPLYRWSG
RYRQHAAFEQ AVFRADARNH ILLLSEAEKP LFQRWYATPD DRFHLMPPYV STDRFAGPEA
PHIGAGLRRE LGLGEADRML LMVGSDFRRK GVDRSIRALA ALPESPRRRT HLYVLGKGRA
ATQEALARGL GVADQVHFLQ GRDDVARFLF AADLLLHPAY QENTGTAIVE AIAAGLPALV
TGNCGYAFHI ERAGSGRVLP PPFTQAAMDE ALASMIDSPE QPRWREYART YARRTELGSR
AEHALRVIEG PRYGEHG