Gene Mlg_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2358 
Symbol 
ID4268456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2674854 
End bp2676044 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content76% 
IMG OID638127116 
Productputative glycosyltransferase protein 
Protein accessionYP_743188 
Protein GI114321505 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0125042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.920267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGCC TGCGCATCCT CTTCCACTGC CAGCACCTCA GCGGGGTCGG CCACTACATG 
CGCGGCCTCG CGCTGACCCG GGAGCTGGCG CGCCACCACA CGGTCTGGCT CAGCGACGGC
GGTCGGTCCA TTCCCGGCGC CGACCCCGGT GGCGGACGGC TGCTGGCACT GCCCCGCCTC
CGGCGCCGGG AGGGCCGGCT GGAGACCCTG GACGGCAGGC CCGCCGCCGC TGTCTGGCCG
GAGCGCGCCC GGCGGCTGGC CGGCGCGGTG GCCCGCTTGG CCCCGGACGT GGTCATCGTC
GAGCACTACC CCTTCAGCAA GTGGGGCCTG GGGGCGGAGG TCACCGGGCT GCTGGCGGCC
GCCCGCCGGC GCAACCCGGC CCTGCTCGCC GTCTGCTCGG TGCGGGATAT CCCGCTGCAG
ACCCGCCACG AGCCAGTCCC GGCCGGCGAC TACGTCCGGG AGGTGCTGGC GCGGCTGCAC
CAGTGGTTCG ACGCCGTGAT GGTCCACGCC GACCCCGGCC TGTGCCGGCT GGAGGAGGCC
TTTCCGGCCG CCGGCCGCAT CCGCCTGTCG GTGGGCCATA CCGGGCTGGT CCCACCAGGT
CCGGACCTGG CCCCGGATGC GCCGGCAGGG GCGGCCGGCG CCACTGCCGG TCCGTCGCCC
TACGCCGTGG CCAGCATCGG CGGCGGGCGC GACGCCGCGG CGCTGCTCGC CCGGCTGGCG
GCGCACTGGC CGGCCATCCG TCGTCGGGCC GGCCTGGATG ATCTGCCGCT GGCGCTGTTC
AGCGGCCTGG GCCCGCCGGA CCCGGCCCTG GCGCGCGCAG TGGCGGGGCA GCCGGCGCTG
TCGTTGCACC CCTTCGGCCC AGCGTACAGC GCCTGGCTGC GCGGGGCGGC GCTGTCCATC
AGTTGTGCCG GCTACAACAC CTGTGCTCAG TTGCTGCAGC TTCGCCGGCC GGCCCTACTG
GTCCCCAATA CCGCCATGTC CGACCAGTTA CGCCGGGCGG AACGGCTGCA GGCACGGGGG
CTGGCGCGGC TGCTTCGCCC GGAGGCCTTC TCGGTGGCGG CAGTGGCCGA TGAGCTGCGG
GCGCTGCGGG ACGCGCCGCC CGCGGACCCC GGCGTCGATC TGGACGGGGC CCGCGGTGCC
CGGCGCTTTA TTGAAGGGCT GGCCGGGGCC GGCGGTCACT CAGATCGGTG A
 
Protein sequence
MTRLRILFHC QHLSGVGHYM RGLALTRELA RHHTVWLSDG GRSIPGADPG GGRLLALPRL 
RRREGRLETL DGRPAAAVWP ERARRLAGAV ARLAPDVVIV EHYPFSKWGL GAEVTGLLAA
ARRRNPALLA VCSVRDIPLQ TRHEPVPAGD YVREVLARLH QWFDAVMVHA DPGLCRLEEA
FPAAGRIRLS VGHTGLVPPG PDLAPDAPAG AAGATAGPSP YAVASIGGGR DAAALLARLA
AHWPAIRRRA GLDDLPLALF SGLGPPDPAL ARAVAGQPAL SLHPFGPAYS AWLRGAALSI
SCAGYNTCAQ LLQLRRPALL VPNTAMSDQL RRAERLQARG LARLLRPEAF SVAAVADELR
ALRDAPPADP GVDLDGARGA RRFIEGLAGA GGHSDR