Gene Mlg_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2798 
Symbol 
ID4269141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3180641 
End bp3181813 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content64% 
IMG OID638127560 
Productglycosyl transferase, group 1 
Protein accessionYP_743628 
Protein GI114321945 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACC TGCCACTAGC AGGCCGCCGT ATCGGTTTTC TTCTCGAGCA CCCCGTGACC 
GTGGGCGCGG ACGTCCAGGA ACAGATGGCG GCCTGCCGGC GGCTGGGTGC CCATGTCACC
GCCTGCTTTT TGTCCGGGAC ACCCTCCGAG TTCCCCGGCC ACGGCTCAAG CGCTGATGAC
TTGACGGGCC TGAATCTGCC CGTGGAGGCG ATCTCTGGCC ATCGTATCGG CGCAGCGCGA
CGACTGCGCC ATCTGCTCAG GAAACGGCCG CTCGACACCC TCGTTTGCGA TCAGTACAAG
GCCATCAGCA CCGCAGCATT GGCCACGTTC TTCCCACACG CCGAGGCACC CGCCATTGTT
GCTCTCCTTC GCGGCTATCA CGCGGTGTCC TCACCCAGCC GGCGGCGGTT CTACCGGCTG
TTCGGGCGGC ACATCCGCGC TTTTATCACG CTCAGCGCAG CGCAACAGCA ACAAATCCGG
AACCTGCTGA CCTGGTTCCC CCCGGATCGC ATCCACGTCG TCCCGGTCCA CCTCGACCTG
GACCGGCTGG CCGGTGCCAT GTTGCCGGCG AACGAGGCCC GCCGACAACT GTGCCTGCCC
GGGTCGGCCG TGCTATTCGG CTGCATTTCG CGCCTGCACC CCAGTAAGCG GGTCATGGAC
CTCGTTGAGG CCACCAGGCT ATTGCGGGAA CGAGGCGTGG AATTCCACTT GGTCATTATC
GGTGGCGGTA AGCAGGAGGA TGCGCTGAGA AAACGCATCG CGGAAGCCGA CCTTGACGGC
ACCGTCCGGC TGACCGGGCG CCTGGAGTCG GCGCACCGCT ATATGCCCGC CTTTGATGCC
TTCATCCTGC CCTCCCCCTG CGAATCCTTC GGACGGGTGT TCCTGGAGGC CCACGCCGCC
AAGACCCCCA TTATCGCGGC CGATGGTGCT GCCGCTCCCG AGGTGGCCGG TCCCGCCGCG
CTCCTCTTTC AGCCCACGAA TGCGGCAGAC CTCGCCGACA GGATGCTCGA ATTCATGGCG
CAAGGGCCTG AGGAGCAACT GAACGCAGGA CAGATTGGGG AGGAGTACGC CAGGCGGCAT
TTCAGCCAGG AAGCTCTGGA CCGGCATTTG CAAAAGGCCC TGGAAAAAAA AGGATTACTG
GCGGAATCCC GAGACCCTGG CGCTCATCGG TGA
 
Protein sequence
MNDLPLAGRR IGFLLEHPVT VGADVQEQMA ACRRLGAHVT ACFLSGTPSE FPGHGSSADD 
LTGLNLPVEA ISGHRIGAAR RLRHLLRKRP LDTLVCDQYK AISTAALATF FPHAEAPAIV
ALLRGYHAVS SPSRRRFYRL FGRHIRAFIT LSAAQQQQIR NLLTWFPPDR IHVVPVHLDL
DRLAGAMLPA NEARRQLCLP GSAVLFGCIS RLHPSKRVMD LVEATRLLRE RGVEFHLVII
GGGKQEDALR KRIAEADLDG TVRLTGRLES AHRYMPAFDA FILPSPCESF GRVFLEAHAA
KTPIIAADGA AAPEVAGPAA LLFQPTNAAD LADRMLEFMA QGPEEQLNAG QIGEEYARRH
FSQEALDRHL QKALEKKGLL AESRDPGAHR