Gene Mlg_2806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2806 
Symbol 
ID4269149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3190011 
End bp3191099 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID638127568 
Productglycosyl transferase family protein 
Protein accessionYP_743636 
Protein GI114321953 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.629835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAG CCGAACTGCC GCTGACCACA CCGCCCCGCT CGCTCTGCAT CCTGCGCTTT 
TCCGCGCTGG GGGACGTCAC CCACATGACC CCGGTGGTGC GTACCCTGCA GCGGGAATGG
CCGGAGACCC GCCTGACCTG GATCGTCGGC AAGGCCGAAC ACACCCTGGT GGGGGATATC
CCCGGTGTGG ACTTCGCGGT CTTCGACAAG GCCGCTGGCT GGGCCGGTTA TCGGGACCTG
TGGCGGCAAC TGCGCGGACA GCGGTTCGAC GTGCTGCTGC ACAACCAGTT CGCCCTGCGG
GCCAATATCG CCAGCCTGGG CATCCGCGCG GACCTGCGGC TGGGTTACGA CCGGGCCCGC
TCCCGGGACC TGCACGGGCT GTTCATCAAC GCCCGCATCC CGCCCCACCC GGGCCAGCAC
GTCATCGACA TCTACTTCAG TTTCATCGAA ACCCTGGGGC TCCGGCGCCG GCACATGGTC
TGGGACATTC CCGTGCCGGA GGCGGCCGAG GCCCGTGCCC GGGCACTGAC CCCGGACGAC
ACCCCCACGC TCGTGATCAG CCCCTGCTCC AGCCACGCCC TGCGCAACTG GACGGTGGCG
GGCTGCGCCC GGGTCGCGGA TCACGCCGCA CGCCGCCACG GACTGCGCGT GCTGATCACC
GGCGGCCCCT CTGAGGTGGA GCGGGAGACG GGCGCGGCCA TCGCCGCGCA GGCAGAAACG
GCGCCGGAGA ACCTGGTGGG CCAGACCTCC ATCAAGGAGA TGCTCGCCCT GTTGGGCCGC
GCCACGGCGG TGGTGAGCCC CGATTCCGGC CCGGCGCACA TGGCCAACGC CATGGGCACG
CCCGTGATCG GGCTCTACGC CTGCACTAAC CCCGGTCGGG CGCGGCCCTA TTACAGCGGC
CAGTGGTGCG TTGATCGCTA TGACGAGGCC TCAAGGCGGG AGCTGGGCAG GCCCGCCAGC
GAGATCCGCT GGGGCACCAA GATCGAGCGC CCGGGTGTGA TGGCGCTGAT CACCCCGGAG
GACGTGATCG AACGGCTGGA TGCCCTGATG GCCGCCGGTG CCCCGCGCGC CATTCCGCCG
GAGACCTGA
 
Protein sequence
MARAELPLTT PPRSLCILRF SALGDVTHMT PVVRTLQREW PETRLTWIVG KAEHTLVGDI 
PGVDFAVFDK AAGWAGYRDL WRQLRGQRFD VLLHNQFALR ANIASLGIRA DLRLGYDRAR
SRDLHGLFIN ARIPPHPGQH VIDIYFSFIE TLGLRRRHMV WDIPVPEAAE ARARALTPDD
TPTLVISPCS SHALRNWTVA GCARVADHAA RRHGLRVLIT GGPSEVERET GAAIAAQAET
APENLVGQTS IKEMLALLGR ATAVVSPDSG PAHMANAMGT PVIGLYACTN PGRARPYYSG
QWCVDRYDEA SRRELGRPAS EIRWGTKIER PGVMALITPE DVIERLDALM AAGAPRAIPP
ET