Gene Mlg_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2342 
Symbol 
ID4269098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2653230 
End bp2654645 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content64% 
IMG OID638127100 
Productglycosyl transferase family protein 
Protein accessionYP_743172 
Protein GI114321489 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGG AGCTTGGTTT GCAAACGCTA ACAGATCCTG CCGTGACGGC GATCCTGACC 
TCCCTTCTCA TTGGTTGGGT GCTGGTCGCG GCGGAACCTG CGTTATCCCG TCTGACGCGA
GACCGGAACG ACCTGGATGC CGTGCAGGCC TCGCACACCG GCGAGGTGCT GCGGTTGGGC
GGGGTCGCGA TCTTTGGCGG GGTGCTGGCC GGCGCCCTGG TCCTGAGCGG GACGACCGAT
ATCAGCTTCA CCATGCTCCT CCTGCTGACA GCCTTGCCGG TGCTGATGGC GGGGCTGGCG
GAGGACTTGG GCTATCCTGT CTCACCCCGG GGCCGCTTGA TGGCCGCCGC CATATCGGCA
GCGGCCTGCG TCCTGATCCT GGGCCTTTGG GTGCCGAGGG CCGACTTACC AGGCATTGAC
CTGCTGATGA CCTTCATGCC CCTGGCGATC GTTCTCACGG TGCTGGGCGC GGCCGGTTTT
TGCCACGCGG TCAATCTGAT TGACGGTATG AATGGCTTGG CGGCCTTCAC CGCACTCGTG
GCGGCTGCCG GGCTAAGCGC GATCGCTTAC CAGGCTGGCG AGCCCGAGAT AAGCCTCTTT
GCCATGCTGC TGGGGGCGGC TTGCCTGGGG TTCCTGGCGT GGAATTGGCC GCTGGGCAGG
CTGTTCCTTG GGGATGCGGG CTCCTACGGC ATTGGGCACC TGCTGGCTTG GCTGGCCATC
GCCCTGGTTA TGCTGGCCCC AGCGGTTGCG TTTGCTGCGG TGGTTCTCGT CCTTTTCTGG
CCACTTGCAG ACACCCTGCA CACCATCCTC CGCAGGTTCC TGGCGCGGCA GCGCATTGCT
GAACCCGACA AGATGCACCT GCACCAGAAG ATCCGGCGGT GCCTGGAGGT GGTCTGGTTC
GGCTCCAACC GCCGTGAACT GACCAATCCA CTGGCGACGC TGGTGATGGC GCCGATAATT
GCGCTACCGG TGGCCACCGG CGTTATACTC TGGAATCAGG CAGTGGCCGC GTACGTGGCG
CTTGCCTTCT TCGCATTGGC CTTTGGCGGG CTGCATCTGA TGATCATGCG GCTTGCAACG
CTCTACCGTC GTGCCAGGTG GCCCTTCAGC GCCCTCAACC GGAGACAGGA TGCGGTGGCT
TCGCCCGACT CACTTAACCC TCCGCTGATC GCCGTGCGCG TCGACTCGGA CTATTCCGGA
ATGTTCATTC AGGATGGTCT GGCCGTCGAC GTGCGAATCT TCCGGTACGC CAAGGACACC
CACTGGACCC TGGAGACCTA TGATGGTGTA AACCCACCTG TTCAGTGGAG CCAGCAATTC
GATACGGAAC GGGCCGCCTG GGACGCGTTC ATGCGGGCGG TTCGCGAAGA TACGATGGAC
ACCCTGGCGA GGGGCTACCA GGTCCGGCCT CGCTAA
 
Protein sequence
MSLELGLQTL TDPAVTAILT SLLIGWVLVA AEPALSRLTR DRNDLDAVQA SHTGEVLRLG 
GVAIFGGVLA GALVLSGTTD ISFTMLLLLT ALPVLMAGLA EDLGYPVSPR GRLMAAAISA
AACVLILGLW VPRADLPGID LLMTFMPLAI VLTVLGAAGF CHAVNLIDGM NGLAAFTALV
AAAGLSAIAY QAGEPEISLF AMLLGAACLG FLAWNWPLGR LFLGDAGSYG IGHLLAWLAI
ALVMLAPAVA FAAVVLVLFW PLADTLHTIL RRFLARQRIA EPDKMHLHQK IRRCLEVVWF
GSNRRELTNP LATLVMAPII ALPVATGVIL WNQAVAAYVA LAFFALAFGG LHLMIMRLAT
LYRRARWPFS ALNRRQDAVA SPDSLNPPLI AVRVDSDYSG MFIQDGLAVD VRIFRYAKDT
HWTLETYDGV NPPVQWSQQF DTERAAWDAF MRAVREDTMD TLARGYQVRP R