Gene Mlg_0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0132 
Symbol 
ID4269825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp150706 
End bp152499 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content68% 
IMG OID638124856 
Productglycosyl transferase family protein 
Protein accessionYP_740977 
Protein GI114319294 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG2267] Lysophospholipase 
TIGRFAM ID[TIGR03101] hydrolase, ortholog 2, exosortase system type 1 associated 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCCG GATTTCTAGA AGGGCCCCAA GGGCCACTCT TCCATATCCT TCACCCACCG 
GAGGCGGAGC CCCCCAAGGG TTGCGTGCTC TATGCTCCGC CGTTTGCGGA GGAGCTGAAC
AAATCCCGCC GCATGGTGGC GGAGCAGGCG CGCAGGCTGG CGGCTGCCGG CTACGCCGTG
CTGCTGCCCG ACCTGTATGG CTGCGGCGAC AGCGCCGGTG AGTTGCAGGA TGCCCGCTGG
GAGGCCTGGC TGGACGACCT GCAGCGGTGC GCGGAGACGC TATGCGCCCG TTTTCCGGCC
CCGCTGCACC TGTGGGGGCT GCGCAGCGGC TGCCTGCTGG CCAGCGCCCT GGCCCACCGC
CTGGAGACCC CACCCCGCTC ACTGCTCTAC TGGCAACCGG TCGGCAACGG CAAGCTCTTC
CTGACCCAGT TCCTGCGTCT GCGCGTGGCC GCCGGGATGA TGAGCGGCGG AAAGGAGACC
ACCGCGGCAT TGCGTGAGCG CCTGGCCGGC GGCGAGACCC TGGAGATCGC CGGCTACCCA
CTGGCACCGG CCCTGGCCCA GGCGCTGGAG CAGGCCCGTT TGCAGCAACC GCCCGATGGC
GTCGAGGTGC ACTGGGTGGA AGTGATGCAG GGGGATGCCC CGCAGCTGCC TCCTGCCAGT
CAGCGGCTGG TGGACGACTG GCAGGAGGCC GGTATCGCGG TGCAGGCCGC AGTGGTGCCC
GGCGAACCCT TCTGGTCCAC CCAGGAGATC CGCACGGTAC CTGCGCTGTG GCAACGGACG
CTGGGTTGCC TCCAGCGCGG CCCGGCGGCA GCAGCGCAAG CGGCCGATGC AAGCGCCCAG
CCCCTGGTTT CGGTGATCAT GCCGGCGTTC AACGCCGCCA GTTACATCGA GGAGGCCATC
GACAGCGTCC TGGCCCAGGA CTACCCGCAC AAGGAGCTAA TTGTCATCGA CGATGGCTCC
AGTGACGACA CGGTGGCCCG GGTGCAAGCC TACGGTGACC GGGTACGGCT GTTGACCCAG
GCCAACCAGG GCTCGGCGGT GGCCCGGAAC CAGGGCCTGG ATGCCGCCCA GGGGGAGTAC
ATCGCCTTTC TGGATTCCGA CGACGTGTGG CTGCCGGGCA AGCTGACGGC GCAGGTGGGG
TACCTGGAGG CGCACCCGGA TGTGGGCATG ATCTACTCGG ACTGGCTGCC CTGGAAACGG
GACAAGCAGT CCAAGGCCTT CCCCCCACCC GAAGCCCTGG CACCGGCAAC ACCTGATACC
GGGGTACCTC CGGAAGAGAT CCCGCTGCTG ACCGAAGGCT CCGGCTGGCT CTACAACCGG
CTGCTCTTTG GCTCGCTACT GCACACCATC ACGGTCATGG CCCGCCGTGA GCTGATCGAG
CAGGTCGGCC GGTTCGATCC CGAACTGAAA CGGGGTCAGG ATTACGACTA CTGGCTGCGG
GCCTCCCGCC ACACCGAGAT CCACCAGCTG GACCGGGTGT TCGCGCTGTA CCGATTGCAC
GGCAGCGGCT GCATCACCCA ATGGCCGGAC ATCAACTACG AAAAGCTGGT GGTGGAAAAG
GCGTTGGCCC GCTGGGGGCT GGAGGGACCC ACCGGTGAAC GCTCCGACCG CAAGGCCGTC
GAGCGACGCC TGGCCGGCAC CTGCTTTGAC TTTGGCTATC ACCACTACTG GAGCGGTAAC
CCCCGCAGGG CCAGCCGGTC CTTCCTGGAG GCGCTGCGCC ACCACCCCCG CCACCTGGGC
AGTTGGCGCT ACGCCGGGAT GAGCCTGGCC ATGGGTCTCT TCAAGGGGCG TTAA
 
Protein sequence
MEAGFLEGPQ GPLFHILHPP EAEPPKGCVL YAPPFAEELN KSRRMVAEQA RRLAAAGYAV 
LLPDLYGCGD SAGELQDARW EAWLDDLQRC AETLCARFPA PLHLWGLRSG CLLASALAHR
LETPPRSLLY WQPVGNGKLF LTQFLRLRVA AGMMSGGKET TAALRERLAG GETLEIAGYP
LAPALAQALE QARLQQPPDG VEVHWVEVMQ GDAPQLPPAS QRLVDDWQEA GIAVQAAVVP
GEPFWSTQEI RTVPALWQRT LGCLQRGPAA AAQAADASAQ PLVSVIMPAF NAASYIEEAI
DSVLAQDYPH KELIVIDDGS SDDTVARVQA YGDRVRLLTQ ANQGSAVARN QGLDAAQGEY
IAFLDSDDVW LPGKLTAQVG YLEAHPDVGM IYSDWLPWKR DKQSKAFPPP EALAPATPDT
GVPPEEIPLL TEGSGWLYNR LLFGSLLHTI TVMARRELIE QVGRFDPELK RGQDYDYWLR
ASRHTEIHQL DRVFALYRLH GSGCITQWPD INYEKLVVEK ALARWGLEGP TGERSDRKAV
ERRLAGTCFD FGYHHYWSGN PRRASRSFLE ALRHHPRHLG SWRYAGMSLA MGLFKGR