Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0132 |
Symbol | |
ID | 4269825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 150706 |
End bp | 152499 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638124856 |
Product | glycosyl transferase family protein |
Protein accession | YP_740977 |
Protein GI | 114319294 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG2267] Lysophospholipase |
TIGRFAM ID | [TIGR03101] hydrolase, ortholog 2, exosortase system type 1 associated |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCCG GATTTCTAGA AGGGCCCCAA GGGCCACTCT TCCATATCCT TCACCCACCG GAGGCGGAGC CCCCCAAGGG TTGCGTGCTC TATGCTCCGC CGTTTGCGGA GGAGCTGAAC AAATCCCGCC GCATGGTGGC GGAGCAGGCG CGCAGGCTGG CGGCTGCCGG CTACGCCGTG CTGCTGCCCG ACCTGTATGG CTGCGGCGAC AGCGCCGGTG AGTTGCAGGA TGCCCGCTGG GAGGCCTGGC TGGACGACCT GCAGCGGTGC GCGGAGACGC TATGCGCCCG TTTTCCGGCC CCGCTGCACC TGTGGGGGCT GCGCAGCGGC TGCCTGCTGG CCAGCGCCCT GGCCCACCGC CTGGAGACCC CACCCCGCTC ACTGCTCTAC TGGCAACCGG TCGGCAACGG CAAGCTCTTC CTGACCCAGT TCCTGCGTCT GCGCGTGGCC GCCGGGATGA TGAGCGGCGG AAAGGAGACC ACCGCGGCAT TGCGTGAGCG CCTGGCCGGC GGCGAGACCC TGGAGATCGC CGGCTACCCA CTGGCACCGG CCCTGGCCCA GGCGCTGGAG CAGGCCCGTT TGCAGCAACC GCCCGATGGC GTCGAGGTGC ACTGGGTGGA AGTGATGCAG GGGGATGCCC CGCAGCTGCC TCCTGCCAGT CAGCGGCTGG TGGACGACTG GCAGGAGGCC GGTATCGCGG TGCAGGCCGC AGTGGTGCCC GGCGAACCCT TCTGGTCCAC CCAGGAGATC CGCACGGTAC CTGCGCTGTG GCAACGGACG CTGGGTTGCC TCCAGCGCGG CCCGGCGGCA GCAGCGCAAG CGGCCGATGC AAGCGCCCAG CCCCTGGTTT CGGTGATCAT GCCGGCGTTC AACGCCGCCA GTTACATCGA GGAGGCCATC GACAGCGTCC TGGCCCAGGA CTACCCGCAC AAGGAGCTAA TTGTCATCGA CGATGGCTCC AGTGACGACA CGGTGGCCCG GGTGCAAGCC TACGGTGACC GGGTACGGCT GTTGACCCAG GCCAACCAGG GCTCGGCGGT GGCCCGGAAC CAGGGCCTGG ATGCCGCCCA GGGGGAGTAC ATCGCCTTTC TGGATTCCGA CGACGTGTGG CTGCCGGGCA AGCTGACGGC GCAGGTGGGG TACCTGGAGG CGCACCCGGA TGTGGGCATG ATCTACTCGG ACTGGCTGCC CTGGAAACGG GACAAGCAGT CCAAGGCCTT CCCCCCACCC GAAGCCCTGG CACCGGCAAC ACCTGATACC GGGGTACCTC CGGAAGAGAT CCCGCTGCTG ACCGAAGGCT CCGGCTGGCT CTACAACCGG CTGCTCTTTG GCTCGCTACT GCACACCATC ACGGTCATGG CCCGCCGTGA GCTGATCGAG CAGGTCGGCC GGTTCGATCC CGAACTGAAA CGGGGTCAGG ATTACGACTA CTGGCTGCGG GCCTCCCGCC ACACCGAGAT CCACCAGCTG GACCGGGTGT TCGCGCTGTA CCGATTGCAC GGCAGCGGCT GCATCACCCA ATGGCCGGAC ATCAACTACG AAAAGCTGGT GGTGGAAAAG GCGTTGGCCC GCTGGGGGCT GGAGGGACCC ACCGGTGAAC GCTCCGACCG CAAGGCCGTC GAGCGACGCC TGGCCGGCAC CTGCTTTGAC TTTGGCTATC ACCACTACTG GAGCGGTAAC CCCCGCAGGG CCAGCCGGTC CTTCCTGGAG GCGCTGCGCC ACCACCCCCG CCACCTGGGC AGTTGGCGCT ACGCCGGGAT GAGCCTGGCC ATGGGTCTCT TCAAGGGGCG TTAA
|
Protein sequence | MEAGFLEGPQ GPLFHILHPP EAEPPKGCVL YAPPFAEELN KSRRMVAEQA RRLAAAGYAV LLPDLYGCGD SAGELQDARW EAWLDDLQRC AETLCARFPA PLHLWGLRSG CLLASALAHR LETPPRSLLY WQPVGNGKLF LTQFLRLRVA AGMMSGGKET TAALRERLAG GETLEIAGYP LAPALAQALE QARLQQPPDG VEVHWVEVMQ GDAPQLPPAS QRLVDDWQEA GIAVQAAVVP GEPFWSTQEI RTVPALWQRT LGCLQRGPAA AAQAADASAQ PLVSVIMPAF NAASYIEEAI DSVLAQDYPH KELIVIDDGS SDDTVARVQA YGDRVRLLTQ ANQGSAVARN QGLDAAQGEY IAFLDSDDVW LPGKLTAQVG YLEAHPDVGM IYSDWLPWKR DKQSKAFPPP EALAPATPDT GVPPEEIPLL TEGSGWLYNR LLFGSLLHTI TVMARRELIE QVGRFDPELK RGQDYDYWLR ASRHTEIHQL DRVFALYRLH GSGCITQWPD INYEKLVVEK ALARWGLEGP TGERSDRKAV ERRLAGTCFD FGYHHYWSGN PRRASRSFLE ALRHHPRHLG SWRYAGMSLA MGLFKGR
|
| |