Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2358 |
Symbol | |
ID | 4268456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2674854 |
End bp | 2676044 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 638127116 |
Product | putative glycosyltransferase protein |
Protein accession | YP_743188 |
Protein GI | 114321505 |
COG category | [R] General function prediction only |
COG ID | [COG4671] Predicted glycosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0125042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.920267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGCC TGCGCATCCT CTTCCACTGC CAGCACCTCA GCGGGGTCGG CCACTACATG CGCGGCCTCG CGCTGACCCG GGAGCTGGCG CGCCACCACA CGGTCTGGCT CAGCGACGGC GGTCGGTCCA TTCCCGGCGC CGACCCCGGT GGCGGACGGC TGCTGGCACT GCCCCGCCTC CGGCGCCGGG AGGGCCGGCT GGAGACCCTG GACGGCAGGC CCGCCGCCGC TGTCTGGCCG GAGCGCGCCC GGCGGCTGGC CGGCGCGGTG GCCCGCTTGG CCCCGGACGT GGTCATCGTC GAGCACTACC CCTTCAGCAA GTGGGGCCTG GGGGCGGAGG TCACCGGGCT GCTGGCGGCC GCCCGCCGGC GCAACCCGGC CCTGCTCGCC GTCTGCTCGG TGCGGGATAT CCCGCTGCAG ACCCGCCACG AGCCAGTCCC GGCCGGCGAC TACGTCCGGG AGGTGCTGGC GCGGCTGCAC CAGTGGTTCG ACGCCGTGAT GGTCCACGCC GACCCCGGCC TGTGCCGGCT GGAGGAGGCC TTTCCGGCCG CCGGCCGCAT CCGCCTGTCG GTGGGCCATA CCGGGCTGGT CCCACCAGGT CCGGACCTGG CCCCGGATGC GCCGGCAGGG GCGGCCGGCG CCACTGCCGG TCCGTCGCCC TACGCCGTGG CCAGCATCGG CGGCGGGCGC GACGCCGCGG CGCTGCTCGC CCGGCTGGCG GCGCACTGGC CGGCCATCCG TCGTCGGGCC GGCCTGGATG ATCTGCCGCT GGCGCTGTTC AGCGGCCTGG GCCCGCCGGA CCCGGCCCTG GCGCGCGCAG TGGCGGGGCA GCCGGCGCTG TCGTTGCACC CCTTCGGCCC AGCGTACAGC GCCTGGCTGC GCGGGGCGGC GCTGTCCATC AGTTGTGCCG GCTACAACAC CTGTGCTCAG TTGCTGCAGC TTCGCCGGCC GGCCCTACTG GTCCCCAATA CCGCCATGTC CGACCAGTTA CGCCGGGCGG AACGGCTGCA GGCACGGGGG CTGGCGCGGC TGCTTCGCCC GGAGGCCTTC TCGGTGGCGG CAGTGGCCGA TGAGCTGCGG GCGCTGCGGG ACGCGCCGCC CGCGGACCCC GGCGTCGATC TGGACGGGGC CCGCGGTGCC CGGCGCTTTA TTGAAGGGCT GGCCGGGGCC GGCGGTCACT CAGATCGGTG A
|
Protein sequence | MTRLRILFHC QHLSGVGHYM RGLALTRELA RHHTVWLSDG GRSIPGADPG GGRLLALPRL RRREGRLETL DGRPAAAVWP ERARRLAGAV ARLAPDVVIV EHYPFSKWGL GAEVTGLLAA ARRRNPALLA VCSVRDIPLQ TRHEPVPAGD YVREVLARLH QWFDAVMVHA DPGLCRLEEA FPAAGRIRLS VGHTGLVPPG PDLAPDAPAG AAGATAGPSP YAVASIGGGR DAAALLARLA AHWPAIRRRA GLDDLPLALF SGLGPPDPAL ARAVAGQPAL SLHPFGPAYS AWLRGAALSI SCAGYNTCAQ LLQLRRPALL VPNTAMSDQL RRAERLQARG LARLLRPEAF SVAAVADELR ALRDAPPADP GVDLDGARGA RRFIEGLAGA GGHSDR
|
| |