Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2799 |
Symbol | |
ID | 4269142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3181816 |
End bp | 3182901 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638127561 |
Product | hypothetical protein |
Protein accession | YP_743629 |
Protein GI | 114321946 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.741691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAGG AAGTAGCGAG TCCGGTCGTC CTGTTCGTGC CGATCAGTGG GCGCGAGGGC TCCGGTGAGT ATTACCGGCT GTTGACCCTT GCGAAGGGCC TGTCGCAACG GTGCCCGGAT TGGTCGCTGC AGTTCGTGGT CAACCGGAGC GCCCGGGTGG AGCGCCCCGC CTTTATCCGC GTCCATGAGC TCGAGGGGAC GCCCAAGCGG AATCCGAACC GGCTCAGCAA CCTTATCCAG CAGTTGCGGC CGCAGGTCGT GGTGTTCGAC AGCACCCTGA GGCCTTGGAT GCTTAGGGCG GCGCGGTCTG TGGGCGCCCG CACCGTCTAT GTCAGCTGGC GCCCGCGCAC GCGCCGGCTG GGTTTTGCCC GCAAGCACTT GCGTTTGCTG GATGAACACT GGCTGGTGGG CGATCCCGGA GACCTGCGCC TGACGTTCGG TGAGCGTTGC CGGCTCATGG CGGCGGGGAG CCGCACGGCG ATCCGGTTCT TCTCAGCCCT GATGCCGCTG CCCGACCAGC AGGCCGCGGA TACCCTGCTC CGGCGCCTGG CGCCCGGGGC GGACGGGTAT GTCTTTTTTT GTTCCGGCGG TGGCGGGACC TCCGTGGGCG GACGACCGTC CAGCGAGATC TTCCAGCGGG CGGCGCGTCT GTTCCACGCC CGGACCGGAG TGCCTGTCGT GTTTGTGGCC GGGCCCCTCA GTTCTCACCG CCTGAGTGAT CAGCCGGGTC GGTTGGAATT GCGGTCGGTG CCCTCGGAGG TGTTCGTGGC CCTGGTGGCC CGTGCCCGGT TGGCGGTGAG TGGCGGGGGC AGCATGATTC AGCAGGTGTT GTCGGCCCGG GTGCCCTGTG TGGGGGTGGC CGCTGGCGGC AACGACCAGC CCCAGCGCAT CGAAAACCTG GCGGCGCAAG GCCGGATCGT GCCGGCGGAC GCGGACGACG AGGGCATCGC GAGTGCCGTG GAGGCACTGC ATGCCGACCC CGACCGCCAA GCCCGGATCT TGGCCAATGG GGCGGAACGG GCTTACGCCA ATGGCACGCC CGAGGCGGTG GACCGGCTCA TCGCCCTGAT ACCGGAACGT CCCTGA
|
Protein sequence | MVEEVASPVV LFVPISGREG SGEYYRLLTL AKGLSQRCPD WSLQFVVNRS ARVERPAFIR VHELEGTPKR NPNRLSNLIQ QLRPQVVVFD STLRPWMLRA ARSVGARTVY VSWRPRTRRL GFARKHLRLL DEHWLVGDPG DLRLTFGERC RLMAAGSRTA IRFFSALMPL PDQQAADTLL RRLAPGADGY VFFCSGGGGT SVGGRPSSEI FQRAARLFHA RTGVPVVFVA GPLSSHRLSD QPGRLELRSV PSEVFVALVA RARLAVSGGG SMIQQVLSAR VPCVGVAAGG NDQPQRIENL AAQGRIVPAD ADDEGIASAV EALHADPDRQ ARILANGAER AYANGTPEAV DRLIALIPER P
|
| |