Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0228 |
Symbol | |
ID | 4268928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 262798 |
End bp | 263793 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638124952 |
Product | cobalamin synthesis protein, P47K |
Protein accession | YP_741073 |
Protein GI | 114319390 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTCC CCTGCAATCT GATTACCGGC GCCCTGGGTG TCGGCAAGTC CACGGCCATA CGCCACCTGC TCAAGCATCA TCGCCCCAAA GGGGAGCGCT GGGCGGTGTT GGTCAACGAG GTCGGGGCCG TGCCCGTCGA TCAAGCGGCC CTGTCGGTGG ACGACAACGT GGTGGTGGCA GACCTGCCCG GCGGCTGCCT GTGCTGTACC CTGGGCGCGC CTTTTGATCG CACCCTCAAG CGACTGCTGC GCCGGGAGCG GCCCGACCGC CTGCTGATCG AGCCCACCGG TTTGGGCCAT CCGGCCCGCA TCCTGCAGAC CCTGCGGGAG GGGCCCGTGG CGGCGTCGGT CCGGTTGGGT GCCACCATTA CGCTGGTGGA TCCGCGGCAA TGGCGCAGTG GCGAATTGGC GGATAATCCG GCCTGGTGGG ACCAGATCGA GCTCGCCGAC GTCCTCGTGG CCAACAAGGC CGACCTCGCC CCCTCGGGCG ATGTCGCCGC CTTCATGGGG TGGGCGGCGG ACCTGTTCCC GCCCAAGGCG CGGGTGGAGA TCACCCGCCA CGGCCAGCTC AACCCGGAGT GGCTATCCGA ACCCTGCGAT TCCGGACGCT CGCCGCTGTT CCCCGATGCC CATCAGGTGG CCGCTGGGGA CTATGTCCAG CAGGGTGCCG AGCCGGTGGA TGAAGGGGTC TGGCGGGCCT GCGGCCGGTC ACTGGGCCAA CGCAGCGTCG GCTGGGTGTT CGCGGCAGCC ACGGTGTTTG ACCGTCAACG CCTGCTGCGC ACGCTCAACG AACTGCGCCC GGCGCAGCGC CTGAAGGGCG TCTTTCGCAC CGGACGGGAC TGGCTGCTGG TGAATGCCGA TCGGGACGGG GTGCGGGCCG AGGTCTGTGA CTGGCGTCGC GACAGTCGTC TGGAGGTCAT CGGCCACGAG GATGCCCGGG CCCTGGAGGC GGCGCTGCTG GCCTGCCGGC GCCAACGGGA GAAAGACCCG GTCTGA
|
Protein sequence | MSVPCNLITG ALGVGKSTAI RHLLKHHRPK GERWAVLVNE VGAVPVDQAA LSVDDNVVVA DLPGGCLCCT LGAPFDRTLK RLLRRERPDR LLIEPTGLGH PARILQTLRE GPVAASVRLG ATITLVDPRQ WRSGELADNP AWWDQIELAD VLVANKADLA PSGDVAAFMG WAADLFPPKA RVEITRHGQL NPEWLSEPCD SGRSPLFPDA HQVAAGDYVQ QGAEPVDEGV WRACGRSLGQ RSVGWVFAAA TVFDRQRLLR TLNELRPAQR LKGVFRTGRD WLLVNADRDG VRAEVCDWRR DSRLEVIGHE DARALEAALL ACRRQREKDP V
|
| |