Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1948 |
Symbol | infB |
ID | 4268116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2214700 |
End bp | 2217351 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126702 |
Product | translation initiation factor IF-2 |
Protein accession | YP_742780 |
Protein GI | 114321097 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0536071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAG ACAAGGTAAG AGAGTTCGCG GAAACGGTCG GCATCCCGGT GGAGCGCCTG GTGTCCCAAC TGGAGGCGGC GGGCATCAGT GGGCGTGGGC CGGAGGACCC CCTTTCCGAT CTGGACAAGG CCACCCTGCT CGAATACCTG CGCAAGGGGC GTGACGGCGG TGAGCAGGAC GACGACCAGG CGCCCAGCAA GATCACCCTC CGGCGCAAGA AGGTCAGTAC GCTGAAGATG CCCGCCAGCG GGGGCGGGGG CAGCGGCGCC CGGGGCCCCC GGCAGACGCG CACGGTCAAC GTGGAGGTGC GCAAGAAGCG CACCTACGTG AAACGCAGCG TGGTGGAGGC CGAAGAGTCC AAACACGACG TCGAGCGCCT GGAGCGGGCC CTGATCGAGG ACCGCAAGCG GGCGGAGGAG CGGGCGCGTC GGGAGGCCGA GGAGGCTGAG GCCCGCCGCC GTGAGCAGGA GGAGGCCGAG CGCCGACAGG CGGAAGCCGA GGCGCTCCGT CAGGCGGAGG CCGAACGTGA GGCCACGGCC GAGACGGCCG GCGTCGCCGA CGAGGCGGAC AAGGCCGAAC CCCAGCCCGA TCCGGAAGCG GCACGACTGG CTGCCGAGAA GGAAGAGGCG CGCAGGCGCG AGGAAGAGAA GGAGCGCCGC CGGCTGGAGC AGGAGGCCCG GCGCGAACGC GAGGCCGAGG AGCGCGCGGC CCGTAAGACC GGCGCCACCG CGCCTGCTGC CAAGGGCAAG CAGAAAAAGG GCCGGGAGAG CCTCTCCATG GGCGCCGGCA AGCCCGGCCG CCGGGGGGGC AAGAAGGGCG GCAGGCGTGC CGCCTCCGGT GGTGAGGCGG CCAAGCAGCT ACAGCATGGC TTCGCCAAAC CCACCCAGCC GGTGGTGCGT GAGGTGGAGA TCCCTGAGAG CATCACCGTC GGTGATCTGG CACAGAAGAT GAGCGTCAAG GCCGCCGTGC TCATCAAGGA GATGATGAAG CAGGGGGTGA TGGCGACCAT CAACCAGGCC CTGGACCAGG ACACCGCGGT CCTGCTGGTC GAAGAGATGG GCCACAAGCC GGTGATTGTG CGTGCGGACG CCCTTGAGGA GGAGGTGCTG CAGGATACCT CCCAGGCCCA GGAGGGCGAC AAGGCCCCGC GGCCGCCGGT GGTCACCGTC ATGGGCCACG TCGACCACGG CAAGACCTCG CTGTTGGACA ACATCCGCCG GGCCAAGGTG GCGGACGCCG AGGCCGGGGG GATCACCCAG CACATCGGCG CCTACCACGT GGAGACCGAC CGCGGCATGG TCACCTTCCT GGACACCCCG GGACACGAGG CCTTCACCGC CATGCGTGCC CGTGGTGCCC AGTTGACCGA CATCGTGGTG CTGGTGGTGG CGGCCGACGA CGGCGTCATG CCCCAGACCG AAGAGGCGGT GCGCCATGCC AAGGCGGCCG AGGTGCCGAT GGTGGTGGCG GTCAACAAGA TCGACAAGCC GGATGCGGAC CCGGACCGGG TCAAGCAGGA GCTCTCGCAG ATGGAGGTCA TCCCCGAGGA GTGGGGCGGT GATGTCCAGT TCATCCACGT CTCGGCAAAA CAGGGCGAGG GTCTGGATGA TCTGCTGGAG GCGATCCTGC TGCAGGCCGA GCTGATGGAG CTGGGGGCCG TGGCCGAGGG CAACGCCTCC GGTATCGTGC TGGAGTCCAG TCTGGACAAG GGGCGCGGCC CGGTGGCCAC GGTGCTGGTG CAGAGCGGCC TGCTGAAGAA GGGCGACAGC CTGCTCTGCG GCACCGAATA CGGCCGCGTG CGTGCGCTGA TCGACGAGAC CGGCAAGCGG GTCGACGAGG CCGGGCCCTC GATCCCCGTG GTGGTGCTTG GGCTCTCCGG TCTGCCCTCC GCCGGCGACG ACATGGTGGT GGTCGACGAC GAGAAGAAGG CCCGTGAGGT GGCCGAGATG CGCAAGGAGC GTCAGCGCGA CAAGCGTCTG GCCCAGCAGC AGGCGGCCCG CATGGAGAAC CTCTTCAACC AGATGAAGGA GGATGAGGTC AACACGGTCA ACCTGGTGGT CAAGGCGGAC GTCCAGGGCA GTGCCGAGGC GCTGCAGCAA TCGCTGGCCA ACCTCTCCAC CGACGATATC CAGGTCAAGG TGATCTCCTC CGGCGTGGGC GCCATCAACG AGTCCGATGT CAACCTGGCC CTGGCCTCCA ACGCCATCCT GATCGGCTTC AACGTGCGTG CCGATGCGGC TGCCCGGCGT CTGGTCCAGG AGAACGACGT CGACCTGCAC TACTACAGCG TCATCTACGA CGCCATCGAG CAGGTGAAAA ACGCCATCTC CGGGATGTTG GAACCGGAAC TCGAGGAGCA CATCATCGGC CTGGCAGAGG TCAAGGACGT GTTCCGCTCC TCCAAGCTTG GCGCGGTGGC CGGCTGTCTG GTCACCGAGG GGGCAGTGCG CCGGAAGAAC CCCATCCGCG TGCTGCGCGA CAACGTGGTC ATCTACGAGG GCGAGCTGGA GTCCCTGCGC CGCCACAAGG ACGACGTCAC CGAGGTCAAG TCCGGCACCG AGTGTGGTAT CGGCGTTAAG AACTACAACG ACGTCCGCAT TGGCGACCAG ATCGAGTGCT ACGAGCGCGT GGAAGTGCGC CGCGAGCTGT GA
|
Protein sequence | MAEDKVREFA ETVGIPVERL VSQLEAAGIS GRGPEDPLSD LDKATLLEYL RKGRDGGEQD DDQAPSKITL RRKKVSTLKM PASGGGGSGA RGPRQTRTVN VEVRKKRTYV KRSVVEAEES KHDVERLERA LIEDRKRAEE RARREAEEAE ARRREQEEAE RRQAEAEALR QAEAEREATA ETAGVADEAD KAEPQPDPEA ARLAAEKEEA RRREEEKERR RLEQEARRER EAEERAARKT GATAPAAKGK QKKGRESLSM GAGKPGRRGG KKGGRRAASG GEAAKQLQHG FAKPTQPVVR EVEIPESITV GDLAQKMSVK AAVLIKEMMK QGVMATINQA LDQDTAVLLV EEMGHKPVIV RADALEEEVL QDTSQAQEGD KAPRPPVVTV MGHVDHGKTS LLDNIRRAKV ADAEAGGITQ HIGAYHVETD RGMVTFLDTP GHEAFTAMRA RGAQLTDIVV LVVAADDGVM PQTEEAVRHA KAAEVPMVVA VNKIDKPDAD PDRVKQELSQ MEVIPEEWGG DVQFIHVSAK QGEGLDDLLE AILLQAELME LGAVAEGNAS GIVLESSLDK GRGPVATVLV QSGLLKKGDS LLCGTEYGRV RALIDETGKR VDEAGPSIPV VVLGLSGLPS AGDDMVVVDD EKKAREVAEM RKERQRDKRL AQQQAARMEN LFNQMKEDEV NTVNLVVKAD VQGSAEALQQ SLANLSTDDI QVKVISSGVG AINESDVNLA LASNAILIGF NVRADAAARR LVQENDVDLH YYSVIYDAIE QVKNAISGML EPELEEHIIG LAEVKDVFRS SKLGAVAGCL VTEGAVRRKN PIRVLRDNVV IYEGELESLR RHKDDVTEVK SGTECGIGVK NYNDVRIGDQ IECYERVEVR REL
|
| |