Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1437 |
Symbol | |
ID | 4269247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1641369 |
End bp | 1643621 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638126193 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_742276 |
Protein GI | 114320593 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.715322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCG GGATGTCGGG CTTCGCCCTG GGCGTGGTCG CCCTGCAGCA GCTGCCGTCA CTGCCCGGCG GGCCCTGGCT GCCGGCGGCC CTCCTGGCAC TGCCGCTGCT CGCGCTGCGC GCCCCGGCCG TGCGACTGGT GGCGGGGCTC GCCCTTGGCT TGGGCTGGGC CACCTTGCAC GCTCACCACG GTCTGGCCCA TCGGTTGCCC CTCGCCCTGG AGGGGCAGGA CCTCATCCTT ACGGGTAGTG TCGCAGACCT GAGCGAACCC CGCGGCCACA GCACCCGGTT CGTCTTTGCG CCGGATCAGG CCCGCACCCC GAACGGGGAG CCCGTCGACG CCCGCCTCCC GCGGCGGATC CGGCTGAGTG CCTATGGACT CGCTACGCCG CCCGCCGCCG GGGAGCGCTG GCGGCTGACC GTGCGCCTGC GGCCCCCTGC GGGCGCCTTG AACGACGGCG GCTTCGATTA TGAGCGTTGG CTGCACCAGA ATCGGTTTGA CGCCACCGGC TACGTGCGGG CCGAGCCCGC TCCGCAACGC CTGACGGAGG GGCGGGGGCT CCATGCCCTG CGCGAACAAA TCGCCGGAGC GATCCGCGAG CGCGTGGGGC AGGGTGGCGC GGCGACCCTG CTGCCCGCGC TCGCGGTCGC CGATCGCAGT GGCATGACCG AGGCCCAGTG GTCGGTGCTC GGGGCGACGG GCACCGGTCA CCTGCTGGCC ATCTCCGGGC TCCATATCGG CCTGGTGGCC GGGTTTGGCT TTGTCGTCGG AGGCGGGGTC TGGCGATGTC TGCCGGCCCT GGCCCGGCGC TCGCCGGCGC GAATGACTGG CGCGGCCTTT GCCCTGCTTC TGGCGGCGGG GTATGCGGCC CTGGCCGGCT TCACCCTGCC CACCCAGCGG GCCCTGATTA TGCTCGGCGT GGCGCTGGGG GCCCTGATGC TACGGCGCCG CCCGACGGTC TCCCACGGCC TGCTCGTGGC GCTGACCGCG GTACTGATCC TGGACCCGCT GGCACCGCTG GGTCCCGGGT TCTGGCTGTC GTTTGGCGCG GTCGCCATCA TCTTCCTGCT GGCTGCTCAT CGGCGGGCCG ACCGATCGGG GTGGTGGGTG GGGCTGCGCC TACACGCCCT GATCAGCCTG GCGCTGCTCC CGGTGATCGG CTGGTGGTTC GACGAACTGC CCCTGATCTC GGCACCGGCC AACATGCTGG CCATCCCGGT GGTCGCCTTC CTGGTGGTGC CGCCGCTGCT CCTGGGCGTG CCGTTGCTGG CCTTGCTGCC ACCCCTGAGC GAGGCCTTGC TGGTCTTCAG CCTCGGCGTG CTCGAGGGGC TGATGCAGGG GCTGGCCTGG CTTGCCGAGT ACGGCCAATG GGGCGATCCG GCGGGGGTCC GGCAGGGGCT CTGGGTGGCG GCGGCAGGCG CGCTCCTGCT GTTATTGCCG CCCGGCTGGT TCGGGCGCTG GCTGGGGCTA CCGCTGCTCG CCTTGCCGGT GGCGGCCGGG CCCCAGCCGG ACGCAGCATC CGCCCCGTCG CAGCTCGCCG TGCTCGATGC CGGTCGCGGA CTGATCAGTG TGCTCGTTGT GGCGGACCGG GTTCTGGTCT ATGGCAGCGG GGCACGAGTG GGCCGTGGCG CGACCGCGGC CGAGCGCACC CTGGTGCCCT GGTTGGAGGT TCGGGGGCTG CGCCCGGACT ACCTGATCCC CGGGGGCCGC GGCTCGGCCT GGACCGGAGG GCTGGAGGCC CTGCGGGCGC GCTACCCCCA GGCGCACCCG GTAACAACCT GCGAGGCCGG CGGCGCCCTG CCACCCGGTG TCCGGCTGCG GCCGGTGGCC GGGGGCTGTG CCCTGGAGAC GACCCTGGGG CAAGCGCGGG TGCGACTCAC CCCGTCGGAG CGGCCGCACC ACAACGGGGC GCCGCCGATG GCGGTGCTCG TCGCCCCGCT CACCCATCTG CAGCAGCTCG AGGCCGGGGG GCACTCGCCC CGGTACCGAA TCGGGTACCC GGTGCGGCGC GAGGAGGCCA CGCTGGGCAA TGCCAGGGCC ACCGCGCACC TGGGCCACAA CCCGGCGGTG CTGGGCACCG TCCTCGTCCG ACCCGGGGCA GAGGGGCTGC GGCTGGAGTG CTGGTTGCGG GATCAGGGGC GCTACTTCCA CCGCCCGTGG CCGCCGGGGC AGGGGAGCAC TGACACGGGC GAGGGCGCTC CAAGTCGGTG GGCAAATCCT TTATCATCGC GGTCATGCGA ACCGGGCCCC TGA
|
Protein sequence | MRAGMSGFAL GVVALQQLPS LPGGPWLPAA LLALPLLALR APAVRLVAGL ALGLGWATLH AHHGLAHRLP LALEGQDLIL TGSVADLSEP RGHSTRFVFA PDQARTPNGE PVDARLPRRI RLSAYGLATP PAAGERWRLT VRLRPPAGAL NDGGFDYERW LHQNRFDATG YVRAEPAPQR LTEGRGLHAL REQIAGAIRE RVGQGGAATL LPALAVADRS GMTEAQWSVL GATGTGHLLA ISGLHIGLVA GFGFVVGGGV WRCLPALARR SPARMTGAAF ALLLAAGYAA LAGFTLPTQR ALIMLGVALG ALMLRRRPTV SHGLLVALTA VLILDPLAPL GPGFWLSFGA VAIIFLLAAH RRADRSGWWV GLRLHALISL ALLPVIGWWF DELPLISAPA NMLAIPVVAF LVVPPLLLGV PLLALLPPLS EALLVFSLGV LEGLMQGLAW LAEYGQWGDP AGVRQGLWVA AAGALLLLLP PGWFGRWLGL PLLALPVAAG PQPDAASAPS QLAVLDAGRG LISVLVVADR VLVYGSGARV GRGATAAERT LVPWLEVRGL RPDYLIPGGR GSAWTGGLEA LRARYPQAHP VTTCEAGGAL PPGVRLRPVA GGCALETTLG QARVRLTPSE RPHHNGAPPM AVLVAPLTHL QQLEAGGHSP RYRIGYPVRR EEATLGNARA TAHLGHNPAV LGTVLVRPGA EGLRLECWLR DQGRYFHRPW PPGQGSTDTG EGAPSRWANP LSSRSCEPGP
|
| |