Gene Mlg_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1437 
Symbol 
ID4269247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1641369 
End bp1643621 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content74% 
IMG OID638126193 
ProductComEC/Rec2-related protein 
Protein accessionYP_742276 
Protein GI114320593 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.715322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCG GGATGTCGGG CTTCGCCCTG GGCGTGGTCG CCCTGCAGCA GCTGCCGTCA 
CTGCCCGGCG GGCCCTGGCT GCCGGCGGCC CTCCTGGCAC TGCCGCTGCT CGCGCTGCGC
GCCCCGGCCG TGCGACTGGT GGCGGGGCTC GCCCTTGGCT TGGGCTGGGC CACCTTGCAC
GCTCACCACG GTCTGGCCCA TCGGTTGCCC CTCGCCCTGG AGGGGCAGGA CCTCATCCTT
ACGGGTAGTG TCGCAGACCT GAGCGAACCC CGCGGCCACA GCACCCGGTT CGTCTTTGCG
CCGGATCAGG CCCGCACCCC GAACGGGGAG CCCGTCGACG CCCGCCTCCC GCGGCGGATC
CGGCTGAGTG CCTATGGACT CGCTACGCCG CCCGCCGCCG GGGAGCGCTG GCGGCTGACC
GTGCGCCTGC GGCCCCCTGC GGGCGCCTTG AACGACGGCG GCTTCGATTA TGAGCGTTGG
CTGCACCAGA ATCGGTTTGA CGCCACCGGC TACGTGCGGG CCGAGCCCGC TCCGCAACGC
CTGACGGAGG GGCGGGGGCT CCATGCCCTG CGCGAACAAA TCGCCGGAGC GATCCGCGAG
CGCGTGGGGC AGGGTGGCGC GGCGACCCTG CTGCCCGCGC TCGCGGTCGC CGATCGCAGT
GGCATGACCG AGGCCCAGTG GTCGGTGCTC GGGGCGACGG GCACCGGTCA CCTGCTGGCC
ATCTCCGGGC TCCATATCGG CCTGGTGGCC GGGTTTGGCT TTGTCGTCGG AGGCGGGGTC
TGGCGATGTC TGCCGGCCCT GGCCCGGCGC TCGCCGGCGC GAATGACTGG CGCGGCCTTT
GCCCTGCTTC TGGCGGCGGG GTATGCGGCC CTGGCCGGCT TCACCCTGCC CACCCAGCGG
GCCCTGATTA TGCTCGGCGT GGCGCTGGGG GCCCTGATGC TACGGCGCCG CCCGACGGTC
TCCCACGGCC TGCTCGTGGC GCTGACCGCG GTACTGATCC TGGACCCGCT GGCACCGCTG
GGTCCCGGGT TCTGGCTGTC GTTTGGCGCG GTCGCCATCA TCTTCCTGCT GGCTGCTCAT
CGGCGGGCCG ACCGATCGGG GTGGTGGGTG GGGCTGCGCC TACACGCCCT GATCAGCCTG
GCGCTGCTCC CGGTGATCGG CTGGTGGTTC GACGAACTGC CCCTGATCTC GGCACCGGCC
AACATGCTGG CCATCCCGGT GGTCGCCTTC CTGGTGGTGC CGCCGCTGCT CCTGGGCGTG
CCGTTGCTGG CCTTGCTGCC ACCCCTGAGC GAGGCCTTGC TGGTCTTCAG CCTCGGCGTG
CTCGAGGGGC TGATGCAGGG GCTGGCCTGG CTTGCCGAGT ACGGCCAATG GGGCGATCCG
GCGGGGGTCC GGCAGGGGCT CTGGGTGGCG GCGGCAGGCG CGCTCCTGCT GTTATTGCCG
CCCGGCTGGT TCGGGCGCTG GCTGGGGCTA CCGCTGCTCG CCTTGCCGGT GGCGGCCGGG
CCCCAGCCGG ACGCAGCATC CGCCCCGTCG CAGCTCGCCG TGCTCGATGC CGGTCGCGGA
CTGATCAGTG TGCTCGTTGT GGCGGACCGG GTTCTGGTCT ATGGCAGCGG GGCACGAGTG
GGCCGTGGCG CGACCGCGGC CGAGCGCACC CTGGTGCCCT GGTTGGAGGT TCGGGGGCTG
CGCCCGGACT ACCTGATCCC CGGGGGCCGC GGCTCGGCCT GGACCGGAGG GCTGGAGGCC
CTGCGGGCGC GCTACCCCCA GGCGCACCCG GTAACAACCT GCGAGGCCGG CGGCGCCCTG
CCACCCGGTG TCCGGCTGCG GCCGGTGGCC GGGGGCTGTG CCCTGGAGAC GACCCTGGGG
CAAGCGCGGG TGCGACTCAC CCCGTCGGAG CGGCCGCACC ACAACGGGGC GCCGCCGATG
GCGGTGCTCG TCGCCCCGCT CACCCATCTG CAGCAGCTCG AGGCCGGGGG GCACTCGCCC
CGGTACCGAA TCGGGTACCC GGTGCGGCGC GAGGAGGCCA CGCTGGGCAA TGCCAGGGCC
ACCGCGCACC TGGGCCACAA CCCGGCGGTG CTGGGCACCG TCCTCGTCCG ACCCGGGGCA
GAGGGGCTGC GGCTGGAGTG CTGGTTGCGG GATCAGGGGC GCTACTTCCA CCGCCCGTGG
CCGCCGGGGC AGGGGAGCAC TGACACGGGC GAGGGCGCTC CAAGTCGGTG GGCAAATCCT
TTATCATCGC GGTCATGCGA ACCGGGCCCC TGA
 
Protein sequence
MRAGMSGFAL GVVALQQLPS LPGGPWLPAA LLALPLLALR APAVRLVAGL ALGLGWATLH 
AHHGLAHRLP LALEGQDLIL TGSVADLSEP RGHSTRFVFA PDQARTPNGE PVDARLPRRI
RLSAYGLATP PAAGERWRLT VRLRPPAGAL NDGGFDYERW LHQNRFDATG YVRAEPAPQR
LTEGRGLHAL REQIAGAIRE RVGQGGAATL LPALAVADRS GMTEAQWSVL GATGTGHLLA
ISGLHIGLVA GFGFVVGGGV WRCLPALARR SPARMTGAAF ALLLAAGYAA LAGFTLPTQR
ALIMLGVALG ALMLRRRPTV SHGLLVALTA VLILDPLAPL GPGFWLSFGA VAIIFLLAAH
RRADRSGWWV GLRLHALISL ALLPVIGWWF DELPLISAPA NMLAIPVVAF LVVPPLLLGV
PLLALLPPLS EALLVFSLGV LEGLMQGLAW LAEYGQWGDP AGVRQGLWVA AAGALLLLLP
PGWFGRWLGL PLLALPVAAG PQPDAASAPS QLAVLDAGRG LISVLVVADR VLVYGSGARV
GRGATAAERT LVPWLEVRGL RPDYLIPGGR GSAWTGGLEA LRARYPQAHP VTTCEAGGAL
PPGVRLRPVA GGCALETTLG QARVRLTPSE RPHHNGAPPM AVLVAPLTHL QQLEAGGHSP
RYRIGYPVRR EEATLGNARA TAHLGHNPAV LGTVLVRPGA EGLRLECWLR DQGRYFHRPW
PPGQGSTDTG EGAPSRWANP LSSRSCEPGP