Gene Mlg_2681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2681 
Symbol 
ID4269556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3035976 
End bp3037223 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID638127440 
Productpeptidase M42 family protein 
Protein accessionYP_743511 
Protein GI114321828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000914613 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAACC ACGCCATTCC GCCACCCTGG GTCCGGCCCA TGCCCGAGGC GCAGTTTCAG 
CTCATGCGCC GGATCCTTGC AGCGCCCAGC CCCGTGGGAC TGGAGGCGGC CATGACGGAG
GGCGTGCTCT GCCCCCATTT CCGCTCCTTT GCCCCAGAGA GCTGGCAACT GCAGCGGTTC
CAGGGACACG CCGGTATCGT GCTGGACACC CATCCCGGTG ACGACGAGCG CTTCCGGCTC
ATGATTGTCG GTCACGCCGA CAAGATCCGC CTGCAGGTCC GGAGTATCGG CGACGACGGC
AAGGTCTGGA TCAACAGCGA CGGCTTTCTG CCCGCCACCC TGATCGGCCA CGAGGTGCGC
CTGTTCACCG AGAACCCCGG CCGTCCGGGC CACTACCGGG TGATCGACGG GGGCACGGTG
GAGGCCCTCG GCGCCATCCA CTTCGCCGAG CCGGAGCTGC GCAGTGGCGA AAAGGGGGTG
AAGAAAGAAC AGCTCTACCT GGAGCTGCAG GTGCACGGCG ATGACCGCAA GGCCCAGGTG
GAGCACCTGG GGGTCCGGCC CGGCGATACC CTGCTGCTCA ACCGGCCCAT CCGGCGCGGC
TTCAGCCCGA ATACCTTTTA CGGCGCCTAC CTGGACAACG GCCTGGGCAG CTTCGTCACG
GCCGAGGCCG CCCGGCTGCT GGTGGAGGCC GGCGCCCCGG CGAGCATCCG TGTGTTGTTC
GCCATTGCCG GGTATGAGGA GATCGGCTGC TTCGGCAGCC GGGTGCTGGC GGCCCACTAC
CGGCCGGATG CCCTGATCGC GGTGGACGTG GAGCATGATT ATCGGGCCGC CCCGCAGGTG
AGTGACCGGC GGCTGCCGCC CTTGGAGATG GGCAAGGGCT TTAGCCTGTC GGTGGGCTCC
ATCGTCAGTG AGCAGTTGAA CCAAGTGATC GAGGAGGGGG CTCGGAGCCG TGGGATCCCC
AGCCAGCGTG ACGTGGTGGG TCCGGATACC GGCACGGACG GCATGGCCGG GGTGCTGGCC
AATGTTGACT GCGCCGCTGC CTCAGTGGGC ATCCCCATCC GCAACATGCA CACCATCTCC
GAGACCGGCC ACACCTCGGA TGTGCTCGCG GCCCTGCACG GTGTGGTGGA GGCGGCCCTG
GCGCTGGACG CCGCCGGCAC GGACCCGGAG GCGCTGCGTC GGCGCTTTCG CGAACACCAT
CCGCGGCTGG ACCAGGCCGC CCCCCTGCGC CACCCGGGCC CCGCCTGA
 
Protein sequence
MPNHAIPPPW VRPMPEAQFQ LMRRILAAPS PVGLEAAMTE GVLCPHFRSF APESWQLQRF 
QGHAGIVLDT HPGDDERFRL MIVGHADKIR LQVRSIGDDG KVWINSDGFL PATLIGHEVR
LFTENPGRPG HYRVIDGGTV EALGAIHFAE PELRSGEKGV KKEQLYLELQ VHGDDRKAQV
EHLGVRPGDT LLLNRPIRRG FSPNTFYGAY LDNGLGSFVT AEAARLLVEA GAPASIRVLF
AIAGYEEIGC FGSRVLAAHY RPDALIAVDV EHDYRAAPQV SDRRLPPLEM GKGFSLSVGS
IVSEQLNQVI EEGARSRGIP SQRDVVGPDT GTDGMAGVLA NVDCAAASVG IPIRNMHTIS
ETGHTSDVLA ALHGVVEAAL ALDAAGTDPE ALRRRFREHH PRLDQAAPLR HPGPA