Gene Mlg_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1913 
Symbol 
ID4270114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2178996 
End bp2180264 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID638126669 
Productpeptidase M42 family protein 
Protein accessionYP_742747 
Protein GI114321064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATG CCCCCTGGAC GAACCCCATG CCCGAGCCGC AATTCGAGCT GATGCGGCGC 
ATCCTGGCCG CCCCCAGCCC GGTGGGGCTG GAGGGCGCCA TGACCTACGG CGTGCTCAAG
CCCCACTTCG AGGGGTTCGC GCCCGCGGAC TGGCACCTGC ACCAGTTCAA GGGCCACGCC
GGCGTGGTGC TGGATACCCA CCCGGGCCGT GATGACCTGT TCAAGGTCAT GGTGATCGGC
CACGCGGACA AGATCCGCAT GCAGGTCCGC TCCATCGGCG ACGACGGCAA GATCTGGATC
AACACCGACG CCTTCCTGCC CAACGTACTG GTCGGCCACG AGGTCACGCT CTTCAGCGAG
GACCCCGAGG CCCCGGGCCA ATACCGGCGC ATCGAGGGCG GCACCGTGGA GGCGCTGGGC
GCCATCCACT TCTCCGACCC GAAGCAGCGC ACCGGCGAGC AGGGCATCAA GAAAGAGCAG
ATCTACCTGG ACCTGCAGAT CCACGGCGAA AACAAAAAGC AGCAGGTGGA GAACCTGGGC
GTGCGCCCCG GGGATTCGAT CCTGTTCAAC CGCCCCATCC GCCACGGTTT CAGCCCCGAC
ACCTTCTATG GCGCCTACCT GGACAACGGC CTGGGCTGCT TCGTCACCGC CGAGGTGGCC
CGGCTGATCG CCGAGGCCGG CGGCACGGAA AAGGTCAGGG TGTTGTTCGC CATCGCCAGC
TACGAGGAGA TCGGCCGCTT CGGCAGCCGG GTACTGGCCG GGGAGCTCAA GCCCGATGCC
ATCATCGCCG TGGACGTGAA CCACGACTAC GTGGCCGCCC CCGGTATCGG CGACCGGCGC
ATGCAGCCGC TGGAGATGGG TAAGGGCTTC ACCCTGTCGG TGGGTGCCGT GGCCAGCGAG
CAGCTCAACC GGATCATCGA AAGCACCGCC AAGGCGCAAC AGATCCCCAT GCAGCGCGAC
GTTGTGGGGA ACGACACCGG TACCGACGGC ATGGCCGGCG TGCTCGCCTC CGTGGACTGC
GTGGCCACCT CCATCGGCTT CCCGATCCGG AACATGCACA CCATCTCCGA GACCGGCAAC
ACCCGCGATG TGCTGGCGGC CATCCACGCC ATCACCCGCA GCCTGCAGGC GCTGGACGCC
CTGGCGGATC CGCATCGGGA GTTCCTGGAC AACCACCCAC GCCTGGACCA GGCCAATTCA
CTGGGCCATC AGGGCGGAGA GAAGCCGGAT GACGGCGAGC CGTCCACAAC GCCGGAGAAA
ACCACCTGA
 
Protein sequence
MSNAPWTNPM PEPQFELMRR ILAAPSPVGL EGAMTYGVLK PHFEGFAPAD WHLHQFKGHA 
GVVLDTHPGR DDLFKVMVIG HADKIRMQVR SIGDDGKIWI NTDAFLPNVL VGHEVTLFSE
DPEAPGQYRR IEGGTVEALG AIHFSDPKQR TGEQGIKKEQ IYLDLQIHGE NKKQQVENLG
VRPGDSILFN RPIRHGFSPD TFYGAYLDNG LGCFVTAEVA RLIAEAGGTE KVRVLFAIAS
YEEIGRFGSR VLAGELKPDA IIAVDVNHDY VAAPGIGDRR MQPLEMGKGF TLSVGAVASE
QLNRIIESTA KAQQIPMQRD VVGNDTGTDG MAGVLASVDC VATSIGFPIR NMHTISETGN
TRDVLAAIHA ITRSLQALDA LADPHREFLD NHPRLDQANS LGHQGGEKPD DGEPSTTPEK
TT