Gene Mlg_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0143 
Symbol 
ID4269274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp165519 
End bp166451 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content71% 
IMG OID638124867 
Productpolysaccharide deacetylase 
Protein accessionYP_740988 
Protein GI114319305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID[TIGR03006] polysaccharide deactylase family protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.839924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTA GGCAGGGACA AGGCCGCCAG GTCAGCGACA TGCAGGGGGT GGTGAACGCC 
CTCACCGTGG ACGTGGAGGA CTACTTCCAG GTCTCGGCGC TGGCGCCGCA CATCCCGCGC
GAGAGCTGGT CGCACCGCGC CTGCCGGGTG GAACGCAATC TCGATCGGAT CCTGGCTCTG
TTCGAGCGCC GGGATGCCCG GGCCACCTTC TTCACCCTGG GCTGGATCGC CGAGCGCTAC
CCGGCGGCGG TGCGCCGCAT CGTCGAGTGC GGCCACGAGC TGGCCAGCCA CGGCTACGGC
CATGAACGGG TCAGTGACCT GGGCCCGGCC CAGTTCCATG CCGACATCAC CCGCGCCAAG
GCCTTGCTGG AGGACATCGG CGGGGTCGCC GTCAAGGGCT ATCGCGCCCC CAGCTTCTCC
ATCGGCCGGA GCAACCTCTG GGCCCTGGAG GTGCTGGCGG AGACCGGGCA CCGCTACAGC
TCCAGCATCT ACCCGGTGCG CCACGACCAC TACGGCATGC CGGAGGCACC GCGCCACCCG
CACCGCCCCA CCGGGCGCGG CGGTATCCTG GAGCTGCCCC CCGCCACCCT GGCACTGGCC
GGGCGCAATC TTCCGGCCGC GGGCGGCGGC TACTTCCGGC TGCTGCCCTA CGCGGCCTCC
CGCGGCGCGC TGAACCGCAT CAACCGCGTC GAGGGCCAGC CGGCGGTCTT CTACTTCCAC
CCTTGGGAGA TCGACCCCGG CCAGCCCCGG ATCCCGGGCA TCGGCCTGAA GACCCGCTTT
CGCCACTACC TCAATCTGCA CCGCATGGAG GCGCGGCTGG AGCGGTTGCT GCACGACTTC
CGCTGGGACC GGATGGACCA GGTCTACGCC AGTGCCCTTG CCGGCGAACC ACAGCAGGCC
ACGCCGCCAG CCGCCGCGCC TGAACTCGCT TGA
 
Protein sequence
MTTRQGQGRQ VSDMQGVVNA LTVDVEDYFQ VSALAPHIPR ESWSHRACRV ERNLDRILAL 
FERRDARATF FTLGWIAERY PAAVRRIVEC GHELASHGYG HERVSDLGPA QFHADITRAK
ALLEDIGGVA VKGYRAPSFS IGRSNLWALE VLAETGHRYS SSIYPVRHDH YGMPEAPRHP
HRPTGRGGIL ELPPATLALA GRNLPAAGGG YFRLLPYAAS RGALNRINRV EGQPAVFYFH
PWEIDPGQPR IPGIGLKTRF RHYLNLHRME ARLERLLHDF RWDRMDQVYA SALAGEPQQA
TPPAAAPELA