Gene Mlg_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0801 
Symbol 
ID4270565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp895275 
End bp897281 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content68% 
IMG OID638125552 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_741645 
Protein GI114319962 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.950377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCATT ATCTTGCAGG TCTTCCGCGC CCCGTAAAGC AGGGGCTGAT GGTCCTGGCC 
GACGCTGCTG TAGTGGTCCT GGCCCTTTGG GCCGCTCTGG TCCTTCACGC GGGGGTCGGC
CCGGCCAGTT TTGATCCCAC CCTGTGGGCG TTGTATGCGC TGGTGCCCAT CGTGGCCATC
CCCCTGTTCG CACTCTTCGG TCTTTATCGC GCCGTGGTCC GATGGATGGG TGCCCACGCC
CTGGCCACCA TCACCAAGGG GGTGATGGCC TCCACCCTCG CCCTGACCGT AATCATTTCG
GTAATGCCGC AATGGACACT CAGCCTTCCG GTTGTCCTCA ATTTCGCCCT GCTCAGCCTG
TGTGCCGTTG GCGGCCTGCG CATCGTGGTG CGAGGCTGGT TCCGGCGCGC GGCCAACGGA
GAGGGAGATC GCCACCGGCA GCCGGTGGTG ATCTACGGGG CCGGCAACTG CGGCATCGAG
CTGGCGAGCA GCCTGATCGG CAATACCCAT TACCGCCCGG TGGCCTTTGT CGACGATGAC
CCGCGCAAGC AGGGCACCAT CATCCAGGGA GTCAGCGTGC ATCCTCCGCA ACGCCTGCCG
GCATTGATCG ACCGTGAGCG GGTTGCCTAC GTGCTGCTGG CCATGGCCCG ACTGGCCCGG
CCGGAGCGAC GCCGCATCGT CGAGCGGCTG GAGCCCCTCA ACGTGCACAT CCTCACCATC
CCGCCGCTGT CCGATCTGGT CACCCGACGC GCCGGCATGG AGGACGTGCG CGAGGTGGAG
GTGGAGGACC TGCTGGGCCG GGATGCGGTG GCCCCGCGGC CGGAACTGCT GCGGCGCTGT
ATCGCCGGTC ATGCAGTCAT GGTGACCGGG GCCGGCGGCT CCATTGGCTC TGAGCTCTGC
CGGCAGATCC TCCGCCTGCG GCCACGGCGA CTGGTCCTGG TGGAGCGCAG TGAGTTCGCG
CTCTACGCCA TTGAGAAGGA GCTCCGCTAC CAGATGGAGG GTCAGGCGCA CCAACCGGAG
CTGCACCCGG TGCTCGGGGA TGTGACCGAC GGCGAGCGCA TGCAGGCGCT GATGAGCGCC
TTCGCCATCC ACAGCGTCTA CCACGCCGCC GCCTACAAGC ACGTACCACT GGTCGAGGCG
AACAGCCTGC AGGGCATCCA TAACAACGTC TTCGGCACCC TGCGCACGGC GGAGGCCGCG
GCCGCCGCCG GCGTCCCCCA CTTCGTGCTG ATCAGTACCG ACAAGGCGGT GCGCCCCACC
AATGTCATGG GCGCGAGCAA GCGGATGGCG GAACAGGTGG TTCAGGACCT GGCCCGCCGG
CAAGGTGTCC CCACGGTGTT CTCCATGGTC CGCTTCGGCA ATGTCCTCGG CTCGTCCGGC
TCAGTCGTGC CGCTGTTCCG CGAACAGATT CGCAAGGGCG GTCCGGTCAC GGTCACCCAT
CCCGAGGTCA CCCGCTACTT CATGACCATC CCCGAGGCCG CCTCCCTGGT GATCCAGGCC
GGCGCCATGG CGCGGGGCGG CGAGGTGTTT GTGCTGGACA TGGGCGAGCC GGTCCGGATC
GACGACCTCG CCCGGCGGAT GATCCGGCTG TCCGGGCTGA CGGTGCGCGA CGAGGCGCGC
CCGGCGGGCG ATATCGAGAT CCGTTACACC GGCCTGCGCC CGGGCGAGAA GCTCTATGAG
GAATTGCTGC TGGGCGAGGC GGTGACCGGC ACCGACCACC CCATGATCTG CCGGGCCAGC
GAGGCGCGGC TACCCGCGGA TGGCCTGCAG CGCCTGCTGG AGGATCTGCG CCGGGCCGCA
GCCCGGTTCG ATTGTGAGGC CGCCCGCCAG TTGCTGGCCG GGGCGGTGGA GGGCTATGAG
GCCCCCGGCC CGTGCAACGA CGTGCTGGGC CGGCGATTGG CAGTGGCGGC GGCCAAGCCG
TCGCCGGCGA TCAGTCCTTG GCCCGCGCCT TGTACTGGTA CTGATAATAG TAGTAATCGC
TCGAGCGCCC ACTCCGTTTC GGGTTGA
 
Protein sequence
MHHYLAGLPR PVKQGLMVLA DAAVVVLALW AALVLHAGVG PASFDPTLWA LYALVPIVAI 
PLFALFGLYR AVVRWMGAHA LATITKGVMA STLALTVIIS VMPQWTLSLP VVLNFALLSL
CAVGGLRIVV RGWFRRAANG EGDRHRQPVV IYGAGNCGIE LASSLIGNTH YRPVAFVDDD
PRKQGTIIQG VSVHPPQRLP ALIDRERVAY VLLAMARLAR PERRRIVERL EPLNVHILTI
PPLSDLVTRR AGMEDVREVE VEDLLGRDAV APRPELLRRC IAGHAVMVTG AGGSIGSELC
RQILRLRPRR LVLVERSEFA LYAIEKELRY QMEGQAHQPE LHPVLGDVTD GERMQALMSA
FAIHSVYHAA AYKHVPLVEA NSLQGIHNNV FGTLRTAEAA AAAGVPHFVL ISTDKAVRPT
NVMGASKRMA EQVVQDLARR QGVPTVFSMV RFGNVLGSSG SVVPLFREQI RKGGPVTVTH
PEVTRYFMTI PEAASLVIQA GAMARGGEVF VLDMGEPVRI DDLARRMIRL SGLTVRDEAR
PAGDIEIRYT GLRPGEKLYE ELLLGEAVTG TDHPMICRAS EARLPADGLQ RLLEDLRRAA
ARFDCEAARQ LLAGAVEGYE APGPCNDVLG RRLAVAAAKP SPAISPWPAP CTGTDNSSNR
SSAHSVSG