Gene Mlg_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0800 
Symbol 
ID4270564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp893186 
End bp895351 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content67% 
IMG OID638125551 
Producthypothetical protein 
Protein accessionYP_741644 
Protein GI114319961 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAT ACGAACACCG CCCGACGTCT TCGGACCGTG ATGACGAGAT CGAGATCGGC 
CGTCTGGTCG AGATCCTGCT GAACGGCAAA TGGATCATTG CCGGTCTCAG TGCATTGGGG
CTGGCCCTGG GCGGGTTCTA TGCCTACACC AGCACACCGG TCTACAGCGG CGACATGCTG
CTGCAGGTGG AGAGTAAGCA GGCGACGCTA CCGGGGCTGT CGGAGATGCT GGGCGGCGAA
CAGCAGGTGC CCACCAGTGC GGAGGTGGAG CTGCTGCGAT CGCGCATGGT GGTGGGTTCG
GTGGTGGACC AACTGGACCT GGCCGTGCAG GCTCGGCCCT TGCCGACGGA CTGGCGCGAG
GCGGTGCTGG GTGTGAGCAA CCCCCGGCGC CACATTGAGG TGGGCCGCTT CGAGGTCTCG
CCGGCCCTGG AGGGCCAGAC GTTCATCGTC CGTGTGCTGG GCGAGGGGCA GTTCGCCCTG
CTTGAGGACG ACGGCGGGGC GGAACTGGCG CGCGGGCGGG TGGACGAGAC CCTGGTGCTG
GACAACCCGG ACGGCAGCTC GCTTCAGCTC TTTGTCCACG TGCTCACCGG CGAGCCGGGC
GACACCTTCG AGCTGGTGCG CCAGCCGCGG CTGCGCGCCA TTCAGAACCT GCGCAACGGC
TTGAACGTCA GTCAGCGCGG TGAGTCGGGG ATCCTGGAGG TCACTTACGA GCACCCGGAC
CGCCAGCTTA TCGAGGAGGT CCTCAACACC CTGGGTATGG TCTACGTGCG CCAGAACGTG
GAGCGTCGGA CCGAGGAGGC GGCCCGCAGC CTGGAGTTCC TGGAGGAGCA GCTGCCACAG
TTGCAGGACC GACTGCAGCA GGCCGAGGAT GCCTTCAACG CCTTCCGCCG CGAGCACCAG
GCGGTGGATA TGGATGAGAA CACCCGGGTG ATGCTCAGCC AGTTGGTCGA GGTGGAGAGC
GAACTCCAGG CCTTGCGCCT GGAGGAGAGC GAGAAGAGCC TGCGCTTTGG TCGCGAGCAC
CCGCAAATGC AGTCGCTGCG CCAACGGCGC CAATCGCTGG AGACCCTGCG TGCCGAGCTG
GAGGAGGAGC TGGGGGAACT GCCGGAGCGC CAGCAGCAGT TGGTGCGCCT GCGCCGCGAG
GTGGAGGTGA ACACCCAGCT CTACACCAAC CTGCTCAACA CCGCCCAGGA GTTGCGGATC
TCCCAGGCCG GCACGGTTGG CAATGCCCGC GTGGTGGATG ACGCGGCCGT GGGCTTCAAT
CCAGTGGCGC CGCAGACCAC CCTGATTATG GCGCTGAGCC TGCTCCTGGG CGGCATGCTG
GGGGTGGGCA CCGTGTTCGG CCGCGAGATG CTCCGTCGCG GGGTCGAAGA CCCGGACGCC
CTGGAACTGG AGGTGGGCAT CCCGGTCTAC GCCGTGGCGC CCCACAGCCC GGCGGCCCTG
CAGCTCGAGA AGAAGGGCCG TCGACAGCGG ACCCAGGTGC CGCTGCTGGC GGAGAAAAAC
AGCCAGGACC CGCTGGTGGA GAGCCTGCGC AGTCTGCGCA CCAGCCTGAA CTTCGCTCTG
CTTAACAACG CCCGTAACGT CCTGGCCCTG ACCAGTACCG GCCCGGGGGA GGGCAAGACC
ACCCTGTCCG TCAACTTGGC GGCGGTCCTG GCCCAGAGCG GCCAGCGGGT GCTGCTCATC
GATACCGACA TGCGCCGGGG CCATCTGCAT ATCTTTCTGC AGAACCGGCG GCGCGAGCCG
GGCCTGTCCG GGGTGCTGGC CGGGCAGGCC ACACTGGAGG AGGCGGTCTC ACGCATCAGG
GAGAATCTCG ACGTGCTGCC CGCCGGCACC TTCCCGCCGA ACCCCTCGGA GCTGCTGATG
CAGGAGGGAT TCGGCCGGCT GATCGAGGAG CAGCGCCAGC GCTATGACCT GGTGATCCTG
GACACCGCGC CAGTGATGCC GGTCACCGAC GGCGTCCTGG CGGCCGCCCA CGCCGGGCCG
GTGTTCCTGG TGGCCCGGGC GGGCTATGTG ACCACCCGCG CGGTGCAAAG CACCATCTGG
CGGCTGGAGA AGAACCAGAT CGACACCACC GGGCTCGTGG TCAACGACCT CAACCCGAAA
CGGAGTGGGC GCTCGAGCGA TTACTACTAT TATCAGTACC AGTACAAGGC GCGGGCCAAG
GACTGA
 
Protein sequence
MSQYEHRPTS SDRDDEIEIG RLVEILLNGK WIIAGLSALG LALGGFYAYT STPVYSGDML 
LQVESKQATL PGLSEMLGGE QQVPTSAEVE LLRSRMVVGS VVDQLDLAVQ ARPLPTDWRE
AVLGVSNPRR HIEVGRFEVS PALEGQTFIV RVLGEGQFAL LEDDGGAELA RGRVDETLVL
DNPDGSSLQL FVHVLTGEPG DTFELVRQPR LRAIQNLRNG LNVSQRGESG ILEVTYEHPD
RQLIEEVLNT LGMVYVRQNV ERRTEEAARS LEFLEEQLPQ LQDRLQQAED AFNAFRREHQ
AVDMDENTRV MLSQLVEVES ELQALRLEES EKSLRFGREH PQMQSLRQRR QSLETLRAEL
EEELGELPER QQQLVRLRRE VEVNTQLYTN LLNTAQELRI SQAGTVGNAR VVDDAAVGFN
PVAPQTTLIM ALSLLLGGML GVGTVFGREM LRRGVEDPDA LELEVGIPVY AVAPHSPAAL
QLEKKGRRQR TQVPLLAEKN SQDPLVESLR SLRTSLNFAL LNNARNVLAL TSTGPGEGKT
TLSVNLAAVL AQSGQRVLLI DTDMRRGHLH IFLQNRRREP GLSGVLAGQA TLEEAVSRIR
ENLDVLPAGT FPPNPSELLM QEGFGRLIEE QRQRYDLVIL DTAPVMPVTD GVLAAAHAGP
VFLVARAGYV TTRAVQSTIW RLEKNQIDTT GLVVNDLNPK RSGRSSDYYY YQYQYKARAK
D