Gene Mlg_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0533 
Symbol 
ID4268062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp576720 
End bp578396 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content70% 
IMG OID638125274 
Productsurface antigen (D15) 
Protein accessionYP_741377 
Protein GI114319694 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.478623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGG CGGCCCATGG CGTGGAGGTC CGGGTGGAGG GCGTCAGTGG CGCCCTGCGG 
GACAATGTCG AGGCCTGGCT GGGCGAGCCC GCCGGCGACA GCCGCCGGGC CCTGCGCACT
TACGAGCGCC AACTGCCGGA ACGGGCCGCT CAGGCCCTCC AGGCGCTGGG TCACTACCGG
CCGCAGATCG ACGTCGAGCG CGAGGAGACC GATAACGGGC CGCGGTTCAT CCTGCGCATC
GATCCGGGCG AGCCGGTGCG CATCGCCGCG GTGGATCTGC GCATCGAGGG CGAGGCGCGT
GACGACCCGG CTTTCGAGGG CATCCAGGCG CGGCTCGCCG TGCAGCCGGG TGACGTGCTG
CGCCATGACC GCTACGAGAC GGCCCGCCGG CAGCTGCAGA GCCTGGCGCT GGACCGGGGC
TACTTCGATG CCCGCTACAC CCGGCGGCGG GTGGAGGTGG ACGTGGCGGC CGGTGAGGCC
ACTGTGATGC TCCACTTCGA TACCGGTCGT CGCTATCGGC TCGGGGAGGT GACGTTCTCC
GAGACAGCGC TGGCCCCCTG GTTCCTTCAG CGGCTGGTGC CCTTCGAGCC CGGCGAGCCC
TACCGGGCAG AGCACATCAC CGCCCTCAAC CGGGCCCTCC GGGACAGCGG GTACTTTGCC
CGGGTCACCG TCCGCCCTGA GCCCCGGGAG GCCGACGAGG CCCTGCGGGT GCCGGTGGAG
GTGGAGCTGA CCGCCGAACG CGCCCACCAG GTCCGTCTGG GGGCGGGCTT CTCCACCGAT
GTCGGACCCC GCATCCGTGC CGGCTGGTCC CGGCCCTGGG TCAATCAACG GGGCCATAGC
CTGTCGGTGG ATACCGAGCT CTCGGAGCCG CGCCAAAACA TCTCCACCCG GTACAAGATC
CCGCTGGCCG ACCCGCTGCG CACCCAACTG ATCCTCCAGG CGGGTTTCCA GTTCGAGGAC
ATTGAGGACA CCGAGAGCGA GCTGCTGACC GTCTCCGTGC AGCACCAGCA CCGCTTCGAC
AGCGGTTGGC AGCAGAACCT GGGGCTCCGC TGGGACCGGG ACCGGTTCAC GGTCTCCGAC
GACACCCGCA CCACCACCCT CTATCTGCCC AGCGGCAGCT GGACCCGCAA CCGGGCCCGG
GGCGGCGCCG ACCCCTACTG GGGCGATCGC CTGCTGTTCA GTGTCGAGGG CACGGACGAG
TGGATGGGCT CCGATATCGA CCTGCTCCGG GTGCGCACCG GGGCCCGGCT GCTGCGGAGT
TTTGCGGACA ACCACCGGAT CCTGGTCCGT GGCGACTTGG GTGCGCTCAT CTCCAGCCAG
TTCGGCAAGG TGCCAACGTC CCTTCGCTTC TTTGCCGGCG GCGATCAGAG CGTGCGCGGT
TACCGCTACC AGACTCTGGG GCCGGAGGAT GCCGAAGGCG ATGTCATCGG CGGCCGCTAT
CTGGCGGTGG CCAGTGCCGA GTACGGCTAT ACCTTCCGGC CCCGCTGGCG GGCGGCCGTC
TTCGCCGATG CCGGCAACGC CTTTGACGAT CTGGACGACC CCGACCCACA GGTGGGGGCC
GGGTTCGGTA TCCGCTGGAT CTCGCCGGTG GGCCCGATCC GGCTGGACTT CGCCTCGGCG
CTCAGCAAAT CGGGCAACCC CTGGCGGCTG CACTTCTCCA TGGGGCCGGA GATATGA
 
Protein sequence
MAPAAHGVEV RVEGVSGALR DNVEAWLGEP AGDSRRALRT YERQLPERAA QALQALGHYR 
PQIDVEREET DNGPRFILRI DPGEPVRIAA VDLRIEGEAR DDPAFEGIQA RLAVQPGDVL
RHDRYETARR QLQSLALDRG YFDARYTRRR VEVDVAAGEA TVMLHFDTGR RYRLGEVTFS
ETALAPWFLQ RLVPFEPGEP YRAEHITALN RALRDSGYFA RVTVRPEPRE ADEALRVPVE
VELTAERAHQ VRLGAGFSTD VGPRIRAGWS RPWVNQRGHS LSVDTELSEP RQNISTRYKI
PLADPLRTQL ILQAGFQFED IEDTESELLT VSVQHQHRFD SGWQQNLGLR WDRDRFTVSD
DTRTTTLYLP SGSWTRNRAR GGADPYWGDR LLFSVEGTDE WMGSDIDLLR VRTGARLLRS
FADNHRILVR GDLGALISSQ FGKVPTSLRF FAGGDQSVRG YRYQTLGPED AEGDVIGGRY
LAVASAEYGY TFRPRWRAAV FADAGNAFDD LDDPDPQVGA GFGIRWISPV GPIRLDFASA
LSKSGNPWRL HFSMGPEI