Gene Mlg_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2475 
Symbol 
ID4268416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2815200 
End bp2816240 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID638127233 
ProductNlpBDapX lipoprotein 
Protein accessionYP_743305 
Protein GI114321622 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3317] Uncharacterized lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGA TTATCCGATT TCTGGCCGCC TCCACGGCGG CCCTGGTGCT GGCGGCCTGT 
GGCCGGCCGG AGATGGACCC GGAACGCCTG GATTATCCGG AGCGTCTCAA GCTGCCACCG
GACCTGACCG CGCAGTACTC CGACGCCATG GGCGTGCCCG AGGTGCAGCG CGGCCGCCCC
GATAGCCCCG AGGCGCTGTC GCGCACGTTG CTGCCCGATC CCGCCGCCGA TGTGCGCGTC
CAGCGGGCCG GAGGCCAGCG CTGGCTGGAG CTGCAGGCCG CCCCGGAGGA CGTCTGGGAG
TGGCTGCAGC AGTTCCTGGA GGTGCAGGAT ATCCCGGTGG CCGAGGTCTC GCCTCCGCTG
GGGCTGGTCG AGACGGAGTG GCTGGAGGGG CGGGTGGATG CCGGTCGCGG TGTCTTCACC
CCGGTGACCG TGGATGAGGC CGGCGACGGT GCCCTGGCCG AGCGTTTCCG CTTCCGGCTG
GAGCCGGTCG ACGGCGGAAC CTCCGAGCTG CACGTTGCCC ACAGCGCCGC CGTCCGCGAG
GACGAACGCT GGCGATGGCG CCCTTCCGAC AGCTTTGCCG AGGCGGAGGT GCTGCGCGGC
TTCATGGTCT ATCTGGGCCT CCGGCAGACC GAGGCCGCCC GGCAACTGGG CACGCTGGAG
GCCACTCCGC GGGCCCGGCT CGACCAAGAC GACGAGGGCC GGGCGGTGCT GCTGCTCGAC
GACGAACCCG CGACCGCCTG GCGCCGGACC GGTCTGGCGC TGGACCAGCT CGGTTTCACC
GTGGACGATC GGGACCGCAG CGCGGGCGAG TATCTTATCC GCTATGATCC GCACGCGGAG
GAGGGGGCGC CGGAGCGGGG TTTCCTGGAT CGATTCGCGT TCTGGCGTGA GGACGAACCG
GAGGGGCCTC AGCCCTACCG CCTGTTGCTC GAGTCCCGGA CCGGGGGCAG TCAGCTGGTG
GTGGAGACCG TGGACGGGGC GCCTATTGAC GAGGACCTGG CCCAGCGCCT GTTGGCGTTG
CTCTCCGAGC AGATGCGTTA G
 
Protein sequence
MRLIIRFLAA STAALVLAAC GRPEMDPERL DYPERLKLPP DLTAQYSDAM GVPEVQRGRP 
DSPEALSRTL LPDPAADVRV QRAGGQRWLE LQAAPEDVWE WLQQFLEVQD IPVAEVSPPL
GLVETEWLEG RVDAGRGVFT PVTVDEAGDG ALAERFRFRL EPVDGGTSEL HVAHSAAVRE
DERWRWRPSD SFAEAEVLRG FMVYLGLRQT EAARQLGTLE ATPRARLDQD DEGRAVLLLD
DEPATAWRRT GLALDQLGFT VDDRDRSAGE YLIRYDPHAE EGAPERGFLD RFAFWREDEP
EGPQPYRLLL ESRTGGSQLV VETVDGAPID EDLAQRLLAL LSEQMR