Gene Mlg_2498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2498 
Symbol 
ID4270817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2838749 
End bp2840062 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID638127256 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_743328 
Protein GI114321645 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.953303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC TTACCGCAAC CACTGTTCGC GGCCTGATCA CCGGCGCGGG TCTTGGGGGC 
CTGCTGATCC CGGGGCTGGC CCTCGCCACC AACGGCTACC AGCTCACCGG CCTGGGCAGC
CACGAAAAGT CGCTCGGCGG GGCGGTCACC GCCGCACCGC GCAGCGCGAT GACGGCCATC
AGCAACCCGG CCGGTATCGG CCGCATCGGG TCCCGCGTCG ACTTCTCCAT GGAGGTCTTC
AGCCCCGAGC GGAGCACCGA CTTCCGGGCC CTCGGGGGCG AGAAGGTCAC CAGCGACACC
GACACCTATA TCATTCCGAG CCTGGGCTGG GCGGCCCCCA TCACCGAAGA CCGCCGCCTG
TGGTTCGGCG GGGGCTTCTT CGGCACCTCC GGGCTCGGTG TCGATTACGC GGTGACAGAC
GTCATGCCCA ACGGGCAGCT CATGAACGGC CACACCCAGT GGGACGGCTA CAGCTCGATC
TTCTTCGCCC AGATGACGCC GGTGCTTTCA CTGCGGGTGA ACGACCGCCT CACCGTTGGC
GCCGGCCCGG TGCTCGCGCG CCAGCAGGTG GCCCTGAAAC AGCGCTTCCA CGACATGCCG
GTCGGGCCGG GCATGGTGAT GGACACCAAC TTTGACCTCA GCAAGGCCAG CAGCGCCCTT
GGTGCTGGTG TCAGCCTGGG CCTGATCTAC GACCTTGGCA CCCGGTGGCG GCTGGGCGCC
ACCTACCAGA GCAAGATCCA CTTCGAAGAC CTGCGCTACA ACCTGGCCGC CGGCGACATT
CATGGCCAGG ACAGCAACGG CGAGTTCGTC GACGGCGAGG CGGGCACCTG GCGGCTGGGC
CTCGACTACC CGCAACAGGC CAGCGTGGGC CTGGCCTGGG CGGCAAACAA CACCCTCACC
CTCTCCGCCG ACGTGAAGTG GCTCAACTGG TCCGACACCA TGGATGAGTT GACCGTAAAG
GGCCCCAATG GCAGCCGCTT CGCCCTGGAC CCCGGCTGGG ACGACCAGTG GGTCTTCGCC
GCCGGCGCGG AGTGGGTGGT GAACCCCGAG CGGCTCACCC TTCGCGCCGG CGTCAACTAC
GCCGAATCCC CCCTCGATGA CGAGGACGTG GCCACCAACC TCCTGCTACC GGCGGTGGTG
GAACGCCATG TCGCCCTCGG CGGCACAGTG CGGATGGTCA ACGGCTGGGA CCTGGGCTTC
CACCTCAAGC ACGCCCTGAA GAACAAGCAG ACCCAGGACG GCGGCCCCTT TGACGGCGTC
TCGGTGGAGA TGGACCAGTG GTCCGCCGGA CTCAATATCG GCTACGCCTT TTGA
 
Protein sequence
MKRLTATTVR GLITGAGLGG LLIPGLALAT NGYQLTGLGS HEKSLGGAVT AAPRSAMTAI 
SNPAGIGRIG SRVDFSMEVF SPERSTDFRA LGGEKVTSDT DTYIIPSLGW AAPITEDRRL
WFGGGFFGTS GLGVDYAVTD VMPNGQLMNG HTQWDGYSSI FFAQMTPVLS LRVNDRLTVG
AGPVLARQQV ALKQRFHDMP VGPGMVMDTN FDLSKASSAL GAGVSLGLIY DLGTRWRLGA
TYQSKIHFED LRYNLAAGDI HGQDSNGEFV DGEAGTWRLG LDYPQQASVG LAWAANNTLT
LSADVKWLNW SDTMDELTVK GPNGSRFALD PGWDDQWVFA AGAEWVVNPE RLTLRAGVNY
AESPLDDEDV ATNLLLPAVV ERHVALGGTV RMVNGWDLGF HLKHALKNKQ TQDGGPFDGV
SVEMDQWSAG LNIGYAF