Gene Mlg_2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2155 
Symbol 
ID4270150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2450338 
End bp2452149 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content66% 
IMG OID638126911 
Producthypothetical protein 
Protein accessionYP_742987 
Protein GI114321304 
COG category[S] Function unknown 
COG ID[COG4655] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0266885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGAC TGCAAGGACA GGGAATCGGT CCGGGGCGGC AGCGGGGTGC GATCGGCCTG 
GCGGCGGTCC TCCTGCTGGT GGTGGTGGTC GTGTTCCTGG CCCTGGCTCT GGATGCCGGC
CGGCTCTATA TGGAGCAGCG GAACCTGCAG CGGATCGCCG ATGCGACGGC CCTGGAAACG
GCCTGGAAGC ATACCGGCTG CACGGCGGAT CCTGCATCGG CGCTACAGAC GGCGCAGGCG
GTGGCGGAGC GCAACGGCTA CCAGGGTGAT GACCTCGTCA TAGGAGCGCA GGGGCTGTTG
CTGGGCCGTC TGGTGGAGGA CGGCGTCCTC CGGGTGTTCG AGACCATGCC GGATGTAGGC
CATCACGGGG TGGTGCCGGA GGCGGCGCGG GTGCACGTGG AGCACGAGGT GCCGCAGAGC
TTGGTCCTCG GGGGGCTGTT TGGCCAGCAG GCCACCCTGA GCGCCGAGGC GGTGGCCCGG
CGCATGCCGC TGGTGGGGAT CTCCGCCGGC TCCTGGGCCG CGCGGGTGGA CACCGAGAAC
TCCCCCCTGC TCAATGCCCT GCTTAACGGT CTGCTGGGGA CCAACCTGCA GTTGGACGCG
GTGGCCTTCG CCGGCCTGGT GGATACCTCA GTGACGCTGC TCCAGTTGGC TCAGGACCTG
GCGGTACTGG GGGTGGATCT CAGTGTCGCG ACGGTCGATG AGCTGCTGTC CGCAAATGTG
CGTTTGCTGG ATGTGCTGGA GGCAGCCGTC CGGGCGGTGG AACGGGAGGG TGTCCTCGAC
GTGAACGCCT CGGTCCTGCG CAACCAGCTC CTGAACATCG GGGTGGAAAA CCTGGAGCTG
CAGCTTGCCG ACATCCTCCA GGTCCAGGCC CCCTCCATGG ATCCGGACGC AGCCCTGGAC
GCCCAAGTCA ATGTGCTGGA CCTGATCATG ACCACGGCGA TGACCGCAAC CCGGGACCAC
GCCGTGGAGC TTGATGTTCT CCTGCCACTG AGTGATCTGA ATCTGCTGAA CCTGGTAGAT
GTTGACGCCC GGGTTAAGGC CACGATTGTC GAGCCGCCGC AGATTGTCAT CGGCCCGCCC
GGCCGGGGCC CCGACGGGGA GTGGCGGACC ATTGTGGATA CCGCGCAGGT GCGCCTCCAG
GCCGCCGCGG ACCTATCATT GAATGTGGGT ATCGCAGCGG TCGACGTCGA CCTGGGGGTG
GCGCTGCAGG CGGCGCAAGG GAGCGCCTGG GTTGAGGGGG TTGGCTGTCC TCCGGACACC
CCCGGGGCCA CGGAGGTGGC GGTCGGCACT CTGCCCGGGG TTGCTAATCT GGAGCTGGGG
GAGTTTGATG ACATCGCTGT CTCCGACCCG TCGGTGTTAC CGGTGGCGGT CGAGGTGAGG
GCCTTGGGGA TTCACATTGC CACCCTGGCG CTCGCTGCGA ACGCGCCGAT TCAGCCTGCT
GCCGGCGAAA CGCTTCATTT CCTGGTTGAG GACCGGGCGG CGTTGCCTAC AGAGGTGCAA
TCCGTCGCCA GCGGCTTGGG CGGAGCACTG GCGAACGGGT TACAGACCTT GGGCGAGAGT
ATCGATGTGG AGATCACCCT GGTCGAGGAT TTGGGAGTGC TCGCAACGTT GCTGGGACTG
ACCACGGCCG TGGTGGAAGC GCTGGTTAAT GAAGTGGTTG CGATCTTGCT GAGCCTGGTA
TTACCCCTCG TGCTGCAGCT ATTAGGGAGC GTCATTCTGG AGCCGCTGCT GAGCATGCTT
GGCGTTGGCG TGGGAGGGTT AGACGTCCAA GTGGTGGAGC TGCTTGAGGG CGGCGTCGAT
CTGGTCCGAT GA
 
Protein sequence
MVRLQGQGIG PGRQRGAIGL AAVLLLVVVV VFLALALDAG RLYMEQRNLQ RIADATALET 
AWKHTGCTAD PASALQTAQA VAERNGYQGD DLVIGAQGLL LGRLVEDGVL RVFETMPDVG
HHGVVPEAAR VHVEHEVPQS LVLGGLFGQQ ATLSAEAVAR RMPLVGISAG SWAARVDTEN
SPLLNALLNG LLGTNLQLDA VAFAGLVDTS VTLLQLAQDL AVLGVDLSVA TVDELLSANV
RLLDVLEAAV RAVEREGVLD VNASVLRNQL LNIGVENLEL QLADILQVQA PSMDPDAALD
AQVNVLDLIM TTAMTATRDH AVELDVLLPL SDLNLLNLVD VDARVKATIV EPPQIVIGPP
GRGPDGEWRT IVDTAQVRLQ AAADLSLNVG IAAVDVDLGV ALQAAQGSAW VEGVGCPPDT
PGATEVAVGT LPGVANLELG EFDDIAVSDP SVLPVAVEVR ALGIHIATLA LAANAPIQPA
AGETLHFLVE DRAALPTEVQ SVASGLGGAL ANGLQTLGES IDVEITLVED LGVLATLLGL
TTAVVEALVN EVVAILLSLV LPLVLQLLGS VILEPLLSML GVGVGGLDVQ VVELLEGGVD
LVR