Gene Mlg_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1169 
Symbol 
ID4269108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1366243 
End bp1367139 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content69% 
IMG OID638125918 
Producthypothetical protein 
Protein accessionYP_742008 
Protein GI114320325 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.351045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.345968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG CCGTCCGCCG GGGGGCAACC CTAATCATCA TCGCCGAGTT ACTGCTGGCG 
ACCATGGCGG CGACCATCAA GGCGGCGTCG GCGGAACTGC CCAGCGAGAT GCTCGTCTTC
TTCCGCAACC TGTTCGGGTT GCTGCTGTTG CTGCCGCTGC TGGTCCGGGG CGGGCGCCGG
GGGCTGGCCA CCCGCGTGCC GCACCTGCAT CTGCTGCGCG GCCTGGCCGG CGTGGGCGCC
ATGTACTGCT TTTTCTGGAC CATCGCCCAT ATGCCGCTGG CGGAGGCGCT GCTGGTAAAG
CTCTCCGCCC CCTTCTTTCT ACCCCTGATC GCCTGGCTGT GGTTGCGCGA GACGCTGTCC
GGCCGTACCG TCCTAGCCAT CGCCGTCGGC TTTCTTGGGG TCTATTTCAT CCTGCAACCC
AACGGGGCCA TGCAGGGCGC CGCCCTGCAG GTGGGGGCAG TGGGCCTGGC CGGTGCCGCG
CTGGCGGCGC TGGCCAAGGT CACCATCCGG CGCATGGGCC CGGAGGAATC CAGCCGCCGG
GTGGTCTTCT GGTTCGGCGT GACCGCCACC ACGGTCTCCG CCCTGCCCCT GCCCTGGGTC
TGGCAGACAC CAACGGGTCA GACGCTGGTG CTGTTGGTGG TCCTTGGCGC CTGTGCCACA
TCGGCCCAGC TCCTGCTCAC CCGGGCTTTC GCCATCGCCC CGTCGGGGCG CCTGGGCCCC
TTCACCTACA TCTCGGTGGT CTTCGGTTCG CTCTACGGCT GGTGGATCTG GGGCGAGCTG
CTGGGCCCCA TGACCCTTCT GGGCATGGCC CTGGTCATCG GTGCCGGCCT GCTCAACCTG
ACCCTGCGCC GACCGGCGGC GCCGCCGGCC CGGCGGCCCG CGGAGGAACA CCCATGA
 
Protein sequence
MSDAVRRGAT LIIIAELLLA TMAATIKAAS AELPSEMLVF FRNLFGLLLL LPLLVRGGRR 
GLATRVPHLH LLRGLAGVGA MYCFFWTIAH MPLAEALLVK LSAPFFLPLI AWLWLRETLS
GRTVLAIAVG FLGVYFILQP NGAMQGAALQ VGAVGLAGAA LAALAKVTIR RMGPEESSRR
VVFWFGVTAT TVSALPLPWV WQTPTGQTLV LLVVLGACAT SAQLLLTRAF AIAPSGRLGP
FTYISVVFGS LYGWWIWGEL LGPMTLLGMA LVIGAGLLNL TLRRPAAPPA RRPAEEHP