Gene Mlg_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1749 
Symbol 
ID4270856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2005454 
End bp2006608 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content70% 
IMG OID638126507 
Producthypothetical protein 
Protein accessionYP_742585 
Protein GI114320902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.698415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTCG ATCTGCTCAG CGGCGAACAG CGGGCCCCGG CGGAGTGCCG GATCTACCTC 
CACGACCAGG AGGTGCCGGA GTACTACCCC TTCCTCCGCG AGTTGGAGGT GGACACCAGC
CGGGAGGAGG CCTGGACCGC GACCCTGCGC CTGGCCACCG TGCGCGATGA GCTGGGGAGC
TGGGGTATCC AGGACGACGC GCTGTTGCGG CCCTGGGCGG AGATCACCCT GGCCATCGTC
TTCGGCGATG ACGAGGAGCG CCTGTTCAAG GGCTATATCC GCGAGGTCAA CGCCGACTTC
CCGGAGAGCG CCGGTGAGGC GGAGGTCGTG GTCGAGTGCC AGGACCAGTC CCTGCGCCTG
GACCGCACCC ACCAGCGCGA GCCCTGGGGC GACGAGGAGG CCCCGGTCAG CGACCGGGTG
ATCATCGAGG AGATCCTCAG CCACTACGGC CTCGCCCTCA GCCCGGACAG CAAGAGCGGC
CAGCAGGGGC TGGTGGAACT GCCGCAGGAC AGCTCCGATA TCCAGTTCCT GCGCAAGCGC
GCCGAGGAGA ACAACTACGA GCTGATCTTC TACCCGGACG AGGTCTACTT CGGGCCCTAC
CGCCTGGACG GCGCCGGGGC CCAGGCCACC ATCCAGGTCT ACGCCGGCCA GGCCACCAAC
TGCCTGCGGC TCAACGTCAG CGCCGACGGC CACCTGCCGG ACCTGCTCCA GGTGGAGGTG
CCGGCGGAGC GCGAGGATGA CGACTCGCGC ACGCTGCAGA TGTTCTCCAC GCTGCCCCCC
ATGGGGCCGG AGCGGGCCGA CAGCCGGGGC GGCGGCCTGG AGCCCAATGT AGACACCCTG
AGCGGGGAGG GGGGCGATGC CGCCGAGCAG CTCGAGGCCC GGGCCCAGGC TCGCATCAAC
GAATACGACC TGCACCGCCT GCAGGCCGAC GGCGAGCTGG ACGGCAGCCT CTACGGCCAC
GTGCTGCGCC CGGGCCGGCC GGTGCCGGTG GACGGCCTGG GCGAGCGGCT GAGCGGGCTT
TACTACGTCG ACCGGGTGGC CCATCACTTC AGCCCCGACG GCTACTTCCA GCGCTTTCAA
CTGCTGCGCA ACGCCTACGG CGACAACGTG GAGACCGCCG CCCCGGTCGC TTCCCGGCTG
GCGGGGGTGC TCTGA
 
Protein sequence
MVLDLLSGEQ RAPAECRIYL HDQEVPEYYP FLRELEVDTS REEAWTATLR LATVRDELGS 
WGIQDDALLR PWAEITLAIV FGDDEERLFK GYIREVNADF PESAGEAEVV VECQDQSLRL
DRTHQREPWG DEEAPVSDRV IIEEILSHYG LALSPDSKSG QQGLVELPQD SSDIQFLRKR
AEENNYELIF YPDEVYFGPY RLDGAGAQAT IQVYAGQATN CLRLNVSADG HLPDLLQVEV
PAEREDDDSR TLQMFSTLPP MGPERADSRG GGLEPNVDTL SGEGGDAAEQ LEARAQARIN
EYDLHRLQAD GELDGSLYGH VLRPGRPVPV DGLGERLSGL YYVDRVAHHF SPDGYFQRFQ
LLRNAYGDNV ETAAPVASRL AGVL