Gene Mlg_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1158 
Symbol 
ID4270664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1357091 
End bp1358179 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID638125907 
Producthypothetical protein 
Protein accessionYP_741997 
Protein GI114320314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.946147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAGA CCGTACTGCT CGTTGGCACC CAAAAGGGCC TGTTCCGGCT GGAGGACACC 
CCTGACCGGA CCGGTTGGGA GCTGGCCGGG CCGCTGATCG CCGGATACGA GGTGCTGCAC
GCCTGGCTGG ACCCGCGCGA CCCGCAGCGG GGGCTGGCGG CGGTGGACCA CCCCGTGTGG
GGCGCGCATA TCTACCGCAC CGACGATGCC GGCCACCGCT GGGAACCGCT CGCCGGCGTG
CCCCTGCACC GCCCCGGGCT GTGGCCCAAG CGGATGAAGG CGGTCTGGCA CCTGGCGCCG
GGACCGGCCG AGGCCCCGGG GACGGTCTAC GCCGGCACCG ATCCGGCCGG CCTGTTCCGC
AGCGATGATT ACGGCCAGAG CTGGACGCCT GTGGCGTCAC TCAACGAACA CCCCACGCGG
GACACTTGGG AGCCGGCCCG CGGTGGCTTT TCGTTGCATT CCATCCTGAT CGACCCGCAA
TCGCCGCAAC GCCTCTACGT AAGCATCTCG GCCGGGGGAG TCTTCCGCAG CGACGACGGC
GGGCGAAGCT GGCGCCCCTG CAACGAGGGG GTCCGCGCCG AGAACCTGCC CGGCCGCTGC
GCCGTGACCG GCCACAACGT GCACCGCACG GTGCTCTGTC CGCGCCGGCC GGAGCGGCTC
TACCGGCAGT GTTACAACGG CGTCTACCGC AGCGACGACC GGGGCGGGCA CTGGACGGAG
ATCTCCTCCG GACTGCCCAG CGATTTCGGC TACGCCCTGG CCACCCCGCC GCAGGATCCG
GACACGGTGT ACGTCATCCC CATTGAGAGC AACCACCTGC GGACCTGCTG CGACGGCCGC
CTGCGCGTCT ACCGCAGCCG TGACGGTGGC CGACACTGGG CGCCCCTGAC CCGGGGATTG
CCGCAACGCC ACGCCTACGT CACCGTCCTG CGCGAGGCCA TGGCCCAGGA CGGTGCCGAT
CCGGCGGGGC TGTACTTTGG CACCTCCAGC GGCCACCTGT TCGCCAGCCG CGACGGCGGC
GAGCACTGGG AGACGGTGGC GGAGTTCCTG CCCCGGGTGC TCTCAGTGCA GGCCGCCCGC
TGTTACTAA
 
Protein sequence
MTQTVLLVGT QKGLFRLEDT PDRTGWELAG PLIAGYEVLH AWLDPRDPQR GLAAVDHPVW 
GAHIYRTDDA GHRWEPLAGV PLHRPGLWPK RMKAVWHLAP GPAEAPGTVY AGTDPAGLFR
SDDYGQSWTP VASLNEHPTR DTWEPARGGF SLHSILIDPQ SPQRLYVSIS AGGVFRSDDG
GRSWRPCNEG VRAENLPGRC AVTGHNVHRT VLCPRRPERL YRQCYNGVYR SDDRGGHWTE
ISSGLPSDFG YALATPPQDP DTVYVIPIES NHLRTCCDGR LRVYRSRDGG RHWAPLTRGL
PQRHAYVTVL REAMAQDGAD PAGLYFGTSS GHLFASRDGG EHWETVAEFL PRVLSVQAAR
CY