Gene Mlg_0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0395 
Symbol 
ID4269973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp441303 
End bp442334 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID638125125 
ProductPhoH family protein 
Protein accessionYP_741239 
Protein GI114319556 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.503054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTCG GGGCCCGGCC CTGCCGAGAG ACAGTAGACG CCAAATTGAG TAATCGTCCG 
CAAGCCCTCG ATTTTGACCT GGAGCCCGCC GACAACGAGC GCCTGGCGCG TCTGTGCGGG
CAATTCGACG AGAATATCCG CCAGGTGGAG CGCCGCCTGG GGGTGGAGAT CCGCAACAAC
GGCCATCACT TCCGCGTTAT CGGCAACGGC GACTCCGTGG CCGCCGCCGA GCGTGTCCTG
CACGAACTCT ACGACGACTC GGCCCGCAAG CTGATCGAGC CCGAGGCGGT GCATCTGTGC
ATCCAGGATG TGGGGGTGGA GGACGAGCCC GAGGCCGTCG AGACCGACGA GGAGGAGGCG
GCGGACAAGG AAGGCGAGGT GATCATCCGC ACCCGCAAGG CGCAGGTCCG GGGCCGTGGC
CCCAACCAAC GTGCCTACCT GCGCCGGGTG CTCACCCACG ACCTCAACTT CGGTGTCGGC
CCCGCCGGCA CCGGCAAGAC CTATCTGGCC GTGGCCTGCG CCGTGCAGGC GCTGGAGGCG
GACGAGGTGC GCCGGGTGGT GCTGGTGCGC CCGGCGGTGG AGGCCGGCGA GCGCCTCGGT
TTCCTGCCCG GCGATATGGC CCAGAAGGTG GACCCCTACC TGCGCCCGCT CTACGACGCC
CTGTTCGAGA TGCTGGGTTT CGAGCGGGTG GGTCGGCTGA TCGAGCGGGG GGTGATCGAG
ATCGCCCCGC TCGCCTTCAT GCGCGGGCGC ACCCTCAACC ACAGCTTCAT CATCCTGGAC
GAGGCCCAGA ACGCCACCGT CGAGCAGATG AAGATGTTCC TCACCCGCAT CGGCTTCGGT
TCCACCGCCG TGGTCACCGG CGATGTCACC CAGATCGACC TGCCCCGGGA CAAGCCCTCG
GGGCTGCGCG ACGCGGTGGA CGTGCTGCGG GATGTGGACG GCGTCAGCTT CACCTTCTTC
ACCGCCCGCG ACGTGGTGCG CCACGCGTTG GTGCAGCGTA TCGTGCAGGC CTATGACAGC
CGCAGTGAAT GA
 
Protein sequence
MGLGARPCRE TVDAKLSNRP QALDFDLEPA DNERLARLCG QFDENIRQVE RRLGVEIRNN 
GHHFRVIGNG DSVAAAERVL HELYDDSARK LIEPEAVHLC IQDVGVEDEP EAVETDEEEA
ADKEGEVIIR TRKAQVRGRG PNQRAYLRRV LTHDLNFGVG PAGTGKTYLA VACAVQALEA
DEVRRVVLVR PAVEAGERLG FLPGDMAQKV DPYLRPLYDA LFEMLGFERV GRLIERGVIE
IAPLAFMRGR TLNHSFIILD EAQNATVEQM KMFLTRIGFG STAVVTGDVT QIDLPRDKPS
GLRDAVDVLR DVDGVSFTFF TARDVVRHAL VQRIVQAYDS RSE