Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0395 |
Symbol | |
ID | 4269973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 441303 |
End bp | 442334 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125125 |
Product | PhoH family protein |
Protein accession | YP_741239 |
Protein GI | 114319556 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.503054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCTCG GGGCCCGGCC CTGCCGAGAG ACAGTAGACG CCAAATTGAG TAATCGTCCG CAAGCCCTCG ATTTTGACCT GGAGCCCGCC GACAACGAGC GCCTGGCGCG TCTGTGCGGG CAATTCGACG AGAATATCCG CCAGGTGGAG CGCCGCCTGG GGGTGGAGAT CCGCAACAAC GGCCATCACT TCCGCGTTAT CGGCAACGGC GACTCCGTGG CCGCCGCCGA GCGTGTCCTG CACGAACTCT ACGACGACTC GGCCCGCAAG CTGATCGAGC CCGAGGCGGT GCATCTGTGC ATCCAGGATG TGGGGGTGGA GGACGAGCCC GAGGCCGTCG AGACCGACGA GGAGGAGGCG GCGGACAAGG AAGGCGAGGT GATCATCCGC ACCCGCAAGG CGCAGGTCCG GGGCCGTGGC CCCAACCAAC GTGCCTACCT GCGCCGGGTG CTCACCCACG ACCTCAACTT CGGTGTCGGC CCCGCCGGCA CCGGCAAGAC CTATCTGGCC GTGGCCTGCG CCGTGCAGGC GCTGGAGGCG GACGAGGTGC GCCGGGTGGT GCTGGTGCGC CCGGCGGTGG AGGCCGGCGA GCGCCTCGGT TTCCTGCCCG GCGATATGGC CCAGAAGGTG GACCCCTACC TGCGCCCGCT CTACGACGCC CTGTTCGAGA TGCTGGGTTT CGAGCGGGTG GGTCGGCTGA TCGAGCGGGG GGTGATCGAG ATCGCCCCGC TCGCCTTCAT GCGCGGGCGC ACCCTCAACC ACAGCTTCAT CATCCTGGAC GAGGCCCAGA ACGCCACCGT CGAGCAGATG AAGATGTTCC TCACCCGCAT CGGCTTCGGT TCCACCGCCG TGGTCACCGG CGATGTCACC CAGATCGACC TGCCCCGGGA CAAGCCCTCG GGGCTGCGCG ACGCGGTGGA CGTGCTGCGG GATGTGGACG GCGTCAGCTT CACCTTCTTC ACCGCCCGCG ACGTGGTGCG CCACGCGTTG GTGCAGCGTA TCGTGCAGGC CTATGACAGC CGCAGTGAAT GA
|
Protein sequence | MGLGARPCRE TVDAKLSNRP QALDFDLEPA DNERLARLCG QFDENIRQVE RRLGVEIRNN GHHFRVIGNG DSVAAAERVL HELYDDSARK LIEPEAVHLC IQDVGVEDEP EAVETDEEEA ADKEGEVIIR TRKAQVRGRG PNQRAYLRRV LTHDLNFGVG PAGTGKTYLA VACAVQALEA DEVRRVVLVR PAVEAGERLG FLPGDMAQKV DPYLRPLYDA LFEMLGFERV GRLIERGVIE IAPLAFMRGR TLNHSFIILD EAQNATVEQM KMFLTRIGFG STAVVTGDVT QIDLPRDKPS GLRDAVDVLR DVDGVSFTFF TARDVVRHAL VQRIVQAYDS RSE
|
| |