Gene Mlg_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1871 
Symbol 
ID4268089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2133649 
End bp2135142 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content69% 
IMG OID638126627 
ProductPpx/GppA phosphatase 
Protein accessionYP_742705 
Protein GI114321022 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.855928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.965208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC CGGATATCCT GGCCGCTGTC GATCTGGGTT CCAACAGCTT CCATATGATC 
GTCGCCCGCC ACGAGGGCGG ACAGCTGCGC ATCCTGGACC GGTTGCGAGA GCCGGTCCGT
CTTGCCGAGG GACTGGAGAC CCACCGCACC CTGAGCGCGC CGGCTCGGGA GCGTGCTCTG
GCCTGTCTGC GCCGGTTCGG CCAGCGTTTG CGTGAGGTGG ATGCCGGCGG GGTTCGGGCA
GTGGGTACCA ATACCCTGCG CAGGGCCCGG CAGAAGCGGG GCTTCCTTGA GGAGGCCGAG
GCGGCCCTGG GGCACGGGAT TGAGATCATA CCGGGTTACG AGGAGGCGCG GCTGGTCTAC
CTGGGCGTGG CCCGAAGCCT GGATTTCGAT GGCCGCCCGC GGCTGGTGGT GGACATCGGG
GGAGGCAGCA CCGAATTGAT CATCGGGGCC GGGGAGCAGC CCCGAGCCAT GGACAGCCTG
CACATGGGCT GCGTGAGCTT TACCGAGCGC TATTTCCCGG GCGGCGAGAT TACTCCGGCG
TGCATGGAGC GGGCGTTGAC AGCCGCGGGT GTGGAGATGG AGCCGGTGGG CGCCCATTAT
CAGACGCCCC TTTGGGCGGA GGCCGTGGGG GCCTCGGGGA CGATCCGGGC CGTGGCCAAC
GTGGTCCAGG CGGCGGGCTG GTGCGACTAC GGCATCACCC GGGAGGCGCT GCAGCGGCTC
GGTGACACGC TCCTGAGCGC CGGCCACGTG GATCGGCTGC GGCTGGACGG GTTGAGCGAT
GACCGTCGGG TGGTGTTGCC CGGCGGCGTC GCTGTGCTCA GCGCGGTGTT CGAGACCCTG
GGTGTGGGCC GGATGGAGGT GGCCGAGGGG GCGCTGCGTG AGGGGCTGCT CTACGACCTG
GTGGGGCGCC TGGGGGAGGC GGACGTGCGC GCCGCCACGG TGGAGAGCCT GGTCCGGCGC
TACCATGTGG ATGAACTGCA GGCCGGACGG GTGGCCGCCA CCGCCCACTA TTGCCTGCAG
CAGGTGGCCG CTGATTGGTC ACTGCAGCGG CCCTTCTGGG CGCGCCTGTT GCGCTGGGCG
GCGCAACTGC ATGAGGTGGG CCTGGATATC TCCCACAGCC AGTACCACAA GCATGGGGCC
TACATCACCC GCCACGCGGA TATGGCCGGG TTTTCCAGCC TGGACCAGCA GCTGTTGGCG
GTGCTGGTCC GGGCCCACCG GCGCAAGTTC CCGGTCGTCG AGTTCGACGC CCTGCCGGGC
AGCCACCGGA TCCCGTTGAT TCGCCTGGGG GTGTTATTGC GGCTCGCCGT CCTGCTGCAC
CGCAGCCGCA GCGAGACCCC GCTGCCCGAT TTCCGCCTGC GTGGCAACGG CCACCACCTG
AAGCTGGAAC TCCCCTCGGA CTGGCTGGAG CAACACCCCC TGGTGCGCGC GGATCTGGAA
CAGGAACGCC GCTGGCTCAA GCGGGCGGAT CTGCGACTCT ATCTCGGCGC TTGA
 
Protein sequence
MGKPDILAAV DLGSNSFHMI VARHEGGQLR ILDRLREPVR LAEGLETHRT LSAPARERAL 
ACLRRFGQRL REVDAGGVRA VGTNTLRRAR QKRGFLEEAE AALGHGIEII PGYEEARLVY
LGVARSLDFD GRPRLVVDIG GGSTELIIGA GEQPRAMDSL HMGCVSFTER YFPGGEITPA
CMERALTAAG VEMEPVGAHY QTPLWAEAVG ASGTIRAVAN VVQAAGWCDY GITREALQRL
GDTLLSAGHV DRLRLDGLSD DRRVVLPGGV AVLSAVFETL GVGRMEVAEG ALREGLLYDL
VGRLGEADVR AATVESLVRR YHVDELQAGR VAATAHYCLQ QVAADWSLQR PFWARLLRWA
AQLHEVGLDI SHSQYHKHGA YITRHADMAG FSSLDQQLLA VLVRAHRRKF PVVEFDALPG
SHRIPLIRLG VLLRLAVLLH RSRSETPLPD FRLRGNGHHL KLELPSDWLE QHPLVRADLE
QERRWLKRAD LRLYLGA