Gene Mlg_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1870 
Symbol 
ID4268088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2131537 
End bp2133621 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content65% 
IMG OID638126626 
Productpolyphosphate kinase 
Protein accessionYP_742704 
Protein GI114321021 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAAA AAACGATCGA TCTCAAGCAA CCGGACCTCT ACTTCAACCG CCTGCTCAGC 
CTGCTGGAGT TCAACCGCCG GGTGCTCGCC CAGGCCAAGG ACACCGACAC CCCGCTGCTG
GAACGCCTCA AGTTCCTCTG TATCTGCACC TCGAACATGG ACGAGTTCTT CGAGGTGAGG
GTCTCGGGGG TGAAACACAA GGCCGAGGCC GGTTCGGTTC AGGCCGAGTC GGACAATCGC
AGCCCGCAGG AGACCCTGAA TGCCATCAGT GCCGTCAGCC ACGAGCTGGT AGCCGAGCAG
TACCGCGTCC TCAACGAGGA ATTGATCCCG GCCCTGGCGG AAGAGGACAT CCGCTTCATC
CGCCGGGCCG ATTGGACCGA CGCCCAGACG GAGTGGCTGC GCCGTTTTTT CGAGGACGAG
CTGCTGCCGG TGCTCAGCCC CCTGGGACTG GATCCGGCCC ACCCCTTCCC CAAGGTACTG
AACAAGAGCC TGAACTTCAT CGTCAGCCTG GAGGGCAAAG ATGCCTTCGG CCGCAACAGC
GGGTTCGCCA TCGTGCAGGC GCCGCGCGCC CTGCCGCGCC TGATTCAACT GCCCCGGGAG
GGTGAGGACA ACGGACCCTG GGACTTCGTC TTCCTGTCCT CGGTCATTCA CGCCTTCGTG
GATCAGCTCT TCCCGGGCAT GAAGATCAAG GGCTGTTATC AATTCCGGGT GACCCGCAAC
AGCGATCTGT TTGTTGACGA GGAGGAGGTG GACGACCTGC TGCGGGCGCT GGAGGGGGAG
CTGCTCTCCC GCCGGTACGG CGAGGCCATT CGCTTGGAGG TGGCGGCCAA CTGTTCCGAG
GACCTGGCCA ACTTCCTGCT GCGCAAGTTC GAGCTCGGGC CGGACGACCT CTACCAGGTG
GACGGGCCGG TCAACCTGAA CCGGATGATG GCGGTCTACG ACCTGGTGGA CCGCCCCGAT
CTGAAGTATC CGTCGTTCAC CCCCGGGCTG CCGGCGGATT TCAGCCACAG CGGCGATATC
TTCAAGGTCC TGCGCAAGCG CCAGGTGCTG CTGCACCACC CCTTCCAGTC CTTTGCCCCG
GTCATCGAGC TGGTGCGCCA GGCCTCGCTG GATCCGGACG TGCTCGCCAT CAAGCAGACC
CTCTACCGCA CCGGGCCCGA TTCCGCCATC GTCGATCACC TGGTGCGCGC GGCGCGGGAC
GGTAAGGAGG TCACCGTTAT CATCGAGTTG CGCGCCCGCT TTGACGAGGC GGCCAACATC
GCCCTCGCCA ACCGGCTGCA GGAGGCCGGC GTGCACGTGG TCTATGGGGT CGTGGGCCAC
AAGACCCATG CCAAGATGCT GCTGGTGGTG CGCCGCGAGG GGCGCAAGCT GCGCCACTAC
GTGCACCTGG GGACCGGCAA CTACCACTCA CGCACCGCGC GGCTCTACAC CGACTATGGC
CTGTTCACCC GAGACAAGCA TACGGGTGAG GATGTCCACC GGCTGTTCCT GCAAATGACC
AGCCTGGGGC GTTTCTCCGA GCTGAAACGC CTGCTGCAAT CGCCCTTCAC CTTGCGGGAG
GGGGTGATCC AGCGCATCCA ACGGGAGGCC GAACACGCCC TCGCCGGTCA CGAGGCCCGC
ATTATCGTCA AGGTCAATTC CCTCACCGAG CCTGGTGTCA TCCAGGCGCT CTACCAGGCC
TCGCAGGCAG GCGTCACCGT CGACCTGATC GTGCGCGGCA TGTGCTGTCT GCGCCCCGGG
GTACCGGGGG TCTCGGACAA CATTCAAGTC CGCTCCATCA TCGGCCGCTT CCTGGAACAT
ACCCGGGTGT TCTATTTCCA CAACCGGGGC GACAGCGACC TCTATGCCAG CAGCGCCGAT
TGGATGGAGC GCAATTTCTT CCGGCGGGTG GAGACGGCCT TCCCCCTCCT GGACGAGGAG
GCGCGCCGGC GGGTGCTGCT GGACCTGGAG TGCTATCTCA AGGACAACAC CCAGGCCTGG
CTGCTGCAAC CCGACGGCAG TTACGTGCGC CTCCAGCCCG CGGAGGGCGA GGAACCCTAC
TGCGCCCAGC GGGCCCTCCT CGCGTTGCTC GCCGACAGTG CCTGA
 
Protein sequence
MDQKTIDLKQ PDLYFNRLLS LLEFNRRVLA QAKDTDTPLL ERLKFLCICT SNMDEFFEVR 
VSGVKHKAEA GSVQAESDNR SPQETLNAIS AVSHELVAEQ YRVLNEELIP ALAEEDIRFI
RRADWTDAQT EWLRRFFEDE LLPVLSPLGL DPAHPFPKVL NKSLNFIVSL EGKDAFGRNS
GFAIVQAPRA LPRLIQLPRE GEDNGPWDFV FLSSVIHAFV DQLFPGMKIK GCYQFRVTRN
SDLFVDEEEV DDLLRALEGE LLSRRYGEAI RLEVAANCSE DLANFLLRKF ELGPDDLYQV
DGPVNLNRMM AVYDLVDRPD LKYPSFTPGL PADFSHSGDI FKVLRKRQVL LHHPFQSFAP
VIELVRQASL DPDVLAIKQT LYRTGPDSAI VDHLVRAARD GKEVTVIIEL RARFDEAANI
ALANRLQEAG VHVVYGVVGH KTHAKMLLVV RREGRKLRHY VHLGTGNYHS RTARLYTDYG
LFTRDKHTGE DVHRLFLQMT SLGRFSELKR LLQSPFTLRE GVIQRIQREA EHALAGHEAR
IIVKVNSLTE PGVIQALYQA SQAGVTVDLI VRGMCCLRPG VPGVSDNIQV RSIIGRFLEH
TRVFYFHNRG DSDLYASSAD WMERNFFRRV ETAFPLLDEE ARRRVLLDLE CYLKDNTQAW
LLQPDGSYVR LQPAEGEEPY CAQRALLALL ADSA