Gene Mlg_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1024 
Symbol 
ID4270054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1166132 
End bp1167631 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID638125776 
Productpolyphosphate:AMP phosphotransferase 
Protein accessionYP_741867 
Protein GI114320184 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0970051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.793106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAAAAC GCATCAACCG CCTGATCGAG GGTCAGGCGG ACAAAAAGGA GCGGAAAAAG 
ACGCTGCAGG AGCTGCGCCT GAAGCTGCTG CGCAGCCAGC TGGCCCTGTC CGAGGCTGCC
GAGATCCCCG TGGTGATCAT GGTCACCGGC CAAGTCGGTA GCGGCCGCGG CGAGACGGTG
AATCTGCTCA ACAAGTGGCT GGAGAACCGG GGCATGGAGA CCCATGCCTT CGGGCCGCCC
AATGACGAGG AACGCGCCCG CCCGCCCATG TGGCGGTACT GGCGTAGCCT GCCGCCGGCC
GGCCGTATAG GGATCTACGT CAACGGCTGG TATGGCGAGG CGGTGATCGA CCGTGTGCAG
GGTCTGATCG GCCCAGCGGT CCTGGAGACC CGGGTGGAGG AGGTCCGGGA GTTCGAGAAG
ACCCTGGCCG CCGAGGGCGC GCTGATCCTG AAGTTCTGGT ACCGAATCTC CCGCGAGCGG
CAGGCCGAGC GCCTGCACCA GCTGGAGTCC GACCCGGTCA ACCGCTGGCG GGTCAATGAG
TTGTCCTGGC TACGCCACGA GCAGTTTGAC GCCATCGACG AGACCGCCCA CCAGGTGGTG
GAGGCCACCG ACAGCGCCTG GGCCCCCTGG CACGTGCTGG AGGGCGGGCA CCCGGAGCGG
CAAACGCTGC AGACCGCCGC CATCATCCTG GACCGGATGC AGGACCGGCT GAGGGGTCGG
CGGGAGGAGG TGGAGAGCGC CCGCGCCCCG GTCAAATGCC GTCCCGCGCG CGATCCACAG
ACCCTGGAGG CGCTGGACCT GACCCAGACC CTGGACAAGA CCACCTACCA CGAAGAACTG
ACCCGCTGGC AGGACCGGCT CAGTCAACTG GTGCGGGATC CGGTGTTCCG TCGCGACTAC
GCCGTGGTGG CGGTCTTCGA GGGCCACGAC GCCGCCGGCA AGGGCGGCAG CATCCACCGG
GTCACCGCCG CCCTGGACGC CCGCCACTAC CGGGTGATCA GCGTGGCGGC GCCCACCGAC
GAGGAGCGCG CGCAGCCCTG GATTTGGCGC TTCTGGCGGC AACTGCCCTC GCACGGCCGG
ATGACCATCT TCGATCGCTC CTGGTACGGG CGCGTCCTGG TGGAACGGGT GGAGGGCTTC
GCCGAGCGGA CCGACTGGCG CCGCGCCTAT GGCGAGATCA ACCACTTTGA GCAGAACCTC
CTGAGGAGCA ATATTATCCT CGCCAAGTTC TGGCTCGCCA TCGACGCCGA CGAGCAGCTC
GCCCGCTTCC AGGCGCGCGC GGAGACACCG TGGAAGGCGC ACAAGCTGAC CGAGGAGGAC
TGGCGTAACC GGGAGCGGTG GGACGACTAC CAGGCCGCCA TCAACGACAT GCTGCGCTAC
ACCGATACCA CCGCCGCGCC CTGGCACGTG ATCGAGGCCA ATGACAAACG CTTCGCCCGG
GTCAAAGTGA TCAAGCGGCT CTGTGCCGCC ATTGAGGGAG CCATGGAGAG CGGCGGCTGA
 
Protein sequence
MQKRINRLIE GQADKKERKK TLQELRLKLL RSQLALSEAA EIPVVIMVTG QVGSGRGETV 
NLLNKWLENR GMETHAFGPP NDEERARPPM WRYWRSLPPA GRIGIYVNGW YGEAVIDRVQ
GLIGPAVLET RVEEVREFEK TLAAEGALIL KFWYRISRER QAERLHQLES DPVNRWRVNE
LSWLRHEQFD AIDETAHQVV EATDSAWAPW HVLEGGHPER QTLQTAAIIL DRMQDRLRGR
REEVESARAP VKCRPARDPQ TLEALDLTQT LDKTTYHEEL TRWQDRLSQL VRDPVFRRDY
AVVAVFEGHD AAGKGGSIHR VTAALDARHY RVISVAAPTD EERAQPWIWR FWRQLPSHGR
MTIFDRSWYG RVLVERVEGF AERTDWRRAY GEINHFEQNL LRSNIILAKF WLAIDADEQL
ARFQARAETP WKAHKLTEED WRNRERWDDY QAAINDMLRY TDTTAAPWHV IEANDKRFAR
VKVIKRLCAA IEGAMESGG