Gene Mlg_0690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0690 
Symbol 
ID4268849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp766727 
End bp768361 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content71% 
IMG OID638125439 
Productmetal dependent phosphohydrolase 
Protein accessionYP_741534 
Protein GI114319851 
COG category[T] Signal transduction mechanisms 
COG ID[COG1639] Predicted signal transduction protein 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.218536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCC CCGCCAATGA ATTCCTGCAC CGAAGCAGCT TGGCTGAACT GCCCGCGTTC 
CCGGCCGTGG TGCTGTCGTT GCTGGAGGCC ACCGCCCGGG GCGACGCCGG ACCACGGGAG
ATCGAGACCA TCATCGCCCG CGATAGCACC CTCGTCGCCC GGCTTCTCGG CGCCGCCAAT
GCCGCCGCCT ATACCCGCAG CACGCCCACC GACAGCCTGC GCGAGGCCAT CGCGCGGCTG
GGGCTGAGCC GGATCAACAC CCTGGCGGTC GCCACCGCCG TGCGCGGCTT TTTCCAGGCC
CTGGACAGGG GCCAGCCGGT CTGGGTGGAG GCCCTGTGGC GCCACGGGCT GCTTTGCGCT
CAGCTCGCCC GCGAACTGGC CCGGGCCCGG GCCAACAGTT GTCCGGAAAC CGCCTACCTG
GCCGGCCTGC TGCACGATGT GGGCAAACCA GTCATGGGTC TGGCCCGCCC GGAGGAATAC
CAACGACTGC TGGCGGACGC CGAGGCCTCC GACCGTGGCC TACAGGGGTT GGAACAGGCG
CGCTGGGGCT GCGACCACAG CCAGGTGGGG GCGGAACTGC TCGCCGACTG GGGCTTCCCG
GCCCTGCTCA TTGATGCCGT GCGCTACCAC CATCAGCCGG TACCGGCGCT GGAACACGCC
CACCCGCTGG TCCGCTGCGT GGCGCTCGCG AACCTGCTCT GTCACCACCC GGAAGTGGAC
GTGACCGGTC GCCAGGCCGC CGCCCGGTTG CTGGACCTGG GCCGGGAGGA CACCCAGACC
CTGGTGACCG CGGCGCAGCA CGAGCTGGCC AAGACCTGCA CGGCGCTCGG CGAGGACGAC
GCGGCATCGG ACCTGAAGGG GGCCCATGAG CGGGTGGTGC AACGGCTGGG GGACCAGGTC
CGCACCGCGG CGCTGACCGG GGGGCTCGGG GCCGCGCTGG CCGAGGGCGA CACCCTGGAG
AGCCTCGCCA TCTGTGTGGC CCTGCTGTTC GGGGTACGAC ACCTGCTGGT GCTGGAGGTG
GACGAGACCG GCGAGCGGCT GAGCGCTACC GCCATGCCCG CCAACGACCC GGCCATCAGC
GAACTCTCCC TGCCGCTGGA AACCGCGGCC AGCCCCATTG CCGCGTTACT GCTCGAGGAC
CGCATCGCCC CACTCAAGAC CCCCGACCCC GGCGCCATCG ACCAGCCGGT GGCGGACCGC
CAGGTACTCG ACCGGCTCCC TGGCGAAGCT GCCCTGGGGC TGCCCCTTTA CAGTGAGGGC
CGGCCGGTGG CGGCACTCAT CCTGGGGCTG AGCCACGGGC AACTCCAGGC CCTGCAAGCG
GAGGCCCCCC TGCTCCGCAC CTTCGCCGCC CAGGCCGGCG CCCTGCTCGC CCGCCGCCAG
CGGGAGGACC AGGCCGTACG GCAGGCGCGG GAGGAGGCCT CTGCGCAACA GCGGCGCCGC
GAAGCGGAAT TGGTCAAGGC AACCGCAAAC CCGATTACCG TGATGCAGAA TTACCTGAGC
GTATTACAAC AGCAGCTTGA GGCGGACCAC CCCGGACAGA GCGGGGTCAG CGCCCTTCGC
GACGAGGTCG AACGGATCCG CGAGCTGATC AGCGAACTGG AACCGACGGT CGACGGAAAT
CCGGCAAACC CCTGA
 
Protein sequence
MTVPANEFLH RSSLAELPAF PAVVLSLLEA TARGDAGPRE IETIIARDST LVARLLGAAN 
AAAYTRSTPT DSLREAIARL GLSRINTLAV ATAVRGFFQA LDRGQPVWVE ALWRHGLLCA
QLARELARAR ANSCPETAYL AGLLHDVGKP VMGLARPEEY QRLLADAEAS DRGLQGLEQA
RWGCDHSQVG AELLADWGFP ALLIDAVRYH HQPVPALEHA HPLVRCVALA NLLCHHPEVD
VTGRQAAARL LDLGREDTQT LVTAAQHELA KTCTALGEDD AASDLKGAHE RVVQRLGDQV
RTAALTGGLG AALAEGDTLE SLAICVALLF GVRHLLVLEV DETGERLSAT AMPANDPAIS
ELSLPLETAA SPIAALLLED RIAPLKTPDP GAIDQPVADR QVLDRLPGEA ALGLPLYSEG
RPVAALILGL SHGQLQALQA EAPLLRTFAA QAGALLARRQ REDQAVRQAR EEASAQQRRR
EAELVKATAN PITVMQNYLS VLQQQLEADH PGQSGVSALR DEVERIRELI SELEPTVDGN
PANP