Gene Mlg_1929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1929 
Symbol 
ID4270130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2192740 
End bp2194146 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content67% 
IMG OID638126683 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_742761 
Protein GI114321078 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCTG AAGCAACGTC GATGGATTGG GAACGGTTGC TGAGCAAGCA ACGGCTCGGG 
CGCCCCGACG AGGAAGGCAG TCTCGGTTTC CGCACCGATT TCCAGCGGGA TTTCGACCGT
ATCGTCTTCT CCTCGGCCTT TCGCCGGCTG CAGGACAAGA CCCAGGTCTT CCCGCTGGCG
GAGAGCGACT ACGTGCGCAC CCGGCTCACC CACAGCCTGG AGGTCTCCTG TGTGGGGCGC
TCCCTGGGCA CGCGGGTGGG CGAGGCGATC ACCCGTCGCG AGGGCTACAC CGAGGTGGCC
CCCGGCGACA TCGGCGCCAT CGTGGCAGCC GCCTGCCTGG CCCACGACAT CGGCAACCCG
CCCTTCGGCC ATGCCGGCGA GGACGCCATC CGCCACTGGG TGCGAACCAG TCCGGTGGCC
CGCCGGGCGC TGGACGAGTT AAGCCCGCCA CAGCGGGCGG AGTTCGAGCA CTTCGAGGGC
AATGCCCAGG GCTTCCGGGT GGTCACCCGG CTGCAGAACC CGGACAACCG CGGTGGTCTG
CAATTGACCT ACGCGACCCT CGGCGCCGCC CTGAAGTACC CCTGCCCCGC CCACGCCATC
GACCCCGGCT ACGGCATCAG CCGAAAGAAG TACGGCTATT TCGTCGCCGA AGCCGATCTC
TTCCGCGCTG TGGCCCAGAC CAACGGCCTG CTCAAGCAGG CCCCGCGCAC CTACTGCCGC
CACCCGCTGG CCTTCGTCAT GGAGGCGGCG GATAACATCG CCTACCTGAT CGTCGACTTC
GAGGATGCCT TCCGGCTCGG TATCCTGGAG TACCGCACGG TGCACGATCA CTTCCGCGCC
CTCCTCCGAG GCAAGGACCA GGGCACGGTG GAGCGACGCC TGGCCCGGCT GCGGGACGAC
AAGGAGCGGG TGGAGTACCT GCGTGCGCGC GCCATCAATG AGCTGGTGGA GGCCAGCGCC
CGCGCCTTCA TGGATCACGA GGCGGAGATC ATGACCGGTC GCTTCGAGCG GGAACTGACC
GACACCCTGC CATTCAGTGA GGCCCTGCGC GCTATCGCCG GGGTGTCTCA GGAGCGGATC
TACGACCACC TGGAGGTGCA GGGGGTGTGC GCGGCGGGTT ACAGCGTGAT CGGCGGCTTG
CTGGACCTGT TTCATGAGGC GGTCCACGAC ACCGCCGTGG CCCTGGAGGA GGGTAAGCAG
GCCCCACCGC GCTCGCGCAC TGTGACCAAC CTGGTGCCGG AGCAGTTCCT CTACCAGTAC
GACCCCGATT CCGGCCGGCG GTATCGGGTC ACCGACCCCT ACCTGTTGCT GCTCAATCTC
ACCGACTTCA TCGCCGGCAT GACCGACGGC TACGCGGTCT CGCTCTACAA GAAGCTGACC
GGGATGGCTC TGCCCCACCA CGGCTGA
 
Protein sequence
MEPEATSMDW ERLLSKQRLG RPDEEGSLGF RTDFQRDFDR IVFSSAFRRL QDKTQVFPLA 
ESDYVRTRLT HSLEVSCVGR SLGTRVGEAI TRREGYTEVA PGDIGAIVAA ACLAHDIGNP
PFGHAGEDAI RHWVRTSPVA RRALDELSPP QRAEFEHFEG NAQGFRVVTR LQNPDNRGGL
QLTYATLGAA LKYPCPAHAI DPGYGISRKK YGYFVAEADL FRAVAQTNGL LKQAPRTYCR
HPLAFVMEAA DNIAYLIVDF EDAFRLGILE YRTVHDHFRA LLRGKDQGTV ERRLARLRDD
KERVEYLRAR AINELVEASA RAFMDHEAEI MTGRFERELT DTLPFSEALR AIAGVSQERI
YDHLEVQGVC AAGYSVIGGL LDLFHEAVHD TAVALEEGKQ APPRSRTVTN LVPEQFLYQY
DPDSGRRYRV TDPYLLLLNL TDFIAGMTDG YAVSLYKKLT GMALPHHG