Gene Mlg_2860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2860 
Symbol 
ID4268598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3248549 
End bp3251287 
Gene Length2739 bp 
Protein Length912 aa 
Translation table11 
GC content68% 
IMG OID638127622 
ProductDNA polymerase I 
Protein accessionYP_743690 
Protein GI114322007 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG CCCGCAAGCC CCTGATTCTG GTGGACGGCT CCAGCTACCT CTATCGGGCC 
TTCCACGCCA TGCCGCCGCT CACCAACAAA GACGGTGAAC CCACCGGCGC CATGTACGGC
GTGCTCAACA TGGTCCGCAA GCTGCTCGAC GACTACCGGC CGGAGCGGAT CGCGGTGGTC
TTCGACGCCC CCGGGCGCAC CTTCCGTGAC GAGCTGTTCG ATCAGTACAA GGCCCACCGC
CCGCCAATGC CGGACGAGTT GCGCGCCCAG ATCCAGCCGC TGAAGGACAT TATCCGCGCC
ATGGGCCTGC CGCTGCTGGA GGTGCCCGGC GTGGAGGCCG ACGACGTCAT CGGCACCCTC
GCCCGGCAGG CCGCTGAGGC CGGTCAGCCG GTGGTCATCT CCACCGGCGA TAAGGACATG
GCCCAGCTGG TGGACGAGCA GGTCACCCTG CTCAATACCA TGAACGACAC CCGCCTGGAC
GAGGCCGGGG TGAAGGAGAA GTTCGGCGTG CCGCCGGAGC GGATCATCGA CTACCTGGCC
CTGGTCGGGG ACAGCTCCGA CAACATCCCC GGTGTCCCCC GCGTCGGCCC CAAGACCGCT
GCCAAGTGGC TCAATCAGTT CGGCTCGCTG GACGCCCTCA AGGCCCGCGC CGATGAGGTC
AAGGGCAAGG TGGGGGAGAG CCTGCGCGCG CACCTGGACG AGCTGGCGTT GAGCGAGGAC
CTGGCCACCA TCCGCTGCGA CCTGGACCTG GACCAGCGCC CGGAGGACCT GAAACCCGGC
GAATCGGACG TGGAGCGACT GCGCGAGTAC TACCAGCGGT ATGAGTTCCG CCGGCTGCTC
CGCGAGCTGC TGAATGGTGA CGGCGACAGC GGCGGCGAGG TGGCCCCGGC CGGGCCGGGC
GCGGCCGGTG GCGCCGGCGA GGGTGACGAC GCCCGCTACC ACACCGTGGA CGACGCGGAT
GCCTTCGACG ACTGGTTGCG CCGGCTGGAG TCAGCCGAGC TGTTCGCCTT CGACCTGGAG
ACCAGCAGCC TCAACTACAT GGATGCCGAG ATCGTCGGGG TGGCGCTGGC GGTGGGGGCG
GGTGAGGCCG CCTACGTCCC GCTGGCCCAT GAAGGCCCCG ATACCCCGAC GCAACTCGAC
CGCGACCGGG TGCTGGCCGC GCTCAAGCCG CTGCTCGAGG ACCCGGACCG CGCCAAGGTC
GGGCAGAACC TCAAGTACGA CATGAGCGTG CTGGCCCGCT ACGACATCCA CCTGGAGGGT
GTGGCCTACG ACACCATGCT CGAGTCCTAC GTGCTGGACT CCACCGCCAG CCGTCACGAT
ATGGACTCCC TGGCCCTCAA GTACCTGGGC CGTGCCACCG TGAAATACGA GGATGTCTGC
GGTAAGGGCG CCAAGCAGAT CCCCTTCGCC CAGGTGGCGG TGGAGACCGC CACACGCTAT
GCCGGGGAGG ATGCCGATAT CACCCTGCGT CTGCACCAGA CGCTCTATCC GAGGCTCGAG
GCCGAGGGAC GGCTGGTGCA GGTGTTCCAC GCTATCGAAA TGCCGTTGCT GCCGGTGCTC
TCGCGCATGG AGCGTCACGG GGTGAAGGTG GACCGAGCAC TGCTGGAGCA GCAGAGTACG
GAACTGGCCG AGGGCATGGC CGCGTTGGAA CAACGCGCCC ACGAGGAGGC GGAAGGACCC
TTCAACCTCT CCAGCCCCAA GCAGATTCAG GAGATCCTGT TCGAACGCAT GGGCCTGCCG
GTGCTGCAGA AGACCCCCAA GGGGGCGCCC TCCACCGCCG AGTCGGTGCT CGAGGAGCTG
GCGGCACGCG GTTACGAGCT GCCGCGGTTG ATCCTGGCTT ACCGCAGTCT GGCCAAGCTG
AAGACCACCT ACACCGACAA GCTGCCGCGG CTGATCCACC CGAAGACCGG CCGGGTGCAC
ACCAGCTATC ATCAGGCGGT GGCCGCCACC GGGCGGCTGT CCAGCTCCGA TCCCAACCTG
CAGAACATCC CGGTGCGTAC CGCCGAGGGC CGGCGCATCC GCAAGGCCTT CGTCGCCGAG
CCCGGCTGCA AGCTGCTGGC GGCGGACTAC TCCCAGGTGG AGCTGCGGAT CATGGCCCAC
CTGTCGGGGG ATGAGGGGCT GCGCCAAGCG TTCGCCGAGG GTGCGGACAT CCACAGCGCC
ACCGCCGCCG AGGTCTTCGG TCTCGCACCC GAACGGGTGG GCGGCGAGCA GCGGCGGGCG
GCCAAGGCCA TCAACTTTGG TCTGATCTAC GGCATGTCCG CCTACGGCCT GGCCCGGCAG
CTGGGCATCG AGCGCGGCGA GGCCCAGGCC TACGTGGACC GCTATTTTGA GCGCTACCCC
GGGGTCAAAG AGTACATGGA CCGCACCCGT GCCGAGGCCC GTGAACGCGG TTACGTGGAG
ACGCTGTTCG GTCGCCGGCT CTACCTGCCC GAGATCAATG CCCGCAACCG CCAGCGCCGG
GAGTATGCCG AGCGCACCGC CATCAATGCG CCGATGCAGG GCACCGCGGC CGATCTCATC
AAGCGCGCCA TGGTGGCGGT GGACGCCTGG CTGACGGAGG CGCATTCCAA GGCGCGTATG
GTCATGCAGG TCCACGACGA ACTGGTGCTG GAGGTGCCGG CAGCGGACGT GCCGGCGGTG
GCGGAGGGGC TGCGCGAGCG CATGCAGGCG GCCGGGGAAC TGGCCGTGCC GCTGGAGGTC
GATGTGGGCG TGGCCGACGA CTGGGAGGGG GCCCATTGA
 
Protein sequence
MSDARKPLIL VDGSSYLYRA FHAMPPLTNK DGEPTGAMYG VLNMVRKLLD DYRPERIAVV 
FDAPGRTFRD ELFDQYKAHR PPMPDELRAQ IQPLKDIIRA MGLPLLEVPG VEADDVIGTL
ARQAAEAGQP VVISTGDKDM AQLVDEQVTL LNTMNDTRLD EAGVKEKFGV PPERIIDYLA
LVGDSSDNIP GVPRVGPKTA AKWLNQFGSL DALKARADEV KGKVGESLRA HLDELALSED
LATIRCDLDL DQRPEDLKPG ESDVERLREY YQRYEFRRLL RELLNGDGDS GGEVAPAGPG
AAGGAGEGDD ARYHTVDDAD AFDDWLRRLE SAELFAFDLE TSSLNYMDAE IVGVALAVGA
GEAAYVPLAH EGPDTPTQLD RDRVLAALKP LLEDPDRAKV GQNLKYDMSV LARYDIHLEG
VAYDTMLESY VLDSTASRHD MDSLALKYLG RATVKYEDVC GKGAKQIPFA QVAVETATRY
AGEDADITLR LHQTLYPRLE AEGRLVQVFH AIEMPLLPVL SRMERHGVKV DRALLEQQST
ELAEGMAALE QRAHEEAEGP FNLSSPKQIQ EILFERMGLP VLQKTPKGAP STAESVLEEL
AARGYELPRL ILAYRSLAKL KTTYTDKLPR LIHPKTGRVH TSYHQAVAAT GRLSSSDPNL
QNIPVRTAEG RRIRKAFVAE PGCKLLAADY SQVELRIMAH LSGDEGLRQA FAEGADIHSA
TAAEVFGLAP ERVGGEQRRA AKAINFGLIY GMSAYGLARQ LGIERGEAQA YVDRYFERYP
GVKEYMDRTR AEARERGYVE TLFGRRLYLP EINARNRQRR EYAERTAINA PMQGTAADLI
KRAMVAVDAW LTEAHSKARM VMQVHDELVL EVPAADVPAV AEGLRERMQA AGELAVPLEV
DVGVADDWEG AH