Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2860 |
Symbol | |
ID | 4268598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3248549 |
End bp | 3251287 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127622 |
Product | DNA polymerase I |
Protein accession | YP_743690 |
Protein GI | 114322007 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.129264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACG CCCGCAAGCC CCTGATTCTG GTGGACGGCT CCAGCTACCT CTATCGGGCC TTCCACGCCA TGCCGCCGCT CACCAACAAA GACGGTGAAC CCACCGGCGC CATGTACGGC GTGCTCAACA TGGTCCGCAA GCTGCTCGAC GACTACCGGC CGGAGCGGAT CGCGGTGGTC TTCGACGCCC CCGGGCGCAC CTTCCGTGAC GAGCTGTTCG ATCAGTACAA GGCCCACCGC CCGCCAATGC CGGACGAGTT GCGCGCCCAG ATCCAGCCGC TGAAGGACAT TATCCGCGCC ATGGGCCTGC CGCTGCTGGA GGTGCCCGGC GTGGAGGCCG ACGACGTCAT CGGCACCCTC GCCCGGCAGG CCGCTGAGGC CGGTCAGCCG GTGGTCATCT CCACCGGCGA TAAGGACATG GCCCAGCTGG TGGACGAGCA GGTCACCCTG CTCAATACCA TGAACGACAC CCGCCTGGAC GAGGCCGGGG TGAAGGAGAA GTTCGGCGTG CCGCCGGAGC GGATCATCGA CTACCTGGCC CTGGTCGGGG ACAGCTCCGA CAACATCCCC GGTGTCCCCC GCGTCGGCCC CAAGACCGCT GCCAAGTGGC TCAATCAGTT CGGCTCGCTG GACGCCCTCA AGGCCCGCGC CGATGAGGTC AAGGGCAAGG TGGGGGAGAG CCTGCGCGCG CACCTGGACG AGCTGGCGTT GAGCGAGGAC CTGGCCACCA TCCGCTGCGA CCTGGACCTG GACCAGCGCC CGGAGGACCT GAAACCCGGC GAATCGGACG TGGAGCGACT GCGCGAGTAC TACCAGCGGT ATGAGTTCCG CCGGCTGCTC CGCGAGCTGC TGAATGGTGA CGGCGACAGC GGCGGCGAGG TGGCCCCGGC CGGGCCGGGC GCGGCCGGTG GCGCCGGCGA GGGTGACGAC GCCCGCTACC ACACCGTGGA CGACGCGGAT GCCTTCGACG ACTGGTTGCG CCGGCTGGAG TCAGCCGAGC TGTTCGCCTT CGACCTGGAG ACCAGCAGCC TCAACTACAT GGATGCCGAG ATCGTCGGGG TGGCGCTGGC GGTGGGGGCG GGTGAGGCCG CCTACGTCCC GCTGGCCCAT GAAGGCCCCG ATACCCCGAC GCAACTCGAC CGCGACCGGG TGCTGGCCGC GCTCAAGCCG CTGCTCGAGG ACCCGGACCG CGCCAAGGTC GGGCAGAACC TCAAGTACGA CATGAGCGTG CTGGCCCGCT ACGACATCCA CCTGGAGGGT GTGGCCTACG ACACCATGCT CGAGTCCTAC GTGCTGGACT CCACCGCCAG CCGTCACGAT ATGGACTCCC TGGCCCTCAA GTACCTGGGC CGTGCCACCG TGAAATACGA GGATGTCTGC GGTAAGGGCG CCAAGCAGAT CCCCTTCGCC CAGGTGGCGG TGGAGACCGC CACACGCTAT GCCGGGGAGG ATGCCGATAT CACCCTGCGT CTGCACCAGA CGCTCTATCC GAGGCTCGAG GCCGAGGGAC GGCTGGTGCA GGTGTTCCAC GCTATCGAAA TGCCGTTGCT GCCGGTGCTC TCGCGCATGG AGCGTCACGG GGTGAAGGTG GACCGAGCAC TGCTGGAGCA GCAGAGTACG GAACTGGCCG AGGGCATGGC CGCGTTGGAA CAACGCGCCC ACGAGGAGGC GGAAGGACCC TTCAACCTCT CCAGCCCCAA GCAGATTCAG GAGATCCTGT TCGAACGCAT GGGCCTGCCG GTGCTGCAGA AGACCCCCAA GGGGGCGCCC TCCACCGCCG AGTCGGTGCT CGAGGAGCTG GCGGCACGCG GTTACGAGCT GCCGCGGTTG ATCCTGGCTT ACCGCAGTCT GGCCAAGCTG AAGACCACCT ACACCGACAA GCTGCCGCGG CTGATCCACC CGAAGACCGG CCGGGTGCAC ACCAGCTATC ATCAGGCGGT GGCCGCCACC GGGCGGCTGT CCAGCTCCGA TCCCAACCTG CAGAACATCC CGGTGCGTAC CGCCGAGGGC CGGCGCATCC GCAAGGCCTT CGTCGCCGAG CCCGGCTGCA AGCTGCTGGC GGCGGACTAC TCCCAGGTGG AGCTGCGGAT CATGGCCCAC CTGTCGGGGG ATGAGGGGCT GCGCCAAGCG TTCGCCGAGG GTGCGGACAT CCACAGCGCC ACCGCCGCCG AGGTCTTCGG TCTCGCACCC GAACGGGTGG GCGGCGAGCA GCGGCGGGCG GCCAAGGCCA TCAACTTTGG TCTGATCTAC GGCATGTCCG CCTACGGCCT GGCCCGGCAG CTGGGCATCG AGCGCGGCGA GGCCCAGGCC TACGTGGACC GCTATTTTGA GCGCTACCCC GGGGTCAAAG AGTACATGGA CCGCACCCGT GCCGAGGCCC GTGAACGCGG TTACGTGGAG ACGCTGTTCG GTCGCCGGCT CTACCTGCCC GAGATCAATG CCCGCAACCG CCAGCGCCGG GAGTATGCCG AGCGCACCGC CATCAATGCG CCGATGCAGG GCACCGCGGC CGATCTCATC AAGCGCGCCA TGGTGGCGGT GGACGCCTGG CTGACGGAGG CGCATTCCAA GGCGCGTATG GTCATGCAGG TCCACGACGA ACTGGTGCTG GAGGTGCCGG CAGCGGACGT GCCGGCGGTG GCGGAGGGGC TGCGCGAGCG CATGCAGGCG GCCGGGGAAC TGGCCGTGCC GCTGGAGGTC GATGTGGGCG TGGCCGACGA CTGGGAGGGG GCCCATTGA
|
Protein sequence | MSDARKPLIL VDGSSYLYRA FHAMPPLTNK DGEPTGAMYG VLNMVRKLLD DYRPERIAVV FDAPGRTFRD ELFDQYKAHR PPMPDELRAQ IQPLKDIIRA MGLPLLEVPG VEADDVIGTL ARQAAEAGQP VVISTGDKDM AQLVDEQVTL LNTMNDTRLD EAGVKEKFGV PPERIIDYLA LVGDSSDNIP GVPRVGPKTA AKWLNQFGSL DALKARADEV KGKVGESLRA HLDELALSED LATIRCDLDL DQRPEDLKPG ESDVERLREY YQRYEFRRLL RELLNGDGDS GGEVAPAGPG AAGGAGEGDD ARYHTVDDAD AFDDWLRRLE SAELFAFDLE TSSLNYMDAE IVGVALAVGA GEAAYVPLAH EGPDTPTQLD RDRVLAALKP LLEDPDRAKV GQNLKYDMSV LARYDIHLEG VAYDTMLESY VLDSTASRHD MDSLALKYLG RATVKYEDVC GKGAKQIPFA QVAVETATRY AGEDADITLR LHQTLYPRLE AEGRLVQVFH AIEMPLLPVL SRMERHGVKV DRALLEQQST ELAEGMAALE QRAHEEAEGP FNLSSPKQIQ EILFERMGLP VLQKTPKGAP STAESVLEEL AARGYELPRL ILAYRSLAKL KTTYTDKLPR LIHPKTGRVH TSYHQAVAAT GRLSSSDPNL QNIPVRTAEG RRIRKAFVAE PGCKLLAADY SQVELRIMAH LSGDEGLRQA FAEGADIHSA TAAEVFGLAP ERVGGEQRRA AKAINFGLIY GMSAYGLARQ LGIERGEAQA YVDRYFERYP GVKEYMDRTR AEARERGYVE TLFGRRLYLP EINARNRQRR EYAERTAINA PMQGTAADLI KRAMVAVDAW LTEAHSKARM VMQVHDELVL EVPAADVPAV AEGLRERMQA AGELAVPLEV DVGVADDWEG AH
|
| |