Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1175 |
Symbol | |
ID | 4269114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1370677 |
End bp | 1373754 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638125924 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_742014 |
Protein GI | 114320331 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.286001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0989959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATCG ACTACGCCGA ACTCCACTGC CTGTCCTGCT TCAGCTTCCT GCGCGGTGCC TCTCAGCCGG CGGAACTGGT GCAGCGGGCC GCCGAGTTGG GCTACCGCGC CCTGGCCCTC ACCGACGCGT GCTCGGTGGC CGGGGCGGTG CGCGCCCACC AGGCGGCCAA AGAGACGGAC CTGCACCTGA TCCACGGCAG CGAGATCCGC ATTCACCAGG GCCCGCTGCT GGTCTTGCTC GCCCCCTGCC GCCGGGCCTG GGCGGAACTC TGCGCGCTTA TCAGCCTGGG CCGCAGTCAG GCCCGCAAGG GGGACTACCG TCTGGAACGG GAGCAGTTGG AGGGCACCCT GCCCCACTGC CTGGCCCTGT GGGTGCCTGA CGATGCACCG CACGACGCCG AACAGGGCCG ATGGTTCGCC CGCCATTTCG GCGATCGGGG CCACGTGGCA GTGGCCCTGC ACCACGGCCC CGACGACGAG GCCCGCCTGC AGCGGCTACT GGCCCTGGCG GATCGCTTCC GGCTCCCGGC GGTGGCCGCC GGCGGCGTGC TGATGCACCG GCGCGGCCGA CGCGCCCTGC AGGACACCCT CTCCGCCCTG CGCCACCGCC GCACCCTGGC GGCGATGGGC ACGGCGCTGG AGTGCAGCGG CGAGCGCCAT CTGCGGTCGC TGCACAGCCT GGCCCGGCTC TACCCCCCAG CGCTGCTGCG CCGCAGCGTG CACCTGGCGG ACCAGTGCCG CTTCAGCCTG GATAAACTGC GCTACGAATA CCCGGCCGAG CTGGTGCCCG CCGGCGAGAC CCCCGCCAGC TATCTGCGCC GGCTCACCCT GGAGGGCGCC CGCCGCCACT GGCCCCAAGG CATGCCCGAC AAGGTCGCCC ACCAGGTGGA CCACGAACTG GCCCTGATTG CCGAGATGGG TTACGAACCC TTCTTTCTCA CCGTGCACGA TGTGGTCGCC TTCGCCCGGC GCCGGGGCAT CCTCTGCCAG GGCCGGGGCT CCTCGGCCAA CTCGGCGGTC TGTTTCTGCC TGGGGATCAC CGCGGTGGAC CCGGCCCGCC AGTCGCTGCT CTTCGAGCGC TTCATCTCCA AGGAGCGGGG TGAGCCGCCG GACATCGACG TGGACTTCGA GCACGAACGG CGTGAGGAGG TCATCCAGTA CATCTACCGC AAGTACGGCC GCCACCGCGC CGCCCTGGCC GCCACGGTAA TCCGCTACCG CCCGCGCAGC GCCCTGCGCG ACGCCGGCCG CGCCCTGGGG CTGGACGCCG CGACCATCGA CCGGCTGGCC GGAAGCATCC AGTGGTGGGA CGGCAAGCGG GTGGACCCGG AGCGGCTGCG CGAGGCCGGC CTGAACCCGG ACGACCCGCG CCTGGCCCGC ACCGTGGCCA TCGCCGGGCA ACTGCTGGGT CTGCCCCGCC ACCTCTCCCA GCACGTGGGG GGGTTCGTGA TCTCGGAGGG CCCCATCAGC GAGCTGGTAC CCACCGAGAA CGCCGCCATG GCCGGGCGCA CCATCATCCA GTGGGACAAG GACGACCTGG AGGCCCTGGG CCTGCTCAAG GTGGACGTGC TGGCCCTGGG CATGCTCAGT TGCATCCGCC GCGCCTTCGA CCTGCTGGCC GGGTTCCGCG GTCGCCGGCT TACCCTGGCC GACGTGCCGG CGGAGGACCC GGCGGTCTAC CGCATGATCA GCGACGCCGA CACCATGGGC GTGTTCCAGA TCGAGTCCCG CGCCCAGATG GCCATGCTGC CCCGGCTGCG GCCGCAGACC TTCTACGACC TGGTGATCGA GGTGGCCATC GTCCGTCCCG GCCCCATCCA GGGGGACATG GTCCACCCCT ATCTGCGCCG CCGGGAGGGC CTGGAGCCGG TGGATTACCC CAGCGAAGCC GTGCGCGGGG TGCTGGCGCG CACCCTGGGC GTGCCCATCT TCCAGGAACA GGTAATGCAG TTGGCGGTGG TGGCGGCGGG CTTCACCCCC GGCGAGGCGG ACGCCCTGCG CCGGGCCATG GCCGCCTGGA AGCGCAAGGG CGGGCTGGGC CCCTTCCGGG ACAAGCTGCT CAAGGGCATG CGCCGCAATG GTTACTGCGA GGACTACGCC GAGCGGCTCT TCCGGCAGAT CCAGGGCTTC GGCGAGTACG GCTTCCCGGA GTCCCACGCC GCCAGCTTCG CCCTGCTGGT CTACGTCTCC GCCTGGCTCA AGTGCCATGA GCCGGCCCTC TTCACCTGCG CGCTGCTCAA CAGCCAGCCC ATGGGCTTCT ACGCCCCGGC CCAGTTACTG CGGGACGCCG AACGCCACGG GGTGGAGATC CGCCCGGTGG ACGTGCGCCA CAGCGACTGG GACTGCAGCC CGGAGCACCG CGGCGACGGC GAACCGGCCC TGCGCCTGGG CCTGCGCCTG GTGCGGGGCC TCAACCGCCG GGCGGCGGAC CGGCTGATCG CCGCCCGCGG CCGGCGCCCC TTCCGCGACG TGCAGGAGAT GGCCCGCCGC GCCGCCCTGC ACCGCCGGGA TCTGGAGACC CTGGCCCACG CCGGCGCCCT GCGCGGCCTG GCCGGTCACC GCCGCGCGGC CTGGTGGCAG GTGCTGGGCG CCGAAGCCGG CCTGCCGGTG TTTGAGGATC TGCACATCGA GGAGGCGGCG CCGGCGCTGG ACGCCCCCGC CGAGGGCGAG GACCTGGTGG CCGACTACAC CAGCCTGGGC TTCACCCTGG GCCGCCACCC CCTGGCCCTG CTCCGCCCGC AGTTGCGACG CCGCCGGCTG CTGACCGCTG CCGATCTGGC CAGCACCGGC CACGGCCGCC TGGTCCGCAC CGCCGGGTTG GTCATCAACC GCCAACGCCC CGGCTCTGCC GGCGGCGTCA CCTTCCTGAC ACTGGAGGAC GAGACCGGGC AGATCAACCT GGTGGTCTGG AAGGCCACCG CCGAGGCCCA GCGCCGCACC CTGCTGGCGG CCCGGCTGCT GATGGTCAGC GGCATCTGGG AGCGCAAGGG GGCGGTCACC CACCTGGTGG CCGGACGGCT GGAGGACTGG AGCGACTGGC TCGGGGCGTT GGACGTCCGT TCGCGGGATT TCCACTGA
|
Protein sequence | MPIDYAELHC LSCFSFLRGA SQPAELVQRA AELGYRALAL TDACSVAGAV RAHQAAKETD LHLIHGSEIR IHQGPLLVLL APCRRAWAEL CALISLGRSQ ARKGDYRLER EQLEGTLPHC LALWVPDDAP HDAEQGRWFA RHFGDRGHVA VALHHGPDDE ARLQRLLALA DRFRLPAVAA GGVLMHRRGR RALQDTLSAL RHRRTLAAMG TALECSGERH LRSLHSLARL YPPALLRRSV HLADQCRFSL DKLRYEYPAE LVPAGETPAS YLRRLTLEGA RRHWPQGMPD KVAHQVDHEL ALIAEMGYEP FFLTVHDVVA FARRRGILCQ GRGSSANSAV CFCLGITAVD PARQSLLFER FISKERGEPP DIDVDFEHER REEVIQYIYR KYGRHRAALA ATVIRYRPRS ALRDAGRALG LDAATIDRLA GSIQWWDGKR VDPERLREAG LNPDDPRLAR TVAIAGQLLG LPRHLSQHVG GFVISEGPIS ELVPTENAAM AGRTIIQWDK DDLEALGLLK VDVLALGMLS CIRRAFDLLA GFRGRRLTLA DVPAEDPAVY RMISDADTMG VFQIESRAQM AMLPRLRPQT FYDLVIEVAI VRPGPIQGDM VHPYLRRREG LEPVDYPSEA VRGVLARTLG VPIFQEQVMQ LAVVAAGFTP GEADALRRAM AAWKRKGGLG PFRDKLLKGM RRNGYCEDYA ERLFRQIQGF GEYGFPESHA ASFALLVYVS AWLKCHEPAL FTCALLNSQP MGFYAPAQLL RDAERHGVEI RPVDVRHSDW DCSPEHRGDG EPALRLGLRL VRGLNRRAAD RLIAARGRRP FRDVQEMARR AALHRRDLET LAHAGALRGL AGHRRAAWWQ VLGAEAGLPV FEDLHIEEAA PALDAPAEGE DLVADYTSLG FTLGRHPLAL LRPQLRRRRL LTAADLASTG HGRLVRTAGL VINRQRPGSA GGVTFLTLED ETGQINLVVW KATAEAQRRT LLAARLLMVS GIWERKGAVT HLVAGRLEDW SDWLGALDVR SRDFH
|
| |