Gene Mlg_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1849 
Symbol 
ID4269217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2106288 
End bp2109836 
Gene Length3549 bp 
Protein Length1182 aa 
Translation table11 
GC content68% 
IMG OID638126605 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_742683 
Protein GI114321000 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC AGAGCGGTTT CGTACACCTG AATGTCCACT CGGAGTTCTC CCTCTCCGAC 
GGTATCCTGC GTATCCCCGA GCTGGTGGAC GCGGTGGCCG GGCAGGGTAT GCCGGCGGTG
GCCATCACCG ATCAGGGCAA CCTGTTCGGC ATGGTCAAGT TCTACAAGGC GGCGCTGGCA
GCGGGGGTCA AGCCCATTGT CGGTGCCGAG ATCCGGGTTG AGGACGAGGA GGCGGAACGG
GACCACTCCG CGCTCACCCT CCTGTGCCGC GATCTGGACG GCTATGCCGC TCTGTCCAGG
CTGCTCTCGC TGAGCTACCA GCAGGGCCGC ACCGTGGACG ACACCCCCGT GGTCCAACGT
GCCTGGCTGG AGGCCGACCA CGAGGGCCTC ATCGTGCTCT CCGGTGGCAA GGACGGTGAT
GTGGGCAAGG CCCTGCTGGC CGGCAACACA CGCCGCGCCG AAAGCCTGGC CCGCTGGTAT
GCCGGGCACT TCCCCGACGC CTATTTCCTG GAGCTGCAGC GCACCGGCCG TCCCGGCGAT
GAGGACCACC TGCACGCCGC CGTGGCGCTG GCCACCGAGC TGGCCTTGCC GGTGGTGGCG
ACCAACGCCG TGCGCTTCCT CGCTGCCGAC GATTACGACG CCCATGAGGT GCGGGTCTGT
ATCCACGACG GCTACACCCT CGACGACCCA AAGCGCCCGC GCCGTTACAC CAACCAGCAG
TACCTGCGCA GCCCGGCGGA GATGGGGGCG CTGTTTTCCG ATCTGCCCGA GGCGCTGGAG
AATACCGTCG AGATCGCCCG GCGCTGCAGC CTGGATCTGA CCTTGGGCGA GAACGTCCTG
CCCGATTTCC CCATCCCCGA GGGATTGACC ATTGAGGAGT TCCTGCACCA GGAGGCCCTG
CGCGGGCTCG AGCAGCGGCT GGAGGAGCGC GGCCTGGCGG AGGGTGAGAC CGAGGAGGGC
GTGCGCGAGA CCTACCGGCA GCGGCTGGAC TGGGAGCTGG GGATCATCAA TCAGATGGGC
TTCCCCGGCT ACTTCCTGAT CGTGGCCGAT TTCATCCGCT GGGCCAAGGA ACACGATATC
CCGGTGGGCC CGGGGCGCGG CTCCGGGGCG GGGTCGCTGG TGGCCTACGC CCTGGCGATC
ACCGATCTCG ATCCGCTGCG CTACGATCTG CTGTTTGAGC GCTTTCTCAA TCCCGAACGC
GTCTCCATGC CCGACTTCGA CGTGGACTTC TGCATGGAGC AGCGTGACGA GGTCATCGAG
TACGTGGCCC GGCGCTACGG CCGCGAGAAG GTGTCGCAGA TCGCCACCCA CGGCACCATG
GCCGCCCGCG CCGTGGTGCG CGATGTGGGC CGGGTGCTGG GCCATGGCTA CGGCTACGTC
GACCGCATCG CCAAACTGGT GCCCTTCGAA CCGGGCATGC AGCTGACCAA GGCCTTCGAG
CTGGAGCCGG AGCTCAAGGC GCTCTATGAA AAGGACGAGG AGGCCGGCCC GCTGCTGAGC
ATGGCCCTCA AGCTGGAGGG CCTGGCCCGT AATGTCGGTA AGCACGCCGG CGGGGTGGTC
ATCGCCCCCA CGGCGCTGAC CGACTTCTCG CCGCTCTACT GCGAACCGGG CGGTGAGGGG
TTGGCCACCC AGTACGACAA GGACGATGTG GAGGAGGTGG GCCTGGTCAA GTTCGACTTC
CTGGGCCTGC GCACGCTCAC CATCATCGAC TGGACGGTGA AGGCGGTGAA TGCCCTGCGC
GAACAGCGGG GAGAGACCCC GCTGGATATC GCCCGGATCC CGTTGGACGA CCGCGCCACC
TTCGACCTGC TCAAGCGCTG CCAGACCACC GCGGTGTTCC AGTTGGAATC CCGCGGCATG
AAGGAGCTGA TCAAACGCCT GCAGCCCGAC AGCTTTGAGG ACATCATCGC CCTGGTGGCG
CTGTTCCGCC CCGGCCCGCT GCAGTCGGGC ATGGTGGACG ACTTTATCGA CCGCAAGCAC
GGCCGGGCGC AGGTGGCCTA CCCGACCCCG GAACTGCACC ACGACGACCT GGAGCCGATC
CTCAAGCCCA CTTACGGCGT CATCCTCTAC CAGGAGCAGG TGATGCAGAT CGCCCAGGCC
CTGGCCGGTT ACAGCCTGGG GGCGGCCGAT CTGCTGCGCC GGGCGATGGG CAAGAAGAAG
GCGGCCGAGA TGGCCAAGCA GCGGGAGATC TTCCTCAAGG GCGCGCAGGA GCACGGCCTG
AGCGAGGCGC ACGCCGGCGC CATCTTTGAC CTGATGGAGA AGTTCGCCGG TTACGGCTTC
AACAAATCCC ACTCCGCGGC CTACGCACTG CTCTCCTACC AGACCGCCTG GCTCAAGCAC
CACTACCCGG CGCCCTTCAT GGCCTCGGTG CTCTCCTCGG ACATGGATAA CACCGACAAG
GTGGTGATCT TCATCGAAGA GTGCCGGGAG ATGGGGCTCA CCGTGCGCCC GCCGGACGTG
AACCGCTCCG GGTACCGGTT CCGCGCCGAG GACGAACAGA CCATCATCTA CGGCCTCGGT
GCGGTGAAAG GGGTGGGGCA GTCGGCCCTG GACGCCATCA TTGAGGAGCG CGACGCCCAC
GGCCCGTTCA AGGACCTGCA GGACCTGTGC AACCGGGTGG ACCTGCGCAA GGTGAACCGC
CGGGTGCTGG AGGCCCTCTG CCGCTCCGGC AGCCTCGACA GCCTGATCCC CAACCGCGCC
ACCGGGATGG CCTGGCTACC GGAGGCCCTG GCGGCGGCCG AACAGAAGAC CCGTAACGCC
GCCGCCGGTC AGGAGGACCT GTTTGGCCTG CCCGGCGACG GCGCGGCCGT GGCGGAAGCG
GAGGAGCCGC ATTTTTCCAC CCCGGAGCAG GCGGAGTGGG ATGAGCACGA CCGGCTGGCC
GCCGAGAAGG AGACCCTGGG GCTGTTCCTG ACCGGCCATC CCATCGACCC CTACGAGGCC
GAACTGTCCC ATCTGGCCGA ACGGCGCATC GCCCAGGTGC TGGCAGCGGC CGGAGAGCCG
GGCGAGAAGC CGGAGAACGG CCGGGGCCGG CGGCGCAACG GCCCCAGCGT CCGGGTGGTG
GGGTTGGTGG TGTCCCTGCG TACCCGCAAC ACCGCCAGCG GCGGGCGCAT GGCCAGCCTG
GTGCTGGATG ACCGCACCGC GCGCATGGAG GCCATGCTCT TTCCTGAGGC CTACGAGCGG
CTGCGCGGGC TGATCGCGGT GGACCGGGTG CTGGTGGTCC AGGGCAGTCT CGATTACGAC
GACTTCGCCG GCGGCTGGCG GATCACCGTG GAGCAGTTGC AGGACATCAA CGACGCCCGC
GCCGAGCGGC TGCGCCAGGT GGTCATTGAG CTCTCGGCGG CCATTGCGGA CAACGGCTTC
GCCGACGCCC TGGAGGAAAC CCTGAAGCCC TATCGCCAGG GGCGGTGTGA CGTGCATCTG
GACTACCGTG GCGAGCGGGC CAGCGGGCGC CTGCGCCTGG GTGACGACTG GCGGGTGCAC
CCCAGTGACG ACCTGTTGCG CCGTCTCGAG CGACTGGCGG GCCCGGCGCG GGTGCGCCTG
GAGTATTGA
 
Protein sequence
MSAQSGFVHL NVHSEFSLSD GILRIPELVD AVAGQGMPAV AITDQGNLFG MVKFYKAALA 
AGVKPIVGAE IRVEDEEAER DHSALTLLCR DLDGYAALSR LLSLSYQQGR TVDDTPVVQR
AWLEADHEGL IVLSGGKDGD VGKALLAGNT RRAESLARWY AGHFPDAYFL ELQRTGRPGD
EDHLHAAVAL ATELALPVVA TNAVRFLAAD DYDAHEVRVC IHDGYTLDDP KRPRRYTNQQ
YLRSPAEMGA LFSDLPEALE NTVEIARRCS LDLTLGENVL PDFPIPEGLT IEEFLHQEAL
RGLEQRLEER GLAEGETEEG VRETYRQRLD WELGIINQMG FPGYFLIVAD FIRWAKEHDI
PVGPGRGSGA GSLVAYALAI TDLDPLRYDL LFERFLNPER VSMPDFDVDF CMEQRDEVIE
YVARRYGREK VSQIATHGTM AARAVVRDVG RVLGHGYGYV DRIAKLVPFE PGMQLTKAFE
LEPELKALYE KDEEAGPLLS MALKLEGLAR NVGKHAGGVV IAPTALTDFS PLYCEPGGEG
LATQYDKDDV EEVGLVKFDF LGLRTLTIID WTVKAVNALR EQRGETPLDI ARIPLDDRAT
FDLLKRCQTT AVFQLESRGM KELIKRLQPD SFEDIIALVA LFRPGPLQSG MVDDFIDRKH
GRAQVAYPTP ELHHDDLEPI LKPTYGVILY QEQVMQIAQA LAGYSLGAAD LLRRAMGKKK
AAEMAKQREI FLKGAQEHGL SEAHAGAIFD LMEKFAGYGF NKSHSAAYAL LSYQTAWLKH
HYPAPFMASV LSSDMDNTDK VVIFIEECRE MGLTVRPPDV NRSGYRFRAE DEQTIIYGLG
AVKGVGQSAL DAIIEERDAH GPFKDLQDLC NRVDLRKVNR RVLEALCRSG SLDSLIPNRA
TGMAWLPEAL AAAEQKTRNA AAGQEDLFGL PGDGAAVAEA EEPHFSTPEQ AEWDEHDRLA
AEKETLGLFL TGHPIDPYEA ELSHLAERRI AQVLAAAGEP GEKPENGRGR RRNGPSVRVV
GLVVSLRTRN TASGGRMASL VLDDRTARME AMLFPEAYER LRGLIAVDRV LVVQGSLDYD
DFAGGWRITV EQLQDINDAR AERLRQVVIE LSAAIADNGF ADALEETLKP YRQGRCDVHL
DYRGERASGR LRLGDDWRVH PSDDLLRRLE RLAGPARVRL EY