Gene Mlg_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0452 
Symbol 
ID4270800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp509112 
End bp513392 
Gene Length4281 bp 
Protein Length1426 aa 
Translation table11 
GC content66% 
IMG OID638125192 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_741296 
Protein GI114319613 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACT TGCTGAATCT GTTCAAGCAA CCGGGCGCAC AGCTCGAGGA CTTCGACGCC 
ATCCGTATCG GACTGGCGTC GCCGGAGATG ATCCGGTCCT GGTCCTACGG CGAGGTGAAA
AAGCCCGAGA CCATCAACTA CCGCACCTTC AAGCCGGAGC GCGACGGCCT GTTCTGCGCC
AAGATCTTCG GCCCGGTGAA GGACTACGAG TGCCTTTGCG GCAAGTACAA GCGGCTCAAG
CACCGGGGCG TGGTGTGCGA GAAGTGCGGG GTGGAGGTCA CCATCGCCAA GGTGCGCCGG
GAGCGCATGG GTCATATTGA CCTGGCCAGC CCGGTCGCCC ACATCTGGTT CCTGAAGAGC
CTGCCCTCGC GCATCGGGCT GCTGCTGGAT ATGACCCTGC GGGACATCGA GCGCATCCTT
TACTTCGAGG CGTTCGTGGT CATCGAGCCG GGTATGACCC CGTTGGAGCG CGGCCAACTG
CTCTCCGACG AGGCCTACCT GGACGCCATC GAGCAGCACG GCGATGAGTT CGAGGCCAAG
ATGGGCGCCG AGGCGGTCCT CGACCTGCTC AAGAGCCTGG ACATGACCGG CGAGGCCCGG
ACCCTGCGCG AGGAGATCGA GGGCACCAAC TCCGAGTCCA AGATCAAGCG CCTGTCCAAG
CGGCTGAAAC TGATCGAGGC CTTCCTCGAG TCCGGCAACA AGCCGGAGTG GATGATCATG
GACGTGCTGC CGGTGCTGCC ACCGGACCTG CGTCCGCTGG TGCCGCTGGA CGGCGGCCGG
TTCGCCACTT CCGATCTGAA CGACCTGTAT CGCCGGGTCA TTAACCGGAA CAACCGGCTG
AAGCGGCTGC TGGAGCTGTC GGCCCCCGAC ATCATCGTGC GCAACGAAAA GCGCATGCTG
CAGGAGTCCG TGGACGCACT GCTGGACAAC GGCCGGCGCG GCCGGGCCAT TACCGGTACC
AACAAGCGCC CGCTGAAGTC CCTGGCCGAC ATGATCAAGG GCAAGCAGGG CCGGTTCCGG
CAGAACCTGC TGGGCAAGCG CGTCGATTAC TCCGGCCGTT CGGTCATCGT GGTCGGTCCG
ACCCTGCGCC TGCACCAGTG CGGCCTGCCC AAGCGCATGG CGCTGGAGCT GTTCAAGCCG
TTCATCTTCT CCAAGCTGCA ACTGCGCGGC CTGGCCACCA CCATCAAGGC GGCCAAGAAG
ATGGTCGAGC GCGAGACCGG CGAGGTCTGG GACATCCTCT CGGAGGTGAT CCGCGAGCAC
CCGGTCATGC TCAACCGTGC GCCCACGTTG CACCGCCTGG GTATTCAGGC GTTCGAGCCG
GTGCTCATCG AGGGCAAGGC CATCCAGCTC CACCCGCTGG TCTGCACCGC CTTCAACGCC
GACTTTGACG GCGACCAGAT GGCCGTGCAC GTGCCACTGT CGCTGGAGGC GCAGCTGGAA
GCCCGCGCCC TGATGATGTC CACCAACAAC ATCCTGTCGC CGGCCAGTGG TGAGCCCATC
ATCGTCCCCT CGCAGGACGT GGTGCTGGGC CTCTATTACA TGACCCGTGA ACGGCTGGAC
GCCAAGGGCC GGGGCATGGT CTTCACCGAC GTGCAGGAGG TGCACCGGGC CCACCAGAAC
GGGGTGCTGG ACCTGGGCGC CCGCGTTCAG GTGCGGATCC GCGAGGCCGT GTTCGACGAG
AACGGCGGCA TGAATGAGCG GGTGCACCGG GTCGAGACCG TGGCCGGCCG GGCGCTGCTC
TACGAGATCG TTCCCGACGG GCTGCCCTTC GAGCTGGTGG ATCGGGACAT GACCAAGAAG
GCCATTTCCG GCCTGGTCAA TGCCTGCTAC CGCCGGGTGG GCCTGAAGGG CACGGTGGTC
TTTGCCGACC AGCTCATGTA TATGGGCTTC TCCATGTCCA CCGGCGCCGG GGTCTCCATC
GGTGTCAACG ACATGGAAGT GCCGGCGGAG AAGGAGAAGA TCCTGGCCGA TGCCGAGGAA
GAGGTGAAGG ACATCGAGGA GCAGTACGCC TCGGGCCTGG TCACCAACGG CGAGCGCTAC
AACAAGGTGG TGGACATCTG GTCCCACACC AACGAGGCCG TGGCCAAGGC CATGATGGAG
AAGATGGGCA AGGACCTCGT CGAGGTGGAT GGCGAGCAGA AGGAGCAGAA GTCCTTCAAC
TCCATCTTTA TGATGGCCGA TTCCGGCGCG CGTGGCTCGG CGGCGCAGAT CCGGCAGCTG
GCGGGTATGC GCGGCCTGAT GGCGAAGCCG GACGGCTCCA TCATCGAGAC CCCCATCACG
GCGAACTTCC GTGAGGGGCT GAACGTGCTC CAGTACTTCA TCTCCACCCA CGGTGCCCGT
AAGGGCCTGG CCGACACGGC GCTGAAGACG GCCAACTCCG GGTATCTGAC CCGACGCCTG
GTGGACGTCT CCCAGGACCT GGTGGTCACC GAGGAGGATT GCGGCACCAC CGAAGGCCTG
CATATGACGC CCATCATCGA GGGTGGTGAT GTGGTGGAGA CCCTGGCCGA TCGCGTCCTC
GGGCGCGTGG TGGCGGAGGA TGTCTACAAG CCGGGCACTG ACGAAGTGGT CGCGGCGGCC
GGCACCCTGC TCGATGAGGA GTGGGTCGAG CACCTGGAGC AGCAGGGTGT GGACGAGATC
CGGGTCCGCT CGCCGATCAC CTGCCAGACT CGCCACGGCG TTTGCGCCCA GTGCTACGGT
CGCGACCTGG CGCGCGGGCA CGGCGTCAAC GTCGGTGAGG CGGTGGGTGT GATCGCCGCC
CAGTCCATCG GTGAGCCGGG CACCCAGCTG ACCATGCGGA CCTTCCACAT CGGTGGGGCC
GCGTCGCGGG CCGCGTCGAT CAACAACGTG CAGGTCCGCA ACTCGGGCTC GGTGCGGCTG
CACAACGTTA AGGTGGTCAA GCACCACTCC GGCAACTACG TGGCCGTCTC CCGTTCCGGC
GAGGTGACCG TCATGGACGA TCACGGCCGT GAGCGTGAGC GCTACAAGAT CCCCTACGGC
GCCGTGCTCT CGGTGGCCGA TGGCGATGCG GTGGAGTCCG GCCAGATCGT GGCCAACTGG
GATCCCCACA CCCACCCGAT CATCACCGAG GTGGAGGGTC GGGTGCGCTT CTACGACTTC
GTGGAGGGCG TCACCGTGGC CCGCGAGGTG GACGAGGTCA CCGGCCTCTC CAGTCTGGTG
GTCACCGATC CCAAGAGCCG TGGCAACGGC GAGCACCGGC GTATGGTGAC CGACGCCAGC
GGCAAGCAGG TGGAGGAGCG GGTCGCGTAC AAGGACTTGC GGCCCATGAT CAAGCTGGTG
GACGAGGACG GCAACGACCT CAACATCGCC GGCACCGACA TCCCGGCCCA CTACTTCCTG
CCCGCCGAGG CGATCATCAG TCTCGAGGAT GGGGCCGAGG TCCGGGTGGG CGACGCCCTG
GCGCGTATCC CGCAGGAGTC CTCCAAGACC CGCGATATCA CCGGTGGTCT GCCTCGCGTG
GCCGACCTGT TCGAGGCCCG CAAGCCGAAG GAGCCGGCCA TCCTGGCCGA GGTCTCCGGT
ACCGTGGGCT TCGGCAAGGA CACCAAGGGC AAGCAGCGCC TGGTGATCAC CAAGGCGGAC
GGTGAGACCT ACGAGGAGCT GATTCCCAAG TGGCGGACCG TCACGGTCTT CGAGGGTGAG
CACGTGGAGA AGGGTGAGGT GATCGCCGAC GGCGAGCCGA ACCCGCACGA CATCCTCCGC
CTGCTGGGGG TGACTGCGCT GGCCGCTTAC GTGGTCAAGG AGATCCAGGA CGTCTACCGT
CTGCAGGGCG TGAAGATCAA CGACAAGCAC ATCGAGGTCA TCTGCCGGCA GATGCTGCGC
AAGGTCGGCG TCAAGGACCC GGGCGAAAGC CACTTCCTGC GCGGCGAGCA GGTCGACCGG
GCCCGCGTCC TTGAGGCCAA TGATGCCCTG GAGGCGGCCG ACAAGACGCC GGCCACCTTC
GAGCCGTTGC TGCTGGGCAT CACCAAGGCC TCGCTGGCTA CCGAGTCGTT CATCTCCGCA
GCCTCGTTCC AGGAGACGAC CCGGGTGCTG ACCGAGGCCG CCACCCGCGG GGCCCGGGAC
GATCTGCGCG GGCTCAAGGA GAATGTCATC GTCGGTCGCC TGATCCCGGC GGGGACCGGC
TTCGCCTACC ACGAGGAGCG TCGGCGGGCA CAGGCCGACC CCATCGCGGC GGCGGAATCG
GCCATCGGTC TGGGTGGTGG CGAACAACCG GCCACCTCTG AGACCGGGGC TGGGGGCTCC
GACCCGTCGG AGGAAGGGTA A
 
Protein sequence
MKDLLNLFKQ PGAQLEDFDA IRIGLASPEM IRSWSYGEVK KPETINYRTF KPERDGLFCA 
KIFGPVKDYE CLCGKYKRLK HRGVVCEKCG VEVTIAKVRR ERMGHIDLAS PVAHIWFLKS
LPSRIGLLLD MTLRDIERIL YFEAFVVIEP GMTPLERGQL LSDEAYLDAI EQHGDEFEAK
MGAEAVLDLL KSLDMTGEAR TLREEIEGTN SESKIKRLSK RLKLIEAFLE SGNKPEWMIM
DVLPVLPPDL RPLVPLDGGR FATSDLNDLY RRVINRNNRL KRLLELSAPD IIVRNEKRML
QESVDALLDN GRRGRAITGT NKRPLKSLAD MIKGKQGRFR QNLLGKRVDY SGRSVIVVGP
TLRLHQCGLP KRMALELFKP FIFSKLQLRG LATTIKAAKK MVERETGEVW DILSEVIREH
PVMLNRAPTL HRLGIQAFEP VLIEGKAIQL HPLVCTAFNA DFDGDQMAVH VPLSLEAQLE
ARALMMSTNN ILSPASGEPI IVPSQDVVLG LYYMTRERLD AKGRGMVFTD VQEVHRAHQN
GVLDLGARVQ VRIREAVFDE NGGMNERVHR VETVAGRALL YEIVPDGLPF ELVDRDMTKK
AISGLVNACY RRVGLKGTVV FADQLMYMGF SMSTGAGVSI GVNDMEVPAE KEKILADAEE
EVKDIEEQYA SGLVTNGERY NKVVDIWSHT NEAVAKAMME KMGKDLVEVD GEQKEQKSFN
SIFMMADSGA RGSAAQIRQL AGMRGLMAKP DGSIIETPIT ANFREGLNVL QYFISTHGAR
KGLADTALKT ANSGYLTRRL VDVSQDLVVT EEDCGTTEGL HMTPIIEGGD VVETLADRVL
GRVVAEDVYK PGTDEVVAAA GTLLDEEWVE HLEQQGVDEI RVRSPITCQT RHGVCAQCYG
RDLARGHGVN VGEAVGVIAA QSIGEPGTQL TMRTFHIGGA ASRAASINNV QVRNSGSVRL
HNVKVVKHHS GNYVAVSRSG EVTVMDDHGR ERERYKIPYG AVLSVADGDA VESGQIVANW
DPHTHPIITE VEGRVRFYDF VEGVTVAREV DEVTGLSSLV VTDPKSRGNG EHRRMVTDAS
GKQVEERVAY KDLRPMIKLV DEDGNDLNIA GTDIPAHYFL PAEAIISLED GAEVRVGDAL
ARIPQESSKT RDITGGLPRV ADLFEARKPK EPAILAEVSG TVGFGKDTKG KQRLVITKAD
GETYEELIPK WRTVTVFEGE HVEKGEVIAD GEPNPHDILR LLGVTALAAY VVKEIQDVYR
LQGVKINDKH IEVICRQMLR KVGVKDPGES HFLRGEQVDR ARVLEANDAL EAADKTPATF
EPLLLGITKA SLATESFISA ASFQETTRVL TEAATRGARD DLRGLKENVI VGRLIPAGTG
FAYHEERRRA QADPIAAAES AIGLGGGEQP ATSETGAGGS DPSEEG