Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0452 |
Symbol | |
ID | 4270800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 509112 |
End bp | 513392 |
Gene Length | 4281 bp |
Protein Length | 1426 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125192 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_741296 |
Protein GI | 114319613 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACT TGCTGAATCT GTTCAAGCAA CCGGGCGCAC AGCTCGAGGA CTTCGACGCC ATCCGTATCG GACTGGCGTC GCCGGAGATG ATCCGGTCCT GGTCCTACGG CGAGGTGAAA AAGCCCGAGA CCATCAACTA CCGCACCTTC AAGCCGGAGC GCGACGGCCT GTTCTGCGCC AAGATCTTCG GCCCGGTGAA GGACTACGAG TGCCTTTGCG GCAAGTACAA GCGGCTCAAG CACCGGGGCG TGGTGTGCGA GAAGTGCGGG GTGGAGGTCA CCATCGCCAA GGTGCGCCGG GAGCGCATGG GTCATATTGA CCTGGCCAGC CCGGTCGCCC ACATCTGGTT CCTGAAGAGC CTGCCCTCGC GCATCGGGCT GCTGCTGGAT ATGACCCTGC GGGACATCGA GCGCATCCTT TACTTCGAGG CGTTCGTGGT CATCGAGCCG GGTATGACCC CGTTGGAGCG CGGCCAACTG CTCTCCGACG AGGCCTACCT GGACGCCATC GAGCAGCACG GCGATGAGTT CGAGGCCAAG ATGGGCGCCG AGGCGGTCCT CGACCTGCTC AAGAGCCTGG ACATGACCGG CGAGGCCCGG ACCCTGCGCG AGGAGATCGA GGGCACCAAC TCCGAGTCCA AGATCAAGCG CCTGTCCAAG CGGCTGAAAC TGATCGAGGC CTTCCTCGAG TCCGGCAACA AGCCGGAGTG GATGATCATG GACGTGCTGC CGGTGCTGCC ACCGGACCTG CGTCCGCTGG TGCCGCTGGA CGGCGGCCGG TTCGCCACTT CCGATCTGAA CGACCTGTAT CGCCGGGTCA TTAACCGGAA CAACCGGCTG AAGCGGCTGC TGGAGCTGTC GGCCCCCGAC ATCATCGTGC GCAACGAAAA GCGCATGCTG CAGGAGTCCG TGGACGCACT GCTGGACAAC GGCCGGCGCG GCCGGGCCAT TACCGGTACC AACAAGCGCC CGCTGAAGTC CCTGGCCGAC ATGATCAAGG GCAAGCAGGG CCGGTTCCGG CAGAACCTGC TGGGCAAGCG CGTCGATTAC TCCGGCCGTT CGGTCATCGT GGTCGGTCCG ACCCTGCGCC TGCACCAGTG CGGCCTGCCC AAGCGCATGG CGCTGGAGCT GTTCAAGCCG TTCATCTTCT CCAAGCTGCA ACTGCGCGGC CTGGCCACCA CCATCAAGGC GGCCAAGAAG ATGGTCGAGC GCGAGACCGG CGAGGTCTGG GACATCCTCT CGGAGGTGAT CCGCGAGCAC CCGGTCATGC TCAACCGTGC GCCCACGTTG CACCGCCTGG GTATTCAGGC GTTCGAGCCG GTGCTCATCG AGGGCAAGGC CATCCAGCTC CACCCGCTGG TCTGCACCGC CTTCAACGCC GACTTTGACG GCGACCAGAT GGCCGTGCAC GTGCCACTGT CGCTGGAGGC GCAGCTGGAA GCCCGCGCCC TGATGATGTC CACCAACAAC ATCCTGTCGC CGGCCAGTGG TGAGCCCATC ATCGTCCCCT CGCAGGACGT GGTGCTGGGC CTCTATTACA TGACCCGTGA ACGGCTGGAC GCCAAGGGCC GGGGCATGGT CTTCACCGAC GTGCAGGAGG TGCACCGGGC CCACCAGAAC GGGGTGCTGG ACCTGGGCGC CCGCGTTCAG GTGCGGATCC GCGAGGCCGT GTTCGACGAG AACGGCGGCA TGAATGAGCG GGTGCACCGG GTCGAGACCG TGGCCGGCCG GGCGCTGCTC TACGAGATCG TTCCCGACGG GCTGCCCTTC GAGCTGGTGG ATCGGGACAT GACCAAGAAG GCCATTTCCG GCCTGGTCAA TGCCTGCTAC CGCCGGGTGG GCCTGAAGGG CACGGTGGTC TTTGCCGACC AGCTCATGTA TATGGGCTTC TCCATGTCCA CCGGCGCCGG GGTCTCCATC GGTGTCAACG ACATGGAAGT GCCGGCGGAG AAGGAGAAGA TCCTGGCCGA TGCCGAGGAA GAGGTGAAGG ACATCGAGGA GCAGTACGCC TCGGGCCTGG TCACCAACGG CGAGCGCTAC AACAAGGTGG TGGACATCTG GTCCCACACC AACGAGGCCG TGGCCAAGGC CATGATGGAG AAGATGGGCA AGGACCTCGT CGAGGTGGAT GGCGAGCAGA AGGAGCAGAA GTCCTTCAAC TCCATCTTTA TGATGGCCGA TTCCGGCGCG CGTGGCTCGG CGGCGCAGAT CCGGCAGCTG GCGGGTATGC GCGGCCTGAT GGCGAAGCCG GACGGCTCCA TCATCGAGAC CCCCATCACG GCGAACTTCC GTGAGGGGCT GAACGTGCTC CAGTACTTCA TCTCCACCCA CGGTGCCCGT AAGGGCCTGG CCGACACGGC GCTGAAGACG GCCAACTCCG GGTATCTGAC CCGACGCCTG GTGGACGTCT CCCAGGACCT GGTGGTCACC GAGGAGGATT GCGGCACCAC CGAAGGCCTG CATATGACGC CCATCATCGA GGGTGGTGAT GTGGTGGAGA CCCTGGCCGA TCGCGTCCTC GGGCGCGTGG TGGCGGAGGA TGTCTACAAG CCGGGCACTG ACGAAGTGGT CGCGGCGGCC GGCACCCTGC TCGATGAGGA GTGGGTCGAG CACCTGGAGC AGCAGGGTGT GGACGAGATC CGGGTCCGCT CGCCGATCAC CTGCCAGACT CGCCACGGCG TTTGCGCCCA GTGCTACGGT CGCGACCTGG CGCGCGGGCA CGGCGTCAAC GTCGGTGAGG CGGTGGGTGT GATCGCCGCC CAGTCCATCG GTGAGCCGGG CACCCAGCTG ACCATGCGGA CCTTCCACAT CGGTGGGGCC GCGTCGCGGG CCGCGTCGAT CAACAACGTG CAGGTCCGCA ACTCGGGCTC GGTGCGGCTG CACAACGTTA AGGTGGTCAA GCACCACTCC GGCAACTACG TGGCCGTCTC CCGTTCCGGC GAGGTGACCG TCATGGACGA TCACGGCCGT GAGCGTGAGC GCTACAAGAT CCCCTACGGC GCCGTGCTCT CGGTGGCCGA TGGCGATGCG GTGGAGTCCG GCCAGATCGT GGCCAACTGG GATCCCCACA CCCACCCGAT CATCACCGAG GTGGAGGGTC GGGTGCGCTT CTACGACTTC GTGGAGGGCG TCACCGTGGC CCGCGAGGTG GACGAGGTCA CCGGCCTCTC CAGTCTGGTG GTCACCGATC CCAAGAGCCG TGGCAACGGC GAGCACCGGC GTATGGTGAC CGACGCCAGC GGCAAGCAGG TGGAGGAGCG GGTCGCGTAC AAGGACTTGC GGCCCATGAT CAAGCTGGTG GACGAGGACG GCAACGACCT CAACATCGCC GGCACCGACA TCCCGGCCCA CTACTTCCTG CCCGCCGAGG CGATCATCAG TCTCGAGGAT GGGGCCGAGG TCCGGGTGGG CGACGCCCTG GCGCGTATCC CGCAGGAGTC CTCCAAGACC CGCGATATCA CCGGTGGTCT GCCTCGCGTG GCCGACCTGT TCGAGGCCCG CAAGCCGAAG GAGCCGGCCA TCCTGGCCGA GGTCTCCGGT ACCGTGGGCT TCGGCAAGGA CACCAAGGGC AAGCAGCGCC TGGTGATCAC CAAGGCGGAC GGTGAGACCT ACGAGGAGCT GATTCCCAAG TGGCGGACCG TCACGGTCTT CGAGGGTGAG CACGTGGAGA AGGGTGAGGT GATCGCCGAC GGCGAGCCGA ACCCGCACGA CATCCTCCGC CTGCTGGGGG TGACTGCGCT GGCCGCTTAC GTGGTCAAGG AGATCCAGGA CGTCTACCGT CTGCAGGGCG TGAAGATCAA CGACAAGCAC ATCGAGGTCA TCTGCCGGCA GATGCTGCGC AAGGTCGGCG TCAAGGACCC GGGCGAAAGC CACTTCCTGC GCGGCGAGCA GGTCGACCGG GCCCGCGTCC TTGAGGCCAA TGATGCCCTG GAGGCGGCCG ACAAGACGCC GGCCACCTTC GAGCCGTTGC TGCTGGGCAT CACCAAGGCC TCGCTGGCTA CCGAGTCGTT CATCTCCGCA GCCTCGTTCC AGGAGACGAC CCGGGTGCTG ACCGAGGCCG CCACCCGCGG GGCCCGGGAC GATCTGCGCG GGCTCAAGGA GAATGTCATC GTCGGTCGCC TGATCCCGGC GGGGACCGGC TTCGCCTACC ACGAGGAGCG TCGGCGGGCA CAGGCCGACC CCATCGCGGC GGCGGAATCG GCCATCGGTC TGGGTGGTGG CGAACAACCG GCCACCTCTG AGACCGGGGC TGGGGGCTCC GACCCGTCGG AGGAAGGGTA A
|
Protein sequence | MKDLLNLFKQ PGAQLEDFDA IRIGLASPEM IRSWSYGEVK KPETINYRTF KPERDGLFCA KIFGPVKDYE CLCGKYKRLK HRGVVCEKCG VEVTIAKVRR ERMGHIDLAS PVAHIWFLKS LPSRIGLLLD MTLRDIERIL YFEAFVVIEP GMTPLERGQL LSDEAYLDAI EQHGDEFEAK MGAEAVLDLL KSLDMTGEAR TLREEIEGTN SESKIKRLSK RLKLIEAFLE SGNKPEWMIM DVLPVLPPDL RPLVPLDGGR FATSDLNDLY RRVINRNNRL KRLLELSAPD IIVRNEKRML QESVDALLDN GRRGRAITGT NKRPLKSLAD MIKGKQGRFR QNLLGKRVDY SGRSVIVVGP TLRLHQCGLP KRMALELFKP FIFSKLQLRG LATTIKAAKK MVERETGEVW DILSEVIREH PVMLNRAPTL HRLGIQAFEP VLIEGKAIQL HPLVCTAFNA DFDGDQMAVH VPLSLEAQLE ARALMMSTNN ILSPASGEPI IVPSQDVVLG LYYMTRERLD AKGRGMVFTD VQEVHRAHQN GVLDLGARVQ VRIREAVFDE NGGMNERVHR VETVAGRALL YEIVPDGLPF ELVDRDMTKK AISGLVNACY RRVGLKGTVV FADQLMYMGF SMSTGAGVSI GVNDMEVPAE KEKILADAEE EVKDIEEQYA SGLVTNGERY NKVVDIWSHT NEAVAKAMME KMGKDLVEVD GEQKEQKSFN SIFMMADSGA RGSAAQIRQL AGMRGLMAKP DGSIIETPIT ANFREGLNVL QYFISTHGAR KGLADTALKT ANSGYLTRRL VDVSQDLVVT EEDCGTTEGL HMTPIIEGGD VVETLADRVL GRVVAEDVYK PGTDEVVAAA GTLLDEEWVE HLEQQGVDEI RVRSPITCQT RHGVCAQCYG RDLARGHGVN VGEAVGVIAA QSIGEPGTQL TMRTFHIGGA ASRAASINNV QVRNSGSVRL HNVKVVKHHS GNYVAVSRSG EVTVMDDHGR ERERYKIPYG AVLSVADGDA VESGQIVANW DPHTHPIITE VEGRVRFYDF VEGVTVAREV DEVTGLSSLV VTDPKSRGNG EHRRMVTDAS GKQVEERVAY KDLRPMIKLV DEDGNDLNIA GTDIPAHYFL PAEAIISLED GAEVRVGDAL ARIPQESSKT RDITGGLPRV ADLFEARKPK EPAILAEVSG TVGFGKDTKG KQRLVITKAD GETYEELIPK WRTVTVFEGE HVEKGEVIAD GEPNPHDILR LLGVTALAAY VVKEIQDVYR LQGVKINDKH IEVICRQMLR KVGVKDPGES HFLRGEQVDR ARVLEANDAL EAADKTPATF EPLLLGITKA SLATESFISA ASFQETTRVL TEAATRGARD DLRGLKENVI VGRLIPAGTG FAYHEERRRA QADPIAAAES AIGLGGGEQP ATSETGAGGS DPSEEG
|
| |