Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2230 |
Symbol | |
ID | 4269445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2532778 |
End bp | 2534295 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126986 |
Product | RNA polymerase, sigma 54 subunit, RpoN |
Protein accession | YP_743062 |
Protein GI | 114321379 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAACGG TCATGAAACA ATCGCTGCAA CTTCGACTCG GGCAGCAACT CACTATGACG CCACAGCTGC AACAGGCCAT CCGCCTGCTG CAGCTCTCCA CCCTGGATCT GCAGATGGAG ATCCAGCAGC AGCTCGAGTC CAACGTCATG CTGGAACTCG CCGAAGAGGA CCAGCAGGAG ACCGAACAGC GCACCGAAGA GGGCGAGACC CAGGACACGG AGGGCGAGCG CGGGGACGAG GCCAGCACCG AGGAGGCCAC CGAGGCCGGG ACCGCGGGCG AGGAGACCGC CGCCCAGGAG CAGGCCGACA TCCCCGAGGA CCTGCCGCTG GACTCCAATT GGGATGACAT CTACGACGGC AGCACCCCCT GGAGCCAGCC GGACAGCGAG GACGAGGACC GCGACCCCTA CGCCAACCGC TCCGGTGGCG GTGAAACCCT GCACGACCAC CTGACCTGGC AGGCGGAGCT CACCCCCTTC ACCGATCGGG ACGCGGCCAT CGCACAGGTC ATCATCGACT CCGTGCGCGA CGACGGCTAC CTGGGCGCCG GCATCGAGGA GTTGATCTCC GCCCTGCCGG CGGAGTGGGC CGTGGAGCCC GATGAGGTCG AGGCGGTGCT CCGGCGCATC CAGCACTTCG ACCCGGTGGG CGTGGCCGCC CGCGATCCGC GTGAGGCGCT GCTCATCCAG CTCGAGCAGC TGCCGCCGGA CACCCCACTG CTGCCCGAGG CCCGGCGCCT GGTGGATCTG CACCTGGACA TGCTGGTACA GCGCCAGTAC GCCCAACTCT GTCGGCGCAT GAAGCTCAAT CAGGACCAGC TTCGCGAGGT CCTGGGCCTG ATCCAGACCC TCGATCCGCG GCCGGGCTCG CAAATCGGTG GCGACGAGAC CCAGTACGTG GTCCCCGACG TGGTGGTCCG GCGCAGCGAC GGCCGCTGGC AGGTGGAGCT CAATCCCGCG ACCGCCCCCC GGCTGCGGGT CAACAGCTAT TACGCCAGCC TGATCAAACG CGCCGACAAC AGCAGCGACA ACACCACCCT GCGCAACCAC CTGCAGGAGG CGCGCTGGTT CATCAAGAGC CTGCTTAGCC GCAACGACAC CCTGCTCAAG GTCGCCCGCT GCATCGTCGA GCGCCAGCAG GGCTACTTCG ACCACGGCGA AGAGGCCATG CAGCCGTTGG TCCTGCGCGA GGTGGCCGAG GCGGTGGACA TGCACGAGTC CACCATCTCC CGGATCACCA CCCGCAAGTA CATGCACACC CCGCGGGGCA CCCTGGAGTT CAAGTACTTC TTCTCCAGCC ATGTGCAGAC GGTGGACGGC GGCGAGTGCT CCGCCACCGC CATCCGTGCC CGCATCCGGC GTCTGATCGC CGATGAGAAC CCCACCAAAC CACTCAGTGA CAGTCGTATT GCCAATATCC TCCAGGAGGA GGGCATAAAC GTGGCAAGAC GGACCGTAGC CAAGTATCGT GAGGCTATGG CCATCGCGTC CTCGTCAGAG CGCAAGCGAC TGGCCTGA
|
Protein sequence | MATVMKQSLQ LRLGQQLTMT PQLQQAIRLL QLSTLDLQME IQQQLESNVM LELAEEDQQE TEQRTEEGET QDTEGERGDE ASTEEATEAG TAGEETAAQE QADIPEDLPL DSNWDDIYDG STPWSQPDSE DEDRDPYANR SGGGETLHDH LTWQAELTPF TDRDAAIAQV IIDSVRDDGY LGAGIEELIS ALPAEWAVEP DEVEAVLRRI QHFDPVGVAA RDPREALLIQ LEQLPPDTPL LPEARRLVDL HLDMLVQRQY AQLCRRMKLN QDQLREVLGL IQTLDPRPGS QIGGDETQYV VPDVVVRRSD GRWQVELNPA TAPRLRVNSY YASLIKRADN SSDNTTLRNH LQEARWFIKS LLSRNDTLLK VARCIVERQQ GYFDHGEEAM QPLVLREVAE AVDMHESTIS RITTRKYMHT PRGTLEFKYF FSSHVQTVDG GECSATAIRA RIRRLIADEN PTKPLSDSRI ANILQEEGIN VARRTVAKYR EAMAIASSSE RKRLA
|
| |