Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2662 |
Symbol | |
ID | 4268795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 3014489 |
End bp | 3015346 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127421 |
Product | RNA polymerase, sigma 32 subunit, RpoH |
Protein accession | YP_743492 |
Protein GI | 114321809 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0000148114 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACTG CACTGGTACC GCTGAAACAC GGCAACCTCC CCACTCCCGT CGGGAGCGAG GAGGCGTATA TCCATGCGGT CAACCAGATC CCCGTGCTCA GCGCCGAGGA GGAGCACGAG CTCGCCGAGC GCTACCGCCT GCAGGGCGAC CTGGACGCGG CCCGCGCACT GGTCCTGTCC CACCTGCGCT TCGTGGTCCA CATCGCGCGC AGCTACCGCG GCTATGGCCT GCCGCTGGGC GACATCATCC AGGAGGGCAA CGTGGGGCTG ATGAAGGCCG TCAAGCGCTT CGATCCCGCC CAGGGGGTGC GCCTGGTCTC CTTCGCGGTG CACTGGATCC GCGCCGAGAT CCACGAGTAC GTGCTGCGCA ACTGGCGCAT CGTCAAGGTG GCCACCACCA AGGCCCAGCG CAAGCTATTC TTCAACCTGC GCAGCGGCCG CAAGCACCTG GGTTGGCTCA CCAGCGAGGA GGTGGACGCC ATGGCCAGGG ACCTGGGGGT CAAACCCGAG ACGGTGCGCG AGATGGAAGC GCGCATGACC GGCAACGACA CCACCTTCGA CCCGACCCCG GGCCAGGACG ACGAGGGCAT TCATGCCCCG GTCGCCTACC TGGAGGACAA GCGCTACGAC CCGGCCACCG CCGTCGAGGA GGCGGACTGG GAGCAGCACC GCGATCAGAA CCTGCACCAG GCCCTGGCAG GACTGGACGA GCGGAGCCAG GACATCCTGG CCCGGCGCTG GCTGTCGGAG CGCAAGGCGA CGCTGAAGGA GCTGGCCGAG CACTACCAGG TCTCGGCAGA GCGTATCCGG CAGTTGGAGA AAAACGCCAT GGGCCGGCTC AAGACCGCCC TGGCCTGA
|
Protein sequence | MTTALVPLKH GNLPTPVGSE EAYIHAVNQI PVLSAEEEHE LAERYRLQGD LDAARALVLS HLRFVVHIAR SYRGYGLPLG DIIQEGNVGL MKAVKRFDPA QGVRLVSFAV HWIRAEIHEY VLRNWRIVKV ATTKAQRKLF FNLRSGRKHL GWLTSEEVDA MARDLGVKPE TVREMEARMT GNDTTFDPTP GQDDEGIHAP VAYLEDKRYD PATAVEEADW EQHRDQNLHQ ALAGLDERSQ DILARRWLSE RKATLKELAE HYQVSAERIR QLEKNAMGRL KTALA
|
| |