Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1582 |
Symbol | |
ID | 6974992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1757958 |
End bp | 1759358 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643391113 |
Product | RNA polymerase, sigma 54 subunit, RpoN |
Protein accession | YP_002275976 |
Protein GI | 209543747 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCCC AGCCCCGTCA GGAATTCAAG CAGAACCAGG GACTGTTCAT GAGCGCGCGC ATGCGCCAGT CCATCCAGAT CCTGCAATTG TCCAATGCCG AGCTCCATGA ATTCCTGGAC GCGGAGGTCG AGAAGAATCC GCTTCTGGAG CGCGATCCGA CCACCTCCGC CCCCCTGGCG GAGAGGCCCG GCCTTGCCGC GCCGCCGCCC ATGATGCGCG CGCCCGGCGC GACGGGGGAT GCCGGGCGCT GGACGTCCGA CAGCGGCCTG GATGATGACG GGACGGGGCG CCTGGCCGAA TCCGCCCCGG CGTTGCGCGA CGTGATGTTC GAACAACTGG TTCTGTCCGG CTGCACGACG ACGGAGCGGG AAATCGGCGC GCACCTGATC GCCGCACTCG ATCCGGCGGG ACGGCTGGCC GGCGGCGCCA CCGACGACAT CGCCGCCACG TTGCAGGTCA CGCTGCGGGA CGTCGAGGCC ATCCGGCAGC GCATGATGCG TTTCGACCCG CCCGGCCTGT TCGCCACGTC GCTTCGGGAA TGCCTGGCGG CCCAGTTGCA GGAACGCAAC CGTTTCGACC CGGCCATGGC GCGCCTGCTG GACAATCTGG ACCTTCTGGC CCGGCACGAC CTGAAGCGCC TGCGCCAGGC CTGCGACGTC AGCGCCACGG ACCTGGAGGA CATGATTGCC GAATTACGCA CACTCGACCC CAAGCCCGGG GCGGAACATG GCAGCACCCT GACCGTCATC GTCCCGGACA TCCTGATGGA GCAGACGGCC GAGGGCGGAT GGACGCTGGA GCTGAACCCC GAGAGCACCC CCCGTGTCGT GCTGAACAGC GCGCTGAGCA CCCGCATGTC GCTGCGCGCC CATGGGGCGG AGCGGACATA CCTGAATGAC ACTCTGACCA GCGCGAACTG GCTGATCCGC GCGCTGCAGC AGCGCAGCAT GACCATCCTG CGCGTCTCGA CCGAGATCCT GCGCCGACAG GAGGATTTCC TGCAGCACGG TCCGCAGGCC CTGCAGCCGC TCAACCTCCG GACGGTCGCC GAGGCGCTGA ACATTCACGA AAGCACGGTC AGCCGCGTGA CGGCGAACAA GTACGTCGCG ACCCCGCGCG GCGTGCTGCC GCTGAAATTC TTCTTCGTGG CGGCCATGAC CGGAAATGAC GGCGAAATCC GCAGCAACGT TGCGATCCAG AGCGTGATCC GGCGCATGAT CCAGGGCGAA CGCCCCGATG CCGTCCTGTC CGACGAGGCG ATTTCCCTGA CCCTGCGCCG GCAGGGCATC GACATCGCAC GCCGCACGGT CGCGAAATAT CGCGAGGCGA TGGGCTTTCC CAATTCCCTC CAGCGGCTTC AGCGCGCCGC CGAGACCGCC TCGCCCCCCA ACCGGCGATA G
|
Protein sequence | MSAQPRQEFK QNQGLFMSAR MRQSIQILQL SNAELHEFLD AEVEKNPLLE RDPTTSAPLA ERPGLAAPPP MMRAPGATGD AGRWTSDSGL DDDGTGRLAE SAPALRDVMF EQLVLSGCTT TEREIGAHLI AALDPAGRLA GGATDDIAAT LQVTLRDVEA IRQRMMRFDP PGLFATSLRE CLAAQLQERN RFDPAMARLL DNLDLLARHD LKRLRQACDV SATDLEDMIA ELRTLDPKPG AEHGSTLTVI VPDILMEQTA EGGWTLELNP ESTPRVVLNS ALSTRMSLRA HGAERTYLND TLTSANWLIR ALQQRSMTIL RVSTEILRRQ EDFLQHGPQA LQPLNLRTVA EALNIHESTV SRVTANKYVA TPRGVLPLKF FFVAAMTGND GEIRSNVAIQ SVIRRMIQGE RPDAVLSDEA ISLTLRRQGI DIARRTVAKY REAMGFPNSL QRLQRAAETA SPPNRR
|
| |