Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0474 |
Symbol | |
ID | 6973869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 519355 |
End bp | 520242 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643390007 |
Product | RNA polymerase factor sigma-32 |
Protein accession | YP_002274885 |
Protein GI | 209542656 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02392] alternative sigma factor RpoH [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.455996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00148932 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCTCTT CCGTTCTCAA TGTTGGTCCT GAATCCAACC TGTCAAGATA CCTTCAGGAC ATCCGTAAAT TTCCGATGCT GTCGCCCGAG GATGAACAGC GCCTGTCCCG GCGCTGGAAG GACAAGGGCG ATACCGAGGC CGCGCATGCG CTCGTCACCT CGCACCTGCG CCTGGTCGCC AAGATCGCCA TGGGCTATCG CGGCTATGGC CTGCCGCTGG GCGAGCTGAT CAGCGAAGGC AATATCGGGA TGATGCAGGC GGTCCGCCGC TTCGACCCGG ATCGCGGCTT CCGCCTGGCG ACCTATGCGA TGTGGTGGAT TCGGGCCGCC ATCCAGGAAT ATATCCTGCA TAGCTGGTCG CTGGTGAAGA TGGGCACCAC CGCCGCGCAG AAGAAGCTGT TCTTCAACCT GCGCCGCCTG AAGGGCCAGA TGCAGGCCAT CGACGACGGC GACCTGAAGC CCGAGCAGGT GAACAAGATC GCCGAATCGC TGGGTGTGCC CGAGCAGGAC GTCATCAACA TGAACCGCCG CCTGTCGGCG CCGGATCACA GCCTGAACGC CCCGCTGCGC GCCGACAGCG AAGGCGAATG GCAGGACTGG CTGGTCGACG AACACGACAA CCAGGAACAG ACCCTGGCGG AAAACGAGGA ATTCAGCGGA CGCAAGGCAT TGCTGGACAA TGCCATGAAG ACGCTGAACG ACCGTGAGCG CCACATCCTG ACCGAACGCC GCCTGAAGGA CGACCCGGCC ACGCTGGAAG AATTGTCGCA CACCTACAAC ATCTCGCGCG AGCGGGTGCG GCAGATCGAG GTCCGGGCGT TCGAGAAGGT GCAGGCGGCG ATGAAGGCGG AAGTCGAGGC CCATCGCGAA GCCCACGCCG CAAACTGA
|
Protein sequence | MASSVLNVGP ESNLSRYLQD IRKFPMLSPE DEQRLSRRWK DKGDTEAAHA LVTSHLRLVA KIAMGYRGYG LPLGELISEG NIGMMQAVRR FDPDRGFRLA TYAMWWIRAA IQEYILHSWS LVKMGTTAAQ KKLFFNLRRL KGQMQAIDDG DLKPEQVNKI AESLGVPEQD VINMNRRLSA PDHSLNAPLR ADSEGEWQDW LVDEHDNQEQ TLAENEEFSG RKALLDNAMK TLNDRERHIL TERRLKDDPA TLEELSHTYN ISRERVRQIE VRAFEKVQAA MKAEVEAHRE AHAAN
|
| |