Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2831 |
Symbol | rpoA |
ID | 2686836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3115118 |
End bp | 3116140 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637127520 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | NP_953874 |
Protein GI | 39997923 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.688016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAGAA ACTGGCGCGA CCTGATCAGT CCGAAGAAAC TTCAGGTTGA AAGCGAGACG CTTACCAATA AATATGGAAA ATTTTATGCA GAACCGTTTG AGCGCGGATT CGGCACGACT CTCGGCAACT CACTTCGCAG GGTCTTGCTT TCTTCGCTCC AAGGTGCAGG AATTACCTCT GTAAGGATCA AGGGGGTGCT TCACGAATTT TCTTCCATCC CCGGCGTAAC CGAGGATGTT ACCAATATCA TTCTCAACCT TAAGGGCGTC AGCCTCAAAA TGTACGGTAC TGAGCCTAAG ACCGTCCGGA TTATCCATAA GGGTGACGGT ATTATCACGG CAGGTGATAT CATTACCGAC CCCCAAGTCG AGATCCTGAA TCCGGAACAC CACATCGCCA CATGCTCCAA GGATGCAAAT CTTGAAATGG AGATGGTGGT GAAGGTTGGC AAGGGCTATG TGCCCGCAGA CCGGAATCGC GATGAGAAGG CTCCCGTTGG CACGATTCCG ATTGATGCCC TGTTTTCACC GATCCGCAAG GTTAACTTTA CGGTTTCAAA CGCCCGCGTC GGTCAGATGA CCGATTATGA CAAGCTGACG CTTGAGGTGT GGACCAACGG CAGCGTGATC CCGGAAGACG CGGTCGCCTT TGCCGCCAAG ATCCTCAAGG AACAACTGAG CATTTTCATT AACTTTGACG AGGAAGCTGA ACCGAGCGGC GAGGCTGAAG TTGGCGAAGG GGAAAGCCCC ATCAACGAGA ACCTCTATCG TTCGGTCGAT GAACTCGAGC TTTCCGTGCG CTCTGCTAAC TGCCTCAAGA ACGCAGGCAT TAAGCTTATT GGCGAGCTTG TGTCCCGGAC CGAGGCCGAG ATGCTCAAAA CGCAGAACTT CGGCCGCAAG TCGTTGAACG AGATCAAGGA TATACTCGCT GAGATGGGCC TTACGCTTGG TATGAAACTG GAAGGGTTCC CCGATCCGGA AGTCATGCGC AGGCTGCGCG GCGAGCGCAA GGATGAAGAA TAA
|
Protein sequence | MYRNWRDLIS PKKLQVESET LTNKYGKFYA EPFERGFGTT LGNSLRRVLL SSLQGAGITS VRIKGVLHEF SSIPGVTEDV TNIILNLKGV SLKMYGTEPK TVRIIHKGDG IITAGDIITD PQVEILNPEH HIATCSKDAN LEMEMVVKVG KGYVPADRNR DEKAPVGTIP IDALFSPIRK VNFTVSNARV GQMTDYDKLT LEVWTNGSVI PEDAVAFAAK ILKEQLSIFI NFDEEAEPSG EAEVGEGESP INENLYRSVD ELELSVRSAN CLKNAGIKLI GELVSRTEAE MLKTQNFGRK SLNEIKDILA EMGLTLGMKL EGFPDPEVMR RLRGERKDEE
|
| |