Gene GSU2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2831 
SymbolrpoA 
ID2686836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3115118 
End bp3116140 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content52% 
IMG OID637127520 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionNP_953874 
Protein GI39997923 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.688016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGAA ACTGGCGCGA CCTGATCAGT CCGAAGAAAC TTCAGGTTGA AAGCGAGACG 
CTTACCAATA AATATGGAAA ATTTTATGCA GAACCGTTTG AGCGCGGATT CGGCACGACT
CTCGGCAACT CACTTCGCAG GGTCTTGCTT TCTTCGCTCC AAGGTGCAGG AATTACCTCT
GTAAGGATCA AGGGGGTGCT TCACGAATTT TCTTCCATCC CCGGCGTAAC CGAGGATGTT
ACCAATATCA TTCTCAACCT TAAGGGCGTC AGCCTCAAAA TGTACGGTAC TGAGCCTAAG
ACCGTCCGGA TTATCCATAA GGGTGACGGT ATTATCACGG CAGGTGATAT CATTACCGAC
CCCCAAGTCG AGATCCTGAA TCCGGAACAC CACATCGCCA CATGCTCCAA GGATGCAAAT
CTTGAAATGG AGATGGTGGT GAAGGTTGGC AAGGGCTATG TGCCCGCAGA CCGGAATCGC
GATGAGAAGG CTCCCGTTGG CACGATTCCG ATTGATGCCC TGTTTTCACC GATCCGCAAG
GTTAACTTTA CGGTTTCAAA CGCCCGCGTC GGTCAGATGA CCGATTATGA CAAGCTGACG
CTTGAGGTGT GGACCAACGG CAGCGTGATC CCGGAAGACG CGGTCGCCTT TGCCGCCAAG
ATCCTCAAGG AACAACTGAG CATTTTCATT AACTTTGACG AGGAAGCTGA ACCGAGCGGC
GAGGCTGAAG TTGGCGAAGG GGAAAGCCCC ATCAACGAGA ACCTCTATCG TTCGGTCGAT
GAACTCGAGC TTTCCGTGCG CTCTGCTAAC TGCCTCAAGA ACGCAGGCAT TAAGCTTATT
GGCGAGCTTG TGTCCCGGAC CGAGGCCGAG ATGCTCAAAA CGCAGAACTT CGGCCGCAAG
TCGTTGAACG AGATCAAGGA TATACTCGCT GAGATGGGCC TTACGCTTGG TATGAAACTG
GAAGGGTTCC CCGATCCGGA AGTCATGCGC AGGCTGCGCG GCGAGCGCAA GGATGAAGAA
TAA
 
Protein sequence
MYRNWRDLIS PKKLQVESET LTNKYGKFYA EPFERGFGTT LGNSLRRVLL SSLQGAGITS 
VRIKGVLHEF SSIPGVTEDV TNIILNLKGV SLKMYGTEPK TVRIIHKGDG IITAGDIITD
PQVEILNPEH HIATCSKDAN LEMEMVVKVG KGYVPADRNR DEKAPVGTIP IDALFSPIRK
VNFTVSNARV GQMTDYDKLT LEVWTNGSVI PEDAVAFAAK ILKEQLSIFI NFDEEAEPSG
EAEVGEGESP INENLYRSVD ELELSVRSAN CLKNAGIKLI GELVSRTEAE MLKTQNFGRK
SLNEIKDILA EMGLTLGMKL EGFPDPEVMR RLRGERKDEE