Gene BBta_5046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5046 
SymbolrpoA 
ID5149761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5275178 
End bp5276209 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID640559824 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001240953 
Protein GI148256368 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.958591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAAA CAGTGACGAT CCAGAAGAAT TGGCAAGAAT TGATTCGGCC GAACAAGCTC 
CAGGTCACGC CGGGCTCCGA CGCGACCCGT TTCGCCACCG TGGTTGCCGA GCCGCTGGAG
CGCGGTTTCG GCCAGACGCT GGGCAATGCG CTCCGCCGCA TCCTGCTGTC GTCGCTGCAG
GGCGCCGCAG TGCAGTCGGT GCACATCGAT GGCGTGCTGC ACGAGTTCTC CTCGATCGCG
GGCGTCCGTG AGGACGTCAC CGACATCGTG CTGAACATCA AGGATATCTC GATCAAGATG
CAGGGCGAGG GCCCGAAGCG GATGGTCGTG AAGAAGCAGG GTCCGGGCGC CGTCACCGCC
GGCGATATCC AGACCGTCGG CGACATCGTC GTGCTCAATC CCGACCTGCA GCTCTGCACC
CTGGACGAGG GCGCCGAGAT CCGCATGGAG TTCACGGTCG CGACCGGCAA GGGCTACGTG
CCGGCCGAGC GCAACCGTCC TGAGGACGCG CCGATCGGCC TGATCCCGAT CGACAGCCTG
TTCTCGCCGG TCCGCAAGGT CTCCTACAAG GTCGAGAACA CCCGCGAGGG CCAGATCCTC
GACTACGACA AGCTGACCAT GACGATCGAG ACCAACGGCG CGATCTCGCC GGAGGACGCG
GTGGCCTACG CCGCTCGCAT CCTGCAGGAT CAGCTCAACG TCTTCGTCAA CTTCGAAGAG
CCGCGCAAGG AAGTTGCCCA GGAGATCATT CCGGATCTCG CCTTCAATCC GGCGTTCCTC
AAGAAGGTGG ACGAACTCGA GCTGTCGGTG CGTTCGGCGA ACTGCCTGAA GAACGACAAC
ATCGTCTATA TCGGCGACCT CGTGCAGAAG TCGGAAGCGG AGATGCTGCG CACCCCGAAC
TTCGGCCGCA AGTCGCTGAA CGAGATCAAG GAAGTGCTGG CTCAGATGGG TCTGCATCTC
GGCATGGAAG TGCCTGGCTG GCCGCCGGAG AATATCGACG AACTGGCCAA GCGCTTCGAG
GATCACTACT GA
 
Protein sequence
MGETVTIQKN WQELIRPNKL QVTPGSDATR FATVVAEPLE RGFGQTLGNA LRRILLSSLQ 
GAAVQSVHID GVLHEFSSIA GVREDVTDIV LNIKDISIKM QGEGPKRMVV KKQGPGAVTA
GDIQTVGDIV VLNPDLQLCT LDEGAEIRME FTVATGKGYV PAERNRPEDA PIGLIPIDSL
FSPVRKVSYK VENTREGQIL DYDKLTMTIE TNGAISPEDA VAYAARILQD QLNVFVNFEE
PRKEVAQEII PDLAFNPAFL KKVDELELSV RSANCLKNDN IVYIGDLVQK SEAEMLRTPN
FGRKSLNEIK EVLAQMGLHL GMEVPGWPPE NIDELAKRFE DHY