Gene Ava_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4207 
SymbolrpoB 
ID3680951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5269721 
End bp5273074 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content49% 
IMG OID637719554 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_324701 
Protein GI75910405 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG AAAATTATAT TGAACCCGCC TTTCTGTTGC CCGACTTGAT TGAAATCCAG 
CGTTCAAGCT TTCGCTGGTT TTTAGAAGAA GGGCTGATCG AAGAACTTAA CTCCTTTAGT
CCTATTACAG ACTATACAGG CAAATTAGAA CTGCATTTTT TAGGACATAA TTACAAACTT
AAGGAGCCAA AGTACAGCGT TGAAGAAGCG AAACGCCGCG ACAGTACTTA CGCTGTACAG
ATGTATGTCC CTACCCGCCT ATTAAATAAA GAAACCGGGG ATATTAAAGA GCAGGAAGTA
TTTATCGGGG ATCTACCTTT GATGACCGAT CGCGGCACGT TTATTATTAA CGGAGCCGAA
AGGGTAATTG TCAATCAAAT TGTGCGATCG CCTGGAGTTT ACTACAAATC AGAAATCGAC
AAAAACGGTC GCCGGACATA TTCAGCCAGC CTCATCCCCA ACCGGGGCGC ATGGCTAAAA
TTTGAAACAG ACCGGAACGA TTTGGTATGG GTACGCATCG ACAAAACCCG CAAACTCTCA
GCCCAAGTAC TATTAAAAGC GCTCGGCTTA TCAGATAACG AAATTTTTGA TGCCCTGCGC
CACCCAGAGT ACTTCCAAAA AACCATCGAG AAAGAAGGGC AGTTTTCCGA AGAAGAAGCT
CTGATGGAGT TATATCGTAA ACTGCGACCA GGCGAACCAC CAACCGTACT AGGTGGGCAA
CAATTGTTAG ACTCCCGCTT CTTCGACCCC AAACGCTATG ACTTGGGTCG AGTCGGTAGA
TACAAACTCA ACAAAAAACT ACGCCTATCA GTCCCCGATA CCGTCCGCGT CCTCACCTCT
GGCGATATTT TGGCTGCGGT CGATTACCTG ATCAACCTGG AATATGACAT CGGTAGTATT
GATGACATCG ACCACTTAGG CAACCGTCGG GTAAGAAGCG TTGGTGAATT GCTACAAAAC
CAAGTCAGAG TAGGCTTAAA CCGCCTAGAA AGAATCATTC GGGAACGGAT GACCGTATCT
GATGCGGAAG TATTAACACC AGCATCCTTG GTTAACCCCA AACCATTGGT AGCCGCCATC
AAAGAATTCT TTGGCTCTAG CCAATTGAGT CAGTTCATGG ATCAAACCAA TCCTTTAGCC
GAACTGACCC ACAAACGCCG TCTGTCAGCC CTTGGCCCTG GTGGTTTAAC CAGAGAACGG
GCGGGTTTTG CTGTGCGAGA TATTCACCCC TCCCACTACG GACGGATTTG TCCCATTGAG
ACACCAGAAG GCCCCAACGC CGGTTTGATT GGTTCCTTAG CCACCCACGC CCGCGTTAAC
CAATACGGCT TCTTAGAAAC GCCATTTAGA CCAGTAGAAA ACGGCCGGGT GAGATTTGAT
CAACCCGCAG TCTACATGAC CGCCGACGAA GAGGACGACC TGCGGGTAGC ACCAGGGGAT
ATTCCTGTAG ATGAAAATGG CCACATTATT GGTCCCCAAG TACCAGTCCG CTATCGCCAG
GAATTCTCCA CCACAACACC GGAACAAGTA GACTACGTAG CCGTATCACC AGTACAGATC
GTCTCTGTAG CTACCAGTAT GATTCCCTTC TTGGAACATG ACGACGCGAA CCGCGCCCTC
ATGGGTTCCA ATATGCAACG GCAAGCTGTA CCTCTACTGA AACCAGAGCG TCCGCTAGTG
GGAACAGGCT TAGAAGCCCA AGGCGCGAGA GACTCAGGGA TGGTAATTGT CTCCCGTACC
GATGGGGACG TTGTCTACGT GGATGCTACA GAAATACGTG TCCGAGTTAG TGGGCAATTA
CCAGCAGCTA GCGGCAAAAG CACTGACAAC GGACAACTGA CCAGTCAGAA AGGACAAGAA
ATTCGCTATA CAGTATCCAA ATATCAGCGT TCTAACCAAG ATACCTGTCT CAACCAAAAA
CCCCTGGTAC GGATTGGTGA GCGTGTAGTT GCTGGTCAGG TGCTAGCTGA TGGCTCCTCC
ACGGAAGGCG GGGAATTGGC ACTGGGACAA AATATCGTCG TCGCTTATAT GCCTTGGGAA
GGCTATAACT ACGAAGACGC GATTTTAATT TCCGAGCGAC TGGTACAGGA TGATATTTAC
ACCTCAATTC ACATTGAAAA ATATGAAATT GAAGCCCGCC AGACAAAACT AGGCCCAGAA
GAAATTACCA GAGAAATTCC TAACGTCGGG GAAGATGCCC TACGTCAGTT AGATGAACAG
GGAATCATTC GGATTGGGGC TTGGGTAGAA GCTGGAGACA TACTGGTAGG CAAGGTGACA
CCAAAAGGTG AATCCGATCA GCCACCAGAA GAAAAGCTAC TACGGGCAAT TTTCGGGGAA
AAAGCACGGG ATGTAAGAGA TAACTCCCTC CGTGTACCCA ACGGCGAAAA AGGGCGGGTC
GTAGACGTGC GCTTGTTTAC CCGCGAACAA GGGGACGAAC TACCACCAGG AGCCAATATG
GTAGTCCGGG TGTATGTAGC CCAGAAGCGG AAAATCCAAG TAGGGGATAA AATGGCAGGT
CGCCACGGTA ATAAAGGGAT TATTTCCCGC ATATTGCCCA TAGAAGATAT GCCTTACTTA
CCCGACGGCT CCCCTGTGGA TATTGTCCTC AACCCCCTTG GTGTACCCAG CCGGATGAAC
GTGGGACAGG TATTTGAGTG TCTATTGGGT TGGGCTGGTC ATACCTTGGG TGTCAGGTTT
AAGATTACTC CCTTCGATGA AATGTACGGA GAAGAGTCAT CTCGCCGCAT TGTGCATGGC
AAATTGCAAG AAGCCAGAGA CGAGACAGGG AAAGATTGGG TTTATAACCC AGATGACCCA
GGCAAAATCA TGGTGTTCGA TGGTCGTACA GGGGAACCCT TTGATCGACC AGTAACTATC
GGCGTAGCTT ATATGCTGAA ACTGGTACAC CTAGTAGACG ATAAGATTCA CGCTCGTTCT
ACAGGCCCTT ACTCCTTGGT GACTCAGCAG CCATTGGGTG GAAAAGCCCA ACAAGGTGGT
CAGCGCTTTG GAGAAATGGA AGTATGGGCA TTGGAAGCTT TCGGTGCAGC TTATACCTTG
CAGGAATTGC TGACGGTGAA ATCAGACGAT ATGCAGGGAC GGAACGAAGC ATTAAATGCG
ATCGTTAAAG GCAAGGCCAT TCCTCGACCT GGAACACCAG AATCCTTCAA GGTATTGATG
CGAGAACTGC AATCCTTGGG GTTAGACATT GCCGTACATA AAGTAGAAAC CCAAGCTGAT
GGTAGTTCCT TGGATGTCGA AGTCGATTTA ATGGCAGACC AATTAGCTCG CCGTACACCA
CCCCGACCAA CCTATGAATC GCTATCCCGC GAATCATTGG ACGATGACGA ATAG
 
Protein sequence
MISENYIEPA FLLPDLIEIQ RSSFRWFLEE GLIEELNSFS PITDYTGKLE LHFLGHNYKL 
KEPKYSVEEA KRRDSTYAVQ MYVPTRLLNK ETGDIKEQEV FIGDLPLMTD RGTFIINGAE
RVIVNQIVRS PGVYYKSEID KNGRRTYSAS LIPNRGAWLK FETDRNDLVW VRIDKTRKLS
AQVLLKALGL SDNEIFDALR HPEYFQKTIE KEGQFSEEEA LMELYRKLRP GEPPTVLGGQ
QLLDSRFFDP KRYDLGRVGR YKLNKKLRLS VPDTVRVLTS GDILAAVDYL INLEYDIGSI
DDIDHLGNRR VRSVGELLQN QVRVGLNRLE RIIRERMTVS DAEVLTPASL VNPKPLVAAI
KEFFGSSQLS QFMDQTNPLA ELTHKRRLSA LGPGGLTRER AGFAVRDIHP SHYGRICPIE
TPEGPNAGLI GSLATHARVN QYGFLETPFR PVENGRVRFD QPAVYMTADE EDDLRVAPGD
IPVDENGHII GPQVPVRYRQ EFSTTTPEQV DYVAVSPVQI VSVATSMIPF LEHDDANRAL
MGSNMQRQAV PLLKPERPLV GTGLEAQGAR DSGMVIVSRT DGDVVYVDAT EIRVRVSGQL
PAASGKSTDN GQLTSQKGQE IRYTVSKYQR SNQDTCLNQK PLVRIGERVV AGQVLADGSS
TEGGELALGQ NIVVAYMPWE GYNYEDAILI SERLVQDDIY TSIHIEKYEI EARQTKLGPE
EITREIPNVG EDALRQLDEQ GIIRIGAWVE AGDILVGKVT PKGESDQPPE EKLLRAIFGE
KARDVRDNSL RVPNGEKGRV VDVRLFTREQ GDELPPGANM VVRVYVAQKR KIQVGDKMAG
RHGNKGIISR ILPIEDMPYL PDGSPVDIVL NPLGVPSRMN VGQVFECLLG WAGHTLGVRF
KITPFDEMYG EESSRRIVHG KLQEARDETG KDWVYNPDDP GKIMVFDGRT GEPFDRPVTI
GVAYMLKLVH LVDDKIHARS TGPYSLVTQQ PLGGKAQQGG QRFGEMEVWA LEAFGAAYTL
QELLTVKSDD MQGRNEALNA IVKGKAIPRP GTPESFKVLM RELQSLGLDI AVHKVETQAD
GSSLDVEVDL MADQLARRTP PRPTYESLSR ESLDDDE