Gene Haur_0242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0242 
Symbol 
ID5732137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp276873 
End bp281504 
Gene Length4632 bp 
Protein Length1543 aa 
Translation table11 
GC content52% 
IMG OID641277366 
ProductDNA-directed RNA polymerase, beta' subunit 
Protein accessionYP_001543022 
Protein GI159896775 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAA TCAATGAATT CAATGCAATT CGTATCAGTC TCGCTTCACC AGAAGATATC 
CTGAGCTGGT CGCACGGTGA AGTAACCAAA CCAGAGACGA TTAATTATAG AACTCTGCGT
CCTGAACGCG ATGGCTTGTT CTGTGAAAAA ATCTTCGGAC CTACCCGCGA TTGGGAATGT
TACTGTGGGA AGTACAAACG TGTGCGTTAT AAGGGCATTG TATGCGATAA ATGTGGCGTA
GAAGTCACCC GATCCAAAGT GCGACGCGAT CGCATGGGTC ATATCAAACT TGCCTCGCCT
GTATCGCATA TTTGGTTTGT TAAAGGCACA CCATCGCGTT TGGGTTTACT TTTAGATATC
TCGCCACGTA ATCTTGAGCG AGTGCTTTAT TTTGCCTCGT ATATTATCAC CGATGTCGAT
GATCTGGCAT TGGGCAATGT ACGTGAGCAA ATGAAGACCG ACTTTGGCGT GCGGCGCAAA
GATCTCGAAG AAAAAATTAT CGAACAACGT GGCGAAAAGG CAACCCGTTT AAGCAAAGAC
CTTGCGGCGA TGGATAATGC CATGGAAGGC ACGTTGGAGC GAACGCATGA ACAATTTGCC
CGCCAACGCC AAGAAATCGA AGATGAAGCC AATGCATTGC GCGAACACCT CGAAGAACTG
ATCGGCATCG ACGCACTAGC CGATGAAGAT ATTGTTTATC GTGGCACGGT GCTGCTCGAA
GAAAACGAGC CAGTGCGCGA ACGCAGCTTG GAACAGCTCG AACAACTGAT CGATCAAGAG
CGCGAAAAGC TTGAACAACG CCGCGAATAC GAGATTGAAA ACGTGCGCTT GTTGGCCGAC
GGCGAGCGTG ATCAACGCCA ATCGGTGGCT GATGCTGAAC AAGAACGGCT AACCACGGCC
TTGATCAAAC AGCTTGAAGA TCTCAACAAA GAAGAAAAAG ACAAGCTTGA TCGCTTGGAT
GATATTCAGT TGCACCGGAT TATTTCGGAA AACGAATATC GGATTCTGCG TGATTTAGCG
CCCTACACCT TCAAGGCTGA TATGGGTGCT GGCGCGGTGC GCGACATCGT ATCGATGGTT
GATCTCGACG AATTGTCGAA CCAAATGCAA GCCGAAGTCC AAAGCTCATC GGGTCAACGC
CGCAAGAAAG CCACCAAACG GCTGCGCGTG GTTGAAGCAT TCCGCAAGAG CGGCAATCGT
CCTGAATGGA TGATTATGAC CGTCTTGCCA GTGATTCCAC CAGATTTGCG CCCGATGGTC
CAGCTTGATG GTGGTCGGTT TGCGACCTCT GACTTGAACG ACCTGTATCG ACGGGTGATC
AACCGCAACA ACCGGCTCAA GCGCTTGATG GAGTTGAACG CTCCTGAAAT CATCGTGCGC
AACGAAAAAC GTATGTTGCA AGAAGCGGTT GATGCCTTGA TCGACAACGG TCGCCGTGGC
CGCCCAGTCA GTGGCAAGGG CAAGCATCGC CTTAAGAGCC TGAGCGATAT GCTCAAAGGC
AAGCAAGGCC GCTTCCGCCA AAACCTCTTG GGTAAGCGGG TTGACTACTC AGGCCGTTCG
GTGATTGTGG TCGGACCAAC CCTGCAATTG CACCAATGTG GTTTGCCCAA GAAAATGGCC
TTGGAATTGT TCAAGCCATT CGTCATGCGC CGCTTGGTCG ATAAAGGCTT TGCCCACAAC
ATCAAATCAG CCAAGCGCTT CGTTGAGCGG GTTCGCCCCG AAGTGTGGGA TGTGCTCGAA
GAAGTCATCA AAGACTACTT GGTATTGCTG AACCGTGCGC CTTCGCTGCA CCGTTTGTCG
ATTCAAGCCT TTGAAGCCAA GCTGATCGAA GGCTCGGCGA TTCAATTGCA CCCCTTGGTG
TGTGCCGCAT TCAACGCCGA CTTTGACGGC GACCAAATGG CTGTGCACGT ACCATTGTCG
CGCAAAGCCC AAGAAGAAGC CCGTCGTCGT ATGATTTCAA CCTACAACCT GTTGTCACCG
GCAACTGGCG ACCCAATTAT TACGCCATCG CAAGACATCG TGTTGGGTTG TTTCTACCTG
ACCCAAGTTC GACCTGGGGC TAAGGGTGGC GGCAAGCGCT TTGGGTCAAT TGACGAAGCT
TTGTTGGCCT ACACCAATGG TGTTGTACAT ATTCAAGCAC CTGTTTGGAT TGTGATCGAA
GATTACATTC TGCCAGGCAG CGATTTGCGT GAAAAGGAAT TGCCATCGTT GGATGGCGTT
ACGCCACGCG TTTTGATCGA AACCAGCGTT GGGCGGATCA TCTTCAACAA TGCGTTGCGC
TACCAAGGCG AGCCAAAAGC TGGCGAAAAC GGCTATCGCT CACCATTGCA CTATCGCAAC
TTCTTGGTTG GCAAATCGGG ATTGAAAGCC TTGATCGCCG ACTGCTATCG CTTCCACTCA
CAGCGCGAGA ACATCATCGC CGACGTGTAT CAAGAATTAA TTGAACGTTT CGGCCCTGAA
ACCTCGGAAG AATCGCTGTT GCGTTTCTAT GCCTCAGAAC GGACGGCGCG TTTGGCCGAC
CGGATCAAGG CCTTGGGCTT CAAGCACGCG ACCTTGGGCG GGATGACCTT CTCGGCTTCG
GACGTAGAAG TGCCTGATAC CAAAGATGCA ATTGTGCAAG AAACCTACAA GAAAGTGCAA
GACATCGAAA AGATGCAACG TCGTGGTTTG ATTACCGACG ACGAGCGCTA TCGTGAAGTG
GTTACGGCGT GGCTCGATGC TACCAACCAA ATCAAGGTTG AAGTTCAACG CTCATTGAAT
CCATTCGGGC CAGTTTCGAT GATGTCAACC TCGGGTGCGC GTGGTAACGT CGAGCAAATT
CGCCAGATGG CGGGGATGCG GGGCTTGACC ACTGACCCAA CCGGACGGAT TATCGAATTG
CCGATTACCG CCAACTTCCG CGAAGGCTTG AGCGTTATCG AGTACTTCAT CTCAACCCAC
GGTGGTCGGA AAGGTTTGGC TGATACCGCC TTGCGGACAG CCGACGCTGG TTACTTGACC
CGCCGTTTGG TGGACGTGGC CCAAGATGTG ATCGTGACGA TCGAAGATTG TGGCACAACC
GAAGGTATGT GGATTCGGGT CAGTCGCGAT ATTTTGGCCT CGGTCGAAGA TCGGATCGTT
GGCCGCGTGA CGGTGGCTCC GGTCACTAAT CCCGTGACTG GTGAAGTCAT TTTCGATACC
GATAGCGAAA TCTTAGAAGA CGATGGCAAG CACATTGCGA ATGTGCTGAA ATCGTTGGAC
AAAGAGGCTC AAGCTGAATT TGGCATTTAT GTCCGTTCGG TACTGACCTG TAACGCCGAT
TATGGCATTT GTCGCAAGTG CTATGGCCGC AACCTTGCCA CTGGCAAGAT GGTCGAAATT
GGCGAAGCTG TCGGGATTAT CGCGGCGCAA TCGATTGGTG AGCCAGGGAC GCAGCTGACC
CTGCGGACGT TCCACACTGG TGGTGTGGCA ACTGATACCG ACATCACCCA AGGTTTGCCA
CGGGTGCAAG AAATCTTCGA AGCCCGGATT CCGAAAGGGA AAGCGGTGCT GGCTCAGATT
GCTGGCCGTG TCCAGATTGT GCGCGAAGAA GAAGGCATTC GGCGCTTGCG GATCGTCTCT
GAGGAAGTCT ATACCGACGA GCAAATGTTG CCCAAAGATT ACCGCGTGGT GGTCAAAAAC
GGCGATGCTG TCGAAATTGG CAACCTCTTG GCCGAAAGCA ATGTCGATGG CGATGGTCGT
GCACCATTGG TCGCAGGCCT AGCTGGCAAC GTCTATGTTG ACGATGATCG CTTGGTAATT
CAAGCCAAAG ACATCGAAGA ACATGAAGAA GTGATTGCCC ACGCTGCTCG TTTGCGGGTC
AAAGATGGCG ATTTGGTGCA AGTTGGCCAA CAAATGACCG AAGGCTCATC TGACCCTCAA
GAAATGCTGC AATTGCGTGG TCGCGAAGCA GTTCAAGAAT ACCTGACCAA CGAAGCCCAA
AAGGTCTATC GTTCGCAAGG TGTGGGGATC AACGATAAGC ACATTGAAGT GATTGTGCGC
CAAATGTTGC GACGGGTACG CATCGAAGAG CCAGGCGATA CTGAACTCTT GCCAGGCGAA
TTGGTCGAAC TCCACGAACT CAATCGGATC AATGCTTCGA TTGTGAGCCA AGGTGGTGAC
CCAGCATTGG CAGTGCCAGT TTTGTTGGGG ATTACCAAGG CCTCGTTGTC AACCGACTCA
TTCTTGTCGG CGGCCTCGTT CCAAGAAACC ACGCGCGTGC TGACTGAAGC TGCCGTCAAT
GGCAAGATCG ATTACCTGCG TGGCTTGAAA GAAAACGTGG TTATCGGCAA GCTGATTCCA
GCTGGTACGG GGATGGAGCA ACGGCGCAAA TTGGCTGAAG AAGCCGCCTT GCGGGTTGCC
CAAATCACCA GCACTCCCGC TGATCGCGAA GTTCCAGCAG CAACGCCTGC GCCAGCAGTG
ATGAGCGAAC CCAAGCCCGC GCCACCACGC TCATTCGACG AAGCCTTGAA CGCTGTCACC
AACATCGACA GTGGCAATGG ACCAAAGGAT GACTTGTTTG CCCAAGCCAT GGCTCGTTTG
CAAGCTGAAG AAGGTCGTAA ACCAACCCTG AGCGAACTCT TAGGCACCGA CGAAGACGAG
GAAAACGTCT AA
 
Protein sequence
MLEINEFNAI RISLASPEDI LSWSHGEVTK PETINYRTLR PERDGLFCEK IFGPTRDWEC 
YCGKYKRVRY KGIVCDKCGV EVTRSKVRRD RMGHIKLASP VSHIWFVKGT PSRLGLLLDI
SPRNLERVLY FASYIITDVD DLALGNVREQ MKTDFGVRRK DLEEKIIEQR GEKATRLSKD
LAAMDNAMEG TLERTHEQFA RQRQEIEDEA NALREHLEEL IGIDALADED IVYRGTVLLE
ENEPVRERSL EQLEQLIDQE REKLEQRREY EIENVRLLAD GERDQRQSVA DAEQERLTTA
LIKQLEDLNK EEKDKLDRLD DIQLHRIISE NEYRILRDLA PYTFKADMGA GAVRDIVSMV
DLDELSNQMQ AEVQSSSGQR RKKATKRLRV VEAFRKSGNR PEWMIMTVLP VIPPDLRPMV
QLDGGRFATS DLNDLYRRVI NRNNRLKRLM ELNAPEIIVR NEKRMLQEAV DALIDNGRRG
RPVSGKGKHR LKSLSDMLKG KQGRFRQNLL GKRVDYSGRS VIVVGPTLQL HQCGLPKKMA
LELFKPFVMR RLVDKGFAHN IKSAKRFVER VRPEVWDVLE EVIKDYLVLL NRAPSLHRLS
IQAFEAKLIE GSAIQLHPLV CAAFNADFDG DQMAVHVPLS RKAQEEARRR MISTYNLLSP
ATGDPIITPS QDIVLGCFYL TQVRPGAKGG GKRFGSIDEA LLAYTNGVVH IQAPVWIVIE
DYILPGSDLR EKELPSLDGV TPRVLIETSV GRIIFNNALR YQGEPKAGEN GYRSPLHYRN
FLVGKSGLKA LIADCYRFHS QRENIIADVY QELIERFGPE TSEESLLRFY ASERTARLAD
RIKALGFKHA TLGGMTFSAS DVEVPDTKDA IVQETYKKVQ DIEKMQRRGL ITDDERYREV
VTAWLDATNQ IKVEVQRSLN PFGPVSMMST SGARGNVEQI RQMAGMRGLT TDPTGRIIEL
PITANFREGL SVIEYFISTH GGRKGLADTA LRTADAGYLT RRLVDVAQDV IVTIEDCGTT
EGMWIRVSRD ILASVEDRIV GRVTVAPVTN PVTGEVIFDT DSEILEDDGK HIANVLKSLD
KEAQAEFGIY VRSVLTCNAD YGICRKCYGR NLATGKMVEI GEAVGIIAAQ SIGEPGTQLT
LRTFHTGGVA TDTDITQGLP RVQEIFEARI PKGKAVLAQI AGRVQIVREE EGIRRLRIVS
EEVYTDEQML PKDYRVVVKN GDAVEIGNLL AESNVDGDGR APLVAGLAGN VYVDDDRLVI
QAKDIEEHEE VIAHAARLRV KDGDLVQVGQ QMTEGSSDPQ EMLQLRGREA VQEYLTNEAQ
KVYRSQGVGI NDKHIEVIVR QMLRRVRIEE PGDTELLPGE LVELHELNRI NASIVSQGGD
PALAVPVLLG ITKASLSTDS FLSAASFQET TRVLTEAAVN GKIDYLRGLK ENVVIGKLIP
AGTGMEQRRK LAEEAALRVA QITSTPADRE VPAATPAPAV MSEPKPAPPR SFDEALNAVT
NIDSGNGPKD DLFAQAMARL QAEEGRKPTL SELLGTDEDE ENV