Gene Haur_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2548 
Symbol 
ID5734426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3274623 
End bp3275783 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content51% 
IMG OID641279688 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001545314 
Protein GI159899067 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000188671 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACA AGCTACTGGA GCTTGCGGCA AAAAATCAAA ACCGCATCAA ACGCGAAGAC 
ATTCTCAGCG TGTTGCCAGA GCCAGAAACG AATATCGAAG CCCTCGAAAG CCTCTACGAA
CAAATCGAAG AAGCGGGGAT CGAGGTTGTT GATGCAGATG AAGCGCCCAA CGTTGCCGAC
CTCGATCTCG ATTTAGATCT TGATAGTGAT CACGACCTCG ACGGAGCCTT GCCTGATCTT
TCGGATATTG CGCTCGACGA CCCAGTGCGC ATGTACTTGC AAGAAATTGG TCAAGTACCC
TTGCTGACTG CGGCGGAAGA AGTCGATCTG GCCAAGAAGA TGGAAGTGGC TCACGCCGCC
AACGATACCT TGCTGCAAGA TGATGGCATG CTTGAACATA GTGATCGAAT TTCGTTAAAA
CGCCATATCG ATACTGGCCG AGCAGCCCGC CAACATTTGA TTCAGGCTAA CTTGCGCTTG
GTGGTTTCGA TTGCCAAAAA ATATACCTCG TATGGCTTGA CCATGATGGA TCTGGTGCAA
GAGGGCAATA TTGGCTTGAT GCGGGCAGTC GAGAAATTCG ATTACACCAA AGGCCATAAG
TTCTCGACCT ACGCGACGTG GTGGATTCGC CAAGCAATTA CGCGAGCGAT CGCCGACCAA
AGCCGCACCA TCCGCTTGCC TGTGCACATG GGCGAGGCAA TTAGCCAAGT TAAGCGAACG
TCGCATAAGT TGCAACAAAC CATGCAGCGC GAACCAACGC CCGAAGAAAT AGCCGAAGCC
ATGGGCATTA CCGCCTCGAA AGTGCGTCGC ACCTTGGAAG CCTCGATGCA CCCACTTTCG
TTGGAAATGC CGGTTGGTCA AGAAGGCGAA GGCCGCATGG GCGATTTCAT CGAGGATGAT
CGAGTTTCAA CCCCAGTTGA TGCTGCGGCC ATGACCATGT TGCGCGAACA AATTGAAGAA
GTTTTGCAAA AATTGCCTGA GCGCGAACGC AAAATAATTC AATTGCGCTA TGGCCTTAAA
GATGGCCGCT ACCGTACCCT CGAAGAAGTT GGCCTTGAAT TTGGCATTAC TCGCGAACGG
ATTCGCCAAA TCGAAGCGGT GGCCTTGCGT AAACTGCGTC ACCCCCACCT TGGCAAGAAA
TTACGTGGCT ATCTTGATTA A
 
Protein sequence
MIDKLLELAA KNQNRIKRED ILSVLPEPET NIEALESLYE QIEEAGIEVV DADEAPNVAD 
LDLDLDLDSD HDLDGALPDL SDIALDDPVR MYLQEIGQVP LLTAAEEVDL AKKMEVAHAA
NDTLLQDDGM LEHSDRISLK RHIDTGRAAR QHLIQANLRL VVSIAKKYTS YGLTMMDLVQ
EGNIGLMRAV EKFDYTKGHK FSTYATWWIR QAITRAIADQ SRTIRLPVHM GEAISQVKRT
SHKLQQTMQR EPTPEEIAEA MGITASKVRR TLEASMHPLS LEMPVGQEGE GRMGDFIEDD
RVSTPVDAAA MTMLREQIEE VLQKLPERER KIIQLRYGLK DGRYRTLEEV GLEFGITRER
IRQIEAVALR KLRHPHLGKK LRGYLD