Gene Haur_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0015 
Symbol 
ID5736849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp18166 
End bp19368 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content51% 
IMG OID641277136 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001542795 
Protein GI159896548 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0128011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCTTG ACGATGAAAT CCTCGACTCT GAGCTTGCGG GTGATGAAAC TGACGATTTT 
CCCGCTTTAC GCACACTCAG CGATCTTTTA GCCACTGGCA AAAAACGGGG CTATGTTACG
CCTGGCGAAA TTCAGCAAAT TATCGATTCG CAGCGTGACG AGGATAAACA ACTCGAAGAA
ATTCGCGATA CCTTGCAAAA GGCCAATATT CGAGTCGAGG AAACTGGCGA CGACGACGAT
GAAATTACGG TCAGCGAAAA CCCTGAAGCC GATTTTTCCA ATGAGGGCAT CAGTGTCAAC
GACACCGTGC GCTTATATTT GCGCGAAATC GGCTTAGTGC CTTTGCTCAA AGGCGAGCAA
GAGGTTGAGC TAGCCCGTGC TATGGAAGAT GGCGATAAAG CCCAACAAGA GTTGATCGAG
AACGATCATA CGCTTTCGGA TGTTGAACGC CTAGCCTTGC GCCGCCGCAT CGACCGTGGC
GAGCAAGCTC GTGAACACTT GACCACCGCC AACTTACGTT TGGTGGTCAG TGTGGCCAAA
AAATACATTG GCCGTGGGCT TTCGTTACTC GATTTGATTC AAGAAGGCAA TGTTGGCTTG
ATTCGCGCGG TCGAGAAATT CAACTACACC AAAGGCTTTA AGTTCTCTAC CTACGCCACT
TGGTGGATTC GCCAAGCAAT CACGCGGGCA ATCGCCGACC AAGCCCGTAC CATTCGGATT
CCGGTGCACA TGGTCGAAAC GATCAACCGT ATGATGCGCA CAGCACGGCG TTTAACCCAA
GAGAATGGCC GCGAACCTAG CGATGAAGAA CTGGCCCAGG CACTCGATCT GACGGTCGAG
AAAGTGCGAT CAATTCGCAA AACCTCGATG GAGCCAGTGT CGCTCGAAAC CCCAGTTGGT
CAAGAAGAAG ATAGCCAACT TGGCGACTTC TTGCCCGATG AAAAACTTGA AGCGCCTTCC
GATGCAGCTT CGCATCAGAT GTTGCGCGAA CAAGTTGCTC AGGTACTCGA TCAACTGACC
GAGCGTGAAA AGCGCGTGTT GAAGCTACGC TTTGGGCTAG AAGATGGAAC CCAACGCACA
CTCGAAGAAG TTGGCAAAGA ATTTGGAGTA ACTCGCGAAC GAATTCGCCA GATCGAGGTG
AAGGCGCTGC GCAAACTGCG TCACCCGCGC TTTGGCAAAA AACTCCGCGA TTATCTGGAA
TAA
 
Protein sequence
MALDDEILDS ELAGDETDDF PALRTLSDLL ATGKKRGYVT PGEIQQIIDS QRDEDKQLEE 
IRDTLQKANI RVEETGDDDD EITVSENPEA DFSNEGISVN DTVRLYLREI GLVPLLKGEQ
EVELARAMED GDKAQQELIE NDHTLSDVER LALRRRIDRG EQAREHLTTA NLRLVVSVAK
KYIGRGLSLL DLIQEGNVGL IRAVEKFNYT KGFKFSTYAT WWIRQAITRA IADQARTIRI
PVHMVETINR MMRTARRLTQ ENGREPSDEE LAQALDLTVE KVRSIRKTSM EPVSLETPVG
QEEDSQLGDF LPDEKLEAPS DAASHQMLRE QVAQVLDQLT EREKRVLKLR FGLEDGTQRT
LEEVGKEFGV TRERIRQIEV KALRKLRHPR FGKKLRDYLE