Gene Haur_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0367 
Symbol 
ID5732218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp439342 
End bp440517 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content52% 
IMG OID641277490 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001543146 
Protein GI159896899 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000117749 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTACCA GACGTAACGT AGTAGATTGC ATAGAGAATC CTGCGGGTTC AATAGATAGC 
TTTCGTGCGC CGCCAGCAGC GAAAGCCATA GTGGGCGTAG CCCAGCGAGC GTGGACGATG
ACAAAGTATG AAACGATGAA TGTAGCAAAA AACCAACCGG AACTCGAAGA TGCCCCCGAG
GGCTGGGAGC TTGAGCCGGG TTTTAATGAT GAAGACGTAG AATTAGCAAT TGTAAACCTA
CGCCGCAAAA AAACGCTCTA TTCCGAGGCT GTCGAAGATT CAGTCCAGAT GTATTTGCGT
GAAATCGGCG AGGTTGAGTT GCTCACCGCC GAGGAAGAAA TTACGCTGGC CAAGCAAATT
GAAACTGGCC GCAAAGCGGA ACATCAGTTG CAAACCGAGC AGTATGCAAC CTGGGATGAA
CGTACCGCCC TCGAACGCCG CGTCGATTTT GGTAATGAAG GCCGTCGCAA GCTGACCCAA
GCCAATTTGC GCTTGGTCGT GAGCGTCGCC AAAAAATATA TTGGCAATCA CATGTCGTTC
ATGGATTTGA TCCAAGAAGG CAATATTGGT TTGATGCGGG CAGTTGAGAA GTTCGATTAT
CGCAAGGGTA ATCGCTTTTC AACCTATGCT ACGTGGTGGA TTCGCCAAGC AGTTACCCGT
GCGATCGCTG AGCAAAGCCG CACCATCCGC TTGCCAGTCC ACTTGAGCGA ATCAATCGCC
CAAATGCGCC GCGTCGCTTA CCAGCTTGAG CAAGCTTTGC AACGTGAAGC AACGCCCGAT
GAATTGGCTG ATGCCTTGGG TGTAAGTTTG CGCAAGGTCA AGCGTATGCT GAATGCTTCG
GTACAGCCAG TGTCGCTTGA GCAACCAATT GGCAAAGAAG GCCAAGGCCG CGTTGGCGAA
TTTTTGGCCG ACGATACCGA TGAAGCACCG CTTGAGCAGG CCACCCGCAT GATGTTGCAA
GATGAATTGG CCGATGCGCT CTCGCAATTG CCCGACCGCG AACGCCAAAT CTTGTTGTTG
CGCTATGGTT TGGCCGATGG CAAACGCCGC ACGCTCGAAG AAGTTGGCGC TGAATTCGGG
ATTACTCGCG AACGCACCCG CCAAATCGAG GCTGAAGCGA TGCGTCACTT GCGCCAACCC
AATGTTGGTC AACACTTGCG AGCTTACCTT GATTAA
 
Protein sequence
MFTRRNVVDC IENPAGSIDS FRAPPAAKAI VGVAQRAWTM TKYETMNVAK NQPELEDAPE 
GWELEPGFND EDVELAIVNL RRKKTLYSEA VEDSVQMYLR EIGEVELLTA EEEITLAKQI
ETGRKAEHQL QTEQYATWDE RTALERRVDF GNEGRRKLTQ ANLRLVVSVA KKYIGNHMSF
MDLIQEGNIG LMRAVEKFDY RKGNRFSTYA TWWIRQAVTR AIAEQSRTIR LPVHLSESIA
QMRRVAYQLE QALQREATPD ELADALGVSL RKVKRMLNAS VQPVSLEQPI GKEGQGRVGE
FLADDTDEAP LEQATRMMLQ DELADALSQL PDRERQILLL RYGLADGKRR TLEEVGAEFG
ITRERTRQIE AEAMRHLRQP NVGQHLRAYL D