Gene Haur_4672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4672 
Symbol 
ID5736519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5967897 
End bp5969024 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content53% 
IMG OID641281836 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001547431 
Protein GI159901184 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGCAG CCCGTCAGCA TCGTTATCTA ACGCACGACC AAGTGCTGCT CCACCTTCCT 
CCCTCCGACA AAAACACAGC GCTGTTCGAC GAATTGTGTG TCCGCTGCCA AGCAGAAGAT
ATTCCAATTT TGGAAGAACC GCCAACCGCT GCAGTGCATC TCCGTTCACG CGATGTCGAT
GATGACGATC TCGAACGGTC TCTGATTCGG GAAGGGGTAA GCCTCGATGA TCCGGTGCGC
ATGTATTTGC GCGAAATTGG CCAAGTGCAT CTCTTAACCG CCCACGAAGA AACCATGTTG
GCCAAACAAT TGGAACGCGG CGAACGCGCA ATGATTCGGC TCGCCCGCCA CGAATATCTC
CTCGAAGAAA AAGCCCGCCT GCAACAGCAA GTTCTCGAAA GCGAAGCAGC CCGTCAACAT
CTCATCCAAG CCAATTTACG CCTCGTGGTG TCGATTGCAA AAAAATATGT TGGCCGCGGC
ATGTCGTTTT TAGATTTAAT TCAAGAGGGC AATATTGGCC TGATGCGGGC AACCGAGAAG
TTCGATTATC ACAAGGGCTT TAAGTTCTCA ACCTATGCAA CATGGTGGAT TCGCCAAGCA
ATTACGCGCT CGATCTCTGA TCATAGTCGC ACTATTCGCC TGCCAGTTCA CGTGGGCGAG
ACGATCAATC GGGTCAAGCG CACGGCCCAC AAGTTGCAAC AAGCCTTAGA GCGTGAGCCA
AGTGCCGAGG AGATTGGCGA AGCCCTTGGG TTGCCCGCCG ATAAAGTGCG GCGCGTGCTC
GAAGTTGCCC GTCAGCCAGT TTCGCTGGAA ACGCCCGTTG GCAACGAAGG CGATAGTGTG
TTAGGCGATT TTATCGAAGA TGATCGCTCA TCCGAGCCAC TTGATCATGC GACCGAGCAT
TTATTGCGCG AACAACTTGA TGAAGTGCTG TGCAAGCTCG ATGAGCGTGA ACGGCGGATT
ATTGAGCTAC GCTATGGCTT GGTCGATGGC AAATATCGTA CCCTTGAAGA AGTTGGCCGC
GAATTTGGCA TTACCCGCGA GCGAATTCGC CAAATTGAGG CCAAAGTACT GCGAAAACTG
CGCCACCCAG CGTTTGGTGG GCGCAAATTG CGAGCCTATC TTGAATAA
 
Protein sequence
MGAARQHRYL THDQVLLHLP PSDKNTALFD ELCVRCQAED IPILEEPPTA AVHLRSRDVD 
DDDLERSLIR EGVSLDDPVR MYLREIGQVH LLTAHEETML AKQLERGERA MIRLARHEYL
LEEKARLQQQ VLESEAARQH LIQANLRLVV SIAKKYVGRG MSFLDLIQEG NIGLMRATEK
FDYHKGFKFS TYATWWIRQA ITRSISDHSR TIRLPVHVGE TINRVKRTAH KLQQALEREP
SAEEIGEALG LPADKVRRVL EVARQPVSLE TPVGNEGDSV LGDFIEDDRS SEPLDHATEH
LLREQLDEVL CKLDERERRI IELRYGLVDG KYRTLEEVGR EFGITRERIR QIEAKVLRKL
RHPAFGGRKL RAYLE