Gene Haur_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2366 
Symbol 
ID5734247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3013766 
End bp3014923 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content54% 
IMG OID641279507 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_001545134 
Protein GI159898887 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000342445 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAC GCCTCTTAGT TATCGACGAT GAAGCCAATT TACGCTGGGT GCTTAGCGAG 
GCCTTGAGCG ACCAAGGCTA CGACGTAGTG GTGGCCGCGA ATGCCAACGA TGGCTTGGCT
GCGATGAGCC GCCAACCTGC CGATGTGGTC ATTCTCGATC TCAAGTTGAA GGGCATGGAT
GGCTTGGCAA CCTTGGCCCG CTTGCGCGAA CGCTGGCCTG AAGTCGTTGT CTTGATCTTG
ACGGCGTATG GCACAGTGGC CAGCGCGGTT GAGGCTATGC AACTGGGCGC TGCTGATTAT
TTGCGCAAGC CCTTTGATTT GGAAGAAATT GGTTTCAAAT TGCAACGAGC CTTGGAACGA
GCTGCGCTAC AACAAGAACT ACGGCGTTTA CGCCAACAAC AGCAACAGCG CATGGTCAAC
GATTTGATCG GCAGTCATCC AGCATGGGTG GCTTGTCGTC AACAGCTTGA ACGCATGATC
GATCGCTTGC CCGTGTTGGT TTTGGTGGGA GATGCGGGCG TGGGCAAGGC CCAATTGGCG
CGGTATGCCC ATGCTATCAG CCAGCGCCAG CAGGCACCGC TGATTGAGCT TGATGCTGGC
TTATTGAACG AATCGATGCT TGAGGCGGCG CTGGACGAGG CGGGCCAAGG CAGTATCATT
ATTCGCCGTG GTTTAGGGTG GTTGGATTGG TTACTCGCTC GAAAACTTGC GGCATGTGTA
CTCTTGACTA GCCTTGAAGC GCCGAATCAA ACGGTTCCAA CGCTGCATCT CCCCACGCTT
AATCAGCGCC GTAGCGACAT TGGCTTGTTA GCGGATTATT GGCTTGGACA GCAGATGCTT
AGTCCTCAGG CGCTCCAAAA ACTAGAGCAA AGTCAATGGA ACGCCAATCT GCCCGAATTG
CGCCATGTCC TTGAACGTGC AGCTGTCGCG GCTAATGGTC AGCTAATTCA GTCCGAGCAT
TTGCCACACG ATTTGCCTAG TGCTACTGCC GAACCAATCA CACTGCCCGC AAGCGGCTTG
CAACTTGAGG TGGTCGAACG CAGTTTATTG CAACAGGCCT TGCAACAAGC CAATGGTAAT
AAAACCCGCG CCGCTGAATT ATTGGGCTTA TCGCGCCATC AATTACTCTA TCGGCTAGAA
AAACATGGCC TTAGCTAG
 
Protein sequence
MTKRLLVIDD EANLRWVLSE ALSDQGYDVV VAANANDGLA AMSRQPADVV ILDLKLKGMD 
GLATLARLRE RWPEVVVLIL TAYGTVASAV EAMQLGAADY LRKPFDLEEI GFKLQRALER
AALQQELRRL RQQQQQRMVN DLIGSHPAWV ACRQQLERMI DRLPVLVLVG DAGVGKAQLA
RYAHAISQRQ QAPLIELDAG LLNESMLEAA LDEAGQGSII IRRGLGWLDW LLARKLAACV
LLTSLEAPNQ TVPTLHLPTL NQRRSDIGLL ADYWLGQQML SPQALQKLEQ SQWNANLPEL
RHVLERAAVA ANGQLIQSEH LPHDLPSATA EPITLPASGL QLEVVERSLL QQALQQANGN
KTRAAELLGL SRHQLLYRLE KHGLS