Gene B21_02345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02345 
SymbolhyfR 
ID8113234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2475590 
End bp2477602 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID644848547 
Producthypothetical protein 
Protein accessionYP_003000120 
Protein GI251785816 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATGT CAGACGAGGC GATGTTTGCC CCGCCGCAAG GAATAACAAT TGAAGCGGTA 
AACGGAATGC TCGCGGAGCG GTTAGCGCAG AAACACGGTA AGGCGTCTTT ATTACGCGCC
TTCATCCCGC TGCCGCCGCC GTTCAGCCCG GTACAACTTA TTGAACTGCA TGTTCTCAAA
AGCAACTTCT ATTACCGCTA CCATGATGAT GGCAGCGATG TGACGGCAAC AACAGAGTAT
CAGGGCGAGA TGGTCGATTA TTCGCGTCAC GCCGTCCTTC TCGGCAGTAG TGGAATGGCG
GAGCTACGCT TTATTCGCAC CCACGGCAGT CGTTTTACTC CCCAGGATTG CACACTGTTT
AACTGGCTGG CGCGGATAAT CACCCCGGTT CTGCAATCAT GGCTCAATGA TGAAGAACAG
CAGGTGGCGC TGCGTTTGCT GGAGAAAGAT CGCGATCATC ATCGGGTACT GGTTGATATT
ACTAATGCAG TGCTGTCACA TCTTGATCTC GACGATCTGA TCGCTGACGC CGCTCGTGAG
ATCCATCATT TTTTCGGTCT GGCTTCAGTC AGTATGGTAC TGGGCGATCA TCGAAAGAAC
GAGAAGTTTA GCCTGTGGTG CAGCGATCTT TCTGCCTCAC ATTGTGCGTG TCTGCCACGC
AATATGCCTG GCGACAGTGT ATTGCTGACA CAAACGCTAC AAACCCGACA ACCGACCTTG
ACGCACCGTG CAGACGATCT GTTTCTCTGG CAACGCGACC CGTTATTACT CTTACTTGCA
TCTAACGGCT GCGAATCTGC GCTCCTTATA CCGCTTACCT TTGGCAACCA TACACCGGGT
GCATTGTTGC TGGCGCATAC CTCTTCCACT CTCTTTAGTG AGGAAAACTG CCAGCTACTA
CAACACATAG CCGATCGCAT CGCTATTGCC GTTGGCAATG CCGATGCCTG GCGTAGCATG
ACCGATTTGC AGGAAAGTTT GCAGCAAGAA AACCACCAGC TTAGCGAGCA GCTCCTTTCG
AATCTGGGCA TCGGTGACAT TATCTATCAA AGCCAGGCAA TGGAAGACCT ACTCCAGCAG
GTAGATATTG TGGCGAAGAG CGACAGTACG GTGTTGATTT GCGGTGAAAC CGGAACCGGC
AAAGAGGTGA TCGCCAGAGC GATCCATCAA CTTAGCCCGC GACGCGACAA GCCGCTGGTC
AAAATCAACT GCGCTGCCAT CCCCGCCAGT CTTCTGGAAA GTGAGTTATT CGGTCATGAC
AAAGGGGCGT TTACTGGTGC GATTAATACC CATCGTGGTC GTTTTGAAAT TGCCGATGGC
GGCACGTTGT TTCTCGATGA AATTGGCGAT CTGCCGTTAG AACTTCAGCC TAAACTGCTG
CGCGTATTGC AGGAACGGGA GATTGAGCGT CTCGGCGGGA GTAGAACGAT CCCGGTAAAT
GTCAGAGTCA TTGCCGCCAC CAACCGTGAT TTGTGGCAAA TGGTTGAAGA TCGCCAGTTT
CGCAGCGATC TCTTTTATCG CCTGAATGTC TTCCCACTGG AATTGCCGCC GCTGCGCGAC
CGTCCGGAAG ATATCCCTCT TTTAGCAAAG CATTTCACGC AAAAAATGGC GCGCCATATG
AATCGCGCAA TTGACGCCAT CCCGACCGAG GCACTACGCC AGTTGATGTC GTGGGATTGG
CCGGGCAACG TGCGCGAGCT GGAAAACGTG ATTGAGCGGG CGGTACTGTT GACTCGTGGT
AACAGTCTGA ATTTACATCT AAATGTCCGA CAAAGCCGTT TACTGCCGAC GCTAAATGAA
GATTCAGCGC TTCGCAGTTC AATGGCGCAG TTGCTGCACC CGACGACGCC AGAGAATGAC
GAAGAAGAAC GTCAGCGCAT TGTTCAGGTA TTGCGAGAAA CCAATGGCAT TGTTGCCGGG
CCCCGTGGCG CGGCGACACG ATTAGGGATG AAGCGCACCA CGCTGCTGTC ACGAATGCAG
CGTCTGGGGA TCTCGGTTCG CGAGGTGTTG TAA
 
Protein sequence
MAMSDEAMFA PPQGITIEAV NGMLAERLAQ KHGKASLLRA FIPLPPPFSP VQLIELHVLK 
SNFYYRYHDD GSDVTATTEY QGEMVDYSRH AVLLGSSGMA ELRFIRTHGS RFTPQDCTLF
NWLARIITPV LQSWLNDEEQ QVALRLLEKD RDHHRVLVDI TNAVLSHLDL DDLIADAARE
IHHFFGLASV SMVLGDHRKN EKFSLWCSDL SASHCACLPR NMPGDSVLLT QTLQTRQPTL
THRADDLFLW QRDPLLLLLA SNGCESALLI PLTFGNHTPG ALLLAHTSST LFSEENCQLL
QHIADRIAIA VGNADAWRSM TDLQESLQQE NHQLSEQLLS NLGIGDIIYQ SQAMEDLLQQ
VDIVAKSDST VLICGETGTG KEVIARAIHQ LSPRRDKPLV KINCAAIPAS LLESELFGHD
KGAFTGAINT HRGRFEIADG GTLFLDEIGD LPLELQPKLL RVLQEREIER LGGSRTIPVN
VRVIAATNRD LWQMVEDRQF RSDLFYRLNV FPLELPPLRD RPEDIPLLAK HFTQKMARHM
NRAIDAIPTE ALRQLMSWDW PGNVRELENV IERAVLLTRG NSLNLHLNVR QSRLLPTLNE
DSALRSSMAQ LLHPTTPEND EEERQRIVQV LRETNGIVAG PRGAATRLGM KRTTLLSRMQ
RLGISVREVL