Gene EcHS_A2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2845 
SymbolnorR 
ID5593623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2848193 
End bp2849707 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content57% 
IMG OID640921962 
Productanaerobic nitric oxide reductase transcription regulator 
Protein accessionYP_001459473 
Protein GI157162155 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTT CCGTTGATGT GCTGGCGAAT ATCGCCATCG AATTGCAGCG TGGGATTGGT 
CACCAGGATC GTTTTCAGCG CCTGATCACC ACGCTACGTC AGGTGCTGGA GTGCGATGCG
TCTGCGTTGC TACGTTACGA TTCGCGGCAG TTTATTCCGC TTGCCATCGA CGGTCTGGCA
AAGGATGTAC TCGGTAGACG CTTTGCGCTG GAAGGGCATC CACGGCTTGA AGCGATTGCC
CGCGCCGGGG ACGTGGTGCG CTTTCCTGCA GACAGCGAAT TGCCCGATCC CTATGACGGT
TTGATTCCCG GGCAGGAGAG TCTGAAGGTT CACGCCTGCG TTGGTCTGCC ATTGTTTGCC
GGACAAAACC TGATCGGCGC ACTGACGCTC GACGGGATGC AGCCCGATCA GTTCGATGTT
TTCAGCGACG AAGAGTTACG GCTGATTGCC GCGCTGGCGG CGGGAGCGTT AAGCAATGCG
TTGCTGATTG AACAACTGGA AAGCCAGAAT ATGCTGCCAG GCGATGCCAC GCCGTTTGAA
GCGGTGAAAC AGACGCAGAT GATTGGCTTG TCCCCTGGCA TGACGCAACT GAAAAAAGAG
ATTGAGATTG TGGCGGCGTC CGATCTCAAC GTTCTGATCA GCGGTGAGAC GGGAACCGGT
AAGGAGCTGG TGGCGAAAGC GATTCATGAG GCCTCGCCAC GGGCGGTGAA TCCGCTGGTC
TATCTCAACT GTGCCGCACT GCCGGAAAGT GTGGCGGAAA GTGAGTTGTT CGGGCATGTG
AAAGGAGCGT TTACTGGCGC TATCAGTAAC CGCAGTGGGA AGTTTGAAAT GGCGGATAAC
GGCACGCTGT TTCTCGATGA GATCGGCGAG TTGTCGTTGG CATTGCAGGC CAAGCTGCTG
AGGGTGTTGC AGTATGGCGA TATTCAGCGC GTTGGCGATG ACCGCAGTTT GCGGGTCGAT
GTGCGCGTGC TGGCGGCGAC TAACCGCGAC TTACGCGAAG AGGTGCTGGC AGGGCGATTC
CGCGCCGATT TGTTTCATCG CCTGAGTGTG TTTCCACTTT CGGTGCCGCC GCTGCGTGAG
CGGGGCGATG ATGTCATTCT GCTGGCGGGG TATTTCTGCG AGCAGTGTCG TTTGCGGCTG
GGGCTCTCCC GCGTGGTATT AAGTGCCGGA GCGCGAAATT TACTGCAACA CTATCGTTTT
CCGGGGAACG TGCGCGAACT GGAACATGCT ATTCATCGGG CGGTAGTGCT GGCGAGAGCC
ACCCGCAACG GCGATGAAGT GATTCTTGAG GCGCAACATT TTGCTTTTCC TGAGGTGACG
TTGCCGCCGC CAGAAGCGGC GGCGGTGCCC GTTGTTAAGC AAAACCTGCG TGAAGCGACA
GAAGCGTTCC AGCGTGAAAC TATTCGCCAG GCACTGGCAC AAAATCATCA TAACTGGGCT
GCCTGCGCGC GGATGCTGGA AACCGACGTC GCCAACCTGC ATCGGCTGGC GAAACGTCTG
GGAATGAAGG ATTAA
 
Protein sequence
MSFSVDVLAN IAIELQRGIG HQDRFQRLIT TLRQVLECDA SALLRYDSRQ FIPLAIDGLA 
KDVLGRRFAL EGHPRLEAIA RAGDVVRFPA DSELPDPYDG LIPGQESLKV HACVGLPLFA
GQNLIGALTL DGMQPDQFDV FSDEELRLIA ALAAGALSNA LLIEQLESQN MLPGDATPFE
AVKQTQMIGL SPGMTQLKKE IEIVAASDLN VLISGETGTG KELVAKAIHE ASPRAVNPLV
YLNCAALPES VAESELFGHV KGAFTGAISN RSGKFEMADN GTLFLDEIGE LSLALQAKLL
RVLQYGDIQR VGDDRSLRVD VRVLAATNRD LREEVLAGRF RADLFHRLSV FPLSVPPLRE
RGDDVILLAG YFCEQCRLRL GLSRVVLSAG ARNLLQHYRF PGNVRELEHA IHRAVVLARA
TRNGDEVILE AQHFAFPEVT LPPPEAAAVP VVKQNLREAT EAFQRETIRQ ALAQNHHNWA
ACARMLETDV ANLHRLAKRL GMKD