Gene ECH74115_5314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5314 
SymbolglnG 
ID6967293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4952895 
End bp4954313 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content57% 
IMG OID643388975 
Productnitrogen regulation protein NR(I) 
Protein accessionYP_002273384 
Protein GI209399991 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.111299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.738695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTA TGCAACGAGG GATAGTCTGG GTAGTCGATG ACGATAGTTC CATCCGTTGG 
GTGCTTGAAC GTGCGCTCGC TGGAGCGGGT TTAACCTGTA CAACATTTGA GAACGGCGCG
GAGGTACTGG AGGCGCTGGC GAGCAAAACG CCGGATGTGC TGCTTTCAGA TATCCGTATG
CCGGGAATGG ACGGGCTGGC ACTGCTCAAG CAGATTAAAC AGCGCCATCC GATGCTTCCG
GTCATCATTA TGACCGCACA TTCCGATCTG GATGCTGCCG TCAGCGCCTA TCAACAAGGG
GCGTTTGATT ATCTGCCCAA ACCGTTTGAT ATCGACGAAG CCGTGGCGCT GGTTGAGCGC
GCCATCAGTC ATTACCAGGA ACAGCAGCAG CCGCGTAATG TTCAGCTTAA CGGCCCAACG
ACCGATATCA TCGGCGAAGC GCCAGCCATG CAGGACGTGT TCCGTATTAT CGGTCGGCTT
TCGCGTTCTT CTATTAGCGT GCTGATTAAC GGCGAATCCG GCACCGGTAA AGAACTGGTC
GCTCATGCCC TGCATCGCCA CAGTCCGCGA GCCAAAGCGC CATTTATCGC TCTGAATATG
GCGGCTATCC CGAAGGATTT GATCGAATCA GAACTGTTTG GTCACGAGAA AGGCGCGTTT
ACCGGCGCGA ATACCATTCG TCAGGGGCGT TTTGAACAGG CTGATGGCGG TACATTATTT
CTCGATGAAA TTGGCGATAT GCCGCTGGAT GTGCAGACGC GTTTGCTGCG CGTGCTGGCA
GACGGTCAGT TTTACCGCGT TGGCGGCTAT GCGCCGGTGA AAGTGGATGT GCGGATTATC
GCTGCCACTC ACCAGAATCT CGAACAGCGG GTGCAGGAAG GTAAGTTCCG TGAGGATCTG
TTCCACCGCC TGAACGTTAT CCGCGTTCAT CTGCCGCCGC TGCGTGAGCG TCGGGAAGAT
ATTCCCCGCC TGGCGCGCCA TTTTTTACAG GTTGCCGCGC GAGAACTGGG CGTAGAAGCG
AAGTTGCTGC ATCCGGAAAC CGAAGCCGCG TTGACGCGTC TGGCGTGGCC AGGCAACGTG
CGCCAGCTGG AAAACACCTG CCGCTGGCTA ACGGTAATGG CTGCCGGACA GGAAGTGTTG
ATTCAGGATT TGCCTGGCGA ACTGTTTGAA TCAACGGTTG CGGAGAGTAC TTCGCAAATG
CAACCGGACA GTTGGGCGAC GCTTCTTGCG CAGTGGGCAG ACAGAGCGCT ACGTTCCGGT
CATCAAAATC TGCTTTCCGA AGCGCAGCCA GAGCTGGAGC GGACGTTACT GACGACCGCG
TTGCGACATA CGCAGGGGCA TAAACAGGAG GCGGCGCGGC TACTCGGCTG GGGCCGCAAC
ACCCTGACGC GTAAGTTAAA AGAGCTGGGG ATGGAGTGA
 
Protein sequence
MTFMQRGIVW VVDDDSSIRW VLERALAGAG LTCTTFENGA EVLEALASKT PDVLLSDIRM 
PGMDGLALLK QIKQRHPMLP VIIMTAHSDL DAAVSAYQQG AFDYLPKPFD IDEAVALVER
AISHYQEQQQ PRNVQLNGPT TDIIGEAPAM QDVFRIIGRL SRSSISVLIN GESGTGKELV
AHALHRHSPR AKAPFIALNM AAIPKDLIES ELFGHEKGAF TGANTIRQGR FEQADGGTLF
LDEIGDMPLD VQTRLLRVLA DGQFYRVGGY APVKVDVRII AATHQNLEQR VQEGKFREDL
FHRLNVIRVH LPPLRERRED IPRLARHFLQ VAARELGVEA KLLHPETEAA LTRLAWPGNV
RQLENTCRWL TVMAAGQEVL IQDLPGELFE STVAESTSQM QPDSWATLLA QWADRALRSG
HQNLLSEAQP ELERTLLTTA LRHTQGHKQE AARLLGWGRN TLTRKLKELG ME