Gene EcSMS35_4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4253 
SymbolglnG 
ID6146733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4348613 
End bp4350031 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content57% 
IMG OID641619074 
Productnitrogen regulation protein NR(I) 
Protein accessionYP_001746198 
Protein GI170682005 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.177692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.167125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTA TGCAACGAGG GATAGTCTGG GTAGTCGATG ACGATAGTTC CATCCGTTGG 
GTGCTTGAAC GTGCGCTCGC TGGAGCGGGT TTAACCTGTA CGACATTTGA GAACGGCGCG
GAAGTACTGG AGGCGCTGGC GAGCAAAACG CCGGATGTGC TGCTTTCAGA TATCCGTATG
CCGGGAATGG ACGGGCTGGC GCTGCTCAAG CAGATTAAAC AGCGCCATCC GATGCTTCCG
GTCATCATTA TGACCGCACA TTCCGATCTG GATGCTGCCG TCAGCGCCTA TCAACAAGGG
GCGTTTGATT ATCTGCCCAA ACCGTTTGAT ATCGACGAAG CCGTCGCGCT GGTTGAGCGC
GCCATCAGTC ATTACCAGGA ACAGCAGCAG CCGCGTAATG TTCAGCTTAA CGGCCCAACG
ACCGATATCA TCGGCGAAGC GCCAGCCATG CAGGACGTGT TCCGTATTAT CGGTCGGCTT
TCGCGTTCTT CTATTAGCGT GCTGATTAAC GGCGAATCCG GCACCGGTAA AGAACTGGTC
GCTCATGCCC TGCATCGCCA CAGTCCGCGA GCCAAAGCGC CATTTATCGC GCTGAATATG
GCTGCTATCC CGAAGGATTT GATCGAATCA GAACTGTTTG GTCACGAGAA AGGCGCATTT
ACCGGCGCGA ATACCATTCG TCAGGGGCGT TTTGAACAGG CTGATGGCGG TACATTATTC
CTCGATGAAA TTGGCGATAT GCCGCTGGAT GTGCAGACGC GTTTGCTGCG CGTGCTGGCA
GACGGTCAGT TTTATCGCGT TGGCGGCTAT GCGCCGGTGA AAGTGGATGT GCGGATTATC
GCTGCCACTC ACCAGAATCT TGAACAGCGG GTGCAGGAAG GTAAGTTCCG TGAGGATCTG
TTCCACCGCC TGAACGTTAT CCGCGTTCAT CTGCCGCCAT TGCGTGAGCG TCGGGAAGAT
ATTCCCCGTC TGGCACGCCA TTTTTTACAG GTTGCCGCGC GCGAACTGGG CGTAGAAGCG
AAGTTGCTGC ATCCGGAAAC CGAAGCCGCG CTGACGCGCC TGGCGTGGCC AGGCAACGTG
CGCCAGCTGG AAAACACCTG TCGCTGGCTA ACGGTGATGG CCGCCGGGCA GGAAGTGTTG
ATTCAGGATT TGCCCGGTGA ACTGTTTGAA TCAACGGTTG CGGAGAGTAC TTCGCAAATG
CAACCGGACA GTTGGGCGAC ACTTTTAGCG CAGTGGGCAG ACAGAGCGCT GCGTTCCGGT
CATCAAAACC TGCTTTCCGA AGCGCAGCCT GAGCTGGAGC GGACGTTACT GACGACCGCG
TTGCGACATA CGCAGGGGCA TAAACAGGAA GCGGCGCGGC TACTCGGCTG GGGCCGCAAC
ACCCTGACGC GTAAGTTAAA AGAGCTAGGG ATGGAGTGA
 
Protein sequence
MTFMQRGIVW VVDDDSSIRW VLERALAGAG LTCTTFENGA EVLEALASKT PDVLLSDIRM 
PGMDGLALLK QIKQRHPMLP VIIMTAHSDL DAAVSAYQQG AFDYLPKPFD IDEAVALVER
AISHYQEQQQ PRNVQLNGPT TDIIGEAPAM QDVFRIIGRL SRSSISVLIN GESGTGKELV
AHALHRHSPR AKAPFIALNM AAIPKDLIES ELFGHEKGAF TGANTIRQGR FEQADGGTLF
LDEIGDMPLD VQTRLLRVLA DGQFYRVGGY APVKVDVRII AATHQNLEQR VQEGKFREDL
FHRLNVIRVH LPPLRERRED IPRLARHFLQ VAARELGVEA KLLHPETEAA LTRLAWPGNV
RQLENTCRWL TVMAAGQEVL IQDLPGELFE STVAESTSQM QPDSWATLLA QWADRALRSG
HQNLLSEAQP ELERTLLTTA LRHTQGHKQE AARLLGWGRN TLTRKLKELG ME