Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4253 |
Symbol | glnG |
ID | 6146733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4348613 |
End bp | 4350031 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641619074 |
Product | nitrogen regulation protein NR(I) |
Protein accession | YP_001746198 |
Protein GI | 170682005 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR01818] nitrogen regulation protein NR(I) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.177692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.167125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTTA TGCAACGAGG GATAGTCTGG GTAGTCGATG ACGATAGTTC CATCCGTTGG GTGCTTGAAC GTGCGCTCGC TGGAGCGGGT TTAACCTGTA CGACATTTGA GAACGGCGCG GAAGTACTGG AGGCGCTGGC GAGCAAAACG CCGGATGTGC TGCTTTCAGA TATCCGTATG CCGGGAATGG ACGGGCTGGC GCTGCTCAAG CAGATTAAAC AGCGCCATCC GATGCTTCCG GTCATCATTA TGACCGCACA TTCCGATCTG GATGCTGCCG TCAGCGCCTA TCAACAAGGG GCGTTTGATT ATCTGCCCAA ACCGTTTGAT ATCGACGAAG CCGTCGCGCT GGTTGAGCGC GCCATCAGTC ATTACCAGGA ACAGCAGCAG CCGCGTAATG TTCAGCTTAA CGGCCCAACG ACCGATATCA TCGGCGAAGC GCCAGCCATG CAGGACGTGT TCCGTATTAT CGGTCGGCTT TCGCGTTCTT CTATTAGCGT GCTGATTAAC GGCGAATCCG GCACCGGTAA AGAACTGGTC GCTCATGCCC TGCATCGCCA CAGTCCGCGA GCCAAAGCGC CATTTATCGC GCTGAATATG GCTGCTATCC CGAAGGATTT GATCGAATCA GAACTGTTTG GTCACGAGAA AGGCGCATTT ACCGGCGCGA ATACCATTCG TCAGGGGCGT TTTGAACAGG CTGATGGCGG TACATTATTC CTCGATGAAA TTGGCGATAT GCCGCTGGAT GTGCAGACGC GTTTGCTGCG CGTGCTGGCA GACGGTCAGT TTTATCGCGT TGGCGGCTAT GCGCCGGTGA AAGTGGATGT GCGGATTATC GCTGCCACTC ACCAGAATCT TGAACAGCGG GTGCAGGAAG GTAAGTTCCG TGAGGATCTG TTCCACCGCC TGAACGTTAT CCGCGTTCAT CTGCCGCCAT TGCGTGAGCG TCGGGAAGAT ATTCCCCGTC TGGCACGCCA TTTTTTACAG GTTGCCGCGC GCGAACTGGG CGTAGAAGCG AAGTTGCTGC ATCCGGAAAC CGAAGCCGCG CTGACGCGCC TGGCGTGGCC AGGCAACGTG CGCCAGCTGG AAAACACCTG TCGCTGGCTA ACGGTGATGG CCGCCGGGCA GGAAGTGTTG ATTCAGGATT TGCCCGGTGA ACTGTTTGAA TCAACGGTTG CGGAGAGTAC TTCGCAAATG CAACCGGACA GTTGGGCGAC ACTTTTAGCG CAGTGGGCAG ACAGAGCGCT GCGTTCCGGT CATCAAAACC TGCTTTCCGA AGCGCAGCCT GAGCTGGAGC GGACGTTACT GACGACCGCG TTGCGACATA CGCAGGGGCA TAAACAGGAA GCGGCGCGGC TACTCGGCTG GGGCCGCAAC ACCCTGACGC GTAAGTTAAA AGAGCTAGGG ATGGAGTGA
|
Protein sequence | MTFMQRGIVW VVDDDSSIRW VLERALAGAG LTCTTFENGA EVLEALASKT PDVLLSDIRM PGMDGLALLK QIKQRHPMLP VIIMTAHSDL DAAVSAYQQG AFDYLPKPFD IDEAVALVER AISHYQEQQQ PRNVQLNGPT TDIIGEAPAM QDVFRIIGRL SRSSISVLIN GESGTGKELV AHALHRHSPR AKAPFIALNM AAIPKDLIES ELFGHEKGAF TGANTIRQGR FEQADGGTLF LDEIGDMPLD VQTRLLRVLA DGQFYRVGGY APVKVDVRII AATHQNLEQR VQEGKFREDL FHRLNVIRVH LPPLRERRED IPRLARHFLQ VAARELGVEA KLLHPETEAA LTRLAWPGNV RQLENTCRWL TVMAAGQEVL IQDLPGELFE STVAESTSQM QPDSWATLLA QWADRALRSG HQNLLSEAQP ELERTLLTTA LRHTQGHKQE AARLLGWGRN TLTRKLKELG ME
|
| |