Gene EcolC_4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4148 
SymbolglnG 
ID6066358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4576790 
End bp4578199 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content57% 
IMG OID641603569 
Productnitrogen regulation protein NR(I) 
Protein accessionYP_001727072 
Protein GI170022118 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.221467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGAG GGATAGTCTG GGTAGTCGAT GACGATAGTT CCATCCGTTG GGTGCTTGAA 
CGTGCGCTCG CTGGAGCGGG TTTAACCTGT ACGACATTTG AGAACGGCGC GGAAGTACTG
GAGGCGCTGG CGAGCAAAAC GCCGGATGTG CTGCTTTCAG ATATCCGTAT GCCGGGAATG
GACGGGCTGG CGCTGCTCAA GCAGATTAAA CAGCGCCATC CAATGCTTCC GGTCATCATT
ATGACCGCAC ATTCCGATCT GGATGCTGCC GTCAGCGCCT ATCAACAAGG GGCGTTTGAT
TATCTGCCCA AACCGTTTGA TATCGACGAA GCAGTGGCGC TGGTTGAGCG CGCTATCAGT
CATTACCAGG AACAGCAGCA GCCGCGTAAT ATTCAGCTTA ACGGCCCAAC GACCGATATC
ATCGGCGAAG CGCCAGCCAT GCAGGACGTG TTCCGTATTA TCGGTCGGCT TTCGCGTTCT
TCTATTAGCG TGCTGATTAA CGGCGAATCC GGCACCGGTA AAGAACTGGT CGCTCATGCC
CTGCATCGCC ACAGTCCGCG CGCCAAAGCG CCGTTTATCG CGCTGAATAT GGCAGCTATC
CCAAAAGATT TGATCGAATC AGAACTGTTT GGCCACGAGA AAGGCGCGTT TACTGGCGCG
AATACCATTC GTCAGGGGCG TTTTGAACAG GCCGATGGCG GTACATTATT CCTCGACGAA
ATTGGTGATA TGCCGCTGGA TGTGCAGACG CGTTTGCTGC GCGTGCTGGC AGACGGTCAG
TTTTACCGCG TTGGCGGCTA TGCGCCGGTG AAAGTGGATG TGCGGATTAT CGCTGCCACT
CACCAGAATC TCGAACAGCG AGTGCAGGAA GGTAAGTTCC GTGAGGATCT GTTCCACCGC
CTGAACGTTA TCCGCGTTCA TCTGCCGCCG CTGCGCGAAC GTCGGGAAGA TATTCCCCGT
CTGGCGCGCC ATTTTTTACA GGTTGCCGCG CGCGAACTGG GCGTAGAAGC GAAGTTGCTG
CATCCGGAAA CCGAAGCTGC TCTGACGCGT CTGGCGTGGC CAGGCAACGT ACGCCAGCTG
GAAAACACCT GCCGCTGGCT AACGGTGATG GCCGCCGGGC AGGAAGTGTT GATTCAGGAT
TTGCCCGGCG AACTGTTTGA ATCAACGGTT GCGGAGAGTA CTTCGCAAAT GCAACCGGAC
AGTTGGGCGA CACTTTTAGC GCAGTGGGCA GACAGAGCGC TGCGTTCCGG TCATCAAAAT
CTGCTTTCCG AAGCGCAGCC AGAGCTGGAG CGGACGTTAC TGACGACCGC GTTGCGACAT
ACGCAGGGGC ATAAACAGGA AGCGGCGCGG CTACTCGGCT GGGGCCGCAA CACCCTGACG
CGTAAGTTAA AAGAGCTGGG GATGGAGTGA
 
Protein sequence
MQRGIVWVVD DDSSIRWVLE RALAGAGLTC TTFENGAEVL EALASKTPDV LLSDIRMPGM 
DGLALLKQIK QRHPMLPVII MTAHSDLDAA VSAYQQGAFD YLPKPFDIDE AVALVERAIS
HYQEQQQPRN IQLNGPTTDI IGEAPAMQDV FRIIGRLSRS SISVLINGES GTGKELVAHA
LHRHSPRAKA PFIALNMAAI PKDLIESELF GHEKGAFTGA NTIRQGRFEQ ADGGTLFLDE
IGDMPLDVQT RLLRVLADGQ FYRVGGYAPV KVDVRIIAAT HQNLEQRVQE GKFREDLFHR
LNVIRVHLPP LRERREDIPR LARHFLQVAA RELGVEAKLL HPETEAALTR LAWPGNVRQL
ENTCRWLTVM AAGQEVLIQD LPGELFESTV AESTSQMQPD SWATLLAQWA DRALRSGHQN
LLSEAQPELE RTLLTTALRH TQGHKQEAAR LLGWGRNTLT RKLKELGME