Gene EcHS_A4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4095 
SymbolglnG 
ID5593726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4084008 
End bp4085417 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content57% 
IMG OID640923199 
Productnitrogen regulation protein NR(I) 
Protein accessionYP_001460658 
Protein GI157163340 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAG GGATAGTCTG GGTAGTCGAT GACGATAGTT CCATCCGTTG GGTGCTTGAA 
CGTGCGCTCG CTGGAGCGGG TTTAACCTGT ACGACATTTG AGAACGGCGC GGAAGTACTG
GAGGCGCTGG CGAGCAAAAC GCCGGATGTG CTGCTTTCAG ATATCCGTAT GCCGGGAATG
GACGGGCTGG CGCTGCTCAA GCAGATTAAA CAGCGCCATC CAATGCTTCC GGTCATCATT
ATGACCGCAC ATTCCGATCT GGATGCTGCC GTCAGCGCCT ATCAACAAGG GGCGTTTGAT
TATCTGCCCA AACCGTTTGA TATCGACGAA GCAGTGGCGC TGGTTGAGCG CGCTATCAGT
CATTACCAGG AACAGCAGCA GCCGCGTAAT ATTCAGCTTA ACGGCCCAAC GACCGATATC
ATCGGCGAAG CGCCAGCCAT GCAGGACGTG TTCCGTATTA TCGGTCGGCT TTCGCGTTCT
TCTATTAGCG TGCTGATTAA CGGCGAATCC GGCACCGGTA AAGAACTGGT CGCTCATGCC
CTGCATCGCC ACAGTCCGCG CGCCAAAGCG CCGTTTATCG CGCTGAATAT GGCAGCTATC
CCAAAAGATT TGATCGAATC AGAACTGTTT GGCCACGAGA AAGGCGCGTT TACTGGCGCG
AATACCATTC GTCAGGGGCG TTTTGAACAG GCCGATGGCG GTACATTATT CCTCGACGAA
ATTGGTGATA TGCCGCTGGA TGTGCAGACG CGTTTGCTGC GCGTGCTGGC AGACGGTCAG
TTTTACCGCG TTGGCGGCTA TGCGCCGGTG AAAGTGGATG TGCGGATTAT CGCTGCCACT
CACCAGAATC TCGAACAGCG AGTGCAGGAA GGTAAGTTCC GTGAGGATCT GTTCCACCGC
CTGAACGTTA TCCGCGTTCA TCTGCCGCCG CTGCGCGAAC GTCGGGAAGA TATTCCCCGT
CTGGCGCGCC ATTTTTTACA GGTTGCCGCG CACGAACTGG GCGTAGAAGC GAAGTTACTG
CATCCGGAAA CCGAAGCTGC TCTGACGCGT CTGGCGTGGC CAGGCAACGT GCGCCAGCTG
GAAAACACCT GCCGCTGGCT AACGGTGATG GCCGCCGGGC AGGAAGTGTT GATTCAGGAT
TTGCCCGGCG AACTGTTTGA ATCAACGGTT GCGGAGAGTA CTTCGCAAAT GCAACCGGAC
AGTTGGGCGA CGCTTCTTGC GCAGTGGGCA GACAGAGCGC TGCGTTCCGG TCATCAAAAT
CTGCTTTCCG AAGCGCAGCC AGAGCTGGAG CGGACGTTAC TGACGACCGC GTTGCGACAT
ACGCAGGGGC ATAAACAGGA AGCGGCGCGG CTACTCGGCT GGGGCCGCAA CACCCTGACG
CGTAAGTTAA AAGAGCTGGG GATGGAGTGA
 
Protein sequence
MQRGIVWVVD DDSSIRWVLE RALAGAGLTC TTFENGAEVL EALASKTPDV LLSDIRMPGM 
DGLALLKQIK QRHPMLPVII MTAHSDLDAA VSAYQQGAFD YLPKPFDIDE AVALVERAIS
HYQEQQQPRN IQLNGPTTDI IGEAPAMQDV FRIIGRLSRS SISVLINGES GTGKELVAHA
LHRHSPRAKA PFIALNMAAI PKDLIESELF GHEKGAFTGA NTIRQGRFEQ ADGGTLFLDE
IGDMPLDVQT RLLRVLADGQ FYRVGGYAPV KVDVRIIAAT HQNLEQRVQE GKFREDLFHR
LNVIRVHLPP LRERREDIPR LARHFLQVAA HELGVEAKLL HPETEAALTR LAWPGNVRQL
ENTCRWLTVM AAGQEVLIQD LPGELFESTV AESTSQMQPD SWATLLAQWA DRALRSGHQN
LLSEAQPELE RTLLTTALRH TQGHKQEAAR LLGWGRNTLT RKLKELGME