Gene EcDH1_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4118 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4459192 
End bp4460601 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content57% 
IMG OID 
Productnitrogen metabolism transcriptional regulator, NtrC, Fis Family 
Protein accessionACX41718 
Protein GI260451296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAG GGATAGTCTG GGTAGTCGAT GACGATAGTT CCATCCGTTG GGTGCTTGAA 
CGTGCGCTCG CTGGGGCAGG TTTAACCTGT ACGACGTTTG AGAACGGCGC AGAAGTGCTG
GAGGCGCTGG CGAGCAAAAC GCCGGATGTG CTGCTTTCAG ATATCCGTAT GCCGGGAATG
GACGGGCTGG CGCTGCTCAA GCAGATTAAA CAGCGCCATC CAATGCTTCC GGTCATCATT
ATGACCGCAC ATTCCGATCT GGATGCTGCC GTCAGCGCCT ATCAACAAGG GGCGTTTGAT
TATCTGCCCA AACCGTTTGA TATCGACGAA GCAGTGGCGC TGGTTGAGCG CGCTATCAGT
CATTACCAGG AACAGCAGCA GCCGCGTAAT GTTCAGCTTA ACGGCCCAAC GACCGATATC
ATCGGCGAAG CGCCAGCCAT GCAGGACGTG TTCCGTATTA TCGGTCGGCT TTCGCGTTCT
TCTATTAGCG TGCTGATTAA CGGCGAATCC GGCACCGGTA AAGAACTGGT CGCTCATGCC
CTGCATCGCC ACAGTCCGCG CGCCAAAGCG CCGTTTATCG CGCTGAATAT GGCAGCTATC
CCAAAAGATT TGATCGAATC AGAACTGTTT GGCCACGAGA AAGGCGCGTT TACTGGCGCG
AATACCATTC GTCAGGGGCG TTTTGAACAG GCCGATGGCG GTACATTATT CCTCGACGAA
ATTGGTGATA TGCCGCTGGA TGTGCAGACG CGTTTGCTGC GCGTGCTGGC AGACGGTCAG
TTTTACCGCG TTGGCGGCTA TGCGCCGGTG AAAGTGGATG TGCGGATTAT CGCTGCCACT
CACCAGAATC TCGAACAGCG AGTGCAGGAA GGTAAGTTCC GTGAGGATCT GTTCCACCGC
CTGAACGTTA TCCGCGTTCA TCTGCCGCCG CTGCGCGAAC GTCGGGAAGA TATTCCCCGT
CTGGCGCGCC ATTTTTTACA GGTTGCCGCG CGCGAACTGG GCGTAGAAGC GAAGTTACTG
CATCCGGAAA CCGAAGCTGC TCTGACGCGT CTGGCGTGGC CAGGCAACGT GCGCCAGCTG
GAAAACACCT GCCGCTGGCT AACGGTGATG GCCGCCGGGC AGGAAGTGTT GATTCAGGAT
TTGCCCGGCG AACTGTTTGA ATCAACGGTT GCGGAGAGTA CTTCGCAAAT GCAACCGGAC
AGCTGGGCGA CGCTTCTTGC GCAGTGGGCA GACAGAGCGC TGCGTTCCGG TCATCAAAAT
CTGCTTTCCG AAGCGCAGCC AGAGCTGGAG CGGACGTTAC TGACGACCGC GTTGCGACAT
ACGCAGGGGC ATAAACAGGA AGCGGCGCGG CTACTCGGCT GGGGCCGCAA CACCCTGACG
CGTAAGTTAA AAGAGCTGGG GATGGAGTGA
 
Protein sequence
MQRGIVWVVD DDSSIRWVLE RALAGAGLTC TTFENGAEVL EALASKTPDV LLSDIRMPGM 
DGLALLKQIK QRHPMLPVII MTAHSDLDAA VSAYQQGAFD YLPKPFDIDE AVALVERAIS
HYQEQQQPRN VQLNGPTTDI IGEAPAMQDV FRIIGRLSRS SISVLINGES GTGKELVAHA
LHRHSPRAKA PFIALNMAAI PKDLIESELF GHEKGAFTGA NTIRQGRFEQ ADGGTLFLDE
IGDMPLDVQT RLLRVLADGQ FYRVGGYAPV KVDVRIIAAT HQNLEQRVQE GKFREDLFHR
LNVIRVHLPP LRERREDIPR LARHFLQVAA RELGVEAKLL HPETEAALTR LAWPGNVRQL
ENTCRWLTVM AAGQEVLIQD LPGELFESTV AESTSQMQPD SWATLLAQWA DRALRSGHQN
LLSEAQPELE RTLLTTALRH TQGHKQEAAR LLGWGRNTLT RKLKELGME