Gene EcE24377A_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0167 
SymbolcdaR 
ID5587960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp183833 
End bp184963 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content53% 
IMG OID640923896 
Productcarbohydrate diacid transcriptional activator CdaR 
Protein accessionYP_001461333 
Protein GI157156858 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGG ATATCGTGGC ACGTACCATG CGCATCATCG ATACCAATAT CAACGTAATG 
GATGCCCGTG GGCGAATTAT CGGCAGCGGC GATCGTGAGC GTATTGGTGA ATTGCACGAA
GGTGCATTGC TGGTACTTTC ACAGGGACGA GTCGTCGATA TCGATGACGC GGTGGCACGT
CATCTGCACG GTGTGCGGCA AGGGATTAAT CTACCGTTAC GGCTGGAAGG TGAAATTGTC
GGCGTAATTG GCCTGACAGG TGAACCAGAG AATCTGCGTA AATATGGCGA ACTGGTCTGC
ATGACGGCTG AAATGATGCT GGAACAGTCG CGGTTGATGC ACTTGTTGGC TCAGGATAGC
CGTTTGCGGG AGGAACTGGT GATGAACCTG ATTCAGGCAG AGGAGAATAC TCCCGCACTT
ACTGAATGGG CGCAACGGCT GGGGATCGAT CTCAATCAAC CGCGAGTGGT GGCTATTGTT
GAGGTCGACA GCGGTCAGCT TGGTGTGGAC AGCGCAATGG CGGAGTTACA ACAACTGCAA
AACGCGCTGA CTACGCCCGA GCGTAATAAT CTGGTGGCGA TTGTCTCGCT AACCGAAATG
GTGGTGTTGA AACCGGCGTT GAACTCTTTT GGGCGCTGGG ATGCAGAAGA TCATCGTAAG
CGAGTTGAAC AACTGATTAC CCGCATGAAA GAGTACGGCC AGCTGCGTTT TCGCGTTTCA
CTGGGCAACT ATTTTACCGG TCCTGGCAGT ATTGCCCGAT CCTATCGTAC GGCGAAAACG
ACGATGGTGG TGGGTAAACA GCGGATGCCA GAAAGTCGCT GCTATTTTTA TCAGGATCTG
ATGTTACCTG TGTTACTCGA CAGTTTGCGT GGCGACTGGC AGGCCAACGA ACTGGCGCGA
CCGCTGGCGC GGCTGAAAGC GATGGACAAT AACGGCTTGC TGCGACGAAC GCTGGCGGCG
TGGTTTCGTC ACAATGTGCA ACCGCTGGCA ACGTCAAAGG CGTTGTTTAT TCATCGTAAT
ACTCTGGAGT ATCGGCTTAA TCGTATATCG GAACTGACCG GGCTTGATTT GGGTAATTTT
GATGACAGGT TGCTGCTGTA TGTGGCGTTG CAGCTGGATG AAGAGCGGTA G
 
Protein sequence
MAQDIVARTM RIIDTNINVM DARGRIIGSG DRERIGELHE GALLVLSQGR VVDIDDAVAR 
HLHGVRQGIN LPLRLEGEIV GVIGLTGEPE NLRKYGELVC MTAEMMLEQS RLMHLLAQDS
RLREELVMNL IQAEENTPAL TEWAQRLGID LNQPRVVAIV EVDSGQLGVD SAMAELQQLQ
NALTTPERNN LVAIVSLTEM VVLKPALNSF GRWDAEDHRK RVEQLITRMK EYGQLRFRVS
LGNYFTGPGS IARSYRTAKT TMVVGKQRMP ESRCYFYQDL MLPVLLDSLR GDWQANELAR
PLARLKAMDN NGLLRRTLAA WFRHNVQPLA TSKALFIHRN TLEYRLNRIS ELTGLDLGNF
DDRLLLYVAL QLDEER