Gene EcolC_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3723 
Symbol 
ID6068694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4075244 
End bp4076656 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID641603140 
Producttranscriptional regulator 
Protein accessionYP_001726660 
Protein GI170021706 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTT ATCAACATCT GGCGACTCTA CTTGCCGAGC GGATTGAGCA AGGGCTGTAT 
CGTCACGGGG AGAAATTGCC GTCGGTGCGT AGCTTAAGTC GGGAGCACGG CGTCAGCATC
AGCACCGTGC AGCAGGCGTA TCAGACGCTG GAGACGATGA AGCTCATAAC CCCACAGCCG
CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTGCCGCC GATGACGCGT
CCGGTGCAGC GCCCGGTGGA AATTACCCAG TGGGATCAGG TGCTGGATAT GCTGGTGGCG
CATAGCGACA GTTCCATTGT TCCGTTAAGC AAAAGCACGC CGGATGTCGA AACGCCCAGC
CTGAAACCGC TCTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTGCTC
GGTTATGACT TGTTAGCCGG TCAGCGGGTA TTGCGAGAGC AGATTGCCCG CCTGATGCTC
GACAGCGGCT CGGTGGTCAC CGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG
ATGTCGCTGG CGTTAATGGC GGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC
TGTTATTACG GTTCGATGCA GATGCTGCGC GGCATGGGCG TGAAAGTGAT TGAAATCCCA
ACCGATCCAG AAACAGGCAT CAGCGTTGAA GCGCTGGAAC TGGCGCTGGA ACAGTGGCCG
ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC
GCGCGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT
GATGTCTACG GCGAACTGGC AACGGAGTAT CCGCGCCCGC GGACCATTCA TTCCTGGGAT
ATCGACGGGC GAGTGCTGTT GTGCAGCTCG TTCAGTAAAA GCATTGCCCC TGGCCTGCGC
GTGGGCTGGG TCGCACCGGG GCGCTATCAC GATAAACTGC TGCATATGAA ATATGCCATC
AGCAGCTTTA ATGTACCGTC CACGCAAATG GCGGCGGCAA CGTTTGTTCT GGAAGGTCAC
TATCATCGCC ATATCCGGCG GATGCGGCAG ATCTATCAGC GCAATTTGGC GCTTTATACC
TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT
TTACTGTGGA TAGAATTGCC TGAACAGGTC GATATGGTTT GCATTGCGCG GCAGCTGTGC
CGCATGAATA TCCAGGTGGC AGCAGGCTCG ATTTTCTCGG CTTCCGGCAA ATACCGTAAT
TGTCTACGCA TCAACTGCGC TTTGCCGCTC AGCGAAACCT ATCGCGAAGC ACTAAAGCAA
ATTGGCGAGG CCGTGTATCG GGCAATGGAA TAA
 
Protein sequence
MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSREHGVSI STVQQAYQTL ETMKLITPQP 
RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDQVLDMLVA HSDSSIVPLS KSTPDVETPS
LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQIARLML DSGSVVTADD IIITSGCHNS
MSLALMAVCK PGDIVAVESP CYYGSMQMLR GMGVKVIEIP TDPETGISVE ALELALEQWP
IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD
IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLLHMKYAI SSFNVPSTQM AAATFVLEGH
YHRHIRRMRQ IYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCIARQLC
RMNIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME