Gene EcolC_2256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2256 
Symbol 
ID6067050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2486401 
End bp2487351 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content51% 
IMG OID641601660 
ProductPaaX family transcriptional regulator 
Protein accessionYP_001725219 
Protein GI170020265 
COG category[K] Transcription 
COG ID[COG3327] Phenylacetic acid-responsive transcriptional repressor 
TIGRFAM ID[TIGR02277] phenylacetic acid degradation operon negative regulatory protein PaaX 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.203735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC TTGATACTTT TATCCAACAT GCTGTAAACG CTGTTCCGGT CAGTGGCACA 
TCTTTGATCT CCTCTCTGTA TGGTGATTCG CTTTCCCATC GTGGTGGTGA AATCTGGCTT
GGTAGTCTGG CTGCTTTGCT GGAAGGGCTG GGATTTGGTG AGCGTTTCGT GCGCACCGCT
TTGTTTCGTC TTAATAAAGA AGGCTGGCTG GATGTTTCCC GCATCGGGCG ACGCAGTTTC
TATAGCCTCA GTGATAAAGG CTTGCGCCTG ACGCGACGGG CAGAAAGTAA AATTTATCGC
GCAGAGCAAC CTGCATGGGA TGGTAAATGG CTCCTGTTGC TCTCGGAAGG TTTAGATAAA
GCGACGCTGG CTGATGTCAA AAAGCAGTTG ATCTGGCAAG GTTTTGGCGC ACTGGCACCC
AGCCTGATGG CATCGCCGTC GCAAAAACTG GCGGATGTAC AGACACTTTT GCATGAAGCG
GGGGTGGCGG ATAACGTGAT TTGTTTTGAA GCGCAAATAC CACTGGCGCT TTCTCGCGCA
GCACTGCGTG CCAGAGTAGA AGAGTGCTGG CATTTAACTG AACAAAATGC CATGTACGAA
ACCTTTATTC AGTCATTCCG CCCGCTGGTG CCGCTGTTAA AAGAGACGGC AGACGAGTTA
ACCCCGGAGC GTGCATTTCA TATTCAGCTT TTACTGATCC ATTTTTATCG CCGTGTCGTC
CTTAAAGACC CATTGTTGCC GGAGGAGTTG CTTCCGGCTC ACTGGGCAGG GCATACGGCG
CGTCAGCTGT GTATCAACAT TTATCAGCGC GTAGCGCCTG CTGCTTTAGC GTTCGTTAGT
GAAAAAGGTG AAACCTCAGT CGGTGAACTG CCTGCGCCGG GAAGCCTGTA TTTTCAACGT
TTTGGCGGCT TGAATATTGA ACAGGAGGCG TTATGCCAAT TTACCAGATA G
 
Protein sequence
MSKLDTFIQH AVNAVPVSGT SLISSLYGDS LSHRGGEIWL GSLAALLEGL GFGERFVRTA 
LFRLNKEGWL DVSRIGRRSF YSLSDKGLRL TRRAESKIYR AEQPAWDGKW LLLLSEGLDK
ATLADVKKQL IWQGFGALAP SLMASPSQKL ADVQTLLHEA GVADNVICFE AQIPLALSRA
ALRARVEECW HLTEQNAMYE TFIQSFRPLV PLLKETADEL TPERAFHIQL LLIHFYRRVV
LKDPLLPEEL LPAHWAGHTA RQLCINIYQR VAPAALAFVS EKGETSVGEL PAPGSLYFQR
FGGLNIEQEA LCQFTR