Gene EcHS_A1486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1486 
SymbolpaaX 
ID5591576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1486732 
End bp1487682 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content51% 
IMG OID640920643 
Productphenylacetic acid degradation operon negative regulatory protein PaaX 
Protein accessionYP_001458199 
Protein GI157160881 
COG category[K] Transcription 
COG ID[COG3327] Phenylacetic acid-responsive transcriptional repressor 
TIGRFAM ID[TIGR02277] phenylacetic acid degradation operon negative regulatory protein PaaX 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC TTGATACTTT TATCCAACAT GCTGTAAACG CTGTTCCGGT CAGTGGCACA 
TCTTTGATCT CCTCTCTGTA TGGTGATTCG CTTTCCCATC GTGGTGGTGA AATCTGGCTT
GGTAGTCTGG CTGCTTTGCT GGAAGGGCTG GGATTTGGTG AGCGTTTCGT GCGCACCGCT
TTGTTTCGTC TTAATAAAGA AGGCTGGCTG GATGTTTCCC GCATCGGGCG ACGCAGTTTC
TATAGCCTCA GTGATAAAGG CTTGCGCCTG ACGCGACGGG CAGAAAGTAA AATTTATCGC
GCAGAGCAAC CTGCATGGGA TGGTAAATGG CTCCTGTTGC TCTCGGAAGG TTTAGATAAA
GCGACGCTGG CTGATGTCAA AAAGCAGTTG ATCTGGCAAG GTTTTGGCGC ACTGGCACCC
AGCCTGATGG CATCGCCGTC GCAAAAACTG GCGGATGTAC AGACACTTTT GCATGAAGCG
GGGGTGGCGG ATAACGTGAT TTGTTTTGAA GCGCAAATAC CACTGGCGCT TTCTCGCGCA
GCACTGCGTG CCAGAGTAGA AGAGTGCTGG CATTTAACTG AACAAAATGC CATGTACGAA
ACCTTTATTC AGTCATTCCG CCCGCTGGTG CCGCTGTTAA AAGAGACGGC AGACGAGTTA
ACCCCGGAGC GTGCATTTCA TATTCAGCTT TTACTGATCC ATTTTTATCG CCGTGTCGTC
CTTAAAGACC CATTGTTGCC GGAGGAGTTG CTTCCGGCTC ACTGGGCAGG GCATACGGCG
CGTCAGCTGT GTATCAACAT TTATCAGCGC GTAGCGCCTG CTGCTTTAGC GTTCGTTAGT
GAAAAAGGTG AAACCTCAGT CGGTGAACTG CCTGCGCCGG GAAGCCTGTA TTTTCAACGT
TTTGGCGGCT TGAATATTGA ACAGGAGGCG TTATGCCAAT TTACCAGATA G
 
Protein sequence
MSKLDTFIQH AVNAVPVSGT SLISSLYGDS LSHRGGEIWL GSLAALLEGL GFGERFVRTA 
LFRLNKEGWL DVSRIGRRSF YSLSDKGLRL TRRAESKIYR AEQPAWDGKW LLLLSEGLDK
ATLADVKKQL IWQGFGALAP SLMASPSQKL ADVQTLLHEA GVADNVICFE AQIPLALSRA
ALRARVEECW HLTEQNAMYE TFIQSFRPLV PLLKETADEL TPERAFHIQL LLIHFYRRVV
LKDPLLPEEL LPAHWAGHTA RQLCINIYQR VAPAALAFVS EKGETSVGEL PAPGSLYFQR
FGGLNIEQEA LCQFTR