Gene EcHS_A0396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0396 
SymbolprpR 
ID5591597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp412848 
End bp414434 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content55% 
IMG OID640919581 
Productpropionate catabolism operon regulatory protein PrpR 
Protein accessionYP_001457166 
Protein GI157159848 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02329] propionate catabolism operon regulatory protein PrpR 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACATC CACCACGGCT TAATGACGAC AAACCGGTTA TCTGGACGGT ATCTGTAACG 
CGCCTGTTCG AGCTGTTTCG CGATATCAGC CTCGAGTTTG ATCACCTGGC GAACATTACC
CCTATCCAGC TTGGCTTTGA AAAAGCAGTG ACCTATATCC GCAAGAAACT GGCAAACGAA
CGCTGCGACG CCGTCATCGC CGCTGGCTCT AACGGCGCGT ACCTGAAAAG CCGCCTGTCA
GTGCCAGTTA TTTTGATTAA ACCGAGCGGC TACGATGTGT TACAGGCACT GGCAAAAGCC
GGAAAGCTCA CCTCTTCTAT CGGCGTTGTC ACTTATCAGG AAACTATTCC GGCACTGGTG
GCGTTTCAAA AAACCTTTAA TTTGCGCCTC GATCAACGTA GCTACATTAC CGAAGAAGAC
GCCCGCGGGC AGATTAACGA GCTCAAAGCT AACGGCACCG AAGCGGTGGT CGGTGCGGGG
CTGATTACCG ATCTGGCAGA AGAAGCCGGA ATGACCGGAA TTTTTATCTA TTCCGCCGTC
ACCGTGCGCC AGGCGTTCAG CGATGCGCTG GATATGACGC GCATGTCGTT ACGCCATAAC
ACTCACGATG CCACCCGCAA CGCCCTGAGA ACTCGTTACG TGCTGGGCGA TATGCTCGGT
CAATCACCAC AGATGGAACA AGTACGGCAG ACTATTTTGC TGTATGCCCG CTCCAGTGCG
GCGGTGTTGA TTGAAGGGGA AACGGGGACG GGCAAAGAGC TGGCGGCCCA GGCGATTCAT
CGGGAATATT TTGCCCGCCA CGATGCGCGA CAGGGCAAAA AGTCGCATCC GTTTGTTGCC
GTCAACTGCG GGGCGATAGC CGAATCGCTG CTGGAAGCGG AGCTGTTTGG CTATGAGGAA
GGGGCGTTTA CCGGCTCGCG ACGCGGAGGT CGCGCCGGGC TGTTCGAAAT TGCCCACGGC
GGTACGCTGT TTCTGGATGA GATTGGCGAA ATGCCGCTAC CTTTGCAGAC TCGCCTGTTA
CGGGTGCTGG AAGAAAAAGA GGTTACTCGC GTCGGCGGGC ATCAGCCTGT TCCGGTGGAT
GTGCGGGTCA TTAGCGCCAC TCACTGCAAT CTGGAAGAAG ATATGCGGCA AGGGCAGTTT
CGCCGTGATC TGTTTTATCG GCTGAGTATT TTGCGTCTGC AATTGCCACC ACTGCGCGAG
CGGGTGGCGG ATATTCTTCC GCTGGCGGAA AGCTTTTTGA AAGTGTCTCT GGCGGCGCTC
TCCGCCCCGT TTTCTGCCGC ATTACGCCAG GGATTACAGG CAAGCGAAAC CGTGCTGGTG
CACTACGACT GGCCAGGCAA TATTCGTGAA CTGCGCAATA TGATGGAACG ACTGGCGCTG
TTTTTAAGTG TGGAACCGAC GCCGGATTTA ACGCCGCAGT TTATGCAACT GCTACTGCCG
GAGCTGGCGC GCGAGTCGGC GAAAATTCCC GCTCCACGCT TACTGACACC ACAACAGGCA
CTGGAGAAAT TTAAAGGCGA TAAAACAGCA GCGGCGAATT ATTTAGGCAT AAGCCGGACG
ACGTTCTGGC GGCGGCTGAA AAACTGA
 
Protein sequence
MAHPPRLNDD KPVIWTVSVT RLFELFRDIS LEFDHLANIT PIQLGFEKAV TYIRKKLANE 
RCDAVIAAGS NGAYLKSRLS VPVILIKPSG YDVLQALAKA GKLTSSIGVV TYQETIPALV
AFQKTFNLRL DQRSYITEED ARGQINELKA NGTEAVVGAG LITDLAEEAG MTGIFIYSAV
TVRQAFSDAL DMTRMSLRHN THDATRNALR TRYVLGDMLG QSPQMEQVRQ TILLYARSSA
AVLIEGETGT GKELAAQAIH REYFARHDAR QGKKSHPFVA VNCGAIAESL LEAELFGYEE
GAFTGSRRGG RAGLFEIAHG GTLFLDEIGE MPLPLQTRLL RVLEEKEVTR VGGHQPVPVD
VRVISATHCN LEEDMRQGQF RRDLFYRLSI LRLQLPPLRE RVADILPLAE SFLKVSLAAL
SAPFSAALRQ GLQASETVLV HYDWPGNIRE LRNMMERLAL FLSVEPTPDL TPQFMQLLLP
ELARESAKIP APRLLTPQQA LEKFKGDKTA AANYLGISRT TFWRRLKN