Gene EcolC_3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3294 
Symbol 
ID6067058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3608598 
End bp3610184 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content56% 
IMG OID641602709 
ProductFis family proprionate catabolism activator 
Protein accessionYP_001726243 
Protein GI170021289 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02329] propionate catabolism operon regulatory protein PrpR 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATC CACCACGGCT TAATGACGAC AAACCGGTTA TCTGGACGGT ATCTGTAACG 
CGCCTGTTCG AGCTGTTTCG CGATATCAGC CTCGAGTTTG ATCACCTGGC GAACATTACC
CCTATCCAGC TTGGCTTTGA AAAAGCGGTG ACCTACATCC ACAAGAAACT GGCAAACGAA
CGCTGTGACG CCATCATCGC CGCTGGATCT AACGGCGCGT ACCTGAAAAG CCGCCTGTCA
GTGCCAGTTA TTTTGATTAA ACCGAGCGGC TACGATGTGT TACAGGCACT GGCAAAAGCC
GGAAAACTCA CCTCTTCTAT CGGCGTTGTC ACTTATCAGG AAACTATTCC GGCACTGGTG
GCGTTTCAAA AAACCTTTAA TTTGCGCCTC GATCAACGTA GCTACATTAC CGAAGAAGAC
GCACGCGGGC AGATTAACGA GCTAAAAGCT AACGGCACCG AAGCGGTGGT CGGCGCGGGG
CTGATTACCG ATCTGGCAGA AGAAGCCGGA ATGACCGGAA TTTTTATCTA TTCCGCCGCC
ACCGTGCGCC AGGCGTTCAG CGATGCGCTG GATATGACGC GCATGTCGTT ACGCCATAAC
ACTCACGATG CCACCCGCAA CGCCCTGAGA ACTCGTTACG TGCTGGGCGA TATGCTCGGT
CAATCACCAC AGATGGAACA AGTACGGCAG ACTATTTTGC TGTATGCCCG CTCCAGTGCA
GCGGTGTTGA TTGAGGGGGA AACGGGGACG GGCAAAGAGC TGGCGGCCCA GGCGATTCAT
CGGGAATATT TTGCCCGCCA CGATGCGCGA CAGGGCAAAA AGTCGCATCC GTTTGTTGCA
GTCAACTGCG GGGCGATTGC CGAATCGCTG CTGGAAGCAG AACTGTTTGG CTATGAGGAA
GGGGCGTTTA CCGGCTCGCG ACGCGGCGGT CGCGCCGGGC TGTTTGAAAT CGCCCACGGA
GGTACGCTGT TTCTCGATGA GATTGGCGAA ATGCCGCTGC CGTTGCAGAC CCGGCTGCTG
CGGGTGCTGG AAGAAAAAGA GGTCACCCGC GTCGGCGGGC ATCAGCCTGT TCCGGTGGAT
GTGCGGGTCA TTAGCGCCAC TCACTGCAAT CTGGAAGAAG ATATGCGGCA AGGGCAGTTT
CGCCGTGACC TGTTTTATCG GCTGAGTATT TTGCGTCTGC AATTGCCACC ACTGCGCGAG
CGGGTGGCGG ATATTCTGCC ACTGGCGGAA AGCTTTTTGA AAGTGTCTCT GGCGGCGCTC
TCCGCCCCGT TTTCTGCCGC ATTACGCCAG GGATTACAGG CAAGCGAAAC CGTGCTGGTG
CACTACGACT GGCCGGGCAA TATTCGTGAA CTGCGCAATA TGATGGAGCG ACTGGCGCTA
TTTTTAAGTG TGGAACCGAC GCCGGATTTA ACGCCGCAAT TTTTGCAGCT GCTACTGCCG
GAACTGGCGC GCGAGTCGGC GAAAACTCCC GCTCCACGCT TACTGACACC ACAACAGGCA
CTGGAGAAAT TTAATGGCGA TAAAACAGCA GCGGCGAATT ATTTAGGCAT CAGCCGGACG
ACGTTCTGGC GGCGGCTGAA AAGCTGA
 
Protein sequence
MAHPPRLNDD KPVIWTVSVT RLFELFRDIS LEFDHLANIT PIQLGFEKAV TYIHKKLANE 
RCDAIIAAGS NGAYLKSRLS VPVILIKPSG YDVLQALAKA GKLTSSIGVV TYQETIPALV
AFQKTFNLRL DQRSYITEED ARGQINELKA NGTEAVVGAG LITDLAEEAG MTGIFIYSAA
TVRQAFSDAL DMTRMSLRHN THDATRNALR TRYVLGDMLG QSPQMEQVRQ TILLYARSSA
AVLIEGETGT GKELAAQAIH REYFARHDAR QGKKSHPFVA VNCGAIAESL LEAELFGYEE
GAFTGSRRGG RAGLFEIAHG GTLFLDEIGE MPLPLQTRLL RVLEEKEVTR VGGHQPVPVD
VRVISATHCN LEEDMRQGQF RRDLFYRLSI LRLQLPPLRE RVADILPLAE SFLKVSLAAL
SAPFSAALRQ GLQASETVLV HYDWPGNIRE LRNMMERLAL FLSVEPTPDL TPQFLQLLLP
ELARESAKTP APRLLTPQQA LEKFNGDKTA AANYLGISRT TFWRRLKS