Gene EcolC_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0839 
Symbol 
ID6067240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp906957 
End bp908735 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content46% 
IMG OID641600244 
ProductPAS modulated sigma54 specific transcriptional regulator 
Protein accessionYP_001723838 
Protein GI170018884 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTG CTACTACGCA GTCAGTATTG ATGCAAATTC AACCGACAAT TCAGCGTTTT 
GCCAGAATGC TTGCCAGCGT TTTGCAGCTT GAGGTTGAGA TCGTTGATGA AAACTTGTGT
CGCGTTGCCG GAACGGGCGC GTATGGGAAG TTTCTTGGTC GCCAGTTGAG CGGCAACTCA
CGCCTGCTCC GCCACGTCCT GGAAACGAAA ACTGAAAAAG TTGTGACACA GTCTCGCTTC
GATCCCCTTT GCGAAGGTTG CGATAGTAAA GAAAATTGCC GCGAAAAAGC ATTTCTGGGT
ACGCCTGTCA TTTTACAGGA TCGTTGTGTT GGGGTGATAA GTTTGATTGC CGTTACCCAC
GAGCAACAAG AGCATATCAG TGATAATTTA CGCGAATTTT CTGATTATGT TCGCCATATA
TCCACCATTT TTGTTTCGAA ACTTCTGGAG GATCAGGGGC CAGGAGATAA CATCAGTAAA
ATATTCGCGA CCATGATCGA TAATATGGAT CAGGGCGTAT TAGTTGTTGA TGATGAAAGT
CGGGTTCAGT TTGTTAATCA GACTGCCTTA AAAACACTTG GTGTTGTACA AAATAATATT
ATTGGGAAAC CTATCCGTTT CAGACCATTA ACATTTGAGA GTAATTTTAC TCATGGACAT
ATGCAGCATA TTGTTTCGTG GGACGATAAA AGTGAATTAA TCATTGGTCA ATTGCATAAC
ATTCAGGGCC GACAATTATT TTTAATGGCA TTTCACCAAT CGCATACCAG TTTTTCTGTA
GCAAATGCAC CTGATGAACC ACATATTGAA CAATTGGTTG GCGAGTGCCG TGTTATGCGG
CAATTAAAAC GACTCATTAG CCGTATTGCA CCCAGCCCAT CCAGCGTTAT GGTGGTTGGT
GAAAGCGGCA CGGGTAAAGA AGTCGTCGCC CGAGCAATCC ATAAGTTGAG CGGAAGACGG
AATAAACCCT TTATTGCTAT CAACTGTGCC GCGATTCCGG AGCAGCTTCT GGAAAGCGAA
CTGTTCGGTT ATGTTAAAGG CGCATTTACT GGCGCTTCTG CCAACGGTAA AACAGGGTTG
ATTCAGGCGG CGAATACGGG CACGCTGTTT CTCGATGAAA TAGGTGATAT GCCATTAATG
TTGCAGGCTA AATTACTGCG CGCTATTGAG GCGCGTGAAA TTCTGCCGAT TGGTGCCAGT
AGCCCAATAC AAGTCGACAT TCGCATCATT TCTGCAACTA ATCAGAATTT GGCCCAGTTC
ATTGCCGAAG GTAAATTCCG CGAAGATCTC TTCTACCGAC TTAATGTTAT CCCGATAACT
CTGCCACCGC TGCGTGAACG TCAGGAAGAT ATTGAACTAT TGGTGCATTA CTTTTTACAT
CTGCATACCC GTCGTCTGGG ATCGGTTTAT CCTGGCATTG CTCCCGATGT CGTCGAAATA
TTGCGTAAGC ATCGTTGGCC CGGAAACCTG CGCGAGTTAA GCAATTTGAT GGAATATCTG
GTTAACGTGG TTCCTTCAGG TGAAGTTATC GACAGCACGC TATTGCCGCC AAATCTGCTG
AATAATGGCA CAACGGAGCA AAGTGATGTA ACAGAGGTCA GTGAGGCGCA CCTGTCACTC
GATGATGCGG GCGGCACGGC GCTGGAGGAG ATGGAAAAGC AAATGATCCG CGAGGCGCTT
TCACGTCATA ACAGCAAGAA GCAAGTTGCT GATGAACTGG GCATCGGCAT TGCTACGCTC
TATCGCAAGA TTAAGAAATA TGAGTTGTTA AACACATAA
 
Protein sequence
MELATTQSVL MQIQPTIQRF ARMLASVLQL EVEIVDENLC RVAGTGAYGK FLGRQLSGNS 
RLLRHVLETK TEKVVTQSRF DPLCEGCDSK ENCREKAFLG TPVILQDRCV GVISLIAVTH
EQQEHISDNL REFSDYVRHI STIFVSKLLE DQGPGDNISK IFATMIDNMD QGVLVVDDES
RVQFVNQTAL KTLGVVQNNI IGKPIRFRPL TFESNFTHGH MQHIVSWDDK SELIIGQLHN
IQGRQLFLMA FHQSHTSFSV ANAPDEPHIE QLVGECRVMR QLKRLISRIA PSPSSVMVVG
ESGTGKEVVA RAIHKLSGRR NKPFIAINCA AIPEQLLESE LFGYVKGAFT GASANGKTGL
IQAANTGTLF LDEIGDMPLM LQAKLLRAIE AREILPIGAS SPIQVDIRII SATNQNLAQF
IAEGKFREDL FYRLNVIPIT LPPLRERQED IELLVHYFLH LHTRRLGSVY PGIAPDVVEI
LRKHRWPGNL RELSNLMEYL VNVVPSGEVI DSTLLPPNLL NNGTTEQSDV TEVSEAHLSL
DDAGGTALEE MEKQMIREAL SRHNSKKQVA DELGIGIATL YRKIKKYELL NT