Gene EcolC_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0686 
Symbol 
ID6066680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp739349 
End bp740305 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID641600093 
ProductAraC family transcriptional regulator 
Protein accessionYP_001723689 
Protein GI170018735 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAA ATTGCGCACA ATCAAATTGC CGCATTATTC CTAAGAAATT ACGCGATATG 
AAACGTGAAG AGATTTGCCG CTTGCTGGCG GATAAAGTTA ATAAACTGAA AAATAAAGAA
AATAGTTTGT CAGAACTGTT GCCCGATGTG CGTTTGTTGT ATGGCGAGAC ACCTTTCGCA
CGTACACCGG TGATGTACGA GCCTGGCATC ATAATTCTCT TTTCCGGGCA TAAAATCGGT
TATATCAATG AACGCGTGTT TCGTTATGAT GCCAATGAAT ACCTGCTGCT GACGGTGCCG
TTGCCGTTTG AGTGCGAAAC CTATGCCACG TCAGAGGTGC CGCTGGCAGG GTTGCGTCTC
AATGTCGATA TTTTGCAGTT ACAGGAACTG TTGATGGACA TTGGCGAAGA TGAGCATTTC
CAGCCGTCGA TGGCAGCCAG CGGGATTAAC TCCGCCACGT TATCAGAAGA GATTTTATGC
GCGGCGGAGC GGTTACTCGA CGTGATGGAG CGACCACTGG ATGCGCGTAT TCTCGGCAAA
CAGATCATCC GCGAAATTCT GTACTACGTG CTGACCGGAC CTTGCGGCGG CGCGTTACTG
GCGCTGGTCA GCCGCCAGAC TCACTTCAGT CTGATTAGCC GCGTGCTGAA ACGGATTGAG
AATAAATACA CCGAAAACCT GAGCGTCGAG CAACTGGCGG CAGAAGCCAA CATGAGCGTA
TCGGCGTTCC ACCATAATTT TAAGTCTGTC ACCAGCACCT CGCCGTTGCA GTATTTGAAG
AATTACCGTC TGCATAAGGC GCGGATGATG ATCATCCATG ACGGCATGAA GGCCAGCGCA
GCAGCGATGC GCGTCGGCTA TGAAAGCGCA TCGCAATTTA GCCGTGAGTT TAAACGTTAC
TTCGGTGTGA CGCCGGGGGA AGATGCGGCA AGAATGCGGG CGATGCAGGG GAATTAA
 
Protein sequence
MLQNCAQSNC RIIPKKLRDM KREEICRLLA DKVNKLKNKE NSLSELLPDV RLLYGETPFA 
RTPVMYEPGI IILFSGHKIG YINERVFRYD ANEYLLLTVP LPFECETYAT SEVPLAGLRL
NVDILQLQEL LMDIGEDEHF QPSMAASGIN SATLSEEILC AAERLLDVME RPLDARILGK
QIIREILYYV LTGPCGGALL ALVSRQTHFS LISRVLKRIE NKYTENLSVE QLAAEANMSV
SAFHHNFKSV TSTSPLQYLK NYRLHKARMM IIHDGMKASA AAMRVGYESA SQFSREFKRY
FGVTPGEDAA RMRAMQGN