Gene EcolC_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0145 
Symbol 
ID6068299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp158669 
End bp159847 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID641599545 
ProductAraC family transcriptional regulator 
Protein accessionYP_001723154 
Protein GI170018200 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACTA AACGTCACCG CATCACATTA CTGTTCAATG CCAATAAAGC CTATGACCGG 
CAGGTAGTAG AAGGCGTAGG GGAATATTTA CAGGCGTCAC AATCGGAATG GGATATTTTC
ATTGAAGAAG ATTTCCGCGC CCGCATTGAT AAAATCAAGG ACTGGTTAGG AGATGGCGTC
ATTGCCGACT TCGACGACAA ACAGATCGAG CAAGCGCTGG CTGATGTCGA CGTCCCCATT
GTTGGGGTTG GCGGCTCGTA TCACCTTGCA GAAAGTTACC CACCCGTTCA TTACATTGCC
ACCGATAACT ATGCGCTGGT TGAAAGCGCA TTTTTGCATT TAAAAGAGAA AGGCGTTAAC
CGCTTTGCTT TTTATGGTCT TCCGGAATCA AGCGGCAAAC GTTGGGCCAC TGAGCGCGAA
TATGCATTTC GTCAGCTTGT CGCCGAAGAA AAGTATCGCG GAGTGGTTTA TCAGGGGTTA
GAAACCGCGC CAGAGAACTG GCAACACGCG CAAAATCGGC TGGCAGACTG GCTACAAACG
CTACCACCGC AAACCGGGAT TATTGCCGTT ACTGACGCCC GAGCGCGGCA TATTCTGCAA
GTATGTGAAC ATCTACATAT TCCCGTACCG GAAAAATTAT GCGTGATTGG CATCGATAAC
GAAGAACTGA CCCGCTATCT GTCGCGTGTC GCCCTTTCTT CGGTCGCTCA GGGCGCGCGG
CAAATGGGCT ATCAGGCGGC AAAACTGTTG CATCGATTAT TAGATAAAGA AGAAATGCCG
CTACAGCGAA TTTTGGTCCC ACCAGTTCGC GTCATTGAAC GGCGCTCAAC AGATTATCGC
TCGCTGACCG ATCCCGCCGT TATTCAGGCC ATGCATTACA TTCGTAATCA CGCCTGTAAA
GGGATTAAAG TGGATCAGGT ACTGGATGCG GTCGGGATCT CGCGCTCCAA TCTTGAGAAG
CGTTTTAAAG AAGAGGTGGG TGAAACCATC CATGCCATGA TTCATGCCGA GAAGCTGGAG
AAAGCGCGCA GTCTGCTGAT TTCAACCACC TTGTCGATCA ATGAGATATC GCAAATGTGC
GGTTATCCAT CGCTGCAATA TTTCTACTCT GTTTTTAAAA AAGCATATGA CACGACGCCA
AAAGAGTATC GCGATGTAAA TAGCGAGGTC ATGTTGTAG
 
Protein sequence
MFTKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID KIKDWLGDGV 
IADFDDKQIE QALADVDVPI VGVGGSYHLA ESYPPVHYIA TDNYALVESA FLHLKEKGVN
RFAFYGLPES SGKRWATERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHILQ VCEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLDKEEMP LQRILVPPVR VIERRSTDYR SLTDPAVIQA MHYIRNHACK
GIKVDQVLDA VGISRSNLEK RFKEEVGETI HAMIHAEKLE KARSLLISTT LSINEISQMC
GYPSLQYFYS VFKKAYDTTP KEYRDVNSEV ML